Talkscriber Text-to-Speech WebSocket API Documentation
Welcome to the Talkscriber Text-to-Speech WebSocket API documentation. This API is designed to provide seamless, real-time text-to-speech conversion using a secure WebSocket connection. Whether you're developing a TTS application or integrating speech synthesis capabilities into your platform, this documentation will guide you through the process.
Key Features
- Ultra-Low Latency Streaming: Speech starts in less than 0.1 seconds with real-time audio chunk processing
- Real-time Audio Playback: Get immediate audio output as text is processed
- Configurable Buffering: Adjustable buffer size for optimal latency vs. quality balance
- Multiple Output Options: Support for real-time playback, file saving, or both
- Cross-Platform: Works on Windows, macOS, and Linux
- Secure Connection: Uses Secure WebSocket (WSS) for secure communication
Quick Links
Getting Started
To start using the TTS API, connect to our test WebSocket URL:
wss://api.talkscriber.com:9099
For detailed instructions on setting up and using the API, follow the links in the Quick Links section.
Supported Audio Formats
- Sample Rate: 24kHz (matches server configuration)
- Channels: Mono (1 channel)
- Bit Depth: 16-bit PCM
- Protocol: WebSocket binary streaming