Advanced Voice Synthesis Features
Discover the advanced features available in Voicelab’s voice synthesis engine.Multispeaker Text to Speech
Create conversations and dialogues with multiple distinct voices in a single audio file.Basic Multispeaker Usage
Generate conversations with different speakers:Advanced Multispeaker Scenarios
Create complex dialogues for various use cases:Customer Service Training
Educational Content
Voice Selection Best Practices
Choose appropriate voices for different scenarios:Professional Contexts
- Business presentations: Use
professional-female-1orprofessional-male-1 - Corporate training: Professional voices for instructors, conversational for participants
- Customer service: Professional voices for representatives
Casual and Creative Content
- Podcasts: Mix of conversational and expressive voices
- Audiobooks: Expressive voices for characters, conversational for narration
- Social media content: Conversational voices for relatability
Voice Pairing Guidelines
When using multiple voices in a conversation:- Contrast is key: Use different genders or voice types for clarity
- Consistency: Keep the same voice for each character throughout
- Context matching: Match voice formality to the scenario
Streaming Audio Output
Voicelab automatically streams audio bytes as they are generated, providing optimal performance for real-time applications.Benefits of Streaming
- Lower latency: Audio starts playing before generation is complete
- Memory efficiency: No need to buffer entire audio files
- Better user experience: Immediate audio feedback
Implementation Tips
Error Handling and Best Practices
Common Error Scenarios
Rate Limiting Best Practices
Performance Optimization
Text Length Considerations
- Optimal length: 100-1000 characters per request
- Maximum length: 5000 characters for single TTS
- Multispeaker limits: 500 characters per line, 10 lines maximum