Generate natural-sounding speech from text using AI voice models
POST /audio/speech
Convert text to natural-sounding speech audio.
tts-1
)
tts-1-hd
)
gpt-4o-mini-tts
)
elevenlabs
)
playai-tts
)
kokoro-82m
)
microsoft-tts
)
Provider | Special Parameters | Usage |
---|---|---|
GPT-4o Mini TTS | instructions | Natural language voice control |
Dia 1.6B | speaker_transcript , cfg_scale , cfg_filter_top_k | Advanced voice conditioning |
Microsoft TTS | speech_rate , pitch_adjustment , emotional_style | Voice modulation and emotions |
MeloTTS | lang | Language selection (en, fr, es, etc.) |
All Models | speed , temperature , top_p | Common generation controls |
cheerful
- Happy and upbeatsad
- Melancholic toneangry
- Frustrated or upsetfearful
- Nervous or scaredcalm
- Relaxed and peacefulgentle
- Soft and caringnewscast
- Professional news anchorcustomerservice
- Helpful and politeEnter your API key (starts with 'ek-')
Audio file
The response is of type file
.