Models and Pricing
Text to Speech
The Text to Speech API converts text into natural speech, billed per input character. Supports multiple voices, streaming and batch output, in MP3, WAV, PCM, μ-law, and A-law formats.
The Text to Speech API is currently in beta. Pricing and rate limits may change when the API becomes generally available.
Request increased rate limits
At a glance
| Details | |
|---|---|
| Modalities | Text → Audio |
| Pricing | $4.20 / 1M characters (Beta Pricing) |
| Region | us-east-1 |
Pricing
| Details | |
|---|---|
| Per 1M characters | $4.20 / 1M characters (Beta Pricing) |
Rate Limits
| Details | |
|---|---|
| Requests per minute | 600 RPM |
| Concurrent requests | 10 per team |
Capabilities
- Multiple voices
- Streaming output
- Batch output
- MP3 / WAV / PCM / μ-law / A-law formats
Availability
| Details | |
|---|---|
| Cluster | us-east-1 |
Documentation
- Text to Speech Guide — Getting started with text to speech
- API Reference — TTS endpoint reference
- Models and Pricing — Full pricing overview
Did you find this page helpful?