Models and Pricing

Text to Speech

View as Markdown

The Text to Speech API converts text into natural speech, billed per input character. Supports multiple voices, streaming and batch output, in MP3, WAV, PCM, μ-law, and A-law formats.

How to increase my rate limits?


At a glance

Details
ModalitiesText → Audio
Pricing$4.20 / 1M characters
Regionus-east-1

Pricing

Details
Per 1M characters$4.20 / 1M characters

Rate Limits

Details
Requests per minute3,000 RPM
Requests per second50 RPS
Concurrent sessions100 per team

Capabilities

  • Multiple voices
  • Streaming output
  • Batch output
  • MP3 / WAV / PCM / μ-law / A-law formats

Availability

Details
Clusterus-east-1

Documentation


Did you find this page helpful?

Last updated: April 15, 2026