Models

Text to Speech

The Text to Speech API converts text into natural speech, billed per input character. Supports multiple voices, streaming, and batch output, in MP3, WAV, PCM, μ-law, and A-law formats.

How to increase my rate limits?


At a glance

Details
ModalitiesText → Audio
Pricing$15.00 / 1M chars
Regionus-east-1

Pricing

Details
Per 1M chars$15.00 / 1M chars

Rate Limits

Details
Requests per minute3,000 RPM
Requests per second50 RPS
Concurrent sessions100 per team

Capabilities

  • Multiple voices
  • Streaming output
  • Batch output
  • MP3 / WAV / PCM / μ-law / A-law formats

Availability

Details
Clusterus-east-1

Documentation


Last updated: May 27, 2026