Models and Pricing

Text to Speech

The Text to Speech API converts text into natural speech, billed per input character. Supports multiple voices, streaming and batch output, in MP3, WAV, PCM, μ-law, and A-law formats.

The Text to Speech API is currently in beta. Pricing and rate limits may change when the API becomes generally available.

Request increased rate limits

At a glance

	Details
Modalities	Text → Audio
Pricing	$4.20 / 1M characters (Beta Pricing)
Region	us-east-1

Pricing

	Details
Per 1M characters	$4.20 / 1M characters (Beta Pricing)

Rate Limits

	Details
Requests per minute	600 RPM
Concurrent requests	10 per team

Capabilities

Multiple voices
Streaming output
Batch output
MP3 / WAV / PCM / μ-law / A-law formats

Availability

	Details
Cluster	us-east-1

Documentation

Text to Speech Guide — Getting started with text to speech
API Reference — TTS endpoint reference
Models and Pricing — Full pricing overview

Did you find this page helpful?