The Text to Speech API converts text into natural speech, billed per input character. Supports a full lineup of expressive voices, streaming, and batch output, in MP3, WAV, PCM, μ-law, and A-law formats.

How to increase my rate limits?

At a glance

	Details
Modalities	Text → Audio
Pricing	$15.00 / 1M chars
Region	us-east-1

Pricing

	Details
Per 1M chars	$15.00 / 1M chars

Rate Limits

	Details
Requests per second	50 RPS
Concurrent sessions	100 per team

Capabilities

Expressive built-in voices
Streaming output
Batch output
MP3 / WAV / PCM / μ-law / A-law formats

Availability

	Details
Cluster	us-east-1

Documentation

Text to Speech Guide — Getting started with text to speech
API Reference — Text to Speech endpoint reference
Pricing — Full pricing overview

Last updated: July 23, 2026