Models and Pricing

Speech to Text

View as Markdown

The Speech to Text API transcribes audio into text. Use the REST endpoint for file-based batch transcription, or the streaming endpoint for real-time low-latency transcription.

How to increase my rate limits?


At a glance

Details
ModalitiesAudio → Text
REST pricing$0.10 / hr
Streaming pricing$0.20 / hr
Regionus-east-1

Pricing

Details
REST (per hour)$0.10 / hr
Streaming (per hour)$0.20 / hr

Rate Limits

RESTStreaming
RPM (Requests per minute)600600
RPS (Requests per second)1010
Concurrent sessions100 per team

Capabilities

  • REST and streaming transcription
  • Multiple audio formats (WAV, MP3, WebM, OGG, M4A)
  • Multiple languages
  • Real-time interim results (streaming)

Availability

Details
Clusterus-east-1

Documentation


Did you find this page helpful?

Last updated: April 15, 2026