Models

/

Grok 4 Fast

We're excited to release grok-4-fast, our latest advancement in cost-efficient reasoning models.

View as Markdown

At a glance

Modalities

Text, Image

Text

Context window

2,000,000

Pricing

Capabilities

Function calling

Connect the xAI model to external tools and systems.

Structured outputs

Return responses in specific, organized formats.

Reasoning

The model can think before responding.

Pricing

Input

Tokens

$0.20/ 1M tokens

Cached tokens

$0.05/ 1M tokens

Output

Tokens

$0.50/ 1M tokens

You are charged for each token used when making calls to our API.

Using cached input tokens can significantly reduce your costs.

This model is available on multiple clusters, you can find full regional based pricing below.

We charge different rates for requests which exceed the 128K context window

Details

Model name

Aliases

Regionus-east-1, eu-west-1
Rate limits
Requests per minute600
Tokens per minute4,000,000