Excels at enterprise use cases like data extraction, coding, and text summarization. Possesses deep domain knowledge in finance, healthcare, law, and science.
At a glance
Modalities
Context window
131,072
Pricing
Capabilities
Function calling
Connect the xAI model to external tools and systems.
Structured outputs
Return responses in specific, organized formats.
Reasoning
The model can think before responding.
Pricing
Input
Tokens
$3.00/ 1M tokens
Cached tokens
$0.75/ 1M tokens
Output
Tokens
$15.00/ 1M tokens
You are charged for each token used when making calls to our API.
Using cached input tokens can significantly reduce your costs.
This model is available on multiple clusters, you can find full regional based pricing below.
Show batch API pricing
Details
Model name
Aliases
| Region | us-east-1, eu-west-1 |
|---|---|
| Pricing per million tokens * | |
| Input | $3.00 |
| Cached input | $0.75 |
| Output | $15.00 |
| Rate limits | |
| Requests per minute | 900 |
| Tokens per minute | 4,000,000 |
| Batch pricing | |
Quickstart
from xai_sdk import Client
from xai_sdk.chat import user, system
client = Client(api_key="<YOUR_XAI_API_KEY_HERE>")
chat = client.chat.create(model="grok-3", temperature=0)
chat.append(system("You are a PhD-level mathematician."))
chat.append(user("What is 2 + 2?"))
response = chat.sample()
print(response.content)