Inference API
Chat
Chat completions
/v1/chat/completions
Create new response
/v1/responses
Retrieve previous response
/v1/responses/{response_id}
Delete previous response
/v1/responses/{response_id}
Get deferred chat completions
/v1/chat/deferred-completion/{request_id}