Inference API

Chat

View as Markdown

Chat completions

/v1/chat/completions


Create new response

/v1/responses


Retrieve previous response

/v1/responses/{response_id}


Delete previous response

/v1/responses/{response_id}


Get deferred chat completions

/v1/chat/deferred-completion/{request_id}


Did you find this page helpful?