Inference API

Chat


Chat completions

/v1/chat/completions


Create new response

/v1/responses


Retrieve previous response

/v1/responses/{response_id}


Delete previous response

/v1/responses/{response_id}


Get deferred chat completions

/v1/chat/deferred-completion/{request_id}