/v1/responses)是替代 Chat Completions 的新一代统一接口,主打有状态、内置工具、多模态、流式细粒度事件。下面从端点、请求体、响应体、流式事件、常用参数对比五部分完整介绍。POST /v1/responsesAuthorization: Bearer sk-xxx{
"model": "gpt-4o",
"input": "什么是Embedding?"
}{
"model": "gpt-4o-2025-04-14",
"input": [
{"role": "user", "content": "什么是Embedding?"}
],
"instructions": "你是AI技术专家,用通俗语言解释",
"max_output_tokens": 1024,
"temperature": 0.7,
"top_p": 1.0,
"store": true,
"previous_response_id": "resp_xxx",
"tools": [
{"type": "web_search"},
{"type": "code_interpreter"}
],
"tool_choice": "auto",
"response_format": {
"type": "json_schema",
"json_schema": {
"name": "Explanation",
"strict": true,
"schema": {"type": "object", "properties": {"answer": {"type": "string"}}}
}
},
"stream": false
}model:模型名(如 gpt-4o、o3)input:输入内容,支持字符串或消息数组(role/content)instructions:系统级指令(替代旧版 system 角色)max_output_tokens:最大输出 token(旧版 max_tokens)store:是否持久化响应(默认 false,多轮设 true)previous_response_id:多轮对话的上一轮响应 ID(有状态核心)tools:内置工具(web_search/code_interpreter/file_search 等)response_format:结构化输出(支持 json_schema,旧版 response_format 迁移)stream:是否流式返回(true 启用 SSE){
"id": "resp_67cb32528d6881909eb2859a55e18a85",
"object": "response",
"created_at": 1741369938.0,
"model": "gpt-4o-2024-08-06",
"output": [
{
"id": "msg_67cb3252cfac8190865744873aada798",
"type": "message",
"role": "assistant",
"content": [
{
"type": "output_text",
"text": "Embedding是将文本转为高维向量的技术...",
"annotations": []
}
]
}
],
"output_text": "Embedding是将文本转为高维向量的技术...",
"usage": {
"input_tokens": 28,
"output_tokens": 42,
"total_tokens": 70
},
"error": null,
"status": "completed"
}id:响应唯一 ID(多轮用 previous_response_id)object:固定为 responseoutput:输出数组,包含消息/工具调用/推理等项output_text:快捷获取纯文本内容(无需遍历 output)usage:token 消耗统计status:状态(completed/in_progress/failed)stream: true)event: 和 data: 开头,最后以 data: [DONE] 结束。event: response.created
data: {"id":"resp_xxx","status":"in_progress"}
event: response.completed
data: {"id":"resp_xxx","status":"completed","usage":{...}}event: response.output_text.delta
data: {"delta":"Embedding"}
event: response.output_text.delta
data: {"delta":"是将文本转为"}
event: response.output_text.done
data: {"text":"Embedding是将文本转为高维向量的技术..."}event: response.web_search_call.created
data: {"id":"wsc_xxx","action":"search","queries":["什么是Embedding"]}
event: response.web_search_call.completed
data: {"id":"wsc_xxx","status":"completed"}| 特性 | Chat Completions | Responses API |
|---|---|---|
| 端点 | /v1/chat/completions | /v1/responses |
| 输入 | messages 数组 | input(字符串/数组) |
| 系统指令 | system 角色 | instructions 字段 |
| 最大输出 | max_tokens | max_output_tokens |
| 多轮对话 | 手动传历史消息 | previous_response_id(有状态) |
| 工具调用 | 需自定义 functions | 内置 web_search/code_interpreter |
| 流式事件 | 粗粒度 choices.delta | 细粒度 output_text.delta/tool_call 事件 |