Responses API

接口：

text

POST /v1/responses

SDK：

python

client.responses.create(...)

请求字段

字段	类型	必填	说明
`model`	string	是	使用 `/v1/models` 返回的模型 ID
`input`	string/array	是	字符串或 OpenAI message 数组
`instructions`	string	否	系统指令
`max_output_tokens`	number	否	最大输出 token 数
`temperature`	number	否	采样温度
`top_p`	number	否	核采样参数
`stream`	boolean	否	`true` 时返回 Responses SSE
`tools`	array	否	支持 function tools
`tool_choice`	string/object	否	支持 `auto`、`none`、`required` 和指定函数
`metadata`	object	否	请求元数据

响应字段

非流式返回 Responses 风格对象：

字段	说明
`id`	response ID
`object`	`response`
`status`	`completed`、`cancelled` 等
`model`	响应模型
`output`	输出数组
`output_text`	SDK 侧可聚合文本
`usage`	token 用量

流式返回 Responses SSE。事件类型：

事件	说明
`response.created`	请求已创建，包含 `response.id`（取消请求需用此 id）
`response.output_item.added`	输出项开始
`response.content_part.added`	内容块开始
`response.output_text.delta`	文本增量
`response.output_text.done`	文本输出结束
`response.content_part.done`	内容块结束
`response.output_item.done`	输出项结束
`response.completed`	正常完成
`response.cancelled`	请求被取消（`output` 可能为空）
`response.failed`	请求失败，payload 含 `error.code` / `error.message`

取消请求

接口：

text

POST /v1/responses/{response_id}/cancel

SDK：

python

client.responses.cancel("resp_xxx")

HTTP：

bash

curl -X POST https://llm.xiaoyue9527.xyz/v1/responses/resp_xxx/cancel \
  -H 'Authorization: Bearer sk-gtw-REPLACE_ME'

边界

边界	说明
`previous_response_id`	暂不支持
background mode	暂不支持
文件输入	暂不支持
内置工具	暂不支持
取消范围	仅可取消当前进程内仍活跃的流式 Responses 请求；取消凭证 `response_id` 需从 `response.created` 事件获取
断流缓存	`GET /v1/responses/{request_id}` 目前仅命中 Chat/Messages 流式写入的缓存；Responses 流式请求暂不写入断流缓存，断流后无法凭 request_id 取回，客户端需自行重试

示例

bash

curl -N https://llm.xiaoyue9527.xyz/v1/responses \
  -H 'Content-Type: application/json' \
  -H 'Authorization: Bearer sk-gtw-REPLACE_ME' \
  --data-raw '{
    "model": "qwen3.6-plus",
    "input": "解释什么是 RESTful API",
    "max_output_tokens": 2048,
    "stream": true
  }'

Responses API ​

请求字段 ​

响应字段 ​

取消请求 ​

边界 ​

示例 ​

Responses API

请求字段

响应字段

取消请求

边界

示例