mingyan
OpenAI Responses API
Unified response endpoint using the OpenAI Responses API format
baseUrl
http://apillm.globalaiopc.com/gw_llm_powerCompatible with the OpenAI Responses API. You can integrate directly with the following reference:
- OpenAI Responses API: https://platform.openai.com/docs/api-reference/responses/create
Authentication
Get KeyUse the
Authorization header for authentication.Required headers:
cURL
Authorization: Bearer {{YOUR_API_KEY}}
Content-Type: application/json
Notes:
Authorization: API key in the formatBearer {{YOUR_API_KEY}}
Supported Models
The Responses API currently supports the GPT models listed in the OpenAI Chat API:
GPT
| Official Version | Official Version | Premium Account Pool | Mixed Account Pool / Proxy Version | Low-cost Version |
|---|---|---|---|---|
gpt-5.4 | gpt-5.4-official | gpt-5.4-hc | gpt-5.4-fd | gpt-5.4-low |
gpt-5.5 | gpt-5.5-official | gpt-5.5-hc | gpt-5.5-fd | gpt-5.5-low |
Version Notes
| Suffix / Type | Description |
|---|---|
| no suffix | Official version |
-official | Official version |
-hc | Premium account pool version |
-fd | Account pool / proxy version |
-low | Low-cost version |
Request Parameters
modelstringrequiredModel name to use, for example:
gpt-5.4, gpt-5.5, gpt-5.5-officialinputstring/arrayrequiredModel input. Can be a plain text string or a message array using the OpenAI Responses API format
instructionsstringDeveloper or system-level instructions that define model behavior, role, and output requirements
previous_response_idstringPrevious response ID used to continue a conversation
temperaturenumberSampling temperature to control output randomness. Recommended range is 0 to 2
top_pnumberNucleus sampling parameter. It is recommended to tune either
temperature or top_p, not bothstreambooleanWhether to enable streaming responses. When set to
true, the response is returned as SSEmax_output_tokensintegerMaximum output token count
toolsarrayTool definitions available to the model, such as function tools, web search, or file search
tool_choicestring/objectTool selection policy. The model can choose tools automatically, or you can force a specific tool
textobjectText output configuration, including structured output formats such as JSON Schema
reasoningobjectReasoning configuration. When supported by the selected model, this can set reasoning effort and related options
Input Array Structure
input[].rolestringrequiredMessage role. Supported values:
user, assistant, system, developerinput[].contentstring/arrayrequiredMessage content. Can be a plain text string or a multimodal content array
input[].content[].typestringrequiredContent type. Common values include
input_text, input_image, input_file, and output_textinput[].content[].textstringText content used when
type is input_text or output_textinput[].content[].image_urlstringImage URL used when
type is input_imageinput[].content[].file_idstringFile ID used when
type is input_fileResponse
Returns an OpenAI Responses API-compatible response format.
idstringUnique response identifier. It can be used as
previous_response_idobjectstringObject type, usually
responsecreated_atnumberResponse creation timestamp
statusstringResponse status, such as
completed, failed, or in_progressmodelstringActual model name used
outputarrayOutput array, which can contain messages, tool calls, and other output items
output[].typestringOutput item type, such as
message or function_calloutput[].content[].typestringOutput content type, such as
output_textoutput[].content[].textstringGenerated text content
output_textstringAggregated text output for direct plain-text reading
usage.input_tokensintegerInput token count
usage.output_tokensintegerOutput token count
usage.total_tokensintegerTotal token count
Notes
- The request endpoint is
http://apillm.globalaiopc.com/gw_llm_power/v1/responses - Pass the API key using the
Authorizationheader - Responses API uses
input, not the Chat Completionsmessagesfield - Streaming responses use SSE events following the OpenAI Responses API format

