API

Keep the API story explicit: local compatibility and managed alternatives

The upstream docs cover both a local OpenAI-compatible API via `openai_api.py` and a managed DashScope option for hosted access.

OpenAI-style APIFunction callingDashScope

Two API paths

The local API example installs FastAPI, Uvicorn, `openai<1.0`, Pydantic, and `sse_starlette`, then runs `openai_api.py`.

If you do not want to run local serving infrastructure, the README separately points to DashScope as the managed API entry.

OpenAI-compatible client call

import openai

openai.api_base = "http://localhost:8000/v1"
openai.api_key = "none"

response = openai.ChatCompletion.create(
    model="Qwen",
    messages=[{"role": "user", "content": "你好"}],
    stream=False,
    stop=[]
)

print(response.choices[0].message.content)

API notes

Local API

Function calling

The upstream README says function calling is supported in the local API path, with a temporary `stream=False` limitation.

Managed API

DashScope

Use DashScope when you need a hosted entry point rather than a local compatibility layer.

Open link

Serving stack

FastChat OpenAI server

FastChat also exposes an OpenAI-like server in the vLLM deployment flow.

Complete documentation route map

The docs surface stays mirrored in a fixed order, with the current page highlighted inside the shared route map.

Install

Install and Quickstart

Requirements, quickstart, and deployment-oriented install notes for the historical Qwen release line.

Go to page

Models

Models and Variants

The original Qwen model family with context windows, memory guidance, and public checkpoint entry points.

Go to page

Benchmarks

Historical performance tables for the original Qwen release line, preserved with source attribution.

Go to page

Demos

Demos and Deployment Surfaces

Web UI, CLI demo, vLLM, FastChat, and the deployment touchpoints highlighted by the original README.

Go to page

API

Current page

API Surface

OpenAI-compatible local API patterns, function calling, and managed API references for the original Qwen line.

Go to page

Tool Use

Tool Use and System Prompt

System prompt positioning, ReAct-style tooling, function calling, and code-interpreter benchmarks from the original README.

Go to page

Long Context

Long-context techniques and evaluation blocks for the original Qwen release line.

Go to page

FAQ

A public FAQ layer derived from the README-only source surface and the boundary conditions stated by the blueprint.

Go to page

License

License and Citation

Source-code licensing, model-license notes, and citation text mirrored from the original Qwen README.

Go to page

Source anchors

README: API and function calling README_CN: API and function calling