Install

Start from the same requirements and quickstart path exposed upstream

The original repo is documentation-heavy, so the install surface is mostly about environment constraints, package baselines, and the shortest path to a first chat call.

Python 3.8+Transformers 4.32+Optional flash-attention

Runtime baseline

The upstream README calls out Python 3.8+, PyTorch 1.12+, Transformers 4.32+, and CUDA 11.4+ as the baseline environment.

Flash Attention is optional, but the README recommends it for supported fp16 or bf16 devices to improve efficiency and reduce memory usage.

Python 3.8 or newer
PyTorch 1.12 or newer, with 2.0+ recommended
Transformers 4.32 or newer
CUDA 11.4+ for GPU-oriented paths

Quickstart flow

Install baseline dependencies
Start with `pip install -r requirements.txt` if you want the simplest source-aligned local environment.
Add Flash Attention only when the hardware supports it
Treat flash-attention as an optimization layer, not a prerequisite, because the upstream README explicitly says the project still runs without it.
Load the chat checkpoint with `trust_remote_code=True`
The official quickstart shows `AutoTokenizer` and `AutoModelForCausalLM` loading the chat model directly from the public model hub.

Minimal Transformers example

The upstream quickstart centers the local experience on a direct `model.chat()` flow.

from transformers import AutoModelForCausalLM, AutoTokenizer

tokenizer = AutoTokenizer.from_pretrained("Qwen/Qwen-7B-Chat", trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(
    "Qwen/Qwen-7B-Chat",
    device_map="auto",
    trust_remote_code=True
).eval()

response, history = model.chat(tokenizer, "你好", history=None)
print(response)

Where builders usually branch next

Checkpoint hub

Hugging Face

Use the public Qwen organization when you want the standard open-source model-card and checkpoint flow.

Open link

Checkpoint hub

ModelScope

Mirror the same model line in the China-friendly distribution hub used throughout the original docs.

Open link

Runtime shortcut

Docker images

The README also points to prebuilt Docker images for faster environment setup when you do not want to build from scratch.

Open link

Complete documentation route map

The docs surface stays mirrored in a fixed order, with the current page highlighted inside the shared route map.

Install

Current page

Install and Quickstart

Requirements, quickstart, and deployment-oriented install notes for the historical Qwen release line.

Go to page

Models

Models and Variants

The original Qwen model family with context windows, memory guidance, and public checkpoint entry points.

Go to page

Benchmarks

Historical performance tables for the original Qwen release line, preserved with source attribution.

Go to page

Demos

Demos and Deployment Surfaces

Web UI, CLI demo, vLLM, FastChat, and the deployment touchpoints highlighted by the original README.

Go to page

API

API Surface

OpenAI-compatible local API patterns, function calling, and managed API references for the original Qwen line.

Go to page

Tool Use

Tool Use and System Prompt

System prompt positioning, ReAct-style tooling, function calling, and code-interpreter benchmarks from the original README.

Go to page

Long Context

Long-context techniques and evaluation blocks for the original Qwen release line.

Go to page

FAQ

A public FAQ layer derived from the README-only source surface and the boundary conditions stated by the blueprint.

Go to page

License

License and Citation

Source-code licensing, model-license notes, and citation text mirrored from the original Qwen README.

Go to page

Source anchors

README: requirements and quickstart README_CN: requirements and quickstart

Start from the same requirements and quickstart path exposed upstream

Runtime baseline

Quickstart flow

Install baseline dependencies

Add Flash Attention only when the hardware supports it

Load the chat checkpoint with `trust_remote_code=True`

Minimal Transformers example

Where builders usually branch next

Hugging Face

ModelScope

Docker images

Complete documentation route map