Models

Four public model sizes, mirrored across base, chat, and quantized variants

The upstream README exposes a clear public catalog: 1.8B, 7B, 14B, and 72B, plus chat, Int4, and Int8 variants where available.

1.8B to 72BBase and chatQuantized variants

Model comparison

Model	Release	Max length	System prompt	Pretrained tokens	Q-LoRA memory	2048-token Int4 generation	Tool use
Qwen-1.8B	23.11.30	32K	Yes	2.2T	5.8GB	2.9GB	Yes
Qwen-7B	23.08.03	32K	No	2.4T	11.5GB	8.2GB	Yes
Qwen-14B	23.09.25	8K	No	3.0T	18.7GB	13.0GB	Yes
Qwen-72B	23.11.30	32K	Yes	3.0T	61.4GB	48.9GB	Yes

Values are mirrored from the upstream README tables and should be treated as source-cited historical product data.

Representative checkpoint paths

Variant cards

32K context · 2.2T tokens

Qwen-1.8B

The smallest family member still ships 32K context and system prompt support in the chat variant.

Open link

32K context · 2.4T tokens

Qwen-7B

The most practical open deployment target in the original line, with base, chat, Int4, and Int8 checkpoints.

Open link

8K context · 3.0T tokens

Qwen-14B

The 14B release pushed the original line deeper into coding and Chinese knowledge while retaining tool-use support.

Open link

32K context · 3.0T tokens

Qwen-72B

The flagship open release in the original repo, combining 32K context, stronger system prompts, and the top benchmark results.

Open link

Model distribution and ecosystem

ModelScope

Model hubs

Mirror the official model cards across both ModelScope and Hugging Face so download paths are visible in both English and Chinese contexts.

Open link

Managed API

DashScope API

The upstream README points to DashScope when you need a managed API surface instead of local model serving.

Open link

Agent framework

Qwen-Agent

The tool-use and code-interpreter sections connect directly to Qwen-Agent for evaluation and agent workflows.

Open link

Edge runtime

qwen.cpp

The original README highlights qwen.cpp as a lighter runtime path for the historical model line.

Open link

Complete documentation route map

The docs surface stays mirrored in a fixed order, with the current page highlighted inside the shared route map.

Install

Install and Quickstart

Requirements, quickstart, and deployment-oriented install notes for the historical Qwen release line.

Go to page

Models

Current page

Models and Variants

The original Qwen model family with context windows, memory guidance, and public checkpoint entry points.

Go to page

Benchmarks

Historical performance tables for the original Qwen release line, preserved with source attribution.

Go to page

Demos

Demos and Deployment Surfaces

Web UI, CLI demo, vLLM, FastChat, and the deployment touchpoints highlighted by the original README.

Go to page

API

API Surface

OpenAI-compatible local API patterns, function calling, and managed API references for the original Qwen line.

Go to page

Tool Use

Tool Use and System Prompt

System prompt positioning, ReAct-style tooling, function calling, and code-interpreter benchmarks from the original README.

Go to page

Long Context

Long-context techniques and evaluation blocks for the original Qwen release line.

Go to page

FAQ

A public FAQ layer derived from the README-only source surface and the boundary conditions stated by the blueprint.

Go to page

License

License and Citation

Source-code licensing, model-license notes, and citation text mirrored from the original Qwen README.

Go to page

Source anchors

README: model catalog and memory table Technical report