QwenLM/Qwen3-Coder
💜 Qwen Chat | 🤗 Hugging Face | 🤖 ModelScope | 📑 Blog | 📖 Documentation
| 🌍 WebDev | 💬 WeChat (微信) | 🫨 Discord | 📄 Arxiv | 👽 Qwen Code
Visit our Hugging Face or ModelScope organization (click links above), search checkpoints with names starting with Qwen3-Coder-, and you will find all you need! Enjoy!
Qwen3-Coder: Agentic Coding in the World.
Introduction
Today, we’re announcing Qwen3-Coder, our most agentic code model to date. Qwen3-Coder is available in multiple sizes, but we’re excited to introduce its most powerful variant first: Qwen3-Coder-480B-A35B-Instruct — a 480B-parameter Mixture-of-Experts model with 35B active parameters, offering exceptional performance in both coding and agentic tasks. Qwen3-Coder-480B-A35B-Instruct sets new state-of-the-art results among open models on Agentic Coding, Agentic Browser-Use, and Agentic Tool-Use, comparable to Claude Sonnet.
💻 Significant Performance: among open models on Agentic Coding, Agentic Browser-Use, and other foundational coding tasks, achieving results comparable to Claude Sonnet;
📚 Long-context Capabilities: with native support for 256K tokens, extendable up to 1M tokens using Yarn, optimized for repository-scale understanding;
🛠 Agentic Coding: supporting for most platform such as Qwen Code, CLINE, featuring a specially designed function call format;
Basic information
- ✨ Supporting long context understanding and generation with the context length of 256K tokens;
- ✨ Supporting 358 coding languages;
|
|
- ✨ Retain strengths in math and general capabilities from base model.
[!Important]
Qwen3-coder function calling relies on our new tool parser
qwen3coder_tool_parser.pyhere.We updated both the special tokens and their corresponding token ids, in order to maintain consistency with Qwen3. Please make sure to use the new tokenizer.
| model name | type | length | Download |
|---|---|---|---|
| Qwen3-Coder-480B-A35B-Instruct | instruct | 256k | 🤗 Hugging Face • 🤖 ModelScope |
| Qwen3-Coder-480B-A35B-Instruct-FP8 | instruct | 256k | 🤗 Hugging Face • 🤖 ModelScope |
Detailed performance and introduction are shown in this 📑 blog.
Quick Start
[!Important] Qwen3-Coder-480B-A35B-Instruct are instruction models for chatting;
This model supports only non-thinking mode and does not generate
<think></think>blocks in its output. Meanwhile, specifyingenable_thinking=Falseis no longer required.**
👉🏻 Chat with Qwen3-Coder-480B-A35B-Instruct
You can just write several lines of code with transformers to chat with Qwen3-Coder-480B-A35B-Instruct. Essentially, we build the tokenizer and the model with from_pretrained method, and we use generate method to perform chatting with the help of chat template provided by the tokenizer. Below is an example of how to chat with Qwen3-Coder-480B-A35B-Instruct:
|
|
The apply_chat_template() function is used to convert the messages into a format that the model can understand.
The add_generation_prompt argument is used to add a generation prompt, which refers to <|im_start|>assistant\n to the input. Notably, we apply ChatML template for chat models following our previous practice.
The max_new_tokens argument is used to set the maximum length of the response. The tokenizer.batch_decode() function is used to decode the response. In terms of the input, the above messages is an example to show how to format your dialog history and system prompt.
You can use the other size of instruct model in the same way.
Fill in the middle with Qwen3-Coder-480B-A35B-Instruct
The code insertion task, also referred to as the “fill-in-the-middle” challenge, requires the insertion of code segments in a manner that bridges the gaps within a given code context. For an approach aligned with best practices, we recommend adhering to the formatting guidelines outlined in the paper “Efficient Training of Language Models to Fill in the Middle”[arxiv].
The prompt should be structured as follows:
|
|
Following the approach mentioned, an example would be structured in this manner:
|
|
Use Cases
Example: Physics-Based Chimney Demolition Simulation with Controlled Explosion
Prompt with Qwen Chat Web Dev
|
|
Example: Multicolor and Interactive Animation
Prompt with Cline [act mode]
|
|
Example: 3D Google Earth
Prompt with Qwen Chat Web Dev
|
|
Example: Testing Your WPM with a Famous Quote
Prompt with Qwen-Code CLI
|
|
Example: Bouncing Ball in Rotation Hypercube
Prompt with Qwen Chat Web Dev
|
|
Example: Solar System Simulation
Prompt with Cline [act mode]
|
|
Example: DUET Game
Prompt with Cline [act mode]
|
|
Star History
Citation
If you find our work helpful, feel free to give us a cite.
|
|
Contact Us
If you are interested to leave a message to either our research team or product team, join our Discord or WeChat groups!