阿里巴巴

Qwen3

阿里巴巴Slug: qwen-3

Qwen3 instruct models with various sizes and reasoning modes

Models

Evaluations

Plan supports

Models in this family

43 models

Qwen3 0.6B (Non-reasoning)

15 evaluations

阿里巴巴

Qwen3 0.6B is a lightweight, non-reasoning variant of the Qwen3 series with only 0.6 billion parameters. It is optimized for fast inference, low latency, and minimal resource consumption, making it suitable for edge deployment, simple conversational tasks, and applications requiring rapid response times.

Input / 1M tokens

$0.11

Output / 1M tokens

$0.42

Qwen3 0.6B (Reasoning)

15 evaluations

阿里巴巴

A lightweight reasoning model from the Qwen3 series, optimized for fast inference and cost-effective deployment. It excels in logical reasoning tasks with a focus on chain-of-thought capabilities.

Input / 1M tokens

$0.11

Output / 1M tokens

$1.26

Qwen3 1.7B (Non-reasoning)

15 evaluations

阿里巴巴

Qwen3 1.7B is a lightweight language model from Alibaba's Qwen series, optimized for fast and efficient inference. It is designed for non-reasoning tasks, providing quick responses with minimal computational resources.

Input / 1M tokens

$0.11

Output / 1M tokens

$0.42

Qwen3 1.7B (Reasoning)

15 evaluations

阿里巴巴

A compact 1.7B parameter model from Alibaba's Qwen3 series, optimized for efficient reasoning tasks. It is designed to deliver strong logical and analytical performance in resource-constrained environments, offering a balance of speed and capability.

Input / 1M tokens

$0.11

Output / 1M tokens

$1.26

Qwen3 14B (Non-reasoning)

15 evaluations

阿里巴巴

Qwen3 14B is a 14-billion parameter model from Alibaba's Qwen3 series, optimized for general-purpose dialogue and instruction following. As a non-reasoning variant, it focuses on efficient and responsive text generation, making it suitable for applications requiring quick, cost-effective, and high-quality conversational AI.

Input / 1M tokens

$0.235

Output / 1M tokens

$0.82

Qwen3 14B (Reasoning)

15 evaluations

阿里巴巴

Qwen3 14B (Reasoning) is a 14-billion parameter model from Alibaba's Qwen3 series, specifically optimized for complex reasoning tasks. It excels at chain-of-thought and step-by-step logical problem-solving, offering a strong balance between advanced reasoning capabilities and computational efficiency.

Input / 1M tokens

$0.235

Output / 1M tokens

$2.22

Qwen3 235B A22B (Non-reasoning)

15 evaluations

阿里巴巴

Qwen3 235B A22B is a large-scale Mixture-of-Experts (MoE) language model from Alibaba's Qwen series, with a total of 235 billion parameters but only 22 billion activated per inference. This non-reasoning variant is optimized for general-purpose tasks, offering strong multilingual capabilities, coding proficiency, and efficient performance due to its MoE architecture.

Input / 1M tokens

$0.45

Output / 1M tokens

$1.80

Qwen3 235B A22B (Reasoning)

15 evaluations

阿里巴巴

Qwen3 235B A22B (Reasoning) is a large-scale language model from Alibaba's Qwen3 series, optimized for complex reasoning tasks. It utilizes a Mixture-of-Experts (MoE) architecture with 235B total parameters and 22B activated parameters, balancing high performance with computational efficiency. The model excels in instruction following and multi-step logical reasoning.

Input / 1M tokens

$0.70

Output / 1M tokens

$8.40

Qwen3 235B A22B 2507 (Reasoning)

15 evaluations

阿里巴巴

This is a reasoning-optimized variant of the Qwen3 235B model from Alibaba Cloud. It is designed to excel in complex logical, mathematical, and coding tasks that require multi-step reasoning. As a large-scale model, it supports long context windows and is part of the advanced Qwen3 series.

Input / 1M tokens

$0.40

Output / 1M tokens

$2.15

Qwen3 235B A22B 2507 Instruct

15 evaluations

阿里巴巴

Qwen3 235B A22B is a large-scale Mixture-of-Experts (MoE) language model from Alibaba's Qwen series. It features 235 billion total parameters with 22 billion activated per token, designed for strong instruction following, complex reasoning, and multilingual tasks.

Input / 1M tokens

$0.20

Output / 1M tokens

$0.825

Qwen3 30B A3B (Non-reasoning)

15 evaluations

阿里巴巴

Qwen3 30B A3B is a 30-billion parameter model from Alibaba's Qwen3 series, optimized for general-purpose instruction following and fast response generation. As a non-reasoning variant, it prioritizes efficiency and speed over complex chain-of-thought tasks, making it suitable for cost-sensitive and latency-critical applications.

Input / 1M tokens

$0.08

Output / 1M tokens

$0.29

Qwen3 30B A3B (Reasoning)

15 evaluations

阿里巴巴

Qwen3 30B A3B is a reasoning-optimized language model from Alibaba, designed for enhanced logical inference and problem-solving tasks.

Input / 1M tokens

$0.09

Output / 1M tokens

$0.45

Qwen3 30B A3B 2507 (Reasoning)

15 evaluations

阿里巴巴

This is a 30-billion parameter reasoning model from Alibaba's Qwen3 series, optimized for complex logical and analytical tasks. It features enhanced chain-of-thought capabilities to improve accuracy in multi-step problem-solving.

Input / 1M tokens

$0.28

Output / 1M tokens

$1.85

Qwen3 30B A3B 2507 Instruct

15 evaluations

阿里巴巴

Qwen3 30B A3B is a 30-billion parameter instruction-tuned model from Alibaba's Qwen3 series, likely utilizing a Mixture-of-Experts architecture with 3 billion active parameters. It is optimized for strong instruction following, reasoning, and multilingual (especially Chinese) performance, balancing capability with inference efficiency.

Input / 1M tokens

$0.15

Output / 1M tokens

$0.40

Qwen3 32B (Non-reasoning)

12 evaluations

阿里巴巴

Qwen3 32B (Non-reasoning) is a 32-billion parameter instruction-tuned model from Alibaba's Qwen series. It is designed for general-purpose dialogue and content generation, balancing performance and efficiency. This model excels at following instructions and handling a wide range of tasks without specialized reasoning modes.

Input / 1M tokens

$0.15

Output / 1M tokens

$0.59

Qwen3 32B (Reasoning)

15 evaluations

阿里巴巴

Qwen3 32B (Reasoning) is a 32-billion parameter model from Alibaba's Qwen3 series, specifically optimized for complex reasoning tasks. It excels in chain-of-thought processes, logical deduction, and problem-solving, while also maintaining strong coding and long-context capabilities.

Input / 1M tokens

$0.195

Output / 1M tokens

$0.52

Qwen3 4B (Non-reasoning)

8 evaluations

阿里巴巴

Qwen3 4B (Non-reasoning) is a lightweight, 4-billion parameter language model from Alibaba's Qwen3 series, optimized for fast and cost-effective inference. It is designed for general-purpose tasks and edge deployment, offering a balance of performance and efficiency without the overhead of complex reasoning chains.

Input / 1M tokens

$0.11

Output / 1M tokens

$0.42

Qwen3

Models in this family

Qwen3 0.6B (Non-reasoning)

Qwen3 0.6B (Reasoning)

Qwen3 1.7B (Non-reasoning)

Qwen3 1.7B (Reasoning)

Qwen3 14B (Non-reasoning)

Qwen3 14B (Reasoning)

Qwen3 235B A22B (Non-reasoning)

Qwen3 235B A22B (Reasoning)

Qwen3 235B A22B 2507 (Reasoning)

Qwen3 235B A22B 2507 Instruct

Qwen3 30B A3B (Non-reasoning)

Qwen3 30B A3B (Reasoning)

Qwen3 30B A3B 2507 (Reasoning)

Qwen3 30B A3B 2507 Instruct

Qwen3 32B (Non-reasoning)

Qwen3 32B (Reasoning)

Qwen3 4B (Non-reasoning)

Qwen3 4B (Reasoning)

Qwen3 4B 2507 (Reasoning)

Qwen3 4B 2507 Instruct

Qwen3 8B (Non-reasoning)

Qwen3 8B (Reasoning)

Qwen3 Coder 30B A3B Instruct

Qwen3 Coder 480B A35B Instruct

Qwen3 Coder Next

Qwen3 Max

Qwen3 Max (Preview)

Qwen3 Max Thinking

Qwen3 Max Thinking (Preview)

Qwen3 Next 80B A3B (Reasoning)

Qwen3 Next 80B A3B Instruct

Qwen3 Omni 30B A3B (Reasoning)

Qwen3 Omni 30B A3B Instruct

Qwen3 VL 235B A22B (Reasoning)

Qwen3 VL 235B A22B Instruct

Qwen3 VL 30B A3B (Reasoning)

Qwen3 VL 30B A3B Instruct

Qwen3 VL 32B (Reasoning)

Qwen3 VL 32B Instruct

Qwen3 VL 4B (Reasoning)

Qwen3 VL 4B Instruct

Qwen3 VL 8B (Reasoning)

Qwen3 VL 8B Instruct

User ratings

Discussion