Qwen3 0.6B (Non-reasoning)
15 evaluations阿里巴巴
Qwen3 0.6B is a lightweight, non-reasoning variant of the Qwen3 series with only 0.6 billion parameters. It is optimized for fast inference, low latency, and minimal resource consumption, making it suitable for edge deployment, simple conversational tasks, and applications requiring rapid response times.
Input / 1M tokens
$0.11
Output / 1M tokens
$0.42


