Models

Qwen3.5 0.8B (Non-reasoning)

A lightweight, 0.8 billion parameter model from the Qwen3.5 series, optimized for fast inference and low-cost deployment. It is designed for simple, non-reasoning tasks and is suitable for edge devices or applications requiring rapid response times.

FastCheapReasoning
Input / 1M tokens
$0.01
Output / 1M tokens
$0.05
Output tokens/s
105.07
First-token seconds
0.29s
Supported plans
0

Benchmark history

Evaluations

9

TAU2

Measured May 14, 2026Source

Score

0.65

Terminalbench Hard

Measured May 14, 2026Source

Score

0

Lcr

Measured May 14, 2026Source

Score

0.07

Ifbench

Measured May 14, 2026Source

Score

0.22

Scicode

Measured May 14, 2026Source

Score

0.03

Hle

Measured May 14, 2026Source

Score

0.05

Gpqa

Measured May 14, 2026Source

Score

0.24

Artificial Analysis Coding Index

Measured May 14, 2026Source

Score

1

Artificial Analysis Intelligence Index

Measured May 14, 2026Source

Score

9.9

Plan availability

Products and plans that support this model

0
No products or plans have been linked to this model yet.

Discussion

Thinking... Make sure you are connected to GitHub server