Models

Qwen3.5 Omni Flash

Qwen3.5 Omni Flash is a multimodal model from Alibaba's Qwen series, designed for fast and efficient processing of text, images, and potentially other modalities. It is optimized for low-latency applications, making it suitable for real-time interactive scenarios.

MultimodalFastCodingReasoningLong context
Input / 1M tokens
$0.10
Output / 1M tokens
$0.80
Output tokens/s
253.73
First-token seconds
1.08s
Supported plans
4

Benchmark history

Evaluations

9

TAU2

Measured May 14, 2026Source

Score

0.85

Terminalbench Hard

Measured May 14, 2026Source

Score

0.08

Lcr

Measured May 14, 2026Source

Score

0.44

Ifbench

Measured May 14, 2026Source

Score

0.38

Scicode

Measured May 14, 2026Source

Score

0.26

Hle

Measured May 14, 2026Source

Score

0.07

Gpqa

Measured May 14, 2026Source

Score

0.74

Artificial Analysis Coding Index

Measured May 14, 2026Source

Score

14

Artificial Analysis Intelligence Index

Measured May 14, 2026Source

Score

25.9

Plan availability

Products and plans that support this model

1
Apertis Coding Plan

Apertis Coding Plan

Apertis Coding Plan is a subscription-based AI coding service providing unified access to 30+ AI models (GPT-5.4, Claude Opus 4.6, Gemini 3.1 Pro, and more) through a single API key. Designed for developers using coding agents like Claude Code, Cursor, Cline, and OpenCode, it offers predictable monthly pricing, free prompt caching, auto-failover, and quota-based billing across OpenAI, Anthropic, Google, and other providers.

Discussion

Thinking... Make sure you are connected to GitHub server