Models

Kimi K2.5 (Non-reasoning)

Kimi K2.5 (Non-reasoning) is a fast-response variant of the Kimi K2.5 series, optimized for low-latency interactions. It excels in rapid content generation, chat, and multimodal understanding tasks where immediate answers are prioritized over deep, step-by-step reasoning.

FastMultimodalLong context
Input / 1M tokens
$0.60
Output / 1M tokens
$3.00
Output tokens/s
56.41
First-token seconds
1.19s
Supported plans
10

Benchmark history

Evaluations

9

TAU2

Measured May 14, 2026Source

Score

0.81

Terminalbench Hard

Measured May 14, 2026Source

Score

0.19

Lcr

Measured May 14, 2026Source

Score

0.59

Ifbench

Measured May 14, 2026Source

Score

0.44

Scicode

Measured May 14, 2026Source

Score

0.4

Hle

Measured May 14, 2026Source

Score

0.12

Gpqa

Measured May 14, 2026Source

Score

0.79

Artificial Analysis Coding Index

Measured May 14, 2026Source

Score

25.8

Artificial Analysis Intelligence Index

Measured May 14, 2026Source

Score

37.3

CODPL speed

Provider ranking

8

腾讯云

tencent_token_plan

64.91
1,073 ms
100%
3
1h
Rank #6

百炼(阿里云)

aliyun_bailian

55.46
476 ms
100%
3
1h
Rank #2

Plan availability

Products and plans that support this model

2

Discussion

Thinking... Make sure you are connected to GitHub server