Models

GLM-5 (Non-reasoning)

GLM-5 (Non-reasoning) is a variant of the GLM-5 series optimized for high-speed, low-latency responses. It excels in tasks requiring quick turnaround and cost efficiency, while maintaining strong capabilities in coding, multimodal understanding, and long-context processing.

CodingFastCheapLong contextMultimodal
Input / 1M tokens
$1.00
Output / 1M tokens
$3.20
Output tokens/s
66.6
First-token seconds
1.36s
Supported plans
3

Benchmark history

Evaluations

9

TAU2

Measured May 14, 2026Source

Score

0.97

Terminalbench Hard

Measured May 14, 2026Source

Score

0.39

Lcr

Measured May 14, 2026Source

Score

0.37

Ifbench

Measured May 14, 2026Source

Score

0.55

Scicode

Measured May 14, 2026Source

Score

0.38

Hle

Measured May 14, 2026Source

Score

0.07

Gpqa

Measured May 14, 2026Source

Score

0.67

Artificial Analysis Coding Index

Measured May 14, 2026Source

Score

39

Artificial Analysis Intelligence Index

Measured May 14, 2026Source

Score

40.6

CODPL speed

Provider ranking

11

腾讯云

tencent_token_plan

104.97
1,266 ms
100%
3
1h
Rank #3

百炼(阿里云)

aliyun_bailian

46.88
881 ms
100%
3
1h
Rank #7

天翼云

ctyun

60.64
1,691 ms
100%
3
1h
Rank #8

Plan availability

Products and plans that support this model

1
GLM Coding Plan

GLM Coding Plan

GLM Coding Plan is a subscription service by Z AI (Zhipu AI) designed for AI-powered coding. It provides access to GLM models (GLM-5.1, GLM-5-Turbo, GLM-4.7, GLM-4.5-Air) through official integrations with 20+ coding tools including Claude Code, Cline, Kilo Code, Cursor, and VS Code. Plans include dedicated MCP tools for vision understanding, web search, and repository access.

Discussion

Thinking... Make sure you are connected to GitHub server