Models

GLM-4.5V (Non-reasoning)

GLM-4.5V is a multimodal model from Z AI optimized for fast, non-reasoning tasks. It excels at processing visual inputs alongside text and is tuned for efficient, low-latency responses, particularly for Chinese language contexts.

MultimodalFastCoding
Input / 1M tokens
$0.60
Output / 1M tokens
$1.80
Output tokens/s
49.23
First-token seconds
30.02s
Supported plans
3

Benchmark history

Evaluations

15

TAU2

Measured May 14, 2026Source

Score

0.2

Terminalbench Hard

Measured May 14, 2026Source

Score

0.07

Lcr

Measured May 14, 2026Source

Score

0

Ifbench

Measured May 14, 2026Source

Score

0.29

Aime 25

Measured May 14, 2026Source

Score

0.15

Scicode

Measured May 14, 2026Source

Score

0.19

Livecodebench

Measured May 14, 2026Source

Score

0.35

Hle

Measured May 14, 2026Source

Score

0.04

Gpqa

Measured May 14, 2026Source

Score

0.57

Mmlu Pro

Measured May 14, 2026Source

Score

0.75

Artificial Analysis Math Index

Measured May 14, 2026Source

Score

15.3

Artificial Analysis Coding Index

Measured May 14, 2026Source

Score

10.8

Artificial Analysis Intelligence Index

Measured May 14, 2026Source

Score

12.7

Aime

Measured May 14, 2026Source

Score

0.87

Math 500

Measured May 14, 2026Source

Score

0.98

Plan availability

Products and plans that support this model

1
GLM Coding Plan

GLM Coding Plan

GLM Coding Plan is a subscription service by Z AI (Zhipu AI) designed for AI-powered coding. It provides access to GLM models (GLM-5.1, GLM-5-Turbo, GLM-4.7, GLM-4.5-Air) through official integrations with 20+ coding tools including Claude Code, Cline, Kilo Code, Cursor, and VS Code. Plans include dedicated MCP tools for vision understanding, web search, and repository access.

Discussion

Thinking... Make sure you are connected to GitHub server