TAU2
Measured May 14, 2026Source
Score
0.2
Devstral Medium is a mid-sized model from Mistral's Devstral family, specifically optimized for code generation and software engineering tasks. It balances strong coding and reasoning capabilities with efficient performance and cost-effectiveness.
Benchmark history
Score
0.2
Score
0.09
Score
0.29
Score
0.3
Score
0.05
Score
0.07
Score
0.71
Score
0.29
Score
0.34
Score
0.04
Score
0.49
Score
0.71
Score
4.7
Score
15.9
Score
18.7
Plan availability

Thinking... Make sure you are connected to GitHub server