TAU2
Measured May 14, 2026Source
Score
0.25
Mistral
This is Mistral's latest flagship model, featuring strong reasoning and multilingual capabilities with support for a long context window.
Benchmark history
Score
0.25
Score
0.16
Score
0.35
Score
0.36
Score
0.38
Score
0.36
Score
0.47
Score
0.04
Score
0.68
Score
0.81
Score
38
Score
22.7
Score
22.8
Plan availability

Thinking... Make sure you are connected to GitHub server