Frontier LLM Comparison
All major frontier large-language models, ranked by Arena Elo, context window, and output speed — across the labs that ship them
Models
41
Avg Elo
1369
Avg Context
800K
Avg Speed (t/s)
5.2K
| Rank | Model | Arena Elo ▼ | Context | HQ |
|---|---|---|---|---|
| 🥇 | Claude Opus 4.7 Claude Opus 4.7 | 1510 | 200K | 🇺🇸 |
| 🥈 | Claude Opus 4.6 (Thinking) Claude Opus 4.6 (思考模式) | 1504 | 200K | 🇺🇸 |
| 🥉 | GPT-5.5 GPT-5.5 | 1495 | 400K | 🇺🇸 |
| 4 | Gemini 3.1 Pro Gemini 3.1 Pro | 1493 | 1000K | 🇺🇸 |
| 5 | Grok 4.20 Grok 4.20 | 1491 | 256K | 🇺🇸 |
| 6 | GPT-5.4 (High) GPT-5.4 (高推理) | 1484 | 400K | 🇺🇸 |
| 7 | Claude Sonnet 4.6 Claude Sonnet 4.6 | 1465 | 1000K | 🇺🇸 |
| 8 | GLM-5.1 GLM-5.1 | 1465 | 128K | 🇨🇳 |
| 9 | ERNIE 5.0 文心一言 5.0 | 1460 | 128K | 🇨🇳 |
| 10 | DeepSeek V4 Pro DeepSeek V4 Pro | 1455 | 1000K | 🇨🇳 |
| 11 | Gemini 3 Pro Gemini 3 Pro | 1450 | 1000K | 🇺🇸 |
| 12 | Qwen 3.6-Max-Preview 通义千问 3.6-Max-Preview | 1448 | 256K | 🇨🇳 |
| 13 | Kimi K2.6 Kimi K2.6 | 1442 | 256K | 🇨🇳 |
| 14 | GPT-5.2 GPT-5.2 | 1430 | 400K | 🇺🇸 |
| 15 | Claude Sonnet 4.5 (1M) Claude Sonnet 4.5 (1M) | 1420 | 1000K | 🇺🇸 |
| 16 | o3 o3 | 1418 | 200K | 🇺🇸 |
| 17 | Llama 5 Llama 5 | 1408 | 5000K | 🇺🇸 |
| 18 | Mistral Large 3 Mistral Large 3 | 1395 | 128K | 🇫🇷 |
| 19 | Gemini 3.1 Flash Gemini 3.1 Flash | 1378 | 1000K | 🇺🇸 |
| 20 | DeepSeek V4 Flash DeepSeek V4 Flash | 1370 | 1000K | 🇨🇳 |
| 21 | GPT-4.1 GPT-4.1 | 1365 | 1000K | 🇺🇸 |
| 22 | DeepSeek V3.2 DeepSeek V3.2 | 1355 | 128K | 🇨🇳 |
| 23 | Hunyuan 3.0 混元 3.0 | 1352 | 256K | 🇨🇳 |
| 24 | ByteDance Seed 2.0 Pro 豆包 Seed 2.0 Pro | 1340 | 256K | 🇨🇳 |
| 25 | Llama 4 Maverick Llama 4 Maverick | 1335 | 1000K | 🇺🇸 |
| 26 | Qwen3 Max 通义千问 3 Max | 1330 | 262K | 🇨🇳 |
| 27 | Gemini 3.1 Flash-Lite Gemini 3.1 Flash-Lite | 1318 | 1000K | 🇺🇸 |
| 28 | Claude Haiku 4.6 Claude Haiku 4.6 | 1310 | 200K | 🇺🇸 |
| 29 | GPT-5.5 mini GPT-5.5 mini | 1305 | 200K | 🇺🇸 |
| 30 | o3-mini o3-mini | 1295 | 200K | 🇺🇸 |
| 31 | Llama 4 Scout Llama 4 Scout | 1280 | 10000K | 🇺🇸 |
| 32 | DeepSeek R1 DeepSeek R1 | 1275 | 128K | 🇨🇳 |
| 33 | DeepSeek-Coder V4 DeepSeek-Coder V4 | 1268 | 128K | 🇨🇳 |
| 34 | Mistral 3 (14B) Mistral 3 (14B) | 1265 | 128K | 🇫🇷 |
| 35 | Qwen3-Coder Plus 通义千问3 Coder Plus | 1260 | 1000K | 🇨🇳 |
| 36 | Yi-Large 2 Yi-Large 2 | 1255 | 200K | 🇨🇳 |
| 37 | Cohere Command A Cohere Command A | 1255 | 256K | 🇨🇦 |
| 38 | Hunyuan HY 2.0 Think 混元 HY 2.0 Think | 1252 | 128K | 🇨🇳 |
| 39 | MiniMax M2.7 MiniMax M2.7 | 1245 | 1000K | 🇨🇳 |
| 40 | Hermes 4 Hermes 4 | 1240 | 128K | 🇺🇸 |
| 41 | AI21 Jamba 2 AI21 Jamba 2 | 1230 | 256K | 🇮🇱 |