AA Coding Index

Category: code · Unit: % · Last refreshed

Artificial Analysis composite coding score: equally-weighted average of SciCode, Terminal-Bench Hard, and LiveCodeBench. Higher = better.

Top 25 models

RankModelScoreCaptured
1 GPT-5 59.1% 2026-06-16
2 GPT-5.4 57.2% 2026-06-16
3 Claude Opus 4 56.7% 2026-06-16
4 Gemini 3.1 Pro 55.5% 2026-06-16
5 GPT-5.3 Codex 53.1% 2026-06-16
6 Claude Opus 4.7 53.1% 2026-06-16
7 Claude Sonnet 4.6 50.9% 2026-06-16
8 GPT-5.2 48.7% 2026-06-16
9 Claude Opus 4.6 48.1% 2026-06-16
10 Claude Opus 4.5 47.8% 2026-06-16
11 Kimi K2 47.1% 2026-06-16
12 Gemini 2.5 Pro 46.7% 2026-06-16
13 Gemini 3 Pro 46.5% 2026-06-16
14 GPT-5.1 44.7% 2026-06-16
15 GLM-5 44.2% 2026-06-16
16 Gemini 3 Flash 42.6% 2026-06-16
17 Grok 4 42.2% 2026-06-16
18 MiniMax M2 41.9% 2026-06-16
19 Claude Sonnet 4.5 38.6% 2026-06-16
20 o3 38.4% 2026-06-16
21 DeepSeek V3 37.9% 2026-06-16
22 MiniMax M2.5 37.4% 2026-06-16
23 Claude Opus 4.1 36.5% 2026-06-16
24 GLM-4.7 36.3% 2026-06-16
25 Claude Sonnet 4 34.1% 2026-06-16