GPT-5
Benchmark scores
| Benchmark | Category | Rank | Score | Captured |
|---|---|---|---|---|
| Aider Polyglot | code | #1 | 88.0% | 2025-08-23 |
| AA Coding Index | code | #1 | 59.1% | 2026-06-15 |
| Terminal-Bench 2.0 | agents | #4 | 88.1% | 2026-04-30 |
| Chatbot Arena | chat | #7 | 1489 | 2026-06-10 |
| OpenRouter · Weekly Usage | usage | #8 | #20 | 2026-06-09 |
| SWE-bench Verified | agents | #9 | 74.4% | 2025-10-15 |
| METR Time Horizon | agents | #9 | 3h23m | 2025-08-07 |