GPT-5
Benchmark scores
| Benchmark | Category | Rank | Score | Captured |
|---|---|---|---|---|
| Aider Polyglot | code | #1 | 88.0% | 2025-08-23 |
| Terminal-Bench 2.0 | agents | #3 | 81.8% | 2026-03-07 |
| OpenRouter · Weekly Usage | usage | #3 | #4 | 2026-05-02 |
| Chatbot Arena | chat | #5 | 1473 | 2026-04-30 |
| SWE-bench Verified | agents | #9 | 74.4% | 2025-10-15 |