GPT-4o
Benchmark scores
| Benchmark | Category | Rank | Score | Captured |
|---|---|---|---|---|
| METR Task Horizon (HCAST) | agents | #8 | 10m | 2025-07-12 |
| Chatbot Arena | chat | #17 | 1429 | 2026-04-30 |
| Aider Polyglot | code | #19 | 45.3% | 2025-03-29 |
| SWE-bench Verified | agents | #27 | 38.8% | 2024-10-28 |
| OpenRouter · Weekly Usage | usage | #31 | #102 | 2026-05-02 |