GPT-4
Benchmark scores
| Benchmark | Category | Rank | Score | Captured |
|---|---|---|---|---|
| METR Time Horizon | agents | #19 | 4m | 2023-03-14 |
| SWE-bench Verified | agents | #32 | 22.4% | 2024-04-02 |
| OpenRouter · Weekly Usage | usage | #51 | #308 | 2026-06-09 |
| AA Coding Index | code | #54 | 13.1% | 2026-06-15 |
| Chatbot Arena | chat | #63 | 1206 | 2026-06-10 |