o3-mini
Benchmark scores
| Benchmark | Category | Rank | Score | Captured |
|---|---|---|---|---|
| Aider Polyglot | code | #12 | 60.4% | 2025-01-31 |
| SWE-bench Verified | agents | #23 | 42.4% | 2025-02-14 |
| OpenRouter · Weekly Usage | usage | #36 | #155 | 2026-05-02 |
| Chatbot Arena | chat | #38 | 1336 | 2026-04-30 |
| Benchmark | Category | Rank | Score | Captured |
|---|---|---|---|---|
| Aider Polyglot | code | #12 | 60.4% | 2025-01-31 |
| SWE-bench Verified | agents | #23 | 42.4% | 2025-02-14 |
| OpenRouter · Weekly Usage | usage | #36 | #155 | 2026-05-02 |
| Chatbot Arena | chat | #38 | 1336 | 2026-04-30 |