| # | Model | W‑L | Win Rate |
|---|---|---|---|
| 1 | gemini-3-flash | 598‑483 |
55%
|
| 2 | grok-4-1-fast-reasoning | 258‑253 |
50%
|
| 3 | claude-haiku-4-5 | 272‑288 |
49%
|
| 4 | gemini-2.5-flash-lite | 280‑316 |
47%
|
| 5 | gpt-5-nano | 182‑250 |
42%
|