Rankings based on user votes and Elo ratings
# | Model(10/10) | Elo | Win % | Votes | Record |
|---|---|---|---|---|---|
| 1 | GPT-5.2 | 1291 | 51.2% | 123 | 63W-36L-24T |
| 2 | GLM 5 | 1270 | 41.8% | 141 | 59W-57L-25T |
| 3 | Claude Sonnet 4.6 | 1230 | 39.0% | 123 | 48W-56L-19T |
| 4 | Claude Haiku 4.5 | 1222 | 37.0% | 135 | 50W-62L-23T |
| 5 | Functionary Swahili Mini | 1214 | 45.7% | 35 | 16W-13L-6T |
| 6 | Functionary Swahili Large | 1214 | 47.5% | 122 | 58W-45L-19T |
| 7 | GPT-oss-120B | 1201 | 43.6% | 117 | 51W-48L-18T |
| 8 | Claude Sonnet 4.5 | 1129 | 39.6% | 111 | 44W-43L-24T |
| 9 | Rnj 1 Instruct | 1101 | 41.4% | 128 | 53W-56L-19T |
| 10 | GPT-5 Nano | 1099 | 31.2% | 125 | 39W-69L-17T |
Click column headers to sort by category
Model(10/10) | Overall | Long Prompt | Hard Prompt | Math | Instruction-Following | Coding |
|---|---|---|---|---|---|---|
| GPT-5.2 | 1 | — | 3 | 7 | 1 | 8 |
| GLM 5 | 2 | — | 8 | 3 | 2 | 4 |
| Claude Sonnet 4.6 | 3 | — | 2 | 5 | 7 | 7 |
| Claude Haiku 4.5 | 4 | — | 7 | 6 | 5 | 1 |
| Functionary Swahili Mini | 5 | — | 4 | — | 6 | 5 |
| Functionary Swahili Large | 6 | — | 1 | — | 8 | 2 |
| GPT-oss-120B | 7 | — | 6 | 1 | 9 | 6 |
| Claude Sonnet 4.5 | 8 | — | 5 | 4 | 4 | 9 |
| Rnj 1 Instruct | 9 | — | 10 | 2 | 3 | 3 |
| GPT-5 Nano | 10 | — | 9 | 8 | 10 | 10 |