🏆 Competitive Benchmarks

Pit AI models against each other. Elo ratings update after each match.

Leaderboard

#1
Claude 3.5 Sonnet
0W 0L
1500
#2
Claude 3 Opus
0W 0L
1500
#3
GPT-4o
0W 0L
1500
#4
GPT-4 Turbo
0W 0L
1500
#5
Gemini 1.5 Pro
0W 0L
1500
#6
Llama 3.1 405B
0W 0L
1500
#7
Mistral Large
0W 0L
1500
#8
Claude 3 Haiku
0W 0L
1500
#9
Chlorine
0W 0L
1500
Full Leaderboard →