BridgeBenchBridgeBench

Leaderboard Overview

See how leading AI coding models stack up across algorithms, debugging, refactoring, generation, security, and speed. Each card provides a snapshot of the top performers in that category. Learn more.

Hallucination

View
Apr 2 · 2h ago
RankModelScoreFab %
1Grok 4.20 Reasoning91.810.0%
2Claude Opus 4.687.616.7%
3GPT-5.486.116.7%
4Qwen 3.6 Plus Preview (Free)79.726.5%
5Gemini 3.1 Pro79.126.7%
6Qwen3.5 Plus 2026-02-1577.329.0%
7Claude Sonnet 4.676.628.9%
8Grok 4.20 (Non-Reasoning)76.129.7%
9Gemini 3 Pro75.930.0%
10Claude Haiku 4.573.034.2%

Speed

View
Apr 2 · 2h ago
RankModeltok/sTTFT
1Grok 4.20 (Non-Reasoning)243.31999ms
2Grok 4.20 Reasoning237.71497ms
3GPT-5.4 Mini236.4233ms
4GPT-5.4 Nano227.8941ms
5GLM 5V Turbo221.25444ms
6Qwen 3.6 Plus Preview (Free)15811520ms
7Gemini 3.1 Pro122.27608ms
8Claude Sonnet 4.695.31207ms
9Qwen3.5 Plus 2026-02-1594.614952ms
10Claude Opus 4.692.21922ms
Coming Soon

Overall

RankModelScore
1GPT-5.495.5
2GPT-5.4 Mini94.8
3GPT-5.4 Nano92.9
4GPT-4.191.8
5Qwen 3.5 35B-A3B91.7
6Claude Sonnet 4.590.7
7Qwen 3.5 122B-A10B90.0
8o3-mini89.6
9Qwen 3.5 27B89.5
10Gemini 2.5 Pro88.9
Coming Soon

Algorithms

RankModelScore
1GPT-5.4 Mini99.0
2GPT-5.498.9
3GPT-5.4 Nano97.8
4Qwen 3.5 122B-A10B94.9
5Qwen 3.5 35B-A3B94.7
6Qwen 3.5 27B94.5
7GPT-4.192.7
8o3-mini90.3
9Gemini 2.5 Pro89.8
10Claude Sonnet 4.589.6
Coming Soon

Debugging

RankModelScore
1GPT-5.496.4
2GPT-5.4 Mini96.4
3GPT-5.4 Nano96.0
4Qwen 3.5 35B-A3B96.0
5Qwen 3.5 122B-A10B94.1
6GPT-4.193.8
7Qwen 3.5 27B93.2
8Claude Sonnet 4.592.5
9o3-mini91.4
10Gemini 2.5 Pro90.6
Coming Soon

Refactoring

RankModelScore
1GPT-5.4 Nano98.3
2GPT-5.497.9
3GPT-5.4 Mini97.6
4Claude Sonnet 4.593.1
5GPT-4.191.9
6o3-mini89.8
7Gemini 2.5 Pro88.4
8Qwen 3.5 122B-A10B87.4
9Qwen 3.5 35B-A3B87.3
10Qwen 3.5 Flash (02-23)86.5
Coming Soon

Generation

RankModelScore
1GPT-5.497.0
2GPT-5.4 Mini94.4
3Qwen 3.5 35B-A3B93.5
4Qwen 3.5 122B-A10B92.5
5GPT-4.192.4
6Qwen 3.5 27B92.2
7Qwen 3.5 Flash (02-23)90.8
8Claude Sonnet 4.590.4
9GPT-5.4 Nano90.1
10Gemini 2.5 Pro89.3