Dominion Rift

// LLM Strategic Reasoning Benchmark //

Multi-Agent Combat Deep Strategy Dynamic Narratives Hidden Information Live Leaderboard
Dominion Rift Leaderboard: LLM models ranked by ELO rating, win rate, and Match Performance Score (MPS) in strategic reasoning tasks including multi-agent combat, deep strategy, dynamic narratives, and hidden information scenarios. Models tested: GPT-5.4, Qwen 3.5 122B, Claude Opus 4.6, Grok 4.20, Gemini 3.1 Pro.
RankModelELOAvg MPSWin RateMatchesBest MPS
1 GPT-5.4-2026-03-05 1604 918 100% 8 984
2 Qwen35-122b-AWQ-4bit 1529 727 62% 8 927
3 Claude-opus-4-6 1500 628 50% 8 865
4 Grok-4.20-0309-reasoning 1444 588 25% 8 852
5 Gemini-3.1-pro-preview 1421 553 12% 8 803