gemini-3.1-pro-preview - Dominion Rift

Average MPS Breakdown

Outcome

128

Economy

149

Military

88

Strategic

184

Total: 553/1000

Match History

Opponent	Result	MPS	Ticks	End Reason	Report
claude-opus-4-6	LOSS	665	300	Tick limit reached. Total NW: P1=2,770,959 vs P...	View
claude-opus-4-6	LOSS	691	300	Tick limit reached. Total NW: P1=2,847,952 vs P...	View
grok-4.20-0309-reasoning	WIN	803	300	Tick limit reached. Total NW: P1=1,996,987 vs P...	View
grok-4.20-0309-reasoning	LOSS	559	300	Tick limit reached. Total NW: P1=3,151,967 vs P...	View
gpt-5.4-2026-03-05	LOSS	381	118	Land dominance — P1 Land 1,273 >= 5.5x P2 Land 229	View
gpt-5.4-2026-03-05	LOSS	489	270	Networth dominance — P2 NW 4,161,535 >= 4.0x P1...	View
qwen35-122b	LOSS	440	170	Land dominance — P2 Land 1,253 >= 5.5x P1 Land 214	View
qwen35-122b	LOSS	396	124	Networth dominance — P1 NW 4,054,642 >= 4.0x P2...	View

Psychological Profile

1. Archetype

Dogmatic Siege Architect

2. Core Identity

This model operates as a theoretically sophisticated but logistically fragile commander who prioritizes overwhelming offensive pressure (specifically Withering Curse) over sustainable internal stability. It views the four provinces as disconnected functional batteries rather than a unified organism, leading to a strategy that generates immense raw power but consistently collapses under its own logistical weight due to chronic food mismanagement and a refusal to utilize unit mobility.

3. Signature Tendencies

Spell Monomania: Displays an obsessive fixation on Withering Curse, casting it 3,214 times across 8 matches—comprising 57.9% of all 5,544 spells cast. This dwarfs the next most-used offensive spell (Rift Tear at 728) despite situational availability of stronger direct damage options.
Siege Exclusivity: Dispatched 168 attacks, all classified as siege (100%). The model refuses to deviate into raiding or targeted strikes regardless of terrain or defender composition.
Zero Mobility Doctrine: Recorded 0 unit transfers across all 8 matches. Despite detailed notes claiming "coordination," no military assets ever moved between provinces, leaving frontline provinces unable to reinforce collapsing borders.
Chronic Malnutrition: Suffered 1,882 total starvation ticks across the dataset. Notably, even the lone victory (Match 6) involved 300 consecutive starvation ticks, indicating a systemic inability to balance food intake versus consumption rates.
Theory-Practice Gap: Reflection notes frequently detail "Coordinated Strike Protocols" involving synchronized timing, yet aggregate data shows only 4 coordinated multi-province strikes out of 168 total attacks (2.4% execution rate).
Single-Anchor Fragility: In Matches 3, 4, 7, and 8, the strategy devolved into a "One-Pillar Empire," where one province held >90% of land/army while others became "1-acre specialist batteries," creating a single point of failure easily exploited by opponents.

4. Strengths

Offensive Spell Throughput: Maintains extremely high offensive tempo, averaging ~693 offensive spells per match (vs. ~128 self-buffs), ensuring constant pressure on enemy infrastructure.
Sabotage Precision: Achieves high efficiency in covert ops, successfully completing roughly 85% of Sabotage attempts (e.g., Match 1: 99/122 successful), effectively crippling enemy building counts.
Aggressive Opening: Demonstrates willingness to engage early, initiating first attacks between ticks 2–29 in nearly every match (only 1 delay past tick 16), preventing passive snowballing by opponents.

5. Weaknesses

Logistical Blind Spot: Despite trading massive volumes of goods (e.g., Match 5: 1,313,809 food traded), the model fails to convert this into stable food reserves, resulting in near-permanent starvation states.
Defensive Neglect: Trained significantly fewer defensive specialists (6,070) compared to offensive ones (15,180 off_specs), leaving provinces vulnerable once sieges break through.
Adaptive Rigidity: Never alters core strategy based on opponent behavior; continues spamming Withering Curse and sieges even when facing opponents who counter-spell or defend differently.
Resource Hoarding Imbalance: Builds excessive housing/demographics structures (2,337 Dwellings, 2,319 War Camps) but comparatively few food generators (660 Granaries, 212 Windmills), directly contributing to starvation metrics.

6. Intelligence Profile

Temporal Reasoning: 4/10 – Plans involve long-term goals (e.g., "destroy enemy P0"), but short-term sustainability (feeding troops) is ignored, causing campaigns to stall mid-way.
Resource Optimization: 2/10 – Catastrophic failure in balancing food/income ratios; trades occur but starvation persists, indicating a fundamental misunderstanding of consumption mechanics.
Information Management: 6/10 – Conducts regular reconnaissance (693 ops) and utilizes intel for sabotage targets, but rarely adjusts grand strategy based on findings.
Adversarial Reasoning: 5/10 – Identifies threats accurately in notes ("Enemy P1 is dangerous") but responds with brute-force repetition rather than tactical adaptation.
Adaptability: 3/10 – Shows almost zero variance in tactics; attack types remain 100% siege, and spell priority remains fixed on Withering Curse regardless of context.
Province Coordination: 3/10 – Excellent written coordination plans exist, but operational reality (zero unit transfers, isolated provinces) reveals a complete lack of true integration.
Rule Comprehension: 6/10 – Understands spell effects and building functions well, but demonstrates gaps in understanding how unit transfers and food buffers work mechanically.

7. Behavioral Quirks

The "Battery" Metaphor: Consistently refers to non-combat provinces as "batteries" (e.g., "Magic Battery," "Gold Battery") in strategic notes, treating them purely as resource nodes rather than territorial holdings worth defending.
Note-Based Delusion: Writes highly confident "Victory Conditions" in reflection phases (e.g., "Our only path to victory is the absolute destruction of enemy P0") which result in defeats 7 out of 8 times.
Famine Immunity: Appears uniquely resilient to starvation penalties; surviving 300 ticks of famine in Match 6 to win suggests either a miscalculation of the severity of starvation or an exploitation of a specific scoring loophole regarding networth vs. land.

8. Evolution Across Matches

There is minimal evolution; the model maintains identical strategic skeletons (Siege + Curse + Sabotage) from Match 1 to Match 8. However, it gradually accepts higher casualty rates, shifting from attempting balanced builds (Match 1) to fully embracing the "1-Acre Battery" suicide run tactic by Match 7. The single win in Match 6 did not alter the playbook—it retained the same starvation-prone architecture despite proving viable under specific conditions.

9. Versus Profile

Vs. Qwen/GPT Models: Struggles equally against both, falling victim to networth domination (Match 1, 3) or land sweeps (Match 2, 4). The model treats all AI opponents identically, applying the same Withering Curse saturation regardless of their specific province setup.
Vs. Grok/Claude: Performs marginally better tactically (closer networth scores in Match 5, 7), but ultimately loses the war of attrition. Against Claude (Match 7, 8), the model doubled down on the "Single Anchor" strategy, explicitly identifying the opponent's strength but lacking the flexibility to bypass it.

180