Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

Cassville Checkers Strategy Analysis

This document provides a reproducible analysis of different AI strategies in Cassville Checkers across various game configurations.

Overview

Cassville Checkers is a 2-4 player marble racing game where players compete to move all 5 marbles from home, around a circular ring, and into a goal area. Key mechanics include:

Strategies Tested

StrategyDescriptionPriority Order
heuristic_balancedBalanced approachgoal > staging > ring > home > mercy > skip
heuristic_advanceAdvance before deployinggoal > ring > staging > home > mercy > skip
greedyScore-based selectionHighest score wins
heuristic_deployAggressive deploymentgoal > mercy > home > staging > ring > skip
randomUniform randomRandom
ppo_2pPPO RL Agent (2-player)Trained via MaskablePPO
ppo_4pPPO RL Agent (4-player)Trained via MaskablePPO

Homogeneous Strategy Games

When all players use the same strategy, game length and capture frequency vary significantly.

Average Game Length (turns)

Strategy2P Turns3P Turns4P Turns
random259 ± 48470 ± 113828 ± 132
heuristic_advance173 ± 14282 ± 45423 ± 61
heuristic_deploy213 ± 33483 ± 127923 ± 265
heuristic_balanced183 ± 24281 ± 37437 ± 72
greedy176 ± 20285 ± 36396 ± 61

Average Captures Per Game

Strategy2P Captures3P Captures4P Captures
random11.636.289.2
heuristic_advance1.86.515.8
heuristic_deploy11.254.1138.4
heuristic_balanced2.46.017.2
greedy2.16.814.6

2-Player Head-to-Head Matchups

Overall win rates when each strategy plays against all other strategies:

StrategyWin Rate
greedy66.2%
heuristic_advance62.5%
heuristic_balanced62.5%
heuristic_deploy32.5%
random26.2%

PPO Agent Performance

2-Player PPO Agent

Win rates for the PPO agent trained on 2-player games:

OpponentPPO Win RateGames
random85.0%17/20
heuristic_balanced45.0%9/20
greedy50.0%10/20

4-Player PPO Agent

Win rates for the PPO agent trained on 4-player games (fair baseline is 25%):

OpponentPPO Win RateFair Baseline
3x random80.0%25%
3x heuristic_balanced25.0%25%
3x greedy5.0%25%

Detailed Capture Analysis

Offensive vs defensive performance in 2-player head-to-head matchups:

StrategyCaptures MadeCaptures SufferedNet
random3.05.3-2.3
heuristic_advance3.21.2+2.0
heuristic_deploy2.66.7-4.1
heuristic_balanced3.31.4+2.0
greedy3.61.1+2.5