1 UM Stratego Colin Schepers Daan Veltman Enno Ruijters Leon Gerritsen Niek den Teuling Yannick...

UM Stratego

Colin SchepersDaan VeltmanEnno RuijtersLeon Gerritsen

Niek den TeulingYannick Thimister

Introduction Yannick The game of Stratego Daan Evaluation Function Leon Monte Carlo Colin Genetic Algorithm Enno Opponent modeling and strategy Niek Conclusion

Yannick

Content

The game of Stratego

Board of 10x10

Setup field 4x10

B Bombs 1 Marshall 2 General 3 Colonels 4 Majors 5 Captains

6 Lieutenants 7 Sergeants 8 Miner 9 Scout S Spy F Flag

WinFlag captureUnmovable pieces

DrawUnmovable piecesMaximum moves

Starting Positions

Flag placed Bombs placed Remaining pieces placed randomly

Starting Positions

Distance to Freedom Being bombed in Partial obstruction Adjacency

Flag defence Startup Pieces

Starting Positions

Distance to Freedom

Starting Positions

Startup Pieces

Sub-functions of the evaluation function: Material value Information value Near enemy piece value Near flag value Progressive bonus value First-move penalty

Evaluation Function

How it works: All the sub-functions return a value These values are then weighted and added to

each other The higher the total added value, the better

that move is for the player

Evaluation Function

Material Value: Used for comparing the two players' board

strengths Each piece type has a value Total value of the opponent's board is

subtracted from the player's board value Positive value means strong player board Negative value means weak player board

Evaluation Function

Information value: Stimulates the collection of opponent information

and the keeping of personal piece information Each piece type has a certain information value All the values from each side are summed up and

then substracted from each other A marshall being discovered is worse than a

scout being discovered

Evaluation Function

Near enemy piece value Checks if a moveable piece can or cannot

defeat a piece next to it If piece can be defeated, return positive

score If not, return a negative one If piece unknown, return 0

Evaluation Function Near flag value

Stimulates the defence of own flag and the attacking of enemy's flag

Constructs array with possible enemy flag locations

If enemy near own flag, return negative number If own piece near possible enemy flag, return

positive number

Evaluation Function

Progressive bonus value Stimulates the advancement of pieces

towards enemy lines Returns a positive value if piece moves

forward Negative if backward

Evaluation Function

First-move valueKeeps pieces from giving away informationKeeps the number of unmoved pieces high

Monte Carlo A subset of all possible moves is played

No strategy or weights used Evaluation value received after every move

At the end a comparison of evaluation values determines the best move

A depth limit is used so the tree doesn't grow to big and the algorithm will end at some point

Monte Carlo

Advantages:

Simple implementation Can be changed quickly Easy observation of behavior Good documentation Good for partial information situations

Monte Carlo

Disadvantages:

Generally not smart Dependent on the evaluation function Computationally slow

Tree grows very fast

Monte Carlo Experiments

MC against lower-depth MC

Player Wins Losses Draw

MC 28 59 49

MC-LD 59 28 49

MC against no-depth MC

MC 15 2 12

MC-ND 2 15 12

MC against deeper-depth but narrower MC

MC 5 2 11

MC-DDN 2 5 11

MC against narrower MC

MC 62 18 85

MC-N 18 62 85

Genetic Algorithm

Evolve weights of the terms in the evaluation functions

AI uses standard expectiminimax search tree Evolution strategies (evolution paremeters are

themselves evolved)

Genetic Algorithm

Genome:

Mutation:

G= σ,α,w1,. .. ,wn

σ n=σ n−1⋅eN 0,τ

α n=α n−1+α n⋅N 0,σ w i,n= w i,n−1w i,n−1⋅N 0,σ

Genetic Algorithm

Crossover: σ and α of parents average weights:

Averaged if Else randomly chosen from parents

1α<ratio<α

Genetic Algorithm

Fitness function: Win bonus Number of own pieces left Number of turns spent

Genetic Algorithm

Reference AI: Monte Carlo AI Self-selecting reference genome

Select average genome from each generation

Pick winner between this genome and previous reference

Hill climbing

The GA takes too long to train Hill climbing is faster

Opponent modeling

Observing moves Ruling out pieces Stronger pieces are moved towards you Weaker pieces are moved away

Opponent modeling

No knowledge about enemy pieces at the start Updating the probabilities

Update the probability of the moving piece Update probabilities of all other pieces

MC against MC with opponent modeling using a database of Human versus human games

MC 39 44 58

MC-OM 44 39 58

MC against MC with opponent modeling using a database of MC versus MC games

Strategy

Split the game up into phases Exploration phase

Until 25% of enemy pieces are identified Elimination phase

Until 70% of enemy pieces are killed End-game phase

Alter the evaluation function

Conclusion

Both AIs are very slow The genetic AI takes too long to train

In case of Stratego, tweaking a few weights may not be an optimal way to create an intelligent player

1 UM Stratego Colin Schepers Daan Veltman Enno Ruijters Leon Gerritsen Niek den Teuling Yannick...

Documents

Maatschappelijke Ondersteuning Een doodlopende weg? Roger Ruijters Raad van bestuur MeanderGroep Bestuurslid Actiz

1 UM Stratego Collin Schepers Daan Veltman Enno Ruijters Leon Gerritsen Niek den Teuling Yannick Thimister

The Higgs System M. Veltman MacArthur Emeritus Professor ... · Plan of the Lectures 1 Introduction Vacuum expectation value. Cosmological constant. 2 The original Higgs Model Unitarity

Sommaire - · PDF fileSommaire 2 · Editeur responsable : Philippe Chamoret, Marketing Manager Browning International · Design & Infographie : Agence To Be, Thimister (Belgium)

Presentatie Alexander Veltman KJO9- Crowdsourcing

Manon Ruijters Amersfoort| april2016 Adam en...Elly van de Braak – Heleen Draijer – Cees den Hartog – Femke de Jonge – Gerritjan van Luin – Mart van de Veeweij – Tom van

mERlANDSE POLITIEKE M0t;rd'.t~ ,,~, ~- PARTIJEN ... · 12 Kenniseconomie: Niet lullen maar poetsen DOOR THEO VELTMAN EN ARTHUR OLOF 1 7 'Het gaat om kennis' DOOR LAU RENS JAN BRINKHORST

Presentatie titel Rotterdam, 00 januari 2007 Rotterdam, 17 september 2009 Het gaat niet vanzelf…. Freddy Veltman-van Vugt Kenniskring Versterking Beroepsonderwijs

Thais kookboek door Tom en Pia van Doorn Aangevuld door ... · Thais kookboek door Tom en Pia van Doorn Aangevuld door Astrid Veltman met Thaise en andere Zuid-Oost-Aziatische recepten

De 7 thema’s Pipo leest voor - Jutterkening en foto scoren met de verdediger van Ajax. Op menig shirtje en schoen prijkt nu de naam van Joël Veltman. Hij gaf ook nog een presentatie

Presentatie levensloopbegeleiding en KIRA door M. Veltman en A. Stukker (2-3-2013)

Programma maandag 02 september 2013 13.00 u. – 13.50 u.Inloop 13.50 u. -15.00 u.Jaaropening en synopsis onderwijs- programma MLI Freddy Veltman, Margriet

Www.addvisie-info.nl ADDvisie Adviseurs voor horeca, recreatie en toerisme Wytze Veltman

03 joris veltman

KNUPPELBODE nr. 15.pdf · 2016. 3. 7. · WEBCIE : Edo Freriks 06 438 34 928 CREACIE : Isabelle Veltman 06 142 50 601 HORECA : Bart Kamans 06 234 90 094 HORECA : Jan-Willem Schaap

20120905 Pinterest Professioneel - RSLT Digital Marketing School - Paulus Veltman

De levensloopbestendige buurt Samenredzaamheid met de beurs van nu Roger Ruijters Raad van bestuur MeanderGroep 1

BADGER Pellets¨ un combustible 100% naturel local et de qualit · 2019. 10. 8. · Les BADGER Pellets¨ sont produits Virton et Thimister en Belgique et Roost au Grand-Duch du Luxembourg

E3 e - Spot and Web...stampa, “Desert” e “Freeway”, saranno visibili dalla metà di settembre. Autore degli scatti è Olaf Veltman, già autore per BMW. Le affissioni saranno

20150616 Prepare for Impact - Paulus Veltman - ASW Bewoners Online 2016