The Multikernel: A new OS architecture for scalable multicore systems
Download
Report
Transcript The Multikernel: A new OS architecture for scalable multicore systems
EVOLUTIONARY ALGORITHMS
VS.
POKER GAMES
Yikan Chen ([email protected])
Weikeng Qin ([email protected])
1
OUTLINE
Evolutionary
Algorithm
Poker!
2
EVOLUTIONARY ALGORITHM
3
EVOLUTIONARY ALGORITHM
Evolution Process
Crossover
Mutation
Natural Selection
Evolutionary Algorithm
4
EVOLUTIONARY ALGORITHM
Encoding and Crossover
1
1
1
0
0
1
1
0
0
1
0
0
1
0
1
1
0
1
0
0
0
1
1
0
1
1
1
0
1
0
1
1
5
EVOLUTIONARY ALGORITHM
Mutation
1
1
1
0
0
1
1
0
1
1
0
0
0
1
1
1
6
EVOLUTIONARY ALGORITHM
Natural Selection
Run the roulette-wheel selection based on the
fitness value of candidates
7
EVOLUTIONARY ALGORITHM
Important Parameters
Crossover
rate
Mutation rate
Elite rate
Fitness function
Demo
http://userweb.elec.gla.ac.uk/y/yunli/ga_demo/
8
EVOLUTIONARY ALGORITHM & POKER
AKQ 2-player game
$1
blinds for each player
Player1 bet or fold
Player2 call or fold
9
EVOLUTIONARY ALGORITHM & POKER
Derive the optimal strategy using EA
Chromosomal representations
Fij:
fold threshold when Pi got Cardj
Card1
Card2
Card3
P1
2/3
0
0
P2
1
2/3
0
Fitness functions
10
EVOLUTIONARY ALGORITHM & POKER
Fitness functions
Fi:
fitness function
Wij: money won by candidate I against candidate j
11
12
13
EVOLUTIONARY ALGORITHM & POKER
Decreased fluctuation
Further decreased
fluctuation
400-500
Var(f11) ;
generations Var(f22)
Mean(f11);
Mean(f22)
Count only
wins
.065;
.067
.67;
.60
Penalize
failure
.037;
.035
.67;
.70
Penalize
Failure
heavier
.028;
.024
.67;
.74
14
EVOLUTIONARY ALGORITHM & POKER
Real Texas Hold’em
Encoding Strategy (Turn and River)
Hand
strength (player confidence)
Fraction of opponent raise (opponent confidence)
Total raise (profit)
15
EVOLUTIONARY ALGORITHM & POKER
Fitness Criterion
16
EVOLUTIONARY ALGORITHM & POKER
Performance
17
ARTIFICIAL NEURAL NETWORK: REVIEW
18
ARTIFICIAL NEURAL NETWORK: REVIEW
a1
a2
w1
w2
……
∑
f
output
wn
an
b
1
19
ARTIFICIAL NEURAL NETWORK: REVIEW
Input
output
Hidden Layer
20
E-ANN (EVOLUTIONARY ANN)
a
Simplest Encoding Method
b
c
d
d
c
b
a
21
NEAT E-ANN
http://www.cs.utexas.edu/users/nn/
Neuro Evolution of Augmenting Topologies
Encoding Strategy: Node-based
Neuron
gene table
Link gene table
Innovation number
Global
database of innovations
Each innovation has unique ID number
22
NEAT E-ANN
23
NEAT E-ANN
Mutation
Perturb
weights
Add a link gene
Add a neuron gene
Crossover
By
innovation number
24
NEAT E-ANN
Crossover
4
4
6
5
1
1
1->4
5
2
2
2->4
1
1->4
3
3->4
2
2->4
3
4
2->5
3
3->4
1
5
5->4
4
2->5
2
3
8
1->5
5
5->4
6
5->6
7
6->4
9
3>5
10
1->6
25
NEAT E-ANN
Crossover
4
6
5
1
1
1->4
2
2->4
3
3->4
2
4
2->5
5
5->4
3
6
5->6
7
6->4
8
1->5
9
3>5
10
1->6
26
E-ANN & POKER
Simplified Poker Model
1-10
Initial
credit: 10 chips
One chip ante at the beginning
Call, raise (1 chip each time), fold
Tournament
27
E-ANN & POKER
Two player game
28
E-ANN & POKER
29
E-ANN & POKER
Four different types of opponents
Tight Aggressive (TA)
Loose Aggressive (LP)
Tight Passive (TP)
Loose Passive (LP)
30
E-ANN & POKER
α: min win probability to call
β: min win probability to raise
31
E-ANN & POKER
A: player type
B: player action
32
E-ANN & POKER
33
E-ANN & POKER
Bluffing……
34
Thanks!
35