bt-opponent_modeling_in_bayesian_poker3

Download Report

Transcript bt-opponent_modeling_in_bayesian_poker3

Outline

Background




Aims





What is Texas Hold'em?
What are Bayesian Networks?
What is BPP?
Initial opponent model
Adaptive opponent model
Performance testing
Further Work
Conclusion
Opponent Modeling in Bayesian Poker
Brendon Taylor (BSE)
http://www.allposters.com/-sp/Poker-Pups-II-Posters_i1611677_.htm
Supervisors: Ann Nicholson
Kevin Korb
What is Texas Hold'em?
Poker Hands
From strongest to weakest
Poker Bayesian Network
What is BPP?





Bayesian Poker Program
1993: Initial version (Jitnah)
1999: First publication (Korb, Nicholson, Jitnah)
2000: Decision network (Carlton)
2003: Adapted to Texas Hold'em (Boulton)
Personality Types
Aggressive behaviour
 More likely to bet/raise
 Conservative behaviour
 More likely to fold/check/call

AAAI 2006 Results - Bankroll
Hyperborean
(U Alberta)
Hyperborean
(U Alberta)
Bluffbot
(Finland)
Monash
(Monash U)
Teddy
(USA)
0.0514
±0.0171
0.7227
±0.0161
0.4067
±0.0247
0.5271
±0.0197
-0.1895
±0.0289
Bluffbot
(Finland)
-0.0514
±0.0171
Monash
(Monash U)
-0.7227
±0.0161
-0.5271
±0.0197
Teddy
(USA)
-0.4067
±0.0247
0.1895
±0.0289
1.1678
±0.0427
-1.1678
± 0.0427
Initial opponent model
CONSERVATIVE
AGGRESSIVE
New Network Structure
New node
Final opponent model
Generating different opponents
using Betting Curves
Aggressive
Conservative
Adapted from Carlton (2006)
Results - Opponent Type
Further Work

BPP's game play




Improved bluffing strategy.
Adding sand bagging.
Avoiding predictable game play
Network structure



Adding a OppTight node to the network.
Adding a OppBluff node to the network.
Adding a BppBehaviour node to the network.
Conclusion



BPP is an ongoing research project and still
requires further work.
The improved opponent model has improved
BPP's ability to adapt to an opponent.
This project has been challenging and taken
me outside my comfort zone.
References





AAAI Computer Poker Competition (2006).
http://www.cs.ualberta.ca/~pokert/2006/index.html
Aces High Casino Parties and Rentals San Antonio Texas
(2007). http://www.aceshighcasinoparties.com
Carlton, J. (2000). Bayesian poker, Honours thesis, School
of Computer Science and Software Engineering, Monash
University.
Poker Pups II Prints by Jenny Newland at AllPosters.com
(2007). http://www.allposters.com/-sp/Poker-Pups-IIPosters_i1611677_.htm
Taylor, B. (2007). Opponent Modeling in Bayesian Poker,
Honours Thesis, School of Computer Science and Software
Engineering, Monash University.
Aggressive opponent model
Conservative opponent model
Lessons Learnt





Honours is more challenging than undergraduate units.
Artificial Intelligence and decision making.
Machine learning and structures.
How to effectively research a topic.
What to expect if I was to undertake further
post-graduate studies.