Sunday, May 31, 2009

After getting some promising results I decided to start recording some. Obviously I can’t wait until I see some results and then record them – That skews the statistics (or something like that).



This does not necessarily correlate with the optimal strategy provided by Cassidy. But I was excited to see P1 value betting roughly twice as much as they were bluffing, which is part of the optimal strategy given for bet size 1 bluff size 1.




All of these results will be based on bet size 1 with ante size 1 and 1000 hands per game.

This first test (after I got curious) is not as specific. If I had displayed the “field of players” that the AIs were up against we may be able to see why their strategies are what they are. An even more refined (maybe binomial cubed?) mutate method may be used to force the field to stay closer to the “breeders” listed. Lets look at some more results. I’ll show 5 more with the same parameters.




In generation 6 above we have the first appearance of a winning P2 strategy that value bets less than a winning P1 strategy. Next gen. the strategy was way different.





Okay, this time I accidentally chose 20 generations to run instead of 10. (Was thinking 20 since there are 20 AI of each P1 and P2 for each generation. Oh well.

Player 2 bluffs between 4% and 30% (usually around 10%) while Player 1 seems to oscillate between almost 0% and about 50%.



Here somehow P2 won in gen. 1 by simply betting almost all the time. That wasn’t the best though. This one gives close to the suggested optimal strategy for P2 of value betting 1/3 of hands while bluffing 1/6 of hands. Would look like (.33, c, .83).

Interestingly P1 seems to do well almost never bluffing at times.

Okay, we’ll run one more…



Here P2 did well bluffing between 5% and 15% while player 1 again jumped from over 50% to nearly never bluffing. Again, Player 1 almost always (except in the last generation) value bets less often than P2.

This tendency for P1 to value bet and to bluff less often than P2 is stated to be part of an optimal strategy for this situation by Jack Cassidy in The Last Round of Betting in Poker where he gives an optimal strategy for this game as

(1/6, ½, 1/12) for P1 ,

and (1/3, ½, 1/6) for P2.

I think it may be valuable to check by how much these winners are winning and what their opponents strategies are. (These are settings in the program that can be easily set, but I should output to a file instead of the command prompt to adequately store the data.)

No comments:

Post a Comment