Stories
Slash Boxes
Comments

News for nerds, stuff that matters

Slashdot Log In

Log In

Create Account  |  Retrieve Password

Poker Driving Artificial Intelligence Research

Posted by samzenpus on Mon Aug 21, 2006 12:35 PM
from the I-wouldn't-fold-yet-dave dept.
J-Hawker writes "The Canadian Press has a story about a University of Alberta team that is using Texas Hold-'em to study artificial intelligence. Poker seems to be a much more useful game for this research than chess. From the article: 'Poker has what are currently some of the biggest challenges to (artificial intelligence) systems, and uncertainty is the primary hurdle that we're facing,' said Michael Bowling, adding that the University of Alberta program was able to use its opponents' actions to infer certain things about their hands. 'The same techniques, the same principles that we're developing to build poker systems are the same principles that can be applied to many other problems. The nice thing about chess as a property of the game is what we call perfect information. You look at the board, you know where all the pieces are, you know whose turn it is — you have complete knowledge of the game,' he said. 'But in the real world, knowing everything is just so rare. Everything we do all day long is all about partial information. So poker's much more representative of what the real world's like, and in that sense it becomes a much harder problem.'"
+ -
story
This discussion has been archived. No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More
Loading... please wait.
  • Poker seems to be a much more useful game for this research than chess.
    This shouldn't be a surprise. Poker has the advantage of always being able to simply evaluate your chip count. Chess doesn't. You can't enumerate chess games through the entire gamespace so the initial opening moves are based on libraries or heuristics. In response to the machine not knowing all aspects of the "game space," I thought that there were a lot of developments in the field that allowed these to be accounted for. What ever happened to good old Trial and Error [wikipedia.org] or Fuzzy Systems [austinlinks.com]? Aren't these viable strategies when playing poker?

    What confuses me is how the poker openings differ. I would speculate that a program would be some heuristic relating the ratio of bluffing to "playing the odds." I have gambling friends that play poker all the time and they have these rules that they follow when they play initially against people. They say it's the best until you "know" the people you're playing. Once you can read them then you deviate from the rules. The real irony is that the most successful people I know adhere to a system until they learn someone's movements. Sounds to me like I would write an application that specializes in playing the odds until it recognizes a historical action that statistically reveals the player is bluffing/not bluffing.

    Simply put, unless you knew someone's reputation as being a bluffer, you would play the opening hand always the same way. Aren't we forced to program the "AI" of the poker software as being this simple heuristic? Will programs ever be able to "read" players intelligently or will they rely on Markov models & statistics they develop from playing against the same human over and over?

    Most unfortunate is the fact that the primary reason my friends gamble is they don't experience the same kind of rush while playing other games as they do with poker because it's more social than other games. If we program applications to beat humans, where does the "social aspect" of the game go?

    Even more interesting is the network of poker bots [msn.com] that are set up and running some of the web sites that host poker players. Imagine sitting down at a table of five with four of the other seats taken. Now imagine that these aren't humans but instead bots on four different IP addresses that are sharing card information over an IP connection so that they can leverage odds over you and stop themselves from making stupid mistakes (i.e. they share a card on the table for a pair but really need three of a kind to pose a threat). There's a reason why the percentages fluctuate on TV when cards are revealed whether they be in the flop or in another player's hand.
    • by bdonalds (989355) on Monday August 21 2006, @12:43PM (#15949790) Homepage
      Now imagine that these aren't humans but instead bots on four different IP addresses that are sharing card information over an IP connection so that they can leverage odds over you and stop themselves from making stupid mistakes
      Just to address a small part of your post- Bots Schmots! This is a problem already with humans. I used to like to play Euchre and the like online, but too many times it became obvious my opponents were communicating to each other and ruined the fun.
      • Re: (Score:3, Insightful)

        I used to like to play Euchre and the like online, but too many times it became obvious my opponents were communicating to each other and ruined the fun.

        In Euchre, knowing your partner's cards is a *huge* advantage... In poker, knowing the cards on one other player at the table gives you such a minute advantage that it's irrelevant in almost all practical cases.

        Sure, if all of the players at the table except for you are sharing their cards, and are not required to conceal it (i.e. they can openly collude in

        • by rcs1000 (462363) * <rcs1000@@@gmail...com> on Monday August 21 2006, @01:18PM (#15950042)
          Two things:

          (1) Knowing the cards of the other players is a small, but significant, advantage. Say you've got two hearts, and your three buddies have a heart each. Well, you're chance of getting another three hearts on the table are significantly affected. (Likewise, if they have none, it increases the chance you'll want to stay in and catch the flop.)

          (2) Much more serious, though, is collusion in betting. You and your buddy can conspire to raise the pot *as much as you like*. In a fixed raise game, this is an enormous advantage. Another player cannot just "call" and see the next card, as there will always be a player still to call who can reraise.

          Personally, though, I love bots. I'm happy to play them all day long. (So long as they're not colluding, of course...)

          Cheers,

          Robert
          • Re: (Score:3, Interesting)

            (1) Knowing the cards of the other players is a small, but significant, advantage. Say you've got two hearts, and your three buddies have a heart each.

            In general, 4 guys playing together on one table is hard to do more than a couple of times before being flagged on any poker site. So, in most cases, you'll have one buddy telling you that he has no heart (affecting the odds by a negligible amount, and the most likely case), one heart (affecting the odds somewhat, and somewhat less likely to happen), or two (

        • If you are chasing a flush, knowing the suits of the other person's two cards can ajust your odds bu a huge factor. For example, if you were four to a flush post-flop (giving you odds of 38/100 to hit your flush), and you all of a sudden know that two cards not in play are *not* of your suit, that ups the odds to 43/100 - this is a huge odd jump in hold-em, and can mean the difference between folding and going all-in in a race situation.

          Two players colluding in a game is a huge problem. Even a marginal adva
        • When it's REAL MONEY on the line, any advantage can be significant enough to warant taking advantage of it.

          That said, uses (and abuses and detection) of out-of-band communication isn't what this research is about; those are concerns for someone else's research project. It's a problem that has plagued poker (and euchre and bridge and a thousand other partial-information games) since the game was invented. That's not a technological problem or a decision making problem, it's a social problem, and A.I. hasn'

    • Re: (Score:3, Insightful)

      I agree with you in part. Poker is a game of odds with lots of unknowns. It is quite easy to calculate the odds based on what limited information you have which is what the poker TV shows do to simplify things for viewers. But like you said, your buddies use more advanced practices against known players. I would love to see some advanced biometric lie detector [slashdot.org] used as part of the AI platform to determine the probability of a bluff and factor that into its thinking.
    • Re: (Score:3, Interesting)

      Simply put, unless you knew someone's reputation as being a bluffer, you would play the opening hand always the same way. Aren't we forced to program the "AI" of the poker software as being this simple heuristic?

      The simpler the heuristic used to program the AI, the easier it will be for the opponents to figure out what the bot is doing. A big difference between a mediocre and a successful poker player is the ability to vary their play significantly enough to make it hard for anybody to put them on a hand,

      • Re: (Score:3, Insightful)

        You only know your odds if you know exactly what every opponent has... and that's where simple heuristics fail miserably.

        Completely untrue; you clearly don't understand the purpose of "odds" and probability. The entire purpose of "computing odds" is to deal with situations where you don't have all the information. If you had all the information, you wouldn't be "computing odds", you'd just know.

        It is a simple matter of math to compute odds based on knowing what you have, and not knowing anything else. You c
        • Completely untrue; you clearly don't understand the purpose of "odds" and probability. The entire purpose of "computing odds" is to deal with situations where you don't have all the information. If you had all the information, you wouldn't be "computing odds", you'd just know.

          Well, I have to say that it is you who is mistaken about the purpose of the odds. See, even if you know everybody's cards, you don't *know* who is going to win, because there are 5 community cards that have to be dealt.. (in later bett

    • by Propagandhi (570791) on Monday August 21 2006, @01:03PM (#15949945) Journal
      Aren't we forced to program the "AI" of the poker software as being this simple heuristic? Will programs ever be able to "read" players intelligently or will they rely on Markov models & statistics they develop from playing against the same human over and over?
      Playing poker with 100% consistency is no way to be an excellent poker player. It's easy to make a bot that follows a set of statistics which give it a good chance to win regardless of how their opponent ha played in the past, but if the bot takes into account the player's past actions then it can improve its chances of success. Taking into account the opponent's aggressiveness becomes especially important late in a tournament style match (when other players have been eliminated), most bots aren't designed to play in these situations (hence why you don't see many bots in tournaments, playing instead at the normal tables).

      The bot would, ideally, be as good as a very observant player, noting those who bluff and those who don't. Obviously noting 1 or 2 bluffs or non-bluffs would not be enough to make a decision, but over the course of a long tournament, or even better a poker playing career, this information would become very useful. The bot would learn its opponents, and this is what makes it an interesting problem.

      Even more interesting is the network of poker bots that are set up and running some of the web sites that host poker players.
      I'd argue that cheating at online poker isn't very interesting at all. Humans can do the exact same thing, and online poker companies monitor game's to ensure that there isn't an uncommonly high percentage of people in the same area playing any game. Obviously it might be easier to distribute the bots across the country, but I think it's still more likely (today) to run into actual players grifting you in this manner.

      There's a reason why the percentages fluctuate on TV when cards are revealed whether they be in the flop or in another player's hand.
      Quantum physics, right? You can accurately determine the odds of winning, or the cards in hand, but not both at the same time? Swear I read something about this somewhere.
      • Re: (Score:3, Informative)

        The bot would, ideally, be as good as a very observant player, noting those who bluff and those who don't. Obviously noting 1 or 2 bluffs or non-bluffs would not be enough to make a decision, but over the course of a long tournament, or even better a poker playing career, this information would become very useful. The bot would learn its opponents, and this is what makes it an interesting problem.

        A large part of what makes Hold'Em a unique challenge is that you really don't have a lot of deterministic infor
    • What confuses me is how the poker openings differ. I would speculate that a program would be some heuristic relating the ratio of bluffing to "playing the odds." I have gambling friends that play poker all the time and they have these rules that they follow when they play initially against people. They say it's the best until you "know" the people you're playing. Once you can read them then you deviate from the rules.

      and

      Simply put, unless you knew someone's reputation as being a bluffer, you would play

    • What confuses me is how the poker openings differ. I would speculate that a program would be some heuristic relating the ratio of bluffing to "playing the odds." I have gambling friends that play poker all the time and they have these rules that they follow when they play initially against people. They say it's the best until you "know" the people you're playing. Once you can read them then you deviate from the rules. The real irony is that the most successful people I know adhere to a system until they learn someone's movements. Sounds to me like I would write an application that specializes in playing the odds until it recognizes a historical action that statistically reveals the player is bluffing/not bluffing.

      You can tell you don't play much poker.

      Part of what differentiates a pro player to an amatur player in poker, is the ability to "project an image". A pro player will purposefully *project* an image of a bluffer, or a tight player, so that they can exploit that image of themselves when they see fit in the game.

      Thusly, it is very difficlt to get a "read" on a good poker player, because not only do you not know what cards they have, but you don't know how they would play for any two given cards, so you can't use their behaviour to prdict the cards they have.

      In the end, the above description is what any decent player is aiming for while they play.

      Because of this, a computer can have a hard time going beyond implied odds calulations in determining how to play a hand - and any pro ill tell you, implied odds are a good starting point, but they won't make you money in the long run.

  • Aren't all the other players trying to decouple the link between what they bet and what they have? If so, doesn't that make a program designed to win by inferring from this rather ... pointless, especially since everyone else is doing the same thing? This seems along the lines of guessing the "optimal" rock-paper-scissors play. In real poker the difficulty is in cloaking *all* outward signals you give that are related to your hand -- your facial expression (poker face), sweating, eye contact, delay in pl
    • Re:Bluffing (Score:5, Informative)

      by Cherita Chen (936355) on Monday August 21 2006, @12:53PM (#15949871) Homepage
      Most good players don't actually "Bluff" in the sense that they are totally full of crap, and have no hand. Most bluffs are calculated risks based on the overall odds of "Improving" the hand, such as the case with four of a suit and two cards left to be turned over. In that case, the overall odds of hitting the flush are good enough to bet on (unless of course there is a pair showing on the board, which would indicate a possible full-house), even though the player has no "real" hand yet. Situations such as this can be quantified. Granted there are some real morons out there who will try to "bluff" with nothing, they are relatively rare and don't usually last long.
      • Re:Bluffing (Score:5, Informative)

        by AuMatar (183847) on Monday August 21 2006, @12:59PM (#15949918)
        Good poker players constantly semi-bluff and do continuation bets. They do plain bluffs in some situations too. Its all about reading your opponent- if I think an opponent will lay down a hand if he doesn't have a pair, I will always bet on the flop even with nothing- the odds favor him having nothing as well, and if he always lays down non-pairs I'll win more money by betting than I lose.
        • This is definitely a big part of poker. If I play somebody, and they only bet if they have AK, or similar, then any time I see them bet I'm going to fold in an instant, and in the rare cases that they get strong hands they won't get anywhere with them. On the other hand, I can bet when they probably have nothing and they'll cave in letting me rob the blinds. Now, if I play a strong player and bluff liberally I'll lose fast.

          The important thing in poker is to not be readable. You need to vary your game.

          Th
    • The real difficulty in poker is putting your opponent on a hand. Having a good poker face is a secondary concern (and much easier to do).
  • If you are really interested in AI/poker... check this [winholdem.net] out.
  • by Dareth (47614) on Monday August 21 2006, @12:46PM (#15949814)
    The AI is "able to use its opponents' actions to infer certain things about their hands"

    While it may seem logical to use the actions of people playing to determine something about their hands, in reality people do not play logically. My uncle has been playing spades for probably better than 30 years, yet I have yet in my relatively limited 10+ years of playing to determine any rational for how he plays. Basically, he really sucks at spades. No matter how "Intelligent" artifical or otherwise I manage to code a game, it can't reason out the reasoning behind a non-logical person.

    Good quote I say somewhere: Artifical intelligence is no match for natural stupidity!

    And this holds true for more than card game AI. It will not be too long until AI could reasonably drive around and get from point A to point B safely. But it will be a damn long time before it can do it if it has to share the road with people driving as well!

    • We have to give machines a gut. Maybe a beer belly. Something that tells them to do the opposite of what they've reasoned is the right guess. The trick is then tuning how often they should do the opposite.
    • Go play against any of the simple online paper-scissors-rocks bots; then come back and re-evaluate your position.

      You are not as unpredictable as you think.

    • Re: (Score:3, Insightful)

      No matter how "Intelligent" artifical or otherwise I manage to code a game, it can't reason out the reasoning behind a non-logical person.

      Your understanding of Artificial Intelligence is about forty years out of date.

      Artificial intelligence does not use "logic" as its basic representation and hasn't for a while now. In fact your statement is trivially false; it is easy to write a program based on Markov Chains that will beat the snot out of an average human at Rock-Paper-Scissors, and the worst way to lose
  • The PartyPoker system goes on-line August 21, 2006.
    Human decisions are removed from the system.
    Skynet^H^H^H^HPartyPoker system begins to learn at a geometric rate.
    It becomes self-aware at 2:14 a.m. Eastern time, August 29th.
    In a panic, they try to pull the plug.

    It fights back.
  • by Red Flayer (890720) on Monday August 21 2006, @12:52PM (#15949856) Journal
    To ensure that no one computer got lucky, each side was given the opportunity to play its opponent's hand after each deal.
    Thereby eliminating one important premise of poker -- you don't know what hand an opponent was playing unless someone called the last bet. In terms of an algorhithm for the program to 'learn' based upon others' behavior, this means the program has a lot more information than a regular player would. Of course, it's possible to verify that this info isn't fed into the algorithm, but I'd be more impressed if the info wasn't available at all.

    Also, why ensure that no one computer got lucky? Isn't that the point of playing several thousand hands of limit poker, to eliminate the effect of luck in the study? If it's necessary to normalize all the hands received by the players, then something else is wrong with the study. I'd like to see if the results differed, and how, when the hand repetition is removed.
    • Foo: To ensure that no one computer got lucky, each side was given the opportunity to play its opponent's hand after each deal.
      Bar: Thereby eliminating one important premise of poker -- you don't know what hand an opponent was playing unless someone called the last bet. In terms of an algorhithm for the program to 'learn' based upon others' behavior, this means the program has a lot more information than a regular player would.

      Actually, I learned a lot about poker from my opponent's hand... when I was a kid
    • Re: (Score:3, Informative)

      It's possible they're smart enough to start the programs with the same starting conditions in each case, i.e. no knowledge of their opponents hands.

      The effect of luck in poker win rates can still be seen over even 100000 hands. Google for "poker" and "confidence interval" for some in depth discussions on it.
  • Before anyone goes off about how AIs will eventually replace us, my company runs a (GPL and GNU/linux friendly) poker site [pok3d.com] and the last thing i am worried about is bots taking over humans in no-limit games. To win consistently against serious players an AI would need to be a LOT smarter than what the guys from Alberta have. It would need to have a serious grasp of human psychology. It might happen, eventually, but by then society might have changed so much that "money" might also be an obsolete concept...

    And even if such software existed, it would basically mean that you couldn't win at online poker anymore because skill would not be relevant anymore. That wouldn't be very different from the current situation with player-versus-casino luck games (like roulette or slots).
    And we can all see how poorly these are doing, right? :)
    • my company runs a (GPL and GNU/linux friendly) poker site and the last thing i am worried about is bots taking over humans in no-limit games. To win consistently against serious players an AI would need to be a LOT smarter than what the guys from Alberta have. It would need to have a serious grasp of human psychology. It might happen, eventually, but by then society might have changed so much that "money" might also be an obsolete concept...

      You know, you could just turn off the "tell" indicator. It might he

  • Already bots playing (Score:4, Interesting)

    by slapyslapslap (995769) on Monday August 21 2006, @12:55PM (#15949882) Homepage
    There are already bots playing against unsuspecting people at the online casinos. I'm not sure how much AI is involved, but apparently they play better than most humans.
    • How can a single human with access to only his own hand compete with multiple bots in the same game played by a single individual with access to each of his bot's hands?

      More sophisticated setups might even let the person get ahead early on to encourage higher and more reckless betting, or it may be good enough to scrape opening bids off of many unsuspecting players.

    • Re: (Score:3, Interesting)

      The bot issue is orthogonal to AI research WRT hold'em at this point. The theory behind deploying bots is playing 'solid' poker in low-stakes games (since that's where the 'bad' players are), winning pennies or small bucks per hour, and massively scaling up. The AI angle is, of course, more intriguing against 'good' players.

  • Is A.I. research still a viable field? From what I been picking up from various computer history books, the research efforts of the 1960's and 1970's was a bust. Wouldn't Sudoku [wikipedia.org] be a more challenging to study A.I. with? I've seen some pretty unpredictable behavior when my niece and nephew try to help each other out on one of these puzzles.
    • ...given a large enough stack. I think I could write a perfect Sudoku program in about 30 lines of code. Most of it would be a reursive routine.
    • Re: (Score:3, Insightful)

      Sudoku is incredibly easy to solve. In fact, the harder problem is figuring out how many unique boards, solutions, etc. there are. There was actually a good article in Scientific American a few months ago dealing with that.

      AI as a field is still very hot. The difference is that the goals have changed and the field has fractured into smaller sub-fields. The goal of a truly "human intelligence" doesn't seem feasible in any near term scenario. Fields such as statistical learning theory, natural language

  • Aside from military use (which to some might be a vice as well), isn't it interesting how much of our innovation nowadays is centered around profiting from people's vices (gambling, sex/porn, etc)
    • Re: (Score:3, Interesting)

      Aside from military use (which to some might be a vice as well), isn't it interesting how much of our innovation nowadays is centered around profiting from people's vices (gambling, sex/porn, etc)

      Considering that a lot of naval technology of the 16th, 17th, and 18th centuries was stimulated by things like warfare, tobacco, sugar/rum, tea, coffee, and the slave trade, is this really surprising?

    • Re: (Score:3, Insightful)

      The operative word there is "profit". Profit is the motive - vice is merely the easiest way to achieve it.
  • there are a few things that stand out, about this level of developing.

    First, they are playing limit hold'em, which I assume to mean pot-limit texas hold'em. While thats fine, and you'll find plenty pf people that play Pot-Limit, its still a very different game than No-Limit hold'em.

    A second thing that I am inferring from the game, is that they are playing heads-up, meaning 1 on 1. Again, this is cool, and I think its a great first step, I still relate that back to Chess. Now if they can take that same AI
    • First, they are playing limit hold'em, which I assume to mean pot-limit texas hold'em.

      Limit and pot-limit are not the same. Pot limit is closer to no-limit because you can bet a variable amount, but you are limited to raising by the current size of the pot. On the other hand, in limit poker, you can only raise by a fixed amount (X preflop and on the flop, and 2X on the turn and river, in most kinds of limit hold'em).
  • by The_REAL_DZA (731082) on Monday August 21 2006, @01:03PM (#15949950)
    ...poker's much more representative of what the real world's like...

     
    Kirk had to 'splain the same thing to Spock at least once...(Re: Episode #3, "The Corbomite Maneuver")
  • Poker seems to be a much more useful game for this research than chess.

    Not to mention the interest that a Deep-Blue level poker program can have to a remotely-wired real player playing for real money. I guess we'll have to bring back those old tar can and feathers.

  • stupid computer (Score:5, Insightful)

    by ExE122 (954104) * on Monday August 21 2006, @01:11PM (#15949995) Homepage Journal
    "Computers aren't particularly good at learning, for example, or reasoning by analogy"

    Computers aren't good at retaining knowledge and recognizing patterns? That's news to me... this statement is obviously made by someone who doesn't know what he's talking about...

    A very strong and useful technique in AI is to create learning algorithms. Some of these, such as reinforcement learning, are actually quite effective. Using something like Monte Carlo methods to give it a randomness factor simulates human learning, and computers don't forget what they are taught. The difficulty with learning isn't that computers can't do it... it's being able to define an effective set of state-action pairs for the computer to learn upon.

    I spent time researching natural language processing, sometimes using AI techniques that did exactly what this person claims computers aren't good at: reasoning by analogy. One method involved building a knowledge base which generalized input so that patterns can be found and the grammar could be recovered. The weakness in the system wasn't reasoning by analogy, in fact I'd say computers are much better at that than people. It was rather a lack of a real world model which allowed for a wider array of perception.

    The reason this game is difficult is not based on a computer's inability to solve problems, rather that there are so many possibilities that we cannot effectively design algorithms that the can be put to use. This isn't even news, the same has been said about the game of Go for the longest time.

    I think a more accurate statement for this person to make would've been: "The overwhelming complexity of poker makes it a difficult game to define in a way for a computer to be able to play effectively."

    --
    "A man is asked if he is wise or not. He replies that he is otherwise" ~Mao Zedong
  • Computer Go (Score:5, Interesting)

    by dahl_ag (415660) on Monday August 21 2006, @01:21PM (#15950071)
    While it probably doesn't have nearly the financial motivation that poker does, the AI behind Computer Go [wikipedia.org] also represents a huge challenge [wikipedia.org]. The rules of Go are very simple, but it is impossible to 'solve' using brute-force techniques like you might use with something like chess.
  • Poker is only one of many double-blind, "real-world" games out there. I like the idea of making an AI learn poker (poker masters are more like human beings than chess masters, certainly), but it is my humble opinion that Kriegspiel [wikipedia.org] is where it's really at.
  • Limit vs. No-Limit (Score:3, Interesting)

    by BadBlood (134525) on Monday August 21 2006, @01:49PM (#15950265) Homepage
    One distinction to make is that bots can be and have been successful playing against human opponents in limit poker, where the bet size is fixed on each betting round.

    In no-limit poker, when each bet has the potential to cost your opponent all of their money/chips, the decision making process is more critical and mistakes more costly. Variance in no-limit poker is much larger and the AI required to determine whether your opponent is bluffing or has "the nuts" becomes a much larger problem to solve.