AI just won a poker tournament against professional players

Technology

A poker-playing artificial intelligence has claimed victory against humans, winning with a lead of $1.7 million by constantly tweaking its strategy

By Timothy Revell

31 January 2017

Poker pro playing AI — Losing cause
Tim Kaulen/Carnegie Mellon University

An AI just claimed another gaming victory over humans by winning a 20-day poker tournament. The AI, called Libratus, took on four of the world鈥檚 best Heads-Up No-Limit Texas Hold 鈥楨m poker players at a Pennsylvania casino. After 120,000 hands, Libratus won with a lead of over $1.7 million in chips.

鈥淚鈥檓 feeling great,鈥� says , a computer scientist at Carnegie Mellon University who was part of the team that created the AI. 鈥淭his is a David versus Goliath story, and Libratus was able to throw a pebble.鈥�

A poker-proficient AI is remarkable because poker is a game of 鈥渋mperfect information鈥�: players don鈥檛 know what cards their opponents have, so never have a full view of the state of play. This means the AI has to take into account how its opponent is playing and rework its approach so it doesn鈥檛 give away when it has a good hand or is bluffing.

鈥淚t鈥檚 a really important milestone for artificial intelligence,鈥� says at the University of Malta. 鈥淭his is like reality. The real world is a game of imperfect information, so by solving poker we become one step closer to general artificial intelligence.鈥�

Libratus鈥檚 algorithms are not specific to poker, or even just to games. The AI has not been taught any strategies and instead has to work out its own way to play based on the information it鈥檚 given 鈥� in this case, the rules of poker. This means that Libratus could be applied to any situation that requires a response based on imperfect information.

鈥淭here are applications in cybersecurity, negotiations, military settings, auctions and more,鈥� says Sandholm. His lab has also been looking at how AI can bolster the fight against infections, by viewing treatment plans as game strategies. 鈥淵ou can learn to battle diseases better even if you have no extra medicines at your disposal 鈥� you just use them smarter,鈥� says Sandholm.

Spilling the beans

The Carnegie Mellon team has previously been tight-lipped about Libratus鈥檚 methods, fearing that any explanation could assist its human competitors. But now Sandholm is willing to say more about how it works.

Libratus has three main parts. The first has not changed much since 2015 when Sandholm鈥檚 team first entered its AI in a similar tournament against professional players (that time, humans won). This part computed a big list of strategies the AI could use when play began. At the outset of the tournament, Libratus had spent the equivalent of 15 million hours of computation honing its strategies.

The second part, now completely redesigned by Sandholm and his PhD student Noam Brown, worked to improve Libratus鈥檚 strategy with each hand. Called the 鈥渆ndgame solver鈥�, it took into account 鈥渕istakes鈥� the AI鈥檚 opponents made 鈥� instances where they left themselves open to exploitation 鈥� to predict the result of each hand. The team couldn鈥檛 tell from statistical analysis if the earlier version of the endgame solver improved the AI鈥檚 play at all, says Sandholm. 鈥淏ut this new one is just awesome.鈥�

The final part of the AI looked for its own strategic weaknesses so it could change how it played before the next session. This sought to identify things its opponents were exploiting, such as a giveaway 鈥渢ell鈥� that another player had noticed.

This was important as, in the last tournament, the human players were able to work out how the AI played when it had different cards and change the way they bet accordingly.

Tougher to play every day

鈥淚t鈥檚 insanely good this time around 鈥� quite remarkable,鈥� said Jason Les, one of the professional players, as the tournament entered the final days. 鈥淚t seems to have some sort of strategy update component that is learning how to best play us. Its strategy seems to be improving as time goes on and it is tougher and tougher every day.鈥�

Despite their loss, the professional players will split a $200,000 prize pot based on their performances 鈥� and the researchers won鈥檛 actually take home any winnings. Following its victory, the Libratus team plans to publish the AI鈥檚 algorithms in a peer-reviewed journal.

There is still a long way to go before AI can take on the real world, says at the University of Essex, UK. 鈥淚n the real world, you often have a lot more choices than in a card game. The possibilities are more open-ended,鈥� he says.

But it鈥檚 still a fantastic achievement as poker is a complex game, he says. 鈥淚t鈥檚 an impressive step forward and a big deal.鈥�

Topics: Artificial intelligence / Computing / games / Machine learning

麻豆传媒

Technology