Jump to content

OpenAI Five

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by Conscious AI (talk | contribs) at 21:34, 11 October 2018 (→‎Architecture). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

OpenAI Five is an artificial intelligence (AI) software system that plays the Dota 2 esport video game.[1] It was developed by OpenAI, a non-profit artificial intelligence (AI) research lab founded by Elon Musk, Sam Altman and others with the mission to develop safe and friendly AI in a way that benefits humanity.[2][3][4]

In August 2017, the first version of the OpenAI Dota 2 bot played in a 1v1 game against a top professional player, Dendi, where an AI system defeated him at The International 2017.[5][6] Greg Brockman, co-founder and CTO of OpenAI, explained that the AI system had learned by playing against itself for two weeks of real time, and that the learning software was a step in the direction of creating software that can handle complex tasks "like being a surgeon".[7][8]

During 2018, OpenAI has been developing a system to be able to play Dota against a team of five humans. In June 2018, the ability of OpenAI Five expanded to play together as a team of five and were able to defeat teams of amateur and semi-professional players.[9][10] In August 2018, OpenAI Five beat former Dota professionals ranked in the 99.95th percentile at the Benchmark game winning two games and losing one.[11][12]In August 2018, OpenAI Five played against a pro team of five people at The International 2018, where the AI system lost two games to the human team.[13][14]

According to OpenAI, the lab uses Dota as a test system for general-purpose AI to capture the unpredictability and continuous nature of the real world, e.g. teamwork, long time horizons, and hidden patterns.[15] The training system used for Dota is also applied to control a robotic hand.[16][17][18]

History

Dota 2 is listed among the best and most complex video games[19], with professionals who train full time for many years[20]to gain a part of Dota’s annual $40M prize pool, the largest in esports gaming.[21]

Previous AI vs human milestones have been in games such as Jeopardy!, Chess, and Go.[22][23][24] Currently, AI labs are trying to develop AI systems to be able to compete in more complex video games that require strategic thinking, team play, reasoning, etc.[25][26]

According to the OpenAI website, the work on the AI system for Dota 2 started at the end of 2016.[27]In August 2017, the system beat top professional, Dendi, in a 1v1 game during The International 7.[28][29]

In June 2018, OpenAI Five defeated an in-house OpenAI team in a fairly restricted game of 5v5 players.[30][31]

In August 2018, during the Benchmark, OpenAI Five defeated former Dota professionals with a somewhat restricted game of 5v5 players with 5 invulnerable couriers and 18 heroes for selection in the AI system.[32][33]

In August 2018, the OpenAI Five team played against top professional Dota 2 players at The International 8 and lost both games. The first game was against paiN Gaming, listed in the top 18 Dota 2 teams who won more than $450,000 in career earnings.[34]The second game was with a team of Chinese super professional players. Three of them had played together in the same team with an average of $1 million tournament earnings for each of these players.[35]The AI system lost the second game after 45 minutes handling a competitive back-and-forth game.[36]After The International 8 match Garry Kasparov wrote in his tweet: “They'll get there, and sooner than expected. Machines will be superior in any closed system eventually.”[37]

According to the OpenAI official blog post, unlike the Benchmark game the International one was played against significantly stronger human players, a pro team that has been training for the game full-time. Another difference was that hero lineups were provided by a third party rather than by OpenAI Five. Also, the AI system developers removed the last major restriction that most Dota professionals consider “Real Dota” gameplay: OpenAI heroes got a single mortal team courier where previously each hero had its own invulnerable courier to deliver items for them.[38]

Architecture

Each OpenAI Five network contains a single layer with a 1024-unit LSTM[39] that observes the current game state extracted from the Dota developer’s API. The neural network conducts actions via numerous possible action heads (no human data involved), and every head has meaning. For instance, the number of ticks to delay an action, what action to select – the X or Y coordinate of this action in a grid around the unit. In addition, action heads are computed independently. The AI system observes the world as a list of 20,000 numbers and takes an action by conducting a list of 8 enumeration values. Also, it selects different actions and targets to understand how to encode every action and observes the world.[40]

OpenAI Five has been developed as a general-purpose reinforcement learning training system on the Rapid infrastructure. Rapid consists of two layers: it spins up thousands of machines and helps them ‘talk’ to each other and a second layer runs software. OpenAI Five has played 180 years’ worth of games in a self-play technique (which is a part of the reinforcement learning approach) using a scaled-up version of Proximal Policy Optimization.[41][42]running on 256 GPUs and 128,000 CPU cores.[43]

Comparison chart
OpenAI 1v1 bot OpenAI Five
CPUs 60,000 CPU cores on Azure 40128,000 preemptible CPU cores on GCP
GPUs 256 K80 GPUs on Azure 256 P100 GPUs on GCP
Experience collected ~300 years per day ~180 years per day (~900 years per day counting each hero separately)
Size of observation ~3.3 kB ~36.8 kB
Observations per second of gameplay 10 7.5
Batch size 8,388,608 observations 1,048,576 observations
Batches per minute ~20 ~60

Comparison with Chess and Go games

Jeopardy!, Chess and Go are among milestones in AI vs human gaming. But unlike these, Dota 2 is a real-time strategy game played between two teams that consists of five players.[44]

An AI system playing Dota 2 has the following features:

Long run view: Dota games run at 30 frames per second for an average of 45 minutes, which results in 80,000 ticks per game. Some discrete actions, like town portal running, can significantly impact the game. OpenAI Five observes every fourth frame, generating 20,000 moves. By comparison, chess usually ends before 40 moves, Go before 150 moves.[45]

Partially observed state of the game: Elements can see only the area around them. The rest of the map is covered in a fog hiding enemies and their approaches. Professional play requires making inferences based on incomplete data, as well as recreating (prototyping) what one’s opponent might be up to. Chess and Go, by comparison, are full-information games.[46][47]

Continuous action space: each hero can take dozens of actions, and many actions target either another unit or a position. The OpenAI Five developers discretize the space into 170,000 possible actions per hero. Without counting the perpetual aspects of the game, there are an average of ~1,000 valid actions each tick. By comparison, the average number of actions in chess is 35 and 250 in Go.[48]

Continuous observation space: Dota is played on a large ceaseless map with ten heroes, dozens of buildings and NPC units, along with countless game features, like runes, trees and wards, and so on. The OpenAI system observes the state of a game though developers’ Bot API, as 20,000 numbers that constitute all information a human is allowed to get access to. A chess board is represented as about 70 lists (an 8x8 board of 6-piece types – here is historical info); where a Go board has about 400 enumerations (a 19x19 board of 2-piece types counting Ko).[49]

AI and tech community acknowledgment

OpenAI Five advancements have received acknowledgement from the AI, tech and Dota community. Bill Gates tweeted after the OpenAI team’s in-house game in June[50][51]:

#AI bots just beat humans at the video game Dota 2. That’s a big deal, because their victory required teamwork and collaboration – a huge milestone in advancing artificial intelligence.

After The International 8 match Garry Kasparov his tweeted[52] : “They'll get there, and sooner than expected. Machines will be superior in any closed system eventually.”

According The Verge[53], Andreas Theodorou, an AI researcher at the University of Bath who uses computer games to study collaboration, says the latest research on 5v5 games is a big step forward, although he notes that perhaps the most “significant achievement” is OpenAI’s use of visualizations to debug their agents. “These techniques show how even reinforcement learning and machine learning systems, in general, can be transparent”.

In a conversation with MIT Technology Review[54], AI experts said that the achievement of the OpenAI Five system is significant. “Dota 2 is an extremely complicated game, so even beating strong amateurs is truly impressive,” says Noam Brown, a researcher at Carnegie Mellon University in Pittsburgh. “In particular, dealing with hidden information in a game as large as Dota 2 is a major challenge.”

References

  1. ^ "A team of AI algorithms just crushed humans in a complex computer game". MIT Tech Review. Retrieved 25 June 2018.
  2. ^ "ELON MUSK'S BILLION-DOLLAR AI PLAN IS ABOUT FAR MORE THAN SAVING THE WORLD". WIRED. 15 December 2015. Retrieved 15 December 2015.
  3. ^ "Artificial Intelligence Nonprofit OpenAI Launches With Backing From Elon Musk And Sam Altman". TechCrunch. Retrieved 11 December 2015.
  4. ^ "INSIDE OPENAI, ELON MUSK'S WILD PLAN TO SET ARTIFICIAL INTELLIGENCE FREE". WIRED. 27 April 2016. Retrieved 27 April 2016.
  5. ^ "For the first time, AI crushed a professional Dota 2 player at the game's biggest tournament". QUARTZ. Retrieved 12 August 2017.
  6. ^ "OpenAI's bot beats top Dota 2 player so badly that he quits". VentureBeat. 12 August 2017. Retrieved 11 August 2017.
  7. ^ "Dota 2". blog.openai.com. 11 August 2017. Retrieved 12 August 2017.
  8. ^ "More on Dota 2". blog.openai.com. 16 August 2017. Retrieved 16 August 2017.
  9. ^ Simonite, Tom (25 June 2018). "Can Bots Outwit Humans in One of the Biggest Esports Games?". Wired. Retrieved 25 June 2018.
  10. ^ Vincent, James (25 June 2018). "AI bots trained for 180 years a day to beat humans at Dota 2". The Verge. Retrieved 25 June 2018.
  11. ^ "OpenAI's bot handily beat a team of professional Dota players". VentureBeat. 6 August 2018. Retrieved 6 August 2018.
  12. ^ "The OpenAI Dota 2 bots just defeated a team of former pros". Verge. 6 August 2018. Retrieved 6 August 2018.
  13. ^ "OPENAI'S DOTA 2 DEFEAT IS STILL A WIN FOR ARTIFICIAL INTELLIGENCE". Verge. 28 August 2018. Retrieved 28 August 2018.
  14. ^ "The International 2018: Results". blog.openai.com. 24 August 2018. Retrieved 23 August 2018.
  15. ^ "OpenAI Five". blog.openai.com. 25 June 2018. Retrieved 25 June 2018.
  16. ^ "Learning Dexterity". blog.openai.com. 30 July 2018. Retrieved 30 July 2018.
  17. ^ "OpenAI sets new benchmark for robot dexterity". Verge. 30 July 2018. Retrieved 30 July 2018.
  18. ^ "OpenAI's Dactyl system improves the dexterity of robot hands". Engadget. Retrieved 30 July 2018.
  19. ^ "DOTA 2 Guide". Purgegamers.
  20. ^ "Inside the home of top-ranked Dota 2 esports team Evil Geniuses". Venturebeat. 12 February 2017. Retrieved 12 February 2017.
  21. ^ "E-sports earnings". esportsearnings.
  22. ^ Gabbatt, Adam (17 February 2011). "IBM computer Watson wins Jeopardy clash". The Guardian. Retrieved 17 February 2011.
  23. ^ "Chess grandmaster Garry Kasparov on what happens when machines 'reach the level that is impossible for humans to compete'". Business Insider. Retrieved 29 December 2017.
  24. ^ "DeepMind's Go-playing AI doesn't need human help to beat us anymore". Verge. 18 October 2017. Retrieved 18 October 2017.
  25. ^ "Humans Are Still Better Than AI at StarCraft for Now". MIT Tech Review. Retrieved 1 November 2017.
  26. ^ "CAN BOTS OUTWIT HUMANS IN ONE OF THE BIGGEST ESPORTS GAMES?". WIRED. Retrieved 25 June 2018.
  27. ^ "OpenAI Five". openai.com/five.
  28. ^ "OpenAI bot bursts into the ring, humiliates top Dota 2 pro gamer in 'scary' one-on-one bout". The Register. Retrieved 12 August 2017.
  29. ^ "Elon Musk's $1 billion AI startup made a surprise appearance at a $24 million video game tournament — and crushed a pro gamer". Business Insider. Retrieved 11 August 2017.
  30. ^ "OpenAI's bot can now defeat skilled Dota 2 teams". VentureBeat. 25 June 2018. Retrieved 25 June 2018.
  31. ^ "OpenAI reveals progress on their 5v5 Dota AI". Cybersport. Retrieved 25 June 2018.
  32. ^ "OpenAI Five Benchmark: Results". blog.openai.com. 6 August 2018. Retrieved 6 August 2018.
  33. ^ "How the OpenAI Five tore apart a team of Dota 2 pros". 11 August 2018. Retrieved 11 August 2018.
  34. ^ "paiN Gaming". Liquipedia.
  35. ^ "BurNIng". Liquipedia.
  36. ^ "The International 2018: Results". blog.openai.com. 24 August 2018. Retrieved 23 August 2018.
  37. ^ "Gary Kasparov's Twitter". 24 August 2018. Retrieved 24 August 2018.
  38. ^ "The International 2018: Results". blog.openai.com. 24 August 2018. Retrieved 23 August 2018.
  39. ^ "Understanding LSTM Networks". colah's blog. Retrieved 27 August 2015.
  40. ^ "OpenAI Five". blog.openai.com. 25 June 2018. Retrieved 25 June 2018.
  41. ^ "Why are AI researchers so obsessed with games?". QUARTZ. Retrieved 4 August 2018.
  42. ^ Schulman, John; Wolski, Filip; Dhariwal, Prafulla; Radford, Alec; Klimov, Oleg (2017). "Proximal Policy Optimization Algorithms". arXiv:1707.06347 [cs.LG].
  43. ^ "OpenAI Five". blog.openai.com. 25 June 2018. Retrieved 25 June 2018.
  44. ^ "This AI trains on 180 years of gaming every day, and is now beating humans". New Atlas. Retrieved 26 June 2018.
  45. ^ "OpenAI Five". blog.openai.com. 25 June 2018. Retrieved 25 June 2018.
  46. ^ "OpenAI Five". blog.openai.com. 25 June 2018. Retrieved 25 June 2018.
  47. ^ "A team of AI algorithms just crushed humans in a complex computer game". MIT Tech Review. Retrieved 25 June 2018.
  48. ^ "OpenAI Five". blog.openai.com. 25 June 2018. Retrieved 25 June 2018.
  49. ^ "OpenAI Five". blog.openai.com. 25 June 2018. Retrieved 25 June 2018.
  50. ^ "Bill Gates says gamer bots from Elon Musk-backed nonprofit are 'huge milestone' in A.I." CNBC. 28 June 2018. Retrieved 28 June 2018.
  51. ^ "Bill Gates hails 'huge milestone' for AI as bots work in a team to destroy humans at video game 'Dota 2'". Business Insider. Retrieved 27 June 2018.
  52. ^ "Gary Kasparov's Twitter". 24 August 2018. Retrieved 24 August 2018.
  53. ^ "AI bots trained for 180 years a day to beat humans at Dota 2". Verge. 25 June 2018. Retrieved 25 June 2018.
  54. ^ "A team of AI algorithms just crushed humans in a complex computer game". MIT Tech Review. Retrieved 25 June 2018.

See also