AlphaGo Zero is a version of DeepMind's Go software AlphaGo. AlphaGo's team published an article in the journal Nature on 19 October 2017, introducing AlphaGo Zero, a version created without using data from human games, and stronger than any previous version.[1] By playing games against itself, AlphaGo Zero surpassed the strength of AlphaGo Lee in three days by winning 100 games to 0, reached the level of AlphaGo Master in 21 days, and exceeded all the old versions in 40 days.[2]
Training artificial intelligence (AI) without datasets derived from human experts has significant implications for the development of AI with superhuman skills because expert data is "often expensive, unreliable or simply unavailable."[3] Demis Hassabis, the co-founder and CEO of DeepMind, said that AlphaGo Zero was so powerful because it was "no longer constrained by the limits of human knowledge".[4] Furthermore, AlphaGo Zero performed better than standard reinforcement deep learning models (such as DQN implementations[5]) due to its integration of Monte Carlo tree search. David Silver, one of the first authors of DeepMind's papers published in Nature on AlphaGo, said that it is possible to have generalised AI algorithms by removing the need to learn from humans.[6]
Google later developed AlphaZero, a generalized version of AlphaGo Zero that could play chess and Shōgi in addition to Go. In December 2017, AlphaZero beat the 3-day version of AlphaGo Zero by winning 60 games to 40, and with 8 hours of training it outperformed AlphaGo Lee on an Elo scale. AlphaZero also defeated a top chess program (Stockfish) and a top Shōgi program (Elmo).[7][8]
^Silver, David; Schrittwieser, Julian; Simonyan, Karen; Antonoglou, Ioannis; Huang, Aja; Guez, Arthur; Hubert, Thomas; Baker, Lucas; Lai, Matthew; Bolton, Adrian; Chen, Yutian; Lillicrap, Timothy; Fan, Hui; Sifre, Laurent; Driessche, George van den; Graepel, Thore; Hassabis, Demis (19 October 2017). "Mastering the game of Go without human knowledge" (PDF). Nature. 550 (7676): 354–359. Bibcode:2017Natur.550..354S. doi:10.1038/nature24270. ISSN 0028-0836. PMID 29052630. S2CID 205261034. Archived (PDF) from the original on 18 July 2018. Retrieved 2 September 2019.
^Hassabis, Demis; Siver, David (18 October 2017). "AlphaGo Zero: Learning from scratch". DeepMind official website. Archived from the original on 19 October 2017. Retrieved 19 October 2017.
^"Google's New AlphaGo Breakthrough Could Take Algorithms Where No Humans Have Gone". Yahoo! Finance. 19 October 2017. Archived from the original on 19 October 2017. Retrieved 19 October 2017.
^Knapton, Sarah (18 October 2017). "AlphaGo Zero: Google DeepMind supercomputer learns 3,000 years of human knowledge in 40 days". The Telegraph. Archived from the original on 19 October 2017. Retrieved 19 October 2017.
^mnj12 (7 July 2021), mnj12/chessDeepLearning, retrieved 7 July 2021{{citation}}: CS1 maint: numeric names: authors list (link)
^"DeepMind AlphaGo Zero learns on its own without meatbag intervention". ZDNet. 19 October 2017. Archived from the original on 20 October 2017. Retrieved 20 October 2017.
^Silver, David; Hubert, Thomas; Schrittwieser, Julian; Antonoglou, Ioannis; Lai, Matthew; Guez, Arthur; Lanctot, Marc; Sifre, Laurent; Kumaran, Dharshan; Graepel, Thore; Lillicrap, Timothy; Simonyan, Karen; Hassabis, Demis (5 December 2017). "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm". arXiv:1712.01815 [cs.AI].
^Knapton, Sarah; Watson, Leon (6 December 2017). "Entire human chess knowledge learned and surpassed by DeepMind's AlphaZero in four hours". The Telegraph. Archived from the original on 2 December 2020. Retrieved 5 April 2018.
AlphaGoZero is a version of DeepMind's Go software AlphaGo. AlphaGo's team published an article in the journal Nature on 19 October 2017, introducing...
uses an approach similar to AlphaGoZero. On December 5, 2017, the DeepMind team released a preprint paper introducing AlphaZero, which within 24 hours of...
2017). "China censored Google's AlphaGo match against world's best Go player" – via The Guardian. "【录像】浙江卫视解说柯洁对战Alphago专题节目". m.baidu.com. Retrieved 26...
AlphaGo versus Lee Sedol, also known as the DeepMind Challenge Match, was a five-game Go match between top Go player Lee Sedol and AlphaGo, a computer...
version, AlphaGoZero, defeated AlphaGo in a hundred out of a hundred games. Later that year, AlphaZero, a modified version of AlphaGoZero, gained superhuman...
author of chess engine Sjeng and Go engine Leela. Leela Zero's algorithm is based on DeepMind's 2017 paper about AlphaGoZero. Unlike the original Leela, which...
verify the methods in the AlphaZero paper as applied to the game of chess. Like Leela Zero and AlphaGoZero, Leela Chess Zero starts with no intrinsic...
shared properties between them. AlphaZero is a modified version of AlphaGoZero which is able to play Shogi, chess, and Go. The modified agent starts with...
is developed by David Wu. Based on techniques used by DeepMind's AlphaGoZero, KataGo implements Monte Carlo tree search with a convolutional neural network...
AlphaGo used Monte Carlo tree search to score the resulting positions. A later version of AlphaGo, AlphaGoZero, eschewed learning from existing Go games...
AlphaGo versus Fan Hui was a five-game Go match between European champion Fan Hui, a 2-dan (out of 9 dan possible) professional, and AlphaGo, a computer...
g., BERT and GPT models such as ChatGPT), the AlphaGoZero system, the AlphaStar system, and the AlphaFold system. In 2012, AlexNet was developed for...
October 2015, the computer program AlphaGo became the first artificial intelligence program to defeat a professional Go player on a full size board and on...
of AlphaGo project in 2014. He is one of the first authors of DeepMind's paper on AlphaGo Fan in 2016 and a major author of the paper on AlphaGoZero in...
learning. AlphaZero, a generalized version of AlphaGoZero using Monte Carlo tree search, reinforcement learning and deep learning. Leela Chess Zero, a free...
2017. Greenemeier, Larry (18 October 2017). "AI versus AI: Self-Taught AlphaGoZero Vanquishes Its Predecessor". Scientific American. Archived from the original...
Watch. Retrieved 2017-05-27. "AlphaGo官方解读让三子 对人类高手没这种优势" (in Chinese). Sina.com. 25 May 2017. Retrieved 1 June 2017. "各版alphago实力对比 master能让李世石版3子" (in Chinese)...
compile commentaries on the matches on AlphaGo's website. Fan is one of the authors of DeepMind's paper on AlphaGoZero published in the journal Nature on...
distributions rather than by attempting to ensure that the gain/loss of ratings is zero sum. A variation of the Elo rating system called WHR ('Whole History Rating')...
The rules of Go have seen some variation over time and from place to place. This article discusses those sets of rules broadly similar to the ones currently...
Google DeepMind AlphaFold AlphaGo vs. Fan Hui vs. Ke Jie vs. Lee Sedol film Future of Go Summit AlphaGoZeroAlphaStar AlphaZero Master MuZero WaveNet DoubleClick...
(2020-01-18). "The Evolution of AlphaGo to MuZero". Medium. Retrieved 2020-06-07. Shah, Rohin. "[AN #75]: Solving Atari and Go with learned game models, and...
Ehrenfest, history of artificial intelligence, and Lee Sedol's Go match against AlphaGo. The book received mostly positive reviews from critics. John von...