Global Information Lookup Global Information

AlphaGo Zero information


AlphaGo Zero is a version of DeepMind's Go software AlphaGo. AlphaGo's team published an article in the journal Nature on 19 October 2017, introducing AlphaGo Zero, a version created without using data from human games, and stronger than any previous version.[1] By playing games against itself, AlphaGo Zero surpassed the strength of AlphaGo Lee in three days by winning 100 games to 0, reached the level of AlphaGo Master in 21 days, and exceeded all the old versions in 40 days.[2]

Training artificial intelligence (AI) without datasets derived from human experts has significant implications for the development of AI with superhuman skills because expert data is "often expensive, unreliable or simply unavailable."[3] Demis Hassabis, the co-founder and CEO of DeepMind, said that AlphaGo Zero was so powerful because it was "no longer constrained by the limits of human knowledge".[4] Furthermore, AlphaGo Zero performed better than standard reinforcement deep learning models (such as DQN implementations[5]) due to its integration of Monte Carlo tree search. David Silver, one of the first authors of DeepMind's papers published in Nature on AlphaGo, said that it is possible to have generalised AI algorithms by removing the need to learn from humans.[6]

Google later developed AlphaZero, a generalized version of AlphaGo Zero that could play chess and Shōgi in addition to Go. In December 2017, AlphaZero beat the 3-day version of AlphaGo Zero by winning 60 games to 40, and with 8 hours of training it outperformed AlphaGo Lee on an Elo scale. AlphaZero also defeated a top chess program (Stockfish) and a top Shōgi program (Elmo).[7][8]

  1. ^ Silver, David; Schrittwieser, Julian; Simonyan, Karen; Antonoglou, Ioannis; Huang, Aja; Guez, Arthur; Hubert, Thomas; Baker, Lucas; Lai, Matthew; Bolton, Adrian; Chen, Yutian; Lillicrap, Timothy; Fan, Hui; Sifre, Laurent; Driessche, George van den; Graepel, Thore; Hassabis, Demis (19 October 2017). "Mastering the game of Go without human knowledge" (PDF). Nature. 550 (7676): 354–359. Bibcode:2017Natur.550..354S. doi:10.1038/nature24270. ISSN 0028-0836. PMID 29052630. S2CID 205261034. Archived (PDF) from the original on 18 July 2018. Retrieved 2 September 2019.Closed access icon
  2. ^ Hassabis, Demis; Siver, David (18 October 2017). "AlphaGo Zero: Learning from scratch". DeepMind official website. Archived from the original on 19 October 2017. Retrieved 19 October 2017.
  3. ^ "Google's New AlphaGo Breakthrough Could Take Algorithms Where No Humans Have Gone". Yahoo! Finance. 19 October 2017. Archived from the original on 19 October 2017. Retrieved 19 October 2017.
  4. ^ Knapton, Sarah (18 October 2017). "AlphaGo Zero: Google DeepMind supercomputer learns 3,000 years of human knowledge in 40 days". The Telegraph. Archived from the original on 19 October 2017. Retrieved 19 October 2017.
  5. ^ mnj12 (7 July 2021), mnj12/chessDeepLearning, retrieved 7 July 2021{{citation}}: CS1 maint: numeric names: authors list (link)
  6. ^ "DeepMind AlphaGo Zero learns on its own without meatbag intervention". ZDNet. 19 October 2017. Archived from the original on 20 October 2017. Retrieved 20 October 2017.
  7. ^ Silver, David; Hubert, Thomas; Schrittwieser, Julian; Antonoglou, Ioannis; Lai, Matthew; Guez, Arthur; Lanctot, Marc; Sifre, Laurent; Kumaran, Dharshan; Graepel, Thore; Lillicrap, Timothy; Simonyan, Karen; Hassabis, Demis (5 December 2017). "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm". arXiv:1712.01815 [cs.AI].
  8. ^ Knapton, Sarah; Watson, Leon (6 December 2017). "Entire human chess knowledge learned and surpassed by DeepMind's AlphaZero in four hours". The Telegraph. Archived from the original on 2 December 2020. Retrieved 5 April 2018.

and 25 Related for: AlphaGo Zero information

Request time (Page generated in 0.8228 seconds.)

AlphaGo Zero

Last Update:

AlphaGo Zero is a version of DeepMind's Go software AlphaGo. AlphaGo's team published an article in the journal Nature on 19 October 2017, introducing...

Word Count : 1928

AlphaGo

Last Update:

AlphaGo Zero, which was completely self-taught without learning from human games. AlphaGo Zero was then generalized into a program known as AlphaZero...

Word Count : 7997

AlphaZero

Last Update:

uses an approach similar to AlphaGo Zero. On December 5, 2017, the DeepMind team released a preprint paper introducing AlphaZero, which within 24 hours of...

Word Count : 2507

AlphaGo versus Ke Jie

Last Update:

2017). "China censored Google's AlphaGo match against world's best Go player" – via The Guardian. "【录像】浙江卫视解说柯洁对战Alphago专题节目". m.baidu.com. Retrieved 26...

Word Count : 868

AlphaGo versus Lee Sedol

Last Update:

AlphaGo versus Lee Sedol, also known as the DeepMind Challenge Match, was a five-game Go match between top Go player Lee Sedol and AlphaGo, a computer...

Word Count : 6011

Google DeepMind

Last Update:

version, AlphaGo Zero, defeated AlphaGo in a hundred out of a hundred games. Later that year, AlphaZero, a modified version of AlphaGo Zero, gained superhuman...

Word Count : 7410

Leela Zero

Last Update:

author of chess engine Sjeng and Go engine Leela. Leela Zero's algorithm is based on DeepMind's 2017 paper about AlphaGo Zero. Unlike the original Leela, which...

Word Count : 552

Leela Chess Zero

Last Update:

verify the methods in the AlphaZero paper as applied to the game of chess. Like Leela Zero and AlphaGo Zero, Leela Chess Zero starts with no intrinsic...

Word Count : 3218

Machine learning in video games

Last Update:

shared properties between them. AlphaZero is a modified version of AlphaGo Zero which is able to play Shogi, chess, and Go. The modified agent starts with...

Word Count : 3879

KataGo

Last Update:

is developed by David Wu. Based on techniques used by DeepMind's AlphaGo Zero, KataGo implements Monte Carlo tree search with a convolutional neural network...

Word Count : 554

Computer Go

Last Update:

AlphaGo used Monte Carlo tree search to score the resulting positions. A later version of AlphaGo, AlphaGoZero, eschewed learning from existing Go games...

Word Count : 6503

AlphaGo versus Fan Hui

Last Update:

AlphaGo versus Fan Hui was a five-game Go match between European champion Fan Hui, a 2-dan (out of 9 dan possible) professional, and AlphaGo, a computer...

Word Count : 1107

Residual neural network

Last Update:

g., BERT and GPT models such as ChatGPT), the AlphaGo Zero system, the AlphaStar system, and the AlphaFold system. In 2012, AlexNet was developed for...

Word Count : 2828

Attention Is All You Need

Last Update:

Versions AlphaGo (2015) Master (2016) AlphaGo Zero (2017) AlphaZero (2017) MuZero (2019) Competitions Fan Hui (2015) Lee Sedol (2016) Ke Jie (2017) In...

Word Count : 419

List of Go games

Last Update:

October 2015, the computer program AlphaGo became the first artificial intelligence program to defeat a professional Go player on a full size board and on...

Word Count : 2181

Aja Huang

Last Update:

of AlphaGo project in 2014. He is one of the first authors of DeepMind's paper on AlphaGo Fan in 2016 and a major author of the paper on AlphaGo Zero in...

Word Count : 435

Monte Carlo tree search

Last Update:

learning. AlphaZero, a generalized version of AlphaGo Zero using Monte Carlo tree search, reinforcement learning and deep learning. Leela Chess Zero, a free...

Word Count : 4694

Timeline of artificial intelligence

Last Update:

2017. Greenemeier, Larry (18 October 2017). "AI versus AI: Self-Taught AlphaGo Zero Vanquishes Its Predecessor". Scientific American. Archived from the original...

Word Count : 4397

Future of Go Summit

Last Update:

Watch. Retrieved 2017-05-27. "AlphaGo官方解读让三子 对人类高手没这种优势" (in Chinese). Sina.com. 25 May 2017. Retrieved 1 June 2017. "各版alphago实力对比 master能让李世石版3子" (in Chinese)...

Word Count : 1647

Fan Hui

Last Update:

compile commentaries on the matches on AlphaGo's website. Fan is one of the authors of DeepMind's paper on AlphaGo Zero published in the journal Nature on...

Word Count : 498

Go ranks and ratings

Last Update:

distributions rather than by attempting to ensure that the gain/loss of ratings is zero sum. A variation of the Elo rating system called WHR ('Whole History Rating')...

Word Count : 2027

Rules of Go

Last Update:

The rules of Go have seen some variation over time and from place to place. This article discusses those sets of rules broadly similar to the ones currently...

Word Count : 10634

Sundar Pichai

Last Update:

Google DeepMind AlphaFold AlphaGo vs. Fan Hui vs. Ke Jie vs. Lee Sedol film Future of Go Summit AlphaGo Zero AlphaStar AlphaZero Master MuZero WaveNet DoubleClick...

Word Count : 2581

MuZero

Last Update:

(2020-01-18). "The Evolution of AlphaGo to MuZero". Medium. Retrieved 2020-06-07. Shah, Rohin. "[AN #75]: Solving Atari and Go with learned game models, and...

Word Count : 1211

The MANIAC

Last Update:

Ehrenfest, history of artificial intelligence, and Lee Sedol's Go match against AlphaGo. The book received mostly positive reviews from critics. John von...

Word Count : 2451

PDF Search Engine © AllGlobal.net