Log in
Enquire now
AlphaGo Zero

AlphaGo Zero

AlphaGo Zero, an updated version of AlphaGo, is a computer program developed by Google DeepMind to play the board game Go using reinforcement learning.

OverviewStructured DataIssuesContributors

Contents

deepmind.com/blog/alphago-zero-starting-from-scratch
Is a
Software
Software
Product
Product

Product attributes

Industry
Machine learning
Machine learning
Software development
Software development
Artificial Intelligence (AI)
Artificial Intelligence (AI)
Board game
Board game
Launch Date
October 18, 2017
0
Product Parent Company
Google DeepMind
Google DeepMind
0

Other attributes

Official Name
AlphaGo Zero0
Wikidata ID
Q42259287
Overview

AlphaGo Zero, an updated version of AlphaGo, is a computer program developed by Google DeepMind to play the board game Go using reinforcement learning. In March 2016, AlphaGo became the first computer program to beat a world-champion Go player. AlphaGo used search trees to evaluate positions and neural networks to select moves. These neural networks were initially trained using thousands of human amateur and professional games (supervised learning) before using reinforcement learning via self-play. AlphaGo Zero is based solely on reinforcement learning; it doesn't use human data, guidance, or any knowledge beyond the game rules.

Starting from completely random play, AlphaGo Zero used a novel form of reinforcement learning to become its own teacher. The neural network is trained to predict AlphaGo's own move selections and also the winner of AlphaGo games. This neural network improves the strength of the tree search, resulting in higher-quality move selection and improved self-play with each iteration.

AlphaGo Zero was announced in a blog published by Google DeepMind on October 18, 2017. This was followed by a paper published in Nature on October 19, 2017. The paper titled "Mastering the game of Go without human knowledge" goes into greater detail describing the architecture and training of the AlphaGo Zero algorithm. Starting in a tabula rasa ("blank slate") condition, AlphaGo Zero achieved superhuman performance, winning 100–0 against DeepMind's previous program AlphaGo after only three days of self-play training. After forty days of self-training, it outperformed the upgraded version of AlphaGo known as "Master."

Timeline

No Timeline data yet.

Further Resources

Title
Author
Link
Type
Date

Mastering the game of Go without human knowledge

David Silver, Julian Schrittwieser, Karen Simonyan, Ioannis Antonoglou, Aja Huang, Arthur Guez, Thomas Hubert, Lucas Baker, Matthew Lai, Adrian Bolton, Yutian Chen, Timothy Lillicrap, Fan Hui, Laurent Sifre, George van den Driessche, Thore Graepel & Demis Hassabis

https://www.nature.com/articles/nature24270

Academic paper

19 October, 2017

References

Find more entities like AlphaGo Zero

Use the Golden Query Tool to find similar entities by any field in the Knowledge Graph, including industry, location, and more.
Open Query Tool
Access by API
Golden Query Tool
Golden logo

Company

  • Home
  • Press & Media
  • Blog
  • Careers
  • WE'RE HIRING

Products

  • Knowledge Graph
  • Query Tool
  • Data Requests
  • Knowledge Storage
  • API
  • Pricing
  • Enterprise
  • ChatGPT Plugin

Legal

  • Terms of Service
  • Enterprise Terms of Service
  • Privacy Policy

Help

  • Help center
  • API Documentation
  • Contact Us