Google AI algorithm masters ancient game of Go

doi:doi:10.1038/529445a

Nature | News

Google AI algorithm masters ancient game of Go

Deep-learning software defeats human professional for first time.

Elizabeth Gibney

27 January 2016

The computer that mastered Go

Nature Video

A computer has beaten a human professional for the first time at Go — an ancient board game that has long been viewed as one of the greatest challenges for artificial intelligence (AI).

The best human players of chess, draughts and backgammon have all been outplayed by computers. But a hefty handicap was needed for computers to win at Go. Now Google’s London-based AI company, DeepMind, claims that its machine has mastered the game.

Go players react to computer defeat

DeepMind’s program AlphaGo beat Fan Hui, the European Go champion, five times out of five in tournament conditions, the firm reveals in research published in Nature on 27 January¹. It also defeated its silicon-based rivals, winning 99.8% of games against the current best programs. The program has yet to play the Go equivalent of a world champion, but a match against South Korean professional Lee Sedol, considered by many to be the world’s strongest player, is scheduled for March. “We’re pretty confident,” says DeepMind co-founder Demis Hassabis.

“This is a really big result, it’s huge,” says Rémi Coulom, a programmer in Lille, France, who designed a commercial Go program called Crazy Stone. He had thought computer mastery of the game was a decade away.

Nature Podcast

Hear from the makers of the AI that mastered Go — and the professional player it beat.

You may need a more recent browser or to install the latest version of the Adobe Flash Plugin.

The IBM chess computer Deep Blue, which famously beat grandmaster Garry Kasparov in 1997, was explicitly programmed to win at the game. But AlphaGo was not preprogrammed to play Go: rather, it learned using a general-purpose algorithm that allowed it to interpret the game’s patterns, in a similar way to how a DeepMind program learned to play 49 different arcade games².

This means that similar techniques could be applied to other AI domains that require recognition of complex patterns, long-term planning and decision-making, says Hassabis. “A lot of the things we’re trying to do in the world come under that rubric.” Examples are using medical images to make diagnoses or treatment plans, and improving climate-change models.

Digital intuition

In China, Japan and South Korea, Go is hugely popular and is even played by celebrity professionals. But the game has long interested AI researchers because of its complexity. The rules are relatively simple: the goal is to gain the most territory by placing and capturing black and white stones on a 19 × 19 grid. But the average 150-move game contains more possible board configurations — 10¹⁷⁰ — than there are atoms in the Universe, so it can’t be solved by algorithms that search exhaustively for the best move.

Nature Video

Go, a complex game popular in Asia, has frustrated the efforts of artificial-intelligence researchers for decades.

Abstract strategy

Computer science: The learning machines

Chess is less complex than Go, but it still has too many possible configurations to solve by brute force alone. Instead, programs cut down their searches by looking a few turns ahead and judging which player would have the upper hand. In Go, recognizing winning and losing positions is much harder: stones have equal values and can have subtle impacts far across the board.

To interpret Go boards and to learn the best possible moves, the AlphaGo program applied deep learning in neural networks — brain-inspired programs in which connections between layers of simulated neurons are strengthened through examples and experience. It first studied 30 million positions from expert games, gleaning abstract information on the state of play from board data, much as other programmes categorize images from pixels. Then it played against itself across 50 computers, improving with each iteration, a technique known as reinforcement learning.

“Deep learning is killing every problem in AI.”

The software was already competitive with the leading commercial Go programs, which select the best move by scanning a sample of simulated future games. DeepMind then combined this search approach with the ability to pick moves and interpret Go boards — giving AlphaGo a better idea of which strategies are likely to be successful. The technique is “phenomenal”, says Jonathan Schaeffer, a computer scientist at the University of Alberta in Edmonton, Canada, whose software Chinook solved³ draughts in 2007. Rather than follow the trend of the past 30 years of trying to crack games using computing power, DeepMind has reverted to mimicking human-like knowledge, albeit by training, rather than by being programmed, he says. The feat also shows the power of deep learning, which is going from success to success, says Coulom. “Deep learning is killing every problem in AI.”

Game theorists crack poker

AlphaGo plays in a human way, says Fan. “If no one told me, maybe I would think the player was a little strange, but a very strong player, a real person.” The program seems to have developed a conservative (rather than aggressive) style, adds Toby Manning, a lifelong Go player who refereed the match.

Google’s rival firm Facebook has also been working on software that uses machine learning to play Go. Its program, called darkforest, is still behind commercial state-of-the-art Go AI systems, according to a November preprint⁴.

Hassabis says that many challenges remain in DeepMind’s goal of developing a generalized AI system. In particular, its programs cannot yet usefully transfer their learning about one system — such as Go — to new tasks; a feat that humans perform seamlessly. “We’ve no idea how to do that. Not yet,” Hassabis says.

Game-playing software holds lessons for neuroscience

Go players will be keen to use the software to improve their game, says Manning, although Hassabis says that DeepMind has yet to decide whether it will make a commercial version.

AlphaGo hasn’t killed the joy of the game, Manning adds. Strap lines boasting that Go is a game that computers can’t win will have to be changed, he says. “But just because some software has got to a strength that I can only dream of, it’s not going to stop me playing.”

Journal name:: Nature
Volume:: 529,
Pages:: 445–446
Date published:: (28 January 2016)
DOI:: doi:10.1038/529445a

References

Silver, D. et al. Nature 529, 484–489 (2016).
- Article
Mnih, V. et al. Nature 518, 529–533 (2015).
Schaeffer, J. et al. Science 317, 1518–1522 (2007).
Tian, Y. & Zhu, Y. Preprint at arXiv http://arxiv.org/pdf/1511.06410.pdf (2015).

For the best commenting experience, please login or register as a user and agree to our Community Guidelines. You will be re-directed back to this page where you will see comments updating in real-time and have the ability to recommend comments to other users.

Comments

2 comments Subscribe to comments

Louis Oldershaw • 2016-01-27 07:52 PM

Preamble: If you have not seen the recent film "Ex Machina", by all means do! In terms of subject matter, acting, cinematography, music and directing, it is exquisite. Unless one subscribes to the "Ghost In Machine" version of human thought, this step forward for AI is not the least bit surprising. When Deep Blue beat Gary Kasparov at chess the writing was clearly on the wall, so to speak. The human brain and its thought output may be exceedingly subtle and complex, but the brain is only a very large number of neuronal parts and causal physical interactions. It may take a long time before computers can mimic everything a human can do, but to assume that computers cannot do so in principle is a very dubious assumption -one without much scientific merit. I cannot wait for a really advanced computer system to process all available, well-tested, observational physics/astronomy data and then formulate a cosmological paradigm. I would bet dollars to donuts that it would do one hell of a lot better than our current crop of theoretical physicists.
- Share to Twitter Share to Facebook Share link to this comment
Jonathan Kolber • 2016-01-27 07:49 PM

This article highlights the primary reason why policymakers should be paying attention here. Accelerating automation threatens to displace multitudes of workers. We need a new solution. While optimists claim that advancing technology will create new jobs to replace those lost, this time really is different. The apparent limitation that the machines cannot transfer domains of knowledge is less a problem than it appears. This is because the machines can learn new fields of expertise much faster than people can learn them, especially complex fields where the future occupations are likely to emerge. Then the "knowledgeable" machine can be duplicated as required, and the only thing standing between jobs for people in that line of work and unemployment is a cost/benefit analysis. Those who acknowledge the problem commonly advocate a guaranteed income for all. However, that solution has its own problems: poor countries cannot afford it, and in wealthier divided nations (US, India, much of EU) it will meet massive political resistance, and enormous difficulties passing and collecting such taxes as will be necessary to pay for it. Something truly new is necessary. One possibility: www.ACelebrationSociety.com.
- Share to Twitter Share to Facebook Share link to this comment

[b1] Silver, D. et al. Nature 529, 484–489 (2016).

Article

[2] Article

[b2] Mnih, V. et al. Nature 518, 529–533 (2015).

Article
PubMed
ChemPort

[4] Article

[5] PubMed

[6] ChemPort

[b3] Schaeffer, J. et al. Science 317, 1518–1522 (2007).

Article
PubMed
ChemPort

[8] Article

[9] PubMed

[10] ChemPort

[b4] Tian, Y. & Zhu, Y. Preprint at arXiv http://arxiv.org/pdf/1511.06410.pdf (2015).

The computer that mastered Go

Nature Podcast

Abstract strategy

References

Related stories and links

From nature.com

From elsewhere

Author information

Author details

Elizabeth Gibney

Search for this author in

2 comments Subscribe to comments

See other News & Comment articles from Nature

In retrospect: The selfish gene

Wearable sweat sensor paves way for real-time analysis of body chemistry

Go players react to computer defeat

Google AI algorithm masters ancient game of Go

History: Archive of wonders

The week in science: 22–28 January 2016

How quality control could save your science

Hawking’s latest black-hole paper splits physicists

India needs home-grown GM food to stop starvation

Genetics rewrites story of fish species trapped in single hole

Dog DNA probed for clues to human psychiatric ills

Slaughter of the song birds

Paris climate deal hinges on better carbon accountancy

More evidence emerges for 'transmissible Alzheimer's' theory

Monkeys genetically modified to show autism symptoms

Worst year ever for rhino poaching in Africa

Nature's news quiz: chemical elements, Planet Nine, and a nuclear test.

Research integrity: Don't let transparency damage science

Researchers question design of fatal French clinical trial

Science expresses concern over controversial chemistry paper

Social Media Box - AML

Spot check

Top Content - Article Page

Recent

Read

Commented

Transmissable Alzheimer’s?

Autism research

Black-hole conundrum

Research integrity

Nature Podcast

Science jobs from nature jobs