AlphaGo, 새로운 시대의 시작

Video-Streaming for Real-time Rendering

AlphaGo,

2016.03.02Youngsung Son

#AgendaAlphaGo AlphaGo 9 AlphaGo

#https://m.facebook.com/story.php?story_fbid=10102619979696481&id=4

AlphaGo

#https://gogameguru.com/alpha-go-fan-hui/

Fan Hui 2

Europe Champion 4(2013,2014,2015,2016)AlphaGo

#5 : 1, 30 3 : 30 3

: Fan Hui 0-5 AlphaGo : Fan Hui 2-3 AlphaGoAlphaGo http://www.yonhapnews.co.kr/bulletin/2016/02/15/0200000000AKR20160215002700007.HTML

#https://gogameguru.com/alpha-go-fan-hui/

0 - 5AlphaGo

#https://www.youtube.com/watch?v=SUbqykXVx0AAlphaGo

#Alpha Go

Mastering the game of Go with deep neural networks and tree search Nature, 2016

#AgendaAlpha Go Alpha Go 9 Alpha Go

#http://time.com/3705316/deep-blue-kasparov/

Kasparov vs. IBM DeepBlue1997.12

#http://nautil.us/issue/18/genius/why-the-chess-computer-deep-blue-played-like-a-human

#http://www.chessgames.com/perl/chesscollection?cid=1014770

#http://stanford.edu/~cpiech/cs221/apps/deepBlue.html

https://www.uio.no/studier/emner/matnat/ifi/INF4130/h12/undervisningsmateriale/chess-algorithms-theory-and-practice-ver2012.pdf

DeepBlue Chess Algorithm Monte Carlo Tree SearchGame Tree

#

#

208168199381979984699478633344862770286522453884530548425639456820927419612738015378525648451698519643907259916015628128546089888314427129715319317557736620397247064840935(19x19 board) 361!

#AgendaAlphaGo AlphaGo 9 AlphaGo

#48 CPU 494 : 11202 CPU 5 : 0 KGS (5~9)16 3000 100 self 3 AlphaGo

#

AlphaGo

#http://deepmind.com/alpha-go.html

AlphaGo

#http://deepmind.com/alpha-go.html

AlphaGo

#

AlphaGo

#

AlphaGo Deep Learning

#

AlphaGo MCTS

#

100,000 simulation against open source Go program, PachiAlphaGo Training 13-layer policy network from 30million positions


#

9

#1 : 3 9 () 12 : 3 10 () 13 : 3 12 () 14 : 3 13 () 15 : 3 15 () 1 : Aja Huang 6 (DeepMind ) : , 2, 1 3 (4~5 ) 12($1M) (? 5 ?) 9


#9 (1/3)

? ?

# ? 9 () - 60 ? ?

9 (2/3)

2 60 3

# ? 9 () - 60 ? ?

9 (3/3)

? ?

# ? 9 () - 60 ? ?

AgendaAlphaGo AlphaGo 9 AlphaGo

#AlphaGo

#

20David Silver, Aja Huang, Chris J. Maddison, Arthur Guez, Laurent Sifre, George van den Driessche, Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, Sander Dieleman, Dominik Grewe, John Nham, Nal Kalchbrenner, Ilya Sutskever, Timothy Lillicrap, Madeleine Leach, Koray Kavukcuoglu, Thore Graepel &Demis HassabisAlphaGo

#AlphaGo

#

1872 AlphaGo

#ReferencesDeepMind, https://deepmind.com/alpha-go.htmlWhy the Chess Computer Deep Blue Played Like a Human, http://nautil.us/issue/18/genius/why-the-chess-computer-deep-blue-played-like-a-humanMonte Carlo Tree Search, http://mcts.ai/about/index.htmlDavid Silver, etc, Mastering the game of Go with deep neural networks and tree search, Nature, p.484-489, Jan. 2016Volodymyr Mnih, etc, Human-level control through deep reinforcementlearning, Nature, p.529-43, Feb. 2015 , AlphaGo , SPRi Issue Report, 2016.02

#Thank you

Youngsung [email protected]

#

Technology

AlphaGo, 새로운 시대의 시작