AlphaGo, 새로운 시대의 시작

1
AlphaGo, 새로운 시대의 시작
2016.03.02
Youngsung Son

2
Agenda
 AlphaGo 사건
 바둑 프로그램의 한계
 AlphaGo 구성
 이세돌9단 대국 계획
 대국 관전 포인트
 AlphaGo 의의
 참고

3
https://m.facebook.com/story.php?story_fbid=10102619979696481&id=4
AlphaGo 사건

4
https://gogameguru.com/alpha-go-fan-hui/
Fan Hui 2단
Europe Champion 4회
(2013,2014,2015,2016)
AlphaGo 사건

5
 5번 대국
– 오전 공식경기: 제한시간 1시간, 30초 3회 초읽기
– 오후 비공식경기: 제한시간 없이 30초 3회 초읽기
 결과
– 오전 : Fan Hui 0-5 AlphaGo
– 오후 : Fan Hui 2-3 AlphaGo
AlphaGo 사건
http://www.yonhapnews.co.kr/bulletin/2016/02/15/0200000000AKR20160215002700007.HTML

6
https://gogameguru.com/alpha-go-fan-hui/
0 - 5
AlphaGo 사건

7
https://www.youtube.com/watch?v=SUbqykXVx0A
AlphaGo 사건

8
Alpha Go 사건
Mastering the game of Go with deep neural
networks and tree search – Nature, 2016

9
Agenda
 Alpha Go 사건
 Alpha Go 구성
 Alpha Go 시사점
 참고

10
http://time.com/3705316/deep-blue-kasparov/
체스마스터 Kasparov vs. IBM DeepBlue
1997.12
바둑 프로그램의 한계

11
http://nautil.us/issue/18/genius/why-the-chess-computer-deep-blue-played-like-a-human

12
http://www.chessgames.com/perl/chesscollection?cid=1014770

13
http://stanford.edu/~cpiech/cs221/apps/deepBlue.html https://www.uio.no/studier/emner/matnat/ifi/INF4130/h12/
undervisningsmateriale/chess-algorithms-theory-and-practice-ver2012.pdf
DeepBlue Chess Algorithm
Monte Carlo Tree SearchGame Tree

14
너무나도 많은 경우의 수

15
2081681993819799846
9947863334486277028
6522453884530548425
6394568209274196127
3801537852564845169
8519643907259916015
6281285460898883144
2712971531931755773
6620397247064840935
(19x19 board) 361!

16
Agenda
 AlphaGo 사건
 AlphaGo 구성
 AlphaGo 시사점
 참고

17
48 CPU 494 : 1 1202 CPU 5 : 0
온라인 KGS
바둑 고수(5~9단)
16만 기보
3000만 착점 학습
100만번 self대국
하루 3만 대국 학습
AlphaGo 구성

19
http://deepmind.com/alpha-go.html
AlphaGo 구성

20
http://deepmind.com/alpha-go.html
AlphaGo 구성

22
AlphaGo 구성
Deep Learning을 통한 착수 후보 선정

24
100,000 simulation against open source Go program, Pachi
AlphaGo 구성
Training 13-layer policy network from 30million positions
2𝜇s of Rollout policy with weight 𝜋, 3ms of Policy network

25
Agenda
 Alpha Go 사건
 Alpha Go 구성
 참고

26
이세돌 9단 대국 계획

27
 일정
– 1국 : 3월 9일 (수) 오후 1시
– 2국 : 3월 10일 (목) 오후 1시
– 3국 : 3월 12일 (토) 오후 1시
– 4국 : 3월 13일 (일) 오후 1시
– 5국 : 3월 15일 (화) 오후 1시
 상대 : Aja Huang 아마 6단 (DeepMind 직원)
 장소 : 서울 광화문 포시즌스 호텔 특별 대국장
 규칙
– 중국규칙, 제한 시간 2시간, 1분 초읽기 3회 (4~5시간
예상)
 상금 12억원($1M) (승자? 5판 모두 승리?)
이세돌 9단 대국 계획

28
Agenda
 Alpha Go 사건
 Alpha Go 구성
 참고

29
이세돌9단 대국 관전 포인트 (1/3)
이세돌의 흔들기?
알파고는
이창호 스타일?

30
중국 규칙
일곱집반 덤
2시간 60초 3번 초읽기

31
알파고도 실수를 할까?초반에 승부를 봐야?

32
Agenda
 AlphaGo 사건
 AlphaGo 구성
 AlphaGo 시사점
 참고

33
AlphaGo 시사점
성공한 기술 마케팅

34
개발팀 20명
David Silver, Aja Huang, Chris J. Maddison, Arthur Guez, Laurent Sifre, George van den Driessche, Julian
Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, Sander Dieleman, Dominik Grewe,
John Nham, Nal Kalchbrenner, Ilya Sutskever, Timothy Lillicrap, Madeleine Leach, Koray Kavukcuoglu, Thore
Graepel & Demis Hassabis
AlphaGo 시사점

36
존 헨리
1872년
미국 웨스트버지니아
탤컷 터널 공사
증기 드릴과 터널 뚫기
경쟁후 탈진 사망
AlphaGo 시사점

37
References
 DeepMind, https://deepmind.com/alpha-go.html
 Why the Chess Computer Deep Blue Played Like a
Human, http://nautil.us/issue/18/genius/why-the-chess-
computer-deep-blue-played-like-a-human
 Monte Carlo Tree Search, http://mcts.ai/about/index.html
 David Silver, etc, “Mastering the game of Go with deep
neural networks and tree search”,” Nature, p.484-489,
Jan. 2016
 Volodymyr Mnih, etc,” Human-level control through
deep reinforcementlearning”, Nature, p.529-43, Feb.
2015
 김석원 외, “AlphaGo의 인공지능”, SPRi Issue Report,
2016.02

38
Thank you
Youngsung Son
ysson@etri.re.kr

AlphaGo, 새로운 시대의 시작

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Andere mochten auch

Andere mochten auch (19)

Mehr von Young Sung Son

Mehr von Young Sung Son (6)

AlphaGo, 새로운 시대의 시작

Hinweis der Redaktion