←

Rich Sutton on Reinforcement Learning: Alpha Go Zero to 60 | by jurvetson

→

Back to photostream

Steve Jurvetson

jurvetson

Rich Sutton on Reinforcement Learning: Alpha Go Zero to 60

A fireside chat with Rich Sutton from Google DeepMind and the University of Alberta, and the founding father of Reinforcement Learning. Here’s the video

I started with an exploration of whether the AIs of the future will be human-centric or alien, and why? Why are goals and agency central to intelligence?

He was very excited about Deep Mind’s new Alpha Go Zero results, whereby the training of the game-playing AI was dramatically improved by removing human training data sets altogether. It bootstrapped itself from just the rules of the game and a series of AI vs. AI matches. It took just 72 hours. (Jump to video segment)

Dr. Sutton is considered one of the founding fathers of modern computational reinforcement learning, having several significant contributions to the field, including temporal difference learning, policy gradient methods, the Dyna architecture. In June 2017, Demis Hassabis announced that Sutton would co-lead a new Alberta office of Deepmind, while maintaining his professorship at University of Alberta.

6,709 views

5 faves

0 comments

Uploaded on December 8, 2017

Taken on October 26, 2017

Rich Sutton on Reinforcement Learning: Alpha Go Zero to 60

A fireside chat with Rich Sutton from Google DeepMind and the University of Alberta, and the founding father of Reinforcement Learning. Here’s the video

I started with an exploration of whether the AIs of the future will be human-centric or alien, and why? Why are goals and agency central to intelligence?

He was very excited about Deep Mind’s new Alpha Go Zero results, whereby the training of the game-playing AI was dramatically improved by removing human training data sets altogether. It bootstrapped itself from just the rules of the game and a series of AI vs. AI matches. It took just 72 hours. (Jump to video segment)

Dr. Sutton is considered one of the founding fathers of modern computational reinforcement learning, having several significant contributions to the field, including temporal difference learning, policy gradient methods, the Dyna architecture. In June 2017, Demis Hassabis announced that Sutton would co-lead a new Alberta office of Deepmind, while maintaining his professorship at University of Alberta.