Introduction to Deep Reinforcement Learning workshop at School of Ai: AI Day

•

0 gefällt mir•576 views

azzeddine chenine

an introduction to Deep Reinforcement Learning followed by a workshop on Deep Q-learning Algorithm in pytorch

Ingenieurwesen

Deep Reinforcement Learning
Azzeddine CHENINE
AI Research Engineer @instadeepai
An introduction workshop

Deep Reinforcement Learning
Azzeddine CHENINE
AI Research Engineer @instadeepai
What? How? What’s hot about it 🔥?

But What if…..
…Your task is manifested by a series of decisions to
reach or keep an optimal performance
10

Reinforcement Learning
• Building agents that are able to learn an optimal policy to preform a task within a
Markovian environment
…What ?
11

Reinforcement Learning
• Building agents that are able to learn an optimal policy to preform a task within a
Markovian environment
• In a Markovian environment the next state depends only on the current state and the
agent that will be preformed by the agent
…What ?
12

Reinforcement Learning
…How ?
Environment
Agent
14

Reinforcement Learning
…How ?
Environment
Agent
State
15

Reinforcement Learning
…How ?
Environment
Agent
Action
State
16

Reinforcement Learning
…How ?
Environment
Agent
Reward New State Action
17

Reinforcement Learning
…How ?
Environment
Agent
Reward
New State
Action
• Reach an optimal policy
𝝿
•
𝝿
can be deterministic or stochastic
• A deterministic version of
𝝿
can be derived from the
action value function Q(S,a)
• You are free to choose your policy type
18

What’s hot 🔥about DeepRL
• Reinforcement Learning existed since the early 80s
19

What’s hot 🔥about DeepRL
• Reinforcement Learning existed since the early 80s
• Reinforcement Learning before the hype of Deep Learning used to rely on Dynamic
programing Algorithms
20

What’s hot 🔥about DeepRL
Bio
Stocks Games
Robots
• Modern environments present complex action and state spaces
23

What’s hot 🔥about DeepRL
Bio
Stocks Games
Robots
• Deep Neural Networks are able to extract features from different state types
24
• Modern environments present complex action and state spaces

What’s hot 🔥about DeepRL
Bio
Stocks Games
Robots
• Deep Neural Networks are able to approximate functions that map an observation to
a desired output space
25
• Deep Neural Networks are able to extract features from different state types
• Modern environments present complex action and state spaces

DeepRL workshop
• Inspecting a dynamic programing version of Q-learning
• Inspecting limitation and Deep Neural network use case
• Implementing Deep Q-learning with Tensor
fl
ow Keras API and Pytorch
• Getting introduced to OpenAI GYM for reinforcement learning environments
• Visualizing the training and inference of a DQN agents
26

Other hot topics
• Multi-agent reinforcement learning
• Imitation learning and behaviour cloning
• The problem of generation in Deep RL
• Policy based methods: PPO, A2C, A3C…
• DeepRL frameworks: RLLib, TF Agents…
27

Resources
• Berkeley DeepRL Bootcamp on Youtube
• Reinforcement Learning, an introduction
• Udacity DeepRL Nanodegree if possible
• RL course by David silver on Youtube
• Open AI gym documentation
28

Weitere ähnliche Inhalte

Ähnlich wie Introduction to Deep Reinforcement Learning workshop at School of Ai: AI Day

anintroductiontoreinforcementlearning-180912151720.pdfssuseradaf5f

An introduction to reinforcement learningSubrat Panda, PhD

Sippin: A Mobile Application Case Study presented at Techfest LouisvilleDawn Yankeelov

24.09.2021 Reinforcement Learning Algorithms.pptxManiMaran230751

Intro to Deep Reinforcement LearningKhaled Saleh

Deep Learning in RoboticsSungjoon Choi

“Reinforcement Learning: a Practical Introduction,” a Presentation from Micro...Edge AI and Vision Alliance

Orchestration, the conductor's scoreSalesforce Engineering

Introduction2drlShenglin Zhao

Fundamentals of Machine Learning Bootcamp - 24 Nov London 2014 Persontyle

Fundamentals of Machine Learning Bootcamp - 24 Nov London Persontyle

Horizon: Deep Reinforcement Learning at ScaleDatabricks

Bad Advice Unintended Consequences and Broken Paradigms - Think && Act Differ...Steve Werby

孫民/從電腦視覺看人工智慧 : 下一件大事台灣資料科學年會

Susan epstein at ibm csig speaker seriesdiannepatricia

Demystifying Machine Learning and Artificial IntelligenceEPCC, University of Edinburgh

How to Rescue Complex Projects from DisasterPerforce

Is Production RL at a tipping point?M Waleed Kadous

Keynote - From Monolith to Microservices - Lessons Learned in the Real WorldEran Stiller

Types of Artificial Intelligence.pptGEETHAS668001

Ähnlich wie Introduction to Deep Reinforcement Learning workshop at School of Ai: AI Day (20)

anintroductiontoreinforcementlearning-180912151720.pdf

An introduction to reinforcement learning

Sippin: A Mobile Application Case Study presented at Techfest Louisville

24.09.2021 Reinforcement Learning Algorithms.pptx

Intro to Deep Reinforcement Learning

Deep Learning in Robotics

“Reinforcement Learning: a Practical Introduction,” a Presentation from Micro...

Orchestration, the conductor's score

Introduction2drl

Fundamentals of Machine Learning Bootcamp - 24 Nov London 2014

Fundamentals of Machine Learning Bootcamp - 24 Nov London

Horizon: Deep Reinforcement Learning at Scale

Bad Advice Unintended Consequences and Broken Paradigms - Think && Act Differ...

孫民/從電腦視覺看人工智慧 : 下一件大事

Susan epstein at ibm csig speaker series

Demystifying Machine Learning and Artificial Intelligence

How to Rescue Complex Projects from Disaster

Is Production RL at a tipping point?

Keynote - From Monolith to Microservices - Lessons Learned in the Real World

Types of Artificial Intelligence.ppt

Kürzlich hochgeladen

(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat

(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7Call Girls in Nagpur High Profile Call Girls

Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan

Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...Call Girls in Nagpur High Profile

UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingrknatarajan

UNIT-III FMM. DIMENSIONAL ANALYSISrknatarajan

Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile

MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N

Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...Call Girls in Nagpur High Profile

Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Christo Ananth

Introduction to IEEE STANDARDS and its different types.pptxupamatechverse

Introduction to Multiple Access Protocol.pptxupamatechverse

Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...roncy bisnoi

Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile

Roadmap to Membership of RICS - Pathways and RoutesM Maged Hegazy, LLM, MBA, CCP, P3O

MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N

Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile

Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis

Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control

Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile

Kürzlich hochgeladen (20)

(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...

(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7

Coefficient of Thermal Expansion and their Importance.pptx

Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...

UNIT-V FMM.HYDRAULIC TURBINE - Construction and working

UNIT-III FMM. DIMENSIONAL ANALYSIS

Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...

MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE

Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...

Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...

Introduction to IEEE STANDARDS and its different types.pptx

Introduction to Multiple Access Protocol.pptx

Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...

Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts

Roadmap to Membership of RICS - Pathways and Routes

MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS

Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik

Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...

Water Industry Process Automation & Control Monthly - April 2024

Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts

Introduction to Deep Reinforcement Learning workshop at School of Ai: AI Day

1. Hi

2. Deep Reinforcement Learning Azzeddine CHENINE AI Research Engineer @instadeepai An introduction workshop

3. Deep Reinforcement Learning Azzeddine CHENINE AI Research Engineer @instadeepai What? How? What’s hot about it 🔥?

4. 4

5. 5

6. 6

7. 7

8. 8

9. 9

10. But What if….. …Your task is manifested by a series of decisions to reach or keep an optimal performance 10

11. Reinforcement Learning • Building agents that are able to learn an optimal policy to preform a task within a Markovian environment …What ? 11

12. Reinforcement Learning • Building agents that are able to learn an optimal policy to preform a task within a Markovian environment • In a Markovian environment the next state depends only on the current state and the agent that will be preformed by the agent …What ? 12

13. Reinforcement Learning • Building agents that are able to learn an optimal policy to preform a task within a Markovian environment • In a Markovian environment the next state depends only on the current state and the agent that will be preformed by the agent …What ? • This task can be episodic or continues 13

14. Reinforcement Learning …How ? Environment Agent 14

15. Reinforcement Learning …How ? Environment Agent State 15

16. Reinforcement Learning …How ? Environment Agent Action State 16

17. Reinforcement Learning …How ? Environment Agent Reward New State Action 17

18. Reinforcement Learning …How ? Environment Agent Reward New State Action • Reach an optimal policy 𝝿 • 𝝿 can be deterministic or stochastic • A deterministic version of 𝝿 can be derived from the action value function Q(S,a) • You are free to choose your policy type 18

19. What’s hot 🔥about DeepRL • Reinforcement Learning existed since the early 80s 19

20. What’s hot 🔥about DeepRL • Reinforcement Learning existed since the early 80s • Reinforcement Learning before the hype of Deep Learning used to rely on Dynamic programing Algorithms 20

21. What’s hot 🔥about DeepRL • Reinforcement Learning existed since the early 80s • Reinforcement Learning before the hype of Deep Learning used to rely on Dynamic programing Algorithms • Monte-carlo, Sarsa (not salsa 💃), Q-learning, expected Sarsa…etc 21

22. What’s hot 🔥about DeepRL • Reinforcement Learning existed since the early 80s • Reinforcement Learning before the hype of Deep Learning used to rely on Dynamic programing Algorithms • Monte-carlo, Sarsa (not salsa 💃), Q-learning, expected Sarsa…etc • Data structures to hold reference for the actions values of each state 22

23. What’s hot 🔥about DeepRL Bio Stocks Games Robots • Modern environments present complex action and state spaces 23

24. What’s hot 🔥about DeepRL Bio Stocks Games Robots • Deep Neural Networks are able to extract features from different state types 24 • Modern environments present complex action and state spaces

25. What’s hot 🔥about DeepRL Bio Stocks Games Robots • Deep Neural Networks are able to approximate functions that map an observation to a desired output space 25 • Deep Neural Networks are able to extract features from different state types • Modern environments present complex action and state spaces

26. DeepRL workshop • Inspecting a dynamic programing version of Q-learning • Inspecting limitation and Deep Neural network use case • Implementing Deep Q-learning with Tensor fl ow Keras API and Pytorch • Getting introduced to OpenAI GYM for reinforcement learning environments • Visualizing the training and inference of a DQN agents 26

27. Other hot topics • Multi-agent reinforcement learning • Imitation learning and behaviour cloning • The problem of generation in Deep RL • Policy based methods: PPO, A2C, A3C… • DeepRL frameworks: RLLib, TF Agents… 27

28. Resources • Berkeley DeepRL Bootcamp on Youtube • Reinforcement Learning, an introduction • Udacity DeepRL Nanodegree if possible • RL course by David silver on Youtube • Open AI gym documentation 28

Introduction to Deep Reinforcement Learning workshop at School of Ai: AI Day

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Ähnlich wie Introduction to Deep Reinforcement Learning workshop at School of Ai: AI Day

Ähnlich wie Introduction to Deep Reinforcement Learning workshop at School of Ai: AI Day (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Introduction to Deep Reinforcement Learning workshop at School of Ai: AI Day