PR-153: SNAIL: A Simple Neural Attentive Meta-Learner

•Als PPTX, PDF herunterladen•

1 gefällt mir•503 views

- Title: SNAIL: A Simple Neural Attentive Meta-Learner - Paper: https://arxiv.org/abs/1707.03141 - Youtube: https://youtu.be/zGrwpa5-_0Y Taekmin Kim, http://github.com/tantara

Technologie

A Simple Neural AttentIve Meta-
Learner
PR-153
Mar 31, 2019
Taekmin Kim
1

Machine Learning vs. Human
● Machine Learning
○ Try to learn data points
■ Supervised/Unsupervised Learning
■ Reinforcement Learning
● Human
○ Fast adaptation with prior knowledge
■ Few-shot learning
■ Generalization across tasks
3

● Related Work
○ LEARNING TO REINFORCEMENT LEARN
○ RL^2
○ MAML
○ Auto-Meta
Meta-Learner?
Multi-armed bandit problem
https://blog.floydhub.com/meta-rl/
4

Meta-RL
● Goal: Generalization across tasks
● Notations
○ T: Task distribution e.g., driving, multi-armed bandit problems
○ T_i: Specific task e.g., Sonata, Porsche, ...
○ x_t: state
○ a_t: action
5

RNN-based Meta-RL(Agent)
● sequence-to-sequence problem
○ refer to past experience
● Drawbacks:
○ Temporally-linear dependency
https://blog.floydhub.com/meta-rl/
6

Motivation
● Temporal(Causal) Convolution
○ depends on previous steps
● Soft Attention
○ weighted sum
https://www.slideshare.net/ThomasHjeldeThoresen/temporal-convolutional-networks-dethroning-rnns-for-sequence-modelling
https://medium.com/syncedreview/memory-attention-sequences-8522f531dd43
7

Temporal Convolution
https://www.slideshare.net/ThomasHjeldeThoresen/temporal-convolutional-networks-dethroning-rnns-for-sequence-modelling
Vanilla 1D TCs
(exponential) Dilated 1D TCs
Vanilla 1D Convolution Temporal Convolution(TC)
8

Attention is All you Need(2017)
https://mchromiak.github.io/articles/2017/Sep/12/Transformer-Attention-is-all-you-need/#.XJ6U6-szZ0c
https://medium.com/@hyponymous/paper-summary-attention-is-all-you-need-22c2c7a5e06
Q: Hidden State of Decoder
K: Hidden State of Encoder
V: (normalized) Weights
9
PR-049: https://www.youtube.com/watch?v=6zGgVIlStXs

Simple Neural AttentIve Learner
Building Blocks
● DenseBlock
● TCBlock
● AttentionBlock
11

Attention Block
Query: Hidden State of Decoder
Key: Hidden State of Encoder
Value: (normalized) Weights
14

Simple Neural AttentIve Learner
Building Blocks
● DenseBlock
● TCBlock
● AttentionBlock
16

Experiments
● Supervised Learning
○ Few-Shot Learning(Image Classification)
■ n-Way DATASET
■ m-shot
● Reinforcement Learning
○ Multi-Armed Bandits
○ Tabular MDPs
○ Continuous Control
○ Visual Navigation
17

Results: Few-shot Learning
MAML
Omniglot
18

MAML: Optimization-based Meta-RL
20
https://arxiv.org/abs/1703.03400
PR-094: MAML https://www.youtube.com/watch?v=fxJXXKZb-ik

Results: Tabular MDPs
23
가깝고도 먼 TRPO(이웅원 님)
https://www.slideshare.net/WoongwonLee/trpo-87165690

Summary
26
● SNAIL
○ Temporal Convolution
○ Soft Attention
● Meta-RL is promising
● Related Work
○ LEARNING TO REINFORCEMENT LEARN
○ RL^2
○ MAML
○ Auto-Meta
● Materials
○ Meta-RL(Chelsea Finn): http://rail.eecs.berkeley.edu/deeprlcourse/static/slides/lec-20.pdf

Weitere ähnliche Inhalte

Was ist angesagt?

GPT-Xgohyunwoong

PR-231: A Simple Framework for Contrastive Learning of Visual RepresentationsJinwon Lee

LLaMA 2.pptxRkRahul16

Variational continual learningNguyen Giang

Meta-Learning PresentationAkshayaNagarajan10

AutoML - The Future of AINing Jiang

[AIoTLab]attention mechanism.pptxTuCaoMinh2

Meta-Learning with Memory-Augmented Neural Networks (MANN)Yeonsu Kim

【論文紹介】Seq2Seq (NIPS 2014)Tomoyuki Hioki

Recurrent Neural Networks, LSTM and GRUananth

【DL輪読会】Transformers are Sample Efficient World ModelsDeep Learning JP

Actor critic algorithmJie-Han Chen

Introduction to Transformer ModelNuwan Sriyantha Bandara

Large scale-lm-part1gohyunwoong

Deep learningKuppusamy P

Convolutional Neural Network (CNN)Muhammad Haroon

Speeding up Deep Learning training and inferenceThomas Delteil

rnn BASICSPriyanka Reddy

BERT: Bidirectional Encoder Representations from TransformersLiangqun Lu

[DL輪読会]ODT: Online Decision TransformerDeep Learning JP

Was ist angesagt? (20)

GPT-X

PR-231: A Simple Framework for Contrastive Learning of Visual Representations

LLaMA 2.pptx

Variational continual learning

Meta-Learning Presentation

AutoML - The Future of AI

[AIoTLab]attention mechanism.pptx

Meta-Learning with Memory-Augmented Neural Networks (MANN)

【論文紹介】Seq2Seq (NIPS 2014)

Recurrent Neural Networks, LSTM and GRU

【DL輪読会】Transformers are Sample Efficient World Models

Actor critic algorithm

Introduction to Transformer Model

Large scale-lm-part1

Deep learning

Convolutional Neural Network (CNN)

Speeding up Deep Learning training and inference

rnn BASICS

BERT: Bidirectional Encoder Representations from Transformers

[DL輪読会]ODT: Online Decision Transformer

Kürzlich hochgeladen

Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

APIForce Zurich 5 April Automation LPDGMarianaLemus7

Vulnerability_Management_GRC_by Sohang Sengupta.pptxnull - The Open Security Community

Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix

Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106

Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson

Build your next Gen AI Breakthrough - April 2024Neo4j

New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada

Unlocking the Potential of the Cloud for IBM Power SystemsPrecisely

Pigging Solutions in Pet Food ManufacturingPigging Solutions

Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm

Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes

CloudStudio User manual (basic edition):comworks

DMCC Future of Trade Web3 - Special EditionDubai Multi Commodity Centre

Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard

FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

Kürzlich hochgeladen (20)

Injustice - Developers Among Us (SciFiDevCon 2024)

08448380779 Call Girls In Friends Colony Women Seeking Men

APIForce Zurich 5 April Automation LPDG

Vulnerability_Management_GRC_by Sohang Sengupta.pptx

Swan(sea) Song – personal research during my six years at Swansea ... and bey...

Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics

Are Multi-Cloud and Serverless Good or Bad?

Build your next Gen AI Breakthrough - April 2024

New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024

Unlocking the Potential of the Cloud for IBM Power Systems

Pigging Solutions in Pet Food Manufacturing

Streamlining Python Development: A Guide to a Modern Project Setup

Enhancing Worker Digital Experience: A Hands-on Workshop for Partners

CloudStudio User manual (basic edition):

DMCC Future of Trade Web3 - Special Edition

Maximizing Board Effectiveness 2024 Webinar.pptx

FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024

PR-153: SNAIL: A Simple Neural Attentive Meta-Learner

1. A Simple Neural AttentIve Meta- Learner PR-153 Mar 31, 2019 Taekmin Kim 1

2. 2

3. Machine Learning vs. Human ● Machine Learning ○ Try to learn data points ■ Supervised/Unsupervised Learning ■ Reinforcement Learning ● Human ○ Fast adaptation with prior knowledge ■ Few-shot learning ■ Generalization across tasks 3

4. ● Related Work ○ LEARNING TO REINFORCEMENT LEARN ○ RL^2 ○ MAML ○ Auto-Meta Meta-Learner? Multi-armed bandit problem https://blog.floydhub.com/meta-rl/ 4

5. Meta-RL ● Goal: Generalization across tasks ● Notations ○ T: Task distribution e.g., driving, multi-armed bandit problems ○ T_i: Specific task e.g., Sonata, Porsche, ... ○ x_t: state ○ a_t: action 5

6. RNN-based Meta-RL(Agent) ● sequence-to-sequence problem ○ refer to past experience ● Drawbacks: ○ Temporally-linear dependency https://blog.floydhub.com/meta-rl/ 6

7. Motivation ● Temporal(Causal) Convolution ○ depends on previous steps ● Soft Attention ○ weighted sum https://www.slideshare.net/ThomasHjeldeThoresen/temporal-convolutional-networks-dethroning-rnns-for-sequence-modelling https://medium.com/syncedreview/memory-attention-sequences-8522f531dd43 7

8. Temporal Convolution https://www.slideshare.net/ThomasHjeldeThoresen/temporal-convolutional-networks-dethroning-rnns-for-sequence-modelling Vanilla 1D TCs (exponential) Dilated 1D TCs Vanilla 1D Convolution Temporal Convolution(TC) 8

9. Attention is All you Need(2017) https://mchromiak.github.io/articles/2017/Sep/12/Transformer-Attention-is-all-you-need/#.XJ6U6-szZ0c https://medium.com/@hyponymous/paper-summary-attention-is-all-you-need-22c2c7a5e06 Q: Hidden State of Decoder K: Hidden State of Encoder V: (normalized) Weights 9 PR-049: https://www.youtube.com/watch?v=6zGgVIlStXs

10. Motivation ● Temporal(Causal) Convolution ○ depends on previous steps ● Soft Attention ○ weighted sum https://www.slideshare.net/ThomasHjeldeThoresen/temporal-convolutional-networks-dethroning-rnns-for-sequence-modelling https://medium.com/syncedreview/memory-attention-sequences-8522f531dd43 10

11. Simple Neural AttentIve Learner Building Blocks ● DenseBlock ● TCBlock ● AttentionBlock 11

12. Dense Block 12

13. TC Block 13

14. Attention Block Query: Hidden State of Decoder Key: Hidden State of Encoder Value: (normalized) Weights 14

15. 15

16. Simple Neural AttentIve Learner Building Blocks ● DenseBlock ● TCBlock ● AttentionBlock 16

17. Experiments ● Supervised Learning ○ Few-Shot Learning(Image Classification) ■ n-Way DATASET ■ m-shot ● Reinforcement Learning ○ Multi-Armed Bandits ○ Tabular MDPs ○ Continuous Control ○ Visual Navigation 17

18. Results: Few-shot Learning MAML Omniglot 18

19. MAML 19

20. MAML: Optimization-based Meta-RL 20 https://arxiv.org/abs/1703.03400 PR-094: MAML https://www.youtube.com/watch?v=fxJXXKZb-ik

21. Results: Multi-armed Bandits MAML 21

22. Results: Visual Navigation 22

23. Results: Tabular MDPs 23 가깝고도 먼 TRPO(이웅원 님) https://www.slideshare.net/WoongwonLee/trpo-87165690

24. MAML 24

25. Results: Continous Control 25

26. Summary 26 ● SNAIL ○ Temporal Convolution ○ Soft Attention ● Meta-RL is promising ● Related Work ○ LEARNING TO REINFORCEMENT LEARN ○ RL^2 ○ MAML ○ Auto-Meta ● Materials ○ Meta-RL(Chelsea Finn): http://rail.eecs.berkeley.edu/deeprlcourse/static/slides/lec-20.pdf

PR-153: SNAIL: A Simple Neural Attentive Meta-Learner

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

PR-153: SNAIL: A Simple Neural Attentive Meta-Learner