PR12-094: Model-Agnostic Meta-Learning for fast adaptation of deep networks

•

4 gefällt mir•983 views

Paper review: "Model-Agnostic Meta-learning for fast adaptation of deep networks" by C. Finn et al (ICML2017) Presented at Tensorflow-KR paper review forum (#PR12) by Taesu Kim Paper link: https://arxiv.org/abs/1703.03400 Video link: https://youtu.be/fxJXXKZb-ik (in Korean) http://www.neosapience.com

Technologie

Model-Agnostic Meta Learning
for fast adaptation of deep networks
Presented by Taesu Kim
June 24, 2018
C. Finn, P. Abbeel, S. Levine

Meta-learning: Learning to learn
Meta is a prefix used in English to indicate a concept which is an abstraction behind
another concept, used to complete or add to the latter.
-- From Wikipedia

Naïve approach to fine-tune
Is this the best way? Are you sure?

Few-shot supervised learning
For regression
For classification
K-shot, N-way classification
(2)
(3)

For reinforcement learning
Quickly acquire a policy for a new
task only a small amount of
experience in the test setting
(4)

Experimental evaluation
› To answer following questions
– Can MAML enable fast learning of new tasks?
– Can MAML be used for meta-learning in multiple different
domains, including supervised regression, classification, and
reinforcement learning?
– Can a model learned with MAML continue to improve with
additional gradient updates and/or examples?

Regression
› Sine wave fitting
– Amplitude: [0.1, 5.0]
– Phase: [0, Pi]
– x: sampled uniformly from [-5.0, 5.0]
– f(x): 2 hidden layers of size 40 with ReLU
– K={5,10,20}
› Comparison
– Pretraining on all of the tasks
› Regress to random sine functions
› Fine-tune with gradient descent on the K provided points
– Oracle
– Additional multi-task and adaptation methods

Classification
› Omniglot dataset
– 20 instances of 1623 characters from 50 different alphabets
– Each instance was drawn by a different person
– Randomly selected 1200 characters for training, and the
remaining for testing
› MiniImagenet dataset
– 64 training classes, 12 validation classes, and 24 test classes
› N way classification with 1 or 5 shots

Reinforcement learning
› rllab benchmark suite
› Neural network policy with two hidden layers of size 100
with ReLU
› Gradients updates are computed using vanilla policy
gradients (REINFORCE) and trust-region policy (TRPO)
optimization as meta-optimizer
› Comparison
– Pretraining one policy on all of the tasks and fine-tuning
– Training a policy from randomly initialized weights
– Oracle policy

Reinforcement learning
› Locomotion
– High-dimensional locomotion tasks with the MuJoCo simulator

Advanced researches
› Meta-SGD: Learning to learn quickly for few-shot learning,
Li et al, Sep 2017
› Recasting gradient-based meta-learning as hierarchical
Bayes, Grant et al, ICLR 2018
› Gradient-based meta-learning with learned layerwise
metric and subspace, Lee et al, ICML 2018
› Probabilistic Model-agnostic meta learning, Finn et al, Jun
2018
› Bayesian Model-Agnostic Meta-Learning, Kim et al, Jun
2018

Follow us:
Contact us:
contact@neosapience.com
For more information:
http://www.neosapience.com

Empfohlen

Model-Agnostic Meta-Learning for Fast Adaptation of Deep NetworksYoonho Lee

Introduction to MAML (Model Agnostic Meta Learning) with DiscussionsJoonyoung Yi

On First-Order Meta-Learning AlgorithmsYoonho Lee

Continual Learning with Deep Architectures - Tutorial ICML 2021Vincenzo Lomonaco

Meta-Learning with Memory-Augmented Neural Networks (MANN)Yeonsu Kim

AlexNetBertil Hatt

Continual learning: SurveyWonjun Jeong

Overview on Optimization algorithms in Deep LearningKhang Pham

Empfohlen

Model-Agnostic Meta-Learning for Fast Adaptation of Deep NetworksYoonho Lee

Introduction to MAML (Model Agnostic Meta Learning) with DiscussionsJoonyoung Yi

On First-Order Meta-Learning AlgorithmsYoonho Lee

Continual Learning with Deep Architectures - Tutorial ICML 2021Vincenzo Lomonaco

Meta-Learning with Memory-Augmented Neural Networks (MANN)Yeonsu Kim

AlexNetBertil Hatt

Continual learning: SurveyWonjun Jeong

Overview on Optimization algorithms in Deep LearningKhang Pham

Variational continual learningNguyen Giang

Convolutional Neural Network Models - Deep LearningMohamed Loey

Spectral clusteringSOYEON KIM

Introduction to Few shot learningRidge-i, Inc.

Convolutional Neural Network (CNN)Muhammad Haroon

Transfer Learning and Fine Tuning for Cross Domain Image Classification with ...Sujit Pal

Emerging Properties in Self-Supervised Vision TransformersSungchul Kim

Meta-Learning PresentationAkshayaNagarajan10

Bayesian Model-Agnostic Meta-LearningSangwoo Mo

Training Neural NetworksDatabricks

Paper Summary of Beta-VAE: Learning Basic Visual Concepts with a Constrained ...준식 최

Convolutional Neural Networksmilad abbasi

PR-231: A Simple Framework for Contrastive Learning of Visual RepresentationsJinwon Lee

Comparing Incremental Learning Strategies for Convolutional Neural NetworksVincenzo Lomonaco

The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...Universitat Politècnica de Catalunya

State of transformers in Computer VisionDeep Kayal

Introduction to Graph Neural Networks: Basics and Applications - Katsuhiko Is...Preferred Networks

Life-long / Incremental Learning (DLAI D6L1 2017 UPC Deep Learning for Artifi...Universitat Politècnica de Catalunya

Swin transformerJAEMINJEONG5

AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)Fellowship at Vodafone FutureLab

Introduction to cyclical learning rates for training neural netsSayak Paul

Few shot learning/ one shot learning/ machine learningﺁﺻﻒ ﻋﻠﯽ ﻣﯿﺮ

Weitere ähnliche Inhalte

Was ist angesagt?

Variational continual learningNguyen Giang

Convolutional Neural Network Models - Deep LearningMohamed Loey

Spectral clusteringSOYEON KIM

Introduction to Few shot learningRidge-i, Inc.

Convolutional Neural Network (CNN)Muhammad Haroon

Transfer Learning and Fine Tuning for Cross Domain Image Classification with ...Sujit Pal

Emerging Properties in Self-Supervised Vision TransformersSungchul Kim

Meta-Learning PresentationAkshayaNagarajan10

Bayesian Model-Agnostic Meta-LearningSangwoo Mo

Training Neural NetworksDatabricks

Paper Summary of Beta-VAE: Learning Basic Visual Concepts with a Constrained ...준식 최

Convolutional Neural Networksmilad abbasi

PR-231: A Simple Framework for Contrastive Learning of Visual RepresentationsJinwon Lee

Comparing Incremental Learning Strategies for Convolutional Neural NetworksVincenzo Lomonaco

The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...Universitat Politècnica de Catalunya

State of transformers in Computer VisionDeep Kayal

Introduction to Graph Neural Networks: Basics and Applications - Katsuhiko Is...Preferred Networks

Life-long / Incremental Learning (DLAI D6L1 2017 UPC Deep Learning for Artifi...Universitat Politècnica de Catalunya

Swin transformerJAEMINJEONG5

AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)Fellowship at Vodafone FutureLab

Was ist angesagt? (20)

Variational continual learning

Convolutional Neural Network Models - Deep Learning

Spectral clustering

Introduction to Few shot learning

Convolutional Neural Network (CNN)

Transfer Learning and Fine Tuning for Cross Domain Image Classification with ...

Emerging Properties in Self-Supervised Vision Transformers

Meta-Learning Presentation

Bayesian Model-Agnostic Meta-Learning

Training Neural Networks

Paper Summary of Beta-VAE: Learning Basic Visual Concepts with a Constrained ...

Convolutional Neural Networks

PR-231: A Simple Framework for Contrastive Learning of Visual Representations

Comparing Incremental Learning Strategies for Convolutional Neural Networks

The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...

State of transformers in Computer Vision

Introduction to Graph Neural Networks: Basics and Applications - Katsuhiko Is...

Life-long / Incremental Learning (DLAI D6L1 2017 UPC Deep Learning for Artifi...

Swin transformer

AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)

Ähnlich wie PR12-094: Model-Agnostic Meta-Learning for fast adaptation of deep networks

Introduction to cyclical learning rates for training neural netsSayak Paul

Few shot learning/ one shot learning/ machine learningﺁﺻﻒ ﻋﻠﯽ ﻣﯿﺮ

Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Va...Dongmin Lee

Introduction of Deep Reinforcement LearningNAVER Engineering

Learning to Balance: Bayesian Meta-Learning for Imbalanced and Out-of-distrib...MLAI2

Reinforcement LearningDongHyun Kwak

Optimization as a model for few shot learningKaty Lee

presentation.pptMadhuriChandanbatwe

Training Deep Networks with Backprop (D1L4 Insight@DCU Machine Learning Works...Universitat Politècnica de Catalunya

Data mining chapter04and5-bestABDUmomo

Learning to Learn by Gradient Descent by Gradient DescentKaty Lee

Meta Dropout: Learning to Perturb Latent Features for Generalization MLAI2

The Machinery behind Deep LearningStefan Kühn

Competition winning learning ratesMLconf

Review : Adaptive Consistency Regularization for Semi-Supervised Transfer Lea...Dongmin Choi

PaperReview_ “Few-shot Graph Classification with Contrastive Loss and Meta-cl...AkankshaRawat53

StackNet Meta-Modelling frameworkSri Ambati

Learning On The Border:Active Learning in Imbalanced classification Data萍華楊

Presentation File of paper "Leveraging Normalization Layer in Adapters With P...dyyjkd

Machine Learning with Python- Methods for Machine Learning.pptxiaeronlineexm

Ähnlich wie PR12-094: Model-Agnostic Meta-Learning for fast adaptation of deep networks (20)

Introduction to cyclical learning rates for training neural nets

Few shot learning/ one shot learning/ machine learning

Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Va...

Introduction of Deep Reinforcement Learning

Learning to Balance: Bayesian Meta-Learning for Imbalanced and Out-of-distrib...

Reinforcement Learning

Optimization as a model for few shot learning

presentation.ppt

Training Deep Networks with Backprop (D1L4 Insight@DCU Machine Learning Works...

Data mining chapter04and5-best

Learning to Learn by Gradient Descent by Gradient Descent

Meta Dropout: Learning to Perturb Latent Features for Generalization

The Machinery behind Deep Learning

Competition winning learning rates

Review : Adaptive Consistency Regularization for Semi-Supervised Transfer Lea...

PaperReview_ “Few-shot Graph Classification with Contrastive Loss and Meta-cl...

StackNet Meta-Modelling framework

Learning On The Border:Active Learning in Imbalanced classification Data

Presentation File of paper "Leveraging Normalization Layer in Adapters With P...

Machine Learning with Python- Methods for Machine Learning.pptx

Mehr von Taesu Kim

PR12-193 NISP: Pruning Networks using Neural Importance Score PropagationTaesu Kim

PR12-179 M3D-GAN: Multi-Modal Multi-Domain Translation with Universal AttentionTaesu Kim

PR12-165 Few-Shot Adversarial Learning of Realistic Neural Talking Head ModelsTaesu Kim

PR12-151 The Unreasonable Effectiveness of Deep Features as a Perceptual MetricTaesu Kim

Issues in AI product development and practices in audio applicationsTaesu Kim

PR-043: HyperNetworksTaesu Kim

Mehr von Taesu Kim (6)

PR12-193 NISP: Pruning Networks using Neural Importance Score Propagation

PR12-179 M3D-GAN: Multi-Modal Multi-Domain Translation with Universal Attention

PR12-165 Few-Shot Adversarial Learning of Realistic Neural Talking Head Models

PR12-151 The Unreasonable Effectiveness of Deep Features as a Perceptual Metric

Issues in AI product development and practices in audio applications

PR-043: HyperNetworks

Kürzlich hochgeladen

SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada

A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3

Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos

The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech

WordPress Websites for Engineers: Elevate Your Brandgvaughan

DMCC Future of Trade Web3 - Special EditionDubai Multi Commodity Centre

DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy

DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell

"ML in Production",Oleksandr BaganFwdays

How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe

Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited

The State of Passkeys with FIDO Alliance.pptxLoriGlavin3

The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3

Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3

unit 4 immunoblotting technique complete.pptxBkGupta21

Take control of your SAP testing with UiPath Test SuiteDianaGray10

Kürzlich hochgeladen (20)

SALESFORCE EDUCATION CLOUD | FEXLE SERVICES

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024

A Deep Dive on Passkeys: FIDO Paris Seminar.pptx

Are Multi-Cloud and Serverless Good or Bad?

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)

The Ultimate Guide to Choosing WordPress Pros and Cons

WordPress Websites for Engineers: Elevate Your Brand

DMCC Future of Trade Web3 - Special Edition

DevoxxFR 2024 Reproducible Builds with Apache Maven

DSPy a system for AI to Write Prompts and Do Fine Tuning

"ML in Production",Oleksandr Bagan

How AI, OpenAI, and ChatGPT impact business and software.

Ensuring Technical Readiness For Copilot in Microsoft 365

The State of Passkeys with FIDO Alliance.pptx

The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx

Connect Wave/ connectwave Pitch Deck Presentation

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx

unit 4 immunoblotting technique complete.pptx

Take control of your SAP testing with UiPath Test Suite

PR12-094: Model-Agnostic Meta-Learning for fast adaptation of deep networks

1. Model-Agnostic Meta Learning for fast adaptation of deep networks Presented by Taesu Kim June 24, 2018 C. Finn, P. Abbeel, S. Levine

2. Meta-learning: Learning to learn Meta is a prefix used in English to indicate a concept which is an abstraction behind another concept, used to complete or add to the latter. -- From Wikipedia

3. Naïve approach to fine-tune Is this the best way? Are you sure?

4. How to find good initial weights a=f(x)

5. Model agnostic meta-learning algorithm

6. Few-shot supervised learning For regression For classification K-shot, N-way classification (2) (3)

7. For reinforcement learning Quickly acquire a policy for a new task only a small amount of experience in the test setting (4)

8. Experimental evaluation › To answer following questions – Can MAML enable fast learning of new tasks? – Can MAML be used for meta-learning in multiple different domains, including supervised regression, classification, and reinforcement learning? – Can a model learned with MAML continue to improve with additional gradient updates and/or examples?

9. Regression › Sine wave fitting – Amplitude: [0.1, 5.0] – Phase: [0, Pi] – x: sampled uniformly from [-5.0, 5.0] – f(x): 2 hidden layers of size 40 with ReLU – K={5,10,20} › Comparison – Pretraining on all of the tasks › Regress to random sine functions › Fine-tune with gradient descent on the K provided points – Oracle – Additional multi-task and adaptation methods

10. Regression

11. Classification › Omniglot dataset – 20 instances of 1623 characters from 50 different alphabets – Each instance was drawn by a different person – Randomly selected 1200 characters for training, and the remaining for testing › MiniImagenet dataset – 64 training classes, 12 validation classes, and 24 test classes › N way classification with 1 or 5 shots

12. Classification

13. Reinforcement learning › rllab benchmark suite › Neural network policy with two hidden layers of size 100 with ReLU › Gradients updates are computed using vanilla policy gradients (REINFORCE) and trust-region policy (TRPO) optimization as meta-optimizer › Comparison – Pretraining one policy on all of the tasks and fine-tuning – Training a policy from randomly initialized weights – Oracle policy

14. Reinforcement learning › 2d navigation

15. Reinforcement learning › Locomotion – High-dimensional locomotion tasks with the MuJoCo simulator

16. Reinforcement learning

17. Advanced researches › Meta-SGD: Learning to learn quickly for few-shot learning, Li et al, Sep 2017 › Recasting gradient-based meta-learning as hierarchical Bayes, Grant et al, ICLR 2018 › Gradient-based meta-learning with learned layerwise metric and subspace, Lee et al, ICML 2018 › Probabilistic Model-agnostic meta learning, Finn et al, Jun 2018 › Bayesian Model-Agnostic Meta-Learning, Kim et al, Jun 2018

18. Follow us: Contact us: contact@neosapience.com For more information: http://www.neosapience.com