Adversarial training Basics

•

3 gefällt mir•1,038 views

Shamane Siriwardhana

This is an introduction to GAN and WGAN

Daten & Analysen

BENJAMIN PETRY
bpetry@acm.org
www.bpetry.de
Adversarial Training
SHAMANE SIRIWARDHANA

Generative Adversarial Networks(GAN)
Basic Architecture

Min-Max Game
● Generator trying to Fool the discriminator
● Discriminator somehow needs to identify fake from real very well
● Something similar to min-max in game theory
Since it’s Adversary

Somehow Generator needs to fool the Discriminator

After the two way Optimization ...
Start End

Training the Generator
D is Fixed !
Yunjey Choi

Discriminator Optimization Summary
Yunjey Choi

Generator Optimization Summary
Yunjey Choi

Main Problem - Discriminator Saturation
● Discriminator is too Good :(
● There won’t be any chance for the generator to learn something
Yunjey Choi

● GAN's Task is to make the Generated Distribution(Pmodel) same as the Real data
Distribution(Preal)

● There are ways to measure similarity of two distributions Eg:
○ KL divergence
○ Jensen–Shannon divergence
We can easily prove that Optimization of GAN’s loss function is similar to reducing Jensen–Shannon
divergence between the two distributions

When we have an optimal discriminator
Optimization of the Loss = Minimizing the Jensen Shannon Divergence

GAN is not easy to Train !
● Non-convergence: the parameters oscillate, constantly destabilize and unlikely to arrive to
converge (Issues with Nash Equality).
● Mode collapse: generator collapses, leading to produce limited varieties of samples.

Yes! there are more stable methods right now !
❖ Wasserstein GAN
WGAN vs GAN - Similar in terms of Formality & Functionality
Only thing change is the Loss Function !

Now Loss Function is more of a Critic !
❖ Previously the Discriminator and the Generator are working against each
other
❖ But now discriminator is is trying to give the generator an Idea of how different
it’s generated data is deviate from the actual data distribution.
❖ No Log probabilities - No Diminish Gradients
❖ Uses EM(Earth Mover's Distance) distance to model the loss function !

Wasserstein Distance or EM Distance
This is a measurement about how much work that generator has to do to match the
distribution of the real images
This is why we call it a Critic!

Reducing the distance between generated samples and real samples
Generator
distribution
Real
distribution
Critic

We need to clip the weights in the discriminator
● f has to be a 1-Lipschitz function.
● To enforce the constraint, WGAN applies a very simple clipping to restrict the
maximum weight value in f
● The weights of the discriminator must be within a certain range controlled by the
hyperparameters
After every update we need to clip the weights 0f the discriminator

Solving the Vanishing Gradient Issues ..
More stable training ...

Resources
GAN - https://arxiv.org/abs/1406.2661
WGAN - https://arxiv.org/abs/1701.07875
Improved WGAN - https://arxiv.org/abs/1704.00028
Principal Method Of Training GAN - https://openreview.net/pdf?id=Hk4_qw5xe
Amazing series of Article By Jonathan Hui
https://medium.com/@jonathan_hui/gan-whats-generative-adversarial-networks-and-its-application-f39ed278ef09

What we are into this ..
❖ GANhas an amazing ability to enrich Reinforcement Learning such as…
1. Planning
2. Inverse Reinforcement

Imitation Learning
● Learning From Expert’s Demonstrations
● Something in between Supervised Learning and Deep Reinforcement Learning
● There is a clear connection between GAN and Imitation Learning

Generative Adversarial Imitation Learning

What is the difference !
1. Instead of just Images we have expert’s trajectories which means states and
action pairs
2. Now the Generator is an AI Agent

Weitere ähnliche Inhalte

Was ist angesagt?

Machine LearningRahul Kumar

Adversarial Attacks and Defenses in Deep Learning.pdfMichelleHoogenhout

Reinforcement learningDing Li

Ensemble Learning and Random ForestsCloudxLab

Hyperparameter TuningJon Lederman

Convolution Neural Network (CNN)Suraj Aavula

Deep learning for person re-identification哲东郑

Deep Learning Explained: The future of Artificial Intelligence and Smart Netw...Melanie Swan

Adversarial Attacks for Recommender SystemsWQ Fan

Application of Machine Learning in Cyber SecurityDr. Umesh Rao.Hodeghatta

Machine Learning Final presentation AyanaRukasar

Recommender SystemsGirish Khanzode

Decision Tree LearningMilind Gokhale

Fair Recommender Systems Sharmistha Chatterjee

Presentation on supervised learningTonmoy Bhagawati

Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...Simplilearn

Recommender systemNilotpal Pramanik

Explainable AI in Industry (FAT* 2020 Tutorial)Krishnaram Kenthapadi

Deep Learning for Recommender SystemsJustin Basilico

Predicting house priceDivya Tiwari

Was ist angesagt? (20)

Machine Learning

Adversarial Attacks and Defenses in Deep Learning.pdf

Reinforcement learning

Ensemble Learning and Random Forests

Hyperparameter Tuning

Convolution Neural Network (CNN)

Deep learning for person re-identification

Deep Learning Explained: The future of Artificial Intelligence and Smart Netw...

Adversarial Attacks for Recommender Systems

Application of Machine Learning in Cyber Security

Machine Learning Final presentation

Recommender Systems

Decision Tree Learning

Fair Recommender Systems

Presentation on supervised learning

Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...

Recommender system

Explainable AI in Industry (FAT* 2020 Tutorial)

Deep Learning for Recommender Systems

Predicting house price

Ähnlich wie Adversarial training Basics

Generative adversarial networks slides- Auckland AI & ML MeetupShamane Siriwardhana

Deep Generative Models - Kevin McGuinness - UPC Barcelona 2018Universitat Politècnica de Catalunya

11_gan.pdfAnkush84837

gan.pdfDr.rukmani Devi

ICASSP 2018 Tutorial: Generative Adversarial Network and its Applications to ...宏毅李

Deep Learning for Computer Vision: Generative models and adversarial training...Universitat Politècnica de Catalunya

Generative adversarial network_Ayadi_AlaeddineDeep Learning Italia

Reading group gan - 20170417Shuai Zhang

GAN.pdfNiharikaThakur32

Vladislav Kolbasin “Introduction to Generative Adversarial Networks (GANs)”Lviv Startup Club

Generative Adversarial Networks and Their Applications in Medical ImagingSanghoon Hong

Generative Models and Adversarial Training (D2L3 Insight@DCU Machine Learning...Universitat Politècnica de Catalunya

Deep Generative Models II (DLAI D10L1 2017 UPC Deep Learning for Artificial I...Universitat Politècnica de Catalunya

Generative Adversarial Networks and Their ApplicationsArtifacia

Seeing what a gan cannot generate: paper reviewQuantUniversity

GAN.pptxHemanthKonamanchili1

Alberto Massidda - Scenes from a memory - Codemotion Rome 2019Codemotion

Generative Adversarial Network (GAN) for Image SynthesisRiwaz Mahat

Generative Adversarial Network (GANs).kgandham169

Harnessing the power of Generative Adversarial Networks (GANs) for supervised...Scaleway

Ähnlich wie Adversarial training Basics (20)

Generative adversarial networks slides- Auckland AI & ML Meetup

Deep Generative Models - Kevin McGuinness - UPC Barcelona 2018

11_gan.pdf

gan.pdf

ICASSP 2018 Tutorial: Generative Adversarial Network and its Applications to ...

Deep Learning for Computer Vision: Generative models and adversarial training...

Generative adversarial network_Ayadi_Alaeddine

Reading group gan - 20170417

GAN.pdf

Vladislav Kolbasin “Introduction to Generative Adversarial Networks (GANs)”

Generative Adversarial Networks and Their Applications in Medical Imaging

Generative Models and Adversarial Training (D2L3 Insight@DCU Machine Learning...

Deep Generative Models II (DLAI D10L1 2017 UPC Deep Learning for Artificial I...

Generative Adversarial Networks and Their Applications

Seeing what a gan cannot generate: paper review

GAN.pptx

Alberto Massidda - Scenes from a memory - Codemotion Rome 2019

Generative Adversarial Network (GAN) for Image Synthesis

Generative Adversarial Network (GANs).

Harnessing the power of Generative Adversarial Networks (GANs) for supervised...

Kürzlich hochgeladen

Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...ssuserf63bd7

Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter

Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2

Real-Time AI Streaming - AI Max PrincetonTimothy Spann

办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degreeyuu sss

9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort

Biometric Authentication: The Evolution, Applications, Benefits and Challenge...GQ Research

GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch

detection and classification of knee osteoarthritis.pptxAleenaJamil4

Learn How Data Science Changes Our WorldEduminds Learning

Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Universitat Politècnica de Catalunya

20240419 - Measurecamp Amsterdam - SAM.pdfHuman37

Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort

RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993

科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss

Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly

ASML's Taxonomy Adventure by Daniel Cantervoginip

Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics

April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024Timothy Spann

NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics

Kürzlich hochgeladen (20)

Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...

Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...

Identifying Appropriate Test Statistics Involving Population Mean

Real-Time AI Streaming - AI Max Princeton

办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree

9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service

Biometric Authentication: The Evolution, Applications, Benefits and Challenge...

GA4 Without Cookies [Measure Camp AMS]

detection and classification of knee osteoarthritis.pptx

Learn How Data Science Changes Our World

Deep Generative Learning for All - The Gen AI Hype (Spring 2024)

20240419 - Measurecamp Amsterdam - SAM.pdf

Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)

RABBIT: A CLI tool for identifying bots based on their GitHub events.

科罗拉多大学波尔得分校毕业证学位证成绩单-可办理

Generative AI for Social Good at Open Data Science East 2024

ASML's Taxonomy Adventure by Daniel Canter

Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...

April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024

NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...

Adversarial training Basics

1. BENJAMIN PETRY bpetry@acm.org www.bpetry.de Adversarial Training SHAMANE SIRIWARDHANA

2. Discriminative (CNN) Yunjey Choi

3. Generative (GAN,VAE) Yunjey Choi

4. Generative Adversarial Networks(GAN) Basic Architecture

5. Godfather! Godfather !

6. Why Generative Models are Important

7. Insilico Medicine

8. Yes.. This is possible

9. Cycle GAN - Realtime

10. Running Human

11.

12. Min-Max Game ● Generator trying to Fool the discriminator ● Discriminator somehow needs to identify fake from real very well ● Something similar to min-max in game theory Since it’s Adversary

13. Somehow Generator needs to fool the Discriminator

14. Two way Optimization

15. Discriminator

16. Generator

17. After the two way Optimization ... Start End

18.

19. Discriminator & Generator Yunjey Choi

20. Training the Discriminator Yunjey Choi

21. Training the Discriminator Yunjey Choi

22. Training the Generator D is Fixed ! Yunjey Choi

23. Discriminator Optimization Summary Yunjey Choi

24. Generator Optimization Summary Yunjey Choi

25. Visualization

26. Main Problem - Discriminator Saturation ● Discriminator is too Good :( ● There won’t be any chance for the generator to learn something Yunjey Choi

27. Summary of the training process

28. Why GAN Works ? No Magic

29.

30. ● GAN's Task is to make the Generated Distribution(Pmodel) same as the Real data Distribution(Preal)

31. ● There are ways to measure similarity of two distributions Eg: ○ KL divergence ○ Jensen–Shannon divergence We can easily prove that Optimization of GAN’s loss function is similar to reducing Jensen–Shannon divergence between the two distributions

32.

33. When we have an optimal discriminator Optimization of the Loss = Minimizing the Jensen Shannon Divergence

34. Yeah !

35. GAN is not easy to Train ! ● Non-convergence: the parameters oscillate, constantly destabilize and unlikely to arrive to converge (Issues with Nash Equality). ● Mode collapse: generator collapses, leading to produce limited varieties of samples.

36.

37. Yes! there are more stable methods right now ! ❖ Wasserstein GAN WGAN vs GAN - Similar in terms of Formality & Functionality Only thing change is the Loss Function !

38. Now Loss Function is more of a Critic ! ❖ Previously the Discriminator and the Generator are working against each other ❖ But now discriminator is is trying to give the generator an Idea of how different it’s generated data is deviate from the actual data distribution. ❖ No Log probabilities - No Diminish Gradients ❖ Uses EM(Earth Mover's Distance) distance to model the loss function !

39. Wasserstein Distance or EM Distance This is a measurement about how much work that generator has to do to match the distribution of the real images This is why we call it a Critic!

40. Reducing the distance between generated samples and real samples Generator distribution Real distribution Critic

41. WGAN Architecture

42. WGAN vs GAN

43. Finally GAN : WGAN :

44. We need to clip the weights in the discriminator ● f has to be a 1-Lipschitz function. ● To enforce the constraint, WGAN applies a very simple clipping to restrict the maximum weight value in f ● The weights of the discriminator must be within a certain range controlled by the hyperparameters After every update we need to clip the weights 0f the discriminator

45.

46. Solving the Vanishing Gradient Issues .. More stable training ...

47.

48.

49. Resources GAN - https://arxiv.org/abs/1406.2661 WGAN - https://arxiv.org/abs/1701.07875 Improved WGAN - https://arxiv.org/abs/1704.00028 Principal Method Of Training GAN - https://openreview.net/pdf?id=Hk4_qw5xe Amazing series of Article By Jonathan Hui https://medium.com/@jonathan_hui/gan-whats-generative-adversarial-networks-and-its-application-f39ed278ef09

50. What we are into this .. ❖ GANhas an amazing ability to enrich Reinforcement Learning such as… 1. Planning 2. Inverse Reinforcement

51. Imitation Learning ● Learning From Expert’s Demonstrations ● Something in between Supervised Learning and Deep Reinforcement Learning ● There is a clear connection between GAN and Imitation Learning

52. Generative Adversarial Imitation Learning

53. What is the difference ! 1. Instead of just Images we have expert’s trajectories which means states and action pairs 2. Now the Generator is an AI Agent

54. Generative Adversarial Imitation Learning

55. Target Driven Visual Navigation Extra

56. Target Driven Visual Navigation

Adversarial training Basics

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Ähnlich wie Adversarial training Basics

Ähnlich wie Adversarial training Basics (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Adversarial training Basics