Autoencoders, PCA, Variational Autoencoders Explained

•

2 gefällt mir•529 views

This document provides an overview of autoencoders and variational autoencoders. It discusses how principal component analysis (PCA) is related to linear autoencoders and can be performed using backpropagation. Deep and nonlinear autoencoders are also covered. The document then introduces variational autoencoders, which combine variational inference with autoencoders to allow for probabilistic latent space modeling. It explains how variational autoencoders are trained using backpropagation through reparameterization to maximize the evidence lower bound.

Daten & Analysen

Autoencoders
NN club, March 21 2018
Jonathan Ronen

Agenda
● PCA and linear autoencoders
● Deep and nonlinear autoencoders
● Variational autoencoders

PCA for dimensionality reduction
● The U that maximizes the variance of PC1
● also minimizes the reconstruction error
○ Note: this is not the same as OLS,
which minimizes
There are efficient solvers for this, but we could also use backpropagation

PCA through backpropagation
●
● This is an autoencoder
● If the neurons are linear, it is similar to PCA
○ Caveat: PCs are orthogonal, autoencoded
components are not - but they will span the
same space

Nonlinear autoencoder with 32 hidden neurons

Autoencoders can be deep
ReLu
ReLu
ReLu
ReLu
ReLu
ReLu
ReLu
ReLu
ReLu
ReLu
Sigmoid
Sigmoid
Sigmoid
Sigmoid
Sigmoid

Deep autoencoder (bottleneck of 2)
Guess which one is deep (has intermediate layer)?

Many variations of autoencoders
● Sparse autoencoders
● Denoising autoencoders
● Convolutional autoencoders
○ UNet is a sort of autoencoder
● And more…
● I’d like to introduce Variational Autoencoders

Variational autoencoders
Variational Bayesian Inference
Variational Inference
+
autoencoders
z
x observation
latent variable

Variational Inference (quick overview)
z
x observation
latent variable

Variational Inference (quick overview)
z
x observation
latent variable
problematic...

Variational Inference (quick overview)
z
x observation
latent variable
problematic...
Variational Inference Solution:
Chosen to be a
distribution we can work
with

Side note on
● Information
○ “How many bits do we need to represent event x if we optimized for p(x)?”
● Entropy
○ “What is the expected amount of information in each event drawn from p(x)?” (how many bits?)
● Cross-entropy
○ “What is the expected amount of information in p(x) if we optimized for q(x)?” (how many bits?)
● Kullback-Leibler divergence: “cross-entropy - entropy”
○ “How many more bits will we need to represent events from p(x) if we optimized for q(x)?

Variational Inference (quick overview)
skipping the math...
Maximizing the Evidence LOwer Bound (ELBO)

Variational inference is methods to maximize ELBO
How does it fit in with autoencoders?

What if autoencoders were probabilistic?

What if autoencoders were probabilistic?
Regular autoencoder
Variational autoencoder

Variational Autoencoder loss - negative ELBO
reconstruction error divergence from prior

Backpropagation through VAEs - reparameterizing

Regular autoencoder as a generative model?

Jupyter Notebook with all analysis in this talk
https://nbviewer.jupyter.org/gist/jonathanronen/69902c1a97149ab4aae42e099d1d1367

Further reading
● https://arxiv.org/abs/1312.6114
● https://www.youtube.com/watch?v=uaaqyVS9-rM
● https://www.jeremyjordan.me/variational-autoencoders/
● https://blog.keras.io/building-autoencoders-in-keras.html

Weitere ähnliche Inhalte

Was ist angesagt?

Support Vector Machinesnextlib

GAN EvaluationDongheon Lee

Generative Adversarial NetworksMark Chang

Diffusion models beat gans on image synthesisBeerenSahu

Generative Adversarial Network (GAN)Prakhar Rastogi

Support Vector Machines- SVMCarlo Carandang

U-Net (1).pptxChangjin Lee

Semantic Segmentation - Fully Convolutional Networks for Semantic Segmentation岳華杜

An Introduction to Neural Architecture SearchBill Liu

Restricted Boltzmann Machine - A comprehensive study with a focus on Deep Bel...Indraneel Pole

Recurrent Neural Network (RNN) | RNN LSTM Tutorial | Deep Learning Course | S...Simplilearn

Understanding RNN and LSTM健程杨

Convolutional neural network from VGG to DenseNetSungminYou

Support vector machines (svm)Sharayu Patil

LstmMehrnaz Faraz

Latent diffusions vs DALL-E v2Vitaly Bondar

Semantic segmentation with Convolutional Neural Network ApproachesFellowship at Vodafone FutureLab

Image-to-Image Translation with Conditional Adversarial Nets (UPC Reading Group)Universitat Politècnica de Catalunya

Generative Adversarial NetworksMustafa Yagmur

Generative Adversarial Networks and Their Applications in Medical ImagingSanghoon Hong

Was ist angesagt? (20)

Support Vector Machines

GAN Evaluation

Generative Adversarial Networks

Diffusion models beat gans on image synthesis

Generative Adversarial Network (GAN)

Support Vector Machines- SVM

U-Net (1).pptx

Semantic Segmentation - Fully Convolutional Networks for Semantic Segmentation

An Introduction to Neural Architecture Search

Restricted Boltzmann Machine - A comprehensive study with a focus on Deep Bel...

Recurrent Neural Network (RNN) | RNN LSTM Tutorial | Deep Learning Course | S...

Understanding RNN and LSTM

Convolutional neural network from VGG to DenseNet

Support vector machines (svm)

Lstm

Latent diffusions vs DALL-E v2

Semantic segmentation with Convolutional Neural Network Approaches

Image-to-Image Translation with Conditional Adversarial Nets (UPC Reading Group)

Generative Adversarial Networks

Generative Adversarial Networks and Their Applications in Medical Imaging

Ähnlich wie Autoencoders, PCA, Variational Autoencoders Explained

Dictionary Learning in Games - GDC 2014Manchor Ko

Probabilistic programmingEli Gottlieb

ANU ASTR 4004 / 8004 Astronomical Computing : Lecture 6tingyuansenastro

Large scale analysis for spiking dataHassan Nasser

How Computer Workguest5dedf5

Oxford Lectures Part 1Andrea Pasqua

Probabilistic Data Structures and Approximate Solutions Oleksandr PryymakPyData

Deep Neural Networks (D1L2 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya

Recurrent Neural NetworksCloudxLab

Unsupervised Deep Learning (D2L1 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya

2.ANN.pptxVariable14

Artificial Neural Network (draft)James Boulie

Deep Learning Tutorial Ligeng Zhu

Matt Purkeypile's Doctoral Dissertation Defense Slidesmpurkeypile

ilp-nlp-slides.pdfFlorentBersani

State of the art time-series analysis with deep learning by Javier Ordóñez at...Big Data Spain

Graph Neural Network - IntroductionJungwon Kim

Mcs 031shankhasubhra86

Chord DHTJohn-Alan Simmons

The Search for Gravitational Wavesinside-BigData.com

Ähnlich wie Autoencoders, PCA, Variational Autoencoders Explained (20)

Dictionary Learning in Games - GDC 2014

Probabilistic programming

ANU ASTR 4004 / 8004 Astronomical Computing : Lecture 6

Large scale analysis for spiking data

How Computer Work

Oxford Lectures Part 1

Probabilistic Data Structures and Approximate Solutions Oleksandr Pryymak

Deep Neural Networks (D1L2 Insight@DCU Machine Learning Workshop 2017)

Recurrent Neural Networks

Unsupervised Deep Learning (D2L1 Insight@DCU Machine Learning Workshop 2017)

2.ANN.pptx

Artificial Neural Network (draft)

Deep Learning Tutorial

Matt Purkeypile's Doctoral Dissertation Defense Slides

ilp-nlp-slides.pdf

State of the art time-series analysis with deep learning by Javier Ordóñez at...

Graph Neural Network - Introduction

Mcs 031

Chord DHT

The Search for Gravitational Waves

Kürzlich hochgeladen

Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesTimothy Spann

Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics

Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics

wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...KarteekMane1

SMOTE and K-Fold Cross Validation-Presentation.pptxHaritikaChhatwal1

Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics

English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfblazblazml

Real-Time AI Streaming - AI Max PrincetonTimothy Spann

Semantic Shed - Squashing and Squeezing.pptxMike Bennett

Data Analysis Project: Stroke PredictionBoston Institute of Analytics

convolutional neural network and its applications.pdfSubhamKumar3239

Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson

FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024Susanna-Assunta Sansone

Cyber awareness ppt on the recorded dataTecnoIncentive

INTRODUCTION TO Natural language processingsocarem879

Learn How Data Science Changes Our WorldEduminds Learning

Insurance Churn Prediction Data Analysis ProjectBoston Institute of Analytics

Decoding Patterns: Customer Churn Prediction Data Analysis ProjectBoston Institute of Analytics

Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter

Networking Case Study prepared by teacher.pptxHimangsuNath

Kürzlich hochgeladen (20)

Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines

Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...

Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...

wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...

SMOTE and K-Fold Cross Validation-Presentation.pptx

Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...

English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf

Real-Time AI Streaming - AI Max Princeton

Semantic Shed - Squashing and Squeezing.pptx

Data Analysis Project: Stroke Prediction

convolutional neural network and its applications.pdf

Defining Constituents, Data Vizzes and Telling a Data Story

FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024

Cyber awareness ppt on the recorded data

INTRODUCTION TO Natural language processing

Learn How Data Science Changes Our World

Insurance Churn Prediction Data Analysis Project

Decoding Patterns: Customer Churn Prediction Data Analysis Project

Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...

Networking Case Study prepared by teacher.pptx

Autoencoders, PCA, Variational Autoencoders Explained

1. Autoencoders NN club, March 21 2018 Jonathan Ronen

2. Agenda ● PCA and linear autoencoders ● Deep and nonlinear autoencoders ● Variational autoencoders

3. PCA for dimensionality reduction

4. PCA for dimensionality reduction ● The U that maximizes the variance of PC1 ● also minimizes the reconstruction error ○ Note: this is not the same as OLS, which minimizes There are efficient solvers for this, but we could also use backpropagation

5. PCA through backpropagation ● ● This is an autoencoder ● If the neurons are linear, it is similar to PCA ○ Caveat: PCs are orthogonal, autoencoded components are not - but they will span the same space

6. PCA vs linear autoencoders for MNIST

7. PCA vs linear autoencoders for MNIST

8. Autoencoders can be nonlinear

9. Nonlinear autoencoder with 32 hidden neurons

10. Autoencoders can be deep ReLu ReLu ReLu ReLu ReLu ReLu ReLu ReLu ReLu ReLu Sigmoid Sigmoid Sigmoid Sigmoid Sigmoid

11. Deep autoencoder (bottleneck of 2) Guess which one is deep (has intermediate layer)?

12. Many variations of autoencoders ● Sparse autoencoders ● Denoising autoencoders ● Convolutional autoencoders ○ UNet is a sort of autoencoder ● And more… ● I’d like to introduce Variational Autoencoders

13. Variational autoencoders Variational Bayesian Inference Variational Inference + autoencoders z x observation latent variable

14. Variational Inference (quick overview) z x observation latent variable

15. Variational Inference (quick overview) z x observation latent variable problematic...

16. Variational Inference (quick overview) z x observation latent variable problematic... Variational Inference Solution: Chosen to be a distribution we can work with

17. Side note on ● Information ○ “How many bits do we need to represent event x if we optimized for p(x)?” ● Entropy ○ “What is the expected amount of information in each event drawn from p(x)?” (how many bits?) ● Cross-entropy ○ “What is the expected amount of information in p(x) if we optimized for q(x)?” (how many bits?) ● Kullback-Leibler divergence: “cross-entropy - entropy” ○ “How many more bits will we need to represent events from p(x) if we optimized for q(x)?

18. Variational Inference (quick overview) skipping the math... Maximizing the Evidence LOwer Bound (ELBO)

19. Variational inference is methods to maximize ELBO How does it fit in with autoencoders?

20. What if autoencoders were probabilistic?

21. What if autoencoders were probabilistic? Regular autoencoder Variational autoencoder

22. Variational Autoencoder loss - negative ELBO reconstruction error divergence from prior

23. Backpropagation through VAEs sampling

24. Backpropagation through VAEs - reparameterizing

25. VAE 2d embedding

26. VAEs are a generative model

27. Regular autoencoder as a generative model?

28. Jupyter Notebook with all analysis in this talk https://nbviewer.jupyter.org/gist/jonathanronen/69902c1a97149ab4aae42e099d1d1367

29. Further reading ● https://arxiv.org/abs/1312.6114 ● https://www.youtube.com/watch?v=uaaqyVS9-rM ● https://www.jeremyjordan.me/variational-autoencoders/ ● https://blog.keras.io/building-autoencoders-in-keras.html

Autoencoders, PCA, Variational Autoencoders Explained

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Ähnlich wie Autoencoders, PCA, Variational Autoencoders Explained

Ähnlich wie Autoencoders, PCA, Variational Autoencoders Explained (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Autoencoders, PCA, Variational Autoencoders Explained