Swwae ruijie

•Als PPTX, PDF herunterladen•

0 gefällt mir•235 views

哲

哲东郑

Ruijie Quan

Technologie

STACKED WHAT-WHERE AUTO-ENCODERS
1
Stacked WHAT-WHERE Auto-encoders
09.08.2018
Existing popular approaches:
• To pre-train auto-encoders in a layer-wise fashion, and subsequently fine-tune
the entire stack of encoders in a supervised discriminative manner
• The deep boltzmann machine (DBM) model
 Integrates discriminative and generative pathways
 Provides a unified approach to supervised, semi-supervised and
unsupervised learning without relying on sampling during training.
provide a unified mechanism to unsupervised and supervised learning
exhibit poor convergence and mixing properties ultimately due to the
reliance on sampling during training

STACKED WHAT-WHERE AUTO-ENCODERS
2
Stacked WHAT-WHERE Auto-encoders
09.08.2018
• what variables inform the next layer about the content with incomplete
information about position
• where variables inform the corresponding feed-back decoder about where
interesting (dominant) features are located

MODEL ARCHITECTURE
3
Stacked WHAT-WHERE Auto-encoders
09.08.2018
Loss function of SWWAE:
WHAT:
WHERE:

MODEL ARCHITECTURE
4
Stacked WHAT-WHERE Auto-encoders
09.08.2018
Switching between three modalities:
• supervised learning
• unsupervised learning
• semi-supervised learning
mask out the entire Deconvnet pathway by setting λL2∗
to 0 and the SWWAE falls back to vanilla Convnet
nullify the fully-connected layers on top of Convnet together
with softmax classifier by setting λNLL = 0. In this setting, the
SWWAE is equivalent to a deep convolutional auto-encoder.
all three terms of the loss are active. The gradient
contributions from the Deconvnet can be interpreted as an
information preserving regularizer

EXPERIMENTS
09.08.2018
5
1. “where” is critical information demanded by reconstructing; one can barely obtain
well reconstructed images without preserving “where”.
2. this experiment can also be considered as an example using SWWAE for generative
purpose.
Stacked WHAT-WHERE Auto-encoders
 NECESSITY OF “WHERE”

EXPERIMENTS
09.08.2018
6
1. “where”learns highly localized representation. Each element in the “where” has an
approximately linear response to the pixel-level translation on either horizontal/
vertical direction and learns to be invariant to another.
2. “what” learns to be locally stable that exhibits strong invariance to the input-level
translation.
Stacked WHAT-WHERE Auto-encoders
 INVARIANCE AND EQUIVARIANCE
(a): “what” of horizontally translated digits versus original digits; (b): “where” of horizontally translated digits versus original digits;
(c): “what” of vertically translated digits versus original digits; (d): “where” of vertically translated digits versus original digits.

EXPERIMENTS
09.08.2018
7
Stacked WHAT-WHERE Auto-encoders
 INVARIANCE AND EQUIVARIANCE

EXPERIMENTS
09.08.2018
8
Stacked WHAT-WHERE Auto-encoders
 CLASSIFICATION PERFORMANCE

EXPERIMENTS
09.08.2018
9
Stacked WHAT-WHERE Auto-encoders

EXPERIMENTS
09.08.2018
10
Stacked WHAT-WHERE Auto-encoders
 LARGE SCALE EXPERIMENTS
(To compare with results from other approaches, we perform the experiments in the common
experimental setting that only adopts contrast normalization, small translation and horizontal
mirroring for data preprocessing.)

Weitere ähnliche Inhalte

Ähnlich wie Swwae ruijie

Explicit Density ModelsSangwoo Mo

Evolving Communication Mechanisms of the OSGi Framework - Xuejun Chen, BMW Re...mfrancis

“Unifying Computer Vision and Natural Language Understanding for Autonomous S...Edge AI and Vision Alliance

MVTec AD: A Comprehensive Real-World Dataset for Unsupervised Anomaly DetectionLEE HOSEONG

Object Detection with TransformersDatabricks

DaViT.pdfShahidJabbar10

Robust and declarative machine learning pipelines for predictive buying at Ba...Gianmario Spacagna

Building Highly Available Apps on Cassandra (Robbie Strickland, Weather Compa...DataStax

Java image processing ieee projects 2012 @ Seabirds ( Chennai, Bangalore, Hyd...SBGC

SERENE 2014 School: Daniel varro serene2014_schoolHenry Muccini

SERENE 2014 School: Incremental Model Queries over the CloudSERENEWorkshop

(CVPR2021 Oral) RobustNet: Improving Domain Generalization in Urban-Scene Seg...Sungha Choi

Mykhailo Zarai "Be careful when dealing with C++" at Rivne IT TalksVadym Muliavka

What is Serialization in Java? | Java Tutorial | EdurekaEdureka!

The Architecture of Wemlin HubNetcetera

Resume_Kousik_DanKousik Dan

Fisheye-Omnidirectional View in Autonomous Driving IIIYu Huang

Challenges on Distributed Machine Learningjie cao

[DSC Europe 23] Alexander Kovalchuk - Finetuning Stable Diffusion with low-ra...DataScienceConferenc1

AngularJS - TechTalk 3/2/2014Spyros Ioakeimidis

Ähnlich wie Swwae ruijie (20)

Explicit Density Models

Evolving Communication Mechanisms of the OSGi Framework - Xuejun Chen, BMW Re...

“Unifying Computer Vision and Natural Language Understanding for Autonomous S...

MVTec AD: A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection

Object Detection with Transformers

DaViT.pdf

Robust and declarative machine learning pipelines for predictive buying at Ba...

Building Highly Available Apps on Cassandra (Robbie Strickland, Weather Compa...

Java image processing ieee projects 2012 @ Seabirds ( Chennai, Bangalore, Hyd...

SERENE 2014 School: Daniel varro serene2014_school

SERENE 2014 School: Incremental Model Queries over the Cloud

(CVPR2021 Oral) RobustNet: Improving Domain Generalization in Urban-Scene Seg...

Mykhailo Zarai "Be careful when dealing with C++" at Rivne IT Talks

What is Serialization in Java? | Java Tutorial | Edureka

The Architecture of Wemlin Hub

Resume_Kousik_Dan

Fisheye-Omnidirectional View in Autonomous Driving III

Challenges on Distributed Machine Learning

[DSC Europe 23] Alexander Kovalchuk - Finetuning Stable Diffusion with low-ra...

AngularJS - TechTalk 3/2/2014

Mehr von 哲东郑

Deep learning for person re-identification哲东郑

Cross-domain complementary learning with synthetic data for multi-person part...哲东郑

Step zhedong哲东郑

Visual saliency哲东郑

Image Synthesis From Reconfigurable Layout and Style哲东郑

Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval哲东郑

Weijian image retrieval哲东郑

Scops self supervised co-part segmentation哲东郑

Video object detection哲东郑

Center nets哲东郑

C2 ae open set recognition哲东郑

Sota semantic segmentation哲东郑

Deep randomized embedding哲东郑

Semantic Image Synthesis with Spatially-Adaptive Normalization哲东郑

Instance level facial attributes transfer with geometry-aware flow哲东郑

Learning to adapt structured output space for semantic哲东郑

Unsupervised Learning of Object Landmarks through Conditional Image Generation哲东郑

Graph based global reasoning networks 哲东郑

Style gan哲东郑

Vi2vi哲东郑

Mehr von 哲东郑 (20)

Deep learning for person re-identification

Cross-domain complementary learning with synthetic data for multi-person part...

Step zhedong

Visual saliency

Image Synthesis From Reconfigurable Layout and Style

Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval

Weijian image retrieval

Scops self supervised co-part segmentation

Video object detection

Center nets

C2 ae open set recognition

Sota semantic segmentation

Deep randomized embedding

Semantic Image Synthesis with Spatially-Adaptive Normalization

Instance level facial attributes transfer with geometry-aware flow

Learning to adapt structured output space for semantic

Unsupervised Learning of Object Landmarks through Conditional Image Generation

Graph based global reasoning networks

Style gan

Vi2vi

Kürzlich hochgeladen

Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies

A Call to Action for Generative AI in 2024Results

[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745

IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge

Finology Group – Insurtech Innovation Award 2024The Digital Insurer

A Domino Admins Adventures (Engage 2024)Gabriella Davis

Salesforce Community Group Quito, Salesforce 101Paola De la Torre

08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los

Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi

Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700

CNv6 Instructor Chapter 6 Quality of Servicegiselly40

Automating Google Workspace (GWS) & more with Apps Scriptwesley chun

The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j

04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG

TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc

Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK

Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer

Kürzlich hochgeladen (20)

Factors to Consider When Choosing Accounts Payable Services Providers.pptx

A Call to Action for Generative AI in 2024

[2024]Digital Global Overview Report 2024 Meltwater.pdf

IAC 2024 - IA Fast Track to Search Focused AI Solutions

Finology Group – Insurtech Innovation Award 2024

A Domino Admins Adventures (Engage 2024)

Salesforce Community Group Quito, Salesforce 101

08448380779 Call Girls In Greater Kailash - I Women Seeking Men

08448380779 Call Girls In Civil Lines Women Seeking Men

The 7 Things I Know About Cyber Security After 25 Years | April 2024

Top 5 Benefits OF Using Muvi Live Paywall For Live Streams

Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...

CNv6 Instructor Chapter 6 Quality of Service

Automating Google Workspace (GWS) & more with Apps Script

The Codex of Business Writing Software for Real-World Solutions 2.pptx

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

Unblocking The Main Thread Solving ANRs and Frozen Frames

Axa Assurance Maroc - Insurer Innovation Award 2024

Swwae ruijie

1. STACKED WHAT-WHERE AUTO-ENCODERS Ruijie Quan 2018/08/05

2. STACKED WHAT-WHERE AUTO-ENCODERS 1 Stacked WHAT-WHERE Auto-encoders 09.08.2018 Existing popular approaches: • To pre-train auto-encoders in a layer-wise fashion, and subsequently fine-tune the entire stack of encoders in a supervised discriminative manner • The deep boltzmann machine (DBM) model  Integrates discriminative and generative pathways  Provides a unified approach to supervised, semi-supervised and unsupervised learning without relying on sampling during training. provide a unified mechanism to unsupervised and supervised learning exhibit poor convergence and mixing properties ultimately due to the reliance on sampling during training

3. STACKED WHAT-WHERE AUTO-ENCODERS 2 Stacked WHAT-WHERE Auto-encoders 09.08.2018 • what variables inform the next layer about the content with incomplete information about position • where variables inform the corresponding feed-back decoder about where interesting (dominant) features are located

4. MODEL ARCHITECTURE 3 Stacked WHAT-WHERE Auto-encoders 09.08.2018 Loss function of SWWAE: WHAT: WHERE:

5. MODEL ARCHITECTURE 4 Stacked WHAT-WHERE Auto-encoders 09.08.2018 Switching between three modalities: • supervised learning • unsupervised learning • semi-supervised learning mask out the entire Deconvnet pathway by setting λL2∗ to 0 and the SWWAE falls back to vanilla Convnet nullify the fully-connected layers on top of Convnet together with softmax classifier by setting λNLL = 0. In this setting, the SWWAE is equivalent to a deep convolutional auto-encoder. all three terms of the loss are active. The gradient contributions from the Deconvnet can be interpreted as an information preserving regularizer

6. EXPERIMENTS 09.08.2018 5 1. “where” is critical information demanded by reconstructing; one can barely obtain well reconstructed images without preserving “where”. 2. this experiment can also be considered as an example using SWWAE for generative purpose. Stacked WHAT-WHERE Auto-encoders  NECESSITY OF “WHERE”

7. EXPERIMENTS 09.08.2018 6 1. “where”learns highly localized representation. Each element in the “where” has an approximately linear response to the pixel-level translation on either horizontal/ vertical direction and learns to be invariant to another. 2. “what” learns to be locally stable that exhibits strong invariance to the input-level translation. Stacked WHAT-WHERE Auto-encoders  INVARIANCE AND EQUIVARIANCE (a): “what” of horizontally translated digits versus original digits; (b): “where” of horizontally translated digits versus original digits; (c): “what” of vertically translated digits versus original digits; (d): “where” of vertically translated digits versus original digits.

8. EXPERIMENTS 09.08.2018 7 Stacked WHAT-WHERE Auto-encoders  INVARIANCE AND EQUIVARIANCE

9. EXPERIMENTS 09.08.2018 8 Stacked WHAT-WHERE Auto-encoders  CLASSIFICATION PERFORMANCE

10. EXPERIMENTS 09.08.2018 9 Stacked WHAT-WHERE Auto-encoders

11. EXPERIMENTS 09.08.2018 10 Stacked WHAT-WHERE Auto-encoders  LARGE SCALE EXPERIMENTS (To compare with results from other approaches, we perform the experiments in the common experimental setting that only adopts contrast normalization, small translation and horizontal mirroring for data preprocessing.)

12. Thank you for your attention.

Swwae ruijie

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Ähnlich wie Swwae ruijie

Ähnlich wie Swwae ruijie (20)

Mehr von 哲东郑

Mehr von 哲东郑 (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Swwae ruijie

Swwae ruijie

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Ähnlich wie Swwae ruijie

Ähnlich wie Swwae ruijie (20)

Mehr von 哲东 郑

Mehr von 哲东 郑 (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Swwae ruijie

Mehr von 哲东郑

Mehr von 哲东郑 (20)