Deep Learning for Video: Motion Estimation (UPC 2018)

•

1 gefällt mir•2,316 views

https://mcv-m6-video.github.io/deepvideo-2018/ Overview of deep learning solutions for video processing. Part of a series of slides covering topics like action recognition, action detection, object tracking, object detection, scene segmentation, language and learning from videos. Prepared for the Master in Computer Vision Barcelona: http://pagines.uab.cat/mcv/

Daten & Analysen

@DocXavi
Xavier Giró-i-Nieto
[http://pagines.uab.cat/mcv/]
Module 6
Deep Learning for Video:
Motion Estimation
22nd March 2018

● MSc course (2017)
● BSc course (2018)
2
Deep Learning online courses by UPC:
● 1st edition (2016)
● 2nd edition (2017)
● 3rd edition (2018)
● 1st edition (2017)
● 2nd edition (2018)
Next edition Autumn 2018 Next edition Winter/Spring 2019Summer School (late June 2018)

Dosovitskiy, A., Fischer, P., Ilg, E., Hausser, P., Hazirbas, C., Golkov, V., van der Smagt, P., Cremers, D. and Brox, T., FlowNet: Learning Optical
Flow With Convolutional Networks. ICCV 2015 3
End to end supervised learning of optical flow.
Motion: Optical Flow: FlowNet

4
Option A: Stack both input images together and feed them through a generic network.
Motion: Optical Flow: FlowNet (encoder)
Dosovitskiy, A., Fischer, P., Ilg, E., Hausser, P., Hazirbas, C., Golkov, V., van der Smagt, P., Cremers, D. and Brox, T., FlowNet: Learning Optical
Flow With Convolutional Networks. ICCV 2015

5
Option B: Create two separate, yet identical processing streams for the two images and combine them at a
later stage.
Motion: Optical Flow: FlowNet (encoder)
Dosovitskiy, A., Fischer, P., Ilg, E., Hausser, P., Hazirbas, C., Golkov, V., van der Smagt, P., Cremers, D. and Brox, T., FlowNet: Learning Optical
Flow With Convolutional Networks. ICCV 2015

6
Option B: Create two separate, yet identical processing streams for the two images and combine them at a
later stage.
Correlation layer:
Convolution of data patches from the layers to combine.
Motion: Optical Flow: FlowNet (encoder)
Dosovitskiy, A., Fischer, P., Ilg, E., Hausser, P., Hazirbas, C., Golkov, V., van der Smagt, P., Cremers, D. and Brox, T., FlowNet: Learning Optical
Flow With Convolutional Networks. ICCV 2015

7
Upconvolutional layers: Unpooling features maps + convolution.
Upconvolutioned feature maps are concatenated with the corresponding map from the contractive part.
Motion: Optical Flow: FlowNet (decoder)
Dosovitskiy, A., Fischer, P., Ilg, E., Hausser, P., Hazirbas, C., Golkov, V., van der Smagt, P., Cremers, D. and Brox, T., FlowNet: Learning Optical
Flow With Convolutional Networks. ICCV 2015

8
Convnets trained on these unrealistic data generalize well to existing datasets such as Sintel and KITTI.
Data
augmentation
Motion: Optical Flow: FlowNet (synthetic)
Dosovitskiy, A., Fischer, P., Ilg, E., Hausser, P., Hazirbas, C., Golkov, V., van der Smagt, P., Cremers, D. and Brox, T., FlowNet: Learning Optical
Flow With Convolutional Networks. ICCV 2015

9
Ilg, Eddy, Nikolaus Mayer, Tonmoy Saikia, Margret Keuper, Alexey Dosovitskiy, and Thomas Brox.
"Flownet 2.0: Evolution of optical flow estimation with deep networks." CVPR 2017. [code]

10
Motion: Optical Flow: FlowNet 2.0
Ilg, Eddy, Nikolaus Mayer, Tonmoy Saikia, Margret Keuper, Alexey Dosovitskiy, and Thomas Brox. "Flownet 2.0: Evolution of
optical flow estimation with deep networks." CVPR 2017. [code]

11
Motion: MP-Net
Tokmakov, Pavel, Karteek Alahari, and Cordelia Schmid. "Learning motion patterns in videos." CVPR 2017
Predict directly if the object is in motion, instead of the optical flow..

Weitere ähnliche Inhalte

Mehr von Universitat Politècnica de Catalunya

Discovery and Learning of Navigation Goals from Pixels in MinecraftUniversitat Politècnica de Catalunya

Learn2Sign : Sign language recognition and translation using human keypoint e...Universitat Politècnica de Catalunya

Intepretability / Explainable AI for Deep Neural NetworksUniversitat Politècnica de Catalunya

Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020Universitat Politècnica de Catalunya

Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...Universitat Politècnica de Catalunya

Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020Universitat Politècnica de Catalunya

Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...Universitat Politècnica de Catalunya

Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020Universitat Politècnica de Catalunya

Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)Universitat Politècnica de Catalunya

Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...Universitat Politècnica de Catalunya

Curriculum Learning for Recurrent Video Object SegmentationUniversitat Politècnica de Catalunya

Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020Universitat Politècnica de Catalunya

Deep Learning Representations for All - Xavier Giro-i-Nieto - IRI Barcelona 2020Universitat Politècnica de Catalunya

Transcription-Enriched Joint Embeddings for Spoken Descriptions of Images and...Universitat Politècnica de Catalunya

Object Detection with Deep Learning - Xavier Giro-i-Nieto - UPC School Barcel...Universitat Politècnica de Catalunya

Self-supervised Audiovisual Learning 2020 - Xavier Giro-i-Nieto - UPC Telecom...Universitat Politècnica de Catalunya

Self-supervised Visual Learning 2020 - Xavier Giro-i-Nieto - UPC BarcelonaUniversitat Politècnica de Catalunya

Deep Video Object Tracking 2020 - Xavier Giro - UPC TelecomBCN BarcelonaUniversitat Politècnica de Catalunya

Neural Architectures for Video EncodingUniversitat Politècnica de Catalunya

Recurrent Neural Networks RNN - Xavier Giro - UPC TelecomBCN Barcelona 2020Universitat Politècnica de Catalunya

Mehr von Universitat Politècnica de Catalunya (20)

Discovery and Learning of Navigation Goals from Pixels in Minecraft

Learn2Sign : Sign language recognition and translation using human keypoint e...

Intepretability / Explainable AI for Deep Neural Networks

Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020

Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...

Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020

Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...

Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020

Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)

Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...

Curriculum Learning for Recurrent Video Object Segmentation

Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020

Deep Learning Representations for All - Xavier Giro-i-Nieto - IRI Barcelona 2020

Transcription-Enriched Joint Embeddings for Spoken Descriptions of Images and...

Object Detection with Deep Learning - Xavier Giro-i-Nieto - UPC School Barcel...

Self-supervised Audiovisual Learning 2020 - Xavier Giro-i-Nieto - UPC Telecom...

Self-supervised Visual Learning 2020 - Xavier Giro-i-Nieto - UPC Barcelona

Deep Video Object Tracking 2020 - Xavier Giro - UPC TelecomBCN Barcelona

Neural Architectures for Video Encoding

Recurrent Neural Networks RNN - Xavier Giro - UPC TelecomBCN Barcelona 2020

Kürzlich hochgeladen

Week-01-2.ppt BBB human Computer interactionfulawalesam

VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY

Halmar dropshipping via API with DroFxolyaivanovalion

(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7Call Girls in Nagpur High Profile Call Girls

Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823

BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692

Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Riyadh +966572737505 get cytotec

Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson

Determinants of health, dimensions of health, positive health and spectrum of...shambhavirathore45

Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh9953056974 Low Rate Call Girls In Saket, Delhi NCR

Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H

Ravak dropshipping via API with DroFx.pptxolyaivanovalion

Data-Analysis for Chicago Crime Data 2023ymrp368

Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service9953056974 Low Rate Call Girls In Saket, Delhi NCR

Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums

BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls

Carero dropshipping via API with DroFx.pptxolyaivanovalion

Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls

Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823

Invezz.com - Grow your wealth with trading signalsInvezz1

Kürzlich hochgeladen (20)

Week-01-2.ppt BBB human Computer interaction

VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...

Halmar dropshipping via API with DroFx

(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7

Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...

BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx

Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec

Schema on read is obsolete. Welcome metaprogramming..pdf

Determinants of health, dimensions of health, positive health and spectrum of...

Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh

Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf

Ravak dropshipping via API with DroFx.pptx

Data-Analysis for Chicago Crime Data 2023

Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service

Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...

BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service

Carero dropshipping via API with DroFx.pptx

Best VIP Call Girls Noida Sector 22 Call Me: 8448380779

Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...

Invezz.com - Grow your wealth with trading signals

Deep Learning for Video: Motion Estimation (UPC 2018)

1. @DocXavi Xavier Giró-i-Nieto [http://pagines.uab.cat/mcv/] Module 6 Deep Learning for Video: Motion Estimation 22nd March 2018

2. ● MSc course (2017) ● BSc course (2018) 2 Deep Learning online courses by UPC: ● 1st edition (2016) ● 2nd edition (2017) ● 3rd edition (2018) ● 1st edition (2017) ● 2nd edition (2018) Next edition Autumn 2018 Next edition Winter/Spring 2019Summer School (late June 2018)

3. Dosovitskiy, A., Fischer, P., Ilg, E., Hausser, P., Hazirbas, C., Golkov, V., van der Smagt, P., Cremers, D. and Brox, T., FlowNet: Learning Optical Flow With Convolutional Networks. ICCV 2015 3 End to end supervised learning of optical flow. Motion: Optical Flow: FlowNet

4. 4 Option A: Stack both input images together and feed them through a generic network. Motion: Optical Flow: FlowNet (encoder) Dosovitskiy, A., Fischer, P., Ilg, E., Hausser, P., Hazirbas, C., Golkov, V., van der Smagt, P., Cremers, D. and Brox, T., FlowNet: Learning Optical Flow With Convolutional Networks. ICCV 2015

5. 5 Option B: Create two separate, yet identical processing streams for the two images and combine them at a later stage. Motion: Optical Flow: FlowNet (encoder) Dosovitskiy, A., Fischer, P., Ilg, E., Hausser, P., Hazirbas, C., Golkov, V., van der Smagt, P., Cremers, D. and Brox, T., FlowNet: Learning Optical Flow With Convolutional Networks. ICCV 2015

6. 6 Option B: Create two separate, yet identical processing streams for the two images and combine them at a later stage. Correlation layer: Convolution of data patches from the layers to combine. Motion: Optical Flow: FlowNet (encoder) Dosovitskiy, A., Fischer, P., Ilg, E., Hausser, P., Hazirbas, C., Golkov, V., van der Smagt, P., Cremers, D. and Brox, T., FlowNet: Learning Optical Flow With Convolutional Networks. ICCV 2015

7. 7 Upconvolutional layers: Unpooling features maps + convolution. Upconvolutioned feature maps are concatenated with the corresponding map from the contractive part. Motion: Optical Flow: FlowNet (decoder) Dosovitskiy, A., Fischer, P., Ilg, E., Hausser, P., Hazirbas, C., Golkov, V., van der Smagt, P., Cremers, D. and Brox, T., FlowNet: Learning Optical Flow With Convolutional Networks. ICCV 2015

8. 8 Convnets trained on these unrealistic data generalize well to existing datasets such as Sintel and KITTI. Data augmentation Motion: Optical Flow: FlowNet (synthetic) Dosovitskiy, A., Fischer, P., Ilg, E., Hausser, P., Hazirbas, C., Golkov, V., van der Smagt, P., Cremers, D. and Brox, T., FlowNet: Learning Optical Flow With Convolutional Networks. ICCV 2015

9. 9 Ilg, Eddy, Nikolaus Mayer, Tonmoy Saikia, Margret Keuper, Alexey Dosovitskiy, and Thomas Brox. "Flownet 2.0: Evolution of optical flow estimation with deep networks." CVPR 2017. [code]

10. 10 Motion: Optical Flow: FlowNet 2.0 Ilg, Eddy, Nikolaus Mayer, Tonmoy Saikia, Margret Keuper, Alexey Dosovitskiy, and Thomas Brox. "Flownet 2.0: Evolution of optical flow estimation with deep networks." CVPR 2017. [code]

11. 11 Motion: MP-Net Tokmakov, Pavel, Karteek Alahari, and Cordelia Schmid. "Learning motion patterns in videos." CVPR 2017 Predict directly if the object is in motion, instead of the optical flow..

Deep Learning for Video: Motion Estimation (UPC 2018)

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Mehr von Universitat Politècnica de Catalunya

Mehr von Universitat Politècnica de Catalunya (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Deep Learning for Video: Motion Estimation (UPC 2018)