Fast rcnn

•

0 gefällt mir•1,177 views

limHoJun

Review slide on Fast R-CNN

Ingenieurwesen

RWTH AACHEN
Media Informatics
Hojun Lim
1
Fast R-CNN
Paper review session 1

■ Comparison: R-CNN vs Fast R-CNN
■ Image Pyramid
■ Scale Invariance (Multi-scale)
■ Truncated SVD for replacing weights of FC layers
■ Performance Metric: Pascal VOC 2012 vs COCO
Outline
2

Comparision: R-CNN vs Fast R-CNN
3
■ R-CNN
□ Architecture
□ Classification
□ Regression (localization)
-> BBOX encoding: for reducing the answer space.
It can be further reduced by variance trick
X

Comparision: R-CNN vs Fast R-CNN
4
■ R-CNN
□ Defacts
□ Multi-stage training pipeline
(1) Train ConvNet for localization
(2) Train SVMs to ConvNet features
(3) Replacing Softmax by SVM and finetune
□ Training is expensive
□ Convolution for each region proposal, after warping
□ Object detection is slow

Comparision: R-CNN vs Fast R-CNN
5
■ Fast R-CNN
□ Architecture
□ single-stage training pipeline: combining
(1) Log loss
(2) Smooth L1 (= Huber loss when delta is 1)
□ Multi-task loss for each RoI
Indicator function,
u = 0 for background

Comparision: R-CNN vs Fast R-CNN
6
■ Fast R-CNN
□ Improvements
□ Feed whole image through ConvNet
□ RoI Pooling (no warping)
y
x
Backprop of RoI pooling

Comparision: R-CNN vs Fast R-CNN
7
■ Fast R-CNN
□ Limitation
□ Complete architecture depends on external
RoI proposal algorithm
□ Have to extract fixed N(=64) regions
from each image
□ Hard negative mining:
25% positive: IoU in [0.5, 1]
75% negative: IoU in [0.1, 0.5)
□ Weekly addressed multi-scale invariance
□ Brute-force (fixing image resolution)
□ Image Pyramid: expensive

Image Pyramid
8
[1] Image Pyramid (Gaussian Pyramid)

Truncated SVD
12
Q. Why is it helpful to reduce num parameters?
A. Suppose (n, d, r) = (100, 100, 2)

Truncated SVD
13
2D dataset example 3D dataset example

Empfohlen

Fast Factorized Backprojection Algorithm for UWB Bistatic.pdfgrssieee

Final PresentationColin Eaton

Pipeline processing and space time diagramRahul Sharma

RBF Morph FSI featuresMarco E. Biancolini

층류 익형의 설계 최적화HyunJoon Kim

IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD Editor

International Journal of Engineering Research and Development (IJERD)IJERD Editor

Lec 04 - Gate-level MinimizationVajira Thambawita

Empfohlen

Fast Factorized Backprojection Algorithm for UWB Bistatic.pdfgrssieee

Final PresentationColin Eaton

Pipeline processing and space time diagramRahul Sharma

RBF Morph FSI featuresMarco E. Biancolini

층류 익형의 설계 최적화HyunJoon Kim

IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD Editor

International Journal of Engineering Research and Development (IJERD)IJERD Editor

Lec 04 - Gate-level MinimizationVajira Thambawita

An Efficient Pipelined VLSI Architecture for Lifting-Based 2D-Discrete Wavele...Rahul Jain

Bode plot & System typesaiemsolimullah

Kineograph: Taking the Pulse of a Fast-Changing and Connected WorldQian Lin

Ali Redescending M-Estimator Muhammad Ali

Reservoir connectivity analysis_with_streamline_sim_nov_2010_v2Arif Khan

Paper_An Efficient Garbage Collection in Java Virtual Machine via Swap I/O O...Hyo jeong Lee

Seismic QC & Filtering with GeostatisticsGeovariances

Emergency & day clinic 94Vahid Rahmani

131107 foss4 g_osaka_grass7_presentationTakayuki Nuimura

POTW Solution!ekersey136

51 lab-volumesmapr-academy

Collision prevention on computer architectureSarvesh Verma

High performance pipelined architecture of elliptic curve scalar multiplicati...Ieee Xpert

Generating multi-jet events with MadGraphYoshitaro Takaesu

Container routing in liner shippingsangeepower

Projectile motion calculationsAngela Stott

Reza Talk En Kf 09rezatavakoli

טיול ברומא עם R וcluster analysisEliav Schmulewitz

CFD analysis of commercial vehicleShih Cheng Tung

Graph based transistor network generation method for supergate designIeee Xpert

Week5-Faster R-CNN.pptxfahmi324663

Detectionsimplyinsimple

Weitere ähnliche Inhalte

Was ist angesagt?

An Efficient Pipelined VLSI Architecture for Lifting-Based 2D-Discrete Wavele...Rahul Jain

Bode plot & System typesaiemsolimullah

Kineograph: Taking the Pulse of a Fast-Changing and Connected WorldQian Lin

Ali Redescending M-Estimator Muhammad Ali

Reservoir connectivity analysis_with_streamline_sim_nov_2010_v2Arif Khan

Paper_An Efficient Garbage Collection in Java Virtual Machine via Swap I/O O...Hyo jeong Lee

Seismic QC & Filtering with GeostatisticsGeovariances

Emergency & day clinic 94Vahid Rahmani

131107 foss4 g_osaka_grass7_presentationTakayuki Nuimura

POTW Solution!ekersey136

51 lab-volumesmapr-academy

Collision prevention on computer architectureSarvesh Verma

High performance pipelined architecture of elliptic curve scalar multiplicati...Ieee Xpert

Generating multi-jet events with MadGraphYoshitaro Takaesu

Container routing in liner shippingsangeepower

Projectile motion calculationsAngela Stott

Reza Talk En Kf 09rezatavakoli

טיול ברומא עם R וcluster analysisEliav Schmulewitz

CFD analysis of commercial vehicleShih Cheng Tung

Graph based transistor network generation method for supergate designIeee Xpert

Was ist angesagt? (20)

An Efficient Pipelined VLSI Architecture for Lifting-Based 2D-Discrete Wavele...

Bode plot & System type

Kineograph: Taking the Pulse of a Fast-Changing and Connected World

Ali Redescending M-Estimator

Reservoir connectivity analysis_with_streamline_sim_nov_2010_v2

Paper_An Efficient Garbage Collection in Java Virtual Machine via Swap I/O O...

Seismic QC & Filtering with Geostatistics

Emergency & day clinic 94

131107 foss4 g_osaka_grass7_presentation

POTW Solution!

51 lab-volumes

Collision prevention on computer architecture

High performance pipelined architecture of elliptic curve scalar multiplicati...

Generating multi-jet events with MadGraph

Container routing in liner shipping

Projectile motion calculations

Reza Talk En Kf 09

טיול ברומא עם R וcluster analysis

CFD analysis of commercial vehicle

Graph based transistor network generation method for supergate design

Ähnlich wie Fast rcnn

Week5-Faster R-CNN.pptxfahmi324663

Detectionsimplyinsimple

Faster R-CNN - PR012Jinwon Lee

Auro tripathy - Localizing with CNNsAuro Tripathy

Improving region based CNN object detector using bayesian optimizationAmgad Muhammad

150807 Fast R-CNNJunho Cho

R-FCN : object detection via region-based fully convolutional networksEntrepreneur / Startup

Recent Object Detection Research & Person DetectionKai-Wen Zhao

Research Summary: Scalable Algorithms for Nearest-Neighbor Joins on Big Traje...Alex Klibisz

Visual odometry & slam utilizing indoor structured environmentsNAVER Engineering

Modification on Energy Efficient Design of DVB-T2 Constellation De-mapperIJERA Editor

Andrade sep15 fromlowarchitecturalexpertiseuptohighthroughputnonbinaryldpcdec...Sourour Kanzari

Design and minimization of reversible programmable logic arrays and its reali...Sajib Mitra

Reduced Energy Min-Max Decoding Algorithm for Ldpc Code with Adder Correction...ijceronline

High Speed Decoding of Non-Binary Irregular LDPC Codes Using GPUs (Paper)Enrique Monzo Solves

Reginf pldi3daniel_yokomizo

Fast methods for deep learning based object detectionBrodmann17

Practical spherical harmonics based PRT methods.ppsxMannyK4

Ähnlich wie Fast rcnn (20)

Week5-Faster R-CNN.pptx

Detection

Faster R-CNN - PR012

Auro tripathy - Localizing with CNNs

Improving region based CNN object detector using bayesian optimization

150807 Fast R-CNN

R-FCN : object detection via region-based fully convolutional networks

Recent Object Detection Research & Person Detection

Research Summary: Scalable Algorithms for Nearest-Neighbor Joins on Big Traje...

Visual odometry & slam utilizing indoor structured environments

Modification on Energy Efficient Design of DVB-T2 Constellation De-mapper

Andrade sep15 fromlowarchitecturalexpertiseuptohighthroughputnonbinaryldpcdec...

Design and minimization of reversible programmable logic arrays and its reali...

Reduced Energy Min-Max Decoding Algorithm for Ldpc Code with Adder Correction...

High Speed Decoding of Non-Binary Irregular LDPC Codes Using GPUs (Paper)

Reginf pldi3

Fast methods for deep learning based object detection

Practical spherical harmonics based PRT methods.ppsx

Kürzlich hochgeladen

Electronically Controlled suspensions system .pdfme23b1001

Risk Assessment For Installation of Drainage Pipes.pdfROCENODodongVILLACER

Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR9953056974 Low Rate Call Girls In Saket, Delhi NCR

Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerAnamika Sarkar

young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service9953056974 Low Rate Call Girls In Saket, Delhi NCR

Arduino_CSE ece ppt for working and principal of arduino.pptSAURABHKUMAR892774

9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf9953056974 Low Rate Call Girls In Saket, Delhi NCR

Call Girls Narol 7397865700 Independent Call Girlsssuser7cb4ff

An experimental study in using natural admixture as an alternative for chemic...Chandu841456

An introduction to Semiconductor and its types.pptxPurva Nikam

🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...9953056974 Low Rate Call Girls In Saket, Delhi NCR

Comparative Analysis of Text Summarization Techniquesugginaramesh

UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)Dr SOUNDIRARAJ N

Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)dollysharma2066

Past, Present and Future of Generative AIabhishek36461

What are the advantages and disadvantages of membrane structures.pptxwendy cai

8251 universal synchronous asynchronous receiver transmitterShivangiSharma879191

Oxy acetylene welding presentation note.eptoze12

Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...srsj9000

Introduction-To-Agricultural-Surveillance-Rover.pptxk795866

Kürzlich hochgeladen (20)

Electronically Controlled suspensions system .pdf

Risk Assessment For Installation of Drainage Pipes.pdf

Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR

Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger

young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service

Arduino_CSE ece ppt for working and principal of arduino.ppt

9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf

Call Girls Narol 7397865700 Independent Call Girls

An experimental study in using natural admixture as an alternative for chemic...

An introduction to Semiconductor and its types.pptx

🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...

Comparative Analysis of Text Summarization Techniques

UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)

Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)

Past, Present and Future of Generative AI

What are the advantages and disadvantages of membrane structures.pptx

8251 universal synchronous asynchronous receiver transmitter

Oxy acetylene welding presentation note.

Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...

Introduction-To-Agricultural-Surveillance-Rover.pptx

Fast rcnn

1. RWTH AACHEN Media Informatics Hojun Lim 1 Fast R-CNN Paper review session 1

2. ■ Comparison: R-CNN vs Fast R-CNN ■ Image Pyramid ■ Scale Invariance (Multi-scale) ■ Truncated SVD for replacing weights of FC layers ■ Performance Metric: Pascal VOC 2012 vs COCO Outline 2

3. Comparision: R-CNN vs Fast R-CNN 3 ■ R-CNN □ Architecture □ Classification □ Regression (localization) -> BBOX encoding: for reducing the answer space. It can be further reduced by variance trick X

4. Comparision: R-CNN vs Fast R-CNN 4 ■ R-CNN □ Defacts □ Multi-stage training pipeline (1) Train ConvNet for localization (2) Train SVMs to ConvNet features (3) Replacing Softmax by SVM and finetune □ Training is expensive □ Convolution for each region proposal, after warping □ Object detection is slow

5. Comparision: R-CNN vs Fast R-CNN 5 ■ Fast R-CNN □ Architecture □ single-stage training pipeline: combining (1) Log loss (2) Smooth L1 (= Huber loss when delta is 1) □ Multi-task loss for each RoI Indicator function, u = 0 for background

6. Comparision: R-CNN vs Fast R-CNN 6 ■ Fast R-CNN □ Improvements □ Feed whole image through ConvNet □ RoI Pooling (no warping) y x Backprop of RoI pooling

7. Comparision: R-CNN vs Fast R-CNN 7 ■ Fast R-CNN □ Limitation □ Complete architecture depends on external RoI proposal algorithm □ Have to extract fixed N(=64) regions from each image □ Hard negative mining: 25% positive: IoU in [0.5, 1] 75% negative: IoU in [0.1, 0.5) □ Weekly addressed multi-scale invariance □ Brute-force (fixing image resolution) □ Image Pyramid: expensive

8. Image Pyramid 8 [1] Image Pyramid (Gaussian Pyramid)

9. Scale invariance 9

10. Scale invariance 10

11. Truncated SVD 11

12. Truncated SVD 12 Q. Why is it helpful to reduce num parameters? A. Suppose (n, d, r) = (100, 100, 2)

13. Truncated SVD 13 2D dataset example 3D dataset example

14. Q&A Thank you ! 14

Hinweis der Redaktion

MA-INF 2307 - Lab Vision
main idea of the paper(same as the first talk)- review your goals- present & discuss your results- comment on your own implementation (what was available, what had to bedone, what were the difficulties)-> 1. data preprocessing(parsing json), managing two independent projects, making code work in general(since we have many variable here:FDA_mode, round, thresholding, and so on)- conclusion (e.g strengths/weaknesses of the paper, potential future work) -> 시간 부족(학습) -> 사실은 selfsupervised에서 multiband average가 있어야 함.ㅎ ㅏ지만 시간상 하지 못하였다. FDA에서는 사실 이부분에서 주요한 성능향상이 이루어졌기 때문에 Intra에서도 향상이 기대된다.
Explain meaning of ‘Domain adaptation’ : adapting a model trained with annotated samples from one distribution (source), to operate on a different (target) distribution for which no annotations are given Our method does not require any training to perform the domain alignment, just a simple Fourier Transform and its inverse. Despite its simplicity, it achieves state-of-the-art performance in the current benchmarks, when integrated into a relatively standard semantic segmentation model Many researches have been proposed for ’Domain Adaptation’ However, state-of-the-art methods are complex
Explain meaning of ‘Domain adaptation’ : adapting a model trained with annotated samples from one distribution (source), to operate on a different (target) distribution for which no annotations are given Our method does not require any training to perform the domain alignment, just a simple Fourier Transform and its inverse. Despite its simplicity, it achieves state-of-the-art performance in the current benchmarks, when integrated into a relatively standard semantic segmentation model Many researches have been proposed for ’Domain Adaptation’ However, state-of-the-art methods are complex
<a|b> <a| |b> a^{T}b