Deep learning for object detection

•Als PPTX, PDF herunterladen•

8 gefällt mir•5,341 views

This document discusses and compares different methods for deep learning object detection, including region proposal-based methods like R-CNN, Fast R-CNN, Faster R-CNN, and Mask R-CNN as well as single shot methods like YOLO, YOLOv2, and SSD. Region proposal-based methods tend to have higher accuracy but are slower, while single shot methods are faster but less accurate. Newer methods like Faster R-CNN, R-FCN, YOLOv2, and SSD have improved speed and accuracy over earlier approaches.

Technologie

Deep learning for object
detection
Wenjing Chen
*Created in March 2017, might be outdated the time you read.
Slide credit: CS231n

Outline
1. Introduction
2. Common methods
Region proposal based methods
R-CNN, Fast R-CNN, Faster R-CNN, R-FCN, Mask R-CNN
Single shot based methods
YOLO, YOLOv2, SSD
1. Comparison

Introduction
one image -> one label one image -> labels + bounding boxes

Region based methods - R-CNN
Girshick, Ross, et al. "Rich feature hierarchies for accurate object detection and semantic segmentation." Proceedings of the IEEE conference on computer
vision and pattern recognition. 2014.

Region based methods - Fast R-CNN
Girshick, Ross. "Fast r-cnn." Proceedings of the IEEE International Conference on Computer Vision. 2015.

Region based methods - Faster R-CNN
Ren, Shaoqing, et al. "Faster r-cnn: Towards real-time object detection with region proposal networks." Advances in neural information processing systems.
2015.

Region based methods - R-FCN
Li, Yi, Kaiming He, and Jian Sun. "R-fcn: Object detection via region-based fully convolutional networks." Advances in Neural Information Processing Systems.
2016.
Average
pooling

Region based methods - Mask R-CNN
He, Kaiming, et al. "Mask R-CNN." arXiv preprint arXiv:1703.06870 (2017).
Object instance segmentation:
 Extend Faster R-CNN by adding a
branch for predicting segmentation
masks on each RoI
 Running at 5 fps
 Without tricks, outperforms all existing,
single-model entries on every task in
all three tracks of the COCO suite of
challenges, including instance
segmentation, bounding-box object
detection, and person keypoint
detection !!!

Single shot based method - YOLO
Redmon, Joseph, et al. "You only look once: Unified, real-time object detection." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
2016.
1. Resize input image to 448*448.
1. Run a single convolutional network.
Predicts B bounding boxes (4 coordinates + confidence) and
C class probabilities for S*S grids, encoded as an
S*S*(B*5+C) tensor.
1. Non-maximum suppression.
S*S*B bounding boxes per image and C class probabilities
for each box.

Single shot based method - YOLOv2
Redmon, Joseph, and Ali Farhadi. "YOLO9000: Better, Faster, Stronger." arXiv preprint arXiv:1612.08242 (2016).
YOLO problem:
1. Significant number of localization errors.
2. Low recall compared to region proposal based methods.
Improvements:

Single shot based method - SSD
Liu, Wei, et al. "SSD: Single shot multibox detector." European Conference on Computer Vision. Springer International Publishing, 2016.
Improvements:
1. Use a small convolutional filter to predict object categories and offsets in bounding box
locations
2. Use multiple layers for prediction at different scales.

Comparison
From YOLOv2 From SSD
R-FCN
83.6% mAP
5.8fps
R-FCN

PASCAL VOC 2012
http://host.robots.ox.ac.uk:8080/leaderboard/displaylb.php?challengeid=11&compid=4

Comparison
Speed
single shot > region based
Accuracy
region based > single shot
Complexity
YOLO < SSD ≤ Faster R-CNN < R-FCN < YOLOv2(?)

Weitere ähnliche Inhalte

Was ist angesagt?

Introduction to object detectionBrodmann17

You only look once (YOLO) : unified real time object detectionEntrepreneur / Startup

You only look onceGin Kyeng Lee

YOLOgeothomas18

Real-time object detection coz YOLO!J On The Beach

Deep learning based object detection basicsBrodmann17

Yolo releases gianmariaDeep Learning Italia

Recent Progress on Object Detection_20170331Jihong Kang

PR-207: YOLOv3: An Incremental ImprovementJinwon Lee

Yolo v2 ai_tech_20190421穗碧陳

Object detectionROUSHAN RAJ KUMAR

Yolov3VincentWu105

YOLOv4: optimal speed and accuracy of object detection reviewLEE HOSEONG

YoloBang Tsui Liou

Object Detection & TrackingAkshay Gujarathi

YoloSourav Garai

Image segmentation with deep learningAntonio Rueda-Toicen

Intro to Object Detection with SSDThomas Delteil

Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...Simplilearn

Moving object detectionRaviraj singh shekhawat

Was ist angesagt? (20)

Introduction to object detection

You only look once (YOLO) : unified real time object detection

You only look once

YOLO

Real-time object detection coz YOLO!

Deep learning based object detection basics

Yolo releases gianmaria

Recent Progress on Object Detection_20170331

PR-207: YOLOv3: An Incremental Improvement

Yolo v2 ai_tech_20190421

Object detection

Yolov3

YOLOv4: optimal speed and accuracy of object detection review

Yolo

Object Detection & Tracking

Yolo

Image segmentation with deep learning

Intro to Object Detection with SSD

Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...

Moving object detection

Ähnlich wie Deep learning for object detection

object-detection.pptxMohamedAliHabib3

Object Detection An Overviewijtsrd

Faster R-CNN: Towards real-time object detection with region proposal network...Universitat Politècnica de Catalunya

[CVPR 2018] Utilizing unlabeled or noisy labeled data (classification, detect...NAVER Engineering

Modern convolutional object detectorsKwanghee Choi

Mobile Visual Search: Object Re-Identification Against Large RepositoriesUnited States Air Force Academy

最近の研究情勢についていくために - Deep Learningを中心に - Hiroshi Fukui

PR-110: An Analysis of Scale Invariance in Object Detection – SNIPjaewon lee

Deep learning based object detectionMonicaDommaraju

SimCLR: A Simple Framework for Contrastive Learning of Visual Representationsynxm25hpxp

“Understanding DNN-Based Object Detectors,” a Presentation from Au-Zone Techn...Edge AI and Vision Alliance

IRJET - Object Detection using Deep Learning with OpenCV and PythonIRJET Journal

IRJET- Real-Time Object Detection using Deep Learning: A SurveyIRJET Journal

Object Detection Beyond Mask R-CNN and RetinaNet IWanjin Yu

Object Detetcion using SSD-MobileNetIRJET Journal

Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya

Object Detection - Míriam Bellver - UPC Barcelona 2018Universitat Politècnica de Catalunya

Review: You Only Look One-level FeatureDongmin Choi

Deep Learning for X ray Image to Text Generationijtsrd

Introduction talk to Computer Vision Chen Sagiv

Ähnlich wie Deep learning for object detection (20)

object-detection.pptx

Object Detection An Overview

Faster R-CNN: Towards real-time object detection with region proposal network...

[CVPR 2018] Utilizing unlabeled or noisy labeled data (classification, detect...

Modern convolutional object detectors

Mobile Visual Search: Object Re-Identification Against Large Repositories

最近の研究情勢についていくために - Deep Learningを中心に -

PR-110: An Analysis of Scale Invariance in Object Detection – SNIP

Deep learning based object detection

SimCLR: A Simple Framework for Contrastive Learning of Visual Representations

“Understanding DNN-Based Object Detectors,” a Presentation from Au-Zone Techn...

IRJET - Object Detection using Deep Learning with OpenCV and Python

IRJET- Real-Time Object Detection using Deep Learning: A Survey

Object Detection Beyond Mask R-CNN and RetinaNet I

Object Detetcion using SSD-MobileNet

Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)

Object Detection - Míriam Bellver - UPC Barcelona 2018

Review: You Only Look One-level Feature

Deep Learning for X ray Image to Text Generation

Introduction talk to Computer Vision

Kürzlich hochgeladen

Automating Google Workspace (GWS) & more with Apps Scriptwesley chun

The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

GenCyber Cyber Security Day PresentationMichael W. Hawkins

Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun

What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco

Artificial Intelligence: Facts and MythsJoaquim Jorge

🐬 The future of MySQL is Postgres 🐘RTylerCroy

How to convert PDF to text with Nanonetsnaman860154

From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j

Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer

A Domino Admins Adventures (Engage 2024)Gabriella Davis

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer

Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge

04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG

Kürzlich hochgeladen (20)

Automating Google Workspace (GWS) & more with Apps Script

The 7 Things I Know About Cyber Security After 25 Years | April 2024

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

GenCyber Cyber Security Day Presentation

Data Cloud, More than a CDP by Matt Robison

What Are The Drone Anti-jamming Systems Technology?

Artificial Intelligence: Facts and Myths

🐬 The future of MySQL is Postgres 🐘

How to convert PDF to text with Nanonets

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

Tata AIG General Insurance Company - Insurer Innovation Award 2024

A Domino Admins Adventures (Engage 2024)

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

Exploring the Future Potential of AI-Enabled Smartphone Processors

08448380779 Call Girls In Civil Lines Women Seeking Men

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx

Deep learning for object detection

1. Deep learning for object detection Wenjing Chen *Created in March 2017, might be outdated the time you read. Slide credit: CS231n

2. Outline 1. Introduction 2. Common methods Region proposal based methods R-CNN, Fast R-CNN, Faster R-CNN, R-FCN, Mask R-CNN Single shot based methods YOLO, YOLOv2, SSD 1. Comparison

3. Introduction one image -> one label one image -> labels + bounding boxes

4. Region based methods - R-CNN Girshick, Ross, et al. "Rich feature hierarchies for accurate object detection and semantic segmentation." Proceedings of the IEEE conference on computer vision and pattern recognition. 2014.

5. Region based methods - Fast R-CNN Girshick, Ross. "Fast r-cnn." Proceedings of the IEEE International Conference on Computer Vision. 2015.

6. Region based methods - Faster R-CNN Ren, Shaoqing, et al. "Faster r-cnn: Towards real-time object detection with region proposal networks." Advances in neural information processing systems. 2015.

7. Region based methods - Faster R-CNN

8. Region based methods - R-FCN Li, Yi, Kaiming He, and Jian Sun. "R-fcn: Object detection via region-based fully convolutional networks." Advances in Neural Information Processing Systems. 2016. Average pooling

9. Region based methods - Mask R-CNN He, Kaiming, et al. "Mask R-CNN." arXiv preprint arXiv:1703.06870 (2017). Object instance segmentation:  Extend Faster R-CNN by adding a branch for predicting segmentation masks on each RoI  Running at 5 fps  Without tricks, outperforms all existing, single-model entries on every task in all three tracks of the COCO suite of challenges, including instance segmentation, bounding-box object detection, and person keypoint detection !!!

10. Single shot based method - YOLO Redmon, Joseph, et al. "You only look once: Unified, real-time object detection." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2016. 1. Resize input image to 448*448. 1. Run a single convolutional network. Predicts B bounding boxes (4 coordinates + confidence) and C class probabilities for S*S grids, encoded as an S*S*(B*5+C) tensor. 1. Non-maximum suppression. S*S*B bounding boxes per image and C class probabilities for each box.

11. Single shot based method - YOLOv2 Redmon, Joseph, and Ali Farhadi. "YOLO9000: Better, Faster, Stronger." arXiv preprint arXiv:1612.08242 (2016). YOLO problem: 1. Significant number of localization errors. 2. Low recall compared to region proposal based methods. Improvements:

12. Single shot based method - SSD Liu, Wei, et al. "SSD: Single shot multibox detector." European Conference on Computer Vision. Springer International Publishing, 2016. Improvements: 1. Use a small convolutional filter to predict object categories and offsets in bounding box locations 2. Use multiple layers for prediction at different scales.

13. Comparison From YOLOv2 From SSD R-FCN 83.6% mAP 5.8fps R-FCN

14. PASCAL VOC 2012 http://host.robots.ox.ac.uk:8080/leaderboard/displaylb.php?challengeid=11&compid=4

15. Comparison Speed single shot > region based Accuracy region based > single shot Complexity YOLO < SSD ≤ Faster R-CNN < R-FCN < YOLOv2(?)

Hinweis der Redaktion

Batch normalization. 2% more in mAP. High resolution classifier. 4% more in mAP. Convolutional with anchor boxes. 69.5 mAP 81% recall to 69.2 mAP 88% recall. Dimension clusters. Better anchor boxes priors. 60.9% to 67.2% in Avg IOU. Direct location prediction. Solve model instability. Fine-Grained features. 1% more in mAP. Multi-scale training.

Deep learning for object detection

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Ähnlich wie Deep learning for object detection

Ähnlich wie Deep learning for object detection (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Deep learning for object detection

Hinweis der Redaktion