SlideShare ist ein Scribd-Unternehmen logo
1 von 23
Downloaden Sie, um offline zu lesen
An accurate retrieval through R-MAC+
descriptors for landmark recognition
Federico Magliani, Andrea Prati
ICDSC 2018 – Eindhoven, Netherlands – 3-4 September 2018
Agenda
2
➢ Motivations
➢ Summary of contributions
➢ Related works
➢ Introduction to R-MAC descriptors
➢ Proposed approach (R-MAC+)
➢ Experimental results
➢ Conclusions
Motivations
3
Landmark Recognition problem
➢ Try to understand what’s is in front
of you and retrieve similar images.
➢ Semantic gap: for a human, this task
is pretty simple thanks to personal
experience, but a computer can use
only the info available in the images.
➢ It is far from being solved
(viewpoint, illumination conditions,
image resolution, ...).
Motivations
4
➢ Challenges
○ High accuracy retrieval (precision)
○ Fast research (response to query)
○ Reduced memory occupied (mobile friendly)
○ Work well with big data (>1M data)
➢ Possible applications
○ Augmented reality (tourism)
○ Person Re-ID (video-surveillance)
○ Online clothes search (fashion)
Agenda
5
➢ Motivations
➢ Summary of contributions
➢ Related works
➢ Introduction to R-MAC descriptors
➢ Proposed approach (R-MAC+)
➢ Experimental results
➢ Conclusions and Future Works
Summary of contributions
6
➢ a new region detector for CNN feature maps implemented through grids, that respect
the aspect ratio of the images.
➢ an improvement on the effectiveness of the multi-resolution approach for R-MAC
descriptors.
➢ a novel retrieval method for checking the similarities between query descriptors and
regions of database R-MAC descriptors. It allows to outperform the results of R-MAC
descriptors on Oxford5k and Paris6k by +7% and +3%.
Agenda
7
➢ Motivations
➢ Summary of contribution
➢ Related works
➢ Introduction to R-MAC descriptors
➢ Proposed approach (R-MAC+)
➢ Experimental results
➢ Conclusions
Related works
8
➢ Bag of Words (BoW): first method for solving the problem (different
techniques: vocabulary tree, …).
➢ VLAD: similar to BoW, but using the residual of the descriptors
(=feature descriptor - closest centers in the vocabulary).
➢ CNN based: extract features from intermediate layers of CNN
architectures and then apply previous embedding techniques (BLCF, ...).
➢ MAC: max pooling applied on CNN features
➢ R-MAC: regional MAC descriptors created through the application of a
rigid-grid mechanism
Agenda
9
➢ Motivations
➢ Summary of contributions
➢ Related works
➢ Introduction to R-MAC descriptors
➢ Proposed approach (R-MAC+)
➢ Experimental results
➢ Conclusions
R-MAC (Regional MAC) descriptors
10
Considering a rectangular region R ⊆ Ω = (1,W) x (1,H), and define the regional feature vector:
fR
= (fR,1
...fR,i
...fR,K
)T
where fR,i
= max Xi
(p) is the maximum activation of the ith
channel on the considered
region.
Then we calculate the feature vector associated with each region, and post-process it with
l2
-normalization, PCA-whitening and l2
-normalization. We combine the collection of regional feature
vectors into a single image vector by summing them and l2
-normalizing in the end.
We define the response maps and sample square regions at
L different scales
➢ at the largest scale (l=1), the region size is determined
to be as large as possible (height = width = min(W,H))
➢ at every other scale l, we uniformly sample l x (l+m-1)
regions of width 2min(W,H)/(l+1). (with m=2)
R-MAC (Regional MAC) descriptors
11
Settings:
➢ Fully convolutional off-the-shelf VGG16
➢ Pool5
➢ Spatial Max pooling
➢ High Resolution images
➢ Global descriptor based on aggregating region vectors
➢ Sliding window approach
Tolias et al. Particular object retrieval with integral max-pooling of CNN activations. arXiv 2015.
Agenda
12
➢ Motivations
➢ Summary of contributions
➢ Related works
➢ Introduction to R-MAC descriptors
➢ Proposed approach (R-MAC+)
➢ Experimental results
➢ Conclusions
Proposed approach: R-MAC+
New multi-resolution approach: the images are resized of +25%,-25%, 0% on the largest
size, respecting the aspect ratio of the image.
➢ This strategy is an alternative of the first multi-resolution approach, that resized the
image to a fixed size: 550px, 800px and 1050 on the largest size, retaining the aspect
ratio of the image.
➢ This strategy should allow to augment the dimensions of the feature maps in order to
have more features and therefore local maxima than the previous multi-resolution
R-MAC. This approach is connected to the new region detector, that detects a
reduced number of regions (15) instead of the 20 of the original one.
13
Proposed approach: R-MAC+
14
A new mechanism for region detection in the CNN feature maps (15 regions)
● l=0 → 1 region covering entirely the image;
● l=1 → 2 square regions (widthRegion = heightRegion = min(H,W));
● l=2 → 6 rect regions (widthRegion = heightRegion =⌈2*min(W,H)/(l+1))⌉, arranged along the
horizontal axis (width and height of the regions are adapted to cover all the image);
● l=3 → 6 rect regions (widthRegion = heightRegion= ⌈2*min(W,H)/(l+2))⌉, arranged along the
vertical axis (width and height of the regions are adapted to cover all the image).
Proposed approach: R-MAC+
15
A new retrieval method based on db regions (MAC descriptors of the database images) and the
R-MAC descriptors of the query images (+7% on Oxford5k and +4% on Paris6k than previous results)
Agenda
16
➢ Motivations
➢ Summary of contributions
➢ Related works
➢ Introduction to R-MAC descriptors
➢ Proposed approach (R-MAC+)
➢ Experimental results
➢ Conclusions
Datasets and evaluation metric
Datasets:
➢ Holidays (1491 images: 500 classes, 500 queries).
➢ Oxford5k (5063 images, 11 classes, 55 queries).
➢ Paris6k (6412 images, 11 classes, 55 queries).
Evaluation metric:
➢ mAP (mean Average Precision) → mean of Average Precision scores (correct results)
for each query, based on the position in the ranking.
17
Results
18
Method Network Holidays
(original/rotated)
Oxf5k Paris6k
MAC VGG19 76.26 % 57.44 % 73.15 %
R-MAC VGG19 87.65 % 65.56 % 82.80 %
R-MAC ResNet50 92.55 % 71.77 % 83.31 %
M-R R-MAC+ ResNet50 94.63 % / 95.58 % 78.88 % 88.63 %
M-R R-MAC+ with retrieval
based on db regions
ResNet50 94.37 % / 95.87 % 85.39 % 91.90 %
Results after QE application
19
Method Network Holidays
(original/rotated)
Oxf5k Paris6k
M-R R-MAC+ ResNet50 94.97 % / 95.97 % 86.45 % 92.01 %
M-R R-MAC+ with retrieval
based on db regions
ResNet50 94.42 % / 96.05 % 87.92 % 93.64 %
M-R R-MAC+ with retrieval
based on db regions and query
expansion based on db regions
ResNet50 94.28 % / 95.91 % 88.78 % 92.30 %
Comparison with the state of the art
20
Agenda
21
➢ Motivations
➢ Summary of contributions
➢ Related works
➢ Introduction to R-MAC descriptors
➢ Proposed approach (R-MAC+)
➢ Experimental results
➢ Conclusions
Conclusions
➢ We propose different improvements on R-MAC descriptors in order to make the
retrieval very accurate.
○ A multi-resolution approach, that uses bigger feature maps than the previous one.
○ A new region detector with the use of adaptable grids allows to catch more local
maxima.
○ A novel retrieval method based on db regions that highly boosts the performance on
Oxford5k and Paris6k.
➢ The proposed method outperforms the state of the art on Holidays, both on the
original and rotated version. Also it outperforms the state-of-the-art results on
some other public benchmarks without the fine-tuning application.
22
Thank you for your attention!
questions?
http://implab.ce.unipr.it
23

Weitere ähnliche Inhalte

Was ist angesagt?

Aerial detection part3
Aerial detection part3Aerial detection part3
Aerial detection part3ssuser456ad6
 
Feature pyramid networks for object detection
Feature pyramid networks for object detection Feature pyramid networks for object detection
Feature pyramid networks for object detection heedaeKwon
 
Automatic road environment classification 20121002
Automatic road environment classification 20121002Automatic road environment classification 20121002
Automatic road environment classification 20121002es712
 
NetVLAD: CNN architecture for weakly supervised place recognition
NetVLAD:  CNN architecture for weakly supervised place recognitionNetVLAD:  CNN architecture for weakly supervised place recognition
NetVLAD: CNN architecture for weakly supervised place recognitionGeunhee Cho
 
Path Planning And Navigation
Path Planning And NavigationPath Planning And Navigation
Path Planning And Navigationguest90654fd
 
Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013
Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013
Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013Sunando Sengupta
 
object detection paper review
object detection paper reviewobject detection paper review
object detection paper reviewYoonho Na
 
presentazione_IGARSS2011.ppt
presentazione_IGARSS2011.pptpresentazione_IGARSS2011.ppt
presentazione_IGARSS2011.pptgrssieee
 
computervision project
computervision projectcomputervision project
computervision projectLianli Liu
 
Implementation of a lane-tracking system for autonomous driving using Kalman ...
Implementation of a lane-tracking system for autonomous driving using Kalman ...Implementation of a lane-tracking system for autonomous driving using Kalman ...
Implementation of a lane-tracking system for autonomous driving using Kalman ...Francesco Corucci
 
Automatic Dense Semantic Mapping From Visual Street-level Imagery
Automatic Dense Semantic Mapping From Visual Street-level ImageryAutomatic Dense Semantic Mapping From Visual Street-level Imagery
Automatic Dense Semantic Mapping From Visual Street-level ImagerySunando Sengupta
 
Report bep thomas_blanken
Report bep thomas_blankenReport bep thomas_blanken
Report bep thomas_blankenxepost
 
Prunet, Pascal: Plume detection and characterization from XCO2 imagery: Evalu...
Prunet, Pascal: Plume detection and characterization from XCO2 imagery: Evalu...Prunet, Pascal: Plume detection and characterization from XCO2 imagery: Evalu...
Prunet, Pascal: Plume detection and characterization from XCO2 imagery: Evalu...Integrated Carbon Observation System (ICOS)
 
REVIEW OF LANE DETECTION AND TRACKING ALGORITHMS IN ADVANCED DRIVER ASSISTANC...
REVIEW OF LANE DETECTION AND TRACKING ALGORITHMS IN ADVANCED DRIVER ASSISTANC...REVIEW OF LANE DETECTION AND TRACKING ALGORITHMS IN ADVANCED DRIVER ASSISTANC...
REVIEW OF LANE DETECTION AND TRACKING ALGORITHMS IN ADVANCED DRIVER ASSISTANC...ijcsit
 

Was ist angesagt? (20)

Densebox
DenseboxDensebox
Densebox
 
Aerial detection part3
Aerial detection part3Aerial detection part3
Aerial detection part3
 
Feature pyramid networks for object detection
Feature pyramid networks for object detection Feature pyramid networks for object detection
Feature pyramid networks for object detection
 
Automatic road environment classification 20121002
Automatic road environment classification 20121002Automatic road environment classification 20121002
Automatic road environment classification 20121002
 
NetVLAD: CNN architecture for weakly supervised place recognition
NetVLAD:  CNN architecture for weakly supervised place recognitionNetVLAD:  CNN architecture for weakly supervised place recognition
NetVLAD: CNN architecture for weakly supervised place recognition
 
Path Planning And Navigation
Path Planning And NavigationPath Planning And Navigation
Path Planning And Navigation
 
06466595
0646659506466595
06466595
 
Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013
Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013
Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013
 
K-BestMatch
K-BestMatchK-BestMatch
K-BestMatch
 
object detection paper review
object detection paper reviewobject detection paper review
object detection paper review
 
presentazione_IGARSS2011.ppt
presentazione_IGARSS2011.pptpresentazione_IGARSS2011.ppt
presentazione_IGARSS2011.ppt
 
mid_presentation
mid_presentationmid_presentation
mid_presentation
 
computervision project
computervision projectcomputervision project
computervision project
 
Implementation of a lane-tracking system for autonomous driving using Kalman ...
Implementation of a lane-tracking system for autonomous driving using Kalman ...Implementation of a lane-tracking system for autonomous driving using Kalman ...
Implementation of a lane-tracking system for autonomous driving using Kalman ...
 
Automatic Dense Semantic Mapping From Visual Street-level Imagery
Automatic Dense Semantic Mapping From Visual Street-level ImageryAutomatic Dense Semantic Mapping From Visual Street-level Imagery
Automatic Dense Semantic Mapping From Visual Street-level Imagery
 
Report bep thomas_blanken
Report bep thomas_blankenReport bep thomas_blanken
Report bep thomas_blanken
 
MS Project
MS ProjectMS Project
MS Project
 
Prunet, Pascal: Plume detection and characterization from XCO2 imagery: Evalu...
Prunet, Pascal: Plume detection and characterization from XCO2 imagery: Evalu...Prunet, Pascal: Plume detection and characterization from XCO2 imagery: Evalu...
Prunet, Pascal: Plume detection and characterization from XCO2 imagery: Evalu...
 
REVIEW OF LANE DETECTION AND TRACKING ALGORITHMS IN ADVANCED DRIVER ASSISTANC...
REVIEW OF LANE DETECTION AND TRACKING ALGORITHMS IN ADVANCED DRIVER ASSISTANC...REVIEW OF LANE DETECTION AND TRACKING ALGORITHMS IN ADVANCED DRIVER ASSISTANC...
REVIEW OF LANE DETECTION AND TRACKING ALGORITHMS IN ADVANCED DRIVER ASSISTANC...
 
Mmclass5b
Mmclass5bMmclass5b
Mmclass5b
 

Ähnlich wie An accurate retrieval through R-MAC+ descriptors for landmark recognition

Module-5-1_230523_171754 (1).pdf
Module-5-1_230523_171754 (1).pdfModule-5-1_230523_171754 (1).pdf
Module-5-1_230523_171754 (1).pdfvikasmittal92
 
PCA-SIFT: A More Distinctive Representation for Local Image Descriptors
PCA-SIFT: A More Distinctive Representation for Local Image DescriptorsPCA-SIFT: A More Distinctive Representation for Local Image Descriptors
PCA-SIFT: A More Distinctive Representation for Local Image Descriptorswolf
 
A hybrid sine cosine optimization algorithm for solving global optimization p...
A hybrid sine cosine optimization algorithm for solving global optimization p...A hybrid sine cosine optimization algorithm for solving global optimization p...
A hybrid sine cosine optimization algorithm for solving global optimization p...Aboul Ella Hassanien
 
FV_IGARSS11.ppt
FV_IGARSS11.pptFV_IGARSS11.ppt
FV_IGARSS11.pptgrssieee
 
FV_IGARSS11.ppt
FV_IGARSS11.pptFV_IGARSS11.ppt
FV_IGARSS11.pptgrssieee
 
FV_IGARSS11.ppt
FV_IGARSS11.pptFV_IGARSS11.ppt
FV_IGARSS11.pptgrssieee
 
FV_IGARSS11.ppt
FV_IGARSS11.pptFV_IGARSS11.ppt
FV_IGARSS11.pptgrssieee
 
FR3.L09 - MULTIBASELINE GRADIENT AMBIGUITY RESOLUTION TO SUPPORT MINIMUM COST...
FR3.L09 - MULTIBASELINE GRADIENT AMBIGUITY RESOLUTION TO SUPPORT MINIMUM COST...FR3.L09 - MULTIBASELINE GRADIENT AMBIGUITY RESOLUTION TO SUPPORT MINIMUM COST...
FR3.L09 - MULTIBASELINE GRADIENT AMBIGUITY RESOLUTION TO SUPPORT MINIMUM COST...grssieee
 
Video Stitching using Improved RANSAC and SIFT
Video Stitching using Improved RANSAC and SIFTVideo Stitching using Improved RANSAC and SIFT
Video Stitching using Improved RANSAC and SIFTIRJET Journal
 
Convolutional Neural Network for pixel-wise skyline detection
Convolutional Neural Network for pixel-wise skyline detectionConvolutional Neural Network for pixel-wise skyline detection
Convolutional Neural Network for pixel-wise skyline detectionDarian Frajberg
 
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis taeseon ryu
 
Computer Vision: Feature matching with RANSAC Algorithm
Computer Vision: Feature matching with RANSAC AlgorithmComputer Vision: Feature matching with RANSAC Algorithm
Computer Vision: Feature matching with RANSAC Algorithmallyn joy calcaben
 
Landmark Retrieval & Recognition
Landmark Retrieval & RecognitionLandmark Retrieval & Recognition
Landmark Retrieval & Recognitionkenluck2001
 
Deep image retrieval learning global representations for image search
Deep image retrieval  learning global representations for image searchDeep image retrieval  learning global representations for image search
Deep image retrieval learning global representations for image searchUniversitat Politècnica de Catalunya
 
Towards better performance: phase congruency based face recognition
Towards better performance: phase congruency based face recognitionTowards better performance: phase congruency based face recognition
Towards better performance: phase congruency based face recognitionTELKOMNIKA JOURNAL
 

Ähnlich wie An accurate retrieval through R-MAC+ descriptors for landmark recognition (20)

Module-5-1_230523_171754 (1).pdf
Module-5-1_230523_171754 (1).pdfModule-5-1_230523_171754 (1).pdf
Module-5-1_230523_171754 (1).pdf
 
PCA-SIFT: A More Distinctive Representation for Local Image Descriptors
PCA-SIFT: A More Distinctive Representation for Local Image DescriptorsPCA-SIFT: A More Distinctive Representation for Local Image Descriptors
PCA-SIFT: A More Distinctive Representation for Local Image Descriptors
 
A hybrid sine cosine optimization algorithm for solving global optimization p...
A hybrid sine cosine optimization algorithm for solving global optimization p...A hybrid sine cosine optimization algorithm for solving global optimization p...
A hybrid sine cosine optimization algorithm for solving global optimization p...
 
FV_IGARSS11.ppt
FV_IGARSS11.pptFV_IGARSS11.ppt
FV_IGARSS11.ppt
 
FV_IGARSS11.ppt
FV_IGARSS11.pptFV_IGARSS11.ppt
FV_IGARSS11.ppt
 
FV_IGARSS11.ppt
FV_IGARSS11.pptFV_IGARSS11.ppt
FV_IGARSS11.ppt
 
FV_IGARSS11.ppt
FV_IGARSS11.pptFV_IGARSS11.ppt
FV_IGARSS11.ppt
 
FR3.L09 - MULTIBASELINE GRADIENT AMBIGUITY RESOLUTION TO SUPPORT MINIMUM COST...
FR3.L09 - MULTIBASELINE GRADIENT AMBIGUITY RESOLUTION TO SUPPORT MINIMUM COST...FR3.L09 - MULTIBASELINE GRADIENT AMBIGUITY RESOLUTION TO SUPPORT MINIMUM COST...
FR3.L09 - MULTIBASELINE GRADIENT AMBIGUITY RESOLUTION TO SUPPORT MINIMUM COST...
 
Video Stitching using Improved RANSAC and SIFT
Video Stitching using Improved RANSAC and SIFTVideo Stitching using Improved RANSAC and SIFT
Video Stitching using Improved RANSAC and SIFT
 
All projects
All projectsAll projects
All projects
 
ICRA Nathan Piasco
ICRA Nathan PiascoICRA Nathan Piasco
ICRA Nathan Piasco
 
Convolutional Neural Network for pixel-wise skyline detection
Convolutional Neural Network for pixel-wise skyline detectionConvolutional Neural Network for pixel-wise skyline detection
Convolutional Neural Network for pixel-wise skyline detection
 
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
 
Real time traffic management - challenges and solutions
Real time traffic management - challenges and solutionsReal time traffic management - challenges and solutions
Real time traffic management - challenges and solutions
 
Computer Vision: Feature matching with RANSAC Algorithm
Computer Vision: Feature matching with RANSAC AlgorithmComputer Vision: Feature matching with RANSAC Algorithm
Computer Vision: Feature matching with RANSAC Algorithm
 
crowd counting.pptx
crowd counting.pptxcrowd counting.pptx
crowd counting.pptx
 
Landmark Retrieval & Recognition
Landmark Retrieval & RecognitionLandmark Retrieval & Recognition
Landmark Retrieval & Recognition
 
Deep image retrieval learning global representations for image search
Deep image retrieval  learning global representations for image searchDeep image retrieval  learning global representations for image search
Deep image retrieval learning global representations for image search
 
Towards better performance: phase congruency based face recognition
Towards better performance: phase congruency based face recognitionTowards better performance: phase congruency based face recognition
Towards better performance: phase congruency based face recognition
 
20210226 esa-science-coffee-v2.0
20210226 esa-science-coffee-v2.020210226 esa-science-coffee-v2.0
20210226 esa-science-coffee-v2.0
 

Kürzlich hochgeladen

MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...ranjana rawat
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations120cr0395
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...Call Girls in Nagpur High Profile
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 

Kürzlich hochgeladen (20)

MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 

An accurate retrieval through R-MAC+ descriptors for landmark recognition

  • 1. An accurate retrieval through R-MAC+ descriptors for landmark recognition Federico Magliani, Andrea Prati ICDSC 2018 – Eindhoven, Netherlands – 3-4 September 2018
  • 2. Agenda 2 ➢ Motivations ➢ Summary of contributions ➢ Related works ➢ Introduction to R-MAC descriptors ➢ Proposed approach (R-MAC+) ➢ Experimental results ➢ Conclusions
  • 3. Motivations 3 Landmark Recognition problem ➢ Try to understand what’s is in front of you and retrieve similar images. ➢ Semantic gap: for a human, this task is pretty simple thanks to personal experience, but a computer can use only the info available in the images. ➢ It is far from being solved (viewpoint, illumination conditions, image resolution, ...).
  • 4. Motivations 4 ➢ Challenges ○ High accuracy retrieval (precision) ○ Fast research (response to query) ○ Reduced memory occupied (mobile friendly) ○ Work well with big data (>1M data) ➢ Possible applications ○ Augmented reality (tourism) ○ Person Re-ID (video-surveillance) ○ Online clothes search (fashion)
  • 5. Agenda 5 ➢ Motivations ➢ Summary of contributions ➢ Related works ➢ Introduction to R-MAC descriptors ➢ Proposed approach (R-MAC+) ➢ Experimental results ➢ Conclusions and Future Works
  • 6. Summary of contributions 6 ➢ a new region detector for CNN feature maps implemented through grids, that respect the aspect ratio of the images. ➢ an improvement on the effectiveness of the multi-resolution approach for R-MAC descriptors. ➢ a novel retrieval method for checking the similarities between query descriptors and regions of database R-MAC descriptors. It allows to outperform the results of R-MAC descriptors on Oxford5k and Paris6k by +7% and +3%.
  • 7. Agenda 7 ➢ Motivations ➢ Summary of contribution ➢ Related works ➢ Introduction to R-MAC descriptors ➢ Proposed approach (R-MAC+) ➢ Experimental results ➢ Conclusions
  • 8. Related works 8 ➢ Bag of Words (BoW): first method for solving the problem (different techniques: vocabulary tree, …). ➢ VLAD: similar to BoW, but using the residual of the descriptors (=feature descriptor - closest centers in the vocabulary). ➢ CNN based: extract features from intermediate layers of CNN architectures and then apply previous embedding techniques (BLCF, ...). ➢ MAC: max pooling applied on CNN features ➢ R-MAC: regional MAC descriptors created through the application of a rigid-grid mechanism
  • 9. Agenda 9 ➢ Motivations ➢ Summary of contributions ➢ Related works ➢ Introduction to R-MAC descriptors ➢ Proposed approach (R-MAC+) ➢ Experimental results ➢ Conclusions
  • 10. R-MAC (Regional MAC) descriptors 10 Considering a rectangular region R ⊆ Ω = (1,W) x (1,H), and define the regional feature vector: fR = (fR,1 ...fR,i ...fR,K )T where fR,i = max Xi (p) is the maximum activation of the ith channel on the considered region. Then we calculate the feature vector associated with each region, and post-process it with l2 -normalization, PCA-whitening and l2 -normalization. We combine the collection of regional feature vectors into a single image vector by summing them and l2 -normalizing in the end. We define the response maps and sample square regions at L different scales ➢ at the largest scale (l=1), the region size is determined to be as large as possible (height = width = min(W,H)) ➢ at every other scale l, we uniformly sample l x (l+m-1) regions of width 2min(W,H)/(l+1). (with m=2)
  • 11. R-MAC (Regional MAC) descriptors 11 Settings: ➢ Fully convolutional off-the-shelf VGG16 ➢ Pool5 ➢ Spatial Max pooling ➢ High Resolution images ➢ Global descriptor based on aggregating region vectors ➢ Sliding window approach Tolias et al. Particular object retrieval with integral max-pooling of CNN activations. arXiv 2015.
  • 12. Agenda 12 ➢ Motivations ➢ Summary of contributions ➢ Related works ➢ Introduction to R-MAC descriptors ➢ Proposed approach (R-MAC+) ➢ Experimental results ➢ Conclusions
  • 13. Proposed approach: R-MAC+ New multi-resolution approach: the images are resized of +25%,-25%, 0% on the largest size, respecting the aspect ratio of the image. ➢ This strategy is an alternative of the first multi-resolution approach, that resized the image to a fixed size: 550px, 800px and 1050 on the largest size, retaining the aspect ratio of the image. ➢ This strategy should allow to augment the dimensions of the feature maps in order to have more features and therefore local maxima than the previous multi-resolution R-MAC. This approach is connected to the new region detector, that detects a reduced number of regions (15) instead of the 20 of the original one. 13
  • 14. Proposed approach: R-MAC+ 14 A new mechanism for region detection in the CNN feature maps (15 regions) ● l=0 → 1 region covering entirely the image; ● l=1 → 2 square regions (widthRegion = heightRegion = min(H,W)); ● l=2 → 6 rect regions (widthRegion = heightRegion =⌈2*min(W,H)/(l+1))⌉, arranged along the horizontal axis (width and height of the regions are adapted to cover all the image); ● l=3 → 6 rect regions (widthRegion = heightRegion= ⌈2*min(W,H)/(l+2))⌉, arranged along the vertical axis (width and height of the regions are adapted to cover all the image).
  • 15. Proposed approach: R-MAC+ 15 A new retrieval method based on db regions (MAC descriptors of the database images) and the R-MAC descriptors of the query images (+7% on Oxford5k and +4% on Paris6k than previous results)
  • 16. Agenda 16 ➢ Motivations ➢ Summary of contributions ➢ Related works ➢ Introduction to R-MAC descriptors ➢ Proposed approach (R-MAC+) ➢ Experimental results ➢ Conclusions
  • 17. Datasets and evaluation metric Datasets: ➢ Holidays (1491 images: 500 classes, 500 queries). ➢ Oxford5k (5063 images, 11 classes, 55 queries). ➢ Paris6k (6412 images, 11 classes, 55 queries). Evaluation metric: ➢ mAP (mean Average Precision) → mean of Average Precision scores (correct results) for each query, based on the position in the ranking. 17
  • 18. Results 18 Method Network Holidays (original/rotated) Oxf5k Paris6k MAC VGG19 76.26 % 57.44 % 73.15 % R-MAC VGG19 87.65 % 65.56 % 82.80 % R-MAC ResNet50 92.55 % 71.77 % 83.31 % M-R R-MAC+ ResNet50 94.63 % / 95.58 % 78.88 % 88.63 % M-R R-MAC+ with retrieval based on db regions ResNet50 94.37 % / 95.87 % 85.39 % 91.90 %
  • 19. Results after QE application 19 Method Network Holidays (original/rotated) Oxf5k Paris6k M-R R-MAC+ ResNet50 94.97 % / 95.97 % 86.45 % 92.01 % M-R R-MAC+ with retrieval based on db regions ResNet50 94.42 % / 96.05 % 87.92 % 93.64 % M-R R-MAC+ with retrieval based on db regions and query expansion based on db regions ResNet50 94.28 % / 95.91 % 88.78 % 92.30 %
  • 20. Comparison with the state of the art 20
  • 21. Agenda 21 ➢ Motivations ➢ Summary of contributions ➢ Related works ➢ Introduction to R-MAC descriptors ➢ Proposed approach (R-MAC+) ➢ Experimental results ➢ Conclusions
  • 22. Conclusions ➢ We propose different improvements on R-MAC descriptors in order to make the retrieval very accurate. ○ A multi-resolution approach, that uses bigger feature maps than the previous one. ○ A new region detector with the use of adaptable grids allows to catch more local maxima. ○ A novel retrieval method based on db regions that highly boosts the performance on Oxford5k and Paris6k. ➢ The proposed method outperforms the state of the art on Holidays, both on the original and rotated version. Also it outperforms the state-of-the-art results on some other public benchmarks without the fine-tuning application. 22
  • 23. Thank you for your attention! questions? http://implab.ce.unipr.it 23