SlideShare ist ein Scribd-Unternehmen logo
1 von 62
Efficient architecture to
condensate visual information
driven by attention processes
Mª Sara Granados Cabeza Supervisors:
Javier Díaz Alonso
Sonia Mota Fernández
Alberto Prieto Espinosa
Summary
• Introduction and Motivation
• Semidense Representation Map for Visual
Features
• Method Validation: Experimental Results and
Applications
• Implementation on Reconfigurable Hardware
• Conclusions and Future Work
1
Summary
• Introduction and Motivation
• Semidense Representation Map for Visual
Features
• Method Validation: Experimental Results and
Applications
• Implementation on Reconfigurable Hardware
• Conclusions and Future Work
2
Human Visual System
• Eyes do more than
capturing images:
▫ Extract spatio-temporal
transition
▫ Efficient communication
using an event driven
schema
• More than just the eyes
▫ Visual cortex also
important
3
Human Visual System
• Main characteristics we try to emulate in
artificial systems:
▫ Retina pre-processing capabilities
▫ Adaptation to changing environments
▫ Limited resources
▫ Active vision:
 Gazing control
 Attention
 Bottom-up (saliency)
 Top-down (target driven)
4
Artificial Visual Systems
• Robotics
5
• Vehicular
applications
• Low Vision
Aids
DRIVSCO Project
6
DRIVSCO Project
7
FPGA
DRIVSCO Bandwidth Constraints
8
PCI Express
PCI
DRIVSCO Memory Constraints
9
XIRCA
Possible Solutions
• Retinomorphic grabbing systems
• Other hardware devices: Optimized PC
implementations (SSE, MMX, IPP), DSP, GPU,
latest FPGA, ASIC
▫ Brute-force solution
• Compression
▫ Only group and/or reduce color components
▫ No feedback integration
▫ Transfer the problem to higher-level stages
10
Possible Solutions
• Multi-modal descriptors
▫ Pugeault et al. (JVCI , 2011)
11
Our solution
• Novel representation map
▫ Avoid sending unnecessary information
 “Smart” compression = Condensation
 Without losing uniform-region data
▫ Versatile
 Generic algorithm for several visual features
 Suitable for commodity and embedded platforms
▫ Create a feedback channel
 Bottom-up saliency
 Target-driven selection (top-down attention)
▫ Easy integration with higher-level algorithms 12
Tools and Methods
13
Model
1
High performance
implementation.
PC based (basic
approaches in java, c,
c++, etc..)
with accelerator (GPU
and FPGA) or optimized
code (c/c++)
3
Low
performance
implementation
PC based (Matlab)
2
Stand alone
platform.
DSPs or FPGA-
based,
prototyping
board
4
Specific
purpose
system.
FPGA-ASIC based,
specific purpose
board with
certification
capabilities
5
Complexity
Time
Summary
• Introduction and Motivation
• Semidense Representation Map for Visual
Features
• Method Validation: Experimental Results and
Applications
• Implementation on Reconfigurable Hardware
• Conclusions and Future Work
14
System Visual Features
15
Semidense representation map
• What?
▫ Condenses dense visual features
▫ Highlights relevant information
▫ Keeps uniform-region information
• How?
▫ Using sparse visual features as relevance enhancer
▫ Applying a regular grid in the uniform regions
16
Overview
17
Relevant Point Extractors
• Saliency maps:
▫ Itti and Koch (Nature Reviews Neuroscience, 2010)
18
Relevant Point Extractors
• Descriptors
▫ SIFT (IJCV, 2005), SURF (CVIU,2008), etc.
19
SIFT SURF
Relevant Point Extractors
• Structure-based edge and corner detectors
▫ Canny (PAMI,1986), Sobel (Journal of Microscopy, 1988),
Intrinsic Dimension (BMVC,2003)
20
Relevant Points: Selection mask
21
Canny LIP-Sobel
Intrinsic Dimension 1 (id1) Intrinsic Dimension 2 (id2)
Disparity Benchmark Dataset
• Middlebury
• % Bad error (ΔDisparity >1)
22
Relevant Point Extractors
23
Uniform Regions
• Neighborhood:
▫ Size and shape
• Representative point:
▫ Subsampling? Filtering?
24
Uniform Regions: Window Size
• Depends on
▫ The condensation ratio:
 5x5 window  4% of the dense points
 7x7 window  2%
 9x9 window  1%
 Less points imply less subsampling operations
▫ the image resolution
• We need to keep enough information
▫ Dense features not so dense (NaN problem)
25
Uniform Regions: Filter
• Instead of subsampling, filtering:
▫ Error smoothing
▫ Information spreading
• Filters assessed:
▫ Median
▫ Bilateral
▫ Anisotropic
• Benchmark:
▫ Middlebury
26
Plain Regions
27
Summary
28
Decondensation
29
Semidense feature
Original input
Extracted Disparity Decondensed disparity
Real-time approaches
• Low-cost hardware is
noisy
30
Ground Truth
Noisy Hw
Inherent Regularization
31
33
Noisy Hardware
Affine-based regularization
Semidense representation
Decondensed map
Semidense representation map
• Trade-off configuration:
▫ Canny-based relevant point extractor
▫ 5x5 grid based on bilateral filter
• Results
▫ Reduces memory and bandwidth requirements
▫ Extracts relevant information
▫ Incorporates uniform-region information
▫ Inherently regularizes
▫ Easily integrates in higher-level algorithms
34
Summary
• Introduction and Motivation
• Semidense Representation Map for Visual
Features
• Method Validation: Experimental Results and
Applications
• Implementation on Reconfigurable Hardware
• Conclusions and Future Work
35
Applications
1. Attention integration:
▫ Bottom-up: Saliency maps as relevant-point
extractor
▫ Top-down: Independently Moving Objects (IMOs)
2. High-level algorithm application:
▫ Using only semidense visual features
▫ First, we extract the ground-plane
▫ Then, we detect obstacles
36
Attention Processes: Bottom-Up
37
Original
Zero-Threshold RP
Saliency maps
RP mask
Attention-Driven Condensation
38
Attention Processes: Top-Down
• IMOs extraction:
▫ Using Pauwels et al. (Journal of Vision, 2010)
• Integration with Semidense Maps
▫ IMOs from frame N is integrated in the semidense
representation map of frame N+1
• Integration with other processes:
▫ such as Time-To-Contact (TTC)
39
Target-Driven Condensation
40
Obstacle Detection Algorithm
 Chumerin (PhD Dissertation, 2011)
41
Ground-Plane Detection
42
• Compare Original vs Semidense:
▫ Similar response
▫ Grid contains needed information
▫ Structure in the RP introduces noise
• Same accuracy with 8% of input resolution
Obstacle Detection Algorithm
• Elevation map
43
Original (dense) Semidense
Obstacle Detection Algorithm
• Obstacle map
44
Original (dense) Semidense
Obstacle Detection Algorithm
• Final output
45
Original (dense) Semidense
• Equivalent output with 8% of input resolution
Obstacle Detection Algorithm
• Integration in several stages of a high-level
algorithm
• Similar response
▫ Uniform region relevance
• Workload reduction:
4611x 1.5x
Summary
• Introduction and Motivation
• Semidense Representation Map for Visual
Features
• Method Validation: Experimental Results and
Applications
• Implementation on Reconfigurable Hardware
• Conclusions and Future Work
47
FPGAs
• High performance
▫ Parallel processing
• Low power consumption
• Reduced size
• Reconfigurable
▫ Application adaptation
▫ Real-time reconfiguration
• Industrial applications (robots, vehicles,
inspection, surveillance, …)
• Certification capabilities (Safety standards) 48
External interface
Warping
OF Core
Stereo Core
Multi-scale
Extension
Multi-scale
Extension
Rectification
Embedded
Processors
Memory
Controller Unit
LF Core
Interface controller
49
Condensation
Module
Condensation Architecture
• Hardware trade-off
configuration:
▫ Canny
▫ Median filter
• Fine-grain pipeline
▫ Submodules
 Logical functionality
▫ Stages
 Number of simple operations
▫ Scalar units
 Number of features to
process
• One processed datum per clock
cycle
[#Operations, #Scalar]
OF E & O D
Low-Level Vision System
Hysteresis +
nonmax suppr.
RP + Grid RP + Grid
Condensation Condensation Condensation
Pyramid
CO - PROCESSOR
MCU
Vy RP
Vy Grid
Vx RP
Vx. Grid
Disp RP
Disp. Grid
Grid mask
Memory
Grid extractor
size
FIFO
Grid
RP
Gridfeedback
RPfeedback
Vx VyRPGrid RPGrid
D
Storage Storage Storage
Vx
RP
Vy
RP
Vx’ Grid Vy’ Grid D
RP
D’ Grid
Efficient Communication Protocol
• Grid is regular and known
beforehand
▫ Store only the values
▫ Grid binary mask :
 computed
 sent (more versatile)
• RP are non regular:
▫ Under 4% of data:
 Address Event Representation
(AER)
▫ Boahen et al. (IEEE, 2004)
Vx, Vy and D codified with 12 bits
Hardware utilization (XCV4FX100)
• Integration with existing system:
▫ Tomasi et al. (IEEE,2011)  ~90% of slices used
▫ Barranco et al. (DSP,2012)  ~75% of slices used
• JPEG Compression Core:
▫ 17% per visual feature  51% (D, Vx, Vy) 53
DRIVSCO Bandwidth Constraints
54
PCI Express
PCI
Semidense Maps Bandwidth
55
> 20 x
Semidense Maps Memory
56
• Memory needs:
▫ 90 MB (highest resolution)
▫ 20 MB (lowest resolution)
• Semidense memory use
▫ Lowest:
 1 feature: < 87 KB
 Whole system: < 2MB
▫ Highest:
 1 feature: < 350 KB
 Whole system: < 6 MB
> 15 x
Versatile Architecture
• Feedback integration in real-time
▫ Reconfigurable RP and Grid masks
 Programmable or computed on real-time
▫ RP Feedback (Objects, TTC, IMOs)
▫ Grid Feedback (Ground-plane, Adaptive grid)
• Task-driven configuration
57
Summary
• Introduction and Motivation
• Semidense Representation Map for Visual
Features
• Method Validation: Experimental Results and
Applications
• Implementation on Reconfigurable Hardware
• Conclusions and Future Work
58
Conclusions
• Novel semidense representation map:
▫ Relevant data treated with higher priority
▫ Keeping uniform-region information
• Versatile
• Create a feedback channel
▫ Bottom-up saliency
▫ Target-driven transfer (top-down attention)
• Easy integration with higher-level algorithms
• Efficiently implemented in hardware
59
Future Work
• Integrate other enhancing signals
▫ Descriptors (SIFT, SURF, GLOH,…)
• SoC integration using latest platforms
• Incorporate different feedback signals:
▫ TTC estimations
▫ Adapt grid dynamically
• Explore new applications:
▫ Tracking
▫ Video surveillance
▫ Multi-camera systems
60
Main Contributions
• Semidense representation map
• Assessed several enhancing signals
• Evaluated different filters and window size
• Regularization capabilities
• Integration in multiple applications
• Efficient FGPA implementation
▫ 1 datum per clock cycle
• Framework to integrate different signals:
▫ signal-to-symbol loop
61
Questions?
62

Weitere ähnliche Inhalte

Was ist angesagt?

Human Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon Transform
Human Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon TransformHuman Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon Transform
Human Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon TransformFadwa Fouad
 
Video Object Segmentation in Videos
Video Object Segmentation in VideosVideo Object Segmentation in Videos
Video Object Segmentation in VideosNAVER Engineering
 
Online video object segmentation via convolutional trident network
Online video object segmentation via convolutional trident networkOnline video object segmentation via convolutional trident network
Online video object segmentation via convolutional trident networkNAVER Engineering
 
Semantic Line Detection and Its Applications
Semantic Line Detection and Its ApplicationsSemantic Line Detection and Its Applications
Semantic Line Detection and Its ApplicationsNAVER Engineering
 
10cs65-cgnotes
10cs65-cgnotes10cs65-cgnotes
10cs65-cgnotesmushtaqdm
 
Moving object detection using background subtraction algorithm using simulink
Moving object detection using background subtraction algorithm using simulinkMoving object detection using background subtraction algorithm using simulink
Moving object detection using background subtraction algorithm using simulinkeSAT Publishing House
 
unrban-building-damage-detection-by-PJLi.ppt
unrban-building-damage-detection-by-PJLi.pptunrban-building-damage-detection-by-PJLi.ppt
unrban-building-damage-detection-by-PJLi.pptgrssieee
 
Sobel Edge Detection Using FPGA
Sobel Edge Detection Using FPGASobel Edge Detection Using FPGA
Sobel Edge Detection Using FPGAghanshyam zambare
 
Comparing Offline and Online Testing of Deep Neural Networks: An Autonomous C...
Comparing Offline and Online Testing of Deep Neural Networks: An Autonomous C...Comparing Offline and Online Testing of Deep Neural Networks: An Autonomous C...
Comparing Offline and Online Testing of Deep Neural Networks: An Autonomous C...Lionel Briand
 
Color and 3D Semantic Reconstruction of Indoor Scenes from RGB-D stream
Color and 3D Semantic Reconstruction of Indoor Scenes from RGB-D streamColor and 3D Semantic Reconstruction of Indoor Scenes from RGB-D stream
Color and 3D Semantic Reconstruction of Indoor Scenes from RGB-D streamNAVER Engineering
 
Seed net automatic seed generation with deep reinforcement learning for robus...
Seed net automatic seed generation with deep reinforcement learning for robus...Seed net automatic seed generation with deep reinforcement learning for robus...
Seed net automatic seed generation with deep reinforcement learning for robus...NAVER Engineering
 
HiPEAC 2019 Workshop - Use Cases
HiPEAC 2019 Workshop - Use CasesHiPEAC 2019 Workshop - Use Cases
HiPEAC 2019 Workshop - Use CasesTulipp. Eu
 

Was ist angesagt? (15)

Background Subtraction Algorithm for Moving Object Detection Using Denoising ...
Background Subtraction Algorithm for Moving Object Detection Using Denoising ...Background Subtraction Algorithm for Moving Object Detection Using Denoising ...
Background Subtraction Algorithm for Moving Object Detection Using Denoising ...
 
Human Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon Transform
Human Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon TransformHuman Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon Transform
Human Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon Transform
 
Video Object Segmentation in Videos
Video Object Segmentation in VideosVideo Object Segmentation in Videos
Video Object Segmentation in Videos
 
Online video object segmentation via convolutional trident network
Online video object segmentation via convolutional trident networkOnline video object segmentation via convolutional trident network
Online video object segmentation via convolutional trident network
 
Semantic Line Detection and Its Applications
Semantic Line Detection and Its ApplicationsSemantic Line Detection and Its Applications
Semantic Line Detection and Its Applications
 
10cs65-cgnotes
10cs65-cgnotes10cs65-cgnotes
10cs65-cgnotes
 
Moving object detection using background subtraction algorithm using simulink
Moving object detection using background subtraction algorithm using simulinkMoving object detection using background subtraction algorithm using simulink
Moving object detection using background subtraction algorithm using simulink
 
unrban-building-damage-detection-by-PJLi.ppt
unrban-building-damage-detection-by-PJLi.pptunrban-building-damage-detection-by-PJLi.ppt
unrban-building-damage-detection-by-PJLi.ppt
 
Sobel Edge Detection Using FPGA
Sobel Edge Detection Using FPGASobel Edge Detection Using FPGA
Sobel Edge Detection Using FPGA
 
Comparing Offline and Online Testing of Deep Neural Networks: An Autonomous C...
Comparing Offline and Online Testing of Deep Neural Networks: An Autonomous C...Comparing Offline and Online Testing of Deep Neural Networks: An Autonomous C...
Comparing Offline and Online Testing of Deep Neural Networks: An Autonomous C...
 
ANPR FPGA Workshop
ANPR FPGA WorkshopANPR FPGA Workshop
ANPR FPGA Workshop
 
[DL輪読会]ClearGrasp
[DL輪読会]ClearGrasp[DL輪読会]ClearGrasp
[DL輪読会]ClearGrasp
 
Color and 3D Semantic Reconstruction of Indoor Scenes from RGB-D stream
Color and 3D Semantic Reconstruction of Indoor Scenes from RGB-D streamColor and 3D Semantic Reconstruction of Indoor Scenes from RGB-D stream
Color and 3D Semantic Reconstruction of Indoor Scenes from RGB-D stream
 
Seed net automatic seed generation with deep reinforcement learning for robus...
Seed net automatic seed generation with deep reinforcement learning for robus...Seed net automatic seed generation with deep reinforcement learning for robus...
Seed net automatic seed generation with deep reinforcement learning for robus...
 
HiPEAC 2019 Workshop - Use Cases
HiPEAC 2019 Workshop - Use CasesHiPEAC 2019 Workshop - Use Cases
HiPEAC 2019 Workshop - Use Cases
 

Andere mochten auch

Telefonul mobil, viitorul card de credit
Telefonul mobil, viitorul card de creditTelefonul mobil, viitorul card de credit
Telefonul mobil, viitorul card de creditAntonio Eram
 
Power point aplicacion interactiva
Power point aplicacion interactivaPower point aplicacion interactiva
Power point aplicacion interactivaveroniquesa
 
Presentation from the NBI Launch Event in Denmark February 2015
Presentation from the NBI Launch Event in Denmark February 2015Presentation from the NBI Launch Event in Denmark February 2015
Presentation from the NBI Launch Event in Denmark February 2015Networked Busniess Initiative
 
Cableado estructurado-de-red-1204886726520124-4
Cableado estructurado-de-red-1204886726520124-4Cableado estructurado-de-red-1204886726520124-4
Cableado estructurado-de-red-1204886726520124-4CVG Inversiones S.R.L
 
Digital Marketing Forum 2009
Digital Marketing Forum 2009Digital Marketing Forum 2009
Digital Marketing Forum 2009Antonio Eram
 
Modaclubmx Intermedio Oto/Inv 2015
Modaclubmx Intermedio Oto/Inv 2015Modaclubmx Intermedio Oto/Inv 2015
Modaclubmx Intermedio Oto/Inv 2015Ph Multimarcas
 
What is the RTI Community?
What is the RTI Community?What is the RTI Community?
What is the RTI Community?Lacey Trebaol
 
Sociología - Comte y el Positivismo.
Sociología - Comte y el Positivismo.Sociología - Comte y el Positivismo.
Sociología - Comte y el Positivismo.myartslides
 
Pedagogía tradicional
Pedagogía tradicionalPedagogía tradicional
Pedagogía tradicionalVYANKA PEREZ
 

Andere mochten auch (14)

Telefonul mobil, viitorul card de credit
Telefonul mobil, viitorul card de creditTelefonul mobil, viitorul card de credit
Telefonul mobil, viitorul card de credit
 
Power point aplicacion interactiva
Power point aplicacion interactivaPower point aplicacion interactiva
Power point aplicacion interactiva
 
Presentation from the NBI Launch Event in Denmark February 2015
Presentation from the NBI Launch Event in Denmark February 2015Presentation from the NBI Launch Event in Denmark February 2015
Presentation from the NBI Launch Event in Denmark February 2015
 
Cableado estructurado-de-red-1204886726520124-4
Cableado estructurado-de-red-1204886726520124-4Cableado estructurado-de-red-1204886726520124-4
Cableado estructurado-de-red-1204886726520124-4
 
cover of cowboy con man
cover of cowboy con mancover of cowboy con man
cover of cowboy con man
 
SRE Osseo
SRE OsseoSRE Osseo
SRE Osseo
 
Digital Marketing Forum 2009
Digital Marketing Forum 2009Digital Marketing Forum 2009
Digital Marketing Forum 2009
 
Nemesis Corp. Profile
Nemesis Corp. ProfileNemesis Corp. Profile
Nemesis Corp. Profile
 
Gaza
GazaGaza
Gaza
 
Modaclubmx Intermedio Oto/Inv 2015
Modaclubmx Intermedio Oto/Inv 2015Modaclubmx Intermedio Oto/Inv 2015
Modaclubmx Intermedio Oto/Inv 2015
 
What is the RTI Community?
What is the RTI Community?What is the RTI Community?
What is the RTI Community?
 
Karl marx presentacion
Karl marx presentacionKarl marx presentacion
Karl marx presentacion
 
Sociología - Comte y el Positivismo.
Sociología - Comte y el Positivismo.Sociología - Comte y el Positivismo.
Sociología - Comte y el Positivismo.
 
Pedagogía tradicional
Pedagogía tradicionalPedagogía tradicional
Pedagogía tradicional
 

Ähnlich wie Efficient architecture to condensate visual information driven by attention processes (PhD thesis presentation)

Data quality evaluation &amp; orbit identification from scatterometer
Data quality evaluation &amp; orbit identification from scatterometerData quality evaluation &amp; orbit identification from scatterometer
Data quality evaluation &amp; orbit identification from scatterometerMudit Dholakia
 
Cahall Final Intern Presentation
Cahall Final Intern PresentationCahall Final Intern Presentation
Cahall Final Intern PresentationDaniel Cahall
 
Cvpr 2018 papers review (efficient computing)
Cvpr 2018 papers review (efficient computing)Cvpr 2018 papers review (efficient computing)
Cvpr 2018 papers review (efficient computing)DonghyunKang12
 
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...South Tyrol Free Software Conference
 
Operationalizing Machine Learning Using GPU-accelerated, In-database Analytics
Operationalizing Machine Learning Using GPU-accelerated, In-database AnalyticsOperationalizing Machine Learning Using GPU-accelerated, In-database Analytics
Operationalizing Machine Learning Using GPU-accelerated, In-database AnalyticsKinetica
 
CONVOLUTIONAL NEURAL NETWORKS: The workhorse of image and video
CONVOLUTIONAL NEURAL NETWORKS: The workhorse of image and videoCONVOLUTIONAL NEURAL NETWORKS: The workhorse of image and video
CONVOLUTIONAL NEURAL NETWORKS: The workhorse of image and videoCristiano Rafael Steffens
 
CNN vs SIFT-based Visual Localization - Laura Leal-Taixé - UPC Barcelona 2018
CNN vs SIFT-based Visual Localization - Laura Leal-Taixé - UPC Barcelona 2018CNN vs SIFT-based Visual Localization - Laura Leal-Taixé - UPC Barcelona 2018
CNN vs SIFT-based Visual Localization - Laura Leal-Taixé - UPC Barcelona 2018Universitat Politècnica de Catalunya
 
for "Parallelizing Multiple Group-by Queries using MapReduce"
for "Parallelizing Multiple Group-by Queries using MapReduce"for "Parallelizing Multiple Group-by Queries using MapReduce"
for "Parallelizing Multiple Group-by Queries using MapReduce"Yun-Yan Chi
 
TechnicalBackgroundOverview
TechnicalBackgroundOverviewTechnicalBackgroundOverview
TechnicalBackgroundOverviewMotaz El-Saban
 
Comparison of Matrix Completion Algorithms for Background Initialization in V...
Comparison of Matrix Completion Algorithms for Background Initialization in V...Comparison of Matrix Completion Algorithms for Background Initialization in V...
Comparison of Matrix Completion Algorithms for Background Initialization in V...ActiveEon
 
Computer vision for transportation
Computer vision for transportationComputer vision for transportation
Computer vision for transportationWanjin Yu
 
Elevation mapping using stereo vision enabled heterogeneous multi-agent robot...
Elevation mapping using stereo vision enabled heterogeneous multi-agent robot...Elevation mapping using stereo vision enabled heterogeneous multi-agent robot...
Elevation mapping using stereo vision enabled heterogeneous multi-agent robot...Aritra Sarkar
 
Combining out - of - band monitoring with AI and big data for datacenter aut...
Combining out - of - band monitoring with AI and big data  for datacenter aut...Combining out - of - band monitoring with AI and big data  for datacenter aut...
Combining out - of - band monitoring with AI and big data for datacenter aut...Ganesan Narayanasamy
 
FastV2C-HandNet - ICICC 2020
FastV2C-HandNet - ICICC 2020FastV2C-HandNet - ICICC 2020
FastV2C-HandNet - ICICC 2020RohanLekhwani
 
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...Tulipp. Eu
 
Fundamentals of Computer Graphics.pdf
Fundamentals of Computer Graphics.pdfFundamentals of Computer Graphics.pdf
Fundamentals of Computer Graphics.pdfMattupallipardhu
 
Deterministic and high throughput data processing for CubeSats
Deterministic and high throughput data processing for CubeSatsDeterministic and high throughput data processing for CubeSats
Deterministic and high throughput data processing for CubeSatsPablo Ghiglino
 

Ähnlich wie Efficient architecture to condensate visual information driven by attention processes (PhD thesis presentation) (20)

Data quality evaluation &amp; orbit identification from scatterometer
Data quality evaluation &amp; orbit identification from scatterometerData quality evaluation &amp; orbit identification from scatterometer
Data quality evaluation &amp; orbit identification from scatterometer
 
TARDEC Presentation 2
TARDEC Presentation 2TARDEC Presentation 2
TARDEC Presentation 2
 
Cahall Final Intern Presentation
Cahall Final Intern PresentationCahall Final Intern Presentation
Cahall Final Intern Presentation
 
Cvpr 2018 papers review (efficient computing)
Cvpr 2018 papers review (efficient computing)Cvpr 2018 papers review (efficient computing)
Cvpr 2018 papers review (efficient computing)
 
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...
 
Operationalizing Machine Learning Using GPU-accelerated, In-database Analytics
Operationalizing Machine Learning Using GPU-accelerated, In-database AnalyticsOperationalizing Machine Learning Using GPU-accelerated, In-database Analytics
Operationalizing Machine Learning Using GPU-accelerated, In-database Analytics
 
CONVOLUTIONAL NEURAL NETWORKS: The workhorse of image and video
CONVOLUTIONAL NEURAL NETWORKS: The workhorse of image and videoCONVOLUTIONAL NEURAL NETWORKS: The workhorse of image and video
CONVOLUTIONAL NEURAL NETWORKS: The workhorse of image and video
 
CNN vs SIFT-based Visual Localization - Laura Leal-Taixé - UPC Barcelona 2018
CNN vs SIFT-based Visual Localization - Laura Leal-Taixé - UPC Barcelona 2018CNN vs SIFT-based Visual Localization - Laura Leal-Taixé - UPC Barcelona 2018
CNN vs SIFT-based Visual Localization - Laura Leal-Taixé - UPC Barcelona 2018
 
OBDPC 2022
OBDPC 2022OBDPC 2022
OBDPC 2022
 
lecture_16_jiajun.pdf
lecture_16_jiajun.pdflecture_16_jiajun.pdf
lecture_16_jiajun.pdf
 
for "Parallelizing Multiple Group-by Queries using MapReduce"
for "Parallelizing Multiple Group-by Queries using MapReduce"for "Parallelizing Multiple Group-by Queries using MapReduce"
for "Parallelizing Multiple Group-by Queries using MapReduce"
 
TechnicalBackgroundOverview
TechnicalBackgroundOverviewTechnicalBackgroundOverview
TechnicalBackgroundOverview
 
Comparison of Matrix Completion Algorithms for Background Initialization in V...
Comparison of Matrix Completion Algorithms for Background Initialization in V...Comparison of Matrix Completion Algorithms for Background Initialization in V...
Comparison of Matrix Completion Algorithms for Background Initialization in V...
 
Computer vision for transportation
Computer vision for transportationComputer vision for transportation
Computer vision for transportation
 
Elevation mapping using stereo vision enabled heterogeneous multi-agent robot...
Elevation mapping using stereo vision enabled heterogeneous multi-agent robot...Elevation mapping using stereo vision enabled heterogeneous multi-agent robot...
Elevation mapping using stereo vision enabled heterogeneous multi-agent robot...
 
Combining out - of - band monitoring with AI and big data for datacenter aut...
Combining out - of - band monitoring with AI and big data  for datacenter aut...Combining out - of - band monitoring with AI and big data  for datacenter aut...
Combining out - of - band monitoring with AI and big data for datacenter aut...
 
FastV2C-HandNet - ICICC 2020
FastV2C-HandNet - ICICC 2020FastV2C-HandNet - ICICC 2020
FastV2C-HandNet - ICICC 2020
 
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
 
Fundamentals of Computer Graphics.pdf
Fundamentals of Computer Graphics.pdfFundamentals of Computer Graphics.pdf
Fundamentals of Computer Graphics.pdf
 
Deterministic and high throughput data processing for CubeSats
Deterministic and high throughput data processing for CubeSatsDeterministic and high throughput data processing for CubeSats
Deterministic and high throughput data processing for CubeSats
 

Kürzlich hochgeladen

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfOverkill Security
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfOverkill Security
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 

Kürzlich hochgeladen (20)

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 

Efficient architecture to condensate visual information driven by attention processes (PhD thesis presentation)

  • 1. Efficient architecture to condensate visual information driven by attention processes Mª Sara Granados Cabeza Supervisors: Javier Díaz Alonso Sonia Mota Fernández Alberto Prieto Espinosa
  • 2. Summary • Introduction and Motivation • Semidense Representation Map for Visual Features • Method Validation: Experimental Results and Applications • Implementation on Reconfigurable Hardware • Conclusions and Future Work 1
  • 3. Summary • Introduction and Motivation • Semidense Representation Map for Visual Features • Method Validation: Experimental Results and Applications • Implementation on Reconfigurable Hardware • Conclusions and Future Work 2
  • 4. Human Visual System • Eyes do more than capturing images: ▫ Extract spatio-temporal transition ▫ Efficient communication using an event driven schema • More than just the eyes ▫ Visual cortex also important 3
  • 5. Human Visual System • Main characteristics we try to emulate in artificial systems: ▫ Retina pre-processing capabilities ▫ Adaptation to changing environments ▫ Limited resources ▫ Active vision:  Gazing control  Attention  Bottom-up (saliency)  Top-down (target driven) 4
  • 6. Artificial Visual Systems • Robotics 5 • Vehicular applications • Low Vision Aids
  • 11. Possible Solutions • Retinomorphic grabbing systems • Other hardware devices: Optimized PC implementations (SSE, MMX, IPP), DSP, GPU, latest FPGA, ASIC ▫ Brute-force solution • Compression ▫ Only group and/or reduce color components ▫ No feedback integration ▫ Transfer the problem to higher-level stages 10
  • 12. Possible Solutions • Multi-modal descriptors ▫ Pugeault et al. (JVCI , 2011) 11
  • 13. Our solution • Novel representation map ▫ Avoid sending unnecessary information  “Smart” compression = Condensation  Without losing uniform-region data ▫ Versatile  Generic algorithm for several visual features  Suitable for commodity and embedded platforms ▫ Create a feedback channel  Bottom-up saliency  Target-driven selection (top-down attention) ▫ Easy integration with higher-level algorithms 12
  • 14. Tools and Methods 13 Model 1 High performance implementation. PC based (basic approaches in java, c, c++, etc..) with accelerator (GPU and FPGA) or optimized code (c/c++) 3 Low performance implementation PC based (Matlab) 2 Stand alone platform. DSPs or FPGA- based, prototyping board 4 Specific purpose system. FPGA-ASIC based, specific purpose board with certification capabilities 5 Complexity Time
  • 15. Summary • Introduction and Motivation • Semidense Representation Map for Visual Features • Method Validation: Experimental Results and Applications • Implementation on Reconfigurable Hardware • Conclusions and Future Work 14
  • 17. Semidense representation map • What? ▫ Condenses dense visual features ▫ Highlights relevant information ▫ Keeps uniform-region information • How? ▫ Using sparse visual features as relevance enhancer ▫ Applying a regular grid in the uniform regions 16
  • 19. Relevant Point Extractors • Saliency maps: ▫ Itti and Koch (Nature Reviews Neuroscience, 2010) 18
  • 20. Relevant Point Extractors • Descriptors ▫ SIFT (IJCV, 2005), SURF (CVIU,2008), etc. 19 SIFT SURF
  • 21. Relevant Point Extractors • Structure-based edge and corner detectors ▫ Canny (PAMI,1986), Sobel (Journal of Microscopy, 1988), Intrinsic Dimension (BMVC,2003) 20
  • 22. Relevant Points: Selection mask 21 Canny LIP-Sobel Intrinsic Dimension 1 (id1) Intrinsic Dimension 2 (id2)
  • 23. Disparity Benchmark Dataset • Middlebury • % Bad error (ΔDisparity >1) 22
  • 25. Uniform Regions • Neighborhood: ▫ Size and shape • Representative point: ▫ Subsampling? Filtering? 24
  • 26. Uniform Regions: Window Size • Depends on ▫ The condensation ratio:  5x5 window  4% of the dense points  7x7 window  2%  9x9 window  1%  Less points imply less subsampling operations ▫ the image resolution • We need to keep enough information ▫ Dense features not so dense (NaN problem) 25
  • 27. Uniform Regions: Filter • Instead of subsampling, filtering: ▫ Error smoothing ▫ Information spreading • Filters assessed: ▫ Median ▫ Bilateral ▫ Anisotropic • Benchmark: ▫ Middlebury 26
  • 31. Real-time approaches • Low-cost hardware is noisy 30 Ground Truth Noisy Hw
  • 34. Semidense representation map • Trade-off configuration: ▫ Canny-based relevant point extractor ▫ 5x5 grid based on bilateral filter • Results ▫ Reduces memory and bandwidth requirements ▫ Extracts relevant information ▫ Incorporates uniform-region information ▫ Inherently regularizes ▫ Easily integrates in higher-level algorithms 34
  • 35. Summary • Introduction and Motivation • Semidense Representation Map for Visual Features • Method Validation: Experimental Results and Applications • Implementation on Reconfigurable Hardware • Conclusions and Future Work 35
  • 36. Applications 1. Attention integration: ▫ Bottom-up: Saliency maps as relevant-point extractor ▫ Top-down: Independently Moving Objects (IMOs) 2. High-level algorithm application: ▫ Using only semidense visual features ▫ First, we extract the ground-plane ▫ Then, we detect obstacles 36
  • 39. Attention Processes: Top-Down • IMOs extraction: ▫ Using Pauwels et al. (Journal of Vision, 2010) • Integration with Semidense Maps ▫ IMOs from frame N is integrated in the semidense representation map of frame N+1 • Integration with other processes: ▫ such as Time-To-Contact (TTC) 39
  • 41. Obstacle Detection Algorithm  Chumerin (PhD Dissertation, 2011) 41
  • 42. Ground-Plane Detection 42 • Compare Original vs Semidense: ▫ Similar response ▫ Grid contains needed information ▫ Structure in the RP introduces noise • Same accuracy with 8% of input resolution
  • 43. Obstacle Detection Algorithm • Elevation map 43 Original (dense) Semidense
  • 44. Obstacle Detection Algorithm • Obstacle map 44 Original (dense) Semidense
  • 45. Obstacle Detection Algorithm • Final output 45 Original (dense) Semidense • Equivalent output with 8% of input resolution
  • 46. Obstacle Detection Algorithm • Integration in several stages of a high-level algorithm • Similar response ▫ Uniform region relevance • Workload reduction: 4611x 1.5x
  • 47. Summary • Introduction and Motivation • Semidense Representation Map for Visual Features • Method Validation: Experimental Results and Applications • Implementation on Reconfigurable Hardware • Conclusions and Future Work 47
  • 48. FPGAs • High performance ▫ Parallel processing • Low power consumption • Reduced size • Reconfigurable ▫ Application adaptation ▫ Real-time reconfiguration • Industrial applications (robots, vehicles, inspection, surveillance, …) • Certification capabilities (Safety standards) 48
  • 49. External interface Warping OF Core Stereo Core Multi-scale Extension Multi-scale Extension Rectification Embedded Processors Memory Controller Unit LF Core Interface controller 49 Condensation Module
  • 50. Condensation Architecture • Hardware trade-off configuration: ▫ Canny ▫ Median filter • Fine-grain pipeline ▫ Submodules  Logical functionality ▫ Stages  Number of simple operations ▫ Scalar units  Number of features to process • One processed datum per clock cycle [#Operations, #Scalar]
  • 51. OF E & O D Low-Level Vision System Hysteresis + nonmax suppr. RP + Grid RP + Grid Condensation Condensation Condensation Pyramid CO - PROCESSOR MCU Vy RP Vy Grid Vx RP Vx. Grid Disp RP Disp. Grid Grid mask Memory Grid extractor size FIFO Grid RP Gridfeedback RPfeedback Vx VyRPGrid RPGrid D Storage Storage Storage Vx RP Vy RP Vx’ Grid Vy’ Grid D RP D’ Grid
  • 52. Efficient Communication Protocol • Grid is regular and known beforehand ▫ Store only the values ▫ Grid binary mask :  computed  sent (more versatile) • RP are non regular: ▫ Under 4% of data:  Address Event Representation (AER) ▫ Boahen et al. (IEEE, 2004) Vx, Vy and D codified with 12 bits
  • 53. Hardware utilization (XCV4FX100) • Integration with existing system: ▫ Tomasi et al. (IEEE,2011)  ~90% of slices used ▫ Barranco et al. (DSP,2012)  ~75% of slices used • JPEG Compression Core: ▫ 17% per visual feature  51% (D, Vx, Vy) 53
  • 56. Semidense Maps Memory 56 • Memory needs: ▫ 90 MB (highest resolution) ▫ 20 MB (lowest resolution) • Semidense memory use ▫ Lowest:  1 feature: < 87 KB  Whole system: < 2MB ▫ Highest:  1 feature: < 350 KB  Whole system: < 6 MB > 15 x
  • 57. Versatile Architecture • Feedback integration in real-time ▫ Reconfigurable RP and Grid masks  Programmable or computed on real-time ▫ RP Feedback (Objects, TTC, IMOs) ▫ Grid Feedback (Ground-plane, Adaptive grid) • Task-driven configuration 57
  • 58. Summary • Introduction and Motivation • Semidense Representation Map for Visual Features • Method Validation: Experimental Results and Applications • Implementation on Reconfigurable Hardware • Conclusions and Future Work 58
  • 59. Conclusions • Novel semidense representation map: ▫ Relevant data treated with higher priority ▫ Keeping uniform-region information • Versatile • Create a feedback channel ▫ Bottom-up saliency ▫ Target-driven transfer (top-down attention) • Easy integration with higher-level algorithms • Efficiently implemented in hardware 59
  • 60. Future Work • Integrate other enhancing signals ▫ Descriptors (SIFT, SURF, GLOH,…) • SoC integration using latest platforms • Incorporate different feedback signals: ▫ TTC estimations ▫ Adapt grid dynamically • Explore new applications: ▫ Tracking ▫ Video surveillance ▫ Multi-camera systems 60
  • 61. Main Contributions • Semidense representation map • Assessed several enhancing signals • Evaluated different filters and window size • Regularization capabilities • Integration in multiple applications • Efficient FGPA implementation ▫ 1 datum per clock cycle • Framework to integrate different signals: ▫ signal-to-symbol loop 61