SlideShare ist ein Scribd-Unternehmen logo
1 von 21
Downloaden Sie, um offline zu lesen
Visual Information Retrieval in
Endoscopic Video Archives
Jennifer Roldan Carlos, Mathias Lux, Xavier Giro-i-Nieto, Pia Munoz
& Nektarios Anagnostopoulos
Motivation
•  Surgery videos are taken every day
•  Operations rooms are fully booked
•  Many procedures already involve video
•  Storing videos is / will be req. by law
Amount of Videos
•  8-10 h operations / room and day
•  say 6 hours excluding set ups, etc.
•  5-6 days a week
•  1,560 h video / year & OR
Use Case of Re-finding Frames
•  Surgeons take „shots“
•  documentation, for patients, discussion
•  Shots are intentionally framed
•  and make for excellent
representative images
Approach
•  Temporal sampling: every 5th frame
•  Indexing and search based on
•  a set of global features
•  or a localized global features
Late Fusion for Global Features
Features Employed
•  Pyramid HOG
•  extensive and large texture feature
•  Color and Edge Directivity Descriptor
•  compact and well performing joint histogram
•  SIMPLE
•  CEDD descriptors of patches at SURF key points
Data Set
•  33 hours of video
•  from actual procedures focusing on laporoscopy
•  1,276 videos in total
•  593,446 frames after temporal sampling
Example Results - SIMPLE
Evaluation – Re-Finding in Numbers
•  Randomly selected more than 700 shots
•  Excluding tests, white balance and out-of-patient
•  Resulting in 600 sample queries
Evaluation – Re-Finding in Numbers
•  Hypothesis I: every 5th frame is enough to re-find
images.
•  Hypothesis II: There is a noticeable difference
between global and local features.
Evaluation – Re-Finding in Numbers
Evaluation – User Study
•  Exploratory study, thinking aloud test
•  Interactive web page presented to users
•  ten cases with all available shots as queries
•  three non-labeled search engines
Evaluation – User Study
Evaluation – User Study
•  Population drawn from our projects
•  experts in processing endoscopic videos
•  well-aware of the requirements surgeons registered
•  Task was to ...
•  browse diverse results and
•  voice drawbacks and benefits
Findings
•  Sampling every 5th frame works (with headroom)
•  Study participants noted that
•  late fusion works as expected and yields
interesting results besides near duplicates
•  SIMPLE works better for semantically similar
content, ie. translated instruments, etc.
Conclusions
•  The system does not utilize
•  domain dependent methods and heuristics
•  run-time and storage demanding methods
•  Still, it works out for the use case as a
•  candidate support system for surgeons
•  baseline to start on interactive video retrieval for
laporoscopy.
Future Work
•  Salient contours of images
•  focus on being robust against lighting and noise
Future Work
credits for feature & images: Chryssanthi Iakovidou
Future Work
credits for feature & images: Chryssanthi Iakovidou
Time for questions?
Mathias Lux
± Associate Professor @ Klagenfurt University, Austria
mlux@itec.aau.at
Thanks go to Jennifer Roldan Carlos, Xavier Giro-i-Nieto,
Pia Munoz & Nektarios Anagnostopoulos

Weitere ähnliche Inhalte

Ähnlich wie Visual Information Retrieval in Endoscopic Video Archives

Requirements engineering scenario based software requirement specification
Requirements engineering scenario based software requirement specificationRequirements engineering scenario based software requirement specification
Requirements engineering scenario based software requirement specification
Wolfgang Kuchinke
 
UX Research in an Agile World
UX Research in an Agile WorldUX Research in an Agile World
UX Research in an Agile World
Hirajaved10
 
evaluation technique uni 2
evaluation technique uni 2evaluation technique uni 2
evaluation technique uni 2
vrgokila
 
ROLE OF DIGITAL IMAGING IN PATHOLOGY.pptx
ROLE OF DIGITAL IMAGING IN PATHOLOGY.pptxROLE OF DIGITAL IMAGING IN PATHOLOGY.pptx
ROLE OF DIGITAL IMAGING IN PATHOLOGY.pptx
aditisikarwar2
 

Ähnlich wie Visual Information Retrieval in Endoscopic Video Archives (20)

Chapter 8 Evaluation Techniques
Chapter 8 Evaluation  TechniquesChapter 8 Evaluation  Techniques
Chapter 8 Evaluation Techniques
 
Research Proposal Presentation Pitch
Research Proposal Presentation PitchResearch Proposal Presentation Pitch
Research Proposal Presentation Pitch
 
Conducting Remote Unmoderated Usability Testing: Part 1 - RemoteUX Training W...
Conducting Remote Unmoderated Usability Testing: Part 1 - RemoteUX Training W...Conducting Remote Unmoderated Usability Testing: Part 1 - RemoteUX Training W...
Conducting Remote Unmoderated Usability Testing: Part 1 - RemoteUX Training W...
 
Visual Search for Musical Performances and Endoscopic Videos
Visual Search for Musical Performances and Endoscopic VideosVisual Search for Musical Performances and Endoscopic Videos
Visual Search for Musical Performances and Endoscopic Videos
 
Chapter 8 eval. tech. lesson 2
Chapter 8 eval. tech. lesson 2Chapter 8 eval. tech. lesson 2
Chapter 8 eval. tech. lesson 2
 
Human Computer Interaction Evaluation
Human Computer Interaction EvaluationHuman Computer Interaction Evaluation
Human Computer Interaction Evaluation
 
Electronic Laboratory Notebooks
Electronic Laboratory NotebooksElectronic Laboratory Notebooks
Electronic Laboratory Notebooks
 
Requirements engineering scenario based software requirement specification
Requirements engineering scenario based software requirement specificationRequirements engineering scenario based software requirement specification
Requirements engineering scenario based software requirement specification
 
Website Usability & Eye-tracking by Marco Pretorious (Certified Usability Ana...
Website Usability & Eye-tracking by Marco Pretorious (Certified Usability Ana...Website Usability & Eye-tracking by Marco Pretorious (Certified Usability Ana...
Website Usability & Eye-tracking by Marco Pretorious (Certified Usability Ana...
 
UX Research in an Agile World
UX Research in an Agile WorldUX Research in an Agile World
UX Research in an Agile World
 
E3 chap-09
E3 chap-09E3 chap-09
E3 chap-09
 
e3-chap-09.ppt
e3-chap-09.ppte3-chap-09.ppt
e3-chap-09.ppt
 
Evaluation techniques
Evaluation techniquesEvaluation techniques
Evaluation techniques
 
Towards an Agile approach to building application profiles
Towards an Agile approach to building application profilesTowards an Agile approach to building application profiles
Towards an Agile approach to building application profiles
 
evaluation technique uni 2
evaluation technique uni 2evaluation technique uni 2
evaluation technique uni 2
 
How to Conduct Usability Studies: A Librarian Primer
How to Conduct Usability Studies: A Librarian PrimerHow to Conduct Usability Studies: A Librarian Primer
How to Conduct Usability Studies: A Librarian Primer
 
Log Analysis to Understand Medical Professionals' Image Searching Behaviour
Log Analysis to Understand Medical Professionals' Image Searching BehaviourLog Analysis to Understand Medical Professionals' Image Searching Behaviour
Log Analysis to Understand Medical Professionals' Image Searching Behaviour
 
N=10^9: Automated Experimentation at Scale
N=10^9: Automated Experimentation at ScaleN=10^9: Automated Experimentation at Scale
N=10^9: Automated Experimentation at Scale
 
Working with Instrument Data (GlobusWorld Tour - UMich)
Working with Instrument Data (GlobusWorld Tour - UMich)Working with Instrument Data (GlobusWorld Tour - UMich)
Working with Instrument Data (GlobusWorld Tour - UMich)
 
ROLE OF DIGITAL IMAGING IN PATHOLOGY.pptx
ROLE OF DIGITAL IMAGING IN PATHOLOGY.pptxROLE OF DIGITAL IMAGING IN PATHOLOGY.pptx
ROLE OF DIGITAL IMAGING IN PATHOLOGY.pptx
 

Mehr von Universitat Politècnica de Catalunya

Generation of Synthetic Referring Expressions for Object Segmentation in Videos
Generation of Synthetic Referring Expressions for Object Segmentation in VideosGeneration of Synthetic Referring Expressions for Object Segmentation in Videos
Generation of Synthetic Referring Expressions for Object Segmentation in Videos
Universitat Politècnica de Catalunya
 

Mehr von Universitat Politècnica de Catalunya (20)

Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Deep Generative Learning for All
Deep Generative Learning for AllDeep Generative Learning for All
Deep Generative Learning for All
 
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
 
Towards Sign Language Translation & Production | Xavier Giro-i-Nieto
Towards Sign Language Translation & Production | Xavier Giro-i-NietoTowards Sign Language Translation & Production | Xavier Giro-i-Nieto
Towards Sign Language Translation & Production | Xavier Giro-i-Nieto
 
The Transformer - Xavier Giró - UPC Barcelona 2021
The Transformer - Xavier Giró - UPC Barcelona 2021The Transformer - Xavier Giró - UPC Barcelona 2021
The Transformer - Xavier Giró - UPC Barcelona 2021
 
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
 
Open challenges in sign language translation and production
Open challenges in sign language translation and productionOpen challenges in sign language translation and production
Open challenges in sign language translation and production
 
Generation of Synthetic Referring Expressions for Object Segmentation in Videos
Generation of Synthetic Referring Expressions for Object Segmentation in VideosGeneration of Synthetic Referring Expressions for Object Segmentation in Videos
Generation of Synthetic Referring Expressions for Object Segmentation in Videos
 
Discovery and Learning of Navigation Goals from Pixels in Minecraft
Discovery and Learning of Navigation Goals from Pixels in MinecraftDiscovery and Learning of Navigation Goals from Pixels in Minecraft
Discovery and Learning of Navigation Goals from Pixels in Minecraft
 
Learn2Sign : Sign language recognition and translation using human keypoint e...
Learn2Sign : Sign language recognition and translation using human keypoint e...Learn2Sign : Sign language recognition and translation using human keypoint e...
Learn2Sign : Sign language recognition and translation using human keypoint e...
 
Intepretability / Explainable AI for Deep Neural Networks
Intepretability / Explainable AI for Deep Neural NetworksIntepretability / Explainable AI for Deep Neural Networks
Intepretability / Explainable AI for Deep Neural Networks
 
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
 
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
 
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
 
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
 
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
 
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
 
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
 
Curriculum Learning for Recurrent Video Object Segmentation
Curriculum Learning for Recurrent Video Object SegmentationCurriculum Learning for Recurrent Video Object Segmentation
Curriculum Learning for Recurrent Video Object Segmentation
 
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020
 

Kürzlich hochgeladen

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Kürzlich hochgeladen (20)

Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 

Visual Information Retrieval in Endoscopic Video Archives

  • 1. Visual Information Retrieval in Endoscopic Video Archives Jennifer Roldan Carlos, Mathias Lux, Xavier Giro-i-Nieto, Pia Munoz & Nektarios Anagnostopoulos
  • 2. Motivation •  Surgery videos are taken every day •  Operations rooms are fully booked •  Many procedures already involve video •  Storing videos is / will be req. by law
  • 3. Amount of Videos •  8-10 h operations / room and day •  say 6 hours excluding set ups, etc. •  5-6 days a week •  1,560 h video / year & OR
  • 4. Use Case of Re-finding Frames •  Surgeons take „shots“ •  documentation, for patients, discussion •  Shots are intentionally framed •  and make for excellent representative images
  • 5. Approach •  Temporal sampling: every 5th frame •  Indexing and search based on •  a set of global features •  or a localized global features
  • 6. Late Fusion for Global Features
  • 7. Features Employed •  Pyramid HOG •  extensive and large texture feature •  Color and Edge Directivity Descriptor •  compact and well performing joint histogram •  SIMPLE •  CEDD descriptors of patches at SURF key points
  • 8. Data Set •  33 hours of video •  from actual procedures focusing on laporoscopy •  1,276 videos in total •  593,446 frames after temporal sampling
  • 10. Evaluation – Re-Finding in Numbers •  Randomly selected more than 700 shots •  Excluding tests, white balance and out-of-patient •  Resulting in 600 sample queries
  • 11. Evaluation – Re-Finding in Numbers •  Hypothesis I: every 5th frame is enough to re-find images. •  Hypothesis II: There is a noticeable difference between global and local features.
  • 13. Evaluation – User Study •  Exploratory study, thinking aloud test •  Interactive web page presented to users •  ten cases with all available shots as queries •  three non-labeled search engines
  • 15. Evaluation – User Study •  Population drawn from our projects •  experts in processing endoscopic videos •  well-aware of the requirements surgeons registered •  Task was to ... •  browse diverse results and •  voice drawbacks and benefits
  • 16. Findings •  Sampling every 5th frame works (with headroom) •  Study participants noted that •  late fusion works as expected and yields interesting results besides near duplicates •  SIMPLE works better for semantically similar content, ie. translated instruments, etc.
  • 17. Conclusions •  The system does not utilize •  domain dependent methods and heuristics •  run-time and storage demanding methods •  Still, it works out for the use case as a •  candidate support system for surgeons •  baseline to start on interactive video retrieval for laporoscopy.
  • 18. Future Work •  Salient contours of images •  focus on being robust against lighting and noise
  • 19. Future Work credits for feature & images: Chryssanthi Iakovidou
  • 20. Future Work credits for feature & images: Chryssanthi Iakovidou
  • 21. Time for questions? Mathias Lux ± Associate Professor @ Klagenfurt University, Austria mlux@itec.aau.at Thanks go to Jennifer Roldan Carlos, Xavier Giro-i-Nieto, Pia Munoz & Nektarios Anagnostopoulos