SlideShare ist ein Scribd-Unternehmen logo
1 von 30
Multimedia Content Based Retrieval GovindarajuHujigal govin.tech1@gmail.com
Content based retrieval in multimedia an important research area challenging problem since multimedia data needs detailed interpretation from pixel values different strategies in terms of syntactic and semantic indexing for retrieval
Why do we need MCBR ? How do I find what I’m looking for?!
Multimedia content Retrieval multimedia and storage technology that has led to building of a large repository of digital image, video, and audio data. Compared to text search, any assignment of text labels a massively labor intensive effort. Focus is an calculating statistics which can be approximately correlated to the content featureswithout costly human interaction.
Multimedia content Retrieval Search based on Syntactic features Shape, texture, color histogram relatively undemanding Search based on Semantic features  human perception “ List all dogs look like cat” “City” “Landscape” “cricket”
Syntactic indexing Use syntactic features as the basis for matching and employ either Query-through-dialog or Query by-example box to interface with the user. Query-through-dialog  Enter the words describing the image Query-through-dialog not convenient as the user needs to know the exact details of the attributes like shape, color, texture etc.
Image descriptors – Color  Apples are red 
  
 But tomatoes are too!!!
Image descriptors – Texture  Texture differentiates between a Lawn and a Forest
Syntactic indexing Query by example example images and user chose the closest. various features like color, shape, textures  and spatial distribution f the chosen image are evaluated and matched against the images in the database. Similarity or distance metric. In Video, various key frames of video clips which are close to the user query are shown.
Syntactic indexing Query by example limitations Image can be annotated and interpreted in many ways. For example, a particular user may be interested in a waterfall, another may be interested in mountain and yet another in the sky, although all of them may be present in the same image. User may wonder "why do these two images look similar?" or "what specific parts of these images are contributing to the similarity?“. User is required to know the search structure and other details for efficiently searching the database. It requires many comparisons and results may be too many depending on threshold.
Semantic indexing ,[object Object]
Semantic content contains high-level concepts such as objects and events.
As humans think in term of events and remember different events and objects after watching video, these high-level concepts are the most important cues in content-based retrieval. Let’s take as an example a soccer game, humans usually remember goals, interesting actions, red cards etc.,[object Object]
Motion feature as indexing cue.. Spatial Scene Analysis on video can be fully transferred   from CBIR but temporal analysis is the uniqueness    about video.  Temporal Information induces the concept of motion for the objects present in the document
Motion feature as indexing cue.. Frame level: Each frame is treated separately.     There is no temporal analysis at this level. Shot-level: A shot is a set of contiguous frames     all acquired through a continuous camera  recording.     Only the temporal information is used. Scene-level: A scene is a set of contiguous shots     having a common semantic significance. Video-level: The complete video object is treated as a whole.
Motion feature as indexing cue.. The three types of Shot-level are as follows: Cut: A sharp boundary between shots. This generally implies a peak in the difference between color or motion histograms corresponding to the two frames surrounding the cut. Dissolve: The content of last images of the first shots is continuously mixed with that of the first images of the second shot. Wipe: The images of the second shot continuously cover or push out of the display that of the first shot.
Motion feature as indexing cue Often through motion that the content in a video is expressed and the attention of the viewers captivated Query techniques Set of motion vector trajectories mapped to set of objects. Visual query can be ‘player’.[Dimitrova] Use animated sketch to formulate queries.Motion and temporal duration are the key attributes assigned to each object in the sketch in addition to the usual attributes such as shape, color and texture. [VideoQ]
Matching techniques Method of finding similarity between the two sets of multimedia  data, which can either be images or videos. Search based on features like location, colors and concepts, examples of which are ‘mostly red’, ‘sunset’, ‘yellow flowers’ etc. User specify the relative weights to the features or assign equal weightage Automatically identifying the relevance of the features is under active research.
Learning methods in retrieval The user generates both the positive and negative retrieval examples (relevance feedback). Each image can represent multiple concepts. To replace one of these ambiguities, each image is modeled as a bag of instances (sub-blocks in the image).  A bag is labeled as a positive example of a concept, if there exist some instances representing the concept, which could be a car or a waterfall scene. If there does not exist any instance, the bag is labelled as a negative example. The concept is learned by using a small collection of positive and negative examples and this is used to retrieve images containing a similar concept from the database.
Learning methods in retrieval The ability to infer high-level understanding from a multimedia content has proven to be a difficult goal to achieve. Example, the category “John eating icecream”. Such categories might require the presence of sophisticated scene understanding algorithms along with the understanding of spatio-temporal relationship between entities (like the behavior eating can be characterized as repeatedly putting something eatable in mouth).
Structure in multimedia content To achieve efficiency in content-production and due to the limited number of available resources, standard techniques are employed. The intention of video making is to represent an action or to evoke emotions using various storytelling methods. Figure 1 gives an analysis of the basic techniques of shot transitions that are used to convey particular intentions.
Structure in multimedia content Special structure of news in ‘begin shot’, ‘newscaster shot’, ‘interview’, ‘weather forecast’ etc. and builds a video model of news. car-race video has unusual zoom-in and zoom-out, basketball has left-panning and right-panning that last for certain maximum duration. The motion activity in interesting shots in sports is higher than its surrounding shots and so on.
Future of CBR systems There is ambiguity in making such conclusions, for example, dissolve can be either due to ‘flashback’ or due to ‘time lapse’.  if the number of dissolves is two, most probably ‘flashback’                 - “Multimedia Content Description Interface” - specify a standard set of descriptors that can be used to describe various types of multimedia information Make collaborative effort to tag the multimedia
Commercial systems – Like.com
Commercial systems – Like.com
Commercial systems – Like.com
Commercial systems – Like.com
Conclusions Systematic exploration of construction of high-level indexes is lacking. None of the work has considered exploring features close to the human perception. In summary, there is a great need to extract semantic indices for making the CBR system serviceable to the user. Though extracting all such indices might not be possible, there is a great scope for furnishing the semantic indices with a certain well-established structure.

Weitere Àhnliche Inhalte

Was ist angesagt?

Content based image retrieval(cbir)
Content based image retrieval(cbir)Content based image retrieval(cbir)
Content based image retrieval(cbir)
paddu123
 
Image feature extraction
Image feature extractionImage feature extraction
Image feature extraction
Rushin Shah
 
Feature Extraction
Feature ExtractionFeature Extraction
Feature Extraction
skylian
 

Was ist angesagt? (20)

multi dimensional data model
multi dimensional data modelmulti dimensional data model
multi dimensional data model
 
Introduction to Google App Engine
Introduction to Google App EngineIntroduction to Google App Engine
Introduction to Google App Engine
 
Presence cloud
Presence cloudPresence cloud
Presence cloud
 
Google App Engine
Google App EngineGoogle App Engine
Google App Engine
 
Information retrieval introduction
Information retrieval introductionInformation retrieval introduction
Information retrieval introduction
 
The structure of agents
The structure of agentsThe structure of agents
The structure of agents
 
Java rmi
Java rmiJava rmi
Java rmi
 
Content based image retrieval(cbir)
Content based image retrieval(cbir)Content based image retrieval(cbir)
Content based image retrieval(cbir)
 
Middleware
MiddlewareMiddleware
Middleware
 
Real time traffic sign analysis
Real time traffic sign analysisReal time traffic sign analysis
Real time traffic sign analysis
 
Classification in data mining
Classification in data mining Classification in data mining
Classification in data mining
 
Recognition and enhancement of traffic sign for computer generated images
Recognition and enhancement of traffic sign for computer generated imagesRecognition and enhancement of traffic sign for computer generated images
Recognition and enhancement of traffic sign for computer generated images
 
Image feature extraction
Image feature extractionImage feature extraction
Image feature extraction
 
Link Analysis
Link AnalysisLink Analysis
Link Analysis
 
Feature Extraction
Feature ExtractionFeature Extraction
Feature Extraction
 
Vision of cloud computing
Vision of cloud computingVision of cloud computing
Vision of cloud computing
 
Web Mining & Text Mining
Web Mining & Text MiningWeb Mining & Text Mining
Web Mining & Text Mining
 
Ontology engineering
Ontology engineering Ontology engineering
Ontology engineering
 
Text Detection and Recognition
Text Detection and RecognitionText Detection and Recognition
Text Detection and Recognition
 
Web browser architecture
Web browser architectureWeb browser architecture
Web browser architecture
 

Andere mochten auch

Content Based Image and Video Retrieval Algorithm
Content Based Image and Video Retrieval AlgorithmContent Based Image and Video Retrieval Algorithm
Content Based Image and Video Retrieval Algorithm
Akshit Bum
 
Multimedia Information Retrieval: What is it, and why isn't ...
Multimedia Information Retrieval: What is it, and why isn't ...Multimedia Information Retrieval: What is it, and why isn't ...
Multimedia Information Retrieval: What is it, and why isn't ...
webhostingguy
 
Video Indexing And Retrieval
Video Indexing And RetrievalVideo Indexing And Retrieval
Video Indexing And Retrieval
Yvonne M
 
Information storage and retrieval
Information storage and retrievalInformation storage and retrieval
Information storage and retrieval
Sadaf Rafiq
 
Iaetsd enhancement of face retrival desigend for
Iaetsd enhancement of face retrival desigend forIaetsd enhancement of face retrival desigend for
Iaetsd enhancement of face retrival desigend for
Iaetsd Iaetsd
 
Review on content based video lecture retrieval
Review on content based video lecture retrievalReview on content based video lecture retrieval
Review on content based video lecture retrieval
eSAT Journals
 
E:\ì‚Źëłž Learner Training Strategies
E:\ì‚Źëłž   Learner Training StrategiesE:\ì‚Źëłž   Learner Training Strategies
E:\ì‚Źëłž Learner Training Strategies
ttxaz4
 

Andere mochten auch (20)

Multimedia Information Retrieval
Multimedia Information RetrievalMultimedia Information Retrieval
Multimedia Information Retrieval
 
Content Based Image and Video Retrieval Algorithm
Content Based Image and Video Retrieval AlgorithmContent Based Image and Video Retrieval Algorithm
Content Based Image and Video Retrieval Algorithm
 
Content based video retrieval system
Content based video retrieval systemContent based video retrieval system
Content based video retrieval system
 
Video Indexing and Retrieval
Video Indexing and RetrievalVideo Indexing and Retrieval
Video Indexing and Retrieval
 
Multimedia Information Retrieval: What is it, and why isn't ...
Multimedia Information Retrieval: What is it, and why isn't ...Multimedia Information Retrieval: What is it, and why isn't ...
Multimedia Information Retrieval: What is it, and why isn't ...
 
Video Indexing And Retrieval
Video Indexing And RetrievalVideo Indexing And Retrieval
Video Indexing And Retrieval
 
Information storage and retrieval
Information storage and retrievalInformation storage and retrieval
Information storage and retrieval
 
Multimedia
MultimediaMultimedia
Multimedia
 
Mcbr ppt-mini
Mcbr ppt-miniMcbr ppt-mini
Mcbr ppt-mini
 
Iaetsd enhancement of face retrival desigend for
Iaetsd enhancement of face retrival desigend forIaetsd enhancement of face retrival desigend for
Iaetsd enhancement of face retrival desigend for
 
Review on content based video lecture retrieval
Review on content based video lecture retrievalReview on content based video lecture retrieval
Review on content based video lecture retrieval
 
Video Browsing - The Need for Interactive Video Search (Talk at CBMI 2014)
Video Browsing - The Need for Interactive Video Search (Talk at CBMI 2014)Video Browsing - The Need for Interactive Video Search (Talk at CBMI 2014)
Video Browsing - The Need for Interactive Video Search (Talk at CBMI 2014)
 
Interval Pattern Structures: An introdution
Interval Pattern Structures: An introdutionInterval Pattern Structures: An introdution
Interval Pattern Structures: An introdution
 
A model integration framework
A model integration frameworkA model integration framework
A model integration framework
 
Recommendation and Information Retrieval: Two Sides of the Same Coin?
Recommendation and Information Retrieval: Two Sides of the Same Coin?Recommendation and Information Retrieval: Two Sides of the Same Coin?
Recommendation and Information Retrieval: Two Sides of the Same Coin?
 
E:\ì‚Źëłž Learner Training Strategies
E:\ì‚Źëłž   Learner Training StrategiesE:\ì‚Źëłž   Learner Training Strategies
E:\ì‚Źëłž Learner Training Strategies
 
Formal Concept Analysis
Formal Concept AnalysisFormal Concept Analysis
Formal Concept Analysis
 
Language Experience Activities for Elementary Grades, Adult Low Level Readers...
Language Experience Activities for Elementary Grades, Adult Low Level Readers...Language Experience Activities for Elementary Grades, Adult Low Level Readers...
Language Experience Activities for Elementary Grades, Adult Low Level Readers...
 
(In)Formal Concept Analysis
(In)Formal Concept Analysis(In)Formal Concept Analysis
(In)Formal Concept Analysis
 
Whole language method sutriyani (2)
Whole language method sutriyani (2)Whole language method sutriyani (2)
Whole language method sutriyani (2)
 

Ähnlich wie Multimedia content based retrieval slideshare.ppt

Techniques Used For Extracting Useful Information From Images
Techniques Used For Extracting Useful Information From ImagesTechniques Used For Extracting Useful Information From Images
Techniques Used For Extracting Useful Information From Images
Jill Crawford
 
Intrusive Images, Neural Mechanisms, And Treatment...
Intrusive Images, Neural Mechanisms, And Treatment...Intrusive Images, Neural Mechanisms, And Treatment...
Intrusive Images, Neural Mechanisms, And Treatment...
Angie Lee
 

Ähnlich wie Multimedia content based retrieval slideshare.ppt (20)

H018124360
H018124360H018124360
H018124360
 
Scene Description From Images To Sentences
Scene Description From Images To SentencesScene Description From Images To Sentences
Scene Description From Images To Sentences
 
Image retrieval and re ranking techniques - a survey
Image retrieval and re ranking techniques - a surveyImage retrieval and re ranking techniques - a survey
Image retrieval and re ranking techniques - a survey
 
Image Search: Then and Now
Image Search: Then and NowImage Search: Then and Now
Image Search: Then and Now
 
Techniques Used For Extracting Useful Information From Images
Techniques Used For Extracting Useful Information From ImagesTechniques Used For Extracting Useful Information From Images
Techniques Used For Extracting Useful Information From Images
 
Intrusive Images, Neural Mechanisms, And Treatment...
Intrusive Images, Neural Mechanisms, And Treatment...Intrusive Images, Neural Mechanisms, And Treatment...
Intrusive Images, Neural Mechanisms, And Treatment...
 
Spot the Dog: An overview of semantic retrieval of unannotated images in the ...
Spot the Dog: An overview of semantic retrieval of unannotated images in the ...Spot the Dog: An overview of semantic retrieval of unannotated images in the ...
Spot the Dog: An overview of semantic retrieval of unannotated images in the ...
 
APPLICATIONS OF SPATIAL FEATURES IN CBIR : A SURVEY
APPLICATIONS OF SPATIAL FEATURES IN CBIR : A SURVEYAPPLICATIONS OF SPATIAL FEATURES IN CBIR : A SURVEY
APPLICATIONS OF SPATIAL FEATURES IN CBIR : A SURVEY
 
Applications of spatial features in cbir a survey
Applications of spatial features in cbir  a surveyApplications of spatial features in cbir  a survey
Applications of spatial features in cbir a survey
 
Overview of Video Concept Detection using (CNN) Convolutional Neural Network
Overview of Video Concept Detection using (CNN) Convolutional Neural NetworkOverview of Video Concept Detection using (CNN) Convolutional Neural Network
Overview of Video Concept Detection using (CNN) Convolutional Neural Network
 
Visual Search
Visual SearchVisual Search
Visual Search
 
Key Frame Extraction for Salient Activity Recognition
Key Frame Extraction for Salient Activity RecognitionKey Frame Extraction for Salient Activity Recognition
Key Frame Extraction for Salient Activity Recognition
 
A novel Image Retrieval System using an effective region based shape represen...
A novel Image Retrieval System using an effective region based shape represen...A novel Image Retrieval System using an effective region based shape represen...
A novel Image Retrieval System using an effective region based shape represen...
 
Twente ir-course 20-10-2010
Twente ir-course 20-10-2010Twente ir-course 20-10-2010
Twente ir-course 20-10-2010
 
The Visual Data Discovery Tool
The Visual Data Discovery ToolThe Visual Data Discovery Tool
The Visual Data Discovery Tool
 
A Review on Matching For Sketch Technique
A Review on Matching For Sketch TechniqueA Review on Matching For Sketch Technique
A Review on Matching For Sketch Technique
 
The deep learning technology on coco framework full report
The deep learning technology on coco framework full reportThe deep learning technology on coco framework full report
The deep learning technology on coco framework full report
 
IRJET- Neural Story Teller using RNN and Generative Algorithm
IRJET- Neural Story Teller using RNN and Generative AlgorithmIRJET- Neural Story Teller using RNN and Generative Algorithm
IRJET- Neural Story Teller using RNN and Generative Algorithm
 
Mini Project- 3D Graphics And Visualisation
Mini Project- 3D Graphics And VisualisationMini Project- 3D Graphics And Visualisation
Mini Project- 3D Graphics And Visualisation
 
A Survey on Approaches for Object Tracking
A Survey on Approaches for Object TrackingA Survey on Approaches for Object Tracking
A Survey on Approaches for Object Tracking
 

KĂŒrzlich hochgeladen

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

KĂŒrzlich hochgeladen (20)

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 

Multimedia content based retrieval slideshare.ppt

  • 1. Multimedia Content Based Retrieval GovindarajuHujigal govin.tech1@gmail.com
  • 2. Content based retrieval in multimedia an important research area challenging problem since multimedia data needs detailed interpretation from pixel values different strategies in terms of syntactic and semantic indexing for retrieval
  • 3. Why do we need MCBR ? How do I find what I’m looking for?!
  • 4. Multimedia content Retrieval multimedia and storage technology that has led to building of a large repository of digital image, video, and audio data. Compared to text search, any assignment of text labels a massively labor intensive effort. Focus is an calculating statistics which can be approximately correlated to the content featureswithout costly human interaction.
  • 5. Multimedia content Retrieval Search based on Syntactic features Shape, texture, color histogram relatively undemanding Search based on Semantic features human perception “ List all dogs look like cat” “City” “Landscape” “cricket”
  • 6. Syntactic indexing Use syntactic features as the basis for matching and employ either Query-through-dialog or Query by-example box to interface with the user. Query-through-dialog Enter the words describing the image Query-through-dialog not convenient as the user needs to know the exact details of the attributes like shape, color, texture etc.
  • 7. Image descriptors – Color Apples are red 
 
 But tomatoes are too!!!
  • 8. Image descriptors – Texture Texture differentiates between a Lawn and a Forest
  • 9. Syntactic indexing Query by example example images and user chose the closest. various features like color, shape, textures and spatial distribution f the chosen image are evaluated and matched against the images in the database. Similarity or distance metric. In Video, various key frames of video clips which are close to the user query are shown.
  • 10.
  • 11. Syntactic indexing Query by example limitations Image can be annotated and interpreted in many ways. For example, a particular user may be interested in a waterfall, another may be interested in mountain and yet another in the sky, although all of them may be present in the same image. User may wonder "why do these two images look similar?" or "what specific parts of these images are contributing to the similarity?“. User is required to know the search structure and other details for efficiently searching the database. It requires many comparisons and results may be too many depending on threshold.
  • 12.
  • 13. Semantic content contains high-level concepts such as objects and events.
  • 14.
  • 15. Motion feature as indexing cue.. Spatial Scene Analysis on video can be fully transferred from CBIR but temporal analysis is the uniqueness about video. Temporal Information induces the concept of motion for the objects present in the document
  • 16. Motion feature as indexing cue.. Frame level: Each frame is treated separately. There is no temporal analysis at this level. Shot-level: A shot is a set of contiguous frames all acquired through a continuous camera recording. Only the temporal information is used. Scene-level: A scene is a set of contiguous shots having a common semantic significance. Video-level: The complete video object is treated as a whole.
  • 17. Motion feature as indexing cue.. The three types of Shot-level are as follows: Cut: A sharp boundary between shots. This generally implies a peak in the difference between color or motion histograms corresponding to the two frames surrounding the cut. Dissolve: The content of last images of the first shots is continuously mixed with that of the first images of the second shot. Wipe: The images of the second shot continuously cover or push out of the display that of the first shot.
  • 18. Motion feature as indexing cue Often through motion that the content in a video is expressed and the attention of the viewers captivated Query techniques Set of motion vector trajectories mapped to set of objects. Visual query can be ‘player’.[Dimitrova] Use animated sketch to formulate queries.Motion and temporal duration are the key attributes assigned to each object in the sketch in addition to the usual attributes such as shape, color and texture. [VideoQ]
  • 19. Matching techniques Method of finding similarity between the two sets of multimedia data, which can either be images or videos. Search based on features like location, colors and concepts, examples of which are ‘mostly red’, ‘sunset’, ‘yellow flowers’ etc. User specify the relative weights to the features or assign equal weightage Automatically identifying the relevance of the features is under active research.
  • 20. Learning methods in retrieval The user generates both the positive and negative retrieval examples (relevance feedback). Each image can represent multiple concepts. To replace one of these ambiguities, each image is modeled as a bag of instances (sub-blocks in the image). A bag is labeled as a positive example of a concept, if there exist some instances representing the concept, which could be a car or a waterfall scene. If there does not exist any instance, the bag is labelled as a negative example. The concept is learned by using a small collection of positive and negative examples and this is used to retrieve images containing a similar concept from the database.
  • 21. Learning methods in retrieval The ability to infer high-level understanding from a multimedia content has proven to be a difficult goal to achieve. Example, the category “John eating icecream”. Such categories might require the presence of sophisticated scene understanding algorithms along with the understanding of spatio-temporal relationship between entities (like the behavior eating can be characterized as repeatedly putting something eatable in mouth).
  • 22. Structure in multimedia content To achieve efficiency in content-production and due to the limited number of available resources, standard techniques are employed. The intention of video making is to represent an action or to evoke emotions using various storytelling methods. Figure 1 gives an analysis of the basic techniques of shot transitions that are used to convey particular intentions.
  • 23.
  • 24. Structure in multimedia content Special structure of news in ‘begin shot’, ‘newscaster shot’, ‘interview’, ‘weather forecast’ etc. and builds a video model of news. car-race video has unusual zoom-in and zoom-out, basketball has left-panning and right-panning that last for certain maximum duration. The motion activity in interesting shots in sports is higher than its surrounding shots and so on.
  • 25. Future of CBR systems There is ambiguity in making such conclusions, for example, dissolve can be either due to ‘flashback’ or due to ‘time lapse’. if the number of dissolves is two, most probably ‘flashback’ - “Multimedia Content Description Interface” - specify a standard set of descriptors that can be used to describe various types of multimedia information Make collaborative effort to tag the multimedia
  • 30. Conclusions Systematic exploration of construction of high-level indexes is lacking. None of the work has considered exploring features close to the human perception. In summary, there is a great need to extract semantic indices for making the CBR system serviceable to the user. Though extracting all such indices might not be possible, there is a great scope for furnishing the semantic indices with a certain well-established structure.
  • 31. Conclusions Content-based video indexing and retrieval is an active area of research with continuing attributions from several domain including image processing, computer vision,databasesystem and artificial intelligence.