SlideShare ist ein Scribd-Unternehmen logo
1 von 11
Adaptive Video and Metadata Display using Multimedia Documents  Cyril Concolato ACM MM 2010 / SAPMIA Workshop 29/10/2010
Personalized Video Viewing with ROIRelated works Previous works “The big picture on small screens delivering acceptable video quality in mobile TV”, Knoche et al., TOMCCAP 2009 Discusses best zooming factor depending on the content “Adding dynamic visual manipulations to declarative multimedia documents”, Kuikjet al., DocEng 2009 Zooming onto pictures and creating animated camera motions “Animated Picture Presentation Steered by Natural Language”, Reiterer et al., UCMedia 2009 Virtual camera motion driven by ROI and textual description More recent works @ ACM MM 2010 “Crowd-sourced Automatic Zoom and Scroll for Video Retargeting”, Carlier et al. Learning the ROI based on user interaction, and creating a retargeted video based on ROI “Impact of Zooming and Enhancing Region of Interests for Optimizing User Experience on Mobile Sports Video”, Song et al. User study on the usefulness of ROI for improving the user experience “Video Retargeting for Aesthetic Enhancement”, Xiang et al. Automatic ROI detection and video creation page 1
Our approach vs. related works Automatic ROI detection (RWTH Aachen) Similar to existing works with specific detection Differentiated H.264|AVC encoding (IBBT-MMLAB) Balanced encoding between background and ROIs Use of a rich media document To display video  To let the user select a ROI and zoom or not To show additional metadata with adaptation features page 2 “Annotation based personalized adaptation and presentation of videos for mobile applications”, S. De Bruyne, P. Hosten, C. Concolato, M. Asbach, J. De Cock, M. Unger, J. Le Feuvre and R.Vande Walle, Multimedia Tools and Applications, 2011, DOI: 10.1007/s11042-010-0575-2.
Our System Principles Generate rich media documents from video annotations Based on semi-automatic annotations Based on templates Hierarchical Rich Media Documents MPEG-4 BIFS for synchronized & interactive ROI W3C SVG & JavaScript for adaptive metadata layout & interaction page 3
Adaptive Rich Media Documents Part of a global problem of media adaptation (e.g. MPEG-21 DIA) Specificities of documents Structured information (e.g. XML) The use of media  The spatial organization (2D/3D, …) The temporal aspects (animations, synchronization …) The interactive behavior (events, modifications) Existing methods for document adaptation Alternatives/Switch between document branches Constraints solving problem Interpolation between key scenes (e.g. automatic layout, “artistic resizing”) Scalable documents page 4
Example of spatial adaptation of Rich Media Documents page 5
Our choices in thiswork Adaptation based on constraints solving Screen size, video size, quantity/type of metadata to display  Author directives E.g. priority of text over images, relative positioning of elements, … Compiled into a JavaScript algorithm  Included in the rich media document Executed at runtime Results Size and positions of metadata, font size, split of metadata over several pages … page 6
Video and Metadata Display Results page 7 Le Feuvre, J., Concolato, C., and Moissinac, J. 2007. GPAC: open source multimedia framework. In Proceedings of the 15th international Conference on Multimedia (Augsburg, Germany, September 25 - 29, 2007). MULTIMEDIA '07. ACM, New York, NY, 1009-1012. DOI= http://doi.acm.org/10.1145/1291233.1291452
Demonstrations page 8
Conclusions and Future Work Functionnal proof of concept  How media annotations can leverage document adaptation  How different rich media languages can be mixed How user preferences expressed by interactions can drive the adaptation Many aspects can be improved Add more constraints Pixel density, screen orientation, … Improve algorithm for constraint solving Better use of screen space Work on the User Interface When ROI don’t last long enough to be clicked  When many ROIs are present on the screen at the same time When the font size is too small User Studies Future work Authoring of adaptive documents page 9
Thank you for your attention!Questions ? Suggestions ? cyril.concolato@telecom-paristech.fr page 10

Weitere ähnliche Inhalte

Andere mochten auch

Andere mochten auch (7)

Electronic Program Guides using SVG
Electronic Program Guides using SVGElectronic Program Guides using SVG
Electronic Program Guides using SVG
 
MPEG-4 BIFS Overview
MPEG-4 BIFS OverviewMPEG-4 BIFS Overview
MPEG-4 BIFS Overview
 
MPEG-4 BIFS and MPEG-2 TS: Latest developments for digital radio services
MPEG-4 BIFS and MPEG-2 TS: Latest developments for digital radio servicesMPEG-4 BIFS and MPEG-2 TS: Latest developments for digital radio services
MPEG-4 BIFS and MPEG-2 TS: Latest developments for digital radio services
 
Streaming of SVG animations on the Web
Streaming of SVG animations on the WebStreaming of SVG animations on the Web
Streaming of SVG animations on the Web
 
Extensions for Hybrid Delivery using MPEG-2 TS and DASH
Extensions for Hybrid Delivery using MPEG-2 TS and DASHExtensions for Hybrid Delivery using MPEG-2 TS and DASH
Extensions for Hybrid Delivery using MPEG-2 TS and DASH
 
Tutoriel sur le streaming vidéo sur HTTP et sur MPEG-DASH
Tutoriel sur le streaming vidéo sur HTTP et sur MPEG-DASHTutoriel sur le streaming vidéo sur HTTP et sur MPEG-DASH
Tutoriel sur le streaming vidéo sur HTTP et sur MPEG-DASH
 
Live streaming of video and subtitles with MPEG-DASH
Live streaming of video and subtitles with MPEG-DASHLive streaming of video and subtitles with MPEG-DASH
Live streaming of video and subtitles with MPEG-DASH
 

Ähnlich wie Adaptive Video and Metadata Display using Multimedia Documents

Content Modelling for Human Action Detection via Multidimensional Approach
Content Modelling for Human Action Detection via Multidimensional ApproachContent Modelling for Human Action Detection via Multidimensional Approach
Content Modelling for Human Action Detection via Multidimensional Approach
CSCJournals
 
Multimedia Content Understanding: Bringing Context to Content
Multimedia Content Understanding: Bringing Context to ContentMultimedia Content Understanding: Bringing Context to Content
Multimedia Content Understanding: Bringing Context to Content
Benoit HUET
 
Mobile Web Browsing Based On Content Preserving With Reduced Cost
Mobile Web Browsing Based On Content Preserving With Reduced CostMobile Web Browsing Based On Content Preserving With Reduced Cost
Mobile Web Browsing Based On Content Preserving With Reduced Cost
Eswar Publications
 

Ähnlich wie Adaptive Video and Metadata Display using Multimedia Documents (20)

Video Data Visualization System : Semantic Classification and Personalization
Video Data Visualization System : Semantic Classification and Personalization  Video Data Visualization System : Semantic Classification and Personalization
Video Data Visualization System : Semantic Classification and Personalization
 
Video Data Visualization System : Semantic Classification and Personalization
Video Data Visualization System : Semantic Classification and Personalization  Video Data Visualization System : Semantic Classification and Personalization
Video Data Visualization System : Semantic Classification and Personalization
 
Content Modelling for Human Action Detection via Multidimensional Approach
Content Modelling for Human Action Detection via Multidimensional ApproachContent Modelling for Human Action Detection via Multidimensional Approach
Content Modelling for Human Action Detection via Multidimensional Approach
 
Summ11 useinterx
Summ11 useinterxSumm11 useinterx
Summ11 useinterx
 
ICWE 2010 Demonstration and Poster elevator pitch session
ICWE 2010 Demonstration and Poster elevator pitch sessionICWE 2010 Demonstration and Poster elevator pitch session
ICWE 2010 Demonstration and Poster elevator pitch session
 
A Personalized Audio Server using MPEG-7 and MPEG-21 standards
A Personalized Audio Server using MPEG-7 and MPEG-21 standardsA Personalized Audio Server using MPEG-7 and MPEG-21 standards
A Personalized Audio Server using MPEG-7 and MPEG-21 standards
 
Semantic browsing
Semantic browsingSemantic browsing
Semantic browsing
 
A Semantic Multimedia Web: Create, Annotate, Present and Share your Media
A Semantic Multimedia Web: Create, Annotate, Present and Share your MediaA Semantic Multimedia Web: Create, Annotate, Present and Share your Media
A Semantic Multimedia Web: Create, Annotate, Present and Share your Media
 
Emerging database technology multimedia database
Emerging database technology   multimedia databaseEmerging database technology   multimedia database
Emerging database technology multimedia database
 
A Mobile Audio Server enhanced with Semantic Personalization Capabilities
A Mobile Audio Server enhanced with Semantic Personalization CapabilitiesA Mobile Audio Server enhanced with Semantic Personalization Capabilities
A Mobile Audio Server enhanced with Semantic Personalization Capabilities
 
A Framework for Adaptive Delivery of Omnidirectional Video
A Framework for Adaptive Delivery of Omnidirectional VideoA Framework for Adaptive Delivery of Omnidirectional Video
A Framework for Adaptive Delivery of Omnidirectional Video
 
Research Group Multimedia Communication (MMC)
Research Group Multimedia Communication (MMC)Research Group Multimedia Communication (MMC)
Research Group Multimedia Communication (MMC)
 
A Multimedia Visualization Tool For Solving Mechanics Dynamics Problem
A Multimedia Visualization Tool For Solving Mechanics Dynamics ProblemA Multimedia Visualization Tool For Solving Mechanics Dynamics Problem
A Multimedia Visualization Tool For Solving Mechanics Dynamics Problem
 
Multimedia Content Understanding: Bringing Context to Content
Multimedia Content Understanding: Bringing Context to ContentMultimedia Content Understanding: Bringing Context to Content
Multimedia Content Understanding: Bringing Context to Content
 
PEUDOM: a Mashup Platform for End User Development of Common Information Spaces
PEUDOM: a Mashup Platform for End User Development of Common Information SpacesPEUDOM: a Mashup Platform for End User Development of Common Information Spaces
PEUDOM: a Mashup Platform for End User Development of Common Information Spaces
 
ShowNTell: An easy-to-use tool for answering students’ questions with voice-o...
ShowNTell: An easy-to-use tool for answering students’ questions with voice-o...ShowNTell: An easy-to-use tool for answering students’ questions with voice-o...
ShowNTell: An easy-to-use tool for answering students’ questions with voice-o...
 
Mobile Web Browsing Based On Content Preserving With Reduced Cost
Mobile Web Browsing Based On Content Preserving With Reduced CostMobile Web Browsing Based On Content Preserving With Reduced Cost
Mobile Web Browsing Based On Content Preserving With Reduced Cost
 
Rosinski ibm ai overview with several examples of projects in the media and l...
Rosinski ibm ai overview with several examples of projects in the media and l...Rosinski ibm ai overview with several examples of projects in the media and l...
Rosinski ibm ai overview with several examples of projects in the media and l...
 
2010 sigdoc keynote
2010 sigdoc keynote2010 sigdoc keynote
2010 sigdoc keynote
 
Image Security Case Study
Image Security Case StudyImage Security Case Study
Image Security Case Study
 

Kürzlich hochgeladen

Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
Chris Hunter
 

Kürzlich hochgeladen (20)

Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
 

Adaptive Video and Metadata Display using Multimedia Documents

  • 1. Adaptive Video and Metadata Display using Multimedia Documents Cyril Concolato ACM MM 2010 / SAPMIA Workshop 29/10/2010
  • 2. Personalized Video Viewing with ROIRelated works Previous works “The big picture on small screens delivering acceptable video quality in mobile TV”, Knoche et al., TOMCCAP 2009 Discusses best zooming factor depending on the content “Adding dynamic visual manipulations to declarative multimedia documents”, Kuikjet al., DocEng 2009 Zooming onto pictures and creating animated camera motions “Animated Picture Presentation Steered by Natural Language”, Reiterer et al., UCMedia 2009 Virtual camera motion driven by ROI and textual description More recent works @ ACM MM 2010 “Crowd-sourced Automatic Zoom and Scroll for Video Retargeting”, Carlier et al. Learning the ROI based on user interaction, and creating a retargeted video based on ROI “Impact of Zooming and Enhancing Region of Interests for Optimizing User Experience on Mobile Sports Video”, Song et al. User study on the usefulness of ROI for improving the user experience “Video Retargeting for Aesthetic Enhancement”, Xiang et al. Automatic ROI detection and video creation page 1
  • 3. Our approach vs. related works Automatic ROI detection (RWTH Aachen) Similar to existing works with specific detection Differentiated H.264|AVC encoding (IBBT-MMLAB) Balanced encoding between background and ROIs Use of a rich media document To display video To let the user select a ROI and zoom or not To show additional metadata with adaptation features page 2 “Annotation based personalized adaptation and presentation of videos for mobile applications”, S. De Bruyne, P. Hosten, C. Concolato, M. Asbach, J. De Cock, M. Unger, J. Le Feuvre and R.Vande Walle, Multimedia Tools and Applications, 2011, DOI: 10.1007/s11042-010-0575-2.
  • 4. Our System Principles Generate rich media documents from video annotations Based on semi-automatic annotations Based on templates Hierarchical Rich Media Documents MPEG-4 BIFS for synchronized & interactive ROI W3C SVG & JavaScript for adaptive metadata layout & interaction page 3
  • 5. Adaptive Rich Media Documents Part of a global problem of media adaptation (e.g. MPEG-21 DIA) Specificities of documents Structured information (e.g. XML) The use of media The spatial organization (2D/3D, …) The temporal aspects (animations, synchronization …) The interactive behavior (events, modifications) Existing methods for document adaptation Alternatives/Switch between document branches Constraints solving problem Interpolation between key scenes (e.g. automatic layout, “artistic resizing”) Scalable documents page 4
  • 6. Example of spatial adaptation of Rich Media Documents page 5
  • 7. Our choices in thiswork Adaptation based on constraints solving Screen size, video size, quantity/type of metadata to display Author directives E.g. priority of text over images, relative positioning of elements, … Compiled into a JavaScript algorithm Included in the rich media document Executed at runtime Results Size and positions of metadata, font size, split of metadata over several pages … page 6
  • 8. Video and Metadata Display Results page 7 Le Feuvre, J., Concolato, C., and Moissinac, J. 2007. GPAC: open source multimedia framework. In Proceedings of the 15th international Conference on Multimedia (Augsburg, Germany, September 25 - 29, 2007). MULTIMEDIA '07. ACM, New York, NY, 1009-1012. DOI= http://doi.acm.org/10.1145/1291233.1291452
  • 10. Conclusions and Future Work Functionnal proof of concept How media annotations can leverage document adaptation How different rich media languages can be mixed How user preferences expressed by interactions can drive the adaptation Many aspects can be improved Add more constraints Pixel density, screen orientation, … Improve algorithm for constraint solving Better use of screen space Work on the User Interface When ROI don’t last long enough to be clicked When many ROIs are present on the screen at the same time When the font size is too small User Studies Future work Authoring of adaptive documents page 9
  • 11. Thank you for your attention!Questions ? Suggestions ? cyril.concolato@telecom-paristech.fr page 10