SlideShare ist ein Scribd-Unternehmen logo
1 von 22
RELIVING ON DEMAND: A TOTAL VIEWER
EXPERIENCE

           Vivek K. Singh1*, Jiebo Luo2, Dhiraj Joshi2,
          Phoury Lei2, Madirakshi Das2, Peter Stubler2


                              1 University
                              of California, Irvine,
            2 Kodak Research Laboratories, Rochester, NY,




            ACM International Conference on Multimedia – ACMM 2011

1             * Work was done when the author was interning at Kodak Research Laboratories, Eastman Kodak Company, Rochester, NY, USA.
Why do people take pictures?

 1. Digital re-living




 2. Sharing it with
   family and friends
What’s available today?

• Commercial Slideshows (Picasa, iPhoto, ACDsee):
  • Focus on visual appearance only.
  • Don’t understand/utilize semantics (except “FaceMovie”)
• Research efforts: Semantic analysis
  • No interaction
  • Interaction on demand
• Allow different users to dynamically re-direct the flow of
  media reliving experience
Platforms
   Desktop
   Digital frame
   HDTV
   Kodak Gallery
   Mobile
   Kiosk
Preview
 • Re-living of events in user’s life, based on WHO,
   WHERE, and WHEN .
Outline
 • Preview
 • Design principles
 • System design
 • Under the hood (sneak peek)
 • Evaluations
Design principles
 1. User controllable:
    • Responsive to user demand (overcoming intent gap)
 2. Semantically drivable:
    • Events as organizing units
    • Who, when, where; what
 3. Aesthetically pleasing:
    • Dynamic presentation
    • Multimodal (songs, images, videos)
Retrieval vs. Browsing vs. Reliving

• Media by itself is uninteresting unless it performs a
  function (e.g. reliving, sharing) for the human user
• Retrieval
  • Fetching data. Strong intent (e.g. search)
• Browsing
  • Piecemeal reliving. Weak intent (e.g. youtube)
• Reliving
  • Valuable middle ground.
  • Semantically re-direct the flow if desired.
System overview
System overview: Approach
Media data structure

                                                          Media
                               URL                      properties
                   Type              Height, width

                                                        Aesthetic
                                        Aesthetic IVI   properties
        location

                    subjects         dateTime           Semantic
                                                        properties


                                          Score         Suitability
                                                        properties
Pre-processing
                                       Media
                                      Collection




       Date and Time          Aesthetics Value           Face Detection    Location Information
        Extraction              Extraction                                      Extraction



                                                         Face Clustering


      Event Clustering                             Face Labeling
                                                                               Geographic
                                                                               Clustering



                         Metadata
                         Repository
Reordering of event list

• Basic idea


• Time


• People


• Location
Choosing layout
 • Default:




i=     2          3   4   5
Choose transitions

• If (criteria=time || criteria=loc)
   • Slide In/Out
• If (criteria=personi)
   • Face2Face transition



  Transform(θ1, trans.X                Transform(θ2, trans.X
  1,                                   2,
  trans.Y 1, scale 1)                  trans.Y2, scale 2)
Choose song

• If (criteria=time)
   • Select seasonal songs (easily extensible to finer grain)
• If (criteria=loc)
   • Select regional songs
• If (criteria=personi)
   • Select age-based songs (easily extensible to gender)
• Taken from a library of available songs
Show images
 • In time order
 • Higher score => more display time
 • Auto-zoom-crop
   • Find center to focus on
   • Match the aspect ratio required
 • Multiple Holes in transitions
   • Token passing amongst holes
   • Representative image as background
Logging user sessions
     <Interaction>
                      <Click>
                                    <GlobalEventID>urn:guid:f1337996-3c28-4345-b4fb-c4f1b788fc05</GlobalEventID>
                                    <SortedEventID>0</SortedEventID
                                    <TimeStamp>10:17:47 AM</TimeStamp>
                                    <Criteria_type>gps</Criteria_type>
                                    <Criteria_value>61.2175937710438 , -149.898739309764</Criteria_value>
                                    <HotSpotClick>False</HotSpotClick>
                      </Click>
                      <Snapshot>
                                    <Locations>
                                                      <loc>-149.898739309764,61.2175937710438</loc>
                                                      <loc>-73.508556462585,40.5956603174603</loc>
                                                      <loc>102.757525301205,25.1018832329317</loc>
                                                      <loc>104.195397,35.86166</loc>
                                                      <loc>6.09306585111111,52.7236709366667</loc>
                                    </Locations>
                                    <People>
                                                      <peo>Jiebo</peo>
                                                      <peo>Joyce</peo>
                                                      <peo>Xinping</peo>
                                                      <peo></peo>
                                                      <peo></peo>
                                    </People>
                                    <SortedEvents>
                                                      <eve>urn:guid:f1337996-3c28-4345-b4fb-c4f1b788fc05</eve>
                                                      <eve>urn:guid:f1337996-3c28-4345-b4fb-c4f1b788fc05</eve>
                                                      <eve>urn:guid:f1337996-3c28-4345-b4fb-c4f1b788fc05</eve>
                                                      <eve>urn:guid:f1337996-3c28-4345-b4fb-c4f1b788fc05</eve>
                                                      <eve>urn:guid:f1337996-3c28-4345-b4fb-c4f1b788fc05</eve>
                                                      <eve></eve>
                                    </SortedEvents>
                                    <PicsShown>
                                                      <pic>c:datajiebocvpr2008103_5972.jpg</pic>
                                                      <pic>c:datajiebocvpr2008103_5973.jpg</pic>
                                                      <pic>c:datajiebolijiang-shangrila-day2108_0043.jpg</pic>
                                                      <pic>c:datajiebolijiang-shangrila-day2108_0044.jpg</pic>
                                    </PicsShown>
                      </Snapshot>
     </Interaction>
Evaluations
• Experiments with 11 families
• 35 user interaction sessions logged
          Age of contributing photographers         23 to 56

          No. of images/ videos in the collection   2,091 to 10,522

          No. of calendar years in time span        3 to 10

          No. of tagged people in the collection    26 to 137

          No. of places in the collection           19 to 45

• Roles
  • 1st person (owner)
  • 2nd person (immediate family)
  • 3rd person (friends, cousins )
Experiment 1: Comparison with commercially available
options
6.2 Experiment 2: Use of different features across
different user demographics
         Females   1.14      1.49            1.13         1.01
         Males     1.41      1.25            2.08         1.43
          Both     1.30      1.27            1.28         1.35
                   All     1st party       2nd party    3rd party
                                                       Active Vs Passive?




         Clicks per axis               Stickiness :Time spent after clicks
Future work
 • Choosing songs more generically/smartly
 • Choosing optimal spatio-temporal placement of
   images in the slide show
   • Choosing layout
   • Choosing transition time?
 • Supporting multiple axes simultaneously
 • Previews

Weitere ähnliche Inhalte

Ähnlich wie Reliving on demand a total viewer experience

Geo-referenced human-activity-data; access, processing and knowledge extraction
Geo-referenced human-activity-data; access, processing and knowledge extractionGeo-referenced human-activity-data; access, processing and knowledge extraction
Geo-referenced human-activity-data; access, processing and knowledge extraction
Conor Mc Elhinney
 

Ähnlich wie Reliving on demand a total viewer experience (19)

A Hierarchical Deep Temporal Model for Group Activity Recognition (CVPR16)
A Hierarchical Deep Temporal Model for Group Activity Recognition (CVPR16)A Hierarchical Deep Temporal Model for Group Activity Recognition (CVPR16)
A Hierarchical Deep Temporal Model for Group Activity Recognition (CVPR16)
 
Fast, Cheap, and Actionable: Creating an Affordable User Research Program
Fast, Cheap, and Actionable: Creating an Affordable User Research ProgramFast, Cheap, and Actionable: Creating an Affordable User Research Program
Fast, Cheap, and Actionable: Creating an Affordable User Research Program
 
Geospatial for Java
Geospatial for JavaGeospatial for Java
Geospatial for Java
 
The Seven Wastes of Software Development
The Seven Wastes of Software DevelopmentThe Seven Wastes of Software Development
The Seven Wastes of Software Development
 
3D打印:從想像到現實
3D打印:從想像到現實3D打印:從想像到現實
3D打印:從想像到現實
 
Understand Your Festival
Understand Your FestivalUnderstand Your Festival
Understand Your Festival
 
Class 5
Class 5Class 5
Class 5
 
Android Talks #05 - UI optimization of Android applications
Android Talks #05 - UI optimization of Android applicationsAndroid Talks #05 - UI optimization of Android applications
Android Talks #05 - UI optimization of Android applications
 
UMBC IFSM438 Project Management Group Presentation
UMBC IFSM438 Project Management Group PresentationUMBC IFSM438 Project Management Group Presentation
UMBC IFSM438 Project Management Group Presentation
 
Improve the communication between an expert and a layman through interactive ...
Improve the communication between an expert and a layman through interactive ...Improve the communication between an expert and a layman through interactive ...
Improve the communication between an expert and a layman through interactive ...
 
Presentation Selan dos Santos 4Eyes Lab
Presentation Selan dos Santos 4Eyes LabPresentation Selan dos Santos 4Eyes Lab
Presentation Selan dos Santos 4Eyes Lab
 
Final Year Major Project Report ( Year 2010-2014 Batch )
Final Year Major Project Report ( Year 2010-2014 Batch )Final Year Major Project Report ( Year 2010-2014 Batch )
Final Year Major Project Report ( Year 2010-2014 Batch )
 
Алексей Ященко и Ярослав Волощук "False simplicity of front-end applications"
Алексей Ященко и Ярослав Волощук "False simplicity of front-end applications"Алексей Ященко и Ярослав Волощук "False simplicity of front-end applications"
Алексей Ященко и Ярослав Волощук "False simplicity of front-end applications"
 
아리랑 위성영상 AI 객체 검출 경진대회 1등 수상자 솔루션
아리랑 위성영상 AI 객체 검출 경진대회 1등 수상자 솔루션아리랑 위성영상 AI 객체 검출 경진대회 1등 수상자 솔루션
아리랑 위성영상 AI 객체 검출 경진대회 1등 수상자 솔루션
 
Human Action Recognition Using 3D Joint Information and HOOFD Features
Human Action Recognition Using 3D Joint Information and HOOFD FeaturesHuman Action Recognition Using 3D Joint Information and HOOFD Features
Human Action Recognition Using 3D Joint Information and HOOFD Features
 
7 (+/- 2) Steps to Agility
7 (+/- 2) Steps to Agility7 (+/- 2) Steps to Agility
7 (+/- 2) Steps to Agility
 
Geo-referenced human-activity-data; access, processing and knowledge extraction
Geo-referenced human-activity-data; access, processing and knowledge extractionGeo-referenced human-activity-data; access, processing and knowledge extraction
Geo-referenced human-activity-data; access, processing and knowledge extraction
 
EBImage - Short Overview
EBImage - Short OverviewEBImage - Short Overview
EBImage - Short Overview
 
Capital Go 2018 - College Advising with Facial Recognition
Capital Go 2018 - College Advising with Facial RecognitionCapital Go 2018 - College Advising with Facial Recognition
Capital Go 2018 - College Advising with Facial Recognition
 

Kürzlich hochgeladen

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Kürzlich hochgeladen (20)

Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 

Reliving on demand a total viewer experience

  • 1. RELIVING ON DEMAND: A TOTAL VIEWER EXPERIENCE Vivek K. Singh1*, Jiebo Luo2, Dhiraj Joshi2, Phoury Lei2, Madirakshi Das2, Peter Stubler2 1 University of California, Irvine, 2 Kodak Research Laboratories, Rochester, NY, ACM International Conference on Multimedia – ACMM 2011 1 * Work was done when the author was interning at Kodak Research Laboratories, Eastman Kodak Company, Rochester, NY, USA.
  • 2. Why do people take pictures? 1. Digital re-living 2. Sharing it with family and friends
  • 3. What’s available today? • Commercial Slideshows (Picasa, iPhoto, ACDsee): • Focus on visual appearance only. • Don’t understand/utilize semantics (except “FaceMovie”) • Research efforts: Semantic analysis • No interaction • Interaction on demand • Allow different users to dynamically re-direct the flow of media reliving experience
  • 4. Platforms  Desktop  Digital frame  HDTV  Kodak Gallery  Mobile  Kiosk
  • 5. Preview • Re-living of events in user’s life, based on WHO, WHERE, and WHEN .
  • 6. Outline • Preview • Design principles • System design • Under the hood (sneak peek) • Evaluations
  • 7. Design principles 1. User controllable: • Responsive to user demand (overcoming intent gap) 2. Semantically drivable: • Events as organizing units • Who, when, where; what 3. Aesthetically pleasing: • Dynamic presentation • Multimodal (songs, images, videos)
  • 8. Retrieval vs. Browsing vs. Reliving • Media by itself is uninteresting unless it performs a function (e.g. reliving, sharing) for the human user • Retrieval • Fetching data. Strong intent (e.g. search) • Browsing • Piecemeal reliving. Weak intent (e.g. youtube) • Reliving • Valuable middle ground. • Semantically re-direct the flow if desired.
  • 11. Media data structure Media URL properties Type Height, width Aesthetic Aesthetic IVI properties location subjects dateTime Semantic properties Score Suitability properties
  • 12. Pre-processing Media Collection Date and Time Aesthetics Value Face Detection Location Information Extraction Extraction Extraction Face Clustering Event Clustering Face Labeling Geographic Clustering Metadata Repository
  • 13. Reordering of event list • Basic idea • Time • People • Location
  • 14. Choosing layout • Default: i= 2 3 4 5
  • 15. Choose transitions • If (criteria=time || criteria=loc) • Slide In/Out • If (criteria=personi) • Face2Face transition Transform(θ1, trans.X Transform(θ2, trans.X 1, 2, trans.Y 1, scale 1) trans.Y2, scale 2)
  • 16. Choose song • If (criteria=time) • Select seasonal songs (easily extensible to finer grain) • If (criteria=loc) • Select regional songs • If (criteria=personi) • Select age-based songs (easily extensible to gender) • Taken from a library of available songs
  • 17. Show images • In time order • Higher score => more display time • Auto-zoom-crop • Find center to focus on • Match the aspect ratio required • Multiple Holes in transitions • Token passing amongst holes • Representative image as background
  • 18. Logging user sessions <Interaction> <Click> <GlobalEventID>urn:guid:f1337996-3c28-4345-b4fb-c4f1b788fc05</GlobalEventID> <SortedEventID>0</SortedEventID <TimeStamp>10:17:47 AM</TimeStamp> <Criteria_type>gps</Criteria_type> <Criteria_value>61.2175937710438 , -149.898739309764</Criteria_value> <HotSpotClick>False</HotSpotClick> </Click> <Snapshot> <Locations> <loc>-149.898739309764,61.2175937710438</loc> <loc>-73.508556462585,40.5956603174603</loc> <loc>102.757525301205,25.1018832329317</loc> <loc>104.195397,35.86166</loc> <loc>6.09306585111111,52.7236709366667</loc> </Locations> <People> <peo>Jiebo</peo> <peo>Joyce</peo> <peo>Xinping</peo> <peo></peo> <peo></peo> </People> <SortedEvents> <eve>urn:guid:f1337996-3c28-4345-b4fb-c4f1b788fc05</eve> <eve>urn:guid:f1337996-3c28-4345-b4fb-c4f1b788fc05</eve> <eve>urn:guid:f1337996-3c28-4345-b4fb-c4f1b788fc05</eve> <eve>urn:guid:f1337996-3c28-4345-b4fb-c4f1b788fc05</eve> <eve>urn:guid:f1337996-3c28-4345-b4fb-c4f1b788fc05</eve> <eve></eve> </SortedEvents> <PicsShown> <pic>c:datajiebocvpr2008103_5972.jpg</pic> <pic>c:datajiebocvpr2008103_5973.jpg</pic> <pic>c:datajiebolijiang-shangrila-day2108_0043.jpg</pic> <pic>c:datajiebolijiang-shangrila-day2108_0044.jpg</pic> </PicsShown> </Snapshot> </Interaction>
  • 19. Evaluations • Experiments with 11 families • 35 user interaction sessions logged Age of contributing photographers 23 to 56 No. of images/ videos in the collection 2,091 to 10,522 No. of calendar years in time span 3 to 10 No. of tagged people in the collection 26 to 137 No. of places in the collection 19 to 45 • Roles • 1st person (owner) • 2nd person (immediate family) • 3rd person (friends, cousins )
  • 20. Experiment 1: Comparison with commercially available options
  • 21. 6.2 Experiment 2: Use of different features across different user demographics Females 1.14 1.49 1.13 1.01 Males 1.41 1.25 2.08 1.43 Both 1.30 1.27 1.28 1.35 All 1st party 2nd party 3rd party Active Vs Passive? Clicks per axis Stickiness :Time spent after clicks
  • 22. Future work • Choosing songs more generically/smartly • Choosing optimal spatio-temporal placement of images in the slide show • Choosing layout • Choosing transition time? • Supporting multiple axes simultaneously • Previews

Hinweis der Redaktion

  1. People don’t want to see images, they want to re-live and share the experiences