SlideShare ist ein Scribd-Unternehmen logo
1 von 69
Multimedia Semantics:
 Metadata, Analysis and Interaction

Raphael Troncy <raphael.troncy@eurecom.fr>
Multimedia Semantics, EURECOM
Some BIG numbers
 User Generated Content (Jul'09)
    3.7+ billion photos
    10+ billion photos
    110+ million videos
             20 hours uploaded / min ≈ 75 000 full length movies / week

 Archived TV content
    1.5 million hours ≈ 120 km of shelves
    300000 hours | 1 petabyte / year

 News content


 Content difficult to search and reuse
    Barely invisible for the search engines
   04/08/2009 -        Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   -2
Image/Video indexing

 Techniques used by mainstream search engines
    search term occurs in the filename or in the caption or in user tags
    no semantics
 Image indexing: main problem
    an image is not alphabetic: there is no countable discrete units, that, in
    combination will provide the meaning of the image
    image descriptors are not given with the image: one needs to extract or
    interpret them
 Video indexing: additional problem
    a video has additionally a temporal dimension to take into account
    a video has a priori no discrete units neither (i.e. frames, shots, sequences
    cannot be absolutely defined)




   04/08/2009 -    Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   -3
Why is it so difficult to find
 appropriate multimedia content, to
    reuse and repurpose content
previously published and to present
 this content in interfaces that vary
          with user needs?
Sounds Familiar?
                                                                          [Arnold Smeulders,
                                                                          PAMI, 2000]
                                                                          The semantic gap is the
                                                                          lack of coincidence
                                                                          between the information
                                                                          that one can extract from
                                                                          the sensory data and the
                                                                          interpretation that the
                                                                          same data has for a user
                                                                          in a given situation




  04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   -5
a little drop of semantics goes a
                    long way
                    Jim Hendler [1997]
Agenda
1.       Semantics in multimedia analysis
     •          Detecting concepts for video indexing
     •          Evaluating interactive search tasks

2.       Semantics in metadata
     •          Multimedia metadata interoperability
     •          Expose your data following 4 basic principles
     •          Re-use a growing amount of publicly open datasets

3.       Semantics in user interfaces
     •          Provide meaningful presentation of underlying data
     •          Explore large knowledge bases powered by linked data




         04/08/2009 -     Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   -7
The science of labeling

 Automatically detecting the presence of a
 concept in a video stream


                                          airplane


 Naming visual information




  04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   -8
The Computer Vision Approach

 Building detectors one-at-the-time

                                                                  a face detector for
                                                                frontal faces

                                                                                               3 years later

                                                                   a face detector for
                                                                  non-frontal faces

                                                                     One (or more) PhD for
                                                                      every new concept


  04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   -9
So how about these?




  04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 10
A Simple Concept Detector




  04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 11
K-nearest neighbor




  04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 12
Linear Classification




   04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 13
Support Vector Machine




  04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 14
Supervised Learner




  04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 15
NIST TRECVID Evaluation

 Until 2001, everybody defined his own concepts
   Using specific and small data sets
   Hard to compare methodologies

 Since 2001, worldwide evaluation by NIST
   Promote progress in video retrieval search
   Provide common datasets (shots, ASR, key frames)
   Use open, metrics-based evaluation

                               Large-Scale Concept
                              Ontology for Multimedia


  04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 16
Success and Criticism

 More and more concept detectors available:
    TRECVID 2005: 101 concept lexicon
    TRECVID 2006: 491 concept lexicon
    MediaMill Challenge 2007: 572 concept lexicon

 ... but focus is on the final result
    relative merit of indexing methods: ignore intermediary
    steps while systems become more complex (several
    features and learning methods)

 ... but concept detectors developed mismatch
 user information needs

   04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 17
TRECVID Interactive Video Search Task
 Query selection:
    by keyword,
    by concept,
    by example

 Topics unknown
 Test set
    English (2004)
    Chinese (2005-6)
    Dutch (2007-8-9)




   04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 18
VideOlympics
 Benchmark performance cannot be sole criterion
    Experience of searcher counts
    Usability of systems matters

 VideoOlympics: live interactive search task
    Simultaneous exposure
    of video retrieval systems
    Showcase that goes
    beyond a regular demo
    session
    Fun to do (participants)
    & Fun to watch (audience)




   04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 19
VideOlympics Setup




                 One display
                        TRECVID like queries
                        Results pushed by searchers
  04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 20
Agenda
1.       Semantics in multimedia analysis
     •          Detecting concepts for video indexing
     •          Evaluating interactive search tasks

2.       Semantics in metadata
     •          Multimedia metadata interoperability
     •          Expose your data following 4 basic principles
     •          Re-use a growing amount of publicly open datasets

3.       Semantics in user interfaces
     •          Provide meaningful presentation of underlying data
     •          Explore large knowledge bases powered by linked data




         04/08/2009 -     Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 21
Multimedia: Description methods


                  MPEG-21


                  MPEG-7

                  MPEG-4

                  MPEG-2


                  MPEG-1

                   ISO                                                                            W3C




   04/08/2009 -          Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009         - 22
MPEG-7: a multimedia description language?

 ISO standard
 since December
 of 2001      Content organization
                                                                Collections                   Models                   User
                                                                                                                    interaction

 Main
 components:                                         Creation &
                                                                                                     Navigation &      User
                                                                                                       Access       Preferences
     Descriptors                                     Production
                                                                                                     Summaries
     (Ds) and               Media                                                   Usage
                                               Content management                                                     User
     Description                                                                                       Views         History
     Schemes                                    Content description

     (DSs)                          Structural
                                     aspects
                                                                        Semantic
                                                                         aspects
                                                                                                      Variations

     DDL (XML
     Schema +
                     Basic elements
     extensions)          Schema                       Basic                 Links & media            Basic
                           Tools                     datatypes                localization            Tools
 Concern all
 types of media                                      Part 5 – MDS
                                            Multimedia Description Schemes
    04/08/2009 -     Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 23
MPEG-7 and the Semantic Web
 MDS Upper Layer represented in RDFS
   2001: Hunter
   Later on: link to the ABC upper ontology

 MDS fully represented in OWL-DL
   2004: Tsinaraki et al., DS-MIRF model

 MPEG-7 fully represented in OWL-DL
   2005: Garcia and Celma, Rhizomik model
   Fully automatic translation of the whole standard

 MDS and Visual parts represented in OWL-DL
   2007: Arndt et al., COMM model
   Re-engineering MPEG-7 using DOLCE design patterns


  04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 24
04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 25
Example 1: Region Annotation

                                                                                                             http://en.wikipedia.org/wiki/
                                                                                                            Image:Yalta_Conference.jpg




                                                                                         dns:realized-by


                                                               dns:setting
                                                                                        core:semantic-
                                 core:image-data
                                                                                          annotation

                               dns:plays                                                               dns:defines               foaf:Person

      loc:region-                           loc:spatial-mask-                           core:semantic-label-
   locator-descriptor                              role                                        role
                           dns:played-by
                                                                                                                                   rdf:type
dns:defines                                                                                                dns:played-by

                                                                                                               http://en.wikipedia.org/wiki/
   loc:bounding-box                  5 25 10 20 15 15 10 10 5 15"^^xsd:string
                                                                                                                         Churchill
                      data:has-rectangle


       04/08/2009 -               Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009          - 26
Example 2: Sequence Annotation


                                                                                             http://www.reuters.com/news/video/
                                                                                                 summitVideo?videoId=56114




                                                                                   dns:realized-by



                                                             dns:setting
                                                                                      core:semantic-
                               core:image-data
                                                                                        annotation

                            dns:plays                                                                dns:defines                tgn:Sweden

    loc:media-time-                          loc:temporal-                            core:semantic-label-
       descriptor                              mask-role                                     role
                        dns:played-by
                                                                                                                                 skos:broader
dns:defines                                                                                              dns:played-by

    loc:media-time-
                                   "1:21"^^xsd:time                                                                    tgn:Gothenburg
         point
                      data:has-time
       04/08/2009 -             Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009          - 27
04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 28
Image Annotation with Linked Data
Reg1
                                                                  The "Big Three" at the Yalta
                                                                  Conference (Wikipedia)



   Localize a region (bounding box)
   Annotate the content (interpretation)
      Tag: Winston Churchill, UK Prime Minister, Allied Forces, WWII
      Link to knowledge on the Web
   :Reg1 foaf:depicts dbpedia:Winston_Churchill
 ----------------------------------------------
 dbpedia:Winston_Churchill dbpedia:spouse
                       dbpedia:Clementine_Churchill
 dbpedia:Winston_Churchill owl:sameAs
                       fbase:Winston_Churchill
       04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 29
Video Annotation with Linked Data
                                                                                                      Seq4

                                                                Seq1
   A history of G8 violence (video)
   (© Reuters)




      Localize a region
      Annotate the content
        Tag: G8 Summit, Heiligendamn, 2007
        Link to knowledge on the Web       EU Summit, Gothenburg, 2001
  :Seq1 foaf:depicts dbpedia:34th_G8_Summit
----------------------------------------------
dbpedia:33rd_G8_Summit foaf:based_near geo:Heilegendamn
geo:Heilegendamn skos:broader geo:Germany
       04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 30
What is linked data?
  URIs, possibly identifying
  media fragments                                                      wp:2006_FIFA_World_Cup#Final

  + annotations (tags)
                                                                        events:id
  + links among fragments
  & annotations

geonames:2950159
                                                                                            nar:subject

                   nar:location                                                                           nc:15054000

                                                            foaf:depicts
                                                                                       dbpedia:Zidane

    04/08/2009 -        Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009    - 31
                                                                                                                        31
Linked Data Principles

  Tim Berners Lee [2006] (Design Issues)
  1. Use URIs to identify things
     (anything, not just documents);
  2. Use HTTP URIs – globally unique names, distributed
     ownership –
     so that people can look up those names;
  3. Provide useful information in RDF –
     when someone looks up a URI;
  4. Include RDF links to other URIs –
     to enable discovery of related information




   04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 32
An Example: DBpedia

 DBpedia is a community effort to:
   extract structured "infobox" information from Wikipedia
   interlink DBpedia with other datasets on the Web




  04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 33
Scraping infobox data




http://dbpedia.org/resource/Bogotá

     04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 34
Automatic Links Among Open Datasets

<http://dbpedia.org/resource/Bogotá>
  owl:sameAs <http://sws.geonames.org/3688689/>
  owl:sameAs
<http://rdf.freebase.com/ns/guid.9202a8c04000641f                                                      DBpedia
8000000000167bab>
  dbpedia:population "6776009"
  ...


                   <http://sws.geonames.org/3688689/>
                     owl:sameAs <http://dbpedia.org/resource/Bogotá>
                     wgs84_pos:lat "4.6"
Geonames             wgs84_pos:long "-74.0833333"
                     geo:population "7102602"
                     ...




    04/08/2009 -       Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 35
sameAs.org




  04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 36
Bogotá on Freebase




  04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 37
Bogotá on Geonames




  04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 38
How Much Linked Data is there ?




  04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 39
Linked Data Cloud – August 2007




  04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 40
Linked Data Cloud – March 2008




  04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 41
Linked Data Cloud – September 2008




  04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 42
Linked Data Cloud – March 2009




  04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 43
The Web of Data

 Expose open datasets in RDF
 Set RDF links among the data items for
 different datasets
 Over 4.5 billion triples, 5 millions links
 (March 2009)
 ... still counting




   04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 44
Who are the users?
               Why would they use the cloud?
               What tasks can be supported?
               How will the semantics help?




04/08/2009 -      Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 45
Agenda
1.       Semantics in multimedia analysis
     •          Detecting concepts for video indexing
     •          Evaluating interactive search tasks

2.       Semantics in metadata
     •          Multimedia metadata interoperability
     •          Expose your data following 4 basic principles
     •          Re-use a growing amount of publicly open datasets

3.       Semantics in user interfaces
     •          Provide meaningful presentation of underlying data
     •          Explore large knowledge bases powered by linked data




         04/08/2009 -     Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 46
Provide meaningful presentation of data




  04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 47
... and behind the scene




  04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 48
... link an artist to more data




   04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 49
... myspace




  04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 50
... last.fm




   04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 51
... IMDb




   04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 52
Going through the Walled Gardens




David Simonds: Everywhere and nowhere. 19 May 2008, The Economist.
     04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 53
How can semantics help?

Query construction
    disambiguate input (auto-completion)
    selection of available terms (grouping and ranking algorithms)

(Semantic) search algorithm
    graph traversal
    query expansion
    RDFS/OWL reasoning

Presentation of search results
    grouping by property
    visualization on timeline, map, etc.

  04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 54
                                                                                                 54
News Workflow Interoperability

 No integration of media (stories, photo, animation, video)
 Little (or no) context in the news presentation
 Lack of interoperability in the current workflow




 NAR Schema                                       Broadcaster Schema
                                                                                                    User
 NewsCodes                               Controlled Vocabularies                                  Vocabulary

   04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 55
                                                                                                           55
Exploratory Search

 (Ultimate) Goal:
     Provide an environment for searching and browsing
     contextualized multimedia news information

 Required integration:
     Data: various media, different forms, various sources
     Metadata: schema integration, semantic models

 Influence and implications of UI:
     How to represent semantic multimedia metadata
     to facilitate presenting information?
     in other words ... What constraints do end-user
     interfaces put on the modeling of the metadata?
   04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 56
                                                                                                  56
News and Multimedia Formats


NewsML         EventsML      SportsML
  G2                    G2      G2




       News Architecture
               (NAR)




         04/08/2009 -          Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 57
Modeling the News + Media Ontology


                                                                         dc:Subject ≈
                                                                         nar:Subject
                   foaf:Person ≈
                    nar:Person


                                                                                          sioc:Item ≈
                                              +                                            nar:Item



                     geo:lat
                    geo:long


  04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009    - 58
Enriching the News Metadata

                                                   Concepts/Entities that
                                                   are subject of news
                                                           Thematic categories
                                                           People
                                                           Organizations
                                                           Geopolitical Areas
                                                           Points of Interest
                                                           Events
                                                           Products or artefacts



  04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 59
Enriching the News Metadata


                                                    Named Entity
                                                    Recognition



                                              Domain Ontologies




NAR Ontology
 NewsCodes
 Thesaurus




     04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 60
Enriching the News Metadata


                                                     Concept
                                                     Detectors



                                              Domain Ontologies




NAR Ontology
 NewsCodes
 Thesaurus




     04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 61
Presenting News Information




  Dimensions used for searching news items
       When      time                              10/07/2006
       Where     location                          Paris
       What      is depicted                       J. Chirac, Z. Zidane                            Metadata
       Why       event                             WC 2006
       Who       photographer                      Bertrand Guay, AFP


  04/08/2009 -     Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 62
Semantic Search of Multimedia News
                    Description                                                   Number of RDF Triples
General Ontologies: NAR, DC, FOAF                                                                           7,336

Domain Specific Ontologies: football                                                                      104,358

Thesauri: newscodes                                                                                        34,903

DBpedia, Geonames                                                                                          53,468

AFP News Feed (June/July 2006)                                                                            804,446

AFP Photos (June/July 2006)                                                                                61,311
                                                                                              a
INA Broadcast Video (June/July 2006)
                                                                                        P atri              1,932
                                                                                   Cl io
                                                                                by
Total                                                                       r ed lpha 3                  1,067,754
                                                                       P owe 1.0 a

     04/08/2009 -        Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 63
04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 64
04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 65
04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 66
Provide New Dimensions for Exploring




  04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 67
Take Home Message
Concept detection challenges: machine learning and IR
   Features can be extracted and used to describe multimedia content
   Show generality of approach, dynamic nature of video (event)
   Show that an ontology can help

Semantic metadata representation challenges: KR
   Media and metadata can be passed around and among systems
   Reuse what is there
   Expose what you make

Interaction challenges: CHI
   Users can be given much richer
   and more flexible access to (semantically annotated) content
   ... but we are still figuring out how to do this!


   04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 68
Credits

 Many people
   Cees Snoek, Alex Hauptmann, Alan Smeaton,
   Ivan Herman, Krishna Chandramouli, David Simonds,
   Laurent Le Meur
   Colleagues from the Interactive Information Access
   Group, CWI Amsterdam

 Datasets



  http://www.slideshare.net/troncy

  04/08/2009 -   Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009   - 69

Weitere ähnliche Inhalte

Ähnlich wie Multimedia Semantics: Metadata, Analysis and Interaction

Video Data Visualization System : Semantic Classification and Personalization
Video Data Visualization System : Semantic Classification and Personalization  Video Data Visualization System : Semantic Classification and Personalization
Video Data Visualization System : Semantic Classification and Personalization ijcga
 
Video Data Visualization System : Semantic Classification and Personalization
Video Data Visualization System : Semantic Classification and Personalization  Video Data Visualization System : Semantic Classification and Personalization
Video Data Visualization System : Semantic Classification and Personalization ijcga
 
A Personalized Audio Server using MPEG-7 and MPEG-21 standards
A Personalized Audio Server using MPEG-7 and MPEG-21 standardsA Personalized Audio Server using MPEG-7 and MPEG-21 standards
A Personalized Audio Server using MPEG-7 and MPEG-21 standardsUniversity of Piraeus
 
Conférence Open Data par où commencer ? "How to achieve interoperability?" E....
Conférence Open Data par où commencer ? "How to achieve interoperability?" E....Conférence Open Data par où commencer ? "How to achieve interoperability?" E....
Conférence Open Data par où commencer ? "How to achieve interoperability?" E....Aline Custodio
 
Integrating digital traces into a semantic enriched data
Integrating digital traces into a semantic enriched dataIntegrating digital traces into a semantic enriched data
Integrating digital traces into a semantic enriched dataDhaval Thakker
 
A Mobile Audio Server enhanced with Semantic Personalization Capabilities
A Mobile Audio Server enhanced with Semantic Personalization CapabilitiesA Mobile Audio Server enhanced with Semantic Personalization Capabilities
A Mobile Audio Server enhanced with Semantic Personalization CapabilitiesUniversity of Piraeus
 
Speaker Search and Indexing for Multimedia Databases
Speaker Search and Indexing for Multimedia DatabasesSpeaker Search and Indexing for Multimedia Databases
Speaker Search and Indexing for Multimedia DatabasesGihan Wikramanayake
 
Speaker Search and Indexing for Multimedia Databases
Speaker Search and Indexing for Multimedia DatabasesSpeaker Search and Indexing for Multimedia Databases
Speaker Search and Indexing for Multimedia DatabasesGihan Wikramanayake
 
Semantic MDM systems design concept
Semantic MDM systems design conceptSemantic MDM systems design concept
Semantic MDM systems design conceptAndrey Andrichenko
 
System analysis and design for multimedia retrieval systems
System analysis and design for multimedia retrieval systemsSystem analysis and design for multimedia retrieval systems
System analysis and design for multimedia retrieval systemsijma
 
Read Between The Lines: an Annotation Tool for Multimodal Data
Read Between The Lines: an Annotation Tool for Multimodal DataRead Between The Lines: an Annotation Tool for Multimodal Data
Read Between The Lines: an Annotation Tool for Multimodal DataDaniele Di Mitri
 
Computing for Human Experience and Wellness
Computing for Human Experience and WellnessComputing for Human Experience and Wellness
Computing for Human Experience and WellnessAmit Sheth
 
Osservatorio mobile social networks final report
Osservatorio mobile social networks final reportOsservatorio mobile social networks final report
Osservatorio mobile social networks final reportLaura Cavallaro
 
A Personalized Audio Web Service using MPEG-7 and MPEG-21 standards
A Personalized Audio Web Service using MPEG-7 and MPEG-21 standardsA Personalized Audio Web Service using MPEG-7 and MPEG-21 standards
A Personalized Audio Web Service using MPEG-7 and MPEG-21 standardsUniversity of Piraeus
 
Profile based Video segmentation system to support E-learning
Profile based Video segmentation system to support E-learningProfile based Video segmentation system to support E-learning
Profile based Video segmentation system to support E-learningGihan Wikramanayake
 
Personalized Multimedia Web Services in Peer to Peer Networks Using MPEG-7 an...
Personalized Multimedia Web Services in Peer to Peer Networks Using MPEG-7 an...Personalized Multimedia Web Services in Peer to Peer Networks Using MPEG-7 an...
Personalized Multimedia Web Services in Peer to Peer Networks Using MPEG-7 an...University of Piraeus
 
Overview of Video Concept Detection using (CNN) Convolutional Neural Network
Overview of Video Concept Detection using (CNN) Convolutional Neural NetworkOverview of Video Concept Detection using (CNN) Convolutional Neural Network
Overview of Video Concept Detection using (CNN) Convolutional Neural NetworkIRJET Journal
 
MPEG-7 Services in Community Engines
MPEG-7 Services in Community EnginesMPEG-7 Services in Community Engines
MPEG-7 Services in Community EnginesRalf Klamma
 

Ähnlich wie Multimedia Semantics: Metadata, Analysis and Interaction (20)

Video Data Visualization System : Semantic Classification and Personalization
Video Data Visualization System : Semantic Classification and Personalization  Video Data Visualization System : Semantic Classification and Personalization
Video Data Visualization System : Semantic Classification and Personalization
 
Video Data Visualization System : Semantic Classification and Personalization
Video Data Visualization System : Semantic Classification and Personalization  Video Data Visualization System : Semantic Classification and Personalization
Video Data Visualization System : Semantic Classification and Personalization
 
WP2 1st Review
WP2 1st ReviewWP2 1st Review
WP2 1st Review
 
A Personalized Audio Server using MPEG-7 and MPEG-21 standards
A Personalized Audio Server using MPEG-7 and MPEG-21 standardsA Personalized Audio Server using MPEG-7 and MPEG-21 standards
A Personalized Audio Server using MPEG-7 and MPEG-21 standards
 
Conférence Open Data par où commencer ? "How to achieve interoperability?" E....
Conférence Open Data par où commencer ? "How to achieve interoperability?" E....Conférence Open Data par où commencer ? "How to achieve interoperability?" E....
Conférence Open Data par où commencer ? "How to achieve interoperability?" E....
 
Integrating digital traces into a semantic enriched data
Integrating digital traces into a semantic enriched dataIntegrating digital traces into a semantic enriched data
Integrating digital traces into a semantic enriched data
 
A Mobile Audio Server enhanced with Semantic Personalization Capabilities
A Mobile Audio Server enhanced with Semantic Personalization CapabilitiesA Mobile Audio Server enhanced with Semantic Personalization Capabilities
A Mobile Audio Server enhanced with Semantic Personalization Capabilities
 
Speaker Search and Indexing for Multimedia Databases
Speaker Search and Indexing for Multimedia DatabasesSpeaker Search and Indexing for Multimedia Databases
Speaker Search and Indexing for Multimedia Databases
 
Speaker Search and Indexing for Multimedia Databases
Speaker Search and Indexing for Multimedia DatabasesSpeaker Search and Indexing for Multimedia Databases
Speaker Search and Indexing for Multimedia Databases
 
Semantic MDM systems design concept
Semantic MDM systems design conceptSemantic MDM systems design concept
Semantic MDM systems design concept
 
System analysis and design for multimedia retrieval systems
System analysis and design for multimedia retrieval systemsSystem analysis and design for multimedia retrieval systems
System analysis and design for multimedia retrieval systems
 
Read Between The Lines: an Annotation Tool for Multimodal Data
Read Between The Lines: an Annotation Tool for Multimodal DataRead Between The Lines: an Annotation Tool for Multimodal Data
Read Between The Lines: an Annotation Tool for Multimodal Data
 
Computing for Human Experience and Wellness
Computing for Human Experience and WellnessComputing for Human Experience and Wellness
Computing for Human Experience and Wellness
 
Osservatorio mobile social networks final report
Osservatorio mobile social networks final reportOsservatorio mobile social networks final report
Osservatorio mobile social networks final report
 
A Personalized Audio Web Service using MPEG-7 and MPEG-21 standards
A Personalized Audio Web Service using MPEG-7 and MPEG-21 standardsA Personalized Audio Web Service using MPEG-7 and MPEG-21 standards
A Personalized Audio Web Service using MPEG-7 and MPEG-21 standards
 
Profile based Video segmentation system to support E-learning
Profile based Video segmentation system to support E-learningProfile based Video segmentation system to support E-learning
Profile based Video segmentation system to support E-learning
 
Personalized Multimedia Web Services in Peer to Peer Networks Using MPEG-7 an...
Personalized Multimedia Web Services in Peer to Peer Networks Using MPEG-7 an...Personalized Multimedia Web Services in Peer to Peer Networks Using MPEG-7 an...
Personalized Multimedia Web Services in Peer to Peer Networks Using MPEG-7 an...
 
Overview of Video Concept Detection using (CNN) Convolutional Neural Network
Overview of Video Concept Detection using (CNN) Convolutional Neural NetworkOverview of Video Concept Detection using (CNN) Convolutional Neural Network
Overview of Video Concept Detection using (CNN) Convolutional Neural Network
 
Media Pick
Media PickMedia Pick
Media Pick
 
MPEG-7 Services in Community Engines
MPEG-7 Services in Community EnginesMPEG-7 Services in Community Engines
MPEG-7 Services in Community Engines
 

Mehr von Raphael Troncy

K CAP 2019 Opening Ceremony
K CAP 2019 Opening CeremonyK CAP 2019 Opening Ceremony
K CAP 2019 Opening CeremonyRaphael Troncy
 
Semantic Technologies for Connected Vehicles in a Web of Things Environment
Semantic Technologies for Connected Vehicles in a Web of Things EnvironmentSemantic Technologies for Connected Vehicles in a Web of Things Environment
Semantic Technologies for Connected Vehicles in a Web of Things EnvironmentRaphael Troncy
 
HyperTED: exploring video lectures at the fragment levels for enhancing learning
HyperTED: exploring video lectures at the fragment levels for enhancing learningHyperTED: exploring video lectures at the fragment levels for enhancing learning
HyperTED: exploring video lectures at the fragment levels for enhancing learningRaphael Troncy
 
Location Embeddings for Next Trip Recommendation
Location Embeddings for Next Trip RecommendationLocation Embeddings for Next Trip Recommendation
Location Embeddings for Next Trip RecommendationRaphael Troncy
 
A replication study of the top performing systems in SemEval twitter sentimen...
A replication study of the top performing systems in SemEval twitter sentimen...A replication study of the top performing systems in SemEval twitter sentimen...
A replication study of the top performing systems in SemEval twitter sentimen...Raphael Troncy
 
Contextualizing Events in TV News Shows - SNOW 2014
Contextualizing Events in TV News Shows - SNOW 2014Contextualizing Events in TV News Shows - SNOW 2014
Contextualizing Events in TV News Shows - SNOW 2014Raphael Troncy
 
Modeling Geometry and Reference Systems on the Web of Data - LGD 2014
Modeling Geometry and Reference Systems on the Web of Data - LGD 2014Modeling Geometry and Reference Systems on the Web of Data - LGD 2014
Modeling Geometry and Reference Systems on the Web of Data - LGD 2014Raphael Troncy
 
NERD: an open source platform for extracting and disambiguating named entitie...
NERD: an open source platform for extracting and disambiguating named entitie...NERD: an open source platform for extracting and disambiguating named entitie...
NERD: an open source platform for extracting and disambiguating named entitie...Raphael Troncy
 
Deep-linking into Media Assets at the Fragment Level SMAM 2013
Deep-linking into Media Assets at the Fragment Level SMAM 2013Deep-linking into Media Assets at the Fragment Level SMAM 2013
Deep-linking into Media Assets at the Fragment Level SMAM 2013Raphael Troncy
 
Describing Media Assets: Media Fragment Specification and Description
Describing Media Assets: Media Fragment Specification and DescriptionDescribing Media Assets: Media Fragment Specification and Description
Describing Media Assets: Media Fragment Specification and DescriptionRaphael Troncy
 
Semantics at the multimedia fragment level SSSW 2013
Semantics at the multimedia fragment level SSSW 2013Semantics at the multimedia fragment level SSSW 2013
Semantics at the multimedia fragment level SSSW 2013Raphael Troncy
 
Semantic structuring and linking of event-centric data in the social web
Semantic structuring and linking of event-centric data in the social webSemantic structuring and linking of event-centric data in the social web
Semantic structuring and linking of event-centric data in the social webRaphael Troncy
 
Live topic generation from event streams
Live topic generation from event streamsLive topic generation from event streams
Live topic generation from event streamsRaphael Troncy
 
MediaFinder: Collect, Enrich and Visualize Media Memes Shared by the Crowd
MediaFinder: Collect, Enrich and Visualize Media Memes Shared by the CrowdMediaFinder: Collect, Enrich and Visualize Media Memes Shared by the Crowd
MediaFinder: Collect, Enrich and Visualize Media Memes Shared by the CrowdRaphael Troncy
 
EventMedia Live: Exploring Events Connections in Real-Time to Enhance Content
EventMedia Live: Exploring Events Connections in Real-Time to Enhance ContentEventMedia Live: Exploring Events Connections in Real-Time to Enhance Content
EventMedia Live: Exploring Events Connections in Real-Time to Enhance ContentRaphael Troncy
 
Extracting Media Items from Multiple Social Networks
Extracting Media Items from Multiple Social NetworksExtracting Media Items from Multiple Social Networks
Extracting Media Items from Multiple Social NetworksRaphael Troncy
 
Semantics at the multimedia fragment level or how enabling the remixing of on...
Semantics at the multimedia fragment level or how enabling the remixing of on...Semantics at the multimedia fragment level or how enabling the remixing of on...
Semantics at the multimedia fragment level or how enabling the remixing of on...Raphael Troncy
 
MediaEval 2012 SED Opening
MediaEval 2012 SED OpeningMediaEval 2012 SED Opening
MediaEval 2012 SED OpeningRaphael Troncy
 
DeRiVE 2011 workshop opening
DeRiVE 2011 workshop openingDeRiVE 2011 workshop opening
DeRiVE 2011 workshop openingRaphael Troncy
 
MediaEval 2011 SED Opening
MediaEval 2011 SED OpeningMediaEval 2011 SED Opening
MediaEval 2011 SED OpeningRaphael Troncy
 

Mehr von Raphael Troncy (20)

K CAP 2019 Opening Ceremony
K CAP 2019 Opening CeremonyK CAP 2019 Opening Ceremony
K CAP 2019 Opening Ceremony
 
Semantic Technologies for Connected Vehicles in a Web of Things Environment
Semantic Technologies for Connected Vehicles in a Web of Things EnvironmentSemantic Technologies for Connected Vehicles in a Web of Things Environment
Semantic Technologies for Connected Vehicles in a Web of Things Environment
 
HyperTED: exploring video lectures at the fragment levels for enhancing learning
HyperTED: exploring video lectures at the fragment levels for enhancing learningHyperTED: exploring video lectures at the fragment levels for enhancing learning
HyperTED: exploring video lectures at the fragment levels for enhancing learning
 
Location Embeddings for Next Trip Recommendation
Location Embeddings for Next Trip RecommendationLocation Embeddings for Next Trip Recommendation
Location Embeddings for Next Trip Recommendation
 
A replication study of the top performing systems in SemEval twitter sentimen...
A replication study of the top performing systems in SemEval twitter sentimen...A replication study of the top performing systems in SemEval twitter sentimen...
A replication study of the top performing systems in SemEval twitter sentimen...
 
Contextualizing Events in TV News Shows - SNOW 2014
Contextualizing Events in TV News Shows - SNOW 2014Contextualizing Events in TV News Shows - SNOW 2014
Contextualizing Events in TV News Shows - SNOW 2014
 
Modeling Geometry and Reference Systems on the Web of Data - LGD 2014
Modeling Geometry and Reference Systems on the Web of Data - LGD 2014Modeling Geometry and Reference Systems on the Web of Data - LGD 2014
Modeling Geometry and Reference Systems on the Web of Data - LGD 2014
 
NERD: an open source platform for extracting and disambiguating named entitie...
NERD: an open source platform for extracting and disambiguating named entitie...NERD: an open source platform for extracting and disambiguating named entitie...
NERD: an open source platform for extracting and disambiguating named entitie...
 
Deep-linking into Media Assets at the Fragment Level SMAM 2013
Deep-linking into Media Assets at the Fragment Level SMAM 2013Deep-linking into Media Assets at the Fragment Level SMAM 2013
Deep-linking into Media Assets at the Fragment Level SMAM 2013
 
Describing Media Assets: Media Fragment Specification and Description
Describing Media Assets: Media Fragment Specification and DescriptionDescribing Media Assets: Media Fragment Specification and Description
Describing Media Assets: Media Fragment Specification and Description
 
Semantics at the multimedia fragment level SSSW 2013
Semantics at the multimedia fragment level SSSW 2013Semantics at the multimedia fragment level SSSW 2013
Semantics at the multimedia fragment level SSSW 2013
 
Semantic structuring and linking of event-centric data in the social web
Semantic structuring and linking of event-centric data in the social webSemantic structuring and linking of event-centric data in the social web
Semantic structuring and linking of event-centric data in the social web
 
Live topic generation from event streams
Live topic generation from event streamsLive topic generation from event streams
Live topic generation from event streams
 
MediaFinder: Collect, Enrich and Visualize Media Memes Shared by the Crowd
MediaFinder: Collect, Enrich and Visualize Media Memes Shared by the CrowdMediaFinder: Collect, Enrich and Visualize Media Memes Shared by the Crowd
MediaFinder: Collect, Enrich and Visualize Media Memes Shared by the Crowd
 
EventMedia Live: Exploring Events Connections in Real-Time to Enhance Content
EventMedia Live: Exploring Events Connections in Real-Time to Enhance ContentEventMedia Live: Exploring Events Connections in Real-Time to Enhance Content
EventMedia Live: Exploring Events Connections in Real-Time to Enhance Content
 
Extracting Media Items from Multiple Social Networks
Extracting Media Items from Multiple Social NetworksExtracting Media Items from Multiple Social Networks
Extracting Media Items from Multiple Social Networks
 
Semantics at the multimedia fragment level or how enabling the remixing of on...
Semantics at the multimedia fragment level or how enabling the remixing of on...Semantics at the multimedia fragment level or how enabling the remixing of on...
Semantics at the multimedia fragment level or how enabling the remixing of on...
 
MediaEval 2012 SED Opening
MediaEval 2012 SED OpeningMediaEval 2012 SED Opening
MediaEval 2012 SED Opening
 
DeRiVE 2011 workshop opening
DeRiVE 2011 workshop openingDeRiVE 2011 workshop opening
DeRiVE 2011 workshop opening
 
MediaEval 2011 SED Opening
MediaEval 2011 SED OpeningMediaEval 2011 SED Opening
MediaEval 2011 SED Opening
 

Kürzlich hochgeladen

SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 

Kürzlich hochgeladen (20)

SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 

Multimedia Semantics: Metadata, Analysis and Interaction

  • 1. Multimedia Semantics: Metadata, Analysis and Interaction Raphael Troncy <raphael.troncy@eurecom.fr> Multimedia Semantics, EURECOM
  • 2. Some BIG numbers User Generated Content (Jul'09) 3.7+ billion photos 10+ billion photos 110+ million videos 20 hours uploaded / min ≈ 75 000 full length movies / week Archived TV content 1.5 million hours ≈ 120 km of shelves 300000 hours | 1 petabyte / year News content Content difficult to search and reuse Barely invisible for the search engines 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 -2
  • 3. Image/Video indexing Techniques used by mainstream search engines search term occurs in the filename or in the caption or in user tags no semantics Image indexing: main problem an image is not alphabetic: there is no countable discrete units, that, in combination will provide the meaning of the image image descriptors are not given with the image: one needs to extract or interpret them Video indexing: additional problem a video has additionally a temporal dimension to take into account a video has a priori no discrete units neither (i.e. frames, shots, sequences cannot be absolutely defined) 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 -3
  • 4. Why is it so difficult to find appropriate multimedia content, to reuse and repurpose content previously published and to present this content in interfaces that vary with user needs?
  • 5. Sounds Familiar? [Arnold Smeulders, PAMI, 2000] The semantic gap is the lack of coincidence between the information that one can extract from the sensory data and the interpretation that the same data has for a user in a given situation 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 -5
  • 6. a little drop of semantics goes a long way Jim Hendler [1997]
  • 7. Agenda 1. Semantics in multimedia analysis • Detecting concepts for video indexing • Evaluating interactive search tasks 2. Semantics in metadata • Multimedia metadata interoperability • Expose your data following 4 basic principles • Re-use a growing amount of publicly open datasets 3. Semantics in user interfaces • Provide meaningful presentation of underlying data • Explore large knowledge bases powered by linked data 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 -7
  • 8. The science of labeling Automatically detecting the presence of a concept in a video stream airplane Naming visual information 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 -8
  • 9. The Computer Vision Approach Building detectors one-at-the-time a face detector for frontal faces 3 years later a face detector for non-frontal faces One (or more) PhD for every new concept 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 -9
  • 10. So how about these? 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 10
  • 11. A Simple Concept Detector 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 11
  • 12. K-nearest neighbor 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 12
  • 13. Linear Classification 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 13
  • 14. Support Vector Machine 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 14
  • 15. Supervised Learner 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 15
  • 16. NIST TRECVID Evaluation Until 2001, everybody defined his own concepts Using specific and small data sets Hard to compare methodologies Since 2001, worldwide evaluation by NIST Promote progress in video retrieval search Provide common datasets (shots, ASR, key frames) Use open, metrics-based evaluation Large-Scale Concept Ontology for Multimedia 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 16
  • 17. Success and Criticism More and more concept detectors available: TRECVID 2005: 101 concept lexicon TRECVID 2006: 491 concept lexicon MediaMill Challenge 2007: 572 concept lexicon ... but focus is on the final result relative merit of indexing methods: ignore intermediary steps while systems become more complex (several features and learning methods) ... but concept detectors developed mismatch user information needs 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 17
  • 18. TRECVID Interactive Video Search Task Query selection: by keyword, by concept, by example Topics unknown Test set English (2004) Chinese (2005-6) Dutch (2007-8-9) 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 18
  • 19. VideOlympics Benchmark performance cannot be sole criterion Experience of searcher counts Usability of systems matters VideoOlympics: live interactive search task Simultaneous exposure of video retrieval systems Showcase that goes beyond a regular demo session Fun to do (participants) & Fun to watch (audience) 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 19
  • 20. VideOlympics Setup One display TRECVID like queries Results pushed by searchers 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 20
  • 21. Agenda 1. Semantics in multimedia analysis • Detecting concepts for video indexing • Evaluating interactive search tasks 2. Semantics in metadata • Multimedia metadata interoperability • Expose your data following 4 basic principles • Re-use a growing amount of publicly open datasets 3. Semantics in user interfaces • Provide meaningful presentation of underlying data • Explore large knowledge bases powered by linked data 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 21
  • 22. Multimedia: Description methods MPEG-21 MPEG-7 MPEG-4 MPEG-2 MPEG-1 ISO W3C 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 22
  • 23. MPEG-7: a multimedia description language? ISO standard since December of 2001 Content organization Collections Models User interaction Main components: Creation & Navigation & User Access Preferences Descriptors Production Summaries (Ds) and Media Usage Content management User Description Views History Schemes Content description (DSs) Structural aspects Semantic aspects Variations DDL (XML Schema + Basic elements extensions) Schema Basic Links & media Basic Tools datatypes localization Tools Concern all types of media Part 5 – MDS Multimedia Description Schemes 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 23
  • 24. MPEG-7 and the Semantic Web MDS Upper Layer represented in RDFS 2001: Hunter Later on: link to the ABC upper ontology MDS fully represented in OWL-DL 2004: Tsinaraki et al., DS-MIRF model MPEG-7 fully represented in OWL-DL 2005: Garcia and Celma, Rhizomik model Fully automatic translation of the whole standard MDS and Visual parts represented in OWL-DL 2007: Arndt et al., COMM model Re-engineering MPEG-7 using DOLCE design patterns 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 24
  • 25. 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 25
  • 26. Example 1: Region Annotation http://en.wikipedia.org/wiki/ Image:Yalta_Conference.jpg dns:realized-by dns:setting core:semantic- core:image-data annotation dns:plays dns:defines foaf:Person loc:region- loc:spatial-mask- core:semantic-label- locator-descriptor role role dns:played-by rdf:type dns:defines dns:played-by http://en.wikipedia.org/wiki/ loc:bounding-box 5 25 10 20 15 15 10 10 5 15"^^xsd:string Churchill data:has-rectangle 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 26
  • 27. Example 2: Sequence Annotation http://www.reuters.com/news/video/ summitVideo?videoId=56114 dns:realized-by dns:setting core:semantic- core:image-data annotation dns:plays dns:defines tgn:Sweden loc:media-time- loc:temporal- core:semantic-label- descriptor mask-role role dns:played-by skos:broader dns:defines dns:played-by loc:media-time- "1:21"^^xsd:time tgn:Gothenburg point data:has-time 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 27
  • 28. 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 28
  • 29. Image Annotation with Linked Data Reg1 The "Big Three" at the Yalta Conference (Wikipedia) Localize a region (bounding box) Annotate the content (interpretation) Tag: Winston Churchill, UK Prime Minister, Allied Forces, WWII Link to knowledge on the Web :Reg1 foaf:depicts dbpedia:Winston_Churchill ---------------------------------------------- dbpedia:Winston_Churchill dbpedia:spouse dbpedia:Clementine_Churchill dbpedia:Winston_Churchill owl:sameAs fbase:Winston_Churchill 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 29
  • 30. Video Annotation with Linked Data Seq4 Seq1 A history of G8 violence (video) (© Reuters) Localize a region Annotate the content Tag: G8 Summit, Heiligendamn, 2007 Link to knowledge on the Web EU Summit, Gothenburg, 2001 :Seq1 foaf:depicts dbpedia:34th_G8_Summit ---------------------------------------------- dbpedia:33rd_G8_Summit foaf:based_near geo:Heilegendamn geo:Heilegendamn skos:broader geo:Germany 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 30
  • 31. What is linked data? URIs, possibly identifying media fragments wp:2006_FIFA_World_Cup#Final + annotations (tags) events:id + links among fragments & annotations geonames:2950159 nar:subject nar:location nc:15054000 foaf:depicts dbpedia:Zidane 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 31 31
  • 32. Linked Data Principles Tim Berners Lee [2006] (Design Issues) 1. Use URIs to identify things (anything, not just documents); 2. Use HTTP URIs – globally unique names, distributed ownership – so that people can look up those names; 3. Provide useful information in RDF – when someone looks up a URI; 4. Include RDF links to other URIs – to enable discovery of related information 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 32
  • 33. An Example: DBpedia DBpedia is a community effort to: extract structured "infobox" information from Wikipedia interlink DBpedia with other datasets on the Web 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 33
  • 34. Scraping infobox data http://dbpedia.org/resource/Bogotá 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 34
  • 35. Automatic Links Among Open Datasets <http://dbpedia.org/resource/Bogotá> owl:sameAs <http://sws.geonames.org/3688689/> owl:sameAs <http://rdf.freebase.com/ns/guid.9202a8c04000641f DBpedia 8000000000167bab> dbpedia:population "6776009" ... <http://sws.geonames.org/3688689/> owl:sameAs <http://dbpedia.org/resource/Bogotá> wgs84_pos:lat "4.6" Geonames wgs84_pos:long "-74.0833333" geo:population "7102602" ... 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 35
  • 36. sameAs.org 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 36
  • 37. Bogotá on Freebase 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 37
  • 38. Bogotá on Geonames 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 38
  • 39. How Much Linked Data is there ? 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 39
  • 40. Linked Data Cloud – August 2007 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 40
  • 41. Linked Data Cloud – March 2008 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 41
  • 42. Linked Data Cloud – September 2008 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 42
  • 43. Linked Data Cloud – March 2009 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 43
  • 44. The Web of Data Expose open datasets in RDF Set RDF links among the data items for different datasets Over 4.5 billion triples, 5 millions links (March 2009) ... still counting 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 44
  • 45. Who are the users? Why would they use the cloud? What tasks can be supported? How will the semantics help? 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 45
  • 46. Agenda 1. Semantics in multimedia analysis • Detecting concepts for video indexing • Evaluating interactive search tasks 2. Semantics in metadata • Multimedia metadata interoperability • Expose your data following 4 basic principles • Re-use a growing amount of publicly open datasets 3. Semantics in user interfaces • Provide meaningful presentation of underlying data • Explore large knowledge bases powered by linked data 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 46
  • 47. Provide meaningful presentation of data 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 47
  • 48. ... and behind the scene 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 48
  • 49. ... link an artist to more data 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 49
  • 50. ... myspace 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 50
  • 51. ... last.fm 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 51
  • 52. ... IMDb 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 52
  • 53. Going through the Walled Gardens David Simonds: Everywhere and nowhere. 19 May 2008, The Economist. 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 53
  • 54. How can semantics help? Query construction disambiguate input (auto-completion) selection of available terms (grouping and ranking algorithms) (Semantic) search algorithm graph traversal query expansion RDFS/OWL reasoning Presentation of search results grouping by property visualization on timeline, map, etc. 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 54 54
  • 55. News Workflow Interoperability No integration of media (stories, photo, animation, video) Little (or no) context in the news presentation Lack of interoperability in the current workflow NAR Schema Broadcaster Schema User NewsCodes Controlled Vocabularies Vocabulary 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 55 55
  • 56. Exploratory Search (Ultimate) Goal: Provide an environment for searching and browsing contextualized multimedia news information Required integration: Data: various media, different forms, various sources Metadata: schema integration, semantic models Influence and implications of UI: How to represent semantic multimedia metadata to facilitate presenting information? in other words ... What constraints do end-user interfaces put on the modeling of the metadata? 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 56 56
  • 57. News and Multimedia Formats NewsML EventsML SportsML G2 G2 G2 News Architecture (NAR) 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 57
  • 58. Modeling the News + Media Ontology dc:Subject ≈ nar:Subject foaf:Person ≈ nar:Person sioc:Item ≈ + nar:Item geo:lat geo:long 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 58
  • 59. Enriching the News Metadata Concepts/Entities that are subject of news Thematic categories People Organizations Geopolitical Areas Points of Interest Events Products or artefacts 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 59
  • 60. Enriching the News Metadata Named Entity Recognition Domain Ontologies NAR Ontology NewsCodes Thesaurus 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 60
  • 61. Enriching the News Metadata Concept Detectors Domain Ontologies NAR Ontology NewsCodes Thesaurus 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 61
  • 62. Presenting News Information Dimensions used for searching news items When time 10/07/2006 Where location Paris What is depicted J. Chirac, Z. Zidane Metadata Why event WC 2006 Who photographer Bertrand Guay, AFP 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 62
  • 63. Semantic Search of Multimedia News Description Number of RDF Triples General Ontologies: NAR, DC, FOAF 7,336 Domain Specific Ontologies: football 104,358 Thesauri: newscodes 34,903 DBpedia, Geonames 53,468 AFP News Feed (June/July 2006) 804,446 AFP Photos (June/July 2006) 61,311 a INA Broadcast Video (June/July 2006) P atri 1,932 Cl io by Total r ed lpha 3 1,067,754 P owe 1.0 a 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 63
  • 64. 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 64
  • 65. 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 65
  • 66. 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 66
  • 67. Provide New Dimensions for Exploring 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 67
  • 68. Take Home Message Concept detection challenges: machine learning and IR Features can be extracted and used to describe multimedia content Show generality of approach, dynamic nature of video (event) Show that an ontology can help Semantic metadata representation challenges: KR Media and metadata can be passed around and among systems Reuse what is there Expose what you make Interaction challenges: CHI Users can be given much richer and more flexible access to (semantically annotated) content ... but we are still figuring out how to do this! 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 68
  • 69. Credits Many people Cees Snoek, Alex Hauptmann, Alan Smeaton, Ivan Herman, Krishna Chandramouli, David Simonds, Laurent Le Meur Colleagues from the Interactive Information Access Group, CWI Amsterdam Datasets http://www.slideshare.net/troncy 04/08/2009 - Multimedia Semantics: Metadata, Analysis and Interaction - LACNEM 2009 - 69

Hinweis der Redaktion

  1. Diagram is messy. Try to show largest part of MPEG-7 in one slide. From Alia and Michiel: MPEG-7 so far???
  2. Who? experts, lay persons Why? information searching, annotation tasks, How? entering query, finding items of interest, displaying results What? fact finding, information gathering, sensemaking, location-based mobile search (Pub Canary Wharf)