SlideShare ist ein Scribd-Unternehmen logo
1 von 2
Downloaden Sie, um offline zu lesen
Sirio: an Ontology-based Web Search Engine for Videos
Thomas Alisi, Marco Bertini, Gianpaolo D’Amico, Alberto Del Bimbo,
Andrea Ferracani, Federico Pernici and Giuseppe Serra
Media Integration and Communication Center, University of Florence, Italy
{alisi, bertini, damico, delbimbo, ferracani, pernici, serra}@dsi.unifi.it
http://www.micc.unifi.it/vim
ABSTRACT
In this technical demonstration we show a web video search
engine based on ontologies, the Sirio1
system, that has been
developed within the EU VidiVideo project. The goal of
the system is to provide a search engine for videos for both
technical and non-technical users. In fact, the system has
different interfaces that permit different query modalities:
free-text, natural language, graphical composition of con-
cepts using boolean and temporal relations and query by
visual example. In addition, the ontology structure is ex-
ploited to encode semantic relations between concepts per-
mitting, for example, to expand queries to synonyms and
concept specializations.
Categories and Subject Descriptors
H.3.3 [Information Storage and Retrieval]: Information
Search and Retrieval—Search process; H.3.5 [Information
Storage and Retrieval]: Online Information Services—
Web-based services
General Terms
Algorithms, Experimentation
Keywords
Video retrieval, ontologies, web services
1. INTRODUCTION
Video search engines are the product of progress in many
technologies: visual and audio analysis, machine learning
techniques, as well as visualization and interaction. The cur-
rent video search engines are based on lexicons of semantic
concepts and perform keyword-based queries [1]. These sys-
tems are generally desktop applications or have simple web
interfaces that show the results of the query as a ranked list
1
Sirio was the hound of the mythical hunter Orion. It was
a dog so swift that no prey could escape it.
Permission to make digital or hard copies of all or part of this work for
personal or classroom use is granted without fee provided that copies are
not made or distributed for profit or commercial advantage and that copies
bear this notice and the full citation on the first page. To copy otherwise, to
republish, to post on servers or to redistribute to lists, requires prior specific
permission and/or a fee.
MM’09, October 19–24, 2009, Beijing, China.
Copyright 2009 ACM X-XXXXX-XX-X/XX/XX ...$10.00.
of keyframes [2, 3]. These systems do not let users to per-
form composite queries that can include temporal relations
between concepts and do not allow to look for concepts that
are not in the lexicon. In addition, desktop applications re-
quire installation on the end-user computer and can not be
used in a distributed environment.
In this demonstration we present the Sirio system, a web
video search engine that allows semantic retrieval by content
for different domains (broadcast news, surveillance, cultural
heritage documentaries) with query interaction and visual-
ization. The system permits different query modalities (free
text, natural language, graphical composition of concepts
using boolean and temporal relations and query by visual
example) and visualizations, resulting in an advanced tool
for retrieval and exploration of video archives for both tech-
nical and non-technical users. In addition the use of ontolo-
gies permits to exploit semantic relations between concepts
through reasoning. Finally our web system, using the Rich
Internet Application paradigm (RIA), does not require any
installation and provides a responsive user interface.
2. THE SYSTEM
The Sirio system2
is composed by three different inter-
faces: a GUI to build composite queries that may include
boolean/temporal operators and visual examples, a natural
language interface for simpler queries with boolean/temporal
operators, a free-text interface for Google-like searches. In
all the interfaces it is possible to extend queries adding syn-
onyms and concept specializations through ontology reason-
ing and the use of WordNet. Consider, for instance, a query
“Find shots with animal”: the concept specializations ex-
pansion through ontology structure permits to retrieve not
only the shots annotated with animal, but also those anno-
tated with its specializations (dogs, cats, etc.). In particu-
lar, WordNet query expansion, using synonyms, is required
when using natural language and free-text queries, since it is
not possible to force the user to formulate a query selecting
terms from a lexicon, as is done using the GUI interface.
The search engine uses an ontology that has been created
automatically from a flat lexicon, using WordNet to cre-
ate concept relations (is a, is part of and has part). The
ontology is modelled following the Dynamic Pictorially En-
riched Ontology model [4], that includes both concepts and
visual concept prototypes. These prototypes represent the
different visual modalities in which a concept can manifest;
they can be selected by the users to perform query by exam-
2
http://deckard.micc.unifi.it/sirio/
Figure 1: Search interfaces: natural language search; Google-like search; GUI query builder.
ple. Concepts, concepts relations, video annotations and vi-
sual concept prototypes are defined using the standard Web
Ontology Language (OWL) so that the ontology can be eas-
ily reused and shared. The queries created in each interface
are translated by the search engine into SPARQL, the W3C
standard ontology query language.
The system is based on the Rich Internet Application
paradigm, using a client side Flash virtual machine which
can execute instructions on the client computer. RIAs can
avoid the usual slow and synchronous loop for user interac-
tions, typical of web based environments that use only the
HTML widgets available to standard browsers. This allows
to implement a visual querying mechanism that exhibits a
look and feel approaching that of a desktop environment,
with the fast response that is expected by users. With this
solution the application installation is not required, since
the system is updated on the server, and run anywhere re-
gardless of what operating system is used.
The system backend is currently based on open source
tools (i.e. Apache Tomcat and Red 5 video streaming server)
or freely available commercial tools (Adobe Media Server
has a free developer edition). The RTMP video stream-
ing protocol is used. The search engine is developed in Java
and supports multiple ontologies and ontology reasoning ser-
vices. Audio-visual concepts are automatically annotated
using the VidiVideo annotation engine [2]. The search re-
sults are in RSS 2.0 XML format with paging, so that they
can be treated as RSS feeds. Results of the query are shown
in the interface and for each video clip of the result set is
shown the first frame. These frames are obtained from the
video streaming server, and are shown within a small video
player. Users can then play the video sequence and, if in-
terested, zoom in each result displaying it in a larger player,
that provides more details on the video metadata and al-
lows better video browsing. The user interface is written
in Adobe Flex and Action Script 3. All the modules of the
system are connected using HTTP POST, XML and SOAP
web services.
3. DEMONSTRATION
We demonstrate the search modalities of the system in
three different video domains: broadcast news, video surveil-
lance and cultural heritage documentaries. We show how
each interface is suitable for different users: the GUI in-
terface allows to build composite queries that take into ac-
count also metadata, as required by professional archivists,
the natural language interface allows to build simple queries
with boolean and temporal relations between concepts, the
free-text interface provides the popular Google-like search.
Acknowledgments.
This work is partially supported by the EU IST VidiVideo
project (www.vidivideo.info - contract FP6-045547).
4. REFERENCES
[1] A. F. Smeaton, P. Over and W. Kraai. High-Level
Feature Detection from Video in TRECVid: a 5-Year
Retrospective of Achievements, Multimedia Content
Analysis, Theory and Applications, 151–174, 2009,
Springer Verlag.
[2] C. G. M. Snoek et al. The MediaMill TRECVID 2008
Semantic Video Search Engine, In Proceedings of the
6th TRECVID Workshop, 2008.
[3] A. Natsev, J. R. Smith, J. Teˇsi´c, L. Xie, R. Yan,
W. Jiang, M. Merler IBM Research TRECVID-2008
Video Retrieval System, In Proceedings of the 6th
TRECVID Workshop, 2008.
[4] M. Bertini, R. Cucchiara, A. Del Bimbo, C. Grana,
G. Serra, C. Torniai and R. Vezzani. Dynamic
Pictorially Enriched Ontologies for Video Digital
Libraries. In IEEE Multimedia, to appear, 2009.

Weitere ähnliche Inhalte

Ähnlich wie Sirio

Automatic semantic content extraction in videos using a fuzzy ontology and ru...
Automatic semantic content extraction in videos using a fuzzy ontology and ru...Automatic semantic content extraction in videos using a fuzzy ontology and ru...
Automatic semantic content extraction in videos using a fuzzy ontology and ru...IEEEFINALYEARPROJECTS
 
Tell Me Quality Documentation
Tell Me Quality DocumentationTell Me Quality Documentation
Tell Me Quality DocumentationMarco Berlot
 
Smart India Hackathon Idea Submission
Smart India Hackathon Idea SubmissionSmart India Hackathon Idea Submission
Smart India Hackathon Idea SubmissionGaurav Ganna
 
Development Tools - Abhijeet
Development Tools - AbhijeetDevelopment Tools - Abhijeet
Development Tools - AbhijeetAbhijeet Kalsi
 
Review on content based video lecture retrieval
Review on content based video lecture retrievalReview on content based video lecture retrieval
Review on content based video lecture retrievaleSAT Journals
 
Summer school bz_fp7research_20100708
Summer school bz_fp7research_20100708Summer school bz_fp7research_20100708
Summer school bz_fp7research_20100708Sandro D'Elia
 
Scalable architectures for phenotype libraries
Scalable architectures for phenotype librariesScalable architectures for phenotype libraries
Scalable architectures for phenotype librariesMartin Chapman
 
OGC Web Service Shibboleth Interoperability Experiment
OGC Web Service Shibboleth Interoperability ExperimentOGC Web Service Shibboleth Interoperability Experiment
OGC Web Service Shibboleth Interoperability ExperimentEDINA, University of Edinburgh
 
OSFair2017 Workshop | EGI applications database
OSFair2017 Workshop | EGI applications databaseOSFair2017 Workshop | EGI applications database
OSFair2017 Workshop | EGI applications databaseOpen Science Fair
 
ConnectME: connecting content for future TV & video
ConnectME: connecting content for future TV & videoConnectME: connecting content for future TV & video
ConnectME: connecting content for future TV & videoconnectme_project
 
Pacify based video retrieval system
Pacify based video retrieval systemPacify based video retrieval system
Pacify based video retrieval systemeSAT Journals
 
ICWE 2010 Demonstration and Poster elevator pitch session
ICWE 2010 Demonstration and Poster elevator pitch sessionICWE 2010 Demonstration and Poster elevator pitch session
ICWE 2010 Demonstration and Poster elevator pitch sessionMarco Brambilla
 

Ähnlich wie Sirio (20)

Sirio Orione and Pan
Sirio Orione and PanSirio Orione and Pan
Sirio Orione and Pan
 
Semantic browsing
Semantic browsingSemantic browsing
Semantic browsing
 
Interactive Video Search and Browsing Systems
Interactive Video Search and Browsing SystemsInteractive Video Search and Browsing Systems
Interactive Video Search and Browsing Systems
 
Arneb
ArnebArneb
Arneb
 
On Linked Open Data (LOD)-based Semantic Video Annotation Systems
On Linked Open Data (LOD)-based  Semantic Video Annotation SystemsOn Linked Open Data (LOD)-based  Semantic Video Annotation Systems
On Linked Open Data (LOD)-based Semantic Video Annotation Systems
 
Automatic semantic content extraction in videos using a fuzzy ontology and ru...
Automatic semantic content extraction in videos using a fuzzy ontology and ru...Automatic semantic content extraction in videos using a fuzzy ontology and ru...
Automatic semantic content extraction in videos using a fuzzy ontology and ru...
 
Tell Me Quality Documentation
Tell Me Quality DocumentationTell Me Quality Documentation
Tell Me Quality Documentation
 
On Annotation of Video Content for Multimedia Retrieval and Sharing
On Annotation of Video Content for Multimedia  Retrieval and SharingOn Annotation of Video Content for Multimedia  Retrieval and Sharing
On Annotation of Video Content for Multimedia Retrieval and Sharing
 
Smart India Hackathon Idea Submission
Smart India Hackathon Idea SubmissionSmart India Hackathon Idea Submission
Smart India Hackathon Idea Submission
 
Development Tools - Abhijeet
Development Tools - AbhijeetDevelopment Tools - Abhijeet
Development Tools - Abhijeet
 
Review on content based video lecture retrieval
Review on content based video lecture retrievalReview on content based video lecture retrieval
Review on content based video lecture retrieval
 
Media Pick
Media PickMedia Pick
Media Pick
 
Summer school bz_fp7research_20100708
Summer school bz_fp7research_20100708Summer school bz_fp7research_20100708
Summer school bz_fp7research_20100708
 
Scalable architectures for phenotype libraries
Scalable architectures for phenotype librariesScalable architectures for phenotype libraries
Scalable architectures for phenotype libraries
 
Shibboleth Federations and Secure SDI
Shibboleth Federations and Secure SDIShibboleth Federations and Secure SDI
Shibboleth Federations and Secure SDI
 
OGC Web Service Shibboleth Interoperability Experiment
OGC Web Service Shibboleth Interoperability ExperimentOGC Web Service Shibboleth Interoperability Experiment
OGC Web Service Shibboleth Interoperability Experiment
 
OSFair2017 Workshop | EGI applications database
OSFair2017 Workshop | EGI applications databaseOSFair2017 Workshop | EGI applications database
OSFair2017 Workshop | EGI applications database
 
ConnectME: connecting content for future TV & video
ConnectME: connecting content for future TV & videoConnectME: connecting content for future TV & video
ConnectME: connecting content for future TV & video
 
Pacify based video retrieval system
Pacify based video retrieval systemPacify based video retrieval system
Pacify based video retrieval system
 
ICWE 2010 Demonstration and Poster elevator pitch session
ICWE 2010 Demonstration and Poster elevator pitch sessionICWE 2010 Demonstration and Poster elevator pitch session
ICWE 2010 Demonstration and Poster elevator pitch session
 

Kürzlich hochgeladen

ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusZilliz
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfOverkill Security
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 

Kürzlich hochgeladen (20)

ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 

Sirio

  • 1. Sirio: an Ontology-based Web Search Engine for Videos Thomas Alisi, Marco Bertini, Gianpaolo D’Amico, Alberto Del Bimbo, Andrea Ferracani, Federico Pernici and Giuseppe Serra Media Integration and Communication Center, University of Florence, Italy {alisi, bertini, damico, delbimbo, ferracani, pernici, serra}@dsi.unifi.it http://www.micc.unifi.it/vim ABSTRACT In this technical demonstration we show a web video search engine based on ontologies, the Sirio1 system, that has been developed within the EU VidiVideo project. The goal of the system is to provide a search engine for videos for both technical and non-technical users. In fact, the system has different interfaces that permit different query modalities: free-text, natural language, graphical composition of con- cepts using boolean and temporal relations and query by visual example. In addition, the ontology structure is ex- ploited to encode semantic relations between concepts per- mitting, for example, to expand queries to synonyms and concept specializations. Categories and Subject Descriptors H.3.3 [Information Storage and Retrieval]: Information Search and Retrieval—Search process; H.3.5 [Information Storage and Retrieval]: Online Information Services— Web-based services General Terms Algorithms, Experimentation Keywords Video retrieval, ontologies, web services 1. INTRODUCTION Video search engines are the product of progress in many technologies: visual and audio analysis, machine learning techniques, as well as visualization and interaction. The cur- rent video search engines are based on lexicons of semantic concepts and perform keyword-based queries [1]. These sys- tems are generally desktop applications or have simple web interfaces that show the results of the query as a ranked list 1 Sirio was the hound of the mythical hunter Orion. It was a dog so swift that no prey could escape it. Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise, to republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. MM’09, October 19–24, 2009, Beijing, China. Copyright 2009 ACM X-XXXXX-XX-X/XX/XX ...$10.00. of keyframes [2, 3]. These systems do not let users to per- form composite queries that can include temporal relations between concepts and do not allow to look for concepts that are not in the lexicon. In addition, desktop applications re- quire installation on the end-user computer and can not be used in a distributed environment. In this demonstration we present the Sirio system, a web video search engine that allows semantic retrieval by content for different domains (broadcast news, surveillance, cultural heritage documentaries) with query interaction and visual- ization. The system permits different query modalities (free text, natural language, graphical composition of concepts using boolean and temporal relations and query by visual example) and visualizations, resulting in an advanced tool for retrieval and exploration of video archives for both tech- nical and non-technical users. In addition the use of ontolo- gies permits to exploit semantic relations between concepts through reasoning. Finally our web system, using the Rich Internet Application paradigm (RIA), does not require any installation and provides a responsive user interface. 2. THE SYSTEM The Sirio system2 is composed by three different inter- faces: a GUI to build composite queries that may include boolean/temporal operators and visual examples, a natural language interface for simpler queries with boolean/temporal operators, a free-text interface for Google-like searches. In all the interfaces it is possible to extend queries adding syn- onyms and concept specializations through ontology reason- ing and the use of WordNet. Consider, for instance, a query “Find shots with animal”: the concept specializations ex- pansion through ontology structure permits to retrieve not only the shots annotated with animal, but also those anno- tated with its specializations (dogs, cats, etc.). In particu- lar, WordNet query expansion, using synonyms, is required when using natural language and free-text queries, since it is not possible to force the user to formulate a query selecting terms from a lexicon, as is done using the GUI interface. The search engine uses an ontology that has been created automatically from a flat lexicon, using WordNet to cre- ate concept relations (is a, is part of and has part). The ontology is modelled following the Dynamic Pictorially En- riched Ontology model [4], that includes both concepts and visual concept prototypes. These prototypes represent the different visual modalities in which a concept can manifest; they can be selected by the users to perform query by exam- 2 http://deckard.micc.unifi.it/sirio/
  • 2. Figure 1: Search interfaces: natural language search; Google-like search; GUI query builder. ple. Concepts, concepts relations, video annotations and vi- sual concept prototypes are defined using the standard Web Ontology Language (OWL) so that the ontology can be eas- ily reused and shared. The queries created in each interface are translated by the search engine into SPARQL, the W3C standard ontology query language. The system is based on the Rich Internet Application paradigm, using a client side Flash virtual machine which can execute instructions on the client computer. RIAs can avoid the usual slow and synchronous loop for user interac- tions, typical of web based environments that use only the HTML widgets available to standard browsers. This allows to implement a visual querying mechanism that exhibits a look and feel approaching that of a desktop environment, with the fast response that is expected by users. With this solution the application installation is not required, since the system is updated on the server, and run anywhere re- gardless of what operating system is used. The system backend is currently based on open source tools (i.e. Apache Tomcat and Red 5 video streaming server) or freely available commercial tools (Adobe Media Server has a free developer edition). The RTMP video stream- ing protocol is used. The search engine is developed in Java and supports multiple ontologies and ontology reasoning ser- vices. Audio-visual concepts are automatically annotated using the VidiVideo annotation engine [2]. The search re- sults are in RSS 2.0 XML format with paging, so that they can be treated as RSS feeds. Results of the query are shown in the interface and for each video clip of the result set is shown the first frame. These frames are obtained from the video streaming server, and are shown within a small video player. Users can then play the video sequence and, if in- terested, zoom in each result displaying it in a larger player, that provides more details on the video metadata and al- lows better video browsing. The user interface is written in Adobe Flex and Action Script 3. All the modules of the system are connected using HTTP POST, XML and SOAP web services. 3. DEMONSTRATION We demonstrate the search modalities of the system in three different video domains: broadcast news, video surveil- lance and cultural heritage documentaries. We show how each interface is suitable for different users: the GUI in- terface allows to build composite queries that take into ac- count also metadata, as required by professional archivists, the natural language interface allows to build simple queries with boolean and temporal relations between concepts, the free-text interface provides the popular Google-like search. Acknowledgments. This work is partially supported by the EU IST VidiVideo project (www.vidivideo.info - contract FP6-045547). 4. REFERENCES [1] A. F. Smeaton, P. Over and W. Kraai. High-Level Feature Detection from Video in TRECVid: a 5-Year Retrospective of Achievements, Multimedia Content Analysis, Theory and Applications, 151–174, 2009, Springer Verlag. [2] C. G. M. Snoek et al. The MediaMill TRECVID 2008 Semantic Video Search Engine, In Proceedings of the 6th TRECVID Workshop, 2008. [3] A. Natsev, J. R. Smith, J. Teˇsi´c, L. Xie, R. Yan, W. Jiang, M. Merler IBM Research TRECVID-2008 Video Retrieval System, In Proceedings of the 6th TRECVID Workshop, 2008. [4] M. Bertini, R. Cucchiara, A. Del Bimbo, C. Grana, G. Serra, C. Torniai and R. Vezzani. Dynamic Pictorially Enriched Ontologies for Video Digital Libraries. In IEEE Multimedia, to appear, 2009.