Weitere ähnliche Inhalte Ähnlich wie Presentation 16 may morning casestudy 2 xavier jacques jourion (20) Presentation 16 may morning casestudy 2 xavier jacques jourion2. GEMS
The future is now
Semantics for [audiovisual] dummies
Xavier Jacques-Jourion
FIAT-IFTA Media Management Seminar
Beeld & Geluid, Hilversum, May 16th, 2013
3. ©
2013
RTBF
-‐
DGTE
-‐
Agenda
• Introduction
• Semantics 101
• Linked Data
• Demonstration
• Conclusion
3
5. ©
2013
RTBF
-‐
DGTE
-‐
• Public broadcaster
• French-speaking
• 3 TV stations
6 Radio stations
Internet portals
• Around 200.000 hours of archives (radio & TV)
• Digitisation in progress (SONUMA)
5
8. ©
2013
RTBF
-‐
DGTE
-‐
From data to knowledge
8
1315730760
11/09/2001 - 08:46 EST
First plane hits the World Trade Center
North Tower in New York
9. ©
2013
RTBF
-‐
DGTE
-‐
From data to knowledge
9
Raw data
Information / Content
Knowledge
10. ©
2013
RTBF
-‐
DGTE
-‐
Data triplets
• Data inside the system is qualified
• Model: subject - predicate - object
• Examples:
§ Steve is Peter’s son.
§ Peter is John’s brother.
10
has the colourthe sky blue
Subject ObjectPredicate
11. ©
2013
RTBF
-‐
DGTE
-‐
From searching to knowing
11
12. As of September 2011
Music
Brainz
(zitgist)
P20
Turismo
de
Zaragoza
yovisto
Yahoo!
Geo
Planet
YAGO
World
Fact-
book
El
Viajero
Tourism
WordNet
(W3C)
WordNet
(VUA)
VIVO UF
VIVO
Indiana
VIVO
Cornell
VIAF
URI
Burner
Sussex
Reading
Lists
Plymouth
Reading
Lists
UniRef
UniProt
UMBEL
UK Post-
codes
legislation
data.gov.uk
Uberblic
UB
Mann-
heim
TWC LOGD
Twarql
transport
data.gov.
uk
Traffic
Scotland
theses.
fr
Thesau-
rus W
totl.net
Tele-
graphis
TCM
Gene
DIT
Taxon
Concept
Open
Library
(Talis)
tags2con
delicious
t4gm
info
Swedish
Open
Cultural
Heritage
Surge
Radio
Sudoc
STW
RAMEAU
SH
statistics
data.gov.
uk
St.
Andrews
Resource
Lists
ECS
South-
ampton
EPrints
SSW
Thesaur
us
Smart
Link
Slideshare
2RDF
semantic
web.org
Semantic
Tweet
Semantic
XBRL
SW
Dog
Food
Source Code
Ecosystem
Linked Data
US SEC
(rdfabout)
Sears
Scotland
Geo-
graphy
Scotland
Pupils &
Exams
Scholaro-
meter
WordNet
(RKB
Explorer)
Wiki
UN/
LOCODE
Ulm
ECS
(RKB
Explorer)
Roma
RISKS
RESEX
RAE2001
Pisa
OS
OAI
NSF
New-
castle
LAAS
KISTI
JISC
IRIT
IEEE
IBM
Eurécom
ERA
ePrints dotAC
DEPLOY
DBLP
(RKB
Explorer)
Crime
Reports
UK
Course-
ware
CORDIS
(RKB
Explorer)
CiteSeer
Budapest
ACM
riese
Revyu
research
data.gov.
ukRen.
Energy
Genera-
tors
reference
data.gov.
uk
Recht-
spraak.
nl
RDF
ohloh
Last.FM
(rdfize)
RDF
Book
Mashup
Rådata
nå!
PSH
Product
Types
Ontology
Product
DB
PBAC
Poké-
pédia
patents
data.go
v.uk
Ox
Points
Ord-
nance
Survey
Openly
Local
Open
Library
Open
Cyc
Open
Corpo-
rates
Open
Calais
OpenEI
Open
Election
Data
Project
Open
Data
Thesau-
rus
Ontos
News
Portal
OGOLOD
Janus
AMP
Ocean
Drilling
Codices
New
York
Times
NVD
ntnusc
NTU
Resource
Lists
Norwe-
gian
MeSH
NDL
subjects
ndlna
my
Experi-
ment
Italian
Museums
medu-
cator
MARC
Codes
List
Man-
chester
Reading
Lists
Lotico
Weather
Stations
London
Gazette
LOIUS
Linked
Open
Colors
lobid
Resources
lobid
Organi-
sations
LEM
Linked
MDB
LinkedL
CCN
Linked
GeoData
LinkedCT
Linked
User
Feedback
LOV
Linked
Open
Numbers
LODE
Eurostat
(Ontology
Central)
Linked
EDGAR
(Ontology
Central)
Linked
Crunch-
base
lingvoj
Lichfield
Spen-
ding
LIBRIS
Lexvo
LCSH
DBLP
(L3S)
Linked
Sensor Data
(Kno.e.sis)
Klapp-
stuhl-
club
Good-
win
Family
National
Radio-
activity
JP
Jamendo
(DBtune)
Italian
public
schools
ISTAT
Immi-
gration
iServe
IdRef
Sudoc
NSZL
Catalog
Hellenic
PD
Hellenic
FBD
Piedmont
Accomo-
dations
GovTrack
GovWILD
Google
Art
wrapper
gnoss
GESIS
GeoWord
Net
Geo
Species
Geo
Names
Geo
Linked
Data
GEMET
GTAA
STITCH
SIDER
Project
Guten-
berg
Medi
Care
Euro-
stat
(FUB)
EURES
Drug
Bank
Disea-
some
DBLP
(FU
Berlin)
Daily
Med
CORDIS
(FUB)
Freebase
flickr
wrappr
Fishes
of Texas
Finnish
Munici-
palities
ChEMBL
FanHubz
Event
Media
EUTC
Produc-
tions
Eurostat
Europeana
EUNIS
EU
Insti-
tutions
ESD
stan-
dards
EARTh
Enipedia
Popula-
tion (En-
AKTing)
NHS
(En-
AKTing) Mortality
(En-
AKTing)
Energy
(En-
AKTing)
Crime
(En-
AKTing)
CO2
Emission
(En-
AKTing)
EEA
SISVU
educatio
n.data.g
ov.uk
ECS
South-
ampton
ECCO-
TCP
GND
Didactal
ia
DDC Deutsche
Bio-
graphie
data
dcs
Music
Brainz
(DBTune)
Magna-
tune
John
Peel
(DBTune)
Classical
(DB
Tune)
Audio
Scrobbler
(DBTune)
Last.FM
artists
(DBTune)
DB
Tropes
Portu-
guese
DBpedia
dbpedia
lite
Greek
DBpedia
DBpedia
data-
open-
ac-uk
SMC
Journals
Pokedex
Airports
NASA
(Data
Incu-
bator)
Music
Brainz
(Data
Incubator)
Moseley
Folk
Metoffice
Weather
Forecasts
Discogs
(Data
Incubator)
Climbing
data.gov.uk
intervals
Data
Gov.ie
data
bnf.fr
Cornetto
reegle
Chronic-
ling
America
Chem2
Bio2RDF
Calames
business
data.gov.
uk
Bricklink
Brazilian
Poli-
ticians
BNB
UniSTS
UniPath
way
UniParc
Taxono
my
UniProt
(Bio2RDF)
SGD
Reactome
PubMed
Pub
Chem
PRO-
SITE
ProDom
Pfam
PDB
OMIM
MGI
KEGG
Reaction
KEGG
Pathway
KEGG
Glycan
KEGG
Enzyme
KEGG
Drug
KEGG
Com-
pound
InterPro
Homolo
Gene
HGNC
Gene
Ontology
GeneID
Affy-
metrix
bible
ontology
BibBase
FTS
BBC
Wildlife
Finder
BBC
Program
mes BBC
Music
Alpine
Ski
Austria
LOCAH
Amster-
dam
Museum
AGROV
OC
AEMET
US Census
(rdfabout)
Media
Geographic
Publications
Government
Cross-domain
Life sciences
User-generated content
©
2013
RTBF
-‐
DGTE
-‐
Linked Open Data (LOD)
12
14. ©
2013
RTBF
-‐
DGTE
-‐
Do not read this.
Linked Data is about using the Web to connect
related data that wasn't previously linked, or
using the Web to lower the barriers to linking
data currently linked using other methods. More
specifically, Wikipedia defines Linked Data as "a
term used to describe a recommended best
practice for exposing, sharing, and connecting
pieces of data, information, and knowledge on
the Semantic Web using URIs and RDF."
14
19. ©
2013
RTBF
-‐
DGTE
-‐
GEMS
• Goal: build a proof of concept for a semantic-
based multimedia browser interface, using raw
extracts from our media databases.
• De-mystify the field of semantics.
• Developed with two external partners:
19
20. ©
2013
RTBF
-‐
DGTE
-‐
Project intentions
20
• Use semantics to assemble the knowledge
previously spread across multiple databases.
• Connect to public data sources using LOD.
• Propose a new research tool for journalists and
production assistants.
• Cross-media searches.
• Speech-to-text engine.
• Ideally: change the way research is done by giving
access to the knowledge harvested from the
different media collection(s).
21. ©
2013
RTBF
-‐
DGTE
-‐
Principle
21
Nétia
Tramontane
Radio
Dalet
Tramontane
TV
GEMS
22. ©
2013
RTBF
-‐
DGTE
-‐
Content
22
• Medias linked to the end of the Belgian
government crisis in July 2011
§ 4 “JT 19h30”, week starting July 4th, 2011
§ 3 “Invités de Matin Première” (Radio show)
§ 1 “Mise au point” (August 28, 2011)
§ Metadata linked to the above medias