SlideShare ist ein Scribd-Unternehmen logo
1 von 19
Downloaden Sie, um offline zu lesen
RTÉ Content Discovery Project
Christophe Debruyne
c.debruyne@ria.ie
christophe.debruyne@insight-centre.orgchristophe.debruyne@insight-centre.org
MDN Workshop -- 4th of June 2014
Outline
• Context
• Goal and Challenges of the RTÉ Content Discovery Project
• Tasks and Data Annotation• Tasks and Data Annotation
• EBU Core – Identification of problems
• Addressing the issues
• Using the ontology
• Conclusions and Recommendations
Context
RTÉ, Ireland's National
Television and Radio
Broadcaster
National trusted digital
repository for Ireland's
social and cultural data.
Centre for Data Analytics
Documents
Television
Radio
Stills
Linking and preserving data
held by Irish Institutions with
central internet access point.
• Standards
• Cataloguing
• Archiving
• Preservation
• Insight @ NUIG = DERI
• Semantic Technologies
• Linked Data
• Data Analytics Platform
Goal of the RTÉ Content
Discovery Project
• Discover implicit knowledge
• across the different archives
• and the Web of Data
• To facilitate internal workflows (e.g., search)
• For wider reuse and repackaging RTÉ’s
Documents
Television
Radio
Stills
• For wider reuse and repackaging RTÉ’s
information
• Challenges
• Heterogeneous databases
• Different guidelines and practices
• Legacy data (from previous systems)
• … “Linking Open Data cloud diagram,
by R. Cyganiak and A. Jentzsch.
http://lod-cloud.net/”
Part of a wider ambition …
OUTCOMES FOR RTÉ
RTÉ Content Discovery
In this presentation we focus on Television and Radio archives
Documents Television Radio Stills
• In this presentation we focus on Television and Radio archives
• The Television and Radio archives
• Are maintained on two different instances of the same system
• A system that is EBU Core “compatible”
• Different content, different guidelines, …
Three main tasks
• Annotate the data.
• Using relevant standards, ontologies and vocabularies.
• Resource Description Framework (RDF).
• Obtain an integrated view of the different archives by
creating links between the RDF representations of RTÉ’s
archival assets across the different archives.
• Apply advanced methods for discovering related data for a
given subject in external sources such as the Linked Data
Cloud.
Data annotation
Relational
Database
D2RQ RDF Dump Triplestore
Television
Radio
Map symbols of
database to
predicates
(relations and
concepts) in
chosen ontologies
/ vocabularies
Use D2RQ to
generate RDF
dump
Store RDF dump in
adequate triple
store (Jena TDB)
Which ontologies?
• Dublin Core, DC Terms
• Foaf
• EBU Core OWL
• …
EBU Core OWL
• The RTÉ Content Discovery platform will rely on Semantic
Web technologies to reason. Ontologies will therefore need
to be correct.
• But … while adopting the EBU Core OWL ontology, several
problems where identified.
• We contacted EBU to resolve these issues.
• We provide an overview of some of these problems.
Problems
• (1) Forgotten concept unions
• The property ebucore:description has multiple domain axioms.
<rdfs:domain rdf:resource="&ebu;BusinessObject"/>
<rdfs:domain rdf:resource="&ebu;MediaResource"/>
• Unintentionally the wrong implicit information can be inferred.• Unintentionally the wrong implicit information can be inferred.
• (2a) Property unsatisfiability – via class axioms
<owl:Class rdf:about="&ebu;BusinessObject">
… <owl:disjointWith rdf:resource="&ebu;Resource"/> …
</owl:Class>
• Because of (1) and (2), the property description could not be
used
Problems
• (2b) Property unsatisfiability – role hierarchies and datatypes
• Duration has the range xsd:string
• The subproperties of duration have other ranges (e.g., double in
the case of duration in edit units)
• Because each subproperty also inherits the range of the• Because each subproperty also inherits the range of the
superproperty, all instances in the object of that property must
be at the same time a string, and a double. This type conflict
results in a contradiction.
• With (2a) and (2b) we identified 40 properties that lead to
problems.
Problems
• (3) Inconsistencies between formal and informal definitions
• BusinessObject is defined as: "An image, a document, an annotation
[…], a tag […], or an audiovisual media resource […]. Other types of
BusinessObjects may be defined as subclasses.“BusinessObjects may be defined as subclasses.“
• Resource is defined as: "A manifestation of a BusinessObject." and
disjoint with BusinessObjects. Meaning no individual can be an
element of BusinessObjects and Resources at the same time.
• The domain of a title is BusinessObject, yet, it’s definition is:
"Specifies the title or name given to the resource. […]"
Problems
• (4) User readable labels
• Many different properties have the same human readable label,
which could confuse the end user – e.g., when generating an
Interface.
• E.g., there were 11 properties with the label “Name”• E.g., there were 11 properties with the label “Name”
• Some properties had empty labels
• (5) Roles – Loss of context
• Agents were related to Business Objects (BO)
• Agents were related to a Role
• But … a role did not relate to agents in relationship with a BO
• This lead to a loss of context.
Addressing the issues
• Problems were addressed over email.
• The discussions are “lost”, traces are only known to us …
• The ontology-engineering activities of EBU Core should adopt
appropriate methods and tools for collaboration.
• Participation of others
• Traceability (!)
• The ontology is still being developed as we go along, and we
have been able to make (parts of it) work…
Using the ontology
Conclusions and Recommendations
• RTÉ Archives aims at a wider reuse and repackaging of their
archival content on digital platforms through the innovative
use of Semantic and Linked Data technologies.
• We adopted the EBU Core OWL ontology for annotating the
television and radio archives, yet identified some issues.
• We adopted the EBU Core OWL ontology for annotating the
television and radio archives, yet identified some issues.
• We collaborated on resolving those issues together with EBU
• However, we feel that appropriate collaborative methods and
tools should be adopted to facilitate the ontology-
engineering process and – more importantly – enable other
to participate AND have visible traceability of the decisions.
References
• D2RQ, http://d2rq.org/
• Digital Repository of Ireland, http://www.dri.ie/
• Insight, http://www.insight-centre.org/• Insight, http://www.insight-centre.org/
• Jena TDB, http://jena.apache.org/documentation/tdb/
• RTÉ Archives, http://www.rte.ie/archives

Weitere ähnliche Inhalte

Ähnlich wie RTÉ Content Discovery Project - Christophe Debruyne

What is New in W3C land?
What is New in W3C land?What is New in W3C land?
What is New in W3C land?Ivan Herman
 
Global lodlam_communities and open cultural data
Global lodlam_communities and open cultural dataGlobal lodlam_communities and open cultural data
Global lodlam_communities and open cultural dataMinerva Lin
 
High and Lows of Library Linked Data
High and Lows of Library Linked DataHigh and Lows of Library Linked Data
High and Lows of Library Linked DataAdrian Stevenson
 
Thinking about Preservation: OSUL Content Manage Workflow
Thinking about Preservation: OSUL Content Manage WorkflowThinking about Preservation: OSUL Content Manage Workflow
Thinking about Preservation: OSUL Content Manage WorkflowTerry Reese
 
Intro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & MuseumsIntro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & MuseumsJon Voss
 
Linked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & MuseumsLinked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & MuseumsJon Voss
 
MARC and BIBFRAME; Linking libraries and archives
MARC and BIBFRAME; Linking libraries and archivesMARC and BIBFRAME; Linking libraries and archives
MARC and BIBFRAME; Linking libraries and archivesDorothea Salo
 
Building OBO Foundry ontology using semantic web tools
Building OBO Foundry ontology using semantic web toolsBuilding OBO Foundry ontology using semantic web tools
Building OBO Foundry ontology using semantic web toolsMelanie Courtot
 
RDF Data and Image Annotations in ResearchSpace (slides)
RDF Data and Image Annotations in ResearchSpace (slides)RDF Data and Image Annotations in ResearchSpace (slides)
RDF Data and Image Annotations in ResearchSpace (slides)Vladimir Alexiev, PhD, PMP
 
Fondly Collisions: Archival hierarchy and the Europeana Data Model
Fondly Collisions: Archival hierarchy and the Europeana Data Model   Fondly Collisions: Archival hierarchy and the Europeana Data Model
Fondly Collisions: Archival hierarchy and the Europeana Data Model Valentine Charles
 
Alexandria winer20100623
Alexandria winer20100623Alexandria winer20100623
Alexandria winer20100623Dov Winer
 
Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.Jon Voss
 
ArchivesSpace: Building a Next-Generation Archives Management Tool
ArchivesSpace: Building a Next-Generation Archives Management ToolArchivesSpace: Building a Next-Generation Archives Management Tool
ArchivesSpace: Building a Next-Generation Archives Management ToolMark Matienzo
 
FDO as building block for digitization technology stacks
FDO as building block for digitization technology stacksFDO as building block for digitization technology stacks
FDO as building block for digitization technology stacksRaul Palma
 
SWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic WebSWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic WebPascal-Nicolas Becker
 
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎Libcorpio
 
Linked Data at the OU - the story so far
Linked Data at the OU - the story so farLinked Data at the OU - the story so far
Linked Data at the OU - the story so farEnrico Daga
 

Ähnlich wie RTÉ Content Discovery Project - Christophe Debruyne (20)

What is New in W3C land?
What is New in W3C land?What is New in W3C land?
What is New in W3C land?
 
Global lodlam_communities and open cultural data
Global lodlam_communities and open cultural dataGlobal lodlam_communities and open cultural data
Global lodlam_communities and open cultural data
 
High and Lows of Library Linked Data
High and Lows of Library Linked DataHigh and Lows of Library Linked Data
High and Lows of Library Linked Data
 
Thinking about Preservation: OSUL Content Manage Workflow
Thinking about Preservation: OSUL Content Manage WorkflowThinking about Preservation: OSUL Content Manage Workflow
Thinking about Preservation: OSUL Content Manage Workflow
 
Digital Content Management
Digital Content ManagementDigital Content Management
Digital Content Management
 
Intro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & MuseumsIntro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & Museums
 
Linked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & MuseumsLinked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & Museums
 
MARC and BIBFRAME; Linking libraries and archives
MARC and BIBFRAME; Linking libraries and archivesMARC and BIBFRAME; Linking libraries and archives
MARC and BIBFRAME; Linking libraries and archives
 
Building OBO Foundry ontology using semantic web tools
Building OBO Foundry ontology using semantic web toolsBuilding OBO Foundry ontology using semantic web tools
Building OBO Foundry ontology using semantic web tools
 
RDF Data and Image Annotations in ResearchSpace (slides)
RDF Data and Image Annotations in ResearchSpace (slides)RDF Data and Image Annotations in ResearchSpace (slides)
RDF Data and Image Annotations in ResearchSpace (slides)
 
Fondly Collisions: Archival hierarchy and the Europeana Data Model
Fondly Collisions: Archival hierarchy and the Europeana Data Model   Fondly Collisions: Archival hierarchy and the Europeana Data Model
Fondly Collisions: Archival hierarchy and the Europeana Data Model
 
Alexandria winer20100623
Alexandria winer20100623Alexandria winer20100623
Alexandria winer20100623
 
Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.
 
ArchivesSpace: Building a Next-Generation Archives Management Tool
ArchivesSpace: Building a Next-Generation Archives Management ToolArchivesSpace: Building a Next-Generation Archives Management Tool
ArchivesSpace: Building a Next-Generation Archives Management Tool
 
FDO as building block for digitization technology stacks
FDO as building block for digitization technology stacksFDO as building block for digitization technology stacks
FDO as building block for digitization technology stacks
 
NISO Webinar: Library Linked Data: From Vision to Reality
NISO Webinar: Library Linked Data: From Vision to RealityNISO Webinar: Library Linked Data: From Vision to Reality
NISO Webinar: Library Linked Data: From Vision to Reality
 
SWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic WebSWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic Web
 
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
 
The future of the DCC
The future of the DCCThe future of the DCC
The future of the DCC
 
Linked Data at the OU - the story so far
Linked Data at the OU - the story so farLinked Data at the OU - the story so far
Linked Data at the OU - the story so far
 

Mehr von dri_ireland

NORFest 2023 Lightning Talks Session Two
NORFest 2023 Lightning Talks Session TwoNORFest 2023 Lightning Talks Session Two
NORFest 2023 Lightning Talks Session Twodri_ireland
 
NORFest 2023: Early Career Researcher Panel on Research Assessment
NORFest 2023: Early Career Researcher Panel on Research AssessmentNORFest 2023: Early Career Researcher Panel on Research Assessment
NORFest 2023: Early Career Researcher Panel on Research Assessmentdri_ireland
 
NORFest 2023: National Open Research Fund 2023, Projects Launch
NORFest 2023: National Open Research Fund 2023, Projects LaunchNORFest 2023: National Open Research Fund 2023, Projects Launch
NORFest 2023: National Open Research Fund 2023, Projects Launchdri_ireland
 
NORFest 2023 Lightning Talks Session Three
NORFest 2023 Lightning Talks Session Three NORFest 2023 Lightning Talks Session Three
NORFest 2023 Lightning Talks Session Three dri_ireland
 
NORFest 2023 Lightning Talks Session One
NORFest 2023 Lightning Talks Session OneNORFest 2023 Lightning Talks Session One
NORFest 2023 Lightning Talks Session Onedri_ireland
 
NORFest2023 Keynote address: Chelle Gentemann (NASA)
NORFest2023 Keynote address: Chelle Gentemann (NASA)NORFest2023 Keynote address: Chelle Gentemann (NASA)
NORFest2023 Keynote address: Chelle Gentemann (NASA)dri_ireland
 
The Archiving Reproductive Health project as a FAIR data resource for humanit...
The Archiving Reproductive Health project as a FAIR data resource for humanit...The Archiving Reproductive Health project as a FAIR data resource for humanit...
The Archiving Reproductive Health project as a FAIR data resource for humanit...dri_ireland
 
Developing a self-care protocol for working with potentially traumatic data: ...
Developing a self-care protocol for working with potentially traumatic data: ...Developing a self-care protocol for working with potentially traumatic data: ...
Developing a self-care protocol for working with potentially traumatic data: ...dri_ireland
 
An Introduction to the Digital Repository of Ireland
An Introduction to the Digital Repository of Ireland An Introduction to the Digital Repository of Ireland
An Introduction to the Digital Repository of Ireland dri_ireland
 
DRI Copyright and Licencing_UCC_Mar23.pptx
DRI Copyright and Licencing_UCC_Mar23.pptxDRI Copyright and Licencing_UCC_Mar23.pptx
DRI Copyright and Licencing_UCC_Mar23.pptxdri_ireland
 
The Digital Repository of Ireland Digital Preservation and Research Sustainab...
The Digital Repository of Ireland Digital Preservation and Research Sustainab...The Digital Repository of Ireland Digital Preservation and Research Sustainab...
The Digital Repository of Ireland Digital Preservation and Research Sustainab...dri_ireland
 
DRI's role in WorldFAIR: Cultural Heritage / Image Sharing
DRI's role in WorldFAIR: Cultural Heritage / Image SharingDRI's role in WorldFAIR: Cultural Heritage / Image Sharing
DRI's role in WorldFAIR: Cultural Heritage / Image Sharingdri_ireland
 
Introduction to research data management
Introduction to research data managementIntroduction to research data management
Introduction to research data managementdri_ireland
 
Archiving Ports, Ports as Archives
Archiving Ports, Ports as ArchivesArchiving Ports, Ports as Archives
Archiving Ports, Ports as Archivesdri_ireland
 
Preservation, Access, Discovery
Preservation, Access, DiscoveryPreservation, Access, Discovery
Preservation, Access, Discoverydri_ireland
 
Dublin in the Fingal Archives
Dublin in the Fingal ArchivesDublin in the Fingal Archives
Dublin in the Fingal Archivesdri_ireland
 
Dublin Ghost Signs
Dublin Ghost SignsDublin Ghost Signs
Dublin Ghost Signsdri_ireland
 
Mapping Memories: Participatory Media, Place-Based Stories, Refugee Youth
Mapping Memories: Participatory Media, Place-Based Stories, Refugee YouthMapping Memories: Participatory Media, Place-Based Stories, Refugee Youth
Mapping Memories: Participatory Media, Place-Based Stories, Refugee Youthdri_ireland
 
Supporting Activists to Preserve Video Documentation
Supporting Activists to Preserve Video Documentation Supporting Activists to Preserve Video Documentation
Supporting Activists to Preserve Video Documentation dri_ireland
 
Making the Future
Making the FutureMaking the Future
Making the Futuredri_ireland
 

Mehr von dri_ireland (20)

NORFest 2023 Lightning Talks Session Two
NORFest 2023 Lightning Talks Session TwoNORFest 2023 Lightning Talks Session Two
NORFest 2023 Lightning Talks Session Two
 
NORFest 2023: Early Career Researcher Panel on Research Assessment
NORFest 2023: Early Career Researcher Panel on Research AssessmentNORFest 2023: Early Career Researcher Panel on Research Assessment
NORFest 2023: Early Career Researcher Panel on Research Assessment
 
NORFest 2023: National Open Research Fund 2023, Projects Launch
NORFest 2023: National Open Research Fund 2023, Projects LaunchNORFest 2023: National Open Research Fund 2023, Projects Launch
NORFest 2023: National Open Research Fund 2023, Projects Launch
 
NORFest 2023 Lightning Talks Session Three
NORFest 2023 Lightning Talks Session Three NORFest 2023 Lightning Talks Session Three
NORFest 2023 Lightning Talks Session Three
 
NORFest 2023 Lightning Talks Session One
NORFest 2023 Lightning Talks Session OneNORFest 2023 Lightning Talks Session One
NORFest 2023 Lightning Talks Session One
 
NORFest2023 Keynote address: Chelle Gentemann (NASA)
NORFest2023 Keynote address: Chelle Gentemann (NASA)NORFest2023 Keynote address: Chelle Gentemann (NASA)
NORFest2023 Keynote address: Chelle Gentemann (NASA)
 
The Archiving Reproductive Health project as a FAIR data resource for humanit...
The Archiving Reproductive Health project as a FAIR data resource for humanit...The Archiving Reproductive Health project as a FAIR data resource for humanit...
The Archiving Reproductive Health project as a FAIR data resource for humanit...
 
Developing a self-care protocol for working with potentially traumatic data: ...
Developing a self-care protocol for working with potentially traumatic data: ...Developing a self-care protocol for working with potentially traumatic data: ...
Developing a self-care protocol for working with potentially traumatic data: ...
 
An Introduction to the Digital Repository of Ireland
An Introduction to the Digital Repository of Ireland An Introduction to the Digital Repository of Ireland
An Introduction to the Digital Repository of Ireland
 
DRI Copyright and Licencing_UCC_Mar23.pptx
DRI Copyright and Licencing_UCC_Mar23.pptxDRI Copyright and Licencing_UCC_Mar23.pptx
DRI Copyright and Licencing_UCC_Mar23.pptx
 
The Digital Repository of Ireland Digital Preservation and Research Sustainab...
The Digital Repository of Ireland Digital Preservation and Research Sustainab...The Digital Repository of Ireland Digital Preservation and Research Sustainab...
The Digital Repository of Ireland Digital Preservation and Research Sustainab...
 
DRI's role in WorldFAIR: Cultural Heritage / Image Sharing
DRI's role in WorldFAIR: Cultural Heritage / Image SharingDRI's role in WorldFAIR: Cultural Heritage / Image Sharing
DRI's role in WorldFAIR: Cultural Heritage / Image Sharing
 
Introduction to research data management
Introduction to research data managementIntroduction to research data management
Introduction to research data management
 
Archiving Ports, Ports as Archives
Archiving Ports, Ports as ArchivesArchiving Ports, Ports as Archives
Archiving Ports, Ports as Archives
 
Preservation, Access, Discovery
Preservation, Access, DiscoveryPreservation, Access, Discovery
Preservation, Access, Discovery
 
Dublin in the Fingal Archives
Dublin in the Fingal ArchivesDublin in the Fingal Archives
Dublin in the Fingal Archives
 
Dublin Ghost Signs
Dublin Ghost SignsDublin Ghost Signs
Dublin Ghost Signs
 
Mapping Memories: Participatory Media, Place-Based Stories, Refugee Youth
Mapping Memories: Participatory Media, Place-Based Stories, Refugee YouthMapping Memories: Participatory Media, Place-Based Stories, Refugee Youth
Mapping Memories: Participatory Media, Place-Based Stories, Refugee Youth
 
Supporting Activists to Preserve Video Documentation
Supporting Activists to Preserve Video Documentation Supporting Activists to Preserve Video Documentation
Supporting Activists to Preserve Video Documentation
 
Making the Future
Making the FutureMaking the Future
Making the Future
 

Kürzlich hochgeladen

A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...itnewsafrica
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024TopCSSGallery
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observabilityitnewsafrica
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkPixlogix Infotech
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Kaya Weers
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesManik S Magar
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Nikki Chapple
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 

Kürzlich hochgeladen (20)

A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App Framework
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 

RTÉ Content Discovery Project - Christophe Debruyne

  • 1. RTÉ Content Discovery Project Christophe Debruyne c.debruyne@ria.ie christophe.debruyne@insight-centre.orgchristophe.debruyne@insight-centre.org MDN Workshop -- 4th of June 2014
  • 2. Outline • Context • Goal and Challenges of the RTÉ Content Discovery Project • Tasks and Data Annotation• Tasks and Data Annotation • EBU Core – Identification of problems • Addressing the issues • Using the ontology • Conclusions and Recommendations
  • 3. Context RTÉ, Ireland's National Television and Radio Broadcaster National trusted digital repository for Ireland's social and cultural data. Centre for Data Analytics Documents Television Radio Stills Linking and preserving data held by Irish Institutions with central internet access point. • Standards • Cataloguing • Archiving • Preservation • Insight @ NUIG = DERI • Semantic Technologies • Linked Data • Data Analytics Platform
  • 4. Goal of the RTÉ Content Discovery Project • Discover implicit knowledge • across the different archives • and the Web of Data • To facilitate internal workflows (e.g., search) • For wider reuse and repackaging RTÉ’s Documents Television Radio Stills • For wider reuse and repackaging RTÉ’s information • Challenges • Heterogeneous databases • Different guidelines and practices • Legacy data (from previous systems) • … “Linking Open Data cloud diagram, by R. Cyganiak and A. Jentzsch. http://lod-cloud.net/”
  • 5. Part of a wider ambition …
  • 7. RTÉ Content Discovery In this presentation we focus on Television and Radio archives Documents Television Radio Stills • In this presentation we focus on Television and Radio archives • The Television and Radio archives • Are maintained on two different instances of the same system • A system that is EBU Core “compatible” • Different content, different guidelines, …
  • 8. Three main tasks • Annotate the data. • Using relevant standards, ontologies and vocabularies. • Resource Description Framework (RDF). • Obtain an integrated view of the different archives by creating links between the RDF representations of RTÉ’s archival assets across the different archives. • Apply advanced methods for discovering related data for a given subject in external sources such as the Linked Data Cloud.
  • 9. Data annotation Relational Database D2RQ RDF Dump Triplestore Television Radio Map symbols of database to predicates (relations and concepts) in chosen ontologies / vocabularies Use D2RQ to generate RDF dump Store RDF dump in adequate triple store (Jena TDB) Which ontologies? • Dublin Core, DC Terms • Foaf • EBU Core OWL • …
  • 10. EBU Core OWL • The RTÉ Content Discovery platform will rely on Semantic Web technologies to reason. Ontologies will therefore need to be correct. • But … while adopting the EBU Core OWL ontology, several problems where identified. • We contacted EBU to resolve these issues. • We provide an overview of some of these problems.
  • 11. Problems • (1) Forgotten concept unions • The property ebucore:description has multiple domain axioms. <rdfs:domain rdf:resource="&ebu;BusinessObject"/> <rdfs:domain rdf:resource="&ebu;MediaResource"/> • Unintentionally the wrong implicit information can be inferred.• Unintentionally the wrong implicit information can be inferred. • (2a) Property unsatisfiability – via class axioms <owl:Class rdf:about="&ebu;BusinessObject"> … <owl:disjointWith rdf:resource="&ebu;Resource"/> … </owl:Class> • Because of (1) and (2), the property description could not be used
  • 12. Problems • (2b) Property unsatisfiability – role hierarchies and datatypes • Duration has the range xsd:string • The subproperties of duration have other ranges (e.g., double in the case of duration in edit units) • Because each subproperty also inherits the range of the• Because each subproperty also inherits the range of the superproperty, all instances in the object of that property must be at the same time a string, and a double. This type conflict results in a contradiction. • With (2a) and (2b) we identified 40 properties that lead to problems.
  • 13. Problems • (3) Inconsistencies between formal and informal definitions • BusinessObject is defined as: "An image, a document, an annotation […], a tag […], or an audiovisual media resource […]. Other types of BusinessObjects may be defined as subclasses.“BusinessObjects may be defined as subclasses.“ • Resource is defined as: "A manifestation of a BusinessObject." and disjoint with BusinessObjects. Meaning no individual can be an element of BusinessObjects and Resources at the same time. • The domain of a title is BusinessObject, yet, it’s definition is: "Specifies the title or name given to the resource. […]"
  • 14. Problems • (4) User readable labels • Many different properties have the same human readable label, which could confuse the end user – e.g., when generating an Interface. • E.g., there were 11 properties with the label “Name”• E.g., there were 11 properties with the label “Name” • Some properties had empty labels • (5) Roles – Loss of context • Agents were related to Business Objects (BO) • Agents were related to a Role • But … a role did not relate to agents in relationship with a BO • This lead to a loss of context.
  • 15. Addressing the issues • Problems were addressed over email. • The discussions are “lost”, traces are only known to us … • The ontology-engineering activities of EBU Core should adopt appropriate methods and tools for collaboration. • Participation of others • Traceability (!) • The ontology is still being developed as we go along, and we have been able to make (parts of it) work…
  • 17.
  • 18. Conclusions and Recommendations • RTÉ Archives aims at a wider reuse and repackaging of their archival content on digital platforms through the innovative use of Semantic and Linked Data technologies. • We adopted the EBU Core OWL ontology for annotating the television and radio archives, yet identified some issues. • We adopted the EBU Core OWL ontology for annotating the television and radio archives, yet identified some issues. • We collaborated on resolving those issues together with EBU • However, we feel that appropriate collaborative methods and tools should be adopted to facilitate the ontology- engineering process and – more importantly – enable other to participate AND have visible traceability of the decisions.
  • 19. References • D2RQ, http://d2rq.org/ • Digital Repository of Ireland, http://www.dri.ie/ • Insight, http://www.insight-centre.org/• Insight, http://www.insight-centre.org/ • Jena TDB, http://jena.apache.org/documentation/tdb/ • RTÉ Archives, http://www.rte.ie/archives