SlideShare ist ein Scribd-Unternehmen logo
1 von 10
Europeana and RDF data validation
Antoine Isaac
RDF Validation Workshop
10-11 September 2013
Data validation on the Europeana Data Model
EDM is RDF, but Europeana needs to enforce constraints on the
datasets sent by its providers
 Matching basic Europeana functional requirements, e.g.:
• at most one edm:isShownBy
• at most one edm:isShownAt
• either edm:isShownBy or edm:isShownAt is mandatory
 General data quality, e.g.:
• at least a dc:title or a dc:description
http://pro.europeana.eu/edm-documentation
Note: these constraints are often on re-used classes and properties
EDM “Mapping Guidelines”
 Template-based instructions for Europeana providers
property value type cardinality
ore:Aggregation
Machine-readable specs by OWL ontology?
 We have an OWL version of EDM
 But as we know: OWL is good for writing down constraints, not for validation
 And in OWL some EDM constraints amount to adding semantics to classes
and properties that already exist
an ore:Aggregation should have at least 1 edm:isShownAt or 1 edm:isShownBy
https://github.com/europeana/corelib/blob/master/c
orelib-solr-definitions/src/main/resources/eu/rdf/
(let’s be honest: we were not ready for full RDF/OWL compatibility anyway…)
EDM is implemented as XML Schema (for RDF data!)
<sequence>
[…]
<element ref="edm:dataProvider" maxOccurs="1" minOccurs="1"/>
<element ref="edm:isShownAt" maxOccurs="1" minOccurs="0"/>
<element ref="edm:isShownBy" maxOccurs="1" minOccurs="0"/>
[…]
</sequence>
With Schematron rules:
<sch:pattern>
<sch:rule context="ore:Aggregation">
<sch:assert test="edm:isShownAt or edm:isShownBy">
[Error message]
</sch:assert>
</sch:rule>
</sch:pattern>
Not ideal of course
 Document-centric approach to validation
 Extra constraints, especially order of elements
 2 constraint systems co-existing
Falling back to XML Schema
EDM as a Dublin Core application profile?
[Cf. Karen and Tom tomorrow]
An example in the “Description Set Profiles” constraint language:
DescriptionSet [EDM-Providers]
Description [Aggregation-Providers]
Resource Class
ore:Aggregation
Statement
Property
edm:isShownBy
edm:isShownAt
Min Occurs
1
http://dublincore.org/documents/dc-dsp/
Could be converted to other formalisms
SPIN:
ore:Aggregation
spin:constraint
[ a sp:Ask ;
sp:text """
# either isShownBy or isShownAt must be present
ASK WHERE {
{?this isShownBy ?image } UNION {?this isShownBy ?page }
}"""
] .
Stardog ICV:
Class: ore:Aggregation
SubClassOf: min 1 edm:isShownBy or min 1 edm:isShownAt
Issue: still looks like adding general semantics to ore:Aggregation…
Making our requirements clearer
Level 1: Enabling basic validation
 Expressivity for individual constraints
Needs further testing, but DC AP, “OWL-inspired” and SPARQL seem good
OWL would probably force us to introduce many “technical” classes & properties
 Scalability
?
Level 2: “Packaging data” expressing scope of constraints – datasets!
 Side requirement: constraints should read less like messing up with the
original semantics of classes and properties
DC AP approach provides better hooks for tying constraints to groups of datasets
Making our requirements clearer
Level 3: sharing and re-use of constraints
 For humans: relative ease of understanding. Europeana has a wide
network of partners, not always tech-savvy.
OWL terms are hard, SPARQL seems low-level (even though it’s not)
 For machines: higher-level expressions of all constraint will help
implementation in different frameworks
XML/Schematron bad at making different levels of expression/implementation clear
Level 2: “Packaging data” expressing scope of constraints – datasets!
 Other organizations (esp. cultural aggregators) could make their own
profiles of EDM, with some constraints in common but not all
Importance of “packaging data”
Thank you!
aisaac@few.vu.nl

Weitere ähnliche Inhalte

Andere mochten auch

Finding media illustrating events
Finding media illustrating eventsFinding media illustrating events
Finding media illustrating eventsRaphael Troncy
 
Challenges for the Language Technology Industry
Challenges for the Language Technology IndustryChallenges for the Language Technology Industry
Challenges for the Language Technology IndustryAntoine Isaac
 
Multilingual challenges in Europeana
Multilingual challenges in EuropeanaMultilingual challenges in Europeana
Multilingual challenges in EuropeanaAntoine Isaac
 
ALA2009_Andy Weissberg (Bowker)
ALA2009_Andy Weissberg (Bowker)ALA2009_Andy Weissberg (Bowker)
ALA2009_Andy Weissberg (Bowker)bisg
 
Multilingual challenges for accessing digitized culture online - Riga Summit 15
Multilingual challenges for accessing digitized culture online - Riga Summit 15Multilingual challenges for accessing digitized culture online - Riga Summit 15
Multilingual challenges for accessing digitized culture online - Riga Summit 15Antoine Isaac
 
Progress Report on Government Linked Data Worldwide
Progress Report on Government Linked Data WorldwideProgress Report on Government Linked Data Worldwide
Progress Report on Government Linked Data Worldwide3 Round Stones
 
AAC Education Session
AAC Education Session AAC Education Session
AAC Education Session Antoine Isaac
 

Andere mochten auch (20)

NISO Annual Report of 2012 Activities
NISO Annual Report of 2012 ActivitiesNISO Annual Report of 2012 Activities
NISO Annual Report of 2012 Activities
 
Finding media illustrating events
Finding media illustrating eventsFinding media illustrating events
Finding media illustrating events
 
Challenges for the Language Technology Industry
Challenges for the Language Technology IndustryChallenges for the Language Technology Industry
Challenges for the Language Technology Industry
 
Multilingual challenges in Europeana
Multilingual challenges in EuropeanaMultilingual challenges in Europeana
Multilingual challenges in Europeana
 
NISO's Standards Update & Annual Membership Meeting
NISO's Standards Update & Annual Membership MeetingNISO's Standards Update & Annual Membership Meeting
NISO's Standards Update & Annual Membership Meeting
 
ALA2009_Andy Weissberg (Bowker)
ALA2009_Andy Weissberg (Bowker)ALA2009_Andy Weissberg (Bowker)
ALA2009_Andy Weissberg (Bowker)
 
Multilingual challenges for accessing digitized culture online - Riga Summit 15
Multilingual challenges for accessing digitized culture online - Riga Summit 15Multilingual challenges for accessing digitized culture online - Riga Summit 15
Multilingual challenges for accessing digitized culture online - Riga Summit 15
 
Progress Report on Government Linked Data Worldwide
Progress Report on Government Linked Data WorldwideProgress Report on Government Linked Data Worldwide
Progress Report on Government Linked Data Worldwide
 
AAC Education Session
AAC Education Session AAC Education Session
AAC Education Session
 
Intro D2D Paper ER&L Feb 2015
Intro D2D Paper ER&L Feb 2015Intro D2D Paper ER&L Feb 2015
Intro D2D Paper ER&L Feb 2015
 
NASIG 2014: Actions and Updates on the Standards and Best Practices Front
NASIG 2014: Actions and Updates on the Standards and Best Practices FrontNASIG 2014: Actions and Updates on the Standards and Best Practices Front
NASIG 2014: Actions and Updates on the Standards and Best Practices Front
 
Expanding the Possible: What’s New and Upcoming in Standards and Technologies...
Expanding the Possible: What’s New and Upcoming in Standards and Technologies...Expanding the Possible: What’s New and Upcoming in Standards and Technologies...
Expanding the Possible: What’s New and Upcoming in Standards and Technologies...
 
Uncork Your Licenses
Uncork Your LicensesUncork Your Licenses
Uncork Your Licenses
 
The Infrastructure for Alternative Metrics
The Infrastructure for Alternative MetricsThe Infrastructure for Alternative Metrics
The Infrastructure for Alternative Metrics
 
Carpenter Update on NISO Altmetrics Initiative at CNI Fall meeting in Washing...
Carpenter Update on NISO Altmetrics Initiative at CNI Fall meeting in Washing...Carpenter Update on NISO Altmetrics Initiative at CNI Fall meeting in Washing...
Carpenter Update on NISO Altmetrics Initiative at CNI Fall meeting in Washing...
 
Uncork Your Licenses!
Uncork Your Licenses! Uncork Your Licenses!
Uncork Your Licenses!
 
Lagace Presentation on the NISO Open Access Metadata and Indicators Project a...
Lagace Presentation on the NISO Open Access Metadata and Indicators Project a...Lagace Presentation on the NISO Open Access Metadata and Indicators Project a...
Lagace Presentation on the NISO Open Access Metadata and Indicators Project a...
 
NISO's Altmetrics Initiative
NISO's Altmetrics InitiativeNISO's Altmetrics Initiative
NISO's Altmetrics Initiative
 
Carpenter Library Assessment Conference Presentation
Carpenter Library Assessment Conference PresentationCarpenter Library Assessment Conference Presentation
Carpenter Library Assessment Conference Presentation
 
ER&L SUSHI ALI Feb 2015
ER&L SUSHI ALI Feb 2015ER&L SUSHI ALI Feb 2015
ER&L SUSHI ALI Feb 2015
 

Ähnlich wie Europeana and RDF data validation

Ruby on Rails All Hands Meeting
Ruby on Rails All Hands MeetingRuby on Rails All Hands Meeting
Ruby on Rails All Hands MeetingDan Davis
 
Building an aws sdk for Perl - Granada Perl Workshop 2014
Building an aws sdk for Perl - Granada Perl Workshop 2014Building an aws sdk for Perl - Granada Perl Workshop 2014
Building an aws sdk for Perl - Granada Perl Workshop 2014Jose Luis Martínez
 
Entity Framework
Entity FrameworkEntity Framework
Entity Frameworkvrluckyin
 
Entity framework
Entity frameworkEntity framework
Entity frameworkicubesystem
 
java database connectivity for java programming
java database connectivity for java programmingjava database connectivity for java programming
java database connectivity for java programmingrinky1234
 
Ruby on Rails
Ruby on RailsRuby on Rails
Ruby on RailsDelphiCon
 
Chaos Testing with F# and Azure by Rachel Reese at Codemotion Dubai
Chaos Testing with F# and Azure by Rachel Reese at Codemotion DubaiChaos Testing with F# and Azure by Rachel Reese at Codemotion Dubai
Chaos Testing with F# and Azure by Rachel Reese at Codemotion DubaiCodemotion Dubai
 
Customer Presentaion-LittleBigPlanet: Taking an Idea to Market Using AWS
Customer Presentaion-LittleBigPlanet: Taking an Idea to Market Using AWSCustomer Presentaion-LittleBigPlanet: Taking an Idea to Market Using AWS
Customer Presentaion-LittleBigPlanet: Taking an Idea to Market Using AWSAmazon Web Services
 
Asp #2
Asp #2Asp #2
Asp #2Joni
 
Fighting Against Chaotically Separated Values with Embulk
Fighting Against Chaotically Separated Values with EmbulkFighting Against Chaotically Separated Values with Embulk
Fighting Against Chaotically Separated Values with EmbulkSadayuki Furuhashi
 
A hands on overview of the semantic web
A hands on overview of the semantic webA hands on overview of the semantic web
A hands on overview of the semantic webMarakana Inc.
 
Delphi ORM SOA MVC SQL NoSQL JSON REST mORMot
Delphi ORM SOA MVC SQL NoSQL JSON REST mORMotDelphi ORM SOA MVC SQL NoSQL JSON REST mORMot
Delphi ORM SOA MVC SQL NoSQL JSON REST mORMotArnaud Bouchez
 
High quality ap is with api platform
High quality ap is with api platformHigh quality ap is with api platform
High quality ap is with api platformNelson Kopliku
 

Ähnlich wie Europeana and RDF data validation (20)

Ruby on Rails All Hands Meeting
Ruby on Rails All Hands MeetingRuby on Rails All Hands Meeting
Ruby on Rails All Hands Meeting
 
Jdbc_ravi_2016
Jdbc_ravi_2016Jdbc_ravi_2016
Jdbc_ravi_2016
 
ODF Mashups
ODF MashupsODF Mashups
ODF Mashups
 
Building an aws sdk for Perl - Granada Perl Workshop 2014
Building an aws sdk for Perl - Granada Perl Workshop 2014Building an aws sdk for Perl - Granada Perl Workshop 2014
Building an aws sdk for Perl - Granada Perl Workshop 2014
 
Entity Framework
Entity FrameworkEntity Framework
Entity Framework
 
Entity framework
Entity frameworkEntity framework
Entity framework
 
RoR guide_p1
RoR guide_p1RoR guide_p1
RoR guide_p1
 
java database connectivity for java programming
java database connectivity for java programmingjava database connectivity for java programming
java database connectivity for java programming
 
Ruby on Rails
Ruby on RailsRuby on Rails
Ruby on Rails
 
Chaos Testing with F# and Azure by Rachel Reese at Codemotion Dubai
Chaos Testing with F# and Azure by Rachel Reese at Codemotion DubaiChaos Testing with F# and Azure by Rachel Reese at Codemotion Dubai
Chaos Testing with F# and Azure by Rachel Reese at Codemotion Dubai
 
Customer Presentaion-LittleBigPlanet: Taking an Idea to Market Using AWS
Customer Presentaion-LittleBigPlanet: Taking an Idea to Market Using AWSCustomer Presentaion-LittleBigPlanet: Taking an Idea to Market Using AWS
Customer Presentaion-LittleBigPlanet: Taking an Idea to Market Using AWS
 
Asp #2
Asp #2Asp #2
Asp #2
 
First Day With J Ruby
First Day With J RubyFirst Day With J Ruby
First Day With J Ruby
 
jdbc
jdbcjdbc
jdbc
 
Fighting Against Chaotically Separated Values with Embulk
Fighting Against Chaotically Separated Values with EmbulkFighting Against Chaotically Separated Values with Embulk
Fighting Against Chaotically Separated Values with Embulk
 
A hands on overview of the semantic web
A hands on overview of the semantic webA hands on overview of the semantic web
A hands on overview of the semantic web
 
Delphi ORM SOA MVC SQL NoSQL JSON REST mORMot
Delphi ORM SOA MVC SQL NoSQL JSON REST mORMotDelphi ORM SOA MVC SQL NoSQL JSON REST mORMot
Delphi ORM SOA MVC SQL NoSQL JSON REST mORMot
 
High quality ap is with api platform
High quality ap is with api platformHigh quality ap is with api platform
High quality ap is with api platform
 
New AWS Services
New AWS ServicesNew AWS Services
New AWS Services
 
Sadiq786
Sadiq786Sadiq786
Sadiq786
 

Mehr von Antoine Isaac

Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021Antoine Isaac
 
Entity Management at Europeana - DCMI 2021
Entity Management at Europeana - DCMI 2021Entity Management at Europeana - DCMI 2021
Entity Management at Europeana - DCMI 2021Antoine Isaac
 
Le Cadre de publication d'Europeana
Le Cadre de publication d'EuropeanaLe Cadre de publication d'Europeana
Le Cadre de publication d'EuropeanaAntoine Isaac
 
The Europeana Data Model Principles, community and innovation
The Europeana Data Model  Principles, community and innovationThe Europeana Data Model  Principles, community and innovation
The Europeana Data Model Principles, community and innovationAntoine Isaac
 
Europeana as a Linked Data (Quality) case
Europeana as a Linked Data (Quality) caseEuropeana as a Linked Data (Quality) case
Europeana as a Linked Data (Quality) caseAntoine Isaac
 
Metadata aggregation of IIIF Resources at Europeana: status and plans
Metadata aggregation of IIIF Resources at Europeana: status and plansMetadata aggregation of IIIF Resources at Europeana: status and plans
Metadata aggregation of IIIF Resources at Europeana: status and plansAntoine Isaac
 
IIIF and the Europeana mission
IIIF and the Europeana missionIIIF and the Europeana mission
IIIF and the Europeana missionAntoine Isaac
 
Multilingual challenges and ongoing work to tackle them at Europeana
Multilingual challenges and ongoing work to tackle them at EuropeanaMultilingual challenges and ongoing work to tackle them at Europeana
Multilingual challenges and ongoing work to tackle them at EuropeanaAntoine Isaac
 
Semantic Interoperability at Europeana - MultilingualDSIs2018
Semantic Interoperability at Europeana - MultilingualDSIs2018Semantic Interoperability at Europeana - MultilingualDSIs2018
Semantic Interoperability at Europeana - MultilingualDSIs2018Antoine Isaac
 
Lightweight rights modeling and linked data publication for online cultural h...
Lightweight rights modeling and linked data publication for online cultural h...Lightweight rights modeling and linked data publication for online cultural h...
Lightweight rights modeling and linked data publication for online cultural h...Antoine Isaac
 
Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018Antoine Isaac
 
The Europeana Data Model - TPDL2018
The Europeana Data Model - TPDL2018The Europeana Data Model - TPDL2018
The Europeana Data Model - TPDL2018Antoine Isaac
 
Data scale and diversity issues at Europeana
Data scale and diversity issues at EuropeanaData scale and diversity issues at Europeana
Data scale and diversity issues at EuropeanaAntoine Isaac
 
Isaac - W3C Data on the Web Best Practices - Data Vocabularies
Isaac - W3C Data on the Web Best Practices - Data VocabulariesIsaac - W3C Data on the Web Best Practices - Data Vocabularies
Isaac - W3C Data on the Web Best Practices - Data VocabulariesAntoine Isaac
 
Enriching Cultural Heritage Data with DBpedia
Enriching Cultural Heritage Data with DBpediaEnriching Cultural Heritage Data with DBpedia
Enriching Cultural Heritage Data with DBpediaAntoine Isaac
 
Modelling and exchanging annotations
Modelling and exchanging annotationsModelling and exchanging annotations
Modelling and exchanging annotationsAntoine Isaac
 
EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015Antoine Isaac
 
Modelling annotations for Europeana and related projects - DARIAH-EU WS
Modelling annotations for Europeana and related projects - DARIAH-EU WSModelling annotations for Europeana and related projects - DARIAH-EU WS
Modelling annotations for Europeana and related projects - DARIAH-EU WSAntoine Isaac
 

Mehr von Antoine Isaac (20)

Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021
 
Entity Management at Europeana - DCMI 2021
Entity Management at Europeana - DCMI 2021Entity Management at Europeana - DCMI 2021
Entity Management at Europeana - DCMI 2021
 
Le Cadre de publication d'Europeana
Le Cadre de publication d'EuropeanaLe Cadre de publication d'Europeana
Le Cadre de publication d'Europeana
 
The Europeana Data Model Principles, community and innovation
The Europeana Data Model  Principles, community and innovationThe Europeana Data Model  Principles, community and innovation
The Europeana Data Model Principles, community and innovation
 
Europeana as a Linked Data (Quality) case
Europeana as a Linked Data (Quality) caseEuropeana as a Linked Data (Quality) case
Europeana as a Linked Data (Quality) case
 
Metadata aggregation of IIIF Resources at Europeana: status and plans
Metadata aggregation of IIIF Resources at Europeana: status and plansMetadata aggregation of IIIF Resources at Europeana: status and plans
Metadata aggregation of IIIF Resources at Europeana: status and plans
 
IIIF and the Europeana mission
IIIF and the Europeana missionIIIF and the Europeana mission
IIIF and the Europeana mission
 
Multilingual challenges and ongoing work to tackle them at Europeana
Multilingual challenges and ongoing work to tackle them at EuropeanaMultilingual challenges and ongoing work to tackle them at Europeana
Multilingual challenges and ongoing work to tackle them at Europeana
 
Semantic Interoperability at Europeana - MultilingualDSIs2018
Semantic Interoperability at Europeana - MultilingualDSIs2018Semantic Interoperability at Europeana - MultilingualDSIs2018
Semantic Interoperability at Europeana - MultilingualDSIs2018
 
Lightweight rights modeling and linked data publication for online cultural h...
Lightweight rights modeling and linked data publication for online cultural h...Lightweight rights modeling and linked data publication for online cultural h...
Lightweight rights modeling and linked data publication for online cultural h...
 
Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018
 
The Europeana Data Model - TPDL2018
The Europeana Data Model - TPDL2018The Europeana Data Model - TPDL2018
The Europeana Data Model - TPDL2018
 
Europeana et IIIF
Europeana et IIIFEuropeana et IIIF
Europeana et IIIF
 
Data scale and diversity issues at Europeana
Data scale and diversity issues at EuropeanaData scale and diversity issues at Europeana
Data scale and diversity issues at Europeana
 
Isaac - W3C Data on the Web Best Practices - Data Vocabularies
Isaac - W3C Data on the Web Best Practices - Data VocabulariesIsaac - W3C Data on the Web Best Practices - Data Vocabularies
Isaac - W3C Data on the Web Best Practices - Data Vocabularies
 
Europeana APIs
Europeana APIsEuropeana APIs
Europeana APIs
 
Enriching Cultural Heritage Data with DBpedia
Enriching Cultural Heritage Data with DBpediaEnriching Cultural Heritage Data with DBpedia
Enriching Cultural Heritage Data with DBpedia
 
Modelling and exchanging annotations
Modelling and exchanging annotationsModelling and exchanging annotations
Modelling and exchanging annotations
 
EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015
 
Modelling annotations for Europeana and related projects - DARIAH-EU WS
Modelling annotations for Europeana and related projects - DARIAH-EU WSModelling annotations for Europeana and related projects - DARIAH-EU WS
Modelling annotations for Europeana and related projects - DARIAH-EU WS
 

Kürzlich hochgeladen

Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdfChristopherTHyatt
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 

Kürzlich hochgeladen (20)

Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 

Europeana and RDF data validation

  • 1. Europeana and RDF data validation Antoine Isaac RDF Validation Workshop 10-11 September 2013
  • 2. Data validation on the Europeana Data Model EDM is RDF, but Europeana needs to enforce constraints on the datasets sent by its providers  Matching basic Europeana functional requirements, e.g.: • at most one edm:isShownBy • at most one edm:isShownAt • either edm:isShownBy or edm:isShownAt is mandatory  General data quality, e.g.: • at least a dc:title or a dc:description http://pro.europeana.eu/edm-documentation Note: these constraints are often on re-used classes and properties
  • 3. EDM “Mapping Guidelines”  Template-based instructions for Europeana providers property value type cardinality ore:Aggregation
  • 4. Machine-readable specs by OWL ontology?  We have an OWL version of EDM  But as we know: OWL is good for writing down constraints, not for validation  And in OWL some EDM constraints amount to adding semantics to classes and properties that already exist an ore:Aggregation should have at least 1 edm:isShownAt or 1 edm:isShownBy https://github.com/europeana/corelib/blob/master/c orelib-solr-definitions/src/main/resources/eu/rdf/ (let’s be honest: we were not ready for full RDF/OWL compatibility anyway…)
  • 5. EDM is implemented as XML Schema (for RDF data!) <sequence> […] <element ref="edm:dataProvider" maxOccurs="1" minOccurs="1"/> <element ref="edm:isShownAt" maxOccurs="1" minOccurs="0"/> <element ref="edm:isShownBy" maxOccurs="1" minOccurs="0"/> […] </sequence> With Schematron rules: <sch:pattern> <sch:rule context="ore:Aggregation"> <sch:assert test="edm:isShownAt or edm:isShownBy"> [Error message] </sch:assert> </sch:rule> </sch:pattern> Not ideal of course  Document-centric approach to validation  Extra constraints, especially order of elements  2 constraint systems co-existing Falling back to XML Schema
  • 6. EDM as a Dublin Core application profile? [Cf. Karen and Tom tomorrow] An example in the “Description Set Profiles” constraint language: DescriptionSet [EDM-Providers] Description [Aggregation-Providers] Resource Class ore:Aggregation Statement Property edm:isShownBy edm:isShownAt Min Occurs 1 http://dublincore.org/documents/dc-dsp/
  • 7. Could be converted to other formalisms SPIN: ore:Aggregation spin:constraint [ a sp:Ask ; sp:text """ # either isShownBy or isShownAt must be present ASK WHERE { {?this isShownBy ?image } UNION {?this isShownBy ?page } }""" ] . Stardog ICV: Class: ore:Aggregation SubClassOf: min 1 edm:isShownBy or min 1 edm:isShownAt Issue: still looks like adding general semantics to ore:Aggregation…
  • 8. Making our requirements clearer Level 1: Enabling basic validation  Expressivity for individual constraints Needs further testing, but DC AP, “OWL-inspired” and SPARQL seem good OWL would probably force us to introduce many “technical” classes & properties  Scalability ? Level 2: “Packaging data” expressing scope of constraints – datasets!  Side requirement: constraints should read less like messing up with the original semantics of classes and properties DC AP approach provides better hooks for tying constraints to groups of datasets
  • 9. Making our requirements clearer Level 3: sharing and re-use of constraints  For humans: relative ease of understanding. Europeana has a wide network of partners, not always tech-savvy. OWL terms are hard, SPARQL seems low-level (even though it’s not)  For machines: higher-level expressions of all constraint will help implementation in different frameworks XML/Schematron bad at making different levels of expression/implementation clear Level 2: “Packaging data” expressing scope of constraints – datasets!  Other organizations (esp. cultural aggregators) could make their own profiles of EDM, with some constraints in common but not all Importance of “packaging data”

Hinweis der Redaktion

  1. Workshop page: http://www.w3.org/2012/12/rdf-val/
  2. Technical classes, with URIs or blank nodes
  3. Technical classes, with URIs or blank nodes