Validata: An online tool for testing RDF data conformance

Alasdair Gray
Alasdair GrayLecturer in Computer Science at Heriot-Watt University um Heriot-Watt University
Validata: An online tool for
testing RDF data conformance
Alasdair J G Gray
A.J.G.Gray@hw.ac.uk
alasdairjggray.co.uk
@gray_alasdair
Acknowledgements
 Co-authors: Heriot-Watt MEng students
 Jacob Baungard Hansent
 Andrew Beveridge
 Roisin Farmer
 Leif Gehrmann
 Sunil Khutan
 Tomas Robertson
 Johnny Val
 W3C HCLS Interest Group
 Eric Prud’hommeaux
9 December 2015 Validata – SWAT4LS 2015 2
Motivation: HCLS Dataset
Descriptions
W3C Interest Group Note
http://www.w3.org/TR/hcls-dataset/
9 December 2015 Validata – SWAT4LS 2015 3
Goal
 Provide an online validation tool
 Deployable on W3C server
 API
 Support multiple constraints
 Enable different requirement levels
 Different levels of user messages:
 Error
 Warning
 Information
9 December 2015 Validata – SWAT4LS 2015 4
Approach: Data Shapes
Shape
9 December 2015 Validata – SWAT4LS 2015 5
Data
<Dataset> rdf:langString
.
✗
<Dataset> {
`MUST` rdf:type (dctypes:Dataset),
`MUST` dct:title rdf:langString,
`MAY` dct:alternative rdf:langString+,
`MUST` !dct:created .
}
Shape Expressions (ShEx)
Shape
9 December 2015 Validata – SWAT4LS 2015 6
<Dataset> rdf:langString
.
✗
Requirement Levels
Shape
9 December 2015 Validata – SWAT4LS 2015 7
<Dataset> rdf:langString
.
✗
<Dataset> {
`MUST` rdf:type (dctypes:Dataset),
`MUST` dct:title rdf:langString,
`MAY` dct:alternative rdf:langString+,
`MUST` !dct:created .
}
http://www.w3.org/2015/03/ShExValidata/
VALIDATA DEMO
Validata
https://github.com/HW-SWeL/Validata
 RDF constraint validation tool
 Shape Expression (ShEx) constraints
 Open source javascript implementation
9 December 2015 Validata – SWAT4LS 2015 9
www.alasdairjggray.co.uk
A.J.G.Gray@hw.ac.uk
@gray_alasdair
1 von 9

Recomendados

The HCLS Community Profile: Describing Datasets, Versions, and Distributions von
The HCLS Community Profile: Describing Datasets, Versions, and DistributionsThe HCLS Community Profile: Describing Datasets, Versions, and Distributions
The HCLS Community Profile: Describing Datasets, Versions, and DistributionsAlasdair Gray
1.5K views17 Folien
Semantic interoperability courses training module 2 - core vocabularies v0.11 von
Semantic interoperability courses   training module 2 - core vocabularies v0.11Semantic interoperability courses   training module 2 - core vocabularies v0.11
Semantic interoperability courses training module 2 - core vocabularies v0.11Semic.eu
7.9K views45 Folien
Introduction to RDF & SPARQL von
Introduction to RDF & SPARQLIntroduction to RDF & SPARQL
Introduction to RDF & SPARQLOpen Data Support
12.2K views43 Folien
Validata: A tool for testing profile conformance von
Validata: A tool for testing profile conformanceValidata: A tool for testing profile conformance
Validata: A tool for testing profile conformanceAlasdair Gray
1.8K views13 Folien
Using a Jupyter Notebook to perform a reproducible scientific analysis over s... von
Using a Jupyter Notebook to perform a reproducible scientific analysis over s...Using a Jupyter Notebook to perform a reproducible scientific analysis over s...
Using a Jupyter Notebook to perform a reproducible scientific analysis over s...Alasdair Gray
1.4K views9 Folien
Bioschemas Community: Developing profiles over Schema.org to make life scienc... von
Bioschemas Community: Developing profiles over Schema.org to make life scienc...Bioschemas Community: Developing profiles over Schema.org to make life scienc...
Bioschemas Community: Developing profiles over Schema.org to make life scienc...Alasdair Gray
1.4K views12 Folien

Más contenido relacionado

Más de Alasdair Gray

Tutorial: Describing Datasets with the Health Care and Life Sciences Communit... von
Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...
Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...Alasdair Gray
1.5K views65 Folien
Open PHACTS: The Data Today von
Open PHACTS: The Data TodayOpen PHACTS: The Data Today
Open PHACTS: The Data TodayAlasdair Gray
2.1K views22 Folien
Project X von
Project XProject X
Project XAlasdair Gray
1.8K views21 Folien
Data Integration in a Big Data Context: An Open PHACTS Case Study von
Data Integration in a Big Data Context: An Open PHACTS Case StudyData Integration in a Big Data Context: An Open PHACTS Case Study
Data Integration in a Big Data Context: An Open PHACTS Case StudyAlasdair Gray
1.7K views30 Folien
Data Integration in a Big Data Context von
Data Integration in a Big Data ContextData Integration in a Big Data Context
Data Integration in a Big Data ContextAlasdair Gray
2.5K views37 Folien
Data Linkage von
Data LinkageData Linkage
Data LinkageAlasdair Gray
1.8K views19 Folien

Más de Alasdair Gray(18)

Tutorial: Describing Datasets with the Health Care and Life Sciences Communit... von Alasdair Gray
Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...
Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...
Alasdair Gray1.5K views
Open PHACTS: The Data Today von Alasdair Gray
Open PHACTS: The Data TodayOpen PHACTS: The Data Today
Open PHACTS: The Data Today
Alasdair Gray2.1K views
Data Integration in a Big Data Context: An Open PHACTS Case Study von Alasdair Gray
Data Integration in a Big Data Context: An Open PHACTS Case StudyData Integration in a Big Data Context: An Open PHACTS Case Study
Data Integration in a Big Data Context: An Open PHACTS Case Study
Alasdair Gray1.7K views
Data Integration in a Big Data Context von Alasdair Gray
Data Integration in a Big Data ContextData Integration in a Big Data Context
Data Integration in a Big Data Context
Alasdair Gray2.5K views
Scientific lenses to support multiple views over linked chemistry data von Alasdair Gray
Scientific lenses to support multiple views over linked chemistry dataScientific lenses to support multiple views over linked chemistry data
Scientific lenses to support multiple views over linked chemistry data
Alasdair Gray3.7K views
Scientific Lenses over Linked Data An approach to support multiple integrate... von Alasdair Gray
Scientific Lenses over Linked Data An approach to support multiple integrate...Scientific Lenses over Linked Data An approach to support multiple integrate...
Scientific Lenses over Linked Data An approach to support multiple integrate...
Alasdair Gray935 views
Describing Scientific Datasets: The HCLS Community Profile von Alasdair Gray
Describing Scientific Datasets: The HCLS Community ProfileDescribing Scientific Datasets: The HCLS Community Profile
Describing Scientific Datasets: The HCLS Community Profile
Alasdair Gray1.8K views
Data Science meets Linked Data von Alasdair Gray
Data Science meets Linked DataData Science meets Linked Data
Data Science meets Linked Data
Alasdair Gray1.2K views
Sensors and Big Data for Health and Well-being von Alasdair Gray
Sensors and Big Data for Health and Well-beingSensors and Big Data for Health and Well-being
Sensors and Big Data for Health and Well-being
Alasdair Gray1.4K views
Scientific Lenses over Linked Data: Identity Management in the Open PHACTS p... von Alasdair Gray
Scientific Lenses over Linked Data: Identity Management in the Open PHACTS p...Scientific Lenses over Linked Data: Identity Management in the Open PHACTS p...
Scientific Lenses over Linked Data: Identity Management in the Open PHACTS p...
Alasdair Gray577 views
Dataset Descriptions in Open PHACTS and HCLS von Alasdair Gray
Dataset Descriptions in Open PHACTS and HCLSDataset Descriptions in Open PHACTS and HCLS
Dataset Descriptions in Open PHACTS and HCLS
Alasdair Gray587 views
Computing Identity Co-Reference Across Drug Discovery Datasets von Alasdair Gray
Computing Identity Co-Reference Across Drug Discovery DatasetsComputing Identity Co-Reference Across Drug Discovery Datasets
Computing Identity Co-Reference Across Drug Discovery Datasets
Alasdair Gray5.2K views
Incorporating Commercial and Private Data into an Open Linked Data Platform f... von Alasdair Gray
Incorporating Commercial and Private Data into an Open Linked Data Platform f...Incorporating Commercial and Private Data into an Open Linked Data Platform f...
Incorporating Commercial and Private Data into an Open Linked Data Platform f...
Alasdair Gray1.3K views
Including Co-Referent URIs in a SPARQL Query von Alasdair Gray
Including Co-Referent URIs in a SPARQL QueryIncluding Co-Referent URIs in a SPARQL Query
Including Co-Referent URIs in a SPARQL Query
Alasdair Gray1.3K views
2013 01-14 ops-dataset_descriptions von Alasdair Gray
2013 01-14 ops-dataset_descriptions2013 01-14 ops-dataset_descriptions
2013 01-14 ops-dataset_descriptions
Alasdair Gray602 views

Último

Updates on the LINSTOR Driver for CloudStack - Rene Peinthor - LINBIT von
Updates on the LINSTOR Driver for CloudStack - Rene Peinthor - LINBITUpdates on the LINSTOR Driver for CloudStack - Rene Peinthor - LINBIT
Updates on the LINSTOR Driver for CloudStack - Rene Peinthor - LINBITShapeBlue
91 views8 Folien
Network Source of Truth and Infrastructure as Code revisited von
Network Source of Truth and Infrastructure as Code revisitedNetwork Source of Truth and Infrastructure as Code revisited
Network Source of Truth and Infrastructure as Code revisitedNetwork Automation Forum
42 views45 Folien
PharoJS - Zürich Smalltalk Group Meetup November 2023 von
PharoJS - Zürich Smalltalk Group Meetup November 2023PharoJS - Zürich Smalltalk Group Meetup November 2023
PharoJS - Zürich Smalltalk Group Meetup November 2023Noury Bouraqadi
141 views17 Folien
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive von
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLiveAutomating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLiveNetwork Automation Forum
46 views35 Folien
Five Things You SHOULD Know About Postman von
Five Things You SHOULD Know About PostmanFive Things You SHOULD Know About Postman
Five Things You SHOULD Know About PostmanPostman
40 views43 Folien
Business Analyst Series 2023 - Week 3 Session 5 von
Business Analyst Series 2023 -  Week 3 Session 5Business Analyst Series 2023 -  Week 3 Session 5
Business Analyst Series 2023 - Week 3 Session 5DianaGray10
369 views20 Folien

Último(20)

Updates on the LINSTOR Driver for CloudStack - Rene Peinthor - LINBIT von ShapeBlue
Updates on the LINSTOR Driver for CloudStack - Rene Peinthor - LINBITUpdates on the LINSTOR Driver for CloudStack - Rene Peinthor - LINBIT
Updates on the LINSTOR Driver for CloudStack - Rene Peinthor - LINBIT
ShapeBlue91 views
PharoJS - Zürich Smalltalk Group Meetup November 2023 von Noury Bouraqadi
PharoJS - Zürich Smalltalk Group Meetup November 2023PharoJS - Zürich Smalltalk Group Meetup November 2023
PharoJS - Zürich Smalltalk Group Meetup November 2023
Noury Bouraqadi141 views
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive von Network Automation Forum
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLiveAutomating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Five Things You SHOULD Know About Postman von Postman
Five Things You SHOULD Know About PostmanFive Things You SHOULD Know About Postman
Five Things You SHOULD Know About Postman
Postman40 views
Business Analyst Series 2023 - Week 3 Session 5 von DianaGray10
Business Analyst Series 2023 -  Week 3 Session 5Business Analyst Series 2023 -  Week 3 Session 5
Business Analyst Series 2023 - Week 3 Session 5
DianaGray10369 views
VNF Integration and Support in CloudStack - Wei Zhou - ShapeBlue von ShapeBlue
VNF Integration and Support in CloudStack - Wei Zhou - ShapeBlueVNF Integration and Support in CloudStack - Wei Zhou - ShapeBlue
VNF Integration and Support in CloudStack - Wei Zhou - ShapeBlue
ShapeBlue85 views
Developments to CloudStack’s SDN ecosystem: Integration with VMWare NSX 4 - P... von ShapeBlue
Developments to CloudStack’s SDN ecosystem: Integration with VMWare NSX 4 - P...Developments to CloudStack’s SDN ecosystem: Integration with VMWare NSX 4 - P...
Developments to CloudStack’s SDN ecosystem: Integration with VMWare NSX 4 - P...
ShapeBlue82 views
Don’t Make A Human Do A Robot’s Job! : 6 Reasons Why AI Will Save Us & Not De... von Moses Kemibaro
Don’t Make A Human Do A Robot’s Job! : 6 Reasons Why AI Will Save Us & Not De...Don’t Make A Human Do A Robot’s Job! : 6 Reasons Why AI Will Save Us & Not De...
Don’t Make A Human Do A Robot’s Job! : 6 Reasons Why AI Will Save Us & Not De...
Moses Kemibaro29 views
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlue von ShapeBlue
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlueCloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlue
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlue
ShapeBlue46 views
【USB韌體設計課程】精選講義節錄-USB的列舉過程_艾鍗學院 von IttrainingIttraining
【USB韌體設計課程】精選講義節錄-USB的列舉過程_艾鍗學院【USB韌體設計課程】精選講義節錄-USB的列舉過程_艾鍗學院
【USB韌體設計課程】精選講義節錄-USB的列舉過程_艾鍗學院
HTTP headers that make your website go faster - devs.gent November 2023 von Thijs Feryn
HTTP headers that make your website go faster - devs.gent November 2023HTTP headers that make your website go faster - devs.gent November 2023
HTTP headers that make your website go faster - devs.gent November 2023
Thijs Feryn28 views
CloudStack and GitOps at Enterprise Scale - Alex Dometrius, Rene Glover - AT&T von ShapeBlue
CloudStack and GitOps at Enterprise Scale - Alex Dometrius, Rene Glover - AT&TCloudStack and GitOps at Enterprise Scale - Alex Dometrius, Rene Glover - AT&T
CloudStack and GitOps at Enterprise Scale - Alex Dometrius, Rene Glover - AT&T
ShapeBlue56 views
How to Re-use Old Hardware with CloudStack. Saving Money and the Environment ... von ShapeBlue
How to Re-use Old Hardware with CloudStack. Saving Money and the Environment ...How to Re-use Old Hardware with CloudStack. Saving Money and the Environment ...
How to Re-use Old Hardware with CloudStack. Saving Money and the Environment ...
ShapeBlue65 views
NTGapps NTG LowCode Platform von Mustafa Kuğu
NTGapps NTG LowCode Platform NTGapps NTG LowCode Platform
NTGapps NTG LowCode Platform
Mustafa Kuğu141 views
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti... von ShapeBlue
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...
ShapeBlue46 views
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha... von ShapeBlue
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...
ShapeBlue74 views
Data Integrity for Banking and Financial Services von Precisely
Data Integrity for Banking and Financial ServicesData Integrity for Banking and Financial Services
Data Integrity for Banking and Financial Services
Precisely56 views
CloudStack Managed User Data and Demo - Harikrishna Patnala - ShapeBlue von ShapeBlue
CloudStack Managed User Data and Demo - Harikrishna Patnala - ShapeBlueCloudStack Managed User Data and Demo - Harikrishna Patnala - ShapeBlue
CloudStack Managed User Data and Demo - Harikrishna Patnala - ShapeBlue
ShapeBlue46 views

Validata: An online tool for testing RDF data conformance

  • 1. Validata: An online tool for testing RDF data conformance Alasdair J G Gray A.J.G.Gray@hw.ac.uk alasdairjggray.co.uk @gray_alasdair
  • 2. Acknowledgements  Co-authors: Heriot-Watt MEng students  Jacob Baungard Hansent  Andrew Beveridge  Roisin Farmer  Leif Gehrmann  Sunil Khutan  Tomas Robertson  Johnny Val  W3C HCLS Interest Group  Eric Prud’hommeaux 9 December 2015 Validata – SWAT4LS 2015 2
  • 3. Motivation: HCLS Dataset Descriptions W3C Interest Group Note http://www.w3.org/TR/hcls-dataset/ 9 December 2015 Validata – SWAT4LS 2015 3
  • 4. Goal  Provide an online validation tool  Deployable on W3C server  API  Support multiple constraints  Enable different requirement levels  Different levels of user messages:  Error  Warning  Information 9 December 2015 Validata – SWAT4LS 2015 4
  • 5. Approach: Data Shapes Shape 9 December 2015 Validata – SWAT4LS 2015 5 Data <Dataset> rdf:langString . ✗
  • 6. <Dataset> { `MUST` rdf:type (dctypes:Dataset), `MUST` dct:title rdf:langString, `MAY` dct:alternative rdf:langString+, `MUST` !dct:created . } Shape Expressions (ShEx) Shape 9 December 2015 Validata – SWAT4LS 2015 6 <Dataset> rdf:langString . ✗
  • 7. Requirement Levels Shape 9 December 2015 Validata – SWAT4LS 2015 7 <Dataset> rdf:langString . ✗ <Dataset> { `MUST` rdf:type (dctypes:Dataset), `MUST` dct:title rdf:langString, `MAY` dct:alternative rdf:langString+, `MUST` !dct:created . }
  • 9. Validata https://github.com/HW-SWeL/Validata  RDF constraint validation tool  Shape Expression (ShEx) constraints  Open source javascript implementation 9 December 2015 Validata – SWAT4LS 2015 9 www.alasdairjggray.co.uk A.J.G.Gray@hw.ac.uk @gray_alasdair

Hinweis der Redaktion

  1. Work grown out of hackathon from 2 years ago
  2. MEng are fifth year UG students Top students Group design and code project
  3. Motivation: how do we check descriptions conform? Summary level: time unchanging information, e.g. name, description, publisher Version level: version specific information, e.g. version number, creator, etc Distribution level: file specific information, e.g. file location and format, number of triples 18 vocabularies: DCTerms, DCAT, VoID, FOAF, … 61 prescribed properties: MUST, SHOULD, MAY, MUST NOT for each level
  4. Link into data publishing pipeline via API Not tied to HCLS, only a motivation No existing tool meets these needs
  5. How do we validate that our example data conforms to a certain shape Express expected shape as ShEx Toy example, what about for real
  6. ShEx: Concise notation regex based
  7. Extended to allow arbitrary hierarchies Toy example, what about for real