SlideShare ist ein Scribd-Unternehmen logo
1 von 53
Downloaden Sie, um offline zu lesen
iLastic:
Linked Data Generation
Workflow & User Interface
for iMinds Scholarly Data
SAVE-SD 2017
Anastasia Dimou, Gerald Haesendonck, Martin Vanbrabant,
Laurens De Vocht, Ruben Verborgh, Steven Latré, Erik Mannens
Anastasia.Dimou@ugent.be ● @natadimou
Ghent University – IDLab – imec
Publication is archived & published by the
event organizers where it was presented
publisher who publishes the proceedings
authors who co-edited it
organization(s) the authors are affiliated with
Publication
Dimou A. et al. (2015)
Assessing & Refining Mappings to RDF to Improve Dataset Quality
In: Arenas M. et al. (eds) The Semantic Web - ISWC 2015
Lecture Notes in Computer Science, vol 9367. Springer, Cham
Publication is archived & published by the
event ISWC2015
http://iswc2015.semanticweb.org/sites/iswc2015.semanticweb.org/files/93670111.pdf
Publication is archived & published by the
event ISWC2015
publisher LNCS, Springer
https://link.springer.com/chapter/10.1007/978-3-319-25010-6_8
Publication is archived & published by the
event ISWC2015
publisher LNCS, Springer
authors multiple by 8
https://ruben.verborgh.org/publications/dimou_iswc_2015a/
http://jens-lehmann.org/files/2015/iswc_rml_rdfunit.pdf
Publication is archived & published by the
event ISWC2015
publisher LNCS, Springer
authors multiple by 8
organization(s) multiple by 5
https://biblio.ugent.be/publication/8030828
Publication is archived & published 15 times!!
Dimou A. et al. (2015)
Assessing & Refining Mappings to RDF to Improve Dataset Quality
In: Arenas M. et al. (eds) The Semantic Web - ISWC 2015
Lecture Notes in Computer Science, vol 9367. Springer, Cham
Publication is published 15 times...
… if all agents publish its scholarly data as Linked (Open) Data
Publication is published N times...
… if N agents publish its scholarly data as Linked (Open) Data
Linked (Open) Data is generated with N different ways
Semantic Publishing
enhances the meaning of publications
by enriching them with metadata
Semantic Publishing: ad-hoc solutions
different agents own
overlapping or complementary scholarly data
use their own ad-hoc solutions
to generate and publish their own Linked (Open) Data
Semantic Publishing: fragmented datasets
different agents own
overlapping or complementary scholarly data
focus on metadata or content, rarely on both
content annotations are rarely published as datasets
Semantic Publishing: currently leading to..
duplicate efforts for Linked (Open) Data generation:
(re-)implementing from scratch
non-negligible implementation & maintenance costs
Semantic Publishing: current
effort for Linked (Open) Data generation:
implementation & maintenance ↗
Semantic Publishing: our approach
effort for Linked (Open) Data generation:
implementation & maintenance ↘
model, semantic annotations, integration & cleansing ↗
How can we
reduce implementation costs
increase Linked Data quality?
Semantic Publishing: our approach
general-purpose Linked (Open) Data
generation and publication workflow
adjusted to each agent’s scholarly data
integrates metadata & content annotations
Semantic Publishing: iLastic
general-purpose Linked (Open) Data
generation and publication workflow
based on our modular RML tool chain
adjusted to iMinds & Ghent university repository
overlapping and complementary scholarly data
integrates metadata & content annotations
based on the RML tool chain & text enricher alignment
iLastic Workflow
RDF generation & publication service
Enrichment service
iLastic Workflow
RDF generation & publication service
Enrichment service
iLastic Workflow
RDF generation & publication service
Enrichment service
iLastic Workflow
RDF generation & publication service
general purpose tool:
distinct mapping rules definition & execution
Enrichment service
Mapping Module
Processor
Extraction Module
mapping
rules
iLastic Workflow
RDF generation & publication service
general purpose tool:
distinct mapping rules definition & execution
execution: RML Processor
Enrichment service
https://github.com/RMLio/RML-Processor
iLastic Workflow
RDF generation & publication service
general purpose tool:
distinct mapping rules definition & execution
execution: RML Processor
definition
Enrichment service
iLastic Workflow
RDF generation & publication service
general purpose tool:
distinct mapping rules definition & execution
execution: RML Processor
definition: RML language
Enrichment service
A. Dimou et al. (2014) RML: A Generic Language for Integrated RDF Mappings of Heterogeneous Data. In
Proceedings of the 7th Workshop on Linked Data on the Web (LDOW2014), Seoul, Korea.
http://rml.io
iLastic Workflow
RDF generation & publication service
general purpose tool:
distinct mapping rules definition & execution
execution: RML Processor
definition: RML Editor
Enrichment service
Heyvaert P. et al. (2016) RMLEditor: A Graph-Based Mapping Editor for Linked Data Mappings. In The
Semantic Web. Latest Advances and New Domains. ESWC 2016. LNCS, vol 9678. Springer, Cham
https://www.youtube.com/watch?v=0lPDaghlZoQ
iLastic Workflow
RDF generation & publication service
general purpose tool:
execution: RML Processor
definition: RML Editor
validation
Enrichment service
iLastic Workflow
RDF generation & publication service
general purpose tool:
execution: RML Processor
definition: RML Editor
validation: RML Validator
Enrichment service
Dimou A. et al. (2015) Assessing and Refining Mappingsto RDF to Improve Dataset Quality. In: Arenas M. et
al. (eds) The Semantic Web - ISWC 2015. Lecture Notes in Computer Science, vol 9367. Springer, Cham
iLastic Workflow
RDF generation & publication service
Enrichment service
iLastic Workflow
RDF generation & publication service
Enrichment service
iLastic Workflow
RDF generation & publication service
Enrichment service
PDF Extraction: CERMINE
http://cermine.ceon.pl/
iLastic Workflow
RDF generation & publication service
Enrichment service
PDF Extraction: CERMINE
NER: DBpedia Spotlight
https://github.com/dbpedia-spotlight/dbpedia-spotlight
iLastic Workflow
RDF generation & publication service
Enrichment service
iLastic Dataset
59,462 entities
12,472 researchers
22,728 publications
81 organizations
3,295 projects
765,603 triples
iLastic Workflow
RDF generation & publication service
data dumps
Linked Data Fragments
Enrichment service
http://linkeddatafragments.org/
iLastic Workflow
RDF generation & publication service
data dumps
Linked Data Fragments
SPARQL endpoint - Virtuoso
Enrichment service
https://github.com/openlink/virtuoso-opensource
iLastic Workflow
RDF generation & publication service
data dumps
Linked Data Fragments
SPARQL endpoint - Virtuoso
The DataTank
Enrichment service
http://thedatatank.com/
iLastic User Interface
iLastic User Interface
iLastic User Interface
iLastic User Interface
https://www.youtube.com/watch?v=ZxGrHnOuSvw
iLastic:
Linked Data Generation
Workflow & User Interface
for iMinds Scholarly Data
Anastasia.Dimou@ugent.be ● @natadimou

Weitere ähnliche Inhalte

Was ist angesagt?

Iterative data discovery and transformation with open refine
Iterative data discovery and transformation with open refineIterative data discovery and transformation with open refine
Iterative data discovery and transformation with open refineMartin Magdinier
 
Linked Data Experiences at Springer Nature
Linked Data Experiences at Springer NatureLinked Data Experiences at Springer Nature
Linked Data Experiences at Springer NatureMichele Pasin
 
Connected data meetup group - introduction & scope
Connected data meetup group - introduction & scopeConnected data meetup group - introduction & scope
Connected data meetup group - introduction & scopeConnected Data World
 
Sigma EE: Reaping low-hanging fruits in RDF-based data integration
Sigma EE: Reaping low-hanging fruits in RDF-based data integrationSigma EE: Reaping low-hanging fruits in RDF-based data integration
Sigma EE: Reaping low-hanging fruits in RDF-based data integrationRichard Cyganiak
 
Linked data experience at Macmillan: Building discovery services for scientif...
Linked data experience at Macmillan: Building discovery services for scientif...Linked data experience at Macmillan: Building discovery services for scientif...
Linked data experience at Macmillan: Building discovery services for scientif...Michele Pasin
 
Publishing Linked Statistical Data: Aragón, a case study
Publishing Linked Statistical Data: Aragón, a case studyPublishing Linked Statistical Data: Aragón, a case study
Publishing Linked Statistical Data: Aragón, a case studyOscar Corcho
 
Visual Ontology Modeling for Domain Experts and Business Users with metaphactory
Visual Ontology Modeling for Domain Experts and Business Users with metaphactoryVisual Ontology Modeling for Domain Experts and Business Users with metaphactory
Visual Ontology Modeling for Domain Experts and Business Users with metaphactoryPeter Haase
 
ODI Summit 2016 - Linked Open Data at Springer Nature
ODI Summit 2016 - Linked Open Data at Springer NatureODI Summit 2016 - Linked Open Data at Springer Nature
ODI Summit 2016 - Linked Open Data at Springer NatureMichele Pasin
 
Modeling employees relationships with Apache Spark
Modeling employees relationships with Apache SparkModeling employees relationships with Apache Spark
Modeling employees relationships with Apache SparkWassim TRIFI
 
Scalable Web Data Management using RDF
Scalable Web Data Management using RDF  Scalable Web Data Management using RDF
Scalable Web Data Management using RDF Navid Sedighpour
 
Stream processing: The Matrix Revolutions
Stream processing: The Matrix RevolutionsStream processing: The Matrix Revolutions
Stream processing: The Matrix RevolutionsRomanaPernischov
 
Strata sf - Amundsen presentation
Strata sf - Amundsen presentationStrata sf - Amundsen presentation
Strata sf - Amundsen presentationTao Feng
 
Semantika Introduction
Semantika IntroductionSemantika Introduction
Semantika IntroductionJosef Hardi
 
An introduction to multi-model databases
An introduction to multi-model databasesAn introduction to multi-model databases
An introduction to multi-model databasesBerta Hermida Plaza
 
Introduction to Microsoft R Services
Introduction to Microsoft R ServicesIntroduction to Microsoft R Services
Introduction to Microsoft R ServicesGregg Barrett
 
GDBinSV_Meetup_DBMS_Trends_10062016
GDBinSV_Meetup_DBMS_Trends_10062016GDBinSV_Meetup_DBMS_Trends_10062016
GDBinSV_Meetup_DBMS_Trends_10062016Joshua Bae
 
The LINQ Between XML and Database
The LINQ Between XML and DatabaseThe LINQ Between XML and Database
The LINQ Between XML and DatabaseIRJET Journal
 
Graph Analytics for big data
Graph Analytics for big dataGraph Analytics for big data
Graph Analytics for big dataSigmoid
 
GraphTech Ecosystem - part 1: Graph Databases
GraphTech Ecosystem - part 1: Graph DatabasesGraphTech Ecosystem - part 1: Graph Databases
GraphTech Ecosystem - part 1: Graph DatabasesLinkurious
 
20181123 dn2018 graph_analytics_k_patenge
20181123 dn2018 graph_analytics_k_patenge20181123 dn2018 graph_analytics_k_patenge
20181123 dn2018 graph_analytics_k_patengeKarin Patenge
 

Was ist angesagt? (20)

Iterative data discovery and transformation with open refine
Iterative data discovery and transformation with open refineIterative data discovery and transformation with open refine
Iterative data discovery and transformation with open refine
 
Linked Data Experiences at Springer Nature
Linked Data Experiences at Springer NatureLinked Data Experiences at Springer Nature
Linked Data Experiences at Springer Nature
 
Connected data meetup group - introduction & scope
Connected data meetup group - introduction & scopeConnected data meetup group - introduction & scope
Connected data meetup group - introduction & scope
 
Sigma EE: Reaping low-hanging fruits in RDF-based data integration
Sigma EE: Reaping low-hanging fruits in RDF-based data integrationSigma EE: Reaping low-hanging fruits in RDF-based data integration
Sigma EE: Reaping low-hanging fruits in RDF-based data integration
 
Linked data experience at Macmillan: Building discovery services for scientif...
Linked data experience at Macmillan: Building discovery services for scientif...Linked data experience at Macmillan: Building discovery services for scientif...
Linked data experience at Macmillan: Building discovery services for scientif...
 
Publishing Linked Statistical Data: Aragón, a case study
Publishing Linked Statistical Data: Aragón, a case studyPublishing Linked Statistical Data: Aragón, a case study
Publishing Linked Statistical Data: Aragón, a case study
 
Visual Ontology Modeling for Domain Experts and Business Users with metaphactory
Visual Ontology Modeling for Domain Experts and Business Users with metaphactoryVisual Ontology Modeling for Domain Experts and Business Users with metaphactory
Visual Ontology Modeling for Domain Experts and Business Users with metaphactory
 
ODI Summit 2016 - Linked Open Data at Springer Nature
ODI Summit 2016 - Linked Open Data at Springer NatureODI Summit 2016 - Linked Open Data at Springer Nature
ODI Summit 2016 - Linked Open Data at Springer Nature
 
Modeling employees relationships with Apache Spark
Modeling employees relationships with Apache SparkModeling employees relationships with Apache Spark
Modeling employees relationships with Apache Spark
 
Scalable Web Data Management using RDF
Scalable Web Data Management using RDF  Scalable Web Data Management using RDF
Scalable Web Data Management using RDF
 
Stream processing: The Matrix Revolutions
Stream processing: The Matrix RevolutionsStream processing: The Matrix Revolutions
Stream processing: The Matrix Revolutions
 
Strata sf - Amundsen presentation
Strata sf - Amundsen presentationStrata sf - Amundsen presentation
Strata sf - Amundsen presentation
 
Semantika Introduction
Semantika IntroductionSemantika Introduction
Semantika Introduction
 
An introduction to multi-model databases
An introduction to multi-model databasesAn introduction to multi-model databases
An introduction to multi-model databases
 
Introduction to Microsoft R Services
Introduction to Microsoft R ServicesIntroduction to Microsoft R Services
Introduction to Microsoft R Services
 
GDBinSV_Meetup_DBMS_Trends_10062016
GDBinSV_Meetup_DBMS_Trends_10062016GDBinSV_Meetup_DBMS_Trends_10062016
GDBinSV_Meetup_DBMS_Trends_10062016
 
The LINQ Between XML and Database
The LINQ Between XML and DatabaseThe LINQ Between XML and Database
The LINQ Between XML and Database
 
Graph Analytics for big data
Graph Analytics for big dataGraph Analytics for big data
Graph Analytics for big data
 
GraphTech Ecosystem - part 1: Graph Databases
GraphTech Ecosystem - part 1: Graph DatabasesGraphTech Ecosystem - part 1: Graph Databases
GraphTech Ecosystem - part 1: Graph Databases
 
20181123 dn2018 graph_analytics_k_patenge
20181123 dn2018 graph_analytics_k_patenge20181123 dn2018 graph_analytics_k_patenge
20181123 dn2018 graph_analytics_k_patenge
 

Ähnlich wie iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

UnifiedViews: Towards ETL Tool for Simple yet Powerful RDF Data Management.
UnifiedViews: Towards ETL Tool for Simple yet Powerful RDF Data Management.UnifiedViews: Towards ETL Tool for Simple yet Powerful RDF Data Management.
UnifiedViews: Towards ETL Tool for Simple yet Powerful RDF Data Management.tomasknap
 
SEMLIB Final Conference | DERI presentation
SEMLIB Final Conference | DERI presentationSEMLIB Final Conference | DERI presentation
SEMLIB Final Conference | DERI presentationSemLib Project
 
MACHINE LEARNING ON MAPREDUCE FRAMEWORK
MACHINE LEARNING ON MAPREDUCE FRAMEWORKMACHINE LEARNING ON MAPREDUCE FRAMEWORK
MACHINE LEARNING ON MAPREDUCE FRAMEWORKAbhi Jit
 
The Best of Both Worlds: Unlocking the Power of (big) Knowledge Graphs with S...
The Best of Both Worlds: Unlocking the Power of (big) Knowledge Graphs with S...The Best of Both Worlds: Unlocking the Power of (big) Knowledge Graphs with S...
The Best of Both Worlds: Unlocking the Power of (big) Knowledge Graphs with S...Gezim Sejdiu
 
Apprendre Via les Objets Xin Chen
Apprendre Via les Objets  Xin ChenApprendre Via les Objets  Xin Chen
Apprendre Via les Objets Xin Chencecilechen85
 
Semantically Enriched Knowledge Extraction With Data Mining
Semantically Enriched Knowledge Extraction With Data MiningSemantically Enriched Knowledge Extraction With Data Mining
Semantically Enriched Knowledge Extraction With Data MiningEditor IJCATR
 
State and future of linked data in learning analytics
State and future of linked data in learning analyticsState and future of linked data in learning analytics
State and future of linked data in learning analyticsMathieu d'Aquin
 
TFF2016, Rudi Studer, Smarte Dienstleistungen mit semantischen Technologien
TFF2016, Rudi Studer, Smarte Dienstleistungen mit semantischen TechnologienTFF2016, Rudi Studer, Smarte Dienstleistungen mit semantischen Technologien
TFF2016, Rudi Studer, Smarte Dienstleistungen mit semantischen TechnologienTourismFastForward
 
Proof of Concept for Learning Analytics Interoperability
Proof of Concept for Learning Analytics InteroperabilityProof of Concept for Learning Analytics Interoperability
Proof of Concept for Learning Analytics InteroperabilityOpen Cyber University of Korea
 
Linda newsletter issue 1 dec2014
Linda newsletter issue 1 dec2014Linda newsletter issue 1 dec2014
Linda newsletter issue 1 dec2014LinDa_FP7
 
RDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival dataRDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival dataGiorgos Santipantakis
 
FIWARE Global Summit - IDS Implementation with FIWARE Software Components
FIWARE Global Summit - IDS Implementation with FIWARE Software ComponentsFIWARE Global Summit - IDS Implementation with FIWARE Software Components
FIWARE Global Summit - IDS Implementation with FIWARE Software ComponentsFIWARE
 
Resource Discovery Landscape
Resource Discovery LandscapeResource Discovery Landscape
Resource Discovery LandscapeAndy Powell
 
Linking Big Data to Rich Process Descriptions
Linking Big Data to Rich Process DescriptionsLinking Big Data to Rich Process Descriptions
Linking Big Data to Rich Process DescriptionsChristoph Lange
 
aRangodb, un package per l'utilizzo di ArangoDB con R
aRangodb, un package per l'utilizzo di ArangoDB con RaRangodb, un package per l'utilizzo di ArangoDB con R
aRangodb, un package per l'utilizzo di ArangoDB con RGraphRM
 
Database Integrated Analytics using R InitialExperiences wi
Database Integrated Analytics using R InitialExperiences wiDatabase Integrated Analytics using R InitialExperiences wi
Database Integrated Analytics using R InitialExperiences wiOllieShoresna
 
Datapedia Analysis Report
Datapedia Analysis ReportDatapedia Analysis Report
Datapedia Analysis ReportAbanoub Amgad
 

Ähnlich wie iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data (20)

UnifiedViews: Towards ETL Tool for Simple yet Powerful RDF Data Management.
UnifiedViews: Towards ETL Tool for Simple yet Powerful RDF Data Management.UnifiedViews: Towards ETL Tool for Simple yet Powerful RDF Data Management.
UnifiedViews: Towards ETL Tool for Simple yet Powerful RDF Data Management.
 
SEMLIB Final Conference | DERI presentation
SEMLIB Final Conference | DERI presentationSEMLIB Final Conference | DERI presentation
SEMLIB Final Conference | DERI presentation
 
Planetdata simpda
Planetdata simpdaPlanetdata simpda
Planetdata simpda
 
PlanetData: Consuming Structured Data at Web Scale
PlanetData: Consuming Structured Data at Web ScalePlanetData: Consuming Structured Data at Web Scale
PlanetData: Consuming Structured Data at Web Scale
 
MACHINE LEARNING ON MAPREDUCE FRAMEWORK
MACHINE LEARNING ON MAPREDUCE FRAMEWORKMACHINE LEARNING ON MAPREDUCE FRAMEWORK
MACHINE LEARNING ON MAPREDUCE FRAMEWORK
 
LD4KD 2015 - Demos and tools
LD4KD 2015 - Demos and toolsLD4KD 2015 - Demos and tools
LD4KD 2015 - Demos and tools
 
The Best of Both Worlds: Unlocking the Power of (big) Knowledge Graphs with S...
The Best of Both Worlds: Unlocking the Power of (big) Knowledge Graphs with S...The Best of Both Worlds: Unlocking the Power of (big) Knowledge Graphs with S...
The Best of Both Worlds: Unlocking the Power of (big) Knowledge Graphs with S...
 
Apprendre Via les Objets Xin Chen
Apprendre Via les Objets  Xin ChenApprendre Via les Objets  Xin Chen
Apprendre Via les Objets Xin Chen
 
Semantically Enriched Knowledge Extraction With Data Mining
Semantically Enriched Knowledge Extraction With Data MiningSemantically Enriched Knowledge Extraction With Data Mining
Semantically Enriched Knowledge Extraction With Data Mining
 
State and future of linked data in learning analytics
State and future of linked data in learning analyticsState and future of linked data in learning analytics
State and future of linked data in learning analytics
 
TFF2016, Rudi Studer, Smarte Dienstleistungen mit semantischen Technologien
TFF2016, Rudi Studer, Smarte Dienstleistungen mit semantischen TechnologienTFF2016, Rudi Studer, Smarte Dienstleistungen mit semantischen Technologien
TFF2016, Rudi Studer, Smarte Dienstleistungen mit semantischen Technologien
 
Proof of Concept for Learning Analytics Interoperability
Proof of Concept for Learning Analytics InteroperabilityProof of Concept for Learning Analytics Interoperability
Proof of Concept for Learning Analytics Interoperability
 
Linda newsletter issue 1 dec2014
Linda newsletter issue 1 dec2014Linda newsletter issue 1 dec2014
Linda newsletter issue 1 dec2014
 
RDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival dataRDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival data
 
FIWARE Global Summit - IDS Implementation with FIWARE Software Components
FIWARE Global Summit - IDS Implementation with FIWARE Software ComponentsFIWARE Global Summit - IDS Implementation with FIWARE Software Components
FIWARE Global Summit - IDS Implementation with FIWARE Software Components
 
Resource Discovery Landscape
Resource Discovery LandscapeResource Discovery Landscape
Resource Discovery Landscape
 
Linking Big Data to Rich Process Descriptions
Linking Big Data to Rich Process DescriptionsLinking Big Data to Rich Process Descriptions
Linking Big Data to Rich Process Descriptions
 
aRangodb, un package per l'utilizzo di ArangoDB con R
aRangodb, un package per l'utilizzo di ArangoDB con RaRangodb, un package per l'utilizzo di ArangoDB con R
aRangodb, un package per l'utilizzo di ArangoDB con R
 
Database Integrated Analytics using R InitialExperiences wi
Database Integrated Analytics using R InitialExperiences wiDatabase Integrated Analytics using R InitialExperiences wi
Database Integrated Analytics using R InitialExperiences wi
 
Datapedia Analysis Report
Datapedia Analysis ReportDatapedia Analysis Report
Datapedia Analysis Report
 

Mehr von andimou

DBpedia Mappings Quality Assessment
DBpedia Mappings Quality AssessmentDBpedia Mappings Quality Assessment
DBpedia Mappings Quality Assessmentandimou
 
Towards an Interface for User-Friendly Linked Data Generation Administration
Towards an Interface for User-Friendly Linked Data Generation AdministrationTowards an Interface for User-Friendly Linked Data Generation Administration
Towards an Interface for User-Friendly Linked Data Generation Administrationandimou
 
Mappings Validation
Mappings ValidationMappings Validation
Mappings Validationandimou
 
Assessing and Refining Mappings to RDF to Improve Dataset Quality
Assessing and Refining Mappings to RDF to Improve Dataset QualityAssessing and Refining Mappings to RDF to Improve Dataset Quality
Assessing and Refining Mappings to RDF to Improve Dataset Qualityandimou
 
Test-driven Assessment of [R2]RML Mappings to Improve Dataset Quality
Test-driven Assessment of [R2]RML Mappings to Improve Dataset Quality Test-driven Assessment of [R2]RML Mappings to Improve Dataset Quality
Test-driven Assessment of [R2]RML Mappings to Improve Dataset Quality andimou
 
Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...
Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...
Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...andimou
 
Extraction and Semantic Annotation of Workshop Proceedings in HTML using RML
Extraction and Semantic Annotation of Workshop Proceedings in HTML using RMLExtraction and Semantic Annotation of Workshop Proceedings in HTML using RML
Extraction and Semantic Annotation of Workshop Proceedings in HTML using RMLandimou
 
Mapping Hierarchical Sources into RDF using the RML Mapping Language
Mapping Hierarchical Sources into RDF using the RML Mapping LanguageMapping Hierarchical Sources into RDF using the RML Mapping Language
Mapping Hierarchical Sources into RDF using the RML Mapping Languageandimou
 
A Generic Language for Integrated RDF Mappings of Heterogeneous Data
A Generic Language for Integrated RDF Mappings of Heterogeneous DataA Generic Language for Integrated RDF Mappings of Heterogeneous Data
A Generic Language for Integrated RDF Mappings of Heterogeneous Dataandimou
 
Visualizing the information of a Linked Open Data enabled Research Informatio...
Visualizing the information of a Linked Open Data enabled Research Informatio...Visualizing the information of a Linked Open Data enabled Research Informatio...
Visualizing the information of a Linked Open Data enabled Research Informatio...andimou
 

Mehr von andimou (10)

DBpedia Mappings Quality Assessment
DBpedia Mappings Quality AssessmentDBpedia Mappings Quality Assessment
DBpedia Mappings Quality Assessment
 
Towards an Interface for User-Friendly Linked Data Generation Administration
Towards an Interface for User-Friendly Linked Data Generation AdministrationTowards an Interface for User-Friendly Linked Data Generation Administration
Towards an Interface for User-Friendly Linked Data Generation Administration
 
Mappings Validation
Mappings ValidationMappings Validation
Mappings Validation
 
Assessing and Refining Mappings to RDF to Improve Dataset Quality
Assessing and Refining Mappings to RDF to Improve Dataset QualityAssessing and Refining Mappings to RDF to Improve Dataset Quality
Assessing and Refining Mappings to RDF to Improve Dataset Quality
 
Test-driven Assessment of [R2]RML Mappings to Improve Dataset Quality
Test-driven Assessment of [R2]RML Mappings to Improve Dataset Quality Test-driven Assessment of [R2]RML Mappings to Improve Dataset Quality
Test-driven Assessment of [R2]RML Mappings to Improve Dataset Quality
 
Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...
Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...
Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...
 
Extraction and Semantic Annotation of Workshop Proceedings in HTML using RML
Extraction and Semantic Annotation of Workshop Proceedings in HTML using RMLExtraction and Semantic Annotation of Workshop Proceedings in HTML using RML
Extraction and Semantic Annotation of Workshop Proceedings in HTML using RML
 
Mapping Hierarchical Sources into RDF using the RML Mapping Language
Mapping Hierarchical Sources into RDF using the RML Mapping LanguageMapping Hierarchical Sources into RDF using the RML Mapping Language
Mapping Hierarchical Sources into RDF using the RML Mapping Language
 
A Generic Language for Integrated RDF Mappings of Heterogeneous Data
A Generic Language for Integrated RDF Mappings of Heterogeneous DataA Generic Language for Integrated RDF Mappings of Heterogeneous Data
A Generic Language for Integrated RDF Mappings of Heterogeneous Data
 
Visualizing the information of a Linked Open Data enabled Research Informatio...
Visualizing the information of a Linked Open Data enabled Research Informatio...Visualizing the information of a Linked Open Data enabled Research Informatio...
Visualizing the information of a Linked Open Data enabled Research Informatio...
 

Kürzlich hochgeladen

DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 

Kürzlich hochgeladen (20)

DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 

iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

  • 1. iLastic: Linked Data Generation Workflow & User Interface for iMinds Scholarly Data SAVE-SD 2017 Anastasia Dimou, Gerald Haesendonck, Martin Vanbrabant, Laurens De Vocht, Ruben Verborgh, Steven Latré, Erik Mannens Anastasia.Dimou@ugent.be ● @natadimou Ghent University – IDLab – imec
  • 2. Publication is archived & published by the event organizers where it was presented publisher who publishes the proceedings authors who co-edited it organization(s) the authors are affiliated with
  • 3. Publication Dimou A. et al. (2015) Assessing & Refining Mappings to RDF to Improve Dataset Quality In: Arenas M. et al. (eds) The Semantic Web - ISWC 2015 Lecture Notes in Computer Science, vol 9367. Springer, Cham
  • 4. Publication is archived & published by the event ISWC2015 http://iswc2015.semanticweb.org/sites/iswc2015.semanticweb.org/files/93670111.pdf
  • 5. Publication is archived & published by the event ISWC2015 publisher LNCS, Springer https://link.springer.com/chapter/10.1007/978-3-319-25010-6_8
  • 6. Publication is archived & published by the event ISWC2015 publisher LNCS, Springer authors multiple by 8 https://ruben.verborgh.org/publications/dimou_iswc_2015a/ http://jens-lehmann.org/files/2015/iswc_rml_rdfunit.pdf
  • 7. Publication is archived & published by the event ISWC2015 publisher LNCS, Springer authors multiple by 8 organization(s) multiple by 5 https://biblio.ugent.be/publication/8030828
  • 8. Publication is archived & published 15 times!! Dimou A. et al. (2015) Assessing & Refining Mappings to RDF to Improve Dataset Quality In: Arenas M. et al. (eds) The Semantic Web - ISWC 2015 Lecture Notes in Computer Science, vol 9367. Springer, Cham
  • 9. Publication is published 15 times... … if all agents publish its scholarly data as Linked (Open) Data
  • 10. Publication is published N times... … if N agents publish its scholarly data as Linked (Open) Data Linked (Open) Data is generated with N different ways
  • 11. Semantic Publishing enhances the meaning of publications by enriching them with metadata
  • 12. Semantic Publishing: ad-hoc solutions different agents own overlapping or complementary scholarly data use their own ad-hoc solutions to generate and publish their own Linked (Open) Data
  • 13. Semantic Publishing: fragmented datasets different agents own overlapping or complementary scholarly data focus on metadata or content, rarely on both content annotations are rarely published as datasets
  • 14. Semantic Publishing: currently leading to.. duplicate efforts for Linked (Open) Data generation: (re-)implementing from scratch non-negligible implementation & maintenance costs
  • 15. Semantic Publishing: current effort for Linked (Open) Data generation: implementation & maintenance ↗
  • 16. Semantic Publishing: our approach effort for Linked (Open) Data generation: implementation & maintenance ↘ model, semantic annotations, integration & cleansing ↗
  • 17. How can we reduce implementation costs increase Linked Data quality?
  • 18. Semantic Publishing: our approach general-purpose Linked (Open) Data generation and publication workflow adjusted to each agent’s scholarly data integrates metadata & content annotations
  • 19. Semantic Publishing: iLastic general-purpose Linked (Open) Data generation and publication workflow based on our modular RML tool chain adjusted to iMinds & Ghent university repository overlapping and complementary scholarly data integrates metadata & content annotations based on the RML tool chain & text enricher alignment
  • 20. iLastic Workflow RDF generation & publication service Enrichment service
  • 21.
  • 22.
  • 23. iLastic Workflow RDF generation & publication service Enrichment service
  • 24. iLastic Workflow RDF generation & publication service Enrichment service
  • 25. iLastic Workflow RDF generation & publication service general purpose tool: distinct mapping rules definition & execution Enrichment service
  • 27. iLastic Workflow RDF generation & publication service general purpose tool: distinct mapping rules definition & execution execution: RML Processor Enrichment service https://github.com/RMLio/RML-Processor
  • 28.
  • 29. iLastic Workflow RDF generation & publication service general purpose tool: distinct mapping rules definition & execution execution: RML Processor definition Enrichment service
  • 30. iLastic Workflow RDF generation & publication service general purpose tool: distinct mapping rules definition & execution execution: RML Processor definition: RML language Enrichment service A. Dimou et al. (2014) RML: A Generic Language for Integrated RDF Mappings of Heterogeneous Data. In Proceedings of the 7th Workshop on Linked Data on the Web (LDOW2014), Seoul, Korea. http://rml.io
  • 31. iLastic Workflow RDF generation & publication service general purpose tool: distinct mapping rules definition & execution execution: RML Processor definition: RML Editor Enrichment service Heyvaert P. et al. (2016) RMLEditor: A Graph-Based Mapping Editor for Linked Data Mappings. In The Semantic Web. Latest Advances and New Domains. ESWC 2016. LNCS, vol 9678. Springer, Cham https://www.youtube.com/watch?v=0lPDaghlZoQ
  • 32.
  • 33. iLastic Workflow RDF generation & publication service general purpose tool: execution: RML Processor definition: RML Editor validation Enrichment service
  • 34. iLastic Workflow RDF generation & publication service general purpose tool: execution: RML Processor definition: RML Editor validation: RML Validator Enrichment service Dimou A. et al. (2015) Assessing and Refining Mappingsto RDF to Improve Dataset Quality. In: Arenas M. et al. (eds) The Semantic Web - ISWC 2015. Lecture Notes in Computer Science, vol 9367. Springer, Cham
  • 35.
  • 36. iLastic Workflow RDF generation & publication service Enrichment service
  • 37. iLastic Workflow RDF generation & publication service Enrichment service
  • 38.
  • 39. iLastic Workflow RDF generation & publication service Enrichment service PDF Extraction: CERMINE http://cermine.ceon.pl/
  • 40.
  • 41. iLastic Workflow RDF generation & publication service Enrichment service PDF Extraction: CERMINE NER: DBpedia Spotlight https://github.com/dbpedia-spotlight/dbpedia-spotlight
  • 42.
  • 43. iLastic Workflow RDF generation & publication service Enrichment service
  • 44. iLastic Dataset 59,462 entities 12,472 researchers 22,728 publications 81 organizations 3,295 projects 765,603 triples
  • 45. iLastic Workflow RDF generation & publication service data dumps Linked Data Fragments Enrichment service http://linkeddatafragments.org/
  • 46. iLastic Workflow RDF generation & publication service data dumps Linked Data Fragments SPARQL endpoint - Virtuoso Enrichment service https://github.com/openlink/virtuoso-opensource
  • 47.
  • 48. iLastic Workflow RDF generation & publication service data dumps Linked Data Fragments SPARQL endpoint - Virtuoso The DataTank Enrichment service http://thedatatank.com/
  • 53. iLastic: Linked Data Generation Workflow & User Interface for iMinds Scholarly Data Anastasia.Dimou@ugent.be ● @natadimou