SlideShare ist ein Scribd-Unternehmen logo
1 von 21
Sigma EE: Reaping low-hanging fruits in RDF-based data integration Richard Cyganiak I-Semantics 2010, Graz
Intro Semantic Technologies conferences In-use Tracks Applications session D2RQ Expose contents of relational databases as RDF/SPARQL Just a format converter; what do people use it for?
The common theme … Integration of data across the organization/project 3 of XYZ
The RDF-based data integration project
The RDF-based data integration project Probably limited budget … Otherwise would buy from SAP or Oracle
Where next after “Hello World”?
Sigma EE Originally not built for enterprise data but for web data Sindice, search engine for the Web of Data Microformats, RDFa, Linked Data on the Web For building apps on top of data search API http://sindice.com/ How to show the richness of all that data? http://sig.ma/
sig.ma demo
Off-the-shelf UI for the RDF Bus
Background The problem: How to provide uniform access to heterogeneous data sources? Value-added services: Search Browsing Recommendations of related items Reporting Dashboarding Notifications …
Solutions? Data Warehousing Enterprise Information Integration Enterprise Search A middle ground in-between?
Data Warehousing, EII Integrate enterprise data sources into a new data source Data Warehouse: materialized (new DB) Enterprise Information Integration: virtual (distrib. queries) Focus on data Tight integration High up-front cost
Enterprise Search Provides the most sought-after service (search) Focus on documents full-text search Lower up-front cost (no schema alignment) Providing value-added services on top is difficult
A middle ground Start by providing access to data on a per-business-object basis without prior schema alignment Services: Browsing of the catalog of objects; search Align, link and reconciliate as required to enable more services, e.g., expressive queries
A middle ground No accepted term yet Data Spaces? Pay-as-you-go Data Integration? Linked Enterprise Data?
The RDF technology stack A standards-based “data-first” approach RDF, SPARQL, OWL – W3C standards Off-the-shelf components Integrates well with web data sources
The “RDF Bus” Various implementation strategies ETL + One Big Triple Store with SPARQL endpoint Several SPARQL endpoint (SPARQL 1.1 SERVICE feature?) Linked Data style (resolvable URIs) Bus details determine what services can be provided Can you do high-performance SPARQL? Can you do full-text search? Real-time up-to-date information or significant delay? Where is alignment handled? Who can hook in new data sources?
Sigma EE Services: search, browsing Strengths Minimal requirements for the RDF bus Strong support for provenance Dynamic UI Bus has to provide Search and Entity descriptions E.g., SPARQL endpoint with full-text search E.g., Solr E.g., Sindice + (part of) the Web E.g., custom Java classes Or multiple of the above
Architecture
Sigma UI Full-text search On-the-fly fuzzy merge of data sources Empower user to evaluate provenance, reject and accept data sources Show/hide/rearrange properties and values Browse to related entities Permalinks, embeddable widgets
Summary Sigma EE: front-end for your RDF Bus E.g., for your triple store Off-the-shelf UI with minimum configuration Available under GPL or other licenses at http://sig.ma/?page=help Running at http://sig.ma/

Weitere ähnliche Inhalte

Was ist angesagt?

BigData-Architecture
BigData-ArchitectureBigData-Architecture
BigData-Architecture
Narayana B
 
It’s All About The Cards: Sharing on Social Media Encouraged HTML Metadata G...
It’s All About The Cards: Sharing on Social Media Encouraged HTML Metadata G...It’s All About The Cards: Sharing on Social Media Encouraged HTML Metadata G...
It’s All About The Cards: Sharing on Social Media Encouraged HTML Metadata G...
Shawn Jones
 

Was ist angesagt? (20)

GraphDB Cloud: Enterprise Ready RDF Database on Demand
GraphDB Cloud: Enterprise Ready RDF Database on DemandGraphDB Cloud: Enterprise Ready RDF Database on Demand
GraphDB Cloud: Enterprise Ready RDF Database on Demand
 
Solution architecture for big data projects
Solution architecture for big data projectsSolution architecture for big data projects
Solution architecture for big data projects
 
Smarter content with a Dynamic Semantic Publishing Platform
Smarter content with a Dynamic Semantic Publishing PlatformSmarter content with a Dynamic Semantic Publishing Platform
Smarter content with a Dynamic Semantic Publishing Platform
 
Robert Isele | eccenca CorporateMemory - Semantically integrated Enterprise D...
Robert Isele | eccenca CorporateMemory - Semantically integrated Enterprise D...Robert Isele | eccenca CorporateMemory - Semantically integrated Enterprise D...
Robert Isele | eccenca CorporateMemory - Semantically integrated Enterprise D...
 
Manage tracability with Apache Atlas, a flexible metadata repository
Manage tracability with Apache Atlas, a flexible metadata repositoryManage tracability with Apache Atlas, a flexible metadata repository
Manage tracability with Apache Atlas, a flexible metadata repository
 
Solution architecture
Solution architectureSolution architecture
Solution architecture
 
BigData-Architecture
BigData-ArchitectureBigData-Architecture
BigData-Architecture
 
The Business Case for Semantic Web Ontology & Knowledge Graph
The Business Case for Semantic Web Ontology & Knowledge GraphThe Business Case for Semantic Web Ontology & Knowledge Graph
The Business Case for Semantic Web Ontology & Knowledge Graph
 
Enterprise architecture for big data projects
Enterprise architecture for big data projectsEnterprise architecture for big data projects
Enterprise architecture for big data projects
 
PoolParty Search Server
PoolParty Search ServerPoolParty Search Server
PoolParty Search Server
 
The Kasabi Information Marketplace
The Kasabi Information MarketplaceThe Kasabi Information Marketplace
The Kasabi Information Marketplace
 
Solution Architecture - AWS
Solution Architecture - AWSSolution Architecture - AWS
Solution Architecture - AWS
 
Solution Architecture Cassandra
Solution Architecture CassandraSolution Architecture Cassandra
Solution Architecture Cassandra
 
Ontos NLP Stack, Sep. 2016
Ontos NLP Stack, Sep. 2016Ontos NLP Stack, Sep. 2016
Ontos NLP Stack, Sep. 2016
 
Building, and communicating, a knowledge graph in Zalando
Building, and communicating, a knowledge graph in ZalandoBuilding, and communicating, a knowledge graph in Zalando
Building, and communicating, a knowledge graph in Zalando
 
It’s All About The Cards: Sharing on Social Media Encouraged HTML Metadata G...
It’s All About The Cards: Sharing on Social Media Encouraged HTML Metadata G...It’s All About The Cards: Sharing on Social Media Encouraged HTML Metadata G...
It’s All About The Cards: Sharing on Social Media Encouraged HTML Metadata G...
 
Intro to GraphQL
Intro to GraphQLIntro to GraphQL
Intro to GraphQL
 
Building next generation data warehouses
Building next generation data warehousesBuilding next generation data warehouses
Building next generation data warehouses
 
Master Data Management Using AI
Master Data Management Using AIMaster Data Management Using AI
Master Data Management Using AI
 
schema.org, Linked Data's Gateway Drug
schema.org, Linked Data's Gateway Drugschema.org, Linked Data's Gateway Drug
schema.org, Linked Data's Gateway Drug
 

Ähnlich wie Sigma EE: Reaping low-hanging fruits in RDF-based data integration

Wed roman tut_open_datapub
Wed roman tut_open_datapubWed roman tut_open_datapub
Wed roman tut_open_datapub
eswcsummerschool
 

Ähnlich wie Sigma EE: Reaping low-hanging fruits in RDF-based data integration (20)

Linked Data Tutorial
Linked Data TutorialLinked Data Tutorial
Linked Data Tutorial
 
Why rdfa
Why rdfaWhy rdfa
Why rdfa
 
RDFa Introductory Course Session 3/4 Why RDFa
RDFa Introductory Course Session 3/4 Why RDFaRDFa Introductory Course Session 3/4 Why RDFa
RDFa Introductory Course Session 3/4 Why RDFa
 
Talis Platform: A Linked Data Engine
Talis Platform: A Linked Data EngineTalis Platform: A Linked Data Engine
Talis Platform: A Linked Data Engine
 
State of the Semantic Web
State of the Semantic WebState of the Semantic Web
State of the Semantic Web
 
The Best of Both Worlds: Unlocking the Power of (big) Knowledge Graphs with S...
The Best of Both Worlds: Unlocking the Power of (big) Knowledge Graphs with S...The Best of Both Worlds: Unlocking the Power of (big) Knowledge Graphs with S...
The Best of Both Worlds: Unlocking the Power of (big) Knowledge Graphs with S...
 
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open DataMuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
 
RDTF Metadata Guidelines: an update
RDTF Metadata Guidelines: an updateRDTF Metadata Guidelines: an update
RDTF Metadata Guidelines: an update
 
Linked data and voyager
Linked data and voyagerLinked data and voyager
Linked data and voyager
 
Danbri Drupalcon Export
Danbri Drupalcon ExportDanbri Drupalcon Export
Danbri Drupalcon Export
 
Linked Data
Linked DataLinked Data
Linked Data
 
Why I don't use Semantic Web technologies anymore, event if they still influe...
Why I don't use Semantic Web technologies anymore, event if they still influe...Why I don't use Semantic Web technologies anymore, event if they still influe...
Why I don't use Semantic Web technologies anymore, event if they still influe...
 
Corrib.org - OpenSource and Research
Corrib.org - OpenSource and ResearchCorrib.org - OpenSource and Research
Corrib.org - OpenSource and Research
 
Virtuoso -- The Prometheus of RDF
Virtuoso -- The Prometheus of RDFVirtuoso -- The Prometheus of RDF
Virtuoso -- The Prometheus of RDF
 
Wed roman tut_open_datapub
Wed roman tut_open_datapubWed roman tut_open_datapub
Wed roman tut_open_datapub
 
Virtuoso, The Prometheus of RDF -- Sematics 2014 Conference Keynote
 Virtuoso, The Prometheus of RDF -- Sematics 2014 Conference Keynote Virtuoso, The Prometheus of RDF -- Sematics 2014 Conference Keynote
Virtuoso, The Prometheus of RDF -- Sematics 2014 Conference Keynote
 
Site Interoperability Projects at DERI Galway's SW Cluster
Site Interoperability Projects at DERI Galway's SW ClusterSite Interoperability Projects at DERI Galway's SW Cluster
Site Interoperability Projects at DERI Galway's SW Cluster
 
Semantic Web, e-commerce
Semantic Web, e-commerceSemantic Web, e-commerce
Semantic Web, e-commerce
 
SemTech 2010: Pelorus Platform
SemTech 2010: Pelorus PlatformSemTech 2010: Pelorus Platform
SemTech 2010: Pelorus Platform
 
Enterprise knowledge graphs
Enterprise knowledge graphsEnterprise knowledge graphs
Enterprise knowledge graphs
 

Mehr von Richard Cyganiak

EDF2012: The Web of Data and its Five Stars
EDF2012: The Web of Data and its Five StarsEDF2012: The Web of Data and its Five Stars
EDF2012: The Web of Data and its Five Stars
Richard Cyganiak
 
VoID: Metadata for RDF Datasets
VoID: Metadata for RDF DatasetsVoID: Metadata for RDF Datasets
VoID: Metadata for RDF Datasets
Richard Cyganiak
 
How to get your data into Sindice and Google with sitemap4rdf
How to get your data into Sindice and Google with sitemap4rdfHow to get your data into Sindice and Google with sitemap4rdf
How to get your data into Sindice and Google with sitemap4rdf
Richard Cyganiak
 
Self-Service Linked Government Data with dcat and Gridworks
Self-Service Linked Government Data with dcat and GridworksSelf-Service Linked Government Data with dcat and Gridworks
Self-Service Linked Government Data with dcat and Gridworks
Richard Cyganiak
 

Mehr von Richard Cyganiak (12)

SHACL: Shaping the Big Ball of Data Mud
SHACL: Shaping the Big Ball of Data MudSHACL: Shaping the Big Ball of Data Mud
SHACL: Shaping the Big Ball of Data Mud
 
What's New in RDF 1.1?
What's New in RDF 1.1?What's New in RDF 1.1?
What's New in RDF 1.1?
 
EDF2012: The Web of Data and its Five Stars
EDF2012: The Web of Data and its Five StarsEDF2012: The Web of Data and its Five Stars
EDF2012: The Web of Data and its Five Stars
 
VoID: Metadata for RDF Datasets
VoID: Metadata for RDF DatasetsVoID: Metadata for RDF Datasets
VoID: Metadata for RDF Datasets
 
Practical Cross-Dataset Queries with SPARQL (Introduction)
Practical Cross-Dataset Queries with SPARQL (Introduction)Practical Cross-Dataset Queries with SPARQL (Introduction)
Practical Cross-Dataset Queries with SPARQL (Introduction)
 
How to Publish Open Data
How to Publish Open DataHow to Publish Open Data
How to Publish Open Data
 
Investigating Community Implementation of the GoodRelations Ontology
Investigating Community Implementation of the GoodRelations OntologyInvestigating Community Implementation of the GoodRelations Ontology
Investigating Community Implementation of the GoodRelations Ontology
 
How to get your data into Sindice and Google with sitemap4rdf
How to get your data into Sindice and Google with sitemap4rdfHow to get your data into Sindice and Google with sitemap4rdf
How to get your data into Sindice and Google with sitemap4rdf
 
Self-Service Linked Government Data with dcat and Gridworks
Self-Service Linked Government Data with dcat and GridworksSelf-Service Linked Government Data with dcat and Gridworks
Self-Service Linked Government Data with dcat and Gridworks
 
The State of Linked Government Data
The State of Linked Government DataThe State of Linked Government Data
The State of Linked Government Data
 
What is SDMX-RDF?
What is SDMX-RDF?What is SDMX-RDF?
What is SDMX-RDF?
 
dcat: An RDF vocabulary for interoperability of data catalogues
dcat: An RDF vocabulary for interoperability of data cataloguesdcat: An RDF vocabulary for interoperability of data catalogues
dcat: An RDF vocabulary for interoperability of data catalogues
 

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 

Sigma EE: Reaping low-hanging fruits in RDF-based data integration

  • 1. Sigma EE: Reaping low-hanging fruits in RDF-based data integration Richard Cyganiak I-Semantics 2010, Graz
  • 2. Intro Semantic Technologies conferences In-use Tracks Applications session D2RQ Expose contents of relational databases as RDF/SPARQL Just a format converter; what do people use it for?
  • 3. The common theme … Integration of data across the organization/project 3 of XYZ
  • 4. The RDF-based data integration project
  • 5. The RDF-based data integration project Probably limited budget … Otherwise would buy from SAP or Oracle
  • 6. Where next after “Hello World”?
  • 7. Sigma EE Originally not built for enterprise data but for web data Sindice, search engine for the Web of Data Microformats, RDFa, Linked Data on the Web For building apps on top of data search API http://sindice.com/ How to show the richness of all that data? http://sig.ma/
  • 9. Off-the-shelf UI for the RDF Bus
  • 10. Background The problem: How to provide uniform access to heterogeneous data sources? Value-added services: Search Browsing Recommendations of related items Reporting Dashboarding Notifications …
  • 11. Solutions? Data Warehousing Enterprise Information Integration Enterprise Search A middle ground in-between?
  • 12. Data Warehousing, EII Integrate enterprise data sources into a new data source Data Warehouse: materialized (new DB) Enterprise Information Integration: virtual (distrib. queries) Focus on data Tight integration High up-front cost
  • 13. Enterprise Search Provides the most sought-after service (search) Focus on documents full-text search Lower up-front cost (no schema alignment) Providing value-added services on top is difficult
  • 14. A middle ground Start by providing access to data on a per-business-object basis without prior schema alignment Services: Browsing of the catalog of objects; search Align, link and reconciliate as required to enable more services, e.g., expressive queries
  • 15. A middle ground No accepted term yet Data Spaces? Pay-as-you-go Data Integration? Linked Enterprise Data?
  • 16. The RDF technology stack A standards-based “data-first” approach RDF, SPARQL, OWL – W3C standards Off-the-shelf components Integrates well with web data sources
  • 17. The “RDF Bus” Various implementation strategies ETL + One Big Triple Store with SPARQL endpoint Several SPARQL endpoint (SPARQL 1.1 SERVICE feature?) Linked Data style (resolvable URIs) Bus details determine what services can be provided Can you do high-performance SPARQL? Can you do full-text search? Real-time up-to-date information or significant delay? Where is alignment handled? Who can hook in new data sources?
  • 18. Sigma EE Services: search, browsing Strengths Minimal requirements for the RDF bus Strong support for provenance Dynamic UI Bus has to provide Search and Entity descriptions E.g., SPARQL endpoint with full-text search E.g., Solr E.g., Sindice + (part of) the Web E.g., custom Java classes Or multiple of the above
  • 20. Sigma UI Full-text search On-the-fly fuzzy merge of data sources Empower user to evaluate provenance, reject and accept data sources Show/hide/rearrange properties and values Browse to related entities Permalinks, embeddable widgets
  • 21. Summary Sigma EE: front-end for your RDF Bus E.g., for your triple store Off-the-shelf UI with minimum configuration Available under GPL or other licenses at http://sig.ma/?page=help Running at http://sig.ma/