SlideShare ist ein Scribd-Unternehmen logo
1 von 25
Downloaden Sie, um offline zu lesen
FactForge: Data Service or the
Diversity of Inferred Knowledge
            over LOD
   Mariana Damova, PhD, Kiril Simov, Zdravko Tashev, Atanas Kiryakov


                             AIMSA’2012
                           September 2012
Ontotext
   – Top-5 provider of core Semantic Technology
   – Established in year 2000; offices in Bulgaria, UK, USA
   – Active both in research and commercial projects (FP7 funding for 10 years)

• 360° semantic technology – unique portfolio:
   – Semantic Databases: high-performance RDF DBMS, scalable reasoning
   – Semantic Search: text-mining (IE), metadata generation, Information Retrieval (IR)
   – Web Mining: focused crawling, screen scraping, data fusion
   – Linked Data Management and Data Integration

   Good recognition in the SemTech community
   – Ontotext pages are ranked #1 for “semantic annotation” and “semantic repository” at
     GYM, #3 for “linked data management” at Google

   Several joint ventures and subsidiaries
   – Innovantage: leading online recruitment intelligence provider in UK
Ontotext Clients (selected)

          British Broadcasting Corporation (BBC)
                – Run its World Cup 2010 sites on top of OWLIM
                – Since Mar’12 BBC Sports
                – 2012 Olympics sections are driven
                  by OWLIM and a Concept Extraction service developed by Ontotext
          Press Association (UK)
                – Analysis of Sports news
                – Concept extraction
                – Linked data generation
          Top-3 USA media (not allowed to name)
          The National Archives (UK) contracted Ontotext to implement
          semantic KB and semantic search for the Government Web Archive
          British Museum (UK) Ontotext leads the development of Phase 3 of
          ResearchSpace project on collaborative research in cultural heritage;
          British Museum’s public SPARQL end-point is powered by OWLIM
          de Bibliothek (Holland) aggregation of data from 150 library databases
Semantic Web and Linked Open Data
• Semantic Web
  a set of standards that enable computers to interpret the
  semantics of data on the web
• Linked Open Data
  a set of principles for publishing structured data and interlinking
  them so that they can be browsed in a way HTML pages are
  browsable
               - Use URIs to identify things.
               - Use HTTP URIs so that these things can be referred to and looked up
                 ("dereferenced") by people and user agents.
               - Provide useful information about the thing when its URI is dereferenced,
                 using standard formats such as RDF/XML.
               - Include links to other, related URIs in the exposed data to improve discovery
                 of other related information on the Web.
                                       AIMSA’2012                          September 2012 #4
Linked Open Data cloud


                            2008




                         2011
                     295 datasets                         2009
               more than 30 billion triples




                         AIMSA’2012           July 2011          #5
Linked Open Data is maturing
  LOD cloud grows by billions of triples yearly
Technologies and guidelines about
  how to produce linked data fast
  how to assure their quality
  how to provide vertical oriented data services
                                             LOD2, LATC, baseKB



                            AIMSA’2012             September 2012   #6
This talk is about
       reasoning
               and
                     coping with diversity of the data on the web of data




                                    AIMSA’2012             September 2012   #7
Outline

• FactForge (beta)
• Reference Layer
• Access Modes
• Querying
   – Airports around London
   – US city – a subject of a Novel
   – US city – contactInformation

• Challenges
• Conclusion



                                      AIMSA’2012   September 2012
FactForge (beta)




the largest body of heterogeneous general knowledge on which inference has been performed

– powered by OWLIM 5.2                                           – supporting SPARQL 1.1
                                       AIMSA’2012
                                                                        September 2012
Datasets

                         REASON-ABLE VIEW
                           of LOD datasets
                  Number of explicit statements: 1,686,804,539
                      Implicit statements: 1,264,199,839
                    Retrievable statements: 12,646,674,554


                                                      CIA FactBook
   DBpedia 3.7
                     Freebase
                                     NY Times
                                                                     Lexvo



    Wordnet 3.0          Geonames                                Lingvoj
                                           MusicBrainz




materialization is performed with respect to the semantics of OWL-Horst optimized

                                      AIMSA’2012
                                                                             September 2012
Reference Layer




                                                                   PROTON – light weight upper level ontology
                                                                             ~500 classes, ~150 properties
                                                                   http://www.ontotext.com/proton-ontology

Linking at schema level:
(1) using rdfs:subClassOf and rdfs:subPropertyOf statements;
(2) using OWL expressions where there is a difference in the conceptualization
(3) using inference rules if additional individuals are necessary in the repository to support the mapping

                                                    AIMSA’2012                           September 2012 #11
Access modes

RDF Search - retrieve ranked list of URIs related to literals, which contain specific keywords




                                          AIMSA’2012                      September 2012 #12
Access modes (condt)

 Exploration - traversing the data, one resource at a time




                            AIMSA’2012                 September 2012
Access modes (condt)

    Exploration - traversing the data, one resource at a time,
                   inspecting inferred knowledge


- locatedIn – Bulgaria, Eastern Europe
- Geonames types/FearureCodes (dc:type P.PPL)
- parentFeature – Bulgaria, Europe
-containsLocation – Cherno More Sports Complex,
                      Varna Archeological Museum
- isBirthPlaceOf – Aleksander Kraev, Martin Hristov
…




                                                 AIMSA’2012      September 2012   #14
Access modes (condt)
   Exploration - traversing the data, one resource at a time,
                 inspecting inferred knowledge




- locatedIn - Europe
- subRegionOf - Europe
- hasContactInfo –
       website via Freebase
-containsLocation
- partOf
  …




                                         AIMSA’2012             September 2012   #15
Access modes (condt)

SPARQL endpoint




                       AIMSA’2012   September 2012   #16
Access modes (condt)

RelFinder




                       European Data Forum   September 2012   #17
Querying
Using LOD concepts




 SELECT * WHERE {
  ?Person dbp-ont:birthPlace ?BirthPlace ;
       rdf:type dbp-ont:Politician ;
 ?BirthPlace geo-ont:parentFeature dbpedia:Germany .
 }




Using the intermediary layer




 SELECT * WHERE {
   ?Person prot:birthPlace ?BirthPlace ;
        rdf:type prot:Politicianr ;
   ?BirthPlace prot:subRegionOf dbpedia:Germany .
 }




                                                       AIMSA’2012   September 2012
Find Airports near London

                                   Standard LOD vs. PROTON query
                                   13 vs. 20 results
                                   DBpedia vs. DBpedia and Geonames




                      AIMSA’2012              September 2012   #19
Find airports near London - Results comparison




 Using Geospatial index of OWLIM



                                   AIMSA’2012   September 2012   #20
City – a subject of a science fiction author




                         AIMSA’2012            September 2012   #21
OWLIM 5.0 and SPARQL 1.1

Exemplary queries :
GROUP BY, min
   — Minimal and maximal population counts of European countries
Federated Query between FactForge and LinkedLifeData
    — Drugs that cure the disease from which died Alexandre Graham Bell
Literal index over dates
     – World governors in office between 1980 and 2005
Literal index over digits
     ― European countries with population above 20 MLN
Geospatial index
    — Show the distance from London of airports located at most 50 miles away from it




                                    AIMSA’2012                   September 2012   #22
Challenges and usage

• Clean data
   – Clean up input data

• At model level
   – Contradiction detection
   – Consistency checking

• Curation and upgrading methodology



         FactForge has been used as data layer infrastructure in FP7 projects, like RENDER
         FactForge has been used in tasks of
                   linked data generation from unstructured data,
                   metadata enrichment of structured data
                             providing linkage to the entire LOD cloud
                                     for example The National Archive of UK
                                                  EDAMAM - food recommendation app
                                      AIMSA’2012                  September 2012       #23
Acknowledgements

  Partial funding




Colleagues
Ivan Peikov, Ontotext
Rouslan Velkov, Ontotext
Barry Bishop, Ontotext
Barry Norton, Ontotext
Marin Dimitrov, Ontotext
Alex Simov, Ontotext
Jordan Dichev, Ontotext
Konstantin Penchev, Ontotext

                                            Links
                                            http://ff-dev.ontotext.com
                                            http://www.ontotext.com/owlim
                                            http://www.ontotext.com/factforge
                                            Email:
                                            info@factforge.net
                               AIMSA’2012                September 2012     #24
Thank you for your attention!




mariana.damova@ontotext.com

Weitere ähnliche Inhalte

Was ist angesagt?

Linked Open (Geo)Data and the Distributed Ontology Language – a perfect match
Linked Open (Geo)Data and the Distributed Ontology Language – a perfect matchLinked Open (Geo)Data and the Distributed Ontology Language – a perfect match
Linked Open (Geo)Data and the Distributed Ontology Language – a perfect matchChristoph Lange
 
Linked Open Data: A simple how-to
Linked Open Data: A simple how-toLinked Open Data: A simple how-to
Linked Open Data: A simple how-tonvitucci
 
A Unified Approach for Representing Metametadata
A Unified Approach for Representing MetametadataA Unified Approach for Representing Metametadata
A Unified Approach for Representing MetametadataKai Eckert
 
The Dublin Core 1:1 Principle in the Age of Linked Data
The Dublin Core 1:1 Principle in the Age of Linked DataThe Dublin Core 1:1 Principle in the Age of Linked Data
The Dublin Core 1:1 Principle in the Age of Linked DataRichard Urban
 
Pal gov.tutorial2.session13 3.data integration and fusion using rdf
Pal gov.tutorial2.session13 3.data integration and fusion using rdfPal gov.tutorial2.session13 3.data integration and fusion using rdf
Pal gov.tutorial2.session13 3.data integration and fusion using rdfMustafa Jarrar
 
Services semantic technology_terminology
Services semantic technology_terminologyServices semantic technology_terminology
Services semantic technology_terminologyTenforce
 
Matching and merging anonymous terms from web sources
Matching and merging anonymous terms from web sourcesMatching and merging anonymous terms from web sources
Matching and merging anonymous terms from web sourcesIJwest
 
Linked Open Data to support content based Recommender Systems
Linked Open Data to support content based Recommender SystemsLinked Open Data to support content based Recommender Systems
Linked Open Data to support content based Recommender SystemsVito Ostuni
 
Dublin Core In Practice
Dublin Core In PracticeDublin Core In Practice
Dublin Core In PracticeMarcia Zeng
 
From Exploratory Search to Web Search and back - PIKM 2010
From Exploratory Search to Web Search and back - PIKM 2010From Exploratory Search to Web Search and back - PIKM 2010
From Exploratory Search to Web Search and back - PIKM 2010Roku
 
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...Marta Villegas
 
Pal gov.tutorial2.session5 2.rdfs_jarrar
Pal gov.tutorial2.session5 2.rdfs_jarrarPal gov.tutorial2.session5 2.rdfs_jarrar
Pal gov.tutorial2.session5 2.rdfs_jarrarMustafa Jarrar
 
RDF and Open Linked Data, a first approach
RDF and Open Linked Data, a first approachRDF and Open Linked Data, a first approach
RDF and Open Linked Data, a first approachhorvadam
 
Exposing Bibliographic Information as Linked Open Data using Standards-based ...
Exposing Bibliographic Information as Linked Open Data using Standards-based ...Exposing Bibliographic Information as Linked Open Data using Standards-based ...
Exposing Bibliographic Information as Linked Open Data using Standards-based ...Nikolaos Konstantinou
 
Exposing relational database as rdf
Exposing relational database as rdfExposing relational database as rdf
Exposing relational database as rdfShakil Ahmed
 

Was ist angesagt? (20)

Linked Open (Geo)Data and the Distributed Ontology Language – a perfect match
Linked Open (Geo)Data and the Distributed Ontology Language – a perfect matchLinked Open (Geo)Data and the Distributed Ontology Language – a perfect match
Linked Open (Geo)Data and the Distributed Ontology Language – a perfect match
 
Linked Open Data: A simple how-to
Linked Open Data: A simple how-toLinked Open Data: A simple how-to
Linked Open Data: A simple how-to
 
A Unified Approach for Representing Metametadata
A Unified Approach for Representing MetametadataA Unified Approach for Representing Metametadata
A Unified Approach for Representing Metametadata
 
The Dublin Core 1:1 Principle in the Age of Linked Data
The Dublin Core 1:1 Principle in the Age of Linked DataThe Dublin Core 1:1 Principle in the Age of Linked Data
The Dublin Core 1:1 Principle in the Age of Linked Data
 
Pal gov.tutorial2.session13 3.data integration and fusion using rdf
Pal gov.tutorial2.session13 3.data integration and fusion using rdfPal gov.tutorial2.session13 3.data integration and fusion using rdf
Pal gov.tutorial2.session13 3.data integration and fusion using rdf
 
Services semantic technology_terminology
Services semantic technology_terminologyServices semantic technology_terminology
Services semantic technology_terminology
 
Matching and merging anonymous terms from web sources
Matching and merging anonymous terms from web sourcesMatching and merging anonymous terms from web sources
Matching and merging anonymous terms from web sources
 
Linked Open Data to support content based Recommender Systems
Linked Open Data to support content based Recommender SystemsLinked Open Data to support content based Recommender Systems
Linked Open Data to support content based Recommender Systems
 
Dublin Core Intro
Dublin Core IntroDublin Core Intro
Dublin Core Intro
 
Linked (Open) Data
Linked (Open) DataLinked (Open) Data
Linked (Open) Data
 
Dublin Core In Practice
Dublin Core In PracticeDublin Core In Practice
Dublin Core In Practice
 
From Exploratory Search to Web Search and back - PIKM 2010
From Exploratory Search to Web Search and back - PIKM 2010From Exploratory Search to Web Search and back - PIKM 2010
From Exploratory Search to Web Search and back - PIKM 2010
 
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
 
Pal gov.tutorial2.session5 2.rdfs_jarrar
Pal gov.tutorial2.session5 2.rdfs_jarrarPal gov.tutorial2.session5 2.rdfs_jarrar
Pal gov.tutorial2.session5 2.rdfs_jarrar
 
RDF and Open Linked Data, a first approach
RDF and Open Linked Data, a first approachRDF and Open Linked Data, a first approach
RDF and Open Linked Data, a first approach
 
Web Spa
Web SpaWeb Spa
Web Spa
 
Semantic Web in Action
Semantic Web in ActionSemantic Web in Action
Semantic Web in Action
 
Exposing Bibliographic Information as Linked Open Data using Standards-based ...
Exposing Bibliographic Information as Linked Open Data using Standards-based ...Exposing Bibliographic Information as Linked Open Data using Standards-based ...
Exposing Bibliographic Information as Linked Open Data using Standards-based ...
 
Exposing relational database as rdf
Exposing relational database as rdfExposing relational database as rdf
Exposing relational database as rdf
 
Jesús Barrasa
Jesús BarrasaJesús Barrasa
Jesús Barrasa
 

Andere mochten auch

Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23Mariana Damova, Ph.D
 
Contextual Ontology Alignment - ESWC 2011
Contextual Ontology Alignment - ESWC 2011Contextual Ontology Alignment - ESWC 2011
Contextual Ontology Alignment - ESWC 2011Mariana Damova, Ph.D
 
A Framework for Improved Access to Museum Databases in the Semantic Web
A Framework for Improved Access to Museum Databases in the Semantic WebA Framework for Improved Access to Museum Databases in the Semantic Web
A Framework for Improved Access to Museum Databases in the Semantic WebMariana Damova, Ph.D
 
Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23Mariana Damova, Ph.D
 

Andere mochten auch (8)

Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23
 
Contextual Ontology Alignment - ESWC 2011
Contextual Ontology Alignment - ESWC 2011Contextual Ontology Alignment - ESWC 2011
Contextual Ontology Alignment - ESWC 2011
 
A Framework for Improved Access to Museum Databases in the Semantic Web
A Framework for Improved Access to Museum Databases in the Semantic WebA Framework for Improved Access to Museum Databases in the Semantic Web
A Framework for Improved Access to Museum Databases in the Semantic Web
 
Europeana datainowlim oct2012
Europeana datainowlim oct2012Europeana datainowlim oct2012
Europeana datainowlim oct2012
 
Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23
 
Ontologies Fmi 042010
Ontologies Fmi 042010Ontologies Fmi 042010
Ontologies Fmi 042010
 
Bulgariana europeana02112013
Bulgariana europeana02112013Bulgariana europeana02112013
Bulgariana europeana02112013
 
Mozaika june2014
Mozaika june2014Mozaika june2014
Mozaika june2014
 

Ähnlich wie Fact forge aimsa2012

Charleston 2012 - The Future of Serials in a Linked Data World
Charleston 2012 - The Future of Serials in a Linked Data WorldCharleston 2012 - The Future of Serials in a Linked Data World
Charleston 2012 - The Future of Serials in a Linked Data WorldProQuest
 
Linking Open Data with Drupal
Linking Open Data with DrupalLinking Open Data with Drupal
Linking Open Data with Drupalemmanuel_jamin
 
121004 linking open_data_with_drupal_v1
121004 linking open_data_with_drupal_v1121004 linking open_data_with_drupal_v1
121004 linking open_data_with_drupal_v1manujam
 
Web Data Management in RDF Age
Web Data Management in RDF AgeWeb Data Management in RDF Age
Web Data Management in RDF AgeINRIA-OAK
 
Linked Open Data (LOD) part 3
Linked Open Data (LOD)  part 3Linked Open Data (LOD)  part 3
Linked Open Data (LOD) part 3IPLODProject
 
Semantic Technologies for Big Data
Semantic Technologies for Big DataSemantic Technologies for Big Data
Semantic Technologies for Big DataMarin Dimitrov
 
Omitola birmingham cityuniv
Omitola birmingham cityunivOmitola birmingham cityuniv
Omitola birmingham cityunivTope Omitola
 
Linked Data Driven Data Virtualization for Web-scale Integration
Linked Data Driven Data Virtualization for Web-scale IntegrationLinked Data Driven Data Virtualization for Web-scale Integration
Linked Data Driven Data Virtualization for Web-scale Integrationrumito
 
Aggregating Social Media for Enhancing Conference Experiences
Aggregating Social Media for Enhancing Conference ExperiencesAggregating Social Media for Enhancing Conference Experiences
Aggregating Social Media for Enhancing Conference ExperiencesHouda khrouf
 
Linked Open Data Visualization
Linked Open Data VisualizationLinked Open Data Visualization
Linked Open Data VisualizationLaura Po
 
Linked data driven EPCIS Event-based Traceability across Supply chain busine...
Linked data driven EPCIS Event-based Traceability across  Supply chain busine...Linked data driven EPCIS Event-based Traceability across  Supply chain busine...
Linked data driven EPCIS Event-based Traceability across Supply chain busine...Monika Solanki
 
Introduction to Ontology Concepts and Terminology
Introduction to Ontology Concepts and TerminologyIntroduction to Ontology Concepts and Terminology
Introduction to Ontology Concepts and TerminologySteven Miller
 

Ähnlich wie Fact forge aimsa2012 (20)

OpenAIRE schirrwagen
OpenAIRE schirrwagenOpenAIRE schirrwagen
OpenAIRE schirrwagen
 
Charleston 2012 - The Future of Serials in a Linked Data World
Charleston 2012 - The Future of Serials in a Linked Data WorldCharleston 2012 - The Future of Serials in a Linked Data World
Charleston 2012 - The Future of Serials in a Linked Data World
 
Linking Open Data with Drupal
Linking Open Data with DrupalLinking Open Data with Drupal
Linking Open Data with Drupal
 
Introduction to LDL 2012
Introduction to LDL 2012Introduction to LDL 2012
Introduction to LDL 2012
 
Going for GOLD - Adventures in Open Linked Metadata
Going for GOLD - Adventures in Open Linked MetadataGoing for GOLD - Adventures in Open Linked Metadata
Going for GOLD - Adventures in Open Linked Metadata
 
121004 linking open_data_with_drupal_v1
121004 linking open_data_with_drupal_v1121004 linking open_data_with_drupal_v1
121004 linking open_data_with_drupal_v1
 
Lod2
Lod2Lod2
Lod2
 
Web Data Management in RDF Age
Web Data Management in RDF AgeWeb Data Management in RDF Age
Web Data Management in RDF Age
 
Linked Open Data (LOD) part 3
Linked Open Data (LOD)  part 3Linked Open Data (LOD)  part 3
Linked Open Data (LOD) part 3
 
LODAC Museum -- Connecting Museums with LOD --
LODAC Museum -- Connecting Museums with LOD --LODAC Museum -- Connecting Museums with LOD --
LODAC Museum -- Connecting Museums with LOD --
 
Semantic Technologies for Big Data
Semantic Technologies for Big DataSemantic Technologies for Big Data
Semantic Technologies for Big Data
 
Omitola birmingham cityuniv
Omitola birmingham cityunivOmitola birmingham cityuniv
Omitola birmingham cityuniv
 
20140521 sem-tech-biz-guest-lecture
20140521 sem-tech-biz-guest-lecture20140521 sem-tech-biz-guest-lecture
20140521 sem-tech-biz-guest-lecture
 
Linked Data Driven Data Virtualization for Web-scale Integration
Linked Data Driven Data Virtualization for Web-scale IntegrationLinked Data Driven Data Virtualization for Web-scale Integration
Linked Data Driven Data Virtualization for Web-scale Integration
 
Aggregating Social Media for Enhancing Conference Experiences
Aggregating Social Media for Enhancing Conference ExperiencesAggregating Social Media for Enhancing Conference Experiences
Aggregating Social Media for Enhancing Conference Experiences
 
Linked sensor data
Linked sensor dataLinked sensor data
Linked sensor data
 
Linked Open Data Visualization
Linked Open Data VisualizationLinked Open Data Visualization
Linked Open Data Visualization
 
Linked data driven EPCIS Event-based Traceability across Supply chain busine...
Linked data driven EPCIS Event-based Traceability across  Supply chain busine...Linked data driven EPCIS Event-based Traceability across  Supply chain busine...
Linked data driven EPCIS Event-based Traceability across Supply chain busine...
 
2011 07 keynote-ktw
2011 07 keynote-ktw2011 07 keynote-ktw
2011 07 keynote-ktw
 
Introduction to Ontology Concepts and Terminology
Introduction to Ontology Concepts and TerminologyIntroduction to Ontology Concepts and Terminology
Introduction to Ontology Concepts and Terminology
 

Mehr von Mariana Damova, Ph.D

ИКТ програма 2018-2020 Хоризонт 2020 мариана дамова
ИКТ програма 2018-2020 Хоризонт 2020 мариана дамоваИКТ програма 2018-2020 Хоризонт 2020 мариана дамова
ИКТ програма 2018-2020 Хоризонт 2020 мариана дамоваMariana Damova, Ph.D
 
Geography of Letters - The Spirituality of Sofia in the Historic Memory
Geography of Letters - The Spirituality of Sofia in the Historic MemoryGeography of Letters - The Spirituality of Sofia in the Historic Memory
Geography of Letters - The Spirituality of Sofia in the Historic MemoryMariana Damova, Ph.D
 
Startup Europe Week Sofia 2017 - Introduction
Startup Europe Week Sofia 2017 - IntroductionStartup Europe Week Sofia 2017 - Introduction
Startup Europe Week Sofia 2017 - IntroductionMariana Damova, Ph.D
 
Семантични технологии основи
Семантични технологии   основи Семантични технологии   основи
Семантични технологии основи Mariana Damova, Ph.D
 
Startup Europe Week Sofia introduction
Startup Europe Week Sofia introductionStartup Europe Week Sofia introduction
Startup Europe Week Sofia introductionMariana Damova, Ph.D
 
Communication channels for the european single digital market
Communication channels for the european single digital marketCommunication channels for the european single digital market
Communication channels for the european single digital marketMariana Damova, Ph.D
 
Bulgariana europeana27112013 ним
Bulgariana europeana27112013 нимBulgariana europeana27112013 ним
Bulgariana europeana27112013 нимMariana Damova, Ph.D
 
NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...
NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...
NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...Mariana Damova, Ph.D
 
проектиране на онтологии и връзката им с езиковите технологии
проектиране на онтологии и връзката им с езиковите технологиипроектиране на онтологии и връзката им с езиковите технологии
проектиране на онтологии и връзката им с езиковите технологииMariana Damova, Ph.D
 
семантични технологии основи
семантични технологии   основисемантични технологии   основи
семантични технологии основиMariana Damova, Ph.D
 
Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013
Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013
Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013Mariana Damova, Ph.D
 
Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)
Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)
Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)Mariana Damova, Ph.D
 
National aggregatorvarna032013 marianadamova
National aggregatorvarna032013 marianadamovaNational aggregatorvarna032013 marianadamova
National aggregatorvarna032013 marianadamovaMariana Damova, Ph.D
 
National aggregatorvarna032013 marianadamova
National aggregatorvarna032013 marianadamovaNational aggregatorvarna032013 marianadamova
National aggregatorvarna032013 marianadamovaMariana Damova, Ph.D
 

Mehr von Mariana Damova, Ph.D (20)

ИКТ програма 2018-2020 Хоризонт 2020 мариана дамова
ИКТ програма 2018-2020 Хоризонт 2020 мариана дамоваИКТ програма 2018-2020 Хоризонт 2020 мариана дамова
ИКТ програма 2018-2020 Хоризонт 2020 мариана дамова
 
Geography of Letters - The Spirituality of Sofia in the Historic Memory
Geography of Letters - The Spirituality of Sofia in the Historic MemoryGeography of Letters - The Spirituality of Sofia in the Historic Memory
Geography of Letters - The Spirituality of Sofia in the Historic Memory
 
Startup Europe Week Sofia 2017 - Introduction
Startup Europe Week Sofia 2017 - IntroductionStartup Europe Week Sofia 2017 - Introduction
Startup Europe Week Sofia 2017 - Introduction
 
IndustryInform Service of Mozaika
IndustryInform Service of MozaikaIndustryInform Service of Mozaika
IndustryInform Service of Mozaika
 
Семантични технологии основи
Семантични технологии   основи Семантични технологии   основи
Семантични технологии основи
 
IndustryInform Demo March 2016
IndustryInform Demo March 2016IndustryInform Demo March 2016
IndustryInform Demo March 2016
 
Startup Europe Week Sofia introduction
Startup Europe Week Sofia introductionStartup Europe Week Sofia introduction
Startup Europe Week Sofia introduction
 
Mozaika-Jan2016a
Mozaika-Jan2016aMozaika-Jan2016a
Mozaika-Jan2016a
 
Concordia july2015
Concordia july2015Concordia july2015
Concordia july2015
 
Communication channels for the european single digital market
Communication channels for the european single digital marketCommunication channels for the european single digital market
Communication channels for the european single digital market
 
Bulgariana europeana27112013 ним
Bulgariana europeana27112013 нимBulgariana europeana27112013 ним
Bulgariana europeana27112013 ним
 
NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...
NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...
NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...
 
Europeana in Bulgaria
Europeana in BulgariaEuropeana in Bulgaria
Europeana in Bulgaria
 
проектиране на онтологии и връзката им с езиковите технологии
проектиране на онтологии и връзката им с езиковите технологиипроектиране на онтологии и връзката им с езиковите технологии
проектиране на онтологии и връзката им с езиковите технологии
 
семантични технологии основи
семантични технологии   основисемантични технологии   основи
семантични технологии основи
 
Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013
Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013
Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013
 
Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)
Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)
Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)
 
National aggregatorvarna032013 marianadamova
National aggregatorvarna032013 marianadamovaNational aggregatorvarna032013 marianadamova
National aggregatorvarna032013 marianadamova
 
National aggregatorvarna032013 marianadamova
National aggregatorvarna032013 marianadamovaNational aggregatorvarna032013 marianadamova
National aggregatorvarna032013 marianadamova
 
Europeana datainaction nov2012
Europeana datainaction nov2012Europeana datainaction nov2012
Europeana datainaction nov2012
 

Kürzlich hochgeladen

Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 

Kürzlich hochgeladen (20)

Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 

Fact forge aimsa2012

  • 1. FactForge: Data Service or the Diversity of Inferred Knowledge over LOD Mariana Damova, PhD, Kiril Simov, Zdravko Tashev, Atanas Kiryakov AIMSA’2012 September 2012
  • 2. Ontotext – Top-5 provider of core Semantic Technology – Established in year 2000; offices in Bulgaria, UK, USA – Active both in research and commercial projects (FP7 funding for 10 years) • 360° semantic technology – unique portfolio: – Semantic Databases: high-performance RDF DBMS, scalable reasoning – Semantic Search: text-mining (IE), metadata generation, Information Retrieval (IR) – Web Mining: focused crawling, screen scraping, data fusion – Linked Data Management and Data Integration Good recognition in the SemTech community – Ontotext pages are ranked #1 for “semantic annotation” and “semantic repository” at GYM, #3 for “linked data management” at Google Several joint ventures and subsidiaries – Innovantage: leading online recruitment intelligence provider in UK
  • 3. Ontotext Clients (selected) British Broadcasting Corporation (BBC) – Run its World Cup 2010 sites on top of OWLIM – Since Mar’12 BBC Sports – 2012 Olympics sections are driven by OWLIM and a Concept Extraction service developed by Ontotext Press Association (UK) – Analysis of Sports news – Concept extraction – Linked data generation Top-3 USA media (not allowed to name) The National Archives (UK) contracted Ontotext to implement semantic KB and semantic search for the Government Web Archive British Museum (UK) Ontotext leads the development of Phase 3 of ResearchSpace project on collaborative research in cultural heritage; British Museum’s public SPARQL end-point is powered by OWLIM de Bibliothek (Holland) aggregation of data from 150 library databases
  • 4. Semantic Web and Linked Open Data • Semantic Web a set of standards that enable computers to interpret the semantics of data on the web • Linked Open Data a set of principles for publishing structured data and interlinking them so that they can be browsed in a way HTML pages are browsable - Use URIs to identify things. - Use HTTP URIs so that these things can be referred to and looked up ("dereferenced") by people and user agents. - Provide useful information about the thing when its URI is dereferenced, using standard formats such as RDF/XML. - Include links to other, related URIs in the exposed data to improve discovery of other related information on the Web. AIMSA’2012 September 2012 #4
  • 5. Linked Open Data cloud 2008 2011 295 datasets 2009 more than 30 billion triples AIMSA’2012 July 2011 #5
  • 6. Linked Open Data is maturing LOD cloud grows by billions of triples yearly Technologies and guidelines about how to produce linked data fast how to assure their quality how to provide vertical oriented data services LOD2, LATC, baseKB AIMSA’2012 September 2012 #6
  • 7. This talk is about reasoning and coping with diversity of the data on the web of data AIMSA’2012 September 2012 #7
  • 8. Outline • FactForge (beta) • Reference Layer • Access Modes • Querying – Airports around London – US city – a subject of a Novel – US city – contactInformation • Challenges • Conclusion AIMSA’2012 September 2012
  • 9. FactForge (beta) the largest body of heterogeneous general knowledge on which inference has been performed – powered by OWLIM 5.2 – supporting SPARQL 1.1 AIMSA’2012 September 2012
  • 10. Datasets REASON-ABLE VIEW of LOD datasets Number of explicit statements: 1,686,804,539 Implicit statements: 1,264,199,839 Retrievable statements: 12,646,674,554 CIA FactBook DBpedia 3.7 Freebase NY Times Lexvo Wordnet 3.0 Geonames Lingvoj MusicBrainz materialization is performed with respect to the semantics of OWL-Horst optimized AIMSA’2012 September 2012
  • 11. Reference Layer PROTON – light weight upper level ontology ~500 classes, ~150 properties http://www.ontotext.com/proton-ontology Linking at schema level: (1) using rdfs:subClassOf and rdfs:subPropertyOf statements; (2) using OWL expressions where there is a difference in the conceptualization (3) using inference rules if additional individuals are necessary in the repository to support the mapping AIMSA’2012 September 2012 #11
  • 12. Access modes RDF Search - retrieve ranked list of URIs related to literals, which contain specific keywords AIMSA’2012 September 2012 #12
  • 13. Access modes (condt) Exploration - traversing the data, one resource at a time AIMSA’2012 September 2012
  • 14. Access modes (condt) Exploration - traversing the data, one resource at a time, inspecting inferred knowledge - locatedIn – Bulgaria, Eastern Europe - Geonames types/FearureCodes (dc:type P.PPL) - parentFeature – Bulgaria, Europe -containsLocation – Cherno More Sports Complex, Varna Archeological Museum - isBirthPlaceOf – Aleksander Kraev, Martin Hristov … AIMSA’2012 September 2012 #14
  • 15. Access modes (condt) Exploration - traversing the data, one resource at a time, inspecting inferred knowledge - locatedIn - Europe - subRegionOf - Europe - hasContactInfo – website via Freebase -containsLocation - partOf … AIMSA’2012 September 2012 #15
  • 16. Access modes (condt) SPARQL endpoint AIMSA’2012 September 2012 #16
  • 17. Access modes (condt) RelFinder European Data Forum September 2012 #17
  • 18. Querying Using LOD concepts SELECT * WHERE { ?Person dbp-ont:birthPlace ?BirthPlace ; rdf:type dbp-ont:Politician ; ?BirthPlace geo-ont:parentFeature dbpedia:Germany . } Using the intermediary layer SELECT * WHERE { ?Person prot:birthPlace ?BirthPlace ; rdf:type prot:Politicianr ; ?BirthPlace prot:subRegionOf dbpedia:Germany . } AIMSA’2012 September 2012
  • 19. Find Airports near London Standard LOD vs. PROTON query 13 vs. 20 results DBpedia vs. DBpedia and Geonames AIMSA’2012 September 2012 #19
  • 20. Find airports near London - Results comparison Using Geospatial index of OWLIM AIMSA’2012 September 2012 #20
  • 21. City – a subject of a science fiction author AIMSA’2012 September 2012 #21
  • 22. OWLIM 5.0 and SPARQL 1.1 Exemplary queries : GROUP BY, min — Minimal and maximal population counts of European countries Federated Query between FactForge and LinkedLifeData — Drugs that cure the disease from which died Alexandre Graham Bell Literal index over dates – World governors in office between 1980 and 2005 Literal index over digits ― European countries with population above 20 MLN Geospatial index — Show the distance from London of airports located at most 50 miles away from it AIMSA’2012 September 2012 #22
  • 23. Challenges and usage • Clean data – Clean up input data • At model level – Contradiction detection – Consistency checking • Curation and upgrading methodology FactForge has been used as data layer infrastructure in FP7 projects, like RENDER FactForge has been used in tasks of linked data generation from unstructured data, metadata enrichment of structured data providing linkage to the entire LOD cloud for example The National Archive of UK EDAMAM - food recommendation app AIMSA’2012 September 2012 #23
  • 24. Acknowledgements Partial funding Colleagues Ivan Peikov, Ontotext Rouslan Velkov, Ontotext Barry Bishop, Ontotext Barry Norton, Ontotext Marin Dimitrov, Ontotext Alex Simov, Ontotext Jordan Dichev, Ontotext Konstantin Penchev, Ontotext Links http://ff-dev.ontotext.com http://www.ontotext.com/owlim http://www.ontotext.com/factforge Email: info@factforge.net AIMSA’2012 September 2012 #24
  • 25. Thank you for your attention! mariana.damova@ontotext.com