SlideShare a Scribd company logo
1 of 15
Download to read offline
Assembling and Applying an
Education Graph based on Learning
    Resources in Universities

  Tom Heath, Ross Singer, Nadeem Shabir,
     Chris Clarke and Justin Leavesley

             Talis Education Ltd

       LiLe2012, Lyon, 17th April 2012
What do we mean by an
             'Education Graph'?
●   The Web is a graph of documents

●   Facebook, LinkedIn, etc. capture elements of a
    'social graph'

●   The Web of Data is one big, heterogeneous graph
    encoded in RDF

●   The 'education graph' is a portion of that graph
    concerned with learning and teaching
Overview

●   Talis Aspire and the institutional sub-graph

●   Applications of a broader education graph

●   Ongoing and Future Work
Talis Aspire Campus Edition
Talis Aspire Campus Edition
●   ~30 customers in the UK and beyond
●   10,000s of reading lists
●   100,000s of learning resources
●   Loads of users every day!

●   Backed by a hosted triplestore
●   Linked Data views available on the public Web
●   A real, live Linked Data application that people pay for
●   (Probably) the most heavily used Linked Data application
    in the education domain
The slightly more technical bits...
From Plain Text to a
               'Biblio-graph-ic' Record
●   Problem
    ●   Only some data is entered in structured form
    ●   Legacy data is typically plain text citations

●   Our Approach
    ●   Pre-process citation text with regex
    ●   Pass through heavily modified version of FreeCite
    ●   Clean output again with regex
    ●   Return as JSON object
    ●   Pass through entity reconciliation process...
Enhancing Data Quality with
             Entity Reconciliation
●   Validate the accuracy of the record by matching
    against high-quality reference data sources

●   Data sources
    ●   OpenLibrary, OpenKB (serials/journals), CrossRef

●   Process
    ●   Books: match on a precise edition
    ●   Articles: enrich the graph describing the resource using
        OpenKB, search CrossRef using enriched description
    ●   Map record to canonical resource
A Happy By-Product




Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/
Unifying the Institutional Sub-Graphs

●   Goal
     ●   Create a cross-institution (portion of) the education
         graph, centred around learning resources

●   Process
     ●   Harvest the data from each Campus Edition triplestore
     ●   Repeat the entity reconciliation process
         –   Retain the mapping of canonical resources to those on
             institutional lists
Applications:
Talis Aspire Community Edition
Applications:
Recommending Learning Resources
Ongoing and Future Work
●   Evaluation of recommendation quality
    ●   Role/importance of list length, list position, list sections,
        section ordering

●   Linked Data-based data warehousing infrastructure
    (for analytics and prototyping)

●   Alternative approaches to triple-storage

●   Integration of other portions of the education graph
Questions?




        Web: talisaspire.com
        Twitter: @talisaspire
YouTube: youtube.com/user/TalisAspire
 Facebook: facebook.com/talisaspire

More Related Content

What's hot

Exploratory querying of the Dutch GeoRegisters
Exploratory querying of the Dutch GeoRegistersExploratory querying of the Dutch GeoRegisters
Exploratory querying of the Dutch GeoRegisters
Stanislav Ronzhin
 

What's hot (18)

Proposal for open government data
Proposal for open government dataProposal for open government data
Proposal for open government data
 
The e-depot for Dutch Archaeology: Archiving and publication of archaeologica...
The e-depot for Dutch Archaeology: Archiving and publication of archaeologica...The e-depot for Dutch Archaeology: Archiving and publication of archaeologica...
The e-depot for Dutch Archaeology: Archiving and publication of archaeologica...
 
Data management and the online e-depot for Dutch Archaeology at DANS
Data management and the online e-depot for Dutch Archaeology at DANSData management and the online e-depot for Dutch Archaeology at DANS
Data management and the online e-depot for Dutch Archaeology at DANS
 
Ariadne: Interoperability
Ariadne: InteroperabilityAriadne: Interoperability
Ariadne: Interoperability
 
Connecting Heterogeneous Collections using Linked Data
Connecting Heterogeneous Collections using Linked DataConnecting Heterogeneous Collections using Linked Data
Connecting Heterogeneous Collections using Linked Data
 
Athanassios Hatzis
Athanassios HatzisAthanassios Hatzis
Athanassios Hatzis
 
ProteomeXchange update
ProteomeXchange updateProteomeXchange update
ProteomeXchange update
 
Open Data in Archaeology, Julian D. Richards
Open Data in Archaeology, Julian D. RichardsOpen Data in Archaeology, Julian D. Richards
Open Data in Archaeology, Julian D. Richards
 
Towards INSPIRE environmental 5* Open Data
Towards INSPIRE environmental 5* Open Data Towards INSPIRE environmental 5* Open Data
Towards INSPIRE environmental 5* Open Data
 
Eaa2014 Opportunities and Challenges with Open Access and Open Data in the UK
Eaa2014 Opportunities and Challenges with Open Access and Open Data in the UKEaa2014 Opportunities and Challenges with Open Access and Open Data in the UK
Eaa2014 Opportunities and Challenges with Open Access and Open Data in the UK
 
Exploratory querying of the Dutch GeoRegisters
Exploratory querying of the Dutch GeoRegistersExploratory querying of the Dutch GeoRegisters
Exploratory querying of the Dutch GeoRegisters
 
Instutional repositories and data
Instutional repositories and dataInstutional repositories and data
Instutional repositories and data
 
Open data and linked data
Open data and linked dataOpen data and linked data
Open data and linked data
 
2016 SDMX Experts meeting, IMF Implementing SDMX in Low Income and Emerging E...
2016 SDMX Experts meeting, IMF Implementing SDMX in Low Income and Emerging E...2016 SDMX Experts meeting, IMF Implementing SDMX in Low Income and Emerging E...
2016 SDMX Experts meeting, IMF Implementing SDMX in Low Income and Emerging E...
 
Crossing Borders: International Interoperability at the ADS
Crossing Borders: International Interoperability at the ADSCrossing Borders: International Interoperability at the ADS
Crossing Borders: International Interoperability at the ADS
 
Session 03 acquiring data
Session 03 acquiring dataSession 03 acquiring data
Session 03 acquiring data
 
HPC at the University of Michigan: A Multi‐Tenant, Multi‐Science Campus Servi...
HPC at the University of Michigan: A Multi‐Tenant, Multi‐Science Campus Servi...HPC at the University of Michigan: A Multi‐Tenant, Multi‐Science Campus Servi...
HPC at the University of Michigan: A Multi‐Tenant, Multi‐Science Campus Servi...
 
What's So Unique About a Columnar Database?
What's So Unique About a Columnar Database?What's So Unique About a Columnar Database?
What's So Unique About a Columnar Database?
 

Viewers also liked (6)

Яндекс.Почта
Яндекс.ПочтаЯндекс.Почта
Яндекс.Почта
 
Мадонна на Яндекс.Музыке
Мадонна на Яндекс.МузыкеМадонна на Яндекс.Музыке
Мадонна на Яндекс.Музыке
 
Новый год на Яндексе
Новый год на ЯндексеНовый год на Яндексе
Новый год на Яндексе
 
Яндекс.Премьеры
Яндекс.ПремьерыЯндекс.Премьеры
Яндекс.Премьеры
 
Eder sunum fuar_30mayis12
Eder sunum fuar_30mayis12Eder sunum fuar_30mayis12
Eder sunum fuar_30mayis12
 
Myhyv
MyhyvMyhyv
Myhyv
 

Similar to Assembling and Applying an Education Graph based on Learning Resources in Universities

Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...
Lucy McKenna
 
Andrew Cox Research data management
Andrew Cox Research data managementAndrew Cox Research data management
Andrew Cox Research data management
Incisive_Events
 

Similar to Assembling and Applying an Education Graph based on Learning Resources in Universities (20)

Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...
 
Open University Data
Open University DataOpen University Data
Open University Data
 
Staffing Research Data Services at University of Edinburgh
Staffing Research Data Services at University of EdinburghStaffing Research Data Services at University of Edinburgh
Staffing Research Data Services at University of Edinburgh
 
Hide the Stack: Toward Usable Linked Data
Hide the Stack:Toward Usable Linked DataHide the Stack:Toward Usable Linked Data
Hide the Stack: Toward Usable Linked Data
 
Integrating an electronic lab notebook with a data repository; American Chemi...
Integrating an electronic lab notebook with a data repository; American Chemi...Integrating an electronic lab notebook with a data repository; American Chemi...
Integrating an electronic lab notebook with a data repository; American Chemi...
 
Elns and repositories, American Chemical Society, Dallas, March 2014
Elns and repositories, American Chemical Society, Dallas, March 2014Elns and repositories, American Chemical Society, Dallas, March 2014
Elns and repositories, American Chemical Society, Dallas, March 2014
 
Linked Data at the OU - the story so far
Linked Data at the OU - the story so farLinked Data at the OU - the story so far
Linked Data at the OU - the story so far
 
Andrew Cox Research data management
Andrew Cox Research data managementAndrew Cox Research data management
Andrew Cox Research data management
 
Academic Innovation Data Showcase 2-14-19
Academic Innovation Data Showcase 2-14-19Academic Innovation Data Showcase 2-14-19
Academic Innovation Data Showcase 2-14-19
 
Data-Driven Learning Strategy
Data-Driven Learning StrategyData-Driven Learning Strategy
Data-Driven Learning Strategy
 
Prospect for learning analytics to achieve adaptive learning model
Prospect for learning analytics to achieve  adaptive learning modelProspect for learning analytics to achieve  adaptive learning model
Prospect for learning analytics to achieve adaptive learning model
 
Retooling a Research Data Repository: data.depositar.io
Retooling a Research Data Repository: data.depositar.ioRetooling a Research Data Repository: data.depositar.io
Retooling a Research Data Repository: data.depositar.io
 
Describing Theses and Dissertations Using Schema.org
Describing Theses and Dissertations Using Schema.orgDescribing Theses and Dissertations Using Schema.org
Describing Theses and Dissertations Using Schema.org
 
Slides | Research data literacy and the library
Slides | Research data literacy and the librarySlides | Research data literacy and the library
Slides | Research data literacy and the library
 
Slides | Research data literacy and the library
Slides | Research data literacy and the librarySlides | Research data literacy and the library
Slides | Research data literacy and the library
 
Relationship Building and Advocacy Across the Campus
Relationship Building and Advocacy Across the CampusRelationship Building and Advocacy Across the Campus
Relationship Building and Advocacy Across the Campus
 
Open government data portals: from publishing to use and impact
Open government data portals: from publishing to use and impactOpen government data portals: from publishing to use and impact
Open government data portals: from publishing to use and impact
 
IEEE TAG xAPI Webinar Series: Improving the Learner Experience Through an xAP...
IEEE TAG xAPI Webinar Series: Improving the Learner Experience Through an xAP...IEEE TAG xAPI Webinar Series: Improving the Learner Experience Through an xAP...
IEEE TAG xAPI Webinar Series: Improving the Learner Experience Through an xAP...
 
Introduction_to_knowledge_graph.pdf
Introduction_to_knowledge_graph.pdfIntroduction_to_knowledge_graph.pdf
Introduction_to_knowledge_graph.pdf
 
Edinburgh DataShare: Tackling research data in a DSpace institutional repository
Edinburgh DataShare: Tackling research data in a DSpace institutional repositoryEdinburgh DataShare: Tackling research data in a DSpace institutional repository
Edinburgh DataShare: Tackling research data in a DSpace institutional repository
 

Recently uploaded

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Recently uploaded (20)

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 

Assembling and Applying an Education Graph based on Learning Resources in Universities

  • 1. Assembling and Applying an Education Graph based on Learning Resources in Universities Tom Heath, Ross Singer, Nadeem Shabir, Chris Clarke and Justin Leavesley Talis Education Ltd LiLe2012, Lyon, 17th April 2012
  • 2. What do we mean by an 'Education Graph'? ● The Web is a graph of documents ● Facebook, LinkedIn, etc. capture elements of a 'social graph' ● The Web of Data is one big, heterogeneous graph encoded in RDF ● The 'education graph' is a portion of that graph concerned with learning and teaching
  • 3. Overview ● Talis Aspire and the institutional sub-graph ● Applications of a broader education graph ● Ongoing and Future Work
  • 5. Talis Aspire Campus Edition ● ~30 customers in the UK and beyond ● 10,000s of reading lists ● 100,000s of learning resources ● Loads of users every day! ● Backed by a hosted triplestore ● Linked Data views available on the public Web ● A real, live Linked Data application that people pay for ● (Probably) the most heavily used Linked Data application in the education domain
  • 6. The slightly more technical bits...
  • 7. From Plain Text to a 'Biblio-graph-ic' Record ● Problem ● Only some data is entered in structured form ● Legacy data is typically plain text citations ● Our Approach ● Pre-process citation text with regex ● Pass through heavily modified version of FreeCite ● Clean output again with regex ● Return as JSON object ● Pass through entity reconciliation process...
  • 8. Enhancing Data Quality with Entity Reconciliation ● Validate the accuracy of the record by matching against high-quality reference data sources ● Data sources ● OpenLibrary, OpenKB (serials/journals), CrossRef ● Process ● Books: match on a precise edition ● Articles: enrich the graph describing the resource using OpenKB, search CrossRef using enriched description ● Map record to canonical resource
  • 9. A Happy By-Product Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/
  • 10. Unifying the Institutional Sub-Graphs ● Goal ● Create a cross-institution (portion of) the education graph, centred around learning resources ● Process ● Harvest the data from each Campus Edition triplestore ● Repeat the entity reconciliation process – Retain the mapping of canonical resources to those on institutional lists
  • 12.
  • 14. Ongoing and Future Work ● Evaluation of recommendation quality ● Role/importance of list length, list position, list sections, section ordering ● Linked Data-based data warehousing infrastructure (for analytics and prototyping) ● Alternative approaches to triple-storage ● Integration of other portions of the education graph
  • 15. Questions? Web: talisaspire.com Twitter: @talisaspire YouTube: youtube.com/user/TalisAspire Facebook: facebook.com/talisaspire