SlideShare ist ein Scribd-Unternehmen logo
1 von 12
Open Health Knowledge Graphs
#hdpalooza 2013
Technical Development Track
6/3, 1:30-2pm, Congressional Room
George Thomas, HHS OCIO
1
agenda (as advertised)
1. This session introduces the business value of Linked Data,
demonstrating how Linked Data enables the network effect
through a process of „collaboration without coordination,‟
resulting in the integration of information systems across
disparate open government data publishers.
2. This session will describe healthdata.gov
platform components, including new functionality that
programmatically exposes tabular and graph-oriented data.
3. We will set the context for follow-up sessions that review
recent winning submissions for developer challenges and a
final panel discussion about technical opportunities and
upcoming developer challenges contributing to the
maturation of healthdata.gov as a useful knowledge graph.
2
1.1 collaboration without coordination
cross-correlation, graph-merging, reconciliation
3
1.2 linked data integration framework
GKG/Watson/Siri/… healthdata.gov
HKG
PCAST DEAS
Health Data Actor
Variety
Volume
Velocity
4
1.3 achieving liquidity
refine
value
combine
http uri
provenance
global identity
reconciliation
correlation mapping
ohkg
automation curation
5
agenda (as advertised)
1. This session introduces the business value of Linked Data,
demonstrating how Linked Data enables the network effect
through a process of „collaboration without coordination,‟
resulting in the integration of information systems across
disparate open government data publishers.
2. This session will describe healthdata.gov
platform components, including new functionality that
programmatically exposes tabular and graph-oriented data.
3. We will set the context for follow-up sessions that review
recent winning submissions for developer challenges and a
final panel discussion about technical opportunities and
upcoming developer challenges contributing to the
maturation of healthdata.gov as a useful knowledge graph.
6
2.1 these keys are strings, not things
2.2 but these keys are things, not strings
7
8
agenda (as advertised)
1. This session introduces the business value of Linked Data,
demonstrating how Linked Data enables the network effect
through a process of „collaboration without coordination,‟
resulting in the integration of information systems across
disparate open government data publishers.
2. This session will describe healthdata.gov
platform components, including new functionality that
programmatically exposes tabular and graph-oriented data.
3. We will set the context for follow-up sessions that review
recent winning submissions for developer challenges and a
final panel discussion about technical opportunities and
upcoming developer challenges contributing to the
maturation of healthdata.gov as a useful knowledge graph.
9
3.1 first domain developer challenge
• Metadata
– requests the application of existing voluntary
consensus standards for metadata common to all
open government data
– and invites new designs for health domain specific
metadata to classify datasets in our growing catalog,
creating entities, attributes and relations
– that form the foundations for better discovery,
integration and liquidity.
• results page
10
3.2 second domain developer challenge
• Mapping, Reconciliation and Correlation
– builds on the Metadata domain challenge
– begins by acknowledging disparate open government publishing
practices
– and seeks the demonstration of an innovative and automated
solution for transforming semi-structured data into structured data,
– reconciles decentralized distributions about the same data entity
against the master identity of an authoritative source,
– and correlates these master identities when multiple authoritative
sources exist,
– enabling the network effect by introducing strong identity resolution
techniques that ease the ability to aggregate different data about
the same entities from independent publishers.
11
thanks!
@prefix drm: <http://vocab.data.gov/def/drm#>
@prefix sdo: <http://schema.org/>
@prefix vcard: <http://www.w3.org/2006/vcard/ns#>
@prefix dc: <http://purl.org/dc/terms/>
<http://hhs.gov/staff/georgethomas#>
rdf:type drm:DataSteward , sdo:Person ;
vcard:email “george dot thomas 1 at hhs dot gov” ;
dc:contributor <healthdata.gov>, <data.gov/semantic> .

Weitere ähnliche Inhalte

Mehr von Health Data Consortium

Clinical Trial Data Transparency: Explaining Governance for Public Data Sharing
Clinical Trial Data Transparency:  Explaining Governance for Public Data SharingClinical Trial Data Transparency:  Explaining Governance for Public Data Sharing
Clinical Trial Data Transparency: Explaining Governance for Public Data SharingHealth Data Consortium
 
Exchanges go live: early trends in competitor dynamics
Exchanges go live: early trends in competitor dynamicsExchanges go live: early trends in competitor dynamics
Exchanges go live: early trends in competitor dynamicsHealth Data Consortium
 
Liberating Health Data: What we learned in New York, with Dr. Nirav Shah
Liberating Health Data: What we learned in New York, with Dr. Nirav ShahLiberating Health Data: What we learned in New York, with Dr. Nirav Shah
Liberating Health Data: What we learned in New York, with Dr. Nirav ShahHealth Data Consortium
 
Health Datapalooza 2013: Datalab - Victor Lazarro
Health Datapalooza 2013: Datalab - Victor LazarroHealth Datapalooza 2013: Datalab - Victor Lazarro
Health Datapalooza 2013: Datalab - Victor LazarroHealth Data Consortium
 
Health Datapalooza 2013: Datalab - Steven Edwards
Health Datapalooza 2013: Datalab - Steven EdwardsHealth Datapalooza 2013: Datalab - Steven Edwards
Health Datapalooza 2013: Datalab - Steven EdwardsHealth Data Consortium
 
Health Datapalooza 2013: Datalab - Rick Moser
Health Datapalooza 2013: Datalab - Rick MoserHealth Datapalooza 2013: Datalab - Rick Moser
Health Datapalooza 2013: Datalab - Rick MoserHealth Data Consortium
 
Health Datapalooza 2013: Datalab - David Forrest
Health Datapalooza 2013: Datalab - David ForrestHealth Datapalooza 2013: Datalab - David Forrest
Health Datapalooza 2013: Datalab - David ForrestHealth Data Consortium
 
Health Datapalooza 2013: Datalab - Mike Byrne
Health Datapalooza 2013: Datalab - Mike ByrneHealth Datapalooza 2013: Datalab - Mike Byrne
Health Datapalooza 2013: Datalab - Mike ByrneHealth Data Consortium
 
Health Datapalooza 2013: Datalab - Jim Craver
Health Datapalooza 2013: Datalab - Jim CraverHealth Datapalooza 2013: Datalab - Jim Craver
Health Datapalooza 2013: Datalab - Jim CraverHealth Data Consortium
 
Health Datapalooza 2013: Datalab - Damon Davis
Health Datapalooza 2013: Datalab - Damon DavisHealth Datapalooza 2013: Datalab - Damon Davis
Health Datapalooza 2013: Datalab - Damon DavisHealth Data Consortium
 
Health Datapalooza 2013: Bootcamp - cards
Health Datapalooza 2013: Bootcamp - cardsHealth Datapalooza 2013: Bootcamp - cards
Health Datapalooza 2013: Bootcamp - cardsHealth Data Consortium
 
Health Datapalooza 2013: HDC Affiliates Apps Demos - Involution Studios hGraph
Health Datapalooza 2013: HDC Affiliates Apps Demos - Involution Studios hGraphHealth Datapalooza 2013: HDC Affiliates Apps Demos - Involution Studios hGraph
Health Datapalooza 2013: HDC Affiliates Apps Demos - Involution Studios hGraphHealth Data Consortium
 
Health Datapalooza 2013: Cooperation Without Coordination
Health Datapalooza 2013: Cooperation Without CoordinationHealth Datapalooza 2013: Cooperation Without Coordination
Health Datapalooza 2013: Cooperation Without CoordinationHealth Data Consortium
 
Health Datapalooza 2013: Hearing from the Community - Richard Martin
Health Datapalooza 2013: Hearing from the Community - Richard MartinHealth Datapalooza 2013: Hearing from the Community - Richard Martin
Health Datapalooza 2013: Hearing from the Community - Richard MartinHealth Data Consortium
 
Health Datapalooza 2013: Hearing from the Community - Jean Nudelman
Health Datapalooza 2013: Hearing from the Community - Jean NudelmanHealth Datapalooza 2013: Hearing from the Community - Jean Nudelman
Health Datapalooza 2013: Hearing from the Community - Jean NudelmanHealth Data Consortium
 
Health Datapalooza 2013: Closing session
Health Datapalooza 2013: Closing sessionHealth Datapalooza 2013: Closing session
Health Datapalooza 2013: Closing sessionHealth Data Consortium
 
Health Datapalooza 2013: Data Rich, Data Poor - Mark Headd
Health Datapalooza 2013: Data Rich, Data Poor - Mark HeaddHealth Datapalooza 2013: Data Rich, Data Poor - Mark Headd
Health Datapalooza 2013: Data Rich, Data Poor - Mark HeaddHealth Data Consortium
 
Health Datapalooza 2013: Health Data Consortium Affiliates - Sunnie Southern,...
Health Datapalooza 2013: Health Data Consortium Affiliates - Sunnie Southern,...Health Datapalooza 2013: Health Data Consortium Affiliates - Sunnie Southern,...
Health Datapalooza 2013: Health Data Consortium Affiliates - Sunnie Southern,...Health Data Consortium
 
Health Datapalooza 2013: Datalab - Tania Allard
Health Datapalooza 2013: Datalab - Tania AllardHealth Datapalooza 2013: Datalab - Tania Allard
Health Datapalooza 2013: Datalab - Tania AllardHealth Data Consortium
 

Mehr von Health Data Consortium (20)

Clinical Trial Data Transparency: Explaining Governance for Public Data Sharing
Clinical Trial Data Transparency:  Explaining Governance for Public Data SharingClinical Trial Data Transparency:  Explaining Governance for Public Data Sharing
Clinical Trial Data Transparency: Explaining Governance for Public Data Sharing
 
Exchanges go live: early trends in competitor dynamics
Exchanges go live: early trends in competitor dynamicsExchanges go live: early trends in competitor dynamics
Exchanges go live: early trends in competitor dynamics
 
Liberating Health Data: What we learned in New York, with Dr. Nirav Shah
Liberating Health Data: What we learned in New York, with Dr. Nirav ShahLiberating Health Data: What we learned in New York, with Dr. Nirav Shah
Liberating Health Data: What we learned in New York, with Dr. Nirav Shah
 
Health Datapalooza 2013: Datalab - Victor Lazarro
Health Datapalooza 2013: Datalab - Victor LazarroHealth Datapalooza 2013: Datalab - Victor Lazarro
Health Datapalooza 2013: Datalab - Victor Lazarro
 
Health Datapalooza 2013: Datalab - Steven Edwards
Health Datapalooza 2013: Datalab - Steven EdwardsHealth Datapalooza 2013: Datalab - Steven Edwards
Health Datapalooza 2013: Datalab - Steven Edwards
 
Health Datapalooza 2013: Datalab - Rick Moser
Health Datapalooza 2013: Datalab - Rick MoserHealth Datapalooza 2013: Datalab - Rick Moser
Health Datapalooza 2013: Datalab - Rick Moser
 
Health Datapalooza 2013: Datalab - David Forrest
Health Datapalooza 2013: Datalab - David ForrestHealth Datapalooza 2013: Datalab - David Forrest
Health Datapalooza 2013: Datalab - David Forrest
 
Health Datapalooza 2013: Datalab - Mike Byrne
Health Datapalooza 2013: Datalab - Mike ByrneHealth Datapalooza 2013: Datalab - Mike Byrne
Health Datapalooza 2013: Datalab - Mike Byrne
 
Health Datapalooza 2013: Datalab - Jim Craver
Health Datapalooza 2013: Datalab - Jim CraverHealth Datapalooza 2013: Datalab - Jim Craver
Health Datapalooza 2013: Datalab - Jim Craver
 
Health Datapalooza 2013: Datalab - Damon Davis
Health Datapalooza 2013: Datalab - Damon DavisHealth Datapalooza 2013: Datalab - Damon Davis
Health Datapalooza 2013: Datalab - Damon Davis
 
Health Datapalooza 2013: Bootcamp - cards
Health Datapalooza 2013: Bootcamp - cardsHealth Datapalooza 2013: Bootcamp - cards
Health Datapalooza 2013: Bootcamp - cards
 
Health Datapalooza 2013: HDC Affiliates Apps Demos - Involution Studios hGraph
Health Datapalooza 2013: HDC Affiliates Apps Demos - Involution Studios hGraphHealth Datapalooza 2013: HDC Affiliates Apps Demos - Involution Studios hGraph
Health Datapalooza 2013: HDC Affiliates Apps Demos - Involution Studios hGraph
 
Health Datapalooza 2013: Linked Data
Health Datapalooza 2013: Linked DataHealth Datapalooza 2013: Linked Data
Health Datapalooza 2013: Linked Data
 
Health Datapalooza 2013: Cooperation Without Coordination
Health Datapalooza 2013: Cooperation Without CoordinationHealth Datapalooza 2013: Cooperation Without Coordination
Health Datapalooza 2013: Cooperation Without Coordination
 
Health Datapalooza 2013: Hearing from the Community - Richard Martin
Health Datapalooza 2013: Hearing from the Community - Richard MartinHealth Datapalooza 2013: Hearing from the Community - Richard Martin
Health Datapalooza 2013: Hearing from the Community - Richard Martin
 
Health Datapalooza 2013: Hearing from the Community - Jean Nudelman
Health Datapalooza 2013: Hearing from the Community - Jean NudelmanHealth Datapalooza 2013: Hearing from the Community - Jean Nudelman
Health Datapalooza 2013: Hearing from the Community - Jean Nudelman
 
Health Datapalooza 2013: Closing session
Health Datapalooza 2013: Closing sessionHealth Datapalooza 2013: Closing session
Health Datapalooza 2013: Closing session
 
Health Datapalooza 2013: Data Rich, Data Poor - Mark Headd
Health Datapalooza 2013: Data Rich, Data Poor - Mark HeaddHealth Datapalooza 2013: Data Rich, Data Poor - Mark Headd
Health Datapalooza 2013: Data Rich, Data Poor - Mark Headd
 
Health Datapalooza 2013: Health Data Consortium Affiliates - Sunnie Southern,...
Health Datapalooza 2013: Health Data Consortium Affiliates - Sunnie Southern,...Health Datapalooza 2013: Health Data Consortium Affiliates - Sunnie Southern,...
Health Datapalooza 2013: Health Data Consortium Affiliates - Sunnie Southern,...
 
Health Datapalooza 2013: Datalab - Tania Allard
Health Datapalooza 2013: Datalab - Tania AllardHealth Datapalooza 2013: Datalab - Tania Allard
Health Datapalooza 2013: Datalab - Tania Allard
 

Kürzlich hochgeladen

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 

Kürzlich hochgeladen (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 

Health Datapalooza 2013: Open Health Knowledge Graphs - George Thomas

  • 1. Open Health Knowledge Graphs #hdpalooza 2013 Technical Development Track 6/3, 1:30-2pm, Congressional Room George Thomas, HHS OCIO
  • 2. 1 agenda (as advertised) 1. This session introduces the business value of Linked Data, demonstrating how Linked Data enables the network effect through a process of „collaboration without coordination,‟ resulting in the integration of information systems across disparate open government data publishers. 2. This session will describe healthdata.gov platform components, including new functionality that programmatically exposes tabular and graph-oriented data. 3. We will set the context for follow-up sessions that review recent winning submissions for developer challenges and a final panel discussion about technical opportunities and upcoming developer challenges contributing to the maturation of healthdata.gov as a useful knowledge graph.
  • 3. 2 1.1 collaboration without coordination cross-correlation, graph-merging, reconciliation
  • 4. 3 1.2 linked data integration framework GKG/Watson/Siri/… healthdata.gov HKG PCAST DEAS Health Data Actor Variety Volume Velocity
  • 5. 4 1.3 achieving liquidity refine value combine http uri provenance global identity reconciliation correlation mapping ohkg automation curation
  • 6. 5 agenda (as advertised) 1. This session introduces the business value of Linked Data, demonstrating how Linked Data enables the network effect through a process of „collaboration without coordination,‟ resulting in the integration of information systems across disparate open government data publishers. 2. This session will describe healthdata.gov platform components, including new functionality that programmatically exposes tabular and graph-oriented data. 3. We will set the context for follow-up sessions that review recent winning submissions for developer challenges and a final panel discussion about technical opportunities and upcoming developer challenges contributing to the maturation of healthdata.gov as a useful knowledge graph.
  • 7. 6 2.1 these keys are strings, not things
  • 8. 2.2 but these keys are things, not strings 7
  • 9. 8 agenda (as advertised) 1. This session introduces the business value of Linked Data, demonstrating how Linked Data enables the network effect through a process of „collaboration without coordination,‟ resulting in the integration of information systems across disparate open government data publishers. 2. This session will describe healthdata.gov platform components, including new functionality that programmatically exposes tabular and graph-oriented data. 3. We will set the context for follow-up sessions that review recent winning submissions for developer challenges and a final panel discussion about technical opportunities and upcoming developer challenges contributing to the maturation of healthdata.gov as a useful knowledge graph.
  • 10. 9 3.1 first domain developer challenge • Metadata – requests the application of existing voluntary consensus standards for metadata common to all open government data – and invites new designs for health domain specific metadata to classify datasets in our growing catalog, creating entities, attributes and relations – that form the foundations for better discovery, integration and liquidity. • results page
  • 11. 10 3.2 second domain developer challenge • Mapping, Reconciliation and Correlation – builds on the Metadata domain challenge – begins by acknowledging disparate open government publishing practices – and seeks the demonstration of an innovative and automated solution for transforming semi-structured data into structured data, – reconciles decentralized distributions about the same data entity against the master identity of an authoritative source, – and correlates these master identities when multiple authoritative sources exist, – enabling the network effect by introducing strong identity resolution techniques that ease the ability to aggregate different data about the same entities from independent publishers.
  • 12. 11 thanks! @prefix drm: <http://vocab.data.gov/def/drm#> @prefix sdo: <http://schema.org/> @prefix vcard: <http://www.w3.org/2006/vcard/ns#> @prefix dc: <http://purl.org/dc/terms/> <http://hhs.gov/staff/georgethomas#> rdf:type drm:DataSteward , sdo:Person ; vcard:email “george dot thomas 1 at hhs dot gov” ; dc:contributor <healthdata.gov>, <data.gov/semantic> .