SlideShare ist ein Scribd-Unternehmen logo
1 von 26
Downloaden Sie, um offline zu lesen
Standardizing scholarly output
Melissa Haendel
haendel@ohsu.edu
@ontowonka
VIVO 2014
Austin
The Research Life Cycle
EXPERIMENT
COLLABORATE
PUBLISHDEPOSIT DATA
FUND
The Research Life Cycle: Funding
EXPERIMENT
COLLABORATE
PUBLISHDEPOSIT DATA
FUND
FundRef
NIH Reporter
ScienCV
Biosketches
The Research Life Cycle: Experiment
EXPERIMENT
COLLABORATE
PUBLISHDEPOSIT DATA
FUND
The Research Life Cycle: Collaborate
EXPERIMENT
COLLABORATE
PUBLISHDEPOSIT DATA
FUND
Expertise
SciTS
Mentoring
Research trending
The Research Life Cycle: Publish
EXPERIMENT
COLLABORATE
PUBLISHDEPOSIT DATA
FUND
University
publishers
Blogs
The Research Life Cycle: Deposit Data
EXPERIMENT
COLLABORATE
PUBLISHDEPOSIT DATA
FUND
Data repositories
Metadata
The Research Life Cycle
EXPERIMENT
COLLABORATE
PUBLISHDEPOSIT DATA
FUND
VIVO-ISF
Goal:
Create a semantic representation of scholarly
activities and products that would enable
identification of potential collaborators,
relevant resources, and expertise across
scientific disciplines
net w o r k
VIVO-ISF Content and modularization
eagle-i
Research resources
VIVO
Person profiling
CTSA ShareCenter
Discussions, requests,
share documents
VIVO-ISF
Person
Contact
Organizations
Affiliations
Roles
Events
Services
Clinical
Expertise
Reagents
Organisms
Credentials
Inclusion or referencing of domain-
specific vocabularies in VIVO-ISF
Either utilize external services with stable URIs (e.g. UMLS) or
import classes/instances
VIVO-ISF for data integration
The Research Life Cycle: Funding
Three harmonization stories
‘s data
Integrating clinical and basic research
expertise data
The Research Life Cycle: Funding
Most collaboration suggestion tools
are based on publication and
sometimes awarded grant data.
But this often misses clinician
collaborators who don’t publish or
write grants much
Collecting and publishing expertise by
connecting clinical and and research
activities and resources
Step 1
Aggregate
Data
Step 2
Map Data to
ISF
Step 4
Publish Linked
Data
Step 3
Compute
Expertise
Step 1
Aggregate
Clinical Data
Step 2
Map Data to
ISF
Step 4
Publish Linked
Data
Step 3
Compute
Expertise
Provider ID ICD Code Value Code Count
Unique Patient
Count Code Label
1234567 552.00 1 1
Unilateral or unspecified femoral
hernia with obstruction (ICD9CM
552.00)
1234567 553.02 8 6
Bilateral femoral hernia without
mention of obstruction or gangrene
(ICD9CM 553.02)
1234567 555.1 4 1
Regional enteritis of large intestine
(ICD9CM 555.1)
1234568 745.12 10 5
Corrected transposition of great
vessels (ICD9CM 745.12)
Aggregate data
Step 1
Aggregate
Clinical Data
Step 2
Map Data to
VIVO-ISF
Step 4
Publish Linked
Data
Step 3
Compute
Expertise
Provider ID ICD Code Value
Code
Count
Unique
Patient
Count Code Label
1234567 552.00 1 1
Unilateral or
unspecified femoral
hernia with obstruction
(ICD9CM 552.00)
1234567 553.02 8 6
Bilateral femoral hernia
without mention of
obstruction or gangrene
(ICD9CM 553.02)
1234567 555.1 4 1
Regional enteritis of
large intestine (ICD9CM
555.1)
1234568 745.12 10 5
Corrected transposition
of great vessels
(ICD9CM 745.12)
Aggregated
Clinical Data
VIVO-ISF
RDF
triples
Java scripts
OWL API
Map Data to VIVO-ISF
Step 1
Aggregate
Clinical Data
Step 2
Map Data to
ISF
Step 4
Publish Linked
Data
Step 3
Compute
Expertise
Compute Expertise
Step 1
Aggregate
Clinical Data
Step 2
Map Data to
ISF
Step 4
Publish Linked
Data
Step 3
Compute
Expertise
Linked Data
cloud
SPARQL
Endpoints
OtherAPIs
…
Triple Stores
Several means
to access and
query data
Publish Linked data
Integrating public and private research
profile data
The Research Life Cycle: Funding
Most collaboration suggestion tools
are based on publication and
sometimes awarded grant data.
But this is old news for Research
Administration who wants to plan for
what is happening at their institution
NOW.
=> Clinical and Translational Activity Reporting tool (CTAR)
Clinical and Translational Activity
Reporting tool
The Research Life Cycle: Funding
Funding
proposals
Grants &
awards
Publications People Institutions
IRB
protocols
Clinical and Translational Activity
Reporting tool
The Research Life Cycle: Funding
See Robin Champieux and our poster entitled:
Ferrets Ontology
Ferrets
OR
Ontology
=> At inter-institutional
level can see interaction
between previously
unconnected groups via
intervening persons/groups
at another institution
Integrating research data across
institutions
David Eichmann
http://research.icts.uiowa.edu/polyglot/
Integrating data from 40+ institutions
VIVO, SciVal, LOKI, Profiles, etc.
Mapping all the classes and properties to VIVO-ISF and making the integrated data
set available
Classes from:
VIVO sites: 480 unique classes
Profile sites: 31 unique classes
Domains:
vivoweb.org
purl.org
www.w3.org
xmlns.com
www.findanexpert.unimelb.edu.au
vivo.libr.tue.nl
purl.obolibrary.org
griffith.edu.au
Etc.....
Integrating research data across
institutions
Mapping predicates
http://vivoweb.org/ontology/core#hasSubjectArea
8455029
http://vivoweb.org/ontology/core#authorInAuthorship
1444239
http://orng.info/ontology/orng#hasYouTube
402
Also helps us understand what
extensions exist that should be
implmeneted centrally
Integrating data from different
profiling systems
The Research Life Cycle: Funding
What kinds of questions can we answer?
Who in the southeast has expertise in sleep and does work on
mice?
How much collaboration goes on intra versus inter-
institutionally based upon all scholarly activities and products?
How can we identify external advisors for an interdisciplinary
training program?
What gaps exist in research funding topics across institutions
that an institutions may have expertise in?
@ontowonka #vivoisf – tweet me your ideas
 We can profile people based on the diversity of their
activities and products of research
 VIVO-ISF can be used as a standard to integrate research
profiling and scholarly contributions across different
domains, sources, and systems
 Applications such as VIVO, eagle-i, LOKI, Profiles, SciVal/Pure,
Symplectic, and ScienCV can exchange data using VIVO-ISF
 Realizing these goals is the result of wide community
participation and feedback (THANK YOU!)
And… the moral(s) of the stories are:
Working with others
We have an opportunity to engage other communities.
Some new activities:
 HCLS W3C dataset working group working to describe roles and relationships
between people and data (e.g. producer, curator, maintainer, analysis, etc.)
 CASRAI-XI contributor roles WG defining roles for people on publications
 Converis and CASRAI effort to evaluate how to best use VIVO-ISF to aid CV
creation and provide content back to the institutions (and beyond).
 ScienCV data model alignment to support data integration
 Integration of research data with biological data in the Monarch Initiative and
the Neuroscience Information Framework
What are some other opportunities for VIVO-ISF to aid data
integration?

Weitere ähnliche Inhalte

Was ist angesagt?

NIH BD2K bioCADDIE DataMed: Data Discovery Index
NIH BD2K bioCADDIE DataMed: Data Discovery IndexNIH BD2K bioCADDIE DataMed: Data Discovery Index
NIH BD2K bioCADDIE DataMed: Data Discovery IndexSusanna-Assunta Sansone
 
THOR Workshop - Services PANGAEA
THOR Workshop - Services PANGAEATHOR Workshop - Services PANGAEA
THOR Workshop - Services PANGAEAMaaike Duine
 
THOR Workshop - Data Publishing Elsevier
THOR Workshop - Data Publishing ElsevierTHOR Workshop - Data Publishing Elsevier
THOR Workshop - Data Publishing ElsevierMaaike Duine
 
Collaboration Through Interoperability: FundRef and Other Metadata
Collaboration Through Interoperability: FundRef and Other Metadata Collaboration Through Interoperability: FundRef and Other Metadata
Collaboration Through Interoperability: FundRef and Other Metadata Crossref
 
Collaboration Through Interoperability
Collaboration Through InteroperabilityCollaboration Through Interoperability
Collaboration Through InteroperabilityCarol Anne Meyer
 
Lightning Talk, Konkiel: Bootstrapping Library Data Management Services for E...
Lightning Talk, Konkiel: Bootstrapping Library Data Management Services for E...Lightning Talk, Konkiel: Bootstrapping Library Data Management Services for E...
Lightning Talk, Konkiel: Bootstrapping Library Data Management Services for E...ASIS&T
 
Citing data in research articles: principles, implementation, challenges - an...
Citing data in research articles: principles, implementation, challenges - an...Citing data in research articles: principles, implementation, challenges - an...
Citing data in research articles: principles, implementation, challenges - an...FAIRDOM
 
THOR Workshop - Data Publishing PLOS
THOR Workshop - Data Publishing PLOSTHOR Workshop - Data Publishing PLOS
THOR Workshop - Data Publishing PLOSMaaike Duine
 
COAR: SHARE UPDATE
COAR: SHARE UPDATECOAR: SHARE UPDATE
COAR: SHARE UPDATECASRAI
 
Funding data for research
Funding data for researchFunding data for research
Funding data for researchCrossref
 
Managing and sharing confidential data in Australian social science
Managing and sharing confidential data	in Australian social scienceManaging and sharing confidential data	in Australian social science
Managing and sharing confidential data in Australian social scienceARDC
 
Article Impact (ALM and altmetrics)
Article Impact (ALM and altmetrics)Article Impact (ALM and altmetrics)
Article Impact (ALM and altmetrics)Richard Cave
 
Why, What & How: The role of ORCID in Research Management (M. Buys)
Why, What & How: The role of ORCID in Research Management (M. Buys)Why, What & How: The role of ORCID in Research Management (M. Buys)
Why, What & How: The role of ORCID in Research Management (M. Buys)ORCID, Inc
 
Your Work is Distinctive: What About Your Name? (M. Buys)
Your Work is Distinctive: What About Your Name? (M. Buys)Your Work is Distinctive: What About Your Name? (M. Buys)
Your Work is Distinctive: What About Your Name? (M. Buys)ORCID, Inc
 
THOR Workshop - Data Publishing
THOR Workshop - Data PublishingTHOR Workshop - Data Publishing
THOR Workshop - Data PublishingMaaike Duine
 
2013 CrossRef Annual Meeting, How CrossRef has Accelerated Science and Its Pr...
2013 CrossRef Annual Meeting, How CrossRef has Accelerated Science and Its Pr...2013 CrossRef Annual Meeting, How CrossRef has Accelerated Science and Its Pr...
2013 CrossRef Annual Meeting, How CrossRef has Accelerated Science and Its Pr...Crossref
 

Was ist angesagt? (20)

NIH BD2K bioCADDIE DataMed: Data Discovery Index
NIH BD2K bioCADDIE DataMed: Data Discovery IndexNIH BD2K bioCADDIE DataMed: Data Discovery Index
NIH BD2K bioCADDIE DataMed: Data Discovery Index
 
THOR Workshop - Services PANGAEA
THOR Workshop - Services PANGAEATHOR Workshop - Services PANGAEA
THOR Workshop - Services PANGAEA
 
THOR Workshop - Data Publishing Elsevier
THOR Workshop - Data Publishing ElsevierTHOR Workshop - Data Publishing Elsevier
THOR Workshop - Data Publishing Elsevier
 
Collaboration Through Interoperability: FundRef and Other Metadata
Collaboration Through Interoperability: FundRef and Other Metadata Collaboration Through Interoperability: FundRef and Other Metadata
Collaboration Through Interoperability: FundRef and Other Metadata
 
Collaboration Through Interoperability
Collaboration Through InteroperabilityCollaboration Through Interoperability
Collaboration Through Interoperability
 
Lightning Talk, Konkiel: Bootstrapping Library Data Management Services for E...
Lightning Talk, Konkiel: Bootstrapping Library Data Management Services for E...Lightning Talk, Konkiel: Bootstrapping Library Data Management Services for E...
Lightning Talk, Konkiel: Bootstrapping Library Data Management Services for E...
 
Citing data in research articles: principles, implementation, challenges - an...
Citing data in research articles: principles, implementation, challenges - an...Citing data in research articles: principles, implementation, challenges - an...
Citing data in research articles: principles, implementation, challenges - an...
 
Attribution From Res Lib Perspective - Micah Altman, MIT
Attribution From Res Lib Perspective - Micah Altman, MITAttribution From Res Lib Perspective - Micah Altman, MIT
Attribution From Res Lib Perspective - Micah Altman, MIT
 
THOR Workshop - Data Publishing PLOS
THOR Workshop - Data Publishing PLOSTHOR Workshop - Data Publishing PLOS
THOR Workshop - Data Publishing PLOS
 
Citation Metrics
Citation MetricsCitation Metrics
Citation Metrics
 
Erdmann apr28-2
Erdmann apr28-2Erdmann apr28-2
Erdmann apr28-2
 
Carpenter - Privacy Implications Research Data - Intro
Carpenter - Privacy Implications Research Data - IntroCarpenter - Privacy Implications Research Data - Intro
Carpenter - Privacy Implications Research Data - Intro
 
COAR: SHARE UPDATE
COAR: SHARE UPDATECOAR: SHARE UPDATE
COAR: SHARE UPDATE
 
Funding data for research
Funding data for researchFunding data for research
Funding data for research
 
Managing and sharing confidential data in Australian social science
Managing and sharing confidential data	in Australian social scienceManaging and sharing confidential data	in Australian social science
Managing and sharing confidential data in Australian social science
 
Article Impact (ALM and altmetrics)
Article Impact (ALM and altmetrics)Article Impact (ALM and altmetrics)
Article Impact (ALM and altmetrics)
 
Why, What & How: The role of ORCID in Research Management (M. Buys)
Why, What & How: The role of ORCID in Research Management (M. Buys)Why, What & How: The role of ORCID in Research Management (M. Buys)
Why, What & How: The role of ORCID in Research Management (M. Buys)
 
Your Work is Distinctive: What About Your Name? (M. Buys)
Your Work is Distinctive: What About Your Name? (M. Buys)Your Work is Distinctive: What About Your Name? (M. Buys)
Your Work is Distinctive: What About Your Name? (M. Buys)
 
THOR Workshop - Data Publishing
THOR Workshop - Data PublishingTHOR Workshop - Data Publishing
THOR Workshop - Data Publishing
 
2013 CrossRef Annual Meeting, How CrossRef has Accelerated Science and Its Pr...
2013 CrossRef Annual Meeting, How CrossRef has Accelerated Science and Its Pr...2013 CrossRef Annual Meeting, How CrossRef has Accelerated Science and Its Pr...
2013 CrossRef Annual Meeting, How CrossRef has Accelerated Science and Its Pr...
 

Ähnlich wie Standardizing scholarly output with the VIVO ontology

Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...Robert H. McDonald
 
4.16.15 Slides, “Enhancing Early Career Researcher Profiles: VIVO & ORCID Int...
4.16.15 Slides, “Enhancing Early Career Researcher Profiles: VIVO & ORCID Int...4.16.15 Slides, “Enhancing Early Career Researcher Profiles: VIVO & ORCID Int...
4.16.15 Slides, “Enhancing Early Career Researcher Profiles: VIVO & ORCID Int...DuraSpace
 
L&P Eric Celeste - SHARE
L&P Eric Celeste -  SHAREL&P Eric Celeste -  SHARE
L&P Eric Celeste - SHARECASRAI
 
SHARE Update for CNI, Fall 2014
SHARE Update for CNI, Fall 2014SHARE Update for CNI, Fall 2014
SHARE Update for CNI, Fall 2014SHARE
 
Who's the Author? Identifier soup - ORCID, ISNI, LC NACO and VIAF
Who's the Author? Identifier soup - ORCID, ISNI, LC NACO and VIAFWho's the Author? Identifier soup - ORCID, ISNI, LC NACO and VIAF
Who's the Author? Identifier soup - ORCID, ISNI, LC NACO and VIAFSimeon Warner
 
VIVO: enabling the discovery of research and scholarship
VIVO: enabling the discovery of research and scholarshipVIVO: enabling the discovery of research and scholarship
VIVO: enabling the discovery of research and scholarshipPaul Albert
 
Project Credit: Melissa Haendel - On the Nature of Credit
Project Credit: Melissa Haendel - On the Nature of CreditProject Credit: Melissa Haendel - On the Nature of Credit
Project Credit: Melissa Haendel - On the Nature of CreditCASRAI
 
On the nature of Credit
On the nature of CreditOn the nature of Credit
On the nature of Creditmhaendel
 
5-14-13 An Introduction to VIVO Presentation Slides
5-14-13 An Introduction to VIVO Presentation Slides5-14-13 An Introduction to VIVO Presentation Slides
5-14-13 An Introduction to VIVO Presentation SlidesDuraSpace
 
Integrating ORCID, Funding, and Institutional Identifiers
Integrating ORCID, Funding, and Institutional IdentifiersIntegrating ORCID, Funding, and Institutional Identifiers
Integrating ORCID, Funding, and Institutional Identifiers Micah Altman
 
Starting from scratch – building the perfect digital repository
Starting from scratch – building the perfect digital repositoryStarting from scratch – building the perfect digital repository
Starting from scratch – building the perfect digital repositoryVioleta Ilik
 
Tracking research and research systems
Tracking research and research systemsTracking research and research systems
Tracking research and research systemsJisc
 
Persistent Identifiers in Research Management: People, Places and Things
Persistent Identifiers in Research Management: People, Places and ThingsPersistent Identifiers in Research Management: People, Places and Things
Persistent Identifiers in Research Management: People, Places and ThingsORCID, Inc
 
Overview of open access progress globally
Overview of open access progress globallyOverview of open access progress globally
Overview of open access progress globallyIryna Kuchma
 
12.10.14 Slides, “Roadmap to the Future of SHARE”
12.10.14 Slides, “Roadmap to the Future of SHARE”12.10.14 Slides, “Roadmap to the Future of SHARE”
12.10.14 Slides, “Roadmap to the Future of SHARE”DuraSpace
 
January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...
January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...
January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...DeVonne Parks, CEM
 

Ähnlich wie Standardizing scholarly output with the VIVO ontology (20)

Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
 
4.16.15 Slides, “Enhancing Early Career Researcher Profiles: VIVO & ORCID Int...
4.16.15 Slides, “Enhancing Early Career Researcher Profiles: VIVO & ORCID Int...4.16.15 Slides, “Enhancing Early Career Researcher Profiles: VIVO & ORCID Int...
4.16.15 Slides, “Enhancing Early Career Researcher Profiles: VIVO & ORCID Int...
 
L&P Eric Celeste - SHARE
L&P Eric Celeste -  SHAREL&P Eric Celeste -  SHARE
L&P Eric Celeste - SHARE
 
SHARE Update for CNI, Fall 2014
SHARE Update for CNI, Fall 2014SHARE Update for CNI, Fall 2014
SHARE Update for CNI, Fall 2014
 
Who's the Author? Identifier soup - ORCID, ISNI, LC NACO and VIAF
Who's the Author? Identifier soup - ORCID, ISNI, LC NACO and VIAFWho's the Author? Identifier soup - ORCID, ISNI, LC NACO and VIAF
Who's the Author? Identifier soup - ORCID, ISNI, LC NACO and VIAF
 
Kristi Holmes. A bird’s-eye view of scholarship at the individual, institutio...
Kristi Holmes. A bird’s-eye view of scholarship at the individual, institutio...Kristi Holmes. A bird’s-eye view of scholarship at the individual, institutio...
Kristi Holmes. A bird’s-eye view of scholarship at the individual, institutio...
 
VIVO: enabling the discovery of research and scholarship
VIVO: enabling the discovery of research and scholarshipVIVO: enabling the discovery of research and scholarship
VIVO: enabling the discovery of research and scholarship
 
Project Credit: Melissa Haendel - On the Nature of Credit
Project Credit: Melissa Haendel - On the Nature of CreditProject Credit: Melissa Haendel - On the Nature of Credit
Project Credit: Melissa Haendel - On the Nature of Credit
 
On the nature of Credit
On the nature of CreditOn the nature of Credit
On the nature of Credit
 
5-14-13 An Introduction to VIVO Presentation Slides
5-14-13 An Introduction to VIVO Presentation Slides5-14-13 An Introduction to VIVO Presentation Slides
5-14-13 An Introduction to VIVO Presentation Slides
 
Integrating ORCID, Funding, and Institutional Identifiers
Integrating ORCID, Funding, and Institutional IdentifiersIntegrating ORCID, Funding, and Institutional Identifiers
Integrating ORCID, Funding, and Institutional Identifiers
 
Starting from scratch – building the perfect digital repository
Starting from scratch – building the perfect digital repositoryStarting from scratch – building the perfect digital repository
Starting from scratch – building the perfect digital repository
 
Holmes "Institutional Infrastructure for Data Sharing"
Holmes "Institutional Infrastructure for Data Sharing"Holmes "Institutional Infrastructure for Data Sharing"
Holmes "Institutional Infrastructure for Data Sharing"
 
Tracking research and research systems
Tracking research and research systemsTracking research and research systems
Tracking research and research systems
 
Persistent Identifiers in Research Management: People, Places and Things
Persistent Identifiers in Research Management: People, Places and ThingsPersistent Identifiers in Research Management: People, Places and Things
Persistent Identifiers in Research Management: People, Places and Things
 
Alamw15 VIVO
Alamw15 VIVOAlamw15 VIVO
Alamw15 VIVO
 
Overview of open access progress globally
Overview of open access progress globallyOverview of open access progress globally
Overview of open access progress globally
 
November 19, 2014 NISO Virtual Conference: Can't We All Work Together?: Inter...
November 19, 2014 NISO Virtual Conference: Can't We All Work Together?: Inter...November 19, 2014 NISO Virtual Conference: Can't We All Work Together?: Inter...
November 19, 2014 NISO Virtual Conference: Can't We All Work Together?: Inter...
 
12.10.14 Slides, “Roadmap to the Future of SHARE”
12.10.14 Slides, “Roadmap to the Future of SHARE”12.10.14 Slides, “Roadmap to the Future of SHARE”
12.10.14 Slides, “Roadmap to the Future of SHARE”
 
January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...
January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...
January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...
 

Mehr von mhaendel

Patient-led deep phenotyping using a lay-friendly version of the Human Phenot...
Patient-led deep phenotyping using a lay-friendly version of the Human Phenot...Patient-led deep phenotyping using a lay-friendly version of the Human Phenot...
Patient-led deep phenotyping using a lay-friendly version of the Human Phenot...mhaendel
 
Semantics for rare disease phenotyping, diagnostics, and discovery
Semantics for rare disease phenotyping, diagnostics, and discoverySemantics for rare disease phenotyping, diagnostics, and discovery
Semantics for rare disease phenotyping, diagnostics, and discoverymhaendel
 
The Software and Data Licensing Solution: Not Your Dad’s UBMTA
The Software and Data Licensing Solution: Not Your Dad’s UBMTA The Software and Data Licensing Solution: Not Your Dad’s UBMTA
The Software and Data Licensing Solution: Not Your Dad’s UBMTA mhaendel
 
Equivalence is in the (ID) of the beholder
Equivalence is in the (ID) of the beholderEquivalence is in the (ID) of the beholder
Equivalence is in the (ID) of the beholdermhaendel
 
Building (and traveling) the data-brick road: A report from the front lines ...
Building (and traveling) the data-brick road:  A report from the front lines ...Building (and traveling) the data-brick road:  A report from the front lines ...
Building (and traveling) the data-brick road: A report from the front lines ...mhaendel
 
GA4GH Monarch Driver Project Introduction
GA4GH Monarch Driver Project IntroductionGA4GH Monarch Driver Project Introduction
GA4GH Monarch Driver Project Introductionmhaendel
 
GA4GH Phenotype Ontologies Task team update
GA4GH Phenotype Ontologies Task team updateGA4GH Phenotype Ontologies Task team update
GA4GH Phenotype Ontologies Task team updatemhaendel
 
Reusable data for biomedicine: A data licensing odyssey
Reusable data for biomedicine:  A data licensing odysseyReusable data for biomedicine:  A data licensing odyssey
Reusable data for biomedicine: A data licensing odysseymhaendel
 
Data Translator: an Open Science Data Platform for Mechanistic Disease Discovery
Data Translator: an Open Science Data Platform for Mechanistic Disease DiscoveryData Translator: an Open Science Data Platform for Mechanistic Disease Discovery
Data Translator: an Open Science Data Platform for Mechanistic Disease Discoverymhaendel
 
Global phenotypic data sharing standards to maximize diagnostic discovery
Global phenotypic data sharing standards to maximize diagnostic discoveryGlobal phenotypic data sharing standards to maximize diagnostic discovery
Global phenotypic data sharing standards to maximize diagnostic discoverymhaendel
 
How open is open? An evaluation rubric for public knowledgebases
How open is open?  An evaluation rubric for public knowledgebasesHow open is open?  An evaluation rubric for public knowledgebases
How open is open? An evaluation rubric for public knowledgebasesmhaendel
 
Deep phenotyping to aid identification of coding & non-coding rare disease v...
Deep phenotyping to aid identification  of coding & non-coding rare disease v...Deep phenotyping to aid identification  of coding & non-coding rare disease v...
Deep phenotyping to aid identification of coding & non-coding rare disease v...mhaendel
 
Science in the open, what does it take?
Science in the open, what does it take?Science in the open, what does it take?
Science in the open, what does it take?mhaendel
 
Global Phenotypic Data Sharing Standards to Maximize Diagnostics and Mechanis...
Global Phenotypic Data Sharing Standards to Maximize Diagnostics and Mechanis...Global Phenotypic Data Sharing Standards to Maximize Diagnostics and Mechanis...
Global Phenotypic Data Sharing Standards to Maximize Diagnostics and Mechanis...mhaendel
 
Phenopackets as applied to variant interpretation
Phenopackets as applied to variant interpretation Phenopackets as applied to variant interpretation
Phenopackets as applied to variant interpretation mhaendel
 
Credit where credit is due: acknowledging all types of contributions
Credit where credit is due: acknowledging all types of contributionsCredit where credit is due: acknowledging all types of contributions
Credit where credit is due: acknowledging all types of contributionsmhaendel
 
Deep phenotyping for everyone
Deep phenotyping for everyoneDeep phenotyping for everyone
Deep phenotyping for everyonemhaendel
 
Why the world needs phenopacketeers, and how to be one
Why the world needs phenopacketeers, and how to be oneWhy the world needs phenopacketeers, and how to be one
Why the world needs phenopacketeers, and how to be onemhaendel
 
On the frontier of genotype-2-phenotype data integration
On the frontier of genotype-2-phenotype data integrationOn the frontier of genotype-2-phenotype data integration
On the frontier of genotype-2-phenotype data integrationmhaendel
 
The Monarch Initiative: A semantic phenomics approach to disease discovery
The Monarch Initiative: A semantic phenomics approach to disease discoveryThe Monarch Initiative: A semantic phenomics approach to disease discovery
The Monarch Initiative: A semantic phenomics approach to disease discoverymhaendel
 

Mehr von mhaendel (20)

Patient-led deep phenotyping using a lay-friendly version of the Human Phenot...
Patient-led deep phenotyping using a lay-friendly version of the Human Phenot...Patient-led deep phenotyping using a lay-friendly version of the Human Phenot...
Patient-led deep phenotyping using a lay-friendly version of the Human Phenot...
 
Semantics for rare disease phenotyping, diagnostics, and discovery
Semantics for rare disease phenotyping, diagnostics, and discoverySemantics for rare disease phenotyping, diagnostics, and discovery
Semantics for rare disease phenotyping, diagnostics, and discovery
 
The Software and Data Licensing Solution: Not Your Dad’s UBMTA
The Software and Data Licensing Solution: Not Your Dad’s UBMTA The Software and Data Licensing Solution: Not Your Dad’s UBMTA
The Software and Data Licensing Solution: Not Your Dad’s UBMTA
 
Equivalence is in the (ID) of the beholder
Equivalence is in the (ID) of the beholderEquivalence is in the (ID) of the beholder
Equivalence is in the (ID) of the beholder
 
Building (and traveling) the data-brick road: A report from the front lines ...
Building (and traveling) the data-brick road:  A report from the front lines ...Building (and traveling) the data-brick road:  A report from the front lines ...
Building (and traveling) the data-brick road: A report from the front lines ...
 
GA4GH Monarch Driver Project Introduction
GA4GH Monarch Driver Project IntroductionGA4GH Monarch Driver Project Introduction
GA4GH Monarch Driver Project Introduction
 
GA4GH Phenotype Ontologies Task team update
GA4GH Phenotype Ontologies Task team updateGA4GH Phenotype Ontologies Task team update
GA4GH Phenotype Ontologies Task team update
 
Reusable data for biomedicine: A data licensing odyssey
Reusable data for biomedicine:  A data licensing odysseyReusable data for biomedicine:  A data licensing odyssey
Reusable data for biomedicine: A data licensing odyssey
 
Data Translator: an Open Science Data Platform for Mechanistic Disease Discovery
Data Translator: an Open Science Data Platform for Mechanistic Disease DiscoveryData Translator: an Open Science Data Platform for Mechanistic Disease Discovery
Data Translator: an Open Science Data Platform for Mechanistic Disease Discovery
 
Global phenotypic data sharing standards to maximize diagnostic discovery
Global phenotypic data sharing standards to maximize diagnostic discoveryGlobal phenotypic data sharing standards to maximize diagnostic discovery
Global phenotypic data sharing standards to maximize diagnostic discovery
 
How open is open? An evaluation rubric for public knowledgebases
How open is open?  An evaluation rubric for public knowledgebasesHow open is open?  An evaluation rubric for public knowledgebases
How open is open? An evaluation rubric for public knowledgebases
 
Deep phenotyping to aid identification of coding & non-coding rare disease v...
Deep phenotyping to aid identification  of coding & non-coding rare disease v...Deep phenotyping to aid identification  of coding & non-coding rare disease v...
Deep phenotyping to aid identification of coding & non-coding rare disease v...
 
Science in the open, what does it take?
Science in the open, what does it take?Science in the open, what does it take?
Science in the open, what does it take?
 
Global Phenotypic Data Sharing Standards to Maximize Diagnostics and Mechanis...
Global Phenotypic Data Sharing Standards to Maximize Diagnostics and Mechanis...Global Phenotypic Data Sharing Standards to Maximize Diagnostics and Mechanis...
Global Phenotypic Data Sharing Standards to Maximize Diagnostics and Mechanis...
 
Phenopackets as applied to variant interpretation
Phenopackets as applied to variant interpretation Phenopackets as applied to variant interpretation
Phenopackets as applied to variant interpretation
 
Credit where credit is due: acknowledging all types of contributions
Credit where credit is due: acknowledging all types of contributionsCredit where credit is due: acknowledging all types of contributions
Credit where credit is due: acknowledging all types of contributions
 
Deep phenotyping for everyone
Deep phenotyping for everyoneDeep phenotyping for everyone
Deep phenotyping for everyone
 
Why the world needs phenopacketeers, and how to be one
Why the world needs phenopacketeers, and how to be oneWhy the world needs phenopacketeers, and how to be one
Why the world needs phenopacketeers, and how to be one
 
On the frontier of genotype-2-phenotype data integration
On the frontier of genotype-2-phenotype data integrationOn the frontier of genotype-2-phenotype data integration
On the frontier of genotype-2-phenotype data integration
 
The Monarch Initiative: A semantic phenomics approach to disease discovery
The Monarch Initiative: A semantic phenomics approach to disease discoveryThe Monarch Initiative: A semantic phenomics approach to disease discovery
The Monarch Initiative: A semantic phenomics approach to disease discovery
 

Standardizing scholarly output with the VIVO ontology

  • 1. Standardizing scholarly output Melissa Haendel haendel@ohsu.edu @ontowonka VIVO 2014 Austin
  • 2. The Research Life Cycle EXPERIMENT COLLABORATE PUBLISHDEPOSIT DATA FUND
  • 3. The Research Life Cycle: Funding EXPERIMENT COLLABORATE PUBLISHDEPOSIT DATA FUND FundRef NIH Reporter ScienCV Biosketches
  • 4. The Research Life Cycle: Experiment EXPERIMENT COLLABORATE PUBLISHDEPOSIT DATA FUND
  • 5. The Research Life Cycle: Collaborate EXPERIMENT COLLABORATE PUBLISHDEPOSIT DATA FUND Expertise SciTS Mentoring Research trending
  • 6. The Research Life Cycle: Publish EXPERIMENT COLLABORATE PUBLISHDEPOSIT DATA FUND University publishers Blogs
  • 7. The Research Life Cycle: Deposit Data EXPERIMENT COLLABORATE PUBLISHDEPOSIT DATA FUND Data repositories Metadata
  • 8. The Research Life Cycle EXPERIMENT COLLABORATE PUBLISHDEPOSIT DATA FUND VIVO-ISF
  • 9. Goal: Create a semantic representation of scholarly activities and products that would enable identification of potential collaborators, relevant resources, and expertise across scientific disciplines net w o r k
  • 10. VIVO-ISF Content and modularization eagle-i Research resources VIVO Person profiling CTSA ShareCenter Discussions, requests, share documents VIVO-ISF Person Contact Organizations Affiliations Roles Events Services Clinical Expertise Reagents Organisms Credentials
  • 11. Inclusion or referencing of domain- specific vocabularies in VIVO-ISF Either utilize external services with stable URIs (e.g. UMLS) or import classes/instances
  • 12. VIVO-ISF for data integration The Research Life Cycle: Funding Three harmonization stories ‘s data
  • 13. Integrating clinical and basic research expertise data The Research Life Cycle: Funding Most collaboration suggestion tools are based on publication and sometimes awarded grant data. But this often misses clinician collaborators who don’t publish or write grants much
  • 14. Collecting and publishing expertise by connecting clinical and and research activities and resources Step 1 Aggregate Data Step 2 Map Data to ISF Step 4 Publish Linked Data Step 3 Compute Expertise
  • 15. Step 1 Aggregate Clinical Data Step 2 Map Data to ISF Step 4 Publish Linked Data Step 3 Compute Expertise Provider ID ICD Code Value Code Count Unique Patient Count Code Label 1234567 552.00 1 1 Unilateral or unspecified femoral hernia with obstruction (ICD9CM 552.00) 1234567 553.02 8 6 Bilateral femoral hernia without mention of obstruction or gangrene (ICD9CM 553.02) 1234567 555.1 4 1 Regional enteritis of large intestine (ICD9CM 555.1) 1234568 745.12 10 5 Corrected transposition of great vessels (ICD9CM 745.12) Aggregate data
  • 16. Step 1 Aggregate Clinical Data Step 2 Map Data to VIVO-ISF Step 4 Publish Linked Data Step 3 Compute Expertise Provider ID ICD Code Value Code Count Unique Patient Count Code Label 1234567 552.00 1 1 Unilateral or unspecified femoral hernia with obstruction (ICD9CM 552.00) 1234567 553.02 8 6 Bilateral femoral hernia without mention of obstruction or gangrene (ICD9CM 553.02) 1234567 555.1 4 1 Regional enteritis of large intestine (ICD9CM 555.1) 1234568 745.12 10 5 Corrected transposition of great vessels (ICD9CM 745.12) Aggregated Clinical Data VIVO-ISF RDF triples Java scripts OWL API Map Data to VIVO-ISF
  • 17. Step 1 Aggregate Clinical Data Step 2 Map Data to ISF Step 4 Publish Linked Data Step 3 Compute Expertise Compute Expertise
  • 18. Step 1 Aggregate Clinical Data Step 2 Map Data to ISF Step 4 Publish Linked Data Step 3 Compute Expertise Linked Data cloud SPARQL Endpoints OtherAPIs … Triple Stores Several means to access and query data Publish Linked data
  • 19. Integrating public and private research profile data The Research Life Cycle: Funding Most collaboration suggestion tools are based on publication and sometimes awarded grant data. But this is old news for Research Administration who wants to plan for what is happening at their institution NOW. => Clinical and Translational Activity Reporting tool (CTAR)
  • 20. Clinical and Translational Activity Reporting tool The Research Life Cycle: Funding Funding proposals Grants & awards Publications People Institutions IRB protocols
  • 21. Clinical and Translational Activity Reporting tool The Research Life Cycle: Funding See Robin Champieux and our poster entitled:
  • 22. Ferrets Ontology Ferrets OR Ontology => At inter-institutional level can see interaction between previously unconnected groups via intervening persons/groups at another institution Integrating research data across institutions David Eichmann http://research.icts.uiowa.edu/polyglot/
  • 23. Integrating data from 40+ institutions VIVO, SciVal, LOKI, Profiles, etc. Mapping all the classes and properties to VIVO-ISF and making the integrated data set available Classes from: VIVO sites: 480 unique classes Profile sites: 31 unique classes Domains: vivoweb.org purl.org www.w3.org xmlns.com www.findanexpert.unimelb.edu.au vivo.libr.tue.nl purl.obolibrary.org griffith.edu.au Etc..... Integrating research data across institutions Mapping predicates http://vivoweb.org/ontology/core#hasSubjectArea 8455029 http://vivoweb.org/ontology/core#authorInAuthorship 1444239 http://orng.info/ontology/orng#hasYouTube 402 Also helps us understand what extensions exist that should be implmeneted centrally
  • 24. Integrating data from different profiling systems The Research Life Cycle: Funding What kinds of questions can we answer? Who in the southeast has expertise in sleep and does work on mice? How much collaboration goes on intra versus inter- institutionally based upon all scholarly activities and products? How can we identify external advisors for an interdisciplinary training program? What gaps exist in research funding topics across institutions that an institutions may have expertise in? @ontowonka #vivoisf – tweet me your ideas
  • 25.  We can profile people based on the diversity of their activities and products of research  VIVO-ISF can be used as a standard to integrate research profiling and scholarly contributions across different domains, sources, and systems  Applications such as VIVO, eagle-i, LOKI, Profiles, SciVal/Pure, Symplectic, and ScienCV can exchange data using VIVO-ISF  Realizing these goals is the result of wide community participation and feedback (THANK YOU!) And… the moral(s) of the stories are:
  • 26. Working with others We have an opportunity to engage other communities. Some new activities:  HCLS W3C dataset working group working to describe roles and relationships between people and data (e.g. producer, curator, maintainer, analysis, etc.)  CASRAI-XI contributor roles WG defining roles for people on publications  Converis and CASRAI effort to evaluate how to best use VIVO-ISF to aid CV creation and provide content back to the institutions (and beyond).  ScienCV data model alignment to support data integration  Integration of research data with biological data in the Monarch Initiative and the Neuroscience Information Framework What are some other opportunities for VIVO-ISF to aid data integration?

Hinweis der Redaktion

  1. This shows the use-cases for URIs that don’t fall under the typical OWL class/individual modeling of data. There is a need for an agreed on set of codes, concepts, types, etc. of things in addition to classes and individuals. It is also just another perspective on the domain where there is frequently a need to talk about a whole set (an OWL class) as if it is a single primitive thing (an instance) and SKOS is a formalization of this idea.
  2. These codes come from billing data, and are an example of one kind of data that can be aggregated using the ISF.
  3. Aggregated encounter data are mapped to the ISF clinical module using Java scripts based on OWL API to generate RDF data
  4. Person activities and products of research all can be used to represent expertise and link clinical and basic expertise. Use of ISF will enable integration with multiple datasets to discover useful clinical associations and patterns
  5. The key point here is that the connections we can now see in inter-institutional collaboration, using publications as evidence, can be leveraged to target the ontological coverage at an individual site, establishing joint interests by investigators/communities based upon methods, materials, instruments, etc. – other ways of connecting peopole At interinstituional level we can see interaction between previously unconnected groups via intervening persons/groups at another institution Expanded representation expands connections • currently sites • true payoff – concept coverage expansion