SlideShare a Scribd company logo
1 of 43
Download to read offline
Collaborative Ontology
Development
Natasha Noy
Stanford University
Monday, July 15, 13
The ontology development that we
grew up with
Courtesy of Mark Musen
Monday, July 15, 13
Lots of databases and sources
The data is in different silos
Need to integrate them
Considerable benefit if you can integrate the data
Ontologies are essential to science
Monday, July 15, 13
Many ontologies today are large
and there are lots of them
• Gene ontology: 28K classes
• Foundational Model of Anatomy: >80K classes
• NCI Thesaurus: 80K classes
• SNOMED CT: >300K classes
Monday, July 15, 13
There are lots of ontologies and more to come
BioPortal has more
than 350 ontologies
only in the field of
biomedicine
Users uploaded
more than 230
ontologies to
WebProtégé in the
first two months
after its release
Monday, July 15, 13
To provide canonical representation of scientific knowledge
To annotate experimental data to enable interpretation,
comparison, and discovery across databases
To facilitate knowledge-based applications for decision
support, natural language-processing, data integration
and other applications
Scientists have adopted ontologies
Monday, July 15, 13
Ontology development has changed, too
or to any number of
users anywhere
in the world
from a lone
knowledge engineer
to a few
distributed
users
Monday, July 15, 13
Courtesy of Mark Musen
Monday, July 15, 13
Collaborative Ontology
Development
• Collaborative
• Several users contribute to a single developing
ontology
• There are mechanisms to carry out discussions and
to reach consensus
• Ontologies
• From simple taxonomies
• To expressive OWL ontologies
Monday, July 15, 13
Ontologies That Are Being
Developed Collaboratively
Monday, July 15, 13
Gene Ontology (GO)
• Developed by the Gene Ontology Consortium
• Goal: create a single terminological resource
for annotating genes and gene function from
different model organisms:
• drosophilla, mouse, e.coli, homo sapiens, ...
• GO: 38,000 classes
Monday, July 15, 13
Monday, July 15, 13
Key Resource: GO Annotations
Manually curated over the past 10 years
Publicly available
345,000 annotations for homo sapiens
TP53
Gene product
GO:0007569
cell aging
GO Term
PubMed article
Manual
GO
Annotation
Monday, July 15, 13
Monday, July 15, 13
The Gene Ontology
Terminology for consistent description of gene products
Issue Tracker
Curators of biomedical
databases
GO Curators 3 full-time curators have
access to edit GO
Anyone in the community can
submit an issue or request
Monday, July 15, 13
Monday, July 15, 13
The NCI Thesaurus
A reference ontology for cancer biology,
translational science, and clinical oncology
~20 full-time editors making changes
Changes are not immediately visible
A “lead editor” who approves the
changes, and assigns new tasks
Monday, July 15, 13
International Classification of
Diseases (ICD)
Have you looked at your medical insurance bill lately?
Monday, July 15, 13
International Classification of Diseases
Monday, July 15, 13
ICD – Why should you care?
Certificate of death
Policy making
Medical bills
Monday, July 15, 13
Developing ICD-10:
Revision process in the 20th century
8 Annual Revision Conferences (1982 - 89)
17 – 58 Countries participated
1- 5 person delegations
Mainly Health Statisticians
Manual curation
List exchange
Index was done later
"Decibel” Method of discussion
Output: Paper Copy
Work in English only
Limited testing in the field
Monday, July 15, 13
ICD-11: the 21st century
• ICD-11 is being developed as an OWL ontology
• Being developed collaboratively, in an open
editing process
• Links to other ontologies, such as SNOMED CT
• 33,000 classes
Monday, July 15, 13
Over 250 domain experts from around the world
Organized in groups, which edit different parts of the ontology
T. Tudorache, S. Falconer, C. Nyulas, N. F. Noy and M. A. Musen
Will Semantic Web Technologies Work for the Development of ICD-11?
International Semantic Web Conference (ISWC 2010), In-Use Track, Shanghai, China
Monday, July 15, 13
ICD-11 development process
• Each night a snapshot of the commonly edited ontology is
published in a public platform to encourage feedback from
the larger community http://apps.who.int/classifications/
icd11/browse/f/en
• Editorial workflow
• Centrally overseen by WHO
• Peer-reviewed process for the content and structure
• Experts may add change proposals
• WebProtégé used as the collaborative ontology
development platform
Monday, July 15, 13
Modeling ICD-11: Different views
Monday, July 15, 13
Linearization
Foundation:
ICD categories with
Definitions, synonyms
Clinical descriptions
Diagnostic criteria
Causal mechanism
Functional impact
Primary care
Morbidity
Mortality
Monday, July 15, 13
Multi-Linguality
Monday, July 15, 13
Links to Other Terminologies
Search in
BioPortal
Monday, July 15, 13
All properties are
reified
Multi-linguality
External references
Metadata
Evidence
Monday, July 15, 13
related to
linguisticEntity :
LinguisticEntity
LanguageTerm
id : xsd:string
linearizationSpecification* :
LinearizationSpecification
definition : DefinitionTerm
synonym* : LanguageTerm
bodyPart* : BodyPartTerm ...
ICDCategory
source : xsd:string
label : LinguisticEntity ...
ReferenceTerm
label : xsd:string
language : xsd:string
LinguisticEntity linearizationView :
LinearizationValueSet
linearizationParent :
ICDCategoryType ...
LinearizationSpecification
id : xsd:string
Term
DomainConcept
subclass of
Courtesy of Tania Tudorache
Monday, July 15, 13
Monday, July 15, 13
Ontology Development as a
Collaborative Process
• Ontology development is an inherently
collaborative process
• It is also inherently modular, so “stepping on
someone else’s toes” is not a big issue
• Users expect Web 2.0-style interaction:
• feeds, emails
• watched entities
• Web interface
• social-networking features
Monday, July 15, 13
Dimensions of Collaborative
Workflows
•Ontology size
• from 100s to 10,000s of concepts
•Size of the community
• Contributors (in some form): from 2-3
to dozens
• Editors: from 1-2 to 20
•Control mechanisms
• Variety of roles
• Gatekeepers, etc.
• Client-server editing
•Discussion tools
• mailing lists, message boards
• face-to-face meetings, telecons
• Synchronization and editing
mechanisms
• CVS, SVN
Monday, July 15, 13
WebProtégé
Monday, July 15, 13
“Google docs” for
ontologies
Monday, July 15, 13
Collaboration Features
• Simultaneous editing
• Change tracking
• Threaded discussions for ontology entities and changes
(notes, discussions, proposals, reviews)
• Watching ontology entities and branches and notifications
• Upload and sharing of ontologies
• Download any revision of the ontology
• Access policies
• User interface customization for domain experts
• Change analysis and statistics
Monday, July 15, 13
Monday, July 15, 13
Notes and discussions
Monday, July 15, 13
Monday, July 15, 13
Change tracking
Monday, July 15, 13
Watching entities and branches
Monday, July 15, 13
Download any snapshot in time
Monday, July 15, 13
Research Challenges
• Human-Computer Interaction:
• How do we enable domain experts to contribute effectively?
• What are the minimal sets of constructs necessary?
• Change analysis:
• Are there patterns in how users edit ontologies?
• Can we use these patterns to guide user interfaces?
• Community dynamics:
• What are the dynamics in groups that develop ontologies
collaboratively?
• Are there explicit or implicit roles?
• Do roles change over time?
Monday, July 15, 13

More Related Content

What's hot

What does the next generation repository look like?
What does the next generation repository look like?What does the next generation repository look like?
What does the next generation repository look like?Paul Walk
 
Profile Locally Network Globally
Profile Locally Network GloballyProfile Locally Network Globally
Profile Locally Network Globallyericmeeks
 
Linking Data, Linking People
Linking Data, Linking PeopleLinking Data, Linking People
Linking Data, Linking PeoplefereiraJ
 
Karma Data Modeling
Karma Data ModelingKarma Data Modeling
Karma Data ModelingVioleta Ilik
 
Reflections on a (slightly unusual) multi-disciplinary academic career
Reflections on a (slightly unusual) multi-disciplinary academic careerReflections on a (slightly unusual) multi-disciplinary academic career
Reflections on a (slightly unusual) multi-disciplinary academic careerCarole Goble
 
Reproducible and citable data and models: an introduction.
Reproducible and citable data and models: an introduction.Reproducible and citable data and models: an introduction.
Reproducible and citable data and models: an introduction.FAIRDOM
 
Starting from scratch – building the perfect digital repository
Starting from scratch – building the perfect digital repositoryStarting from scratch – building the perfect digital repository
Starting from scratch – building the perfect digital repositoryVioleta Ilik
 
Karma is a tool! Managing your Data
Karma is a tool! Managing your DataKarma is a tool! Managing your Data
Karma is a tool! Managing your DataVioleta Ilik
 
Making your data good enough for sharing.
Making your data good enough for sharing.Making your data good enough for sharing.
Making your data good enough for sharing.FAIRDOM
 
FAIR Data and Model Management for Systems Biology (and SOPs too!)
FAIR Data and Model Management for Systems Biology(and SOPs too!)FAIR Data and Model Management for Systems Biology(and SOPs too!)
FAIR Data and Model Management for Systems Biology (and SOPs too!)Carole Goble
 
Reproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects helpReproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects helpCarole Goble
 
Collaborative Development of Ontologies using BioPortal and WebProtégé
Collaborative Development of Ontologies using  BioPortal and WebProtégé  Collaborative Development of Ontologies using  BioPortal and WebProtégé
Collaborative Development of Ontologies using BioPortal and WebProtégé Trish Whetzel
 
Collaborative Development of Ontologies using BioPortal and WebProtégé
Collaborative Development of Ontologies using  BioPortal and WebProtégé  Collaborative Development of Ontologies using  BioPortal and WebProtégé
Collaborative Development of Ontologies using BioPortal and WebProtégé Trish Whetzel
 
Research Objects, SEEK and FAIRDOM
Research Objects, SEEK and FAIRDOMResearch Objects, SEEK and FAIRDOM
Research Objects, SEEK and FAIRDOMCarole Goble
 
Better Software, Better Research
Better Software, Better ResearchBetter Software, Better Research
Better Software, Better ResearchCarole Goble
 
A. Rose by any other name
A. Rose by any other nameA. Rose by any other name
A. Rose by any other nameAmanda Hill
 
Mtsr2015 goble-keynote
Mtsr2015 goble-keynoteMtsr2015 goble-keynote
Mtsr2015 goble-keynoteCarole Goble
 
Publishing data and code openly
Publishing data and code openlyPublishing data and code openly
Publishing data and code openlyFAIRDOM
 

What's hot (20)

What does the next generation repository look like?
What does the next generation repository look like?What does the next generation repository look like?
What does the next generation repository look like?
 
Profile Locally Network Globally
Profile Locally Network GloballyProfile Locally Network Globally
Profile Locally Network Globally
 
Linking Data, Linking People
Linking Data, Linking PeopleLinking Data, Linking People
Linking Data, Linking People
 
Karma Data Modeling
Karma Data ModelingKarma Data Modeling
Karma Data Modeling
 
Reflections on a (slightly unusual) multi-disciplinary academic career
Reflections on a (slightly unusual) multi-disciplinary academic careerReflections on a (slightly unusual) multi-disciplinary academic career
Reflections on a (slightly unusual) multi-disciplinary academic career
 
Reproducible and citable data and models: an introduction.
Reproducible and citable data and models: an introduction.Reproducible and citable data and models: an introduction.
Reproducible and citable data and models: an introduction.
 
Starting from scratch – building the perfect digital repository
Starting from scratch – building the perfect digital repositoryStarting from scratch – building the perfect digital repository
Starting from scratch – building the perfect digital repository
 
Karma is a tool! Managing your Data
Karma is a tool! Managing your DataKarma is a tool! Managing your Data
Karma is a tool! Managing your Data
 
Making your data good enough for sharing.
Making your data good enough for sharing.Making your data good enough for sharing.
Making your data good enough for sharing.
 
FAIR Data and Model Management for Systems Biology (and SOPs too!)
FAIR Data and Model Management for Systems Biology(and SOPs too!)FAIR Data and Model Management for Systems Biology(and SOPs too!)
FAIR Data and Model Management for Systems Biology (and SOPs too!)
 
Reproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects helpReproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects help
 
Collaborative Development of Ontologies using BioPortal and WebProtégé
Collaborative Development of Ontologies using  BioPortal and WebProtégé  Collaborative Development of Ontologies using  BioPortal and WebProtégé
Collaborative Development of Ontologies using BioPortal and WebProtégé
 
Collaborative Development of Ontologies using BioPortal and WebProtégé
Collaborative Development of Ontologies using  BioPortal and WebProtégé  Collaborative Development of Ontologies using  BioPortal and WebProtégé
Collaborative Development of Ontologies using BioPortal and WebProtégé
 
Research Objects, SEEK and FAIRDOM
Research Objects, SEEK and FAIRDOMResearch Objects, SEEK and FAIRDOM
Research Objects, SEEK and FAIRDOM
 
Better Software, Better Research
Better Software, Better ResearchBetter Software, Better Research
Better Software, Better Research
 
A. Rose by any other name
A. Rose by any other nameA. Rose by any other name
A. Rose by any other name
 
Mtsr2015 goble-keynote
Mtsr2015 goble-keynoteMtsr2015 goble-keynote
Mtsr2015 goble-keynote
 
Hosting a compound centric community resource for chemistry data
Hosting a compound centric community resource for chemistry dataHosting a compound centric community resource for chemistry data
Hosting a compound centric community resource for chemistry data
 
Ngsp
NgspNgsp
Ngsp
 
Publishing data and code openly
Publishing data and code openlyPublishing data and code openly
Publishing data and code openly
 

Similar to Collaborative ontology development

Public engagement while you sleep? How altmetrics can help researchers broade...
Public engagement while you sleep? How altmetrics can help researchers broade...Public engagement while you sleep? How altmetrics can help researchers broade...
Public engagement while you sleep? How altmetrics can help researchers broade...UoLResearchSupport
 
Public engagement while you sleep
Public engagement while you sleep Public engagement while you sleep
Public engagement while you sleep Kirsten Thompson
 
Public engagement while you sleep
Public engagement while you sleepPublic engagement while you sleep
Public engagement while you sleepUoLResearchSupport
 
Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.Susanna-Assunta Sansone
 
Is a Biological Database Really Different than a Biological Journal?
Is a Biological Database Really Different than a Biological Journal?Is a Biological Database Really Different than a Biological Journal?
Is a Biological Database Really Different than a Biological Journal?Philip Bourne
 
Overview of standards/stakeholders in life science (RDA Engagement Interest G...
Overview of standards/stakeholders in life science (RDA Engagement Interest G...Overview of standards/stakeholders in life science (RDA Engagement Interest G...
Overview of standards/stakeholders in life science (RDA Engagement Interest G...Susanna-Assunta Sansone
 
Sansone bio sharing introduction
Sansone bio sharing introductionSansone bio sharing introduction
Sansone bio sharing introductionMIBBI Checklists
 
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014Susanna-Assunta Sansone
 
Humanities Crowdsourcing on the Zooniverse Platform
Humanities Crowdsourcing on the Zooniverse PlatformHumanities Crowdsourcing on the Zooniverse Platform
Humanities Crowdsourcing on the Zooniverse PlatformUCLDH
 
Altmetrics Day Workshop - Internet Librarian International 2014
Altmetrics Day Workshop - Internet Librarian International 2014Altmetrics Day Workshop - Internet Librarian International 2014
Altmetrics Day Workshop - Internet Librarian International 2014Andy Tattersall
 
ContentMining for France and Europe; Lessons from 2 years in UK
ContentMining for France and Europe; Lessons from 2 years in UKContentMining for France and Europe; Lessons from 2 years in UK
ContentMining for France and Europe; Lessons from 2 years in UKpetermurrayrust
 
Ucsd library10182010
Ucsd library10182010Ucsd library10182010
Ucsd library10182010Philip Bourne
 
Data Science: History repeated? – The heritage of the Free and Open Source GI...
Data Science: History repeated? – The heritage of the Free and Open Source GI...Data Science: History repeated? – The heritage of the Free and Open Source GI...
Data Science: History repeated? – The heritage of the Free and Open Source GI...Peter Löwe
 
Practical applications for altmetrics in a changing metrics landscape
Practical applications for altmetrics in a changing metrics landscapePractical applications for altmetrics in a changing metrics landscape
Practical applications for altmetrics in a changing metrics landscapeDigital Science
 

Similar to Collaborative ontology development (20)

Public engagement while you sleep? How altmetrics can help researchers broade...
Public engagement while you sleep? How altmetrics can help researchers broade...Public engagement while you sleep? How altmetrics can help researchers broade...
Public engagement while you sleep? How altmetrics can help researchers broade...
 
Public engagement while you sleep
Public engagement while you sleep Public engagement while you sleep
Public engagement while you sleep
 
Public engagement while you sleep
Public engagement while you sleepPublic engagement while you sleep
Public engagement while you sleep
 
Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.
 
Is a Biological Database Really Different than a Biological Journal?
Is a Biological Database Really Different than a Biological Journal?Is a Biological Database Really Different than a Biological Journal?
Is a Biological Database Really Different than a Biological Journal?
 
Overview of standards/stakeholders in life science (RDA Engagement Interest G...
Overview of standards/stakeholders in life science (RDA Engagement Interest G...Overview of standards/stakeholders in life science (RDA Engagement Interest G...
Overview of standards/stakeholders in life science (RDA Engagement Interest G...
 
Maccallum
MaccallumMaccallum
Maccallum
 
Sansone bio sharing introduction
Sansone bio sharing introductionSansone bio sharing introduction
Sansone bio sharing introduction
 
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014
 
TIDSR
TIDSRTIDSR
TIDSR
 
Humanities Crowdsourcing on the Zooniverse Platform
Humanities Crowdsourcing on the Zooniverse PlatformHumanities Crowdsourcing on the Zooniverse Platform
Humanities Crowdsourcing on the Zooniverse Platform
 
Data and Research Infrastructures and Open Science
Data and Research Infrastructures and Open ScienceData and Research Infrastructures and Open Science
Data and Research Infrastructures and Open Science
 
Sansone mibbi-intro
Sansone mibbi-introSansone mibbi-intro
Sansone mibbi-intro
 
Altmetrics Day Workshop - Internet Librarian International 2014
Altmetrics Day Workshop - Internet Librarian International 2014Altmetrics Day Workshop - Internet Librarian International 2014
Altmetrics Day Workshop - Internet Librarian International 2014
 
British Library Datasets Programme Feb 2011
British Library Datasets Programme Feb 2011British Library Datasets Programme Feb 2011
British Library Datasets Programme Feb 2011
 
Patterson2010
Patterson2010Patterson2010
Patterson2010
 
ContentMining for France and Europe; Lessons from 2 years in UK
ContentMining for France and Europe; Lessons from 2 years in UKContentMining for France and Europe; Lessons from 2 years in UK
ContentMining for France and Europe; Lessons from 2 years in UK
 
Ucsd library10182010
Ucsd library10182010Ucsd library10182010
Ucsd library10182010
 
Data Science: History repeated? – The heritage of the Free and Open Source GI...
Data Science: History repeated? – The heritage of the Free and Open Source GI...Data Science: History repeated? – The heritage of the Free and Open Source GI...
Data Science: History repeated? – The heritage of the Free and Open Source GI...
 
Practical applications for altmetrics in a changing metrics landscape
Practical applications for altmetrics in a changing metrics landscapePractical applications for altmetrics in a changing metrics landscape
Practical applications for altmetrics in a changing metrics landscape
 

More from sssw2012

Semantic Search
Semantic SearchSemantic Search
Semantic Searchsssw2012
 
Manfred Linking the Real World
Manfred Linking the Real WorldManfred Linking the Real World
Manfred Linking the Real Worldsssw2012
 
The Web of Data - Tom Heath
The Web of Data - Tom HeathThe Web of Data - Tom Heath
The Web of Data - Tom Heathsssw2012
 
Linked Data Applications: There is No-One-Size-Fits-All Formula - Asun Gomez ...
Linked Data Applications: There is No-One-Size-Fits-All Formula - Asun Gomez ...Linked Data Applications: There is No-One-Size-Fits-All Formula - Asun Gomez ...
Linked Data Applications: There is No-One-Size-Fits-All Formula - Asun Gomez ...sssw2012
 
Valentina Presutti - Ontology Design Patterns: an introduction
Valentina Presutti - Ontology Design Patterns: an introductionValentina Presutti - Ontology Design Patterns: an introduction
Valentina Presutti - Ontology Design Patterns: an introductionsssw2012
 
Ivan Herman - Semantic Web Activities @ W3C
Ivan Herman - Semantic Web Activities @ W3CIvan Herman - Semantic Web Activities @ W3C
Ivan Herman - Semantic Web Activities @ W3Csssw2012
 
jerome Euzenat - Ontology Matching
jerome Euzenat - Ontology Matchingjerome Euzenat - Ontology Matching
jerome Euzenat - Ontology Matchingsssw2012
 
Aldo Gangemi - Meaning on the Web: An Empirical Design Perspective
Aldo Gangemi - Meaning on the Web: An Empirical Design PerspectiveAldo Gangemi - Meaning on the Web: An Empirical Design Perspective
Aldo Gangemi - Meaning on the Web: An Empirical Design Perspectivesssw2012
 

More from sssw2012 (8)

Semantic Search
Semantic SearchSemantic Search
Semantic Search
 
Manfred Linking the Real World
Manfred Linking the Real WorldManfred Linking the Real World
Manfred Linking the Real World
 
The Web of Data - Tom Heath
The Web of Data - Tom HeathThe Web of Data - Tom Heath
The Web of Data - Tom Heath
 
Linked Data Applications: There is No-One-Size-Fits-All Formula - Asun Gomez ...
Linked Data Applications: There is No-One-Size-Fits-All Formula - Asun Gomez ...Linked Data Applications: There is No-One-Size-Fits-All Formula - Asun Gomez ...
Linked Data Applications: There is No-One-Size-Fits-All Formula - Asun Gomez ...
 
Valentina Presutti - Ontology Design Patterns: an introduction
Valentina Presutti - Ontology Design Patterns: an introductionValentina Presutti - Ontology Design Patterns: an introduction
Valentina Presutti - Ontology Design Patterns: an introduction
 
Ivan Herman - Semantic Web Activities @ W3C
Ivan Herman - Semantic Web Activities @ W3CIvan Herman - Semantic Web Activities @ W3C
Ivan Herman - Semantic Web Activities @ W3C
 
jerome Euzenat - Ontology Matching
jerome Euzenat - Ontology Matchingjerome Euzenat - Ontology Matching
jerome Euzenat - Ontology Matching
 
Aldo Gangemi - Meaning on the Web: An Empirical Design Perspective
Aldo Gangemi - Meaning on the Web: An Empirical Design PerspectiveAldo Gangemi - Meaning on the Web: An Empirical Design Perspective
Aldo Gangemi - Meaning on the Web: An Empirical Design Perspective
 

Recently uploaded

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 

Recently uploaded (20)

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 

Collaborative ontology development

  • 2. The ontology development that we grew up with Courtesy of Mark Musen Monday, July 15, 13
  • 3. Lots of databases and sources The data is in different silos Need to integrate them Considerable benefit if you can integrate the data Ontologies are essential to science Monday, July 15, 13
  • 4. Many ontologies today are large and there are lots of them • Gene ontology: 28K classes • Foundational Model of Anatomy: >80K classes • NCI Thesaurus: 80K classes • SNOMED CT: >300K classes Monday, July 15, 13
  • 5. There are lots of ontologies and more to come BioPortal has more than 350 ontologies only in the field of biomedicine Users uploaded more than 230 ontologies to WebProtégé in the first two months after its release Monday, July 15, 13
  • 6. To provide canonical representation of scientific knowledge To annotate experimental data to enable interpretation, comparison, and discovery across databases To facilitate knowledge-based applications for decision support, natural language-processing, data integration and other applications Scientists have adopted ontologies Monday, July 15, 13
  • 7. Ontology development has changed, too or to any number of users anywhere in the world from a lone knowledge engineer to a few distributed users Monday, July 15, 13
  • 8. Courtesy of Mark Musen Monday, July 15, 13
  • 9. Collaborative Ontology Development • Collaborative • Several users contribute to a single developing ontology • There are mechanisms to carry out discussions and to reach consensus • Ontologies • From simple taxonomies • To expressive OWL ontologies Monday, July 15, 13
  • 10. Ontologies That Are Being Developed Collaboratively Monday, July 15, 13
  • 11. Gene Ontology (GO) • Developed by the Gene Ontology Consortium • Goal: create a single terminological resource for annotating genes and gene function from different model organisms: • drosophilla, mouse, e.coli, homo sapiens, ... • GO: 38,000 classes Monday, July 15, 13
  • 13. Key Resource: GO Annotations Manually curated over the past 10 years Publicly available 345,000 annotations for homo sapiens TP53 Gene product GO:0007569 cell aging GO Term PubMed article Manual GO Annotation Monday, July 15, 13
  • 15. The Gene Ontology Terminology for consistent description of gene products Issue Tracker Curators of biomedical databases GO Curators 3 full-time curators have access to edit GO Anyone in the community can submit an issue or request Monday, July 15, 13
  • 17. The NCI Thesaurus A reference ontology for cancer biology, translational science, and clinical oncology ~20 full-time editors making changes Changes are not immediately visible A “lead editor” who approves the changes, and assigns new tasks Monday, July 15, 13
  • 18. International Classification of Diseases (ICD) Have you looked at your medical insurance bill lately? Monday, July 15, 13
  • 19. International Classification of Diseases Monday, July 15, 13
  • 20. ICD – Why should you care? Certificate of death Policy making Medical bills Monday, July 15, 13
  • 21. Developing ICD-10: Revision process in the 20th century 8 Annual Revision Conferences (1982 - 89) 17 – 58 Countries participated 1- 5 person delegations Mainly Health Statisticians Manual curation List exchange Index was done later "Decibel” Method of discussion Output: Paper Copy Work in English only Limited testing in the field Monday, July 15, 13
  • 22. ICD-11: the 21st century • ICD-11 is being developed as an OWL ontology • Being developed collaboratively, in an open editing process • Links to other ontologies, such as SNOMED CT • 33,000 classes Monday, July 15, 13
  • 23. Over 250 domain experts from around the world Organized in groups, which edit different parts of the ontology T. Tudorache, S. Falconer, C. Nyulas, N. F. Noy and M. A. Musen Will Semantic Web Technologies Work for the Development of ICD-11? International Semantic Web Conference (ISWC 2010), In-Use Track, Shanghai, China Monday, July 15, 13
  • 24. ICD-11 development process • Each night a snapshot of the commonly edited ontology is published in a public platform to encourage feedback from the larger community http://apps.who.int/classifications/ icd11/browse/f/en • Editorial workflow • Centrally overseen by WHO • Peer-reviewed process for the content and structure • Experts may add change proposals • WebProtégé used as the collaborative ontology development platform Monday, July 15, 13
  • 25. Modeling ICD-11: Different views Monday, July 15, 13
  • 26. Linearization Foundation: ICD categories with Definitions, synonyms Clinical descriptions Diagnostic criteria Causal mechanism Functional impact Primary care Morbidity Mortality Monday, July 15, 13
  • 28. Links to Other Terminologies Search in BioPortal Monday, July 15, 13
  • 29. All properties are reified Multi-linguality External references Metadata Evidence Monday, July 15, 13
  • 30. related to linguisticEntity : LinguisticEntity LanguageTerm id : xsd:string linearizationSpecification* : LinearizationSpecification definition : DefinitionTerm synonym* : LanguageTerm bodyPart* : BodyPartTerm ... ICDCategory source : xsd:string label : LinguisticEntity ... ReferenceTerm label : xsd:string language : xsd:string LinguisticEntity linearizationView : LinearizationValueSet linearizationParent : ICDCategoryType ... LinearizationSpecification id : xsd:string Term DomainConcept subclass of Courtesy of Tania Tudorache Monday, July 15, 13
  • 32. Ontology Development as a Collaborative Process • Ontology development is an inherently collaborative process • It is also inherently modular, so “stepping on someone else’s toes” is not a big issue • Users expect Web 2.0-style interaction: • feeds, emails • watched entities • Web interface • social-networking features Monday, July 15, 13
  • 33. Dimensions of Collaborative Workflows •Ontology size • from 100s to 10,000s of concepts •Size of the community • Contributors (in some form): from 2-3 to dozens • Editors: from 1-2 to 20 •Control mechanisms • Variety of roles • Gatekeepers, etc. • Client-server editing •Discussion tools • mailing lists, message boards • face-to-face meetings, telecons • Synchronization and editing mechanisms • CVS, SVN Monday, July 15, 13
  • 36. Collaboration Features • Simultaneous editing • Change tracking • Threaded discussions for ontology entities and changes (notes, discussions, proposals, reviews) • Watching ontology entities and branches and notifications • Upload and sharing of ontologies • Download any revision of the ontology • Access policies • User interface customization for domain experts • Change analysis and statistics Monday, July 15, 13
  • 41. Watching entities and branches Monday, July 15, 13
  • 42. Download any snapshot in time Monday, July 15, 13
  • 43. Research Challenges • Human-Computer Interaction: • How do we enable domain experts to contribute effectively? • What are the minimal sets of constructs necessary? • Change analysis: • Are there patterns in how users edit ontologies? • Can we use these patterns to guide user interfaces? • Community dynamics: • What are the dynamics in groups that develop ontologies collaboratively? • Are there explicit or implicit roles? • Do roles change over time? Monday, July 15, 13