SlideShare ist ein Scribd-Unternehmen logo
1 von 22
Ontologies in Physical Science

        Peter Murray-Rust,
          University of Cambridge
       & Open Knowledge Foundation




 Onto Workshop, ed.ac.uk 2013-04-11



  An #animalgarden production
PMR and friends
 want us to help build                    Is it an
  a computational                        important
 chemistry ontology                      problem?

                   $1,000,000,000/yr
                     for compchem




                                       They need
                                         OWL

Problem: How to build ontologies when
people are uninterested or antagonistic
even though we have the technology
Perhaps the
chemists could
 use OWL-DL



           Chemists don’t
           use ontologies



   Top-down
 schemas like
 AniML haven’t
 (yet) taken off
Are there any
         ontologies in physical
          science that work?



                                     Crystallo-
                                  graphers build
                                  CIF dictionaries




                                  The IUCr, right? Tell
                                     us about CIF


IUCr: International Union of Crystallography
CIF Core defines
       500 common
         concepts




                     Like the
                  wavelength of
                  the radiation
                       used


                                  Or the volume of
                                   the crystal cell



CIF: http://www.iucr.org/cif
An      Core dictionary (coreCIF) version 2.4.3
 example   _diffrn_ambient_temperature
    ?      Definition: The mean temperature in kelvins at
                which the intensities were measured.
           Range: 0.0 -> infinity Type: numb


              ID    For machines:
                    Constraint + type

                                        For
                                        humans



http://www.iucr.org/__data/iucr/cifdic_html/1/
cif_core.dic/Idiffrn_ambient_temperature.html
Definition: The mean temperature in kelvins at
     which the intensities were measured.


           So everyone
             converts
          temperatures
            to use K?

                   Yes! today I
                  swam at 273K

             But chemists
                                   We MUST
             want to use all
                                  have a units
                 sorts of
                                   ontology
             different units
OWL? Is CIF
               a proper
             ontology? It’s
              not RDF…




                 …but we’ve global URIs, like
                cif:_diffrn_ambient_temperature


Because IUCr controls the
 namespace prefix: cif=
 http://www.iucr.org/cif
CIF had 20 years
 of community
   involvement
  through IUCr

                    But most top-
                   down chemistry
                    projects don’t
                        work




                                     So we’ll do this
                                      bottom-up.
Every compchem
        program uses basically
          the same scientific
               concepts

      We think each should
    build its own dictionary so
    we understand the output



              Won’t that just
               be a mess?



No. It’s the first step to
   interoperability.
The programs
                will use CML* for
                chemical output




                  Hyperchem
                   builds ITS
                   dictionary


                                        NWChem
              Each annotates            builds ITS
                 their own              dictionary
              program output

Chemical Markup Language PMR/Rzepa http://www.xml-cml.org
Alpha-electrons:
      Hyperchem uses
      hchem:e_alpha




                      NWChem has
                   nwchem:_alpha_elec


We agree they are the
   same so create
 compchem:alphae

                     in a communal
                cml:compchem dictionary
                   that everyone uses
What if the
data structure
 or concepts        CML provides
  don’t map        conventions so
                   each group can
                   define their data
                       structure




                 Data can then be
                 machine validated
                   against each
                   convention!
But there are                       We’ve
    over 20                      prototyped with
    program                       many before.
     codes.                         They’ll be
                                   encouraged
                   GULP, DPOL
                  Y, CASTEP, S
                  IESTA, MOPA
                      C…



                                        I think it’s
                                      going to work.
                                        BUT TTT*

TTT: Things Take Time (Piet Hein)
Will it work? It   National labs
 depends on          CSIRO/AU
    people         and PNNL/US
                   are committed




                    And we have
                   companies like
 I wish we had      Hyperchem
      some          and Kitware
   publishers
We’ll need
  tools
                                       We’ve got FoX* for
                                       FORTRAN output




                                       JUMBOTemplates
                                        to parse logfiles
RDF for navigating
  dictionaries

                     FoX*: XML/FORTRAN Toby White, Andrew Walker
Benefits of semantic dictionaries:
• FORTRAN logfile can be made semantic
• High degree of interoperability in chemistry
• Semantic publication (HTML5, CML, MathML)
• Interoperates with mainstream Web
• Easily scalable to other phys sci.

Problems:
• Closed code/minds is short-term market advantage
• Non-trivial commitment (updates, code revision)
• Getting top-down approval (e.g. IUPAC)
Benefits of semantic dictionaries:
• FORTRAN logfile can be made semantic
• High degree of interoperability in chemistry
• Semantic publication (HTML5, CML, MathML)
• Interoperates with mainstream Web
• Easily scalable to other phys sci.

Problems:
• Closed code/minds is short-term market advantage
• Non-trivial commitment (updates, code revision)
• Getting top-down approval (e.g. IUPAC)
Benefits of semantic dictionaries:
• FORTRAN logfile can be made semantic
• High degree of interoperability in chemistry
• Semantic publication (HTML5, CML, MathML)
• Interoperates with mainstream Web
• Easily scalable to other phys sci.

Problems:
• Closed code/minds is short-term market advantage
• Non-trivial commitment (updates, code revision)
• Getting top-down approval (e.g. IUPAC)
Benefits of semantic dictionaries:
• FORTRAN logfile can be made semantic
• High degree of interoperability in chemistry
• Semantic publication (HTML5, CML, MathML)
• Interoperates with mainstream Web
• Easily scalable to other phys sci.

Problems:
• Closed code/minds is short-term market advantage
• Non-trivial commitment (updates, code revision)
• Getting top-down approval (e.g. IUPAC)
Benefits of semantic dictionaries:
• FORTRAN logfile can be made semantic
• High degree of interoperability in chemistry
• Semantic publication (HTML5, CML, MathML)
• Interoperates with mainstream Web
• Easily scalable to other phys sci.

Problems:
• Closed code/minds is short-term market advantage
• Non-trivial commitment (updates, code revision)
• Getting top-down approval (e.g. IUPAC)
Perhaps the
   chemists could
    use OWL-DL



Chemists don’t
   use ANY
  ontologies


               Top-down
             schemas like
             AniML haven’t
             (yet) taken off

Weitere ähnliche Inhalte

Was ist angesagt?

Disruptive Communities and Technology
Disruptive Communities and TechnologyDisruptive Communities and Technology
Disruptive Communities and Technologypetermurrayrust
 
Content Mining at Wellcome Trust
Content Mining at Wellcome TrustContent Mining at Wellcome Trust
Content Mining at Wellcome Trustpetermurrayrust
 
Can Computers understand the scientific literature (includes compscie material)
Can Computers understand the scientific literature (includes compscie material)Can Computers understand the scientific literature (includes compscie material)
Can Computers understand the scientific literature (includes compscie material)TheContentMine
 
Content Mining of Science in Europe
Content Mining of Science in EuropeContent Mining of Science in Europe
Content Mining of Science in Europepetermurrayrust
 
ContentMine: Open Data and Social Machines
ContentMine: Open Data and Social MachinesContentMine: Open Data and Social Machines
ContentMine: Open Data and Social MachinesTheContentMine
 
ContentMining in Neuroscience
ContentMining in NeuroscienceContentMining in Neuroscience
ContentMining in NeuroscienceTheContentMine
 
ContentMining for Synthetic Biology
ContentMining for Synthetic BiologyContentMining for Synthetic Biology
ContentMining for Synthetic Biologypetermurrayrust
 
ContentMining for Synthetic Biology
ContentMining for Synthetic BiologyContentMining for Synthetic Biology
ContentMining for Synthetic BiologyTheContentMine
 
ContentMine: Open Data and Social Machines
ContentMine: Open Data and Social MachinesContentMine: Open Data and Social Machines
ContentMine: Open Data and Social Machinespetermurrayrust
 
Open data and Open Science
Open data and Open ScienceOpen data and Open Science
Open data and Open Sciencepetermurrayrust
 
Content Mining of Science in Cambridge
Content Mining of Science in CambridgeContent Mining of Science in Cambridge
Content Mining of Science in CambridgeTheContentMine
 
ContentMine (TDM) at JISC Digifest
ContentMine (TDM) at JISC DigifestContentMine (TDM) at JISC Digifest
ContentMine (TDM) at JISC Digifestpetermurrayrust
 
ContentMine + EPMC: Finding Zika!
ContentMine + EPMC: Finding Zika!ContentMine + EPMC: Finding Zika!
ContentMine + EPMC: Finding Zika!petermurrayrust
 
OpenNotebookScience NOW!
OpenNotebookScience NOW!OpenNotebookScience NOW!
OpenNotebookScience NOW!petermurrayrust
 
ContentMining and Clinical Trials
ContentMining and Clinical TrialsContentMining and Clinical Trials
ContentMining and Clinical TrialsTheContentMine
 
ContentMining and Clinical Trials
ContentMining and Clinical TrialsContentMining and Clinical Trials
ContentMining and Clinical Trialspetermurrayrust
 

Was ist angesagt? (20)

Disruptive Communities and Technology
Disruptive Communities and TechnologyDisruptive Communities and Technology
Disruptive Communities and Technology
 
Content Mining at Wellcome Trust
Content Mining at Wellcome TrustContent Mining at Wellcome Trust
Content Mining at Wellcome Trust
 
Open Notebook Science
Open Notebook ScienceOpen Notebook Science
Open Notebook Science
 
Can Computers understand the scientific literature (includes compscie material)
Can Computers understand the scientific literature (includes compscie material)Can Computers understand the scientific literature (includes compscie material)
Can Computers understand the scientific literature (includes compscie material)
 
Content Mining of Science in Europe
Content Mining of Science in EuropeContent Mining of Science in Europe
Content Mining of Science in Europe
 
ContentMine: Open Data and Social Machines
ContentMine: Open Data and Social MachinesContentMine: Open Data and Social Machines
ContentMine: Open Data and Social Machines
 
ContentMining in Neuroscience
ContentMining in NeuroscienceContentMining in Neuroscience
ContentMining in Neuroscience
 
ContentMining for Synthetic Biology
ContentMining for Synthetic BiologyContentMining for Synthetic Biology
ContentMining for Synthetic Biology
 
ContentMining for Synthetic Biology
ContentMining for Synthetic BiologyContentMining for Synthetic Biology
ContentMining for Synthetic Biology
 
Csvconf
CsvconfCsvconf
Csvconf
 
ContentMine: Open Data and Social Machines
ContentMine: Open Data and Social MachinesContentMine: Open Data and Social Machines
ContentMine: Open Data and Social Machines
 
Making Theses USEFUL
Making Theses USEFULMaking Theses USEFUL
Making Theses USEFUL
 
Open data and Open Science
Open data and Open ScienceOpen data and Open Science
Open data and Open Science
 
Petermrjisc20141201
Petermrjisc20141201Petermrjisc20141201
Petermrjisc20141201
 
Content Mining of Science in Cambridge
Content Mining of Science in CambridgeContent Mining of Science in Cambridge
Content Mining of Science in Cambridge
 
ContentMine (TDM) at JISC Digifest
ContentMine (TDM) at JISC DigifestContentMine (TDM) at JISC Digifest
ContentMine (TDM) at JISC Digifest
 
ContentMine + EPMC: Finding Zika!
ContentMine + EPMC: Finding Zika!ContentMine + EPMC: Finding Zika!
ContentMine + EPMC: Finding Zika!
 
OpenNotebookScience NOW!
OpenNotebookScience NOW!OpenNotebookScience NOW!
OpenNotebookScience NOW!
 
ContentMining and Clinical Trials
ContentMining and Clinical TrialsContentMining and Clinical Trials
ContentMining and Clinical Trials
 
ContentMining and Clinical Trials
ContentMining and Clinical TrialsContentMining and Clinical Trials
ContentMining and Clinical Trials
 

Andere mochten auch

Blogging and social media for leaders - Brooklyn Center version
Blogging and social media for leaders - Brooklyn Center versionBlogging and social media for leaders - Brooklyn Center version
Blogging and social media for leaders - Brooklyn Center versionWigley and Associates
 
Presentación sobre licencias Creative Commons en Edu-AREA
Presentación sobre licencias Creative Commons en Edu-AREAPresentación sobre licencias Creative Commons en Edu-AREA
Presentación sobre licencias Creative Commons en Edu-AREAManuel Caeiro Rodríguez
 
Proyecto mejora competencias TIC del docente
Proyecto mejora competencias TIC del docenteProyecto mejora competencias TIC del docente
Proyecto mejora competencias TIC del docenteBlanca Fondevila
 
La gran invocacion1
La gran invocacion1La gran invocacion1
La gran invocacion1NorkaMontero
 
Tecnologías de la información y la comunicación en chile
Tecnologías de la información y la comunicación en chileTecnologías de la información y la comunicación en chile
Tecnologías de la información y la comunicación en chileAgencia Exportadora®
 
Evaluacion nacional 40% final diseño de proyectos
Evaluacion nacional 40% final diseño de proyectosEvaluacion nacional 40% final diseño de proyectos
Evaluacion nacional 40% final diseño de proyectosrdmarles
 
Caperucita lb
Caperucita lbCaperucita lb
Caperucita lbJaden126
 
Linda Harper Kamin Resume Word 2003
Linda Harper Kamin Resume Word 2003Linda Harper Kamin Resume Word 2003
Linda Harper Kamin Resume Word 2003kaminl
 
Formato para proyecto_u_1_elect (1)
Formato para proyecto_u_1_elect (1)Formato para proyecto_u_1_elect (1)
Formato para proyecto_u_1_elect (1)Mario Lopez
 
Regolamento fantacalcio
Regolamento fantacalcioRegolamento fantacalcio
Regolamento fantacalcioPhabius
 
La primavera: plantas que anuncian su llegada
La primavera: plantas que anuncian su llegadaLa primavera: plantas que anuncian su llegada
La primavera: plantas que anuncian su llegadaflor_anino
 
El Llenguatge Audiovisual
El Llenguatge AudiovisualEl Llenguatge Audiovisual
El Llenguatge AudiovisualBenito Mendoza
 
APHG Unit 3: Language
APHG Unit 3: LanguageAPHG Unit 3: Language
APHG Unit 3: Languageappleselena
 
estructuras innovadoras
estructuras innovadorasestructuras innovadoras
estructuras innovadorasdaniela HR
 

Andere mochten auch (20)

Blogging and social media for leaders - Brooklyn Center version
Blogging and social media for leaders - Brooklyn Center versionBlogging and social media for leaders - Brooklyn Center version
Blogging and social media for leaders - Brooklyn Center version
 
Ds vdesk
Ds vdeskDs vdesk
Ds vdesk
 
Presentación sobre licencias Creative Commons en Edu-AREA
Presentación sobre licencias Creative Commons en Edu-AREAPresentación sobre licencias Creative Commons en Edu-AREA
Presentación sobre licencias Creative Commons en Edu-AREA
 
Proyecto mejora competencias TIC del docente
Proyecto mejora competencias TIC del docenteProyecto mejora competencias TIC del docente
Proyecto mejora competencias TIC del docente
 
La gran invocacion1
La gran invocacion1La gran invocacion1
La gran invocacion1
 
Hv cesar 2015 breve
Hv cesar 2015 breveHv cesar 2015 breve
Hv cesar 2015 breve
 
Tecnologías de la información y la comunicación en chile
Tecnologías de la información y la comunicación en chileTecnologías de la información y la comunicación en chile
Tecnologías de la información y la comunicación en chile
 
3
33
3
 
Evaluacion nacional 40% final diseño de proyectos
Evaluacion nacional 40% final diseño de proyectosEvaluacion nacional 40% final diseño de proyectos
Evaluacion nacional 40% final diseño de proyectos
 
Caperucita lb
Caperucita lbCaperucita lb
Caperucita lb
 
PATRICK WALKER.C.V 2
PATRICK WALKER.C.V 2PATRICK WALKER.C.V 2
PATRICK WALKER.C.V 2
 
Linda Harper Kamin Resume Word 2003
Linda Harper Kamin Resume Word 2003Linda Harper Kamin Resume Word 2003
Linda Harper Kamin Resume Word 2003
 
Computer Science and Information Science 6th semester(2011 June/July) Questi...
 Computer Science and Information Science 6th semester(2011 June/July) Questi... Computer Science and Information Science 6th semester(2011 June/July) Questi...
Computer Science and Information Science 6th semester(2011 June/July) Questi...
 
Formato para proyecto_u_1_elect (1)
Formato para proyecto_u_1_elect (1)Formato para proyecto_u_1_elect (1)
Formato para proyecto_u_1_elect (1)
 
Certification with DaSoft
Certification with DaSoftCertification with DaSoft
Certification with DaSoft
 
Regolamento fantacalcio
Regolamento fantacalcioRegolamento fantacalcio
Regolamento fantacalcio
 
La primavera: plantas que anuncian su llegada
La primavera: plantas que anuncian su llegadaLa primavera: plantas que anuncian su llegada
La primavera: plantas que anuncian su llegada
 
El Llenguatge Audiovisual
El Llenguatge AudiovisualEl Llenguatge Audiovisual
El Llenguatge Audiovisual
 
APHG Unit 3: Language
APHG Unit 3: LanguageAPHG Unit 3: Language
APHG Unit 3: Language
 
estructuras innovadoras
estructuras innovadorasestructuras innovadoras
estructuras innovadoras
 

Ähnlich wie Ontologies in Physical Science

Building cloud-enabled genomics workflows with Luigi and Docker
Building cloud-enabled genomics workflows with Luigi and DockerBuilding cloud-enabled genomics workflows with Luigi and Docker
Building cloud-enabled genomics workflows with Luigi and DockerJacob Feala
 
2015 bioinformatics python_introduction_wim_vancriekinge_vfinal
2015 bioinformatics python_introduction_wim_vancriekinge_vfinal2015 bioinformatics python_introduction_wim_vancriekinge_vfinal
2015 bioinformatics python_introduction_wim_vancriekinge_vfinalProf. Wim Van Criekinge
 
OpenSAF Symposium_Python Bindings_9.21.11
OpenSAF Symposium_Python Bindings_9.21.11OpenSAF Symposium_Python Bindings_9.21.11
OpenSAF Symposium_Python Bindings_9.21.11OpenSAF Foundation
 
Cinfony - Bring cheminformatics toolkits into tune
Cinfony - Bring cheminformatics toolkits into tuneCinfony - Bring cheminformatics toolkits into tune
Cinfony - Bring cheminformatics toolkits into tunebaoilleach
 
2016 05 sanger
2016 05 sanger2016 05 sanger
2016 05 sangerChris Dwan
 
SBML (the Systems Biology Markup Language)
SBML (the Systems Biology Markup Language)SBML (the Systems Biology Markup Language)
SBML (the Systems Biology Markup Language)Mike Hucka
 
Leveraging the Eclipse Ecosystem for the Scientific Community
Leveraging the Eclipse Ecosystem for the Scientific CommunityLeveraging the Eclipse Ecosystem for the Scientific Community
Leveraging the Eclipse Ecosystem for the Scientific Communityguestd41014
 
My Open Access papers
My Open Access papersMy Open Access papers
My Open Access papersbaoilleach
 
CLIMB System Introduction Talk - CLIMB Launch
CLIMB System Introduction Talk - CLIMB LaunchCLIMB System Introduction Talk - CLIMB Launch
CLIMB System Introduction Talk - CLIMB LaunchTom Connor
 
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...Bonnie Hurwitz
 
Climb stateoftheartintro
Climb stateoftheartintroClimb stateoftheartintro
Climb stateoftheartintrothomasrconnor
 
HyperChem program.pptx
HyperChem program.pptxHyperChem program.pptx
HyperChem program.pptxssuser337e30
 
From the Benchtop to the Datacenter: HPC Requirements in Life Science Research
From the Benchtop to the Datacenter: HPC Requirements in Life Science ResearchFrom the Benchtop to the Datacenter: HPC Requirements in Life Science Research
From the Benchtop to the Datacenter: HPC Requirements in Life Science ResearchAri Berman
 
2014 nicta-reproducibility
2014 nicta-reproducibility2014 nicta-reproducibility
2014 nicta-reproducibilityc.titus.brown
 

Ähnlich wie Ontologies in Physical Science (20)

Introducing Parallel Pixie Dust
Introducing Parallel Pixie DustIntroducing Parallel Pixie Dust
Introducing Parallel Pixie Dust
 
Building cloud-enabled genomics workflows with Luigi and Docker
Building cloud-enabled genomics workflows with Luigi and DockerBuilding cloud-enabled genomics workflows with Luigi and Docker
Building cloud-enabled genomics workflows with Luigi and Docker
 
Cpascoe pimms or2012_
Cpascoe pimms or2012_Cpascoe pimms or2012_
Cpascoe pimms or2012_
 
2015 bioinformatics python_introduction_wim_vancriekinge_vfinal
2015 bioinformatics python_introduction_wim_vancriekinge_vfinal2015 bioinformatics python_introduction_wim_vancriekinge_vfinal
2015 bioinformatics python_introduction_wim_vancriekinge_vfinal
 
OpenSAF Symposium_Python Bindings_9.21.11
OpenSAF Symposium_Python Bindings_9.21.11OpenSAF Symposium_Python Bindings_9.21.11
OpenSAF Symposium_Python Bindings_9.21.11
 
Cinfony - Bring cheminformatics toolkits into tune
Cinfony - Bring cheminformatics toolkits into tuneCinfony - Bring cheminformatics toolkits into tune
Cinfony - Bring cheminformatics toolkits into tune
 
2016 05 sanger
2016 05 sanger2016 05 sanger
2016 05 sanger
 
SBML (the Systems Biology Markup Language)
SBML (the Systems Biology Markup Language)SBML (the Systems Biology Markup Language)
SBML (the Systems Biology Markup Language)
 
Leveraging the Eclipse Ecosystem for the Scientific Community
Leveraging the Eclipse Ecosystem for the Scientific CommunityLeveraging the Eclipse Ecosystem for the Scientific Community
Leveraging the Eclipse Ecosystem for the Scientific Community
 
An Update on Arm HPC
An Update on Arm HPCAn Update on Arm HPC
An Update on Arm HPC
 
My Open Access papers
My Open Access papersMy Open Access papers
My Open Access papers
 
CLIMB System Introduction Talk - CLIMB Launch
CLIMB System Introduction Talk - CLIMB LaunchCLIMB System Introduction Talk - CLIMB Launch
CLIMB System Introduction Talk - CLIMB Launch
 
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
 
Container Mythbusters
Container MythbustersContainer Mythbusters
Container Mythbusters
 
Climb stateoftheartintro
Climb stateoftheartintroClimb stateoftheartintro
Climb stateoftheartintro
 
HyperChem program.pptx
HyperChem program.pptxHyperChem program.pptx
HyperChem program.pptx
 
From the Benchtop to the Datacenter: HPC Requirements in Life Science Research
From the Benchtop to the Datacenter: HPC Requirements in Life Science ResearchFrom the Benchtop to the Datacenter: HPC Requirements in Life Science Research
From the Benchtop to the Datacenter: HPC Requirements in Life Science Research
 
2014 nicta-reproducibility
2014 nicta-reproducibility2014 nicta-reproducibility
2014 nicta-reproducibility
 
Blue Gene
Blue GeneBlue Gene
Blue Gene
 
The OHF Legacy
The OHF LegacyThe OHF Legacy
The OHF Legacy
 

Mehr von petermurrayrust

Omdi2021 Ontologies for (Materials) Science in the Digital Age
Omdi2021 Ontologies for (Materials) Science in the Digital AgeOmdi2021 Ontologies for (Materials) Science in the Digital Age
Omdi2021 Ontologies for (Materials) Science in the Digital Agepetermurrayrust
 
Open Science Principles and Practice
Open Science Principles and PracticeOpen Science Principles and Practice
Open Science Principles and Practicepetermurrayrust
 
Open Virus Indian Presentation
Open Virus Indian PresentationOpen Virus Indian Presentation
Open Virus Indian Presentationpetermurrayrust
 
Can machines understand the scientific literature?
Can machines understand the scientific literature?Can machines understand the scientific literature?
Can machines understand the scientific literature?petermurrayrust
 
OpenVirus at OpenPublishingFest
OpenVirus at OpenPublishingFestOpenVirus at OpenPublishingFest
OpenVirus at OpenPublishingFestpetermurrayrust
 
Open Virus Indian Presentation
Open Virus Indian PresentationOpen Virus Indian Presentation
Open Virus Indian Presentationpetermurrayrust
 
Automatic mining of data from materials science literature
Automatic mining of data from materials science literatureAutomatic mining of data from materials science literature
Automatic mining of data from materials science literaturepetermurrayrust
 
Climate Change and Human Migration
Climate Change and Human MigrationClimate Change and Human Migration
Climate Change and Human Migrationpetermurrayrust
 
openVirus - tools for discovering literature on viruses
openVirus - tools for discovering literature on virusesopenVirus - tools for discovering literature on viruses
openVirus - tools for discovering literature on virusespetermurrayrust
 
XML for science; its huge potential; but are pubiishers preventing it?
XML for science; its huge potential; but are pubiishers preventing it?XML for science; its huge potential; but are pubiishers preventing it?
XML for science; its huge potential; but are pubiishers preventing it?petermurrayrust
 
Early Career Reseachers in Science. Start Early, Be Open , Be Brave
Early Career Reseachers in Science. Start Early, Be Open , Be BraveEarly Career Reseachers in Science. Start Early, Be Open , Be Brave
Early Career Reseachers in Science. Start Early, Be Open , Be Bravepetermurrayrust
 
Early Career Reseachers and Open Healthcare
Early Career Reseachers and Open HealthcareEarly Career Reseachers and Open Healthcare
Early Career Reseachers and Open Healthcarepetermurrayrust
 
Rapid biomedical search
Rapid biomedical search Rapid biomedical search
Rapid biomedical search petermurrayrust
 
Scientific search for everyone
Scientific search for everyoneScientific search for everyone
Scientific search for everyonepetermurrayrust
 
Openplant2018 Poster; Semantic searching
Openplant2018 Poster; Semantic searchingOpenplant2018 Poster; Semantic searching
Openplant2018 Poster; Semantic searchingpetermurrayrust
 
Extracting science from the archive
Extracting science from the archiveExtracting science from the archive
Extracting science from the archivepetermurrayrust
 
WikiFactMine: Ontology for Everybody and Everything
WikiFactMine: Ontology for Everybody and EverythingWikiFactMine: Ontology for Everybody and Everything
WikiFactMine: Ontology for Everybody and Everythingpetermurrayrust
 
Disrupting the Publisher-Academic Complex
Disrupting the Publisher-Academic ComplexDisrupting the Publisher-Academic Complex
Disrupting the Publisher-Academic Complexpetermurrayrust
 
Paradise Lost and The Right to Read is the Right to Mine
Paradise Lost and The Right to Read is the Right to MineParadise Lost and The Right to Read is the Right to Mine
Paradise Lost and The Right to Read is the Right to Minepetermurrayrust
 
Young people in an Age of Knowledge Neocolonialism
Young people in an Age of Knowledge NeocolonialismYoung people in an Age of Knowledge Neocolonialism
Young people in an Age of Knowledge Neocolonialismpetermurrayrust
 

Mehr von petermurrayrust (20)

Omdi2021 Ontologies for (Materials) Science in the Digital Age
Omdi2021 Ontologies for (Materials) Science in the Digital AgeOmdi2021 Ontologies for (Materials) Science in the Digital Age
Omdi2021 Ontologies for (Materials) Science in the Digital Age
 
Open Science Principles and Practice
Open Science Principles and PracticeOpen Science Principles and Practice
Open Science Principles and Practice
 
Open Virus Indian Presentation
Open Virus Indian PresentationOpen Virus Indian Presentation
Open Virus Indian Presentation
 
Can machines understand the scientific literature?
Can machines understand the scientific literature?Can machines understand the scientific literature?
Can machines understand the scientific literature?
 
OpenVirus at OpenPublishingFest
OpenVirus at OpenPublishingFestOpenVirus at OpenPublishingFest
OpenVirus at OpenPublishingFest
 
Open Virus Indian Presentation
Open Virus Indian PresentationOpen Virus Indian Presentation
Open Virus Indian Presentation
 
Automatic mining of data from materials science literature
Automatic mining of data from materials science literatureAutomatic mining of data from materials science literature
Automatic mining of data from materials science literature
 
Climate Change and Human Migration
Climate Change and Human MigrationClimate Change and Human Migration
Climate Change and Human Migration
 
openVirus - tools for discovering literature on viruses
openVirus - tools for discovering literature on virusesopenVirus - tools for discovering literature on viruses
openVirus - tools for discovering literature on viruses
 
XML for science; its huge potential; but are pubiishers preventing it?
XML for science; its huge potential; but are pubiishers preventing it?XML for science; its huge potential; but are pubiishers preventing it?
XML for science; its huge potential; but are pubiishers preventing it?
 
Early Career Reseachers in Science. Start Early, Be Open , Be Brave
Early Career Reseachers in Science. Start Early, Be Open , Be BraveEarly Career Reseachers in Science. Start Early, Be Open , Be Brave
Early Career Reseachers in Science. Start Early, Be Open , Be Brave
 
Early Career Reseachers and Open Healthcare
Early Career Reseachers and Open HealthcareEarly Career Reseachers and Open Healthcare
Early Career Reseachers and Open Healthcare
 
Rapid biomedical search
Rapid biomedical search Rapid biomedical search
Rapid biomedical search
 
Scientific search for everyone
Scientific search for everyoneScientific search for everyone
Scientific search for everyone
 
Openplant2018 Poster; Semantic searching
Openplant2018 Poster; Semantic searchingOpenplant2018 Poster; Semantic searching
Openplant2018 Poster; Semantic searching
 
Extracting science from the archive
Extracting science from the archiveExtracting science from the archive
Extracting science from the archive
 
WikiFactMine: Ontology for Everybody and Everything
WikiFactMine: Ontology for Everybody and EverythingWikiFactMine: Ontology for Everybody and Everything
WikiFactMine: Ontology for Everybody and Everything
 
Disrupting the Publisher-Academic Complex
Disrupting the Publisher-Academic ComplexDisrupting the Publisher-Academic Complex
Disrupting the Publisher-Academic Complex
 
Paradise Lost and The Right to Read is the Right to Mine
Paradise Lost and The Right to Read is the Right to MineParadise Lost and The Right to Read is the Right to Mine
Paradise Lost and The Right to Read is the Right to Mine
 
Young people in an Age of Knowledge Neocolonialism
Young people in an Age of Knowledge NeocolonialismYoung people in an Age of Knowledge Neocolonialism
Young people in an Age of Knowledge Neocolonialism
 

Kürzlich hochgeladen

Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docxPoojaSen20
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 

Kürzlich hochgeladen (20)

Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docx
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 

Ontologies in Physical Science

  • 1. Ontologies in Physical Science Peter Murray-Rust, University of Cambridge & Open Knowledge Foundation Onto Workshop, ed.ac.uk 2013-04-11 An #animalgarden production
  • 2. PMR and friends want us to help build Is it an a computational important chemistry ontology problem? $1,000,000,000/yr for compchem They need OWL Problem: How to build ontologies when people are uninterested or antagonistic even though we have the technology
  • 3. Perhaps the chemists could use OWL-DL Chemists don’t use ontologies Top-down schemas like AniML haven’t (yet) taken off
  • 4. Are there any ontologies in physical science that work? Crystallo- graphers build CIF dictionaries The IUCr, right? Tell us about CIF IUCr: International Union of Crystallography
  • 5. CIF Core defines 500 common concepts Like the wavelength of the radiation used Or the volume of the crystal cell CIF: http://www.iucr.org/cif
  • 6. An Core dictionary (coreCIF) version 2.4.3 example _diffrn_ambient_temperature ? Definition: The mean temperature in kelvins at which the intensities were measured. Range: 0.0 -> infinity Type: numb ID For machines: Constraint + type For humans http://www.iucr.org/__data/iucr/cifdic_html/1/ cif_core.dic/Idiffrn_ambient_temperature.html
  • 7. Definition: The mean temperature in kelvins at which the intensities were measured. So everyone converts temperatures to use K? Yes! today I swam at 273K But chemists We MUST want to use all have a units sorts of ontology different units
  • 8. OWL? Is CIF a proper ontology? It’s not RDF… …but we’ve global URIs, like cif:_diffrn_ambient_temperature Because IUCr controls the namespace prefix: cif= http://www.iucr.org/cif
  • 9. CIF had 20 years of community involvement through IUCr But most top- down chemistry projects don’t work So we’ll do this bottom-up.
  • 10. Every compchem program uses basically the same scientific concepts We think each should build its own dictionary so we understand the output Won’t that just be a mess? No. It’s the first step to interoperability.
  • 11. The programs will use CML* for chemical output Hyperchem builds ITS dictionary NWChem Each annotates builds ITS their own dictionary program output Chemical Markup Language PMR/Rzepa http://www.xml-cml.org
  • 12. Alpha-electrons: Hyperchem uses hchem:e_alpha NWChem has nwchem:_alpha_elec We agree they are the same so create compchem:alphae in a communal cml:compchem dictionary that everyone uses
  • 13. What if the data structure or concepts CML provides don’t map conventions so each group can define their data structure Data can then be machine validated against each convention!
  • 14. But there are We’ve over 20 prototyped with program many before. codes. They’ll be encouraged GULP, DPOL Y, CASTEP, S IESTA, MOPA C… I think it’s going to work. BUT TTT* TTT: Things Take Time (Piet Hein)
  • 15. Will it work? It National labs depends on CSIRO/AU people and PNNL/US are committed And we have companies like I wish we had Hyperchem some and Kitware publishers
  • 16. We’ll need tools We’ve got FoX* for FORTRAN output JUMBOTemplates to parse logfiles RDF for navigating dictionaries FoX*: XML/FORTRAN Toby White, Andrew Walker
  • 17. Benefits of semantic dictionaries: • FORTRAN logfile can be made semantic • High degree of interoperability in chemistry • Semantic publication (HTML5, CML, MathML) • Interoperates with mainstream Web • Easily scalable to other phys sci. Problems: • Closed code/minds is short-term market advantage • Non-trivial commitment (updates, code revision) • Getting top-down approval (e.g. IUPAC)
  • 18. Benefits of semantic dictionaries: • FORTRAN logfile can be made semantic • High degree of interoperability in chemistry • Semantic publication (HTML5, CML, MathML) • Interoperates with mainstream Web • Easily scalable to other phys sci. Problems: • Closed code/minds is short-term market advantage • Non-trivial commitment (updates, code revision) • Getting top-down approval (e.g. IUPAC)
  • 19. Benefits of semantic dictionaries: • FORTRAN logfile can be made semantic • High degree of interoperability in chemistry • Semantic publication (HTML5, CML, MathML) • Interoperates with mainstream Web • Easily scalable to other phys sci. Problems: • Closed code/minds is short-term market advantage • Non-trivial commitment (updates, code revision) • Getting top-down approval (e.g. IUPAC)
  • 20. Benefits of semantic dictionaries: • FORTRAN logfile can be made semantic • High degree of interoperability in chemistry • Semantic publication (HTML5, CML, MathML) • Interoperates with mainstream Web • Easily scalable to other phys sci. Problems: • Closed code/minds is short-term market advantage • Non-trivial commitment (updates, code revision) • Getting top-down approval (e.g. IUPAC)
  • 21. Benefits of semantic dictionaries: • FORTRAN logfile can be made semantic • High degree of interoperability in chemistry • Semantic publication (HTML5, CML, MathML) • Interoperates with mainstream Web • Easily scalable to other phys sci. Problems: • Closed code/minds is short-term market advantage • Non-trivial commitment (updates, code revision) • Getting top-down approval (e.g. IUPAC)
  • 22. Perhaps the chemists could use OWL-DL Chemists don’t use ANY ontologies Top-down schemas like AniML haven’t (yet) taken off