SlideShare ist ein Scribd-Unternehmen logo
1 von 22
Ontologies in Physical Science

        Peter Murray-Rust,
          University of Cambridge
       & Open Knowledge Foundation




 Onto Workshop, ed.ac.uk 2013-04-11



  An #animalgarden production
PMR and friends
 want us to help build                    Is it an
  a computational                        important
 chemistry ontology                      problem?

                   $1,000,000,000/yr
                     for compchem




                                       They need
                                         OWL

Problem: How to build ontologies when
people are uninterested or antagonistic
even though we have the technology
Perhaps the
chemists could
 use OWL-DL



           Chemists don’t
           use ontologies



   Top-down
 schemas like
 AniML haven’t
 (yet) taken off
Are there any
         ontologies in physical
          science that work?



                                     Crystallo-
                                  graphers build
                                  CIF dictionaries




                                  The IUCr, right? Tell
                                     us about CIF


IUCr: International Union of Crystallography
CIF Core defines
       500 common
         concepts




                     Like the
                  wavelength of
                  the radiation
                       used


                                  Or the volume of
                                   the crystal cell



CIF: http://www.iucr.org/cif
An      Core dictionary (coreCIF) version 2.4.3
 example   _diffrn_ambient_temperature
    ?      Definition: The mean temperature in kelvins at
                which the intensities were measured.
           Range: 0.0 -> infinity Type: numb


              ID    For machines:
                    Constraint + type

                                        For
                                        humans



http://www.iucr.org/__data/iucr/cifdic_html/1/
cif_core.dic/Idiffrn_ambient_temperature.html
Definition: The mean temperature in kelvins at
     which the intensities were measured.


           So everyone
             converts
          temperatures
            to use K?

                   Yes! today I
                  swam at 273K

             But chemists
                                   We MUST
             want to use all
                                  have a units
                 sorts of
                                   ontology
             different units
OWL? Is CIF
               a proper
             ontology? It’s
              not RDF…




                 …but we’ve global URIs, like
                cif:_diffrn_ambient_temperature


Because IUCr controls the
 namespace prefix: cif=
 http://www.iucr.org/cif
CIF had 20 years
 of community
   involvement
  through IUCr

                    But most top-
                   down chemistry
                    projects don’t
                        work




                                     So we’ll do this
                                      bottom-up.
Every compchem
        program uses basically
          the same scientific
               concepts

      We think each should
    build its own dictionary so
    we understand the output



              Won’t that just
               be a mess?



No. It’s the first step to
   interoperability.
The programs
                will use CML* for
                chemical output




                  Hyperchem
                   builds ITS
                   dictionary


                                        NWChem
              Each annotates            builds ITS
                 their own              dictionary
              program output

Chemical Markup Language PMR/Rzepa http://www.xml-cml.org
Alpha-electrons:
      Hyperchem uses
      hchem:e_alpha




                      NWChem has
                   nwchem:_alpha_elec


We agree they are the
   same so create
 compchem:alphae

                     in a communal
                cml:compchem dictionary
                   that everyone uses
What if the
data structure
 or concepts        CML provides
  don’t map        conventions so
                   each group can
                   define their data
                       structure




                 Data can then be
                 machine validated
                   against each
                   convention!
But there are                       We’ve
    over 20                      prototyped with
    program                       many before.
     codes.                         They’ll be
                                   encouraged
                   GULP, DPOL
                  Y, CASTEP, S
                  IESTA, MOPA
                      C…



                                        I think it’s
                                      going to work.
                                        BUT TTT*

TTT: Things Take Time (Piet Hein)
Will it work? It   National labs
 depends on          CSIRO/AU
    people         and PNNL/US
                   are committed




                    And we have
                   companies like
 I wish we had      Hyperchem
      some          and Kitware
   publishers
We’ll need
  tools
                                       We’ve got FoX* for
                                       FORTRAN output




                                       JUMBOTemplates
                                        to parse logfiles
RDF for navigating
  dictionaries

                     FoX*: XML/FORTRAN Toby White, Andrew Walker
Benefits of semantic dictionaries:
• FORTRAN logfile can be made semantic
• High degree of interoperability in chemistry
• Semantic publication (HTML5, CML, MathML)
• Interoperates with mainstream Web
• Easily scalable to other phys sci.

Problems:
• Closed code/minds is short-term market advantage
• Non-trivial commitment (updates, code revision)
• Getting top-down approval (e.g. IUPAC)
Benefits of semantic dictionaries:
• FORTRAN logfile can be made semantic
• High degree of interoperability in chemistry
• Semantic publication (HTML5, CML, MathML)
• Interoperates with mainstream Web
• Easily scalable to other phys sci.

Problems:
• Closed code/minds is short-term market advantage
• Non-trivial commitment (updates, code revision)
• Getting top-down approval (e.g. IUPAC)
Benefits of semantic dictionaries:
• FORTRAN logfile can be made semantic
• High degree of interoperability in chemistry
• Semantic publication (HTML5, CML, MathML)
• Interoperates with mainstream Web
• Easily scalable to other phys sci.

Problems:
• Closed code/minds is short-term market advantage
• Non-trivial commitment (updates, code revision)
• Getting top-down approval (e.g. IUPAC)
Benefits of semantic dictionaries:
• FORTRAN logfile can be made semantic
• High degree of interoperability in chemistry
• Semantic publication (HTML5, CML, MathML)
• Interoperates with mainstream Web
• Easily scalable to other phys sci.

Problems:
• Closed code/minds is short-term market advantage
• Non-trivial commitment (updates, code revision)
• Getting top-down approval (e.g. IUPAC)
Benefits of semantic dictionaries:
• FORTRAN logfile can be made semantic
• High degree of interoperability in chemistry
• Semantic publication (HTML5, CML, MathML)
• Interoperates with mainstream Web
• Easily scalable to other phys sci.

Problems:
• Closed code/minds is short-term market advantage
• Non-trivial commitment (updates, code revision)
• Getting top-down approval (e.g. IUPAC)
Perhaps the
   chemists could
    use OWL-DL



Chemists don’t
   use ANY
  ontologies


               Top-down
             schemas like
             AniML haven’t
             (yet) taken off

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Disruptive Communities and Technology
Disruptive Communities and TechnologyDisruptive Communities and Technology
Disruptive Communities and Technology
 
Content Mining at Wellcome Trust
Content Mining at Wellcome TrustContent Mining at Wellcome Trust
Content Mining at Wellcome Trust
 
Open Notebook Science
Open Notebook ScienceOpen Notebook Science
Open Notebook Science
 
Can Computers understand the scientific literature (includes compscie material)
Can Computers understand the scientific literature (includes compscie material)Can Computers understand the scientific literature (includes compscie material)
Can Computers understand the scientific literature (includes compscie material)
 
Content Mining of Science in Europe
Content Mining of Science in EuropeContent Mining of Science in Europe
Content Mining of Science in Europe
 
ContentMine: Open Data and Social Machines
ContentMine: Open Data and Social MachinesContentMine: Open Data and Social Machines
ContentMine: Open Data and Social Machines
 
ContentMining in Neuroscience
ContentMining in NeuroscienceContentMining in Neuroscience
ContentMining in Neuroscience
 
ContentMining for Synthetic Biology
ContentMining for Synthetic BiologyContentMining for Synthetic Biology
ContentMining for Synthetic Biology
 
ContentMining for Synthetic Biology
ContentMining for Synthetic BiologyContentMining for Synthetic Biology
ContentMining for Synthetic Biology
 
Csvconf
CsvconfCsvconf
Csvconf
 
ContentMine: Open Data and Social Machines
ContentMine: Open Data and Social MachinesContentMine: Open Data and Social Machines
ContentMine: Open Data and Social Machines
 
Making Theses USEFUL
Making Theses USEFULMaking Theses USEFUL
Making Theses USEFUL
 
Open data and Open Science
Open data and Open ScienceOpen data and Open Science
Open data and Open Science
 
Petermrjisc20141201
Petermrjisc20141201Petermrjisc20141201
Petermrjisc20141201
 
Content Mining of Science in Cambridge
Content Mining of Science in CambridgeContent Mining of Science in Cambridge
Content Mining of Science in Cambridge
 
ContentMine (TDM) at JISC Digifest
ContentMine (TDM) at JISC DigifestContentMine (TDM) at JISC Digifest
ContentMine (TDM) at JISC Digifest
 
ContentMine + EPMC: Finding Zika!
ContentMine + EPMC: Finding Zika!ContentMine + EPMC: Finding Zika!
ContentMine + EPMC: Finding Zika!
 
OpenNotebookScience NOW!
OpenNotebookScience NOW!OpenNotebookScience NOW!
OpenNotebookScience NOW!
 
ContentMining and Clinical Trials
ContentMining and Clinical TrialsContentMining and Clinical Trials
ContentMining and Clinical Trials
 
ContentMining and Clinical Trials
ContentMining and Clinical TrialsContentMining and Clinical Trials
ContentMining and Clinical Trials
 

Andere mochten auch

Blogging and social media for leaders - Brooklyn Center version
Blogging and social media for leaders - Brooklyn Center versionBlogging and social media for leaders - Brooklyn Center version
Blogging and social media for leaders - Brooklyn Center version
Wigley and Associates
 
Evaluacion nacional 40% final diseño de proyectos
Evaluacion nacional 40% final diseño de proyectosEvaluacion nacional 40% final diseño de proyectos
Evaluacion nacional 40% final diseño de proyectos
rdmarles
 
Caperucita lb
Caperucita lbCaperucita lb
Caperucita lb
Jaden126
 
Linda Harper Kamin Resume Word 2003
Linda Harper Kamin Resume Word 2003Linda Harper Kamin Resume Word 2003
Linda Harper Kamin Resume Word 2003
kaminl
 
La primavera: plantas que anuncian su llegada
La primavera: plantas que anuncian su llegadaLa primavera: plantas que anuncian su llegada
La primavera: plantas que anuncian su llegada
flor_anino
 

Andere mochten auch (20)

Blogging and social media for leaders - Brooklyn Center version
Blogging and social media for leaders - Brooklyn Center versionBlogging and social media for leaders - Brooklyn Center version
Blogging and social media for leaders - Brooklyn Center version
 
Ds vdesk
Ds vdeskDs vdesk
Ds vdesk
 
Presentación sobre licencias Creative Commons en Edu-AREA
Presentación sobre licencias Creative Commons en Edu-AREAPresentación sobre licencias Creative Commons en Edu-AREA
Presentación sobre licencias Creative Commons en Edu-AREA
 
Proyecto mejora competencias TIC del docente
Proyecto mejora competencias TIC del docenteProyecto mejora competencias TIC del docente
Proyecto mejora competencias TIC del docente
 
La gran invocacion1
La gran invocacion1La gran invocacion1
La gran invocacion1
 
Hv cesar 2015 breve
Hv cesar 2015 breveHv cesar 2015 breve
Hv cesar 2015 breve
 
Tecnologías de la información y la comunicación en chile
Tecnologías de la información y la comunicación en chileTecnologías de la información y la comunicación en chile
Tecnologías de la información y la comunicación en chile
 
3
33
3
 
Evaluacion nacional 40% final diseño de proyectos
Evaluacion nacional 40% final diseño de proyectosEvaluacion nacional 40% final diseño de proyectos
Evaluacion nacional 40% final diseño de proyectos
 
Caperucita lb
Caperucita lbCaperucita lb
Caperucita lb
 
PATRICK WALKER.C.V 2
PATRICK WALKER.C.V 2PATRICK WALKER.C.V 2
PATRICK WALKER.C.V 2
 
Linda Harper Kamin Resume Word 2003
Linda Harper Kamin Resume Word 2003Linda Harper Kamin Resume Word 2003
Linda Harper Kamin Resume Word 2003
 
Computer Science and Information Science 6th semester(2011 June/July) Questi...
 Computer Science and Information Science 6th semester(2011 June/July) Questi... Computer Science and Information Science 6th semester(2011 June/July) Questi...
Computer Science and Information Science 6th semester(2011 June/July) Questi...
 
Formato para proyecto_u_1_elect (1)
Formato para proyecto_u_1_elect (1)Formato para proyecto_u_1_elect (1)
Formato para proyecto_u_1_elect (1)
 
Certification with DaSoft
Certification with DaSoftCertification with DaSoft
Certification with DaSoft
 
Regolamento fantacalcio
Regolamento fantacalcioRegolamento fantacalcio
Regolamento fantacalcio
 
La primavera: plantas que anuncian su llegada
La primavera: plantas que anuncian su llegadaLa primavera: plantas que anuncian su llegada
La primavera: plantas que anuncian su llegada
 
El Llenguatge Audiovisual
El Llenguatge AudiovisualEl Llenguatge Audiovisual
El Llenguatge Audiovisual
 
APHG Unit 3: Language
APHG Unit 3: LanguageAPHG Unit 3: Language
APHG Unit 3: Language
 
estructuras innovadoras
estructuras innovadorasestructuras innovadoras
estructuras innovadoras
 

Ähnlich wie Ontologies in Physical Science

Leveraging the Eclipse Ecosystem for the Scientific Community
Leveraging the Eclipse Ecosystem for the Scientific CommunityLeveraging the Eclipse Ecosystem for the Scientific Community
Leveraging the Eclipse Ecosystem for the Scientific Community
guestd41014
 
Container Mythbusters
Container MythbustersContainer Mythbusters
Container Mythbusters
inside-BigData.com
 

Ähnlich wie Ontologies in Physical Science (20)

Introducing Parallel Pixie Dust
Introducing Parallel Pixie DustIntroducing Parallel Pixie Dust
Introducing Parallel Pixie Dust
 
Building cloud-enabled genomics workflows with Luigi and Docker
Building cloud-enabled genomics workflows with Luigi and DockerBuilding cloud-enabled genomics workflows with Luigi and Docker
Building cloud-enabled genomics workflows with Luigi and Docker
 
Cpascoe pimms or2012_
Cpascoe pimms or2012_Cpascoe pimms or2012_
Cpascoe pimms or2012_
 
2015 bioinformatics python_introduction_wim_vancriekinge_vfinal
2015 bioinformatics python_introduction_wim_vancriekinge_vfinal2015 bioinformatics python_introduction_wim_vancriekinge_vfinal
2015 bioinformatics python_introduction_wim_vancriekinge_vfinal
 
OpenSAF Symposium_Python Bindings_9.21.11
OpenSAF Symposium_Python Bindings_9.21.11OpenSAF Symposium_Python Bindings_9.21.11
OpenSAF Symposium_Python Bindings_9.21.11
 
Cinfony - Bring cheminformatics toolkits into tune
Cinfony - Bring cheminformatics toolkits into tuneCinfony - Bring cheminformatics toolkits into tune
Cinfony - Bring cheminformatics toolkits into tune
 
2016 05 sanger
2016 05 sanger2016 05 sanger
2016 05 sanger
 
SBML (the Systems Biology Markup Language)
SBML (the Systems Biology Markup Language)SBML (the Systems Biology Markup Language)
SBML (the Systems Biology Markup Language)
 
Leveraging the Eclipse Ecosystem for the Scientific Community
Leveraging the Eclipse Ecosystem for the Scientific CommunityLeveraging the Eclipse Ecosystem for the Scientific Community
Leveraging the Eclipse Ecosystem for the Scientific Community
 
An Update on Arm HPC
An Update on Arm HPCAn Update on Arm HPC
An Update on Arm HPC
 
My Open Access papers
My Open Access papersMy Open Access papers
My Open Access papers
 
CLIMB System Introduction Talk - CLIMB Launch
CLIMB System Introduction Talk - CLIMB LaunchCLIMB System Introduction Talk - CLIMB Launch
CLIMB System Introduction Talk - CLIMB Launch
 
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
 
Container Mythbusters
Container MythbustersContainer Mythbusters
Container Mythbusters
 
Climb stateoftheartintro
Climb stateoftheartintroClimb stateoftheartintro
Climb stateoftheartintro
 
HyperChem program.pptx
HyperChem program.pptxHyperChem program.pptx
HyperChem program.pptx
 
From the Benchtop to the Datacenter: HPC Requirements in Life Science Research
From the Benchtop to the Datacenter: HPC Requirements in Life Science ResearchFrom the Benchtop to the Datacenter: HPC Requirements in Life Science Research
From the Benchtop to the Datacenter: HPC Requirements in Life Science Research
 
2014 nicta-reproducibility
2014 nicta-reproducibility2014 nicta-reproducibility
2014 nicta-reproducibility
 
Blue Gene
Blue GeneBlue Gene
Blue Gene
 
The OHF Legacy
The OHF LegacyThe OHF Legacy
The OHF Legacy
 

Mehr von petermurrayrust

Mehr von petermurrayrust (20)

Omdi2021 Ontologies for (Materials) Science in the Digital Age
Omdi2021 Ontologies for (Materials) Science in the Digital AgeOmdi2021 Ontologies for (Materials) Science in the Digital Age
Omdi2021 Ontologies for (Materials) Science in the Digital Age
 
Open Science Principles and Practice
Open Science Principles and PracticeOpen Science Principles and Practice
Open Science Principles and Practice
 
Open Virus Indian Presentation
Open Virus Indian PresentationOpen Virus Indian Presentation
Open Virus Indian Presentation
 
Can machines understand the scientific literature?
Can machines understand the scientific literature?Can machines understand the scientific literature?
Can machines understand the scientific literature?
 
OpenVirus at OpenPublishingFest
OpenVirus at OpenPublishingFestOpenVirus at OpenPublishingFest
OpenVirus at OpenPublishingFest
 
Open Virus Indian Presentation
Open Virus Indian PresentationOpen Virus Indian Presentation
Open Virus Indian Presentation
 
Automatic mining of data from materials science literature
Automatic mining of data from materials science literatureAutomatic mining of data from materials science literature
Automatic mining of data from materials science literature
 
Climate Change and Human Migration
Climate Change and Human MigrationClimate Change and Human Migration
Climate Change and Human Migration
 
openVirus - tools for discovering literature on viruses
openVirus - tools for discovering literature on virusesopenVirus - tools for discovering literature on viruses
openVirus - tools for discovering literature on viruses
 
XML for science; its huge potential; but are pubiishers preventing it?
XML for science; its huge potential; but are pubiishers preventing it?XML for science; its huge potential; but are pubiishers preventing it?
XML for science; its huge potential; but are pubiishers preventing it?
 
Early Career Reseachers in Science. Start Early, Be Open , Be Brave
Early Career Reseachers in Science. Start Early, Be Open , Be BraveEarly Career Reseachers in Science. Start Early, Be Open , Be Brave
Early Career Reseachers in Science. Start Early, Be Open , Be Brave
 
Early Career Reseachers and Open Healthcare
Early Career Reseachers and Open HealthcareEarly Career Reseachers and Open Healthcare
Early Career Reseachers and Open Healthcare
 
Rapid biomedical search
Rapid biomedical search Rapid biomedical search
Rapid biomedical search
 
Scientific search for everyone
Scientific search for everyoneScientific search for everyone
Scientific search for everyone
 
Openplant2018 Poster; Semantic searching
Openplant2018 Poster; Semantic searchingOpenplant2018 Poster; Semantic searching
Openplant2018 Poster; Semantic searching
 
Extracting science from the archive
Extracting science from the archiveExtracting science from the archive
Extracting science from the archive
 
WikiFactMine: Ontology for Everybody and Everything
WikiFactMine: Ontology for Everybody and EverythingWikiFactMine: Ontology for Everybody and Everything
WikiFactMine: Ontology for Everybody and Everything
 
Disrupting the Publisher-Academic Complex
Disrupting the Publisher-Academic ComplexDisrupting the Publisher-Academic Complex
Disrupting the Publisher-Academic Complex
 
Paradise Lost and The Right to Read is the Right to Mine
Paradise Lost and The Right to Read is the Right to MineParadise Lost and The Right to Read is the Right to Mine
Paradise Lost and The Right to Read is the Right to Mine
 
Young people in an Age of Knowledge Neocolonialism
Young people in an Age of Knowledge NeocolonialismYoung people in an Age of Knowledge Neocolonialism
Young people in an Age of Knowledge Neocolonialism
 

Kürzlich hochgeladen

Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 
Gardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterGardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch Letter
MateoGardella
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
Chris Hunter
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 

Kürzlich hochgeladen (20)

PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Gardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterGardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch Letter
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 

Ontologies in Physical Science

  • 1. Ontologies in Physical Science Peter Murray-Rust, University of Cambridge & Open Knowledge Foundation Onto Workshop, ed.ac.uk 2013-04-11 An #animalgarden production
  • 2. PMR and friends want us to help build Is it an a computational important chemistry ontology problem? $1,000,000,000/yr for compchem They need OWL Problem: How to build ontologies when people are uninterested or antagonistic even though we have the technology
  • 3. Perhaps the chemists could use OWL-DL Chemists don’t use ontologies Top-down schemas like AniML haven’t (yet) taken off
  • 4. Are there any ontologies in physical science that work? Crystallo- graphers build CIF dictionaries The IUCr, right? Tell us about CIF IUCr: International Union of Crystallography
  • 5. CIF Core defines 500 common concepts Like the wavelength of the radiation used Or the volume of the crystal cell CIF: http://www.iucr.org/cif
  • 6. An Core dictionary (coreCIF) version 2.4.3 example _diffrn_ambient_temperature ? Definition: The mean temperature in kelvins at which the intensities were measured. Range: 0.0 -> infinity Type: numb ID For machines: Constraint + type For humans http://www.iucr.org/__data/iucr/cifdic_html/1/ cif_core.dic/Idiffrn_ambient_temperature.html
  • 7. Definition: The mean temperature in kelvins at which the intensities were measured. So everyone converts temperatures to use K? Yes! today I swam at 273K But chemists We MUST want to use all have a units sorts of ontology different units
  • 8. OWL? Is CIF a proper ontology? It’s not RDF… …but we’ve global URIs, like cif:_diffrn_ambient_temperature Because IUCr controls the namespace prefix: cif= http://www.iucr.org/cif
  • 9. CIF had 20 years of community involvement through IUCr But most top- down chemistry projects don’t work So we’ll do this bottom-up.
  • 10. Every compchem program uses basically the same scientific concepts We think each should build its own dictionary so we understand the output Won’t that just be a mess? No. It’s the first step to interoperability.
  • 11. The programs will use CML* for chemical output Hyperchem builds ITS dictionary NWChem Each annotates builds ITS their own dictionary program output Chemical Markup Language PMR/Rzepa http://www.xml-cml.org
  • 12. Alpha-electrons: Hyperchem uses hchem:e_alpha NWChem has nwchem:_alpha_elec We agree they are the same so create compchem:alphae in a communal cml:compchem dictionary that everyone uses
  • 13. What if the data structure or concepts CML provides don’t map conventions so each group can define their data structure Data can then be machine validated against each convention!
  • 14. But there are We’ve over 20 prototyped with program many before. codes. They’ll be encouraged GULP, DPOL Y, CASTEP, S IESTA, MOPA C… I think it’s going to work. BUT TTT* TTT: Things Take Time (Piet Hein)
  • 15. Will it work? It National labs depends on CSIRO/AU people and PNNL/US are committed And we have companies like I wish we had Hyperchem some and Kitware publishers
  • 16. We’ll need tools We’ve got FoX* for FORTRAN output JUMBOTemplates to parse logfiles RDF for navigating dictionaries FoX*: XML/FORTRAN Toby White, Andrew Walker
  • 17. Benefits of semantic dictionaries: • FORTRAN logfile can be made semantic • High degree of interoperability in chemistry • Semantic publication (HTML5, CML, MathML) • Interoperates with mainstream Web • Easily scalable to other phys sci. Problems: • Closed code/minds is short-term market advantage • Non-trivial commitment (updates, code revision) • Getting top-down approval (e.g. IUPAC)
  • 18. Benefits of semantic dictionaries: • FORTRAN logfile can be made semantic • High degree of interoperability in chemistry • Semantic publication (HTML5, CML, MathML) • Interoperates with mainstream Web • Easily scalable to other phys sci. Problems: • Closed code/minds is short-term market advantage • Non-trivial commitment (updates, code revision) • Getting top-down approval (e.g. IUPAC)
  • 19. Benefits of semantic dictionaries: • FORTRAN logfile can be made semantic • High degree of interoperability in chemistry • Semantic publication (HTML5, CML, MathML) • Interoperates with mainstream Web • Easily scalable to other phys sci. Problems: • Closed code/minds is short-term market advantage • Non-trivial commitment (updates, code revision) • Getting top-down approval (e.g. IUPAC)
  • 20. Benefits of semantic dictionaries: • FORTRAN logfile can be made semantic • High degree of interoperability in chemistry • Semantic publication (HTML5, CML, MathML) • Interoperates with mainstream Web • Easily scalable to other phys sci. Problems: • Closed code/minds is short-term market advantage • Non-trivial commitment (updates, code revision) • Getting top-down approval (e.g. IUPAC)
  • 21. Benefits of semantic dictionaries: • FORTRAN logfile can be made semantic • High degree of interoperability in chemistry • Semantic publication (HTML5, CML, MathML) • Interoperates with mainstream Web • Easily scalable to other phys sci. Problems: • Closed code/minds is short-term market advantage • Non-trivial commitment (updates, code revision) • Getting top-down approval (e.g. IUPAC)
  • 22. Perhaps the chemists could use OWL-DL Chemists don’t use ANY ontologies Top-down schemas like AniML haven’t (yet) taken off