SlideShare ist ein Scribd-Unternehmen logo
1 von 25
AnIML:
A New Analytical Data Standard
Stuart J. Chalk, Department of Chemistry, University of North Florida
schalk@unf.edu
ACS Meeting Boston 2015
 Data Formats
 Goals for Data Handling
 Introduction to AnIML
 Sections of an AnIML file
 AnIML Schemas and Files
 AnIML Technique Definitions
 Publishing Instrument Data
 Referencing Data Elements
 Calculations on Data
 Future Developments
 Conclusion
Overview
 Native Data Formats
 Proprietary formats
 "Metadata" separated from result data
 Metadata and data in multiple files
 Metadata not available electronically
 No way to link metadata with result data
 Interchange Data Formats
 Available for only a few techniques
 ANDI — GC, LC, MS
 JCAMP-DX — UV-Vis, IR, NMR, UV/Vis, IMS
 Fixed order, fixed syntax, immutable formats
 Content limitations
 Inconsistent implementations
Current Data Formats
 Extensible
 Easy to add new elements without breaking existing
applications
 Flexible
 Useful for diverse needs: Interchange, Interconversion,
Archiving...
 Useable & Maintainable
 Easy to create, use, adapt, maintain...
 Readily available tools
 Acceptable
 Use standard mechanisms accepted by mainstream
computing
 Human readable
 eXtensible Markup Language
Goals for Data Handling
 Extensible Markup Language (XML) specification
 Development under ASTM E13.15 ‘AnIML Task Group’
 Data standard to:
“Develop an analytical data standard that can
be used to store data from any analytical instrument”
Introduction to AnIML
http://animl.sourceforge.net
 JCAMP-DX
 http://www.jcamp-dx.org/
 ANDI (netCDF)
 ThermoML (NIST)
 SpectroML
 Nguyen, A. D. T., Arslan, A., Travis, J., Smith, M., Schafer, R., &
Kramer, G. W. (2004) ‘Molecular Spectrometry Data
Interchange Applications for NIST's SpectroML’, JALA 9 (6),
346-354. doi:10.1016/j.jala.2004.09.001
 Generalized Analytical Markup Language (GAML)
 http://www.gaml.org/
 First official meeting March 23, 2003 @ ASTM
Brief History of Time AnIML
 Broad scope
 Different types of data
 Size of data sets
 Everyone calls ‘widgit’ something different
 Need for metadata dictionaries
 One size does not fit all
 Getting broad community involvement
 Domain experts
 User communities
 What format?
Challenges for AnIML
 AnIML XML elements are ‘pigeon holes’ for metadata
 Minimal ‘required’ information
 If it’s not required you don’t have to include the element
 Extensible
 Store raw data not processed data
(except for FT techniques)
 Support for legacy data
 Record of changes
 Validatable
 Signable (digital sense)
AnIML Design Philosophy
AnIML Schemas and Files
Sections of an AnIML File
AnIML Technique Definitions
AnIML - Sample
AnIML - Sample
AnIML
-
Experiment
AnIML - Result
 Data storage
format
 Not just for
spectral data
 Access
 Data
 Metadata
 Manipulate
using XSLT
 Validate
 Signable
AnIML in an ELN
 AnIML Viewer -> Jmol/JSpecView (http://jmol.sourceforge.net)
Publish Supplementary Data
 Conversion of AnIML data to SVG using XSLT
Convert to Image File for Publication
 Expose an AnIML file at a URL
 Optional: Define a DOI for that URL
 Use XPath to reference a specific data point in an AnIML file
 //ExperimentStepSet[1]/ExperimentStep[1]/Method[1]/Auth
or[1]/Name[1]
 Encode the XPath expression so it can be part of the URL
Open Instrument Data
Part of a Data Management Plan
 Federal agencies are mandating data be made available
 Long term archive format for research data
 Referenceable if available online
 Searchable with Xquery
 Publish data processing algorithms (XSLT)
 Future proof data -> conversion to future data formats
 The Healthcare and Life Science (HCLS) Community Profile
is a Note from the Semantic Web HCLS Interest Group
 Access to consistent, high-quality metadata is critical to finding,
understanding, and reusing scientific data. This document
describes a consensus among participating stakeholders in the
Health Care and the Life Sciences domain on the description of
datasets using the Resource Description Framework (RDF). This
specification meets key functional requirements, reuses existing
vocabularies to the extent that it is possible, and addresses
elements of data description, versioning, provenance,
discovery, exchange, query, and retrieval.
Data Descriptions:
HCLS Community Profile
http://www.w3.org/TR/hcls-dataset/
 AnIML 1.0 Deliverables
 Core Schema - Fundamental framework for AnIML documents
 Technique Schema - Fundamental framework for technique definition and
extension documents
 AnIML Technique Definition Documents (ATDD) - Rules for content of
specific technique file
 AnIML Naming and Design Rules - Specifies rules about data element
structure for interoperability
 Standard Practice for AnIML Files - Describes how the specification is
supposed to work
 How to Create a Technique Definition Document - Guidelines for creating
new technique definition documents
 Other documents
 Draft Requirements Specification for AnIML Version 1.0
 Requirements and Goals of the Analytical Information Markup Language
AnIML Specification
http://animl.sourceforge.net
 Documentation
 Core specification
 Technique and extension specification
 Naming and design rules
 Annotated technique definitions
(UV/Vis, IR, 1D NMR, MS, Chromatography)
 Balloting through ASTM (end of 2015)
 Vendor, User, Developer extensions
 Semantic extension of AnIML metadata items
Future Developments
Conclusion
 AnIML is a great solution
for storing instrument data
 Human readable (UTF-8)
 Platform neutral
 Archivable
 Validatable
 AnIML leverages the extensive
XML ecosystem of tools
 Software engineers know XML
 schalk@unf.edu
 Phone: 904-620-1938
 Skype: stuartchalk
 LinkedIn/Slidehare: https://www.linkedin.com/in/stuchalk
 ORCID: http://orcid.org/0000-0002-0703-7776
 ResearcherID: http://www.researcherid.com/rid/D-8577-2013
Questions?

Weitere ähnliche Inhalte

Was ist angesagt?

Semantic data integration proof of concept
Semantic data integration proof of conceptSemantic data integration proof of concept
Semantic data integration proof of concept
Nicolas Bertrand
 
Building a Standard for Standards: The ChAMP Project
Building a Standard for Standards: The ChAMP ProjectBuilding a Standard for Standards: The ChAMP Project
Building a Standard for Standards: The ChAMP Project
Stuart Chalk
 
NY Prostate Cancer Conference - P.A. Fearn - Session 1: Data management for p...
NY Prostate Cancer Conference - P.A. Fearn - Session 1: Data management for p...NY Prostate Cancer Conference - P.A. Fearn - Session 1: Data management for p...
NY Prostate Cancer Conference - P.A. Fearn - Session 1: Data management for p...
European School of Oncology
 
Presentation_euroCRIS_ES
Presentation_euroCRIS_ESPresentation_euroCRIS_ES
Presentation_euroCRIS_ES
Ed Simons
 
Clustering the royal society of chemistry chemical repository to enable enhan...
Clustering the royal society of chemistry chemical repository to enable enhan...Clustering the royal society of chemistry chemical repository to enable enhan...
Clustering the royal society of chemistry chemical repository to enable enhan...
Valery Tkachenko
 
Introduction to Metadata Standards
Introduction to Metadata StandardsIntroduction to Metadata Standards
Introduction to Metadata Standards
David Massart
 

Was ist angesagt? (20)

ACS 248th Paper 136 JSmol/JSpecView Eureka Integration
ACS 248th Paper 136 JSmol/JSpecView Eureka IntegrationACS 248th Paper 136 JSmol/JSpecView Eureka Integration
ACS 248th Paper 136 JSmol/JSpecView Eureka Integration
 
Clinical modelling with openEHR Archetypes
Clinical modelling with openEHR ArchetypesClinical modelling with openEHR Archetypes
Clinical modelling with openEHR Archetypes
 
2016 Bio-IT World Cell Line Coordination Poster 2016-04-05v1
2016 Bio-IT World Cell Line Coordination Poster 2016-04-05v12016 Bio-IT World Cell Line Coordination Poster 2016-04-05v1
2016 Bio-IT World Cell Line Coordination Poster 2016-04-05v1
 
Schemas and Schema-driven Metadata Software
Schemas and Schema-driven Metadata SoftwareSchemas and Schema-driven Metadata Software
Schemas and Schema-driven Metadata Software
 
Why ICT Fails in Healthcare: Software Maintenance and Maintainability
Why ICT Fails in Healthcare: Software Maintenance and MaintainabilityWhy ICT Fails in Healthcare: Software Maintenance and Maintainability
Why ICT Fails in Healthcare: Software Maintenance and Maintainability
 
Linkages to EHRs and Related Standards. What can we learn from the Parallel U...
Linkages to EHRs and Related Standards. What can we learn from the Parallel U...Linkages to EHRs and Related Standards. What can we learn from the Parallel U...
Linkages to EHRs and Related Standards. What can we learn from the Parallel U...
 
ELIXIR-UK and the ELIXIR Interoperability Platform
ELIXIR-UK and the ELIXIR Interoperability PlatformELIXIR-UK and the ELIXIR Interoperability Platform
ELIXIR-UK and the ELIXIR Interoperability Platform
 
Integrated research data management in the Structural Sciences
Integrated research data management in the Structural SciencesIntegrated research data management in the Structural Sciences
Integrated research data management in the Structural Sciences
 
Crosslinks
Crosslinks Crosslinks
Crosslinks
 
Semantic data integration proof of concept
Semantic data integration proof of conceptSemantic data integration proof of concept
Semantic data integration proof of concept
 
Knowledge Discovery in an Agents Environment
Knowledge Discovery in an Agents EnvironmentKnowledge Discovery in an Agents Environment
Knowledge Discovery in an Agents Environment
 
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
 
Building a Standard for Standards: The ChAMP Project
Building a Standard for Standards: The ChAMP ProjectBuilding a Standard for Standards: The ChAMP Project
Building a Standard for Standards: The ChAMP Project
 
Capturing the context: one small(ish step for modellers, one giant leap for m...
Capturing the context: one small(ish step for modellers, one giant leap for m...Capturing the context: one small(ish step for modellers, one giant leap for m...
Capturing the context: one small(ish step for modellers, one giant leap for m...
 
NY Prostate Cancer Conference - P.A. Fearn - Session 1: Data management for p...
NY Prostate Cancer Conference - P.A. Fearn - Session 1: Data management for p...NY Prostate Cancer Conference - P.A. Fearn - Session 1: Data management for p...
NY Prostate Cancer Conference - P.A. Fearn - Session 1: Data management for p...
 
Presentation_euroCRIS_ES
Presentation_euroCRIS_ESPresentation_euroCRIS_ES
Presentation_euroCRIS_ES
 
Clustering the royal society of chemistry chemical repository to enable enhan...
Clustering the royal society of chemistry chemical repository to enable enhan...Clustering the royal society of chemistry chemical repository to enable enhan...
Clustering the royal society of chemistry chemical repository to enable enhan...
 
Bh14 ogo
Bh14 ogoBh14 ogo
Bh14 ogo
 
Introduction to Metadata Standards
Introduction to Metadata StandardsIntroduction to Metadata Standards
Introduction to Metadata Standards
 
Data mining
Data miningData mining
Data mining
 

Ähnlich wie AnIML: A New Analytical Data Standard

AeHIN 28 August, 2014 - Innovation in Healthcare IT Standards: The Path to Bi...
AeHIN 28 August, 2014 - Innovation in Healthcare IT Standards: The Path to Bi...AeHIN 28 August, 2014 - Innovation in Healthcare IT Standards: The Path to Bi...
AeHIN 28 August, 2014 - Innovation in Healthcare IT Standards: The Path to Bi...
Timothy Cook
 
Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...
Revolutionizing Laboratory  Instrument Data for the  Pharmaceutical Industry:...Revolutionizing Laboratory  Instrument Data for the  Pharmaceutical Industry:...
Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...
OSTHUS
 

Ähnlich wie AnIML: A New Analytical Data Standard (20)

Overview of the Analytical Information Markup Language (AnIML)
Overview of the Analytical Information Markup Language (AnIML)Overview of the Analytical Information Markup Language (AnIML)
Overview of the Analytical Information Markup Language (AnIML)
 
From allotrope to reference master data management
From allotrope to reference master data management From allotrope to reference master data management
From allotrope to reference master data management
 
Designing and launching the Clinical Reference Library
Designing and launching the Clinical Reference LibraryDesigning and launching the Clinical Reference Library
Designing and launching the Clinical Reference Library
 
Crossing the Analytics Chasm and Getting the Models You Developed Deployed
Crossing the Analytics Chasm and Getting the Models You Developed DeployedCrossing the Analytics Chasm and Getting the Models You Developed Deployed
Crossing the Analytics Chasm and Getting the Models You Developed Deployed
 
Rescuing Data from Decaying and Moribund Clinical Information Systems
Rescuing Data from Decaying and Moribund Clinical Information SystemsRescuing Data from Decaying and Moribund Clinical Information Systems
Rescuing Data from Decaying and Moribund Clinical Information Systems
 
AeHIN 28 August, 2014 - Innovation in Healthcare IT Standards: The Path to Bi...
AeHIN 28 August, 2014 - Innovation in Healthcare IT Standards: The Path to Bi...AeHIN 28 August, 2014 - Innovation in Healthcare IT Standards: The Path to Bi...
AeHIN 28 August, 2014 - Innovation in Healthcare IT Standards: The Path to Bi...
 
Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...
Revolutionizing Laboratory  Instrument Data for the  Pharmaceutical Industry:...Revolutionizing Laboratory  Instrument Data for the  Pharmaceutical Industry:...
Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...
 
Basics of Research Data Management
Basics of Research Data ManagementBasics of Research Data Management
Basics of Research Data Management
 
2009 PLANETS Vienna - MIXED migration to XML
2009 PLANETS Vienna - MIXED migration to XML2009 PLANETS Vienna - MIXED migration to XML
2009 PLANETS Vienna - MIXED migration to XML
 
Connectivity challenges APC Europe by Alan Weber
Connectivity challenges APC Europe by Alan WeberConnectivity challenges APC Europe by Alan Weber
Connectivity challenges APC Europe by Alan Weber
 
SciDB : Open Source Data Management System for Data-Intensive Scientific Anal...
SciDB : Open Source Data Management System for Data-Intensive Scientific Anal...SciDB : Open Source Data Management System for Data-Intensive Scientific Anal...
SciDB : Open Source Data Management System for Data-Intensive Scientific Anal...
 
Addressing Connectivity Challenges of Disparate Data Sources in Smart Manufac...
Addressing Connectivity Challengesof Disparate Data Sourcesin Smart Manufac...Addressing Connectivity Challengesof Disparate Data Sourcesin Smart Manufac...
Addressing Connectivity Challenges of Disparate Data Sources in Smart Manufac...
 
OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...
OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...
OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...
 
Introduction Big Data
Introduction Big DataIntroduction Big Data
Introduction Big Data
 
CDISC2RDF poster for Conference on Data Integration in the Life Sciences 2013
CDISC2RDF poster for Conference on Data Integration in the Life Sciences 2013CDISC2RDF poster for Conference on Data Integration in the Life Sciences 2013
CDISC2RDF poster for Conference on Data Integration in the Life Sciences 2013
 
MLHIM FHIES 2013
MLHIM FHIES 2013 MLHIM FHIES 2013
MLHIM FHIES 2013
 
Azure Databricks for Data Scientists
Azure Databricks for Data ScientistsAzure Databricks for Data Scientists
Azure Databricks for Data Scientists
 
International Journal of Database Management Systems (IJDMS)
International Journal of Database Management Systems (IJDMS) International Journal of Database Management Systems (IJDMS)
International Journal of Database Management Systems (IJDMS)
 
Database Systems Concepts, 5th Ed
Database Systems Concepts, 5th EdDatabase Systems Concepts, 5th Ed
Database Systems Concepts, 5th Ed
 
Text Analytics in the EU Fusepool project (at II-SDV 2013 conference)
Text Analytics in the EU Fusepool project (at II-SDV 2013 conference)Text Analytics in the EU Fusepool project (at II-SDV 2013 conference)
Text Analytics in the EU Fusepool project (at II-SDV 2013 conference)
 

Mehr von Stuart Chalk

Mehr von Stuart Chalk (17)

Semantic properties and units
Semantic properties and unitsSemantic properties and units
Semantic properties and units
 
Open semantic chemical structures
Open semantic chemical structuresOpen semantic chemical structures
Open semantic chemical structures
 
ChemExtractor: Enhanced Rule-Based Capture and Identification of PDF Based Pr...
ChemExtractor: Enhanced Rule-Based Capture and Identification of PDF Based Pr...ChemExtractor: Enhanced Rule-Based Capture and Identification of PDF Based Pr...
ChemExtractor: Enhanced Rule-Based Capture and Identification of PDF Based Pr...
 
The Electronic Notebook Ontology
The Electronic Notebook OntologyThe Electronic Notebook Ontology
The Electronic Notebook Ontology
 
Sharing Science Data: Semantically Reimagining the IUPAC Solubility Series Data
Sharing Science Data: Semantically Reimagining the IUPAC Solubility Series DataSharing Science Data: Semantically Reimagining the IUPAC Solubility Series Data
Sharing Science Data: Semantically Reimagining the IUPAC Solubility Series Data
 
Bringing Flow injection Analysis to the Semantic Web
Bringing Flow injection Analysis to the Semantic WebBringing Flow injection Analysis to the Semantic Web
Bringing Flow injection Analysis to the Semantic Web
 
Reactions to the Open Spectral Database
Reactions to the Open Spectral DatabaseReactions to the Open Spectral Database
Reactions to the Open Spectral Database
 
A Standard Data Format for Computational Chemistry: CSX
A Standard Data Format for Computational Chemistry: CSXA Standard Data Format for Computational Chemistry: CSX
A Standard Data Format for Computational Chemistry: CSX
 
ACS 248th Paper 146 VIVO/ScientistsDB Integration into Eureka
ACS 248th Paper 146 VIVO/ScientistsDB Integration into EurekaACS 248th Paper 146 VIVO/ScientistsDB Integration into Eureka
ACS 248th Paper 146 VIVO/ScientistsDB Integration into Eureka
 
ACS 248th Paper 108 NIST-IUPAC Solubility Data
ACS 248th Paper 108 NIST-IUPAC Solubility DataACS 248th Paper 108 NIST-IUPAC Solubility Data
ACS 248th Paper 108 NIST-IUPAC Solubility Data
 
ACS 248th Paper 104 ChemData Project
ACS 248th Paper 104 ChemData ProjectACS 248th Paper 104 ChemData Project
ACS 248th Paper 104 ChemData Project
 
ACS 248th Paper 71 ChAMP Project
ACS 248th Paper 71 ChAMP ProjectACS 248th Paper 71 ChAMP Project
ACS 248th Paper 71 ChAMP Project
 
ACS 248th Paper 67 Eureka Collaboration
ACS 248th Paper 67 Eureka CollaborationACS 248th Paper 67 Eureka Collaboration
ACS 248th Paper 67 Eureka Collaboration
 
247th ACS Meeting: The Eureka Research Workbench
247th ACS Meeting: The Eureka Research Workbench247th ACS Meeting: The Eureka Research Workbench
247th ACS Meeting: The Eureka Research Workbench
 
247th ACS Meeting: Experiment Markup Language (ExptML)
247th ACS Meeting: Experiment Markup Language (ExptML)247th ACS Meeting: Experiment Markup Language (ExptML)
247th ACS Meeting: Experiment Markup Language (ExptML)
 
Liberating Laboratory Data - Eureka
Liberating Laboratory Data - EurekaLiberating Laboratory Data - Eureka
Liberating Laboratory Data - Eureka
 
Liberating Laboratory Data - AnIML
Liberating Laboratory Data - AnIMLLiberating Laboratory Data - AnIML
Liberating Laboratory Data - AnIML
 

Kürzlich hochgeladen

Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
AlMamun560346
 
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
Lokesh Kothari
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
RizalinePalanog2
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
Areesha Ahmad
 

Kürzlich hochgeladen (20)

Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
 

AnIML: A New Analytical Data Standard

  • 1. AnIML: A New Analytical Data Standard Stuart J. Chalk, Department of Chemistry, University of North Florida schalk@unf.edu ACS Meeting Boston 2015
  • 2.  Data Formats  Goals for Data Handling  Introduction to AnIML  Sections of an AnIML file  AnIML Schemas and Files  AnIML Technique Definitions  Publishing Instrument Data  Referencing Data Elements  Calculations on Data  Future Developments  Conclusion Overview
  • 3.  Native Data Formats  Proprietary formats  "Metadata" separated from result data  Metadata and data in multiple files  Metadata not available electronically  No way to link metadata with result data  Interchange Data Formats  Available for only a few techniques  ANDI — GC, LC, MS  JCAMP-DX — UV-Vis, IR, NMR, UV/Vis, IMS  Fixed order, fixed syntax, immutable formats  Content limitations  Inconsistent implementations Current Data Formats
  • 4.  Extensible  Easy to add new elements without breaking existing applications  Flexible  Useful for diverse needs: Interchange, Interconversion, Archiving...  Useable & Maintainable  Easy to create, use, adapt, maintain...  Readily available tools  Acceptable  Use standard mechanisms accepted by mainstream computing  Human readable  eXtensible Markup Language Goals for Data Handling
  • 5.  Extensible Markup Language (XML) specification  Development under ASTM E13.15 ‘AnIML Task Group’  Data standard to: “Develop an analytical data standard that can be used to store data from any analytical instrument” Introduction to AnIML http://animl.sourceforge.net
  • 6.  JCAMP-DX  http://www.jcamp-dx.org/  ANDI (netCDF)  ThermoML (NIST)  SpectroML  Nguyen, A. D. T., Arslan, A., Travis, J., Smith, M., Schafer, R., & Kramer, G. W. (2004) ‘Molecular Spectrometry Data Interchange Applications for NIST's SpectroML’, JALA 9 (6), 346-354. doi:10.1016/j.jala.2004.09.001  Generalized Analytical Markup Language (GAML)  http://www.gaml.org/  First official meeting March 23, 2003 @ ASTM Brief History of Time AnIML
  • 7.  Broad scope  Different types of data  Size of data sets  Everyone calls ‘widgit’ something different  Need for metadata dictionaries  One size does not fit all  Getting broad community involvement  Domain experts  User communities  What format? Challenges for AnIML
  • 8.  AnIML XML elements are ‘pigeon holes’ for metadata  Minimal ‘required’ information  If it’s not required you don’t have to include the element  Extensible  Store raw data not processed data (except for FT techniques)  Support for legacy data  Record of changes  Validatable  Signable (digital sense) AnIML Design Philosophy
  • 10. Sections of an AnIML File
  • 16.  Data storage format  Not just for spectral data  Access  Data  Metadata  Manipulate using XSLT  Validate  Signable AnIML in an ELN
  • 17.  AnIML Viewer -> Jmol/JSpecView (http://jmol.sourceforge.net) Publish Supplementary Data
  • 18.  Conversion of AnIML data to SVG using XSLT Convert to Image File for Publication
  • 19.  Expose an AnIML file at a URL  Optional: Define a DOI for that URL  Use XPath to reference a specific data point in an AnIML file  //ExperimentStepSet[1]/ExperimentStep[1]/Method[1]/Auth or[1]/Name[1]  Encode the XPath expression so it can be part of the URL Open Instrument Data
  • 20. Part of a Data Management Plan  Federal agencies are mandating data be made available  Long term archive format for research data  Referenceable if available online  Searchable with Xquery  Publish data processing algorithms (XSLT)  Future proof data -> conversion to future data formats
  • 21.  The Healthcare and Life Science (HCLS) Community Profile is a Note from the Semantic Web HCLS Interest Group  Access to consistent, high-quality metadata is critical to finding, understanding, and reusing scientific data. This document describes a consensus among participating stakeholders in the Health Care and the Life Sciences domain on the description of datasets using the Resource Description Framework (RDF). This specification meets key functional requirements, reuses existing vocabularies to the extent that it is possible, and addresses elements of data description, versioning, provenance, discovery, exchange, query, and retrieval. Data Descriptions: HCLS Community Profile http://www.w3.org/TR/hcls-dataset/
  • 22.  AnIML 1.0 Deliverables  Core Schema - Fundamental framework for AnIML documents  Technique Schema - Fundamental framework for technique definition and extension documents  AnIML Technique Definition Documents (ATDD) - Rules for content of specific technique file  AnIML Naming and Design Rules - Specifies rules about data element structure for interoperability  Standard Practice for AnIML Files - Describes how the specification is supposed to work  How to Create a Technique Definition Document - Guidelines for creating new technique definition documents  Other documents  Draft Requirements Specification for AnIML Version 1.0  Requirements and Goals of the Analytical Information Markup Language AnIML Specification http://animl.sourceforge.net
  • 23.  Documentation  Core specification  Technique and extension specification  Naming and design rules  Annotated technique definitions (UV/Vis, IR, 1D NMR, MS, Chromatography)  Balloting through ASTM (end of 2015)  Vendor, User, Developer extensions  Semantic extension of AnIML metadata items Future Developments
  • 24. Conclusion  AnIML is a great solution for storing instrument data  Human readable (UTF-8)  Platform neutral  Archivable  Validatable  AnIML leverages the extensive XML ecosystem of tools  Software engineers know XML
  • 25.  schalk@unf.edu  Phone: 904-620-1938  Skype: stuartchalk  LinkedIn/Slidehare: https://www.linkedin.com/in/stuchalk  ORCID: http://orcid.org/0000-0002-0703-7776  ResearcherID: http://www.researcherid.com/rid/D-8577-2013 Questions?