SlideShare ist ein Scribd-Unternehmen logo
1 von 27
Downloaden Sie, um offline zu lesen
Overview of the ISA format and software suite
Help researchers to
curate, store, analyse, share and publish their experiments
Susanna-Assunta Sansone, PhD (associate director, PI)

Philippe Rocca-Serra, PhD (technical coordinator)
Alejandra Gonzalez-Beltran, PhD (senior developer)
Eamonn Maguire, DPhil candidate (senior developer)
Pavlos Georgiou, MSc candidate (developer)
and new team member to be recruited
Focus on the experimental context and compliance to standards

user community

The International Conference on Systems Biology (ICSB), 22-28 August, 2008

Susanna-Assunta Sansone
www.ebi.ac.uk/net-project
Rationale for developing ISA

 Researchers and bioinformaticians in both
academic and commercial arenas, along with
funding agencies and publishers, embrace
the concept that community-developed
standards are pivotal to structure and enrich
the annotation of
• entities of interest (e.g., genes,
metabolites, phenotypes) and
• experimental steps (e.g., provenance of
study materials, technology and
measurement types)
Rationale for developing ISA







Capture all salient features of
the experimental workflow
Make annotation explicit and
discoverable
Support data provenance
tracking
Use community standards
Rationale for developing ISA

transcriptomics

proteomics

genomics
A wealth of community, different norms and standards, e.g.:

allow data to flow from
one system to another

use the same word and
refer to the same ‘thing’

report the same core,
essential information

To track provenance of the information and ensure richness of data and experimental
metadata descriptions, to maximize sharing and reusability

Key challenges:
lack of coordination, fragmentation and uneven coverage
To compare and integrate data we need interoperable standards
epidemiology

plant biology

microbiology

Biologically-delineated
views of the world

Generic features (‘common core’)
- description of source biomaterial
- experimental design components
MS
Arrays

Gels
Columns

Scanning

transcriptomics

Arrays &
Scanning

proteomics

MS

Technologically-delineated
views of the world

NMR
FTIR
Columns

metabolomics
Mapping the landscape of standards, work in progress
See more at:

+ 303

Estimated

Source: MIBBI,
EQUATOR

Source: BioPortal

+ 130

+ 150

Databases,
annotation,
curation
tools

MAGE-Tab
GCDML

AAO
CHEBI

SRAxml

CML

SOFT
DICOM FASTA

GELML

MITAB

….

SEDML…

OBI

VO

PATO

ENVO

XAO

MzML

DO

….

MIAPA
MIRIAM

MIQAS

MIX

REMARK

MIGEN

MOD

SBRML

….

MIAME

TEDDY

PRO

BTO
IDO…

….

MIQE

MIAPE

CIMR
MIASE

….

CONSORT
MISFISHIE….

….
Dealing with fragmented standards for the experimental context

user community

The International Conference on Systems Biology (ICSB), 22-28 August, 2008

Susanna-Assunta Sansone
www.ebi.ac.uk/net-project
General-purpose, configurable format, designed to
support:
- several omics standards checklists, terminologies
- reference to CDISC SDTM file(s), and
- conversions to (a growing number of) other metadata
formats, used by public repositories
1
1

Create template(s) to fit the type of
experiments to be described
Create templates detailing the steps to be
reported for different investigations, complying
to community standards, e.g. configuring the
value(s) allowed for each field to be
• text (with/without regular expression testing),
• ontology terms,
• numbers etc.
We now have configurations for submission
to EBI repositories, complying to several
community standards.
1

Or describe, curate your experiment using a
desktop-based tool
Report and edit the description using this tool,
(also customized using the templates) with a
spreadsheet like look and feel, packed with
functionalities such as
• ontology search (access via
)
• term-tagging features
• import from spreadsheets etc…
1

Describe, curate your experiment with
geographically- distributed collaborators
Report and edit the description of the
investigation using customized Google
Spreadsheets (importing the ‘template’ created
by the ISA configurator) enabled with ontology
search and term-tagging features.
2
3
4
5
6
Data papers
•
•
•
•
•

New open-access, online-only publication for descriptions of scientifically valuable datasets
Only content type: Data Descriptor, narrative + structured parts
Initially focused on the life, environmental and biomedical sciences
Data Descriptor will be complementary to traditional research journals and data repositories
Designed to foster data sharing and reuse, and ultimately to accelerate scientific discovery
www.nature.com/scientificdata


A grass-root collaborative that works to facilitate collection, curation and
sharing of experiments using a common, structured representation of the
experiments that
•

transcends individual biological and technological domains and

•

can be ‘configured’ to implement (several of) the community standards


A grass-root collaborative that works to facilitate collection, curation and
sharing of experiments using a common, structured representation of the
experiments that
•

transcends individual biological and technological domains and

•

can be ‘configured’ to implement (several of) the community standards







environmental health
genomics
metabolomics
metagenomics
nanotechnology







proteomics
stem cell discovery
system biology
transcriptomics
toxicogenomics
Community involvement and uptake
1st ISA-Tab
workshop
2nd ISA-Tab
workshop

3rd ISA-Tab
workshop

User workshops/visits - start
Other tools
implement ISA-Tab

Core developments
Straw man
ISA-Tab spec

ISA software v1
Final ISA-Tab spec

2007

2008

1st public instance:
Harvard Stem Cell Growing number of
Discovery Engine systems starts to adopt
ISA framework
Conversions to
Pride-XML/SRA-XML/
MAGE-Tab

Database instance
at EBI

2009

2010

Links to
analysis tools
starts
RDF/OWL format starts

2011

2012

Publications

The ISA software suite:
supporting standards-

ISA chapter in : Open Source

compliant curation at the

Software in Life Science

community level

Research

OntoMaton: a Bioportal
powered ontology widget for
Google Spreadsheets.

Bioinformatics

Woodhead Publishing

Bioinformatics

2013
2012

2013

2013

2012

2013

Nanotechnology
Informatics Working Group

Weitere ähnliche Inhalte

Was ist angesagt?

NPG Scientific Data Overview for GBIF - TDWG meeting Oct 2013
NPG Scientific Data Overview for GBIF - TDWG meeting Oct 2013NPG Scientific Data Overview for GBIF - TDWG meeting Oct 2013
NPG Scientific Data Overview for GBIF - TDWG meeting Oct 2013
Susanna-Assunta Sansone
 
Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.
Susanna-Assunta Sansone
 

Was ist angesagt? (20)

NPG Scientific Data Overview for GBIF - TDWG meeting Oct 2013
NPG Scientific Data Overview for GBIF - TDWG meeting Oct 2013NPG Scientific Data Overview for GBIF - TDWG meeting Oct 2013
NPG Scientific Data Overview for GBIF - TDWG meeting Oct 2013
 
Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.
 
FAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
FAIRification is a Team Sport: FAIRsharing and the FAIR CookbookFAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
FAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
 
EOSC-Life AGM 2022 Publishing FAIR RI data resources in EOSC.pdf
EOSC-Life AGM 2022 Publishing FAIR RI data resources in EOSC.pdfEOSC-Life AGM 2022 Publishing FAIR RI data resources in EOSC.pdf
EOSC-Life AGM 2022 Publishing FAIR RI data resources in EOSC.pdf
 
FAIRsharing: how we assist with FAIRness
FAIRsharing: how we assist with FAIRnessFAIRsharing: how we assist with FAIRness
FAIRsharing: how we assist with FAIRness
 
FAIRsharing: curating an ecosystem of research standards and databases
FAIRsharing: curating an ecosystem of research standards and databasesFAIRsharing: curating an ecosystem of research standards and databases
FAIRsharing: curating an ecosystem of research standards and databases
 
FAIR, FAIRplus and the FAIR Cookbook
FAIR, FAIRplus and the FAIR Cookbook FAIR, FAIRplus and the FAIR Cookbook
FAIR, FAIRplus and the FAIR Cookbook
 
FAIR data and standards for a coordinated COVID-19 response
FAIR data and standards for a coordinated COVID-19 responseFAIR data and standards for a coordinated COVID-19 response
FAIR data and standards for a coordinated COVID-19 response
 
FAIR: standards and services
FAIR: standards and servicesFAIR: standards and services
FAIR: standards and services
 
FAIRsharing poster
FAIRsharing posterFAIRsharing poster
FAIRsharing poster
 
The FAIR Cookbook poster
The FAIR Cookbook posterThe FAIR Cookbook poster
The FAIR Cookbook poster
 
Metadata for Interoperable Bioscience
Metadata for Interoperable BioscienceMetadata for Interoperable Bioscience
Metadata for Interoperable Bioscience
 
RDA17 FAIRsharing WG sessions: on repositories and policies
RDA17 FAIRsharing WG sessions: on repositories and policiesRDA17 FAIRsharing WG sessions: on repositories and policies
RDA17 FAIRsharing WG sessions: on repositories and policies
 
FAIR resources, selected examples from ELIXIR-related projects
FAIR resources, selected examples from ELIXIR-related projectsFAIR resources, selected examples from ELIXIR-related projects
FAIR resources, selected examples from ELIXIR-related projects
 
The FAIR movement - Oxford Open Data Week
The FAIR movement - Oxford Open Data WeekThe FAIR movement - Oxford Open Data Week
The FAIR movement - Oxford Open Data Week
 
All Things Biocuration
All Things BiocurationAll Things Biocuration
All Things Biocuration
 
The FAIR Principles and FAIRsharing
The FAIR Principles and FAIRsharingThe FAIR Principles and FAIRsharing
The FAIR Principles and FAIRsharing
 
FAIRsharing for RDA Funders Forum
FAIRsharing for RDA Funders ForumFAIRsharing for RDA Funders Forum
FAIRsharing for RDA Funders Forum
 
The FAIR Cookbook in a nutshell
The FAIR Cookbook in a nutshellThe FAIR Cookbook in a nutshell
The FAIR Cookbook in a nutshell
 
EnablingFAIR - Open research data in the UK
EnablingFAIR - Open research data in the UKEnablingFAIR - Open research data in the UK
EnablingFAIR - Open research data in the UK
 

Andere mochten auch (8)

Computer conversions isa_specsheet
Computer conversions isa_specsheetComputer conversions isa_specsheet
Computer conversions isa_specsheet
 
Feature Structure Unification Syntactic Parser 2.0
Feature Structure Unification Syntactic Parser 2.0Feature Structure Unification Syntactic Parser 2.0
Feature Structure Unification Syntactic Parser 2.0
 
Cache memory
Cache memoryCache memory
Cache memory
 
Pauci- Aleksa Pejić- Marja Cvetković
Pauci- Aleksa Pejić- Marja CvetkovićPauci- Aleksa Pejić- Marja Cvetković
Pauci- Aleksa Pejić- Marja Cvetković
 
Chap1 chipset
Chap1 chipsetChap1 chipset
Chap1 chipset
 
Motherboard components and their functions
Motherboard components and their functionsMotherboard components and their functions
Motherboard components and their functions
 
cache memory
cache memorycache memory
cache memory
 
Motherboard and its components
Motherboard and its componentsMotherboard and its components
Motherboard and its components
 

Ähnlich wie ISA - a short overview - Dec 2013

BioSharing.org - mapping the landscape of community standards, databases, dat...
BioSharing.org - mapping the landscape of community standards, databases, dat...BioSharing.org - mapping the landscape of community standards, databases, dat...
BioSharing.org - mapping the landscape of community standards, databases, dat...
Alejandra Gonzalez-Beltran
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Carole Goble
 

Ähnlich wie ISA - a short overview - Dec 2013 (20)

Sansone bio sharing introduction
Sansone bio sharing introductionSansone bio sharing introduction
Sansone bio sharing introduction
 
Big Data Standards - Workshop, ExpBio, Boston, 2015
Big Data Standards - Workshop, ExpBio, Boston, 2015Big Data Standards - Workshop, ExpBio, Boston, 2015
Big Data Standards - Workshop, ExpBio, Boston, 2015
 
Sansone mibbi-intro
Sansone mibbi-introSansone mibbi-intro
Sansone mibbi-intro
 
BioSharing.org - mapping the landscape of community standards, databases, dat...
BioSharing.org - mapping the landscape of community standards, databases, dat...BioSharing.org - mapping the landscape of community standards, databases, dat...
BioSharing.org - mapping the landscape of community standards, databases, dat...
 
eScience-School-Oct2012-Campinas-Brazil
eScience-School-Oct2012-Campinas-BrazileScience-School-Oct2012-Campinas-Brazil
eScience-School-Oct2012-Campinas-Brazil
 
GARNet workshop on Integrating Large Data into Plant Science
GARNet workshop on Integrating Large Data into Plant ScienceGARNet workshop on Integrating Large Data into Plant Science
GARNet workshop on Integrating Large Data into Plant Science
 
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific DataNIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
 
Metadata challenges research and re-usable data - BioSharing, ISA and STATO
Metadata challenges research and re-usable data - BioSharing, ISA and STATOMetadata challenges research and re-usable data - BioSharing, ISA and STATO
Metadata challenges research and re-usable data - BioSharing, ISA and STATO
 
Oxford DTP - Sansone curation tools - Dec 2014
Oxford DTP - Sansone curation tools - Dec 2014Oxford DTP - Sansone curation tools - Dec 2014
Oxford DTP - Sansone curation tools - Dec 2014
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
NETTAB 2012
NETTAB 2012NETTAB 2012
NETTAB 2012
 
The Research Object Initiative: Frameworks and Use Cases
The Research Object Initiative:Frameworks and Use CasesThe Research Object Initiative:Frameworks and Use Cases
The Research Object Initiative: Frameworks and Use Cases
 
Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017
 
If we build it will they come?
If we build it will they come?If we build it will they come?
If we build it will they come?
 
FAIR and metadata standards - FAIRsharing and Neuroscience
FAIR and metadata standards - FAIRsharing and NeuroscienceFAIR and metadata standards - FAIRsharing and Neuroscience
FAIR and metadata standards - FAIRsharing and Neuroscience
 
If we build it will they come? BOSC2012 Keynote Goble
If we build it will they come? BOSC2012 Keynote GobleIf we build it will they come? BOSC2012 Keynote Goble
If we build it will they come? BOSC2012 Keynote Goble
 
OpenTox Europe 2013
OpenTox Europe 2013OpenTox Europe 2013
OpenTox Europe 2013
 
SEEK for Science: A Data and Model Management Platform to support Open and Re...
SEEK for Science: A Data and Model Management Platform to support Open and Re...SEEK for Science: A Data and Model Management Platform to support Open and Re...
SEEK for Science: A Data and Model Management Platform to support Open and Re...
 
B4OS-2012
B4OS-2012B4OS-2012
B4OS-2012
 
FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014
FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014
FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014
 

Mehr von Susanna-Assunta Sansone

Mehr von Susanna-Assunta Sansone (17)

FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
 
FAIRsharing-Standards-4-GSC-Aug23.pdf
FAIRsharing-Standards-4-GSC-Aug23.pdfFAIRsharing-Standards-4-GSC-Aug23.pdf
FAIRsharing-Standards-4-GSC-Aug23.pdf
 
FAIR-4-GSC-Sansone-Aug23.pdf
FAIR-4-GSC-Sansone-Aug23.pdfFAIR-4-GSC-Sansone-Aug23.pdf
FAIR-4-GSC-Sansone-Aug23.pdf
 
FAIRsharing & FAIRcookbook at RDA 2023
FAIRsharing & FAIRcookbook at RDA 2023FAIRsharing & FAIRcookbook at RDA 2023
FAIRsharing & FAIRcookbook at RDA 2023
 
NFDI Physical Sciences Colloquium - FAIR
NFDI Physical Sciences Colloquium - FAIRNFDI Physical Sciences Colloquium - FAIR
NFDI Physical Sciences Colloquium - FAIR
 
Metadata Standards
Metadata StandardsMetadata Standards
Metadata Standards
 
FAIRcookbook: GSRS22-Singapore
FAIRcookbook: GSRS22-SingaporeFAIRcookbook: GSRS22-Singapore
FAIRcookbook: GSRS22-Singapore
 
FAIR Cookbook
FAIR Cookbook FAIR Cookbook
FAIR Cookbook
 
FAIR, community standards and data FAIRification: components and recipes
FAIR, community standards and data FAIRification: components and recipesFAIR, community standards and data FAIRification: components and recipes
FAIR, community standards and data FAIRification: components and recipes
 
FAIRsharing and the FAIR Cookbook
FAIRsharing and the FAIR Cookbook FAIRsharing and the FAIR Cookbook
FAIRsharing and the FAIR Cookbook
 
FAIRsharing for EOSC
FAIRsharing for EOSC FAIRsharing for EOSC
FAIRsharing for EOSC
 
ELIXIR FAIR Activities - Examplars
ELIXIR FAIR Activities - ExamplarsELIXIR FAIR Activities - Examplars
ELIXIR FAIR Activities - Examplars
 
FAIRsharing - focus on standards and new features
FAIRsharing - focus on standards and new features FAIRsharing - focus on standards and new features
FAIRsharing - focus on standards and new features
 
Open Science FAIR 2021: FAIRsharing and the FAIR Cookbook
Open Science FAIR 2021: FAIRsharing and the FAIR Cookbook Open Science FAIR 2021: FAIRsharing and the FAIR Cookbook
Open Science FAIR 2021: FAIRsharing and the FAIR Cookbook
 
FAIRsharing COVID-19 Collection for The Global Health Network
FAIRsharing COVID-19 Collection for The Global Health NetworkFAIRsharing COVID-19 Collection for The Global Health Network
FAIRsharing COVID-19 Collection for The Global Health Network
 
The FAIR Principles and the IMI FAIRplus project
The FAIR Principles and the IMI FAIRplus projectThe FAIR Principles and the IMI FAIRplus project
The FAIR Principles and the IMI FAIRplus project
 
FAIR and FAIRsharing - ESOF 2020
FAIR and FAIRsharing - ESOF 2020FAIR and FAIRsharing - ESOF 2020
FAIR and FAIRsharing - ESOF 2020
 

Kürzlich hochgeladen

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Kürzlich hochgeladen (20)

AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 

ISA - a short overview - Dec 2013

  • 1. Overview of the ISA format and software suite Help researchers to curate, store, analyse, share and publish their experiments Susanna-Assunta Sansone, PhD (associate director, PI) Philippe Rocca-Serra, PhD (technical coordinator) Alejandra Gonzalez-Beltran, PhD (senior developer) Eamonn Maguire, DPhil candidate (senior developer) Pavlos Georgiou, MSc candidate (developer) and new team member to be recruited
  • 2. Focus on the experimental context and compliance to standards user community The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta Sansone www.ebi.ac.uk/net-project
  • 3. Rationale for developing ISA  Researchers and bioinformaticians in both academic and commercial arenas, along with funding agencies and publishers, embrace the concept that community-developed standards are pivotal to structure and enrich the annotation of • entities of interest (e.g., genes, metabolites, phenotypes) and • experimental steps (e.g., provenance of study materials, technology and measurement types)
  • 4. Rationale for developing ISA     Capture all salient features of the experimental workflow Make annotation explicit and discoverable Support data provenance tracking Use community standards
  • 5. Rationale for developing ISA transcriptomics proteomics genomics
  • 6. A wealth of community, different norms and standards, e.g.: allow data to flow from one system to another use the same word and refer to the same ‘thing’ report the same core, essential information To track provenance of the information and ensure richness of data and experimental metadata descriptions, to maximize sharing and reusability Key challenges: lack of coordination, fragmentation and uneven coverage
  • 7. To compare and integrate data we need interoperable standards epidemiology plant biology microbiology Biologically-delineated views of the world Generic features (‘common core’) - description of source biomaterial - experimental design components MS Arrays Gels Columns Scanning transcriptomics Arrays & Scanning proteomics MS Technologically-delineated views of the world NMR FTIR Columns metabolomics
  • 8. Mapping the landscape of standards, work in progress See more at: + 303 Estimated Source: MIBBI, EQUATOR Source: BioPortal + 130 + 150 Databases, annotation, curation tools MAGE-Tab GCDML AAO CHEBI SRAxml CML SOFT DICOM FASTA GELML MITAB …. SEDML… OBI VO PATO ENVO XAO MzML DO …. MIAPA MIRIAM MIQAS MIX REMARK MIGEN MOD SBRML …. MIAME TEDDY PRO BTO IDO… …. MIQE MIAPE CIMR MIASE …. CONSORT MISFISHIE…. ….
  • 9.
  • 10. Dealing with fragmented standards for the experimental context user community The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta Sansone www.ebi.ac.uk/net-project
  • 11. General-purpose, configurable format, designed to support: - several omics standards checklists, terminologies - reference to CDISC SDTM file(s), and - conversions to (a growing number of) other metadata formats, used by public repositories
  • 12. 1
  • 13. 1 Create template(s) to fit the type of experiments to be described Create templates detailing the steps to be reported for different investigations, complying to community standards, e.g. configuring the value(s) allowed for each field to be • text (with/without regular expression testing), • ontology terms, • numbers etc. We now have configurations for submission to EBI repositories, complying to several community standards.
  • 14. 1 Or describe, curate your experiment using a desktop-based tool Report and edit the description using this tool, (also customized using the templates) with a spreadsheet like look and feel, packed with functionalities such as • ontology search (access via ) • term-tagging features • import from spreadsheets etc…
  • 15.
  • 16. 1 Describe, curate your experiment with geographically- distributed collaborators Report and edit the description of the investigation using customized Google Spreadsheets (importing the ‘template’ created by the ISA configurator) enabled with ontology search and term-tagging features.
  • 17. 2
  • 18. 3
  • 19. 4
  • 20. 5
  • 21. 6
  • 23. • • • • • New open-access, online-only publication for descriptions of scientifically valuable datasets Only content type: Data Descriptor, narrative + structured parts Initially focused on the life, environmental and biomedical sciences Data Descriptor will be complementary to traditional research journals and data repositories Designed to foster data sharing and reuse, and ultimately to accelerate scientific discovery www.nature.com/scientificdata
  • 24.  A grass-root collaborative that works to facilitate collection, curation and sharing of experiments using a common, structured representation of the experiments that • transcends individual biological and technological domains and • can be ‘configured’ to implement (several of) the community standards
  • 25.  A grass-root collaborative that works to facilitate collection, curation and sharing of experiments using a common, structured representation of the experiments that • transcends individual biological and technological domains and • can be ‘configured’ to implement (several of) the community standards      environmental health genomics metabolomics metagenomics nanotechnology      proteomics stem cell discovery system biology transcriptomics toxicogenomics
  • 26. Community involvement and uptake 1st ISA-Tab workshop 2nd ISA-Tab workshop 3rd ISA-Tab workshop User workshops/visits - start Other tools implement ISA-Tab Core developments Straw man ISA-Tab spec ISA software v1 Final ISA-Tab spec 2007 2008 1st public instance: Harvard Stem Cell Growing number of Discovery Engine systems starts to adopt ISA framework Conversions to Pride-XML/SRA-XML/ MAGE-Tab Database instance at EBI 2009 2010 Links to analysis tools starts RDF/OWL format starts 2011 2012 Publications The ISA software suite: supporting standards- ISA chapter in : Open Source compliant curation at the Software in Life Science community level Research OntoMaton: a Bioportal powered ontology widget for Google Spreadsheets. Bioinformatics Woodhead Publishing Bioinformatics 2013