MPDB Presentation

MPDB - Integrated system for storage
and analysis of metabolomic data
Design and implementation of the
data acquisition and analysis
pipeline
Alexander Raskind, SFRES MTU

Omics data availability
http://www.ncbi.nlm.nih.gov/Genbank/genbankstats.html
Transcriptomics data:
ArrayExpress – 3670 experiments,
109666 hybridizations
http://www.ebi.ac.uk/microarray-as/aer/
Proteomics data:
PRIDE – 3,537 Experiments
645,869 Identified Proteins
http://www.ebi.ac.uk/microarray-as/aer/
Metabolomics data:
MMCD – 20.306 compounds
http://mmcd.nmrfam.wisc.edu/
Human Metabolome Database –
2500 compounds
http://www.hmdb.ca/

Shifting research paradigm
genome.uiowa.edu
http://www.shimadzu.com
Targeted analysis High-throughput analysis

Populus as model system
• Wide ecological range
• Small genome relative to other trees
• Relatively easy transformation and cloning
• Belongs to Salicaceae – Willow family,
produces large amount of phenolic
compounds that may influence carbon
sequestration

Project rationale
• Affordable equipment generates limited
amount of metabolomic data with modest
quality
• Proper information storage and maximal
extraction of useful information are essential
• Free open source laboratory information
system tailored to metabolomics workflow
would benefit to a large scientific community

System requirements
• Easy access to large arrays of analytical
results and biological metadata
• Tools for data analysis
• Addition of analysis modules
• Accommodation of other types of analytical
data
• USER FRIENDLY

Major analytical problems
• Chemical complexity of the sample
o human metabolome - 2500 metabolites, plants – much more
• Wide dynamic range of response
o difference between most and least abundant components may be more
than 10,000
• Biological variation
• Matrix effects
o Interactions between sample componets leading to shifts in retention time
and sensitivity of detection comparative to pure compounds
• Instrument effects
o Shifting retention time (column wearing out and maintenance)
o Changes in sensitivity

Data analysis pipeline
• Raw data cleanup, peak detection,
deconvolution and quantification
• Compound identification (library search)
• Export of analysis results and biological
metadata to the database
• Peak alignment and normalization
• Final data analysis

System Outline
Analyzer-Pro
Result (XML format)
MP-align
GC/MS or LC/MS raw data
MPDB
Offline
Online
Data analysis
Biological
information

Compound identification
• NIST 2002 database for GCMS (MS only,
~140,000 entries)
• In-house database of essential metabolites
(MS and retention time, ~200 entries)

Why we need alignment
Single batch Multiple batches

Alignment algorithm
Peak
list
RI
MS
Grou
p
Consist
ency
Aligned
groups

Signal normalization
Raw data Normalized to TIC

User interface - tasks
• Data entry
• New analysis
• Review analysis
• Quality control
• Help

Sample groups review and annotation

Data assessment and analysis
• Data for individual compound groups
• Data for individual samples and compounds
• Principal component analysis
• Clustering of samples and compounds
• Graphical maps of compound ratios

Individual compound group data

Mass spectral data for the group

Individual sample and peak details

Sample analysis – effects of nitrogen
stress on the Populus leaf metabolism
• Plants grown hydroponically
• N-stress for 8 weeks
• Samples taken from leaves at different
developmental stages (lamina and mid-vien)
• Metabolites fractionated by SPE
• Hydrophylic fractions additionally analyzed at 1:20
dilution
• Fractions were also subjected to glucosidase
hydrolysis and LPE
• 3-5 biological and 1-2 technical replicas

Leaf hydrophilic fraction
• Up-regulated by N-stress:
o Galacturonic acid (X7), D-Arabinonate,
o Turanose, Syringin
o Ribose(?), methyl-Galactoside, 3-Hydroxy-3-
methylglutaric acid (HMGA), D-(-)-3-
Phosphoglyceric acid

Leaf hydrophilic fraction
• Down-regulated by N-stress:
o Most of free aminoacids and polyamines below
detection level or strongly reduced. Also some
sugars and polyols, but not clearly identified)
o Small organic acids (fumaric, succinic, threonic,
citric, malic, oxaloacetic)
o Sugar phosphates (glucose, fructose)
o Xylose, melibiose, cellobiose

Acknowledgements
• Prof. Scott Harding
• Prof. Chung-Jui Tsai
• Dr. Changyu Hu
• Prof. Meir Edelman (WIS)

MPDB Presentation

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Andere mochten auch

Andere mochten auch (6)

Ähnlich wie MPDB Presentation

Ähnlich wie MPDB Presentation (20)

MPDB Presentation