SlideShare ist ein Scribd-Unternehmen logo
1 von 71
Downloaden Sie, um offline zu lesen
Bioinformàtica per a la
Recerca Biomèdica
Ricardo Gonzalo Sanz
ricardo.gonzalo@vhir.org
20/05/14
Hospital Universitari Vall d’Hebron
Institut de Recerca - VHIR
Institut d’Investigació Sanitària de l’Instituto de Salud Carlos III (ISCIII)
Basic aspects of Microarray
technology
Affymetrix microarrays manufacture.
2
3
4
5
6
Microarray experiment workflow.
Quality Controls.
Different types of Affymetrix arrays.
1 Introduction
Different types of arrays. Manufactoring. DNA/RNA/Protein
1 Introduction
 reproducibility
 only show you what you’re looking for
 what about ‘indels’, inversions, translocations...
 accuracy
 sensitivity
1 Introduction
1 Introduction
 RNA-Seq was superior in detecting low abundance transcripts
 also better detecting differentiating biologically isoforms
 RNA-Seq demonstrated a broader dynamic range than microarray.
1 Introduction
• In molecular biology exist a lot of techniques to measure the gene expression
(Northern blot)
• Main characteristic from the microarrays discovery (Schena et al. (1995)
Science 270:467-70), was not what could be measured, instead the quantity of
simultaneous measures that could be done.
• Pre microarrays time: study of genes was one by one
• Post microarrays time: all the genes together.
1 Introduction
• But.... what is a microarray in few words?
 DNA fixed to a solid surface (nylon, silica, glass,...)
 RNA “problem” is labeled and have to bind to DNA
fixed in the solid surface in an specific way.
 DNA binded usually is called “probe”
 Labeled RNA usually is called “target”
Important to know in advanced...
1 Introduction
• Microarrays are usually hypothesis-generating:
They highlight specific genes or features that are particularly
interesting for follow-up experiments.
An exception would be the biomarkers discovery studies.
• This does not reduce the importance of experimental design
2
Two color microarrays (cDNA)
• Usually probes are long (20nt)
• Probe is fixed to a glass
• Labeling is with two fluorocrom (Cy3/Cy5).
• Direct comparison of the two samples due
to they are hybridized in the same array.
• Each gene appear few times in the array
• Long probes facilitate crosshybridization
• Not very good reproducibility.
Different types of arrays. Manufactoring. DNA/RNA
2
One color microarrays
• Short probes (20-25 nt)
• Target is labeled with only one fluorocrom
• Only one sample is hybridized in each array.
• Each gene is represented by a lot of probes
in the array
Different types of arrays. Manufactoring. DNA/RNA
2 Different types of arrays. Manufactoring. DNA/RNA
• DNA Polymorphism (GWAS)
• Transcription Factors
• Resequencing
• Cytogenetics
• Expression
• Alternative splicing
• microRNA
DNA RNA
2 Different types of Affymetrix arrays.
3’5’
3’ IVT Arrays
• Biased measurement of the gene expression
• Array more used in the literature. A lot of species present.
Only genes with polyA tail and good 3’ site will
be amplified and will have the chance of
hybridize correctly.
2 Different types of Affymetrix arrays.
3’5’
Gene Arrays
Exon Arrays
Gene/Exon Arrays
• Gene arrays are the most used (good quality and price ratio)
• Gene arrays 2.0 more updated library and also includes lncRNAs
2 Different types of expression arrays.
•153 organisms in the array (human, mouse, rat, canine, ….)
•100% miRBase v17
•2.216 snoRNAs and scaRNAs (human small nuclear RNAs)
•Low inputs amounts (130 ng total RNA)
•2.999 probe sets unique to pre-miRNA hairpins
•Able to differentiate pre and mature miRNAs
•Useful for FFPE samples
miRNA
2 Different types of expression arrays.
HTA array
Affymetrix microarrays manufacture.3
Photolitografy
Affymetrix microarrays manufacture.3
5 Microarray experiment workflow
5 Microarray experiment workflow
5 Microarray experiment workflow
6 Quality Controls
6 Quality Controls
6 Quality Controls
Length of amplified cRNA
6 Quality Controls
Length of fragmented cRNA
Bioinformàtica per a la
Recerca Biomèdica
Ricardo Gonzalo Sanz
ricardo.gonzalo@vhir.org
20/05/14
Hospital Universitari Vall d’Hebron
Institut de Recerca - VHIR
Institut d’Investigació Sanitària de l’Instituto de Salud Carlos III (ISCIII)
Basic aspects of Microarray
Data Analysis
Filtering
2
3
4
5
6
Statistical inference of diferential expression
Clustering
Normalization
1 Introduction. Experimental design
Quality control
7
8
Annotation
Biological interpretation
1 Introduction. Experimental design
1 Introduction. Experimental design
1 Introduction. Experimental design
1 Introduction. Experimental design
1 Introduction. Experimental design
1 Introduction. Experimental design
Microarrays Analysis
Workflow
2 Quality Control
2 Quality Control
Was the experiment a success???
• Microarray experiments generate huge quantitites of data
• Standard statistical approach use plots to check the quality
 show all data together
 highlight structures
 may help to detect problems (“unusual patterns”)
It is hard to decide if things “seem to be
all right” just by looking at the numbers.
2 Quality Control
Diagnostics plots for microarrays:
• Microarray data usually considered at two levels
1. Low level. Data directly coming from the scanner
2. High level. Processed from low level data. Expression values,
normalized or not.
• Some plots are specific for some type of arrays or for some level
2 Quality Control
Diagnostics plots for microarrays:
1. Low level:
 Layout image
 Degradation plots (only in 3’IVT)
 Histogram/density plots
 PCA, Boxplot
2. High level:
 MA plots
 Model based plots (NUSE,RLE,)
 PCA, Boxplot
2 Quality Control
Diganostics plots for microarrays. Low level. Layout image.
2 Quality Control
Diagnostic plots for microarrays. Low level. RNA degradation plot (3’IVT arrays)
2 Quality Control
Diagnostics plots for microarrays. Low level. Histogram/density Plot
2 Quality Control
Diagnostics plots for microarrays. Low level. Boxplot
2 Quality Control
2 Quality Control
Diagnostics plots for microarrays. Low level. PCA
2 Quality Control
Diagnostics plots for microarrays. Low level. PCA
2 Quality Control
2 Quality Control
Diagnostics plots for microarrays. High level. RLE
2 Quality Control
2 Quality Control
Diagnostics plots for microarrays. High level. NUSE
2 Quality Control
Diagnostics plots for microarrays. High level. MA plots
• MA plots allow pair wise comparison of log-intensity of each array to a
reference array and identification of intensity-dependent biases.
• The Y axis of the plot contains the log-ratio intentsity of one array to the
reference median array, which is called “M” while the X axis contains the
average log-intensity of both arrays – called “A”.
• The probe levels are not likely to differ a lot so we expect a MA plot centered
on the Y=0 axis from low to high intensities.
2 Quality Control
Diagnostics plots for microarrays. High level. MA plots
2 Quality Control
3 Normalization
The goal of normalization is to adjust for the effects that are due to variations in the
technology rather than the biology.
3 Normalization
3 Normalization
3 Normalization
4 Filtering
• In a microarray experiment only a few hundreds/thousand of genes change their
expression due to the different conditions
•Researcher is interested in keeping the number of tests/genes as low as possible
while keeping the interesting genes in the selected subset.
•If the truly diferentially expressed genes are over-represented among those
selectec in the filtering step, the FDR associated with a certain threshold of the
statistic test will be lowered due to the filtering.
Genes that do not change introduce
noise, therefore is better not to be
present when the statistical analysis is
done
4 Filtering
Exists different types of filtering:
• Annotation features (specific):
 Specific gene features (i.e. GO term, presence of transcriptional regulative
elements in promoters, etc.)
Data derived from IPA
• Signal features (non specific)
 % intensities greater of a user defined value
 Interquantile range (IQR) greater of a defined value
4 Filtering
Signal filtering: This technique has as its premise the removal of genes that are
deemed to be not expressed or unchanged according to some specific criterion that
is under the control of the user.
5 Statistical inference of diferential expression
• Indirect comparisons: 2 groups, unpaired
• Direct comparsions: 2 groups. paired
5 Statistical inference of diferential expression
Limma package (Gordon Smith)
5 Statistical inference of diferential expression
5 Statistical inference of diferential expression
5 Statistical inference of diferential expression
5 Statistical inference of diferential expression
6 Clustering
Types:
 Supervised clustering try to find the best partition for data that belong to a
know set o classes
 Unsupervised clustering try to define the number and the size of the classes
in which the transcription profiles can be fitted in.
6 Clustering
6 Clustering
Hierarchical Clustering (HCL)
• HCL is an agglomerative /divise clustering method.
• The iterative process continues until all groups are
connected in a hierarchical tree.
• Samples more similar between them are closed.
6 Clustering
7 Annotation
8 Biological interpretation
Gene Ontology
8 Biological interpretation

Weitere ähnliche Inhalte

Was ist angesagt?

Microarray Data Analysis
Microarray Data AnalysisMicroarray Data Analysis
Microarray Data Analysisyuvraj404
 
Next generation-sequencing.ppt-converted
Next generation-sequencing.ppt-convertedNext generation-sequencing.ppt-converted
Next generation-sequencing.ppt-convertedShweta Tiwari
 
DNA MICROARRAY TECHNIQUES
DNA MICROARRAY TECHNIQUESDNA MICROARRAY TECHNIQUES
DNA MICROARRAY TECHNIQUESgayathryp1
 
A short introduction to single-cell RNA-seq analyses
A short introduction to single-cell RNA-seq analysesA short introduction to single-cell RNA-seq analyses
A short introduction to single-cell RNA-seq analysestuxette
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencingUzma Jabeen
 
Biotechnophysics: DNA Nanopore Sequencing
Biotechnophysics: DNA Nanopore SequencingBiotechnophysics: DNA Nanopore Sequencing
Biotechnophysics: DNA Nanopore SequencingMelanie Swan
 
Single nucleotide polymorphism, (SNP)
Single nucleotide polymorphism, (SNP)Single nucleotide polymorphism, (SNP)
Single nucleotide polymorphism, (SNP)KAUSHAL SAHU
 
DNA microarray
DNA microarrayDNA microarray
DNA microarrayS Rasouli
 
Proteomics and protein-protein interaction
Proteomics  and protein-protein interactionProteomics  and protein-protein interaction
Proteomics and protein-protein interactionSenthilkumarV25
 
Microarray technique
Microarray techniqueMicroarray technique
Microarray techniquearunchacko14
 

Was ist angesagt? (20)

Genome sequencing
Genome sequencingGenome sequencing
Genome sequencing
 
Microarray Data Analysis
Microarray Data AnalysisMicroarray Data Analysis
Microarray Data Analysis
 
Microarray Analysis
Microarray AnalysisMicroarray Analysis
Microarray Analysis
 
Next generation-sequencing.ppt-converted
Next generation-sequencing.ppt-convertedNext generation-sequencing.ppt-converted
Next generation-sequencing.ppt-converted
 
DNA MICROARRAY TECHNIQUES
DNA MICROARRAY TECHNIQUESDNA MICROARRAY TECHNIQUES
DNA MICROARRAY TECHNIQUES
 
Microarray full detail
Microarray full detailMicroarray full detail
Microarray full detail
 
A short introduction to single-cell RNA-seq analyses
A short introduction to single-cell RNA-seq analysesA short introduction to single-cell RNA-seq analyses
A short introduction to single-cell RNA-seq analyses
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencing
 
Biotechnophysics: DNA Nanopore Sequencing
Biotechnophysics: DNA Nanopore SequencingBiotechnophysics: DNA Nanopore Sequencing
Biotechnophysics: DNA Nanopore Sequencing
 
Ngs ppt
Ngs pptNgs ppt
Ngs ppt
 
RNA-seq Analysis
RNA-seq AnalysisRNA-seq Analysis
RNA-seq Analysis
 
Single nucleotide polymorphism, (SNP)
Single nucleotide polymorphism, (SNP)Single nucleotide polymorphism, (SNP)
Single nucleotide polymorphism, (SNP)
 
DNA MICROARRAY
DNA MICROARRAYDNA MICROARRAY
DNA MICROARRAY
 
SNP
SNPSNP
SNP
 
DNA microarray
DNA microarrayDNA microarray
DNA microarray
 
DNA microarray
DNA microarrayDNA microarray
DNA microarray
 
Ngs introduction
Ngs introductionNgs introduction
Ngs introduction
 
Proteomics and protein-protein interaction
Proteomics  and protein-protein interactionProteomics  and protein-protein interaction
Proteomics and protein-protein interaction
 
Jan2016 pac bio giab
Jan2016 pac bio giabJan2016 pac bio giab
Jan2016 pac bio giab
 
Microarray technique
Microarray techniqueMicroarray technique
Microarray technique
 

Ähnlich wie Basic Aspects of Microarray Technology and Data Analysis (UEB-UAT Bioinformatics Course - Session 3.2 - VHIR, Barcelona)

20100509 bioinformatics kapushesky_lecture03-04_0
20100509 bioinformatics kapushesky_lecture03-04_020100509 bioinformatics kapushesky_lecture03-04_0
20100509 bioinformatics kapushesky_lecture03-04_0Computer Science Club
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomicsajay301
 
DNA Microarray introdution and application
DNA Microarray introdution and applicationDNA Microarray introdution and application
DNA Microarray introdution and applicationNeeraj Sharma
 
qPCR Design Strategies for Specific Applications
qPCR Design Strategies for Specific ApplicationsqPCR Design Strategies for Specific Applications
qPCR Design Strategies for Specific ApplicationsIntegrated DNA Technologies
 
Microarray @ujjwal sirohi
Microarray @ujjwal sirohiMicroarray @ujjwal sirohi
Microarray @ujjwal sirohiujjwal sirohi
 
A comprehensive study of microarray
A comprehensive study of microarrayA comprehensive study of microarray
A comprehensive study of microarrayPRABAL SINGH
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomicsPawan Kumar
 
Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...
Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...
Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...Ilya Klabukov
 
Toxicogenomics: microarray
Toxicogenomics: microarrayToxicogenomics: microarray
Toxicogenomics: microarrayEden D'souza
 
170120 giab stanford genetics seminar
170120 giab stanford genetics seminar170120 giab stanford genetics seminar
170120 giab stanford genetics seminarGenomeInABottle
 
Genomics_Aishwarya Teli.pptx
Genomics_Aishwarya Teli.pptxGenomics_Aishwarya Teli.pptx
Genomics_Aishwarya Teli.pptxAishwaryaTeli5
 
Protein Microarrays: Approaches to Printing
Protein Microarrays: Approaches to PrintingProtein Microarrays: Approaches to Printing
Protein Microarrays: Approaches to PrintingSCHOTT
 
171017 giab for giab grc workshop
171017 giab for giab grc workshop171017 giab for giab grc workshop
171017 giab for giab grc workshopGenomeInABottle
 

Ähnlich wie Basic Aspects of Microarray Technology and Data Analysis (UEB-UAT Bioinformatics Course - Session 3.2 - VHIR, Barcelona) (20)

20100509 bioinformatics kapushesky_lecture03-04_0
20100509 bioinformatics kapushesky_lecture03-04_020100509 bioinformatics kapushesky_lecture03-04_0
20100509 bioinformatics kapushesky_lecture03-04_0
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
12 arrays
12 arrays12 arrays
12 arrays
 
12 arrays
12 arrays12 arrays
12 arrays
 
DNA Microarray introdution and application
DNA Microarray introdution and applicationDNA Microarray introdution and application
DNA Microarray introdution and application
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
qPCR Design Strategies for Specific Applications
qPCR Design Strategies for Specific ApplicationsqPCR Design Strategies for Specific Applications
qPCR Design Strategies for Specific Applications
 
Axt microarrays
Axt microarraysAxt microarrays
Axt microarrays
 
Microarray @ujjwal sirohi
Microarray @ujjwal sirohiMicroarray @ujjwal sirohi
Microarray @ujjwal sirohi
 
A comprehensive study of microarray
A comprehensive study of microarrayA comprehensive study of microarray
A comprehensive study of microarray
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...
Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...
Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...
 
Toxicogenomics: microarray
Toxicogenomics: microarrayToxicogenomics: microarray
Toxicogenomics: microarray
 
Cignal webina
Cignal webinaCignal webina
Cignal webina
 
Molecular profiling 2013
Molecular profiling 2013Molecular profiling 2013
Molecular profiling 2013
 
170120 giab stanford genetics seminar
170120 giab stanford genetics seminar170120 giab stanford genetics seminar
170120 giab stanford genetics seminar
 
Si rna 2013
Si rna 2013Si rna 2013
Si rna 2013
 
Genomics_Aishwarya Teli.pptx
Genomics_Aishwarya Teli.pptxGenomics_Aishwarya Teli.pptx
Genomics_Aishwarya Teli.pptx
 
Protein Microarrays: Approaches to Printing
Protein Microarrays: Approaches to PrintingProtein Microarrays: Approaches to Printing
Protein Microarrays: Approaches to Printing
 
171017 giab for giab grc workshop
171017 giab for giab grc workshop171017 giab for giab grc workshop
171017 giab for giab grc workshop
 

Mehr von VHIR Vall d’Hebron Institut de Recerca

Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...
Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...
Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...VHIR Vall d’Hebron Institut de Recerca
 
Introduction to Functional Analysis with IPA (UEB-UAT Bioinformatics Course -...
Introduction to Functional Analysis with IPA (UEB-UAT Bioinformatics Course -...Introduction to Functional Analysis with IPA (UEB-UAT Bioinformatics Course -...
Introduction to Functional Analysis with IPA (UEB-UAT Bioinformatics Course -...VHIR Vall d’Hebron Institut de Recerca
 
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...VHIR Vall d’Hebron Institut de Recerca
 
Brief Overview to Amplicon Variant Analysis (UEB-UAT Bioinformatics Course - ...
Brief Overview to Amplicon Variant Analysis (UEB-UAT Bioinformatics Course - ...Brief Overview to Amplicon Variant Analysis (UEB-UAT Bioinformatics Course - ...
Brief Overview to Amplicon Variant Analysis (UEB-UAT Bioinformatics Course - ...VHIR Vall d’Hebron Institut de Recerca
 
Introduction to NGS Variant Calling Analysis (UEB-UAT Bioinformatics Course -...
Introduction to NGS Variant Calling Analysis (UEB-UAT Bioinformatics Course -...Introduction to NGS Variant Calling Analysis (UEB-UAT Bioinformatics Course -...
Introduction to NGS Variant Calling Analysis (UEB-UAT Bioinformatics Course -...VHIR Vall d’Hebron Institut de Recerca
 
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...VHIR Vall d’Hebron Institut de Recerca
 
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...VHIR Vall d’Hebron Institut de Recerca
 
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...VHIR Vall d’Hebron Institut de Recerca
 
NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...
NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...
NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...VHIR Vall d’Hebron Institut de Recerca
 
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...VHIR Vall d’Hebron Institut de Recerca
 
Introduction to Bioinformatics (UEB-UAT Bioinformatics Course - Session 1.1 -...
Introduction to Bioinformatics (UEB-UAT Bioinformatics Course - Session 1.1 -...Introduction to Bioinformatics (UEB-UAT Bioinformatics Course - Session 1.1 -...
Introduction to Bioinformatics (UEB-UAT Bioinformatics Course - Session 1.1 -...VHIR Vall d’Hebron Institut de Recerca
 
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...VHIR Vall d’Hebron Institut de Recerca
 
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de expression génica
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de expression génicaCurso de Genómica - UAT (VHIR) 2012 - Análisis de datos de expression génica
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de expression génicaVHIR Vall d’Hebron Institut de Recerca
 

Mehr von VHIR Vall d’Hebron Institut de Recerca (20)

Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...
Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...
Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...
 
Introduction to Functional Analysis with IPA (UEB-UAT Bioinformatics Course -...
Introduction to Functional Analysis with IPA (UEB-UAT Bioinformatics Course -...Introduction to Functional Analysis with IPA (UEB-UAT Bioinformatics Course -...
Introduction to Functional Analysis with IPA (UEB-UAT Bioinformatics Course -...
 
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
 
Brief Overview to Amplicon Variant Analysis (UEB-UAT Bioinformatics Course - ...
Brief Overview to Amplicon Variant Analysis (UEB-UAT Bioinformatics Course - ...Brief Overview to Amplicon Variant Analysis (UEB-UAT Bioinformatics Course - ...
Brief Overview to Amplicon Variant Analysis (UEB-UAT Bioinformatics Course - ...
 
Introduction to NGS Variant Calling Analysis (UEB-UAT Bioinformatics Course -...
Introduction to NGS Variant Calling Analysis (UEB-UAT Bioinformatics Course -...Introduction to NGS Variant Calling Analysis (UEB-UAT Bioinformatics Course -...
Introduction to NGS Variant Calling Analysis (UEB-UAT Bioinformatics Course -...
 
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
 
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
 
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
 
NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...
NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...
NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...
 
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
 
Introduction to Bioinformatics (UEB-UAT Bioinformatics Course - Session 1.1 -...
Introduction to Bioinformatics (UEB-UAT Bioinformatics Course - Session 1.1 -...Introduction to Bioinformatics (UEB-UAT Bioinformatics Course - Session 1.1 -...
Introduction to Bioinformatics (UEB-UAT Bioinformatics Course - Session 1.1 -...
 
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
 
Information management at vhir ueb using tiki-cms
Information management at vhir ueb using tiki-cmsInformation management at vhir ueb using tiki-cms
Information management at vhir ueb using tiki-cms
 
Introduction to Metagenomics Data Analysis - UEB-VHIR - 2013
Introduction to Metagenomics Data Analysis - UEB-VHIR - 2013Introduction to Metagenomics Data Analysis - UEB-VHIR - 2013
Introduction to Metagenomics Data Analysis - UEB-VHIR - 2013
 
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de RT-qPCR
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de RT-qPCRCurso de Genómica - UAT (VHIR) 2012 - Análisis de datos de RT-qPCR
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de RT-qPCR
 
Curso de Genómica - UAT (VHIR) 2012 - RT-qPCR
Curso de Genómica - UAT (VHIR) 2012 - RT-qPCRCurso de Genómica - UAT (VHIR) 2012 - RT-qPCR
Curso de Genómica - UAT (VHIR) 2012 - RT-qPCR
 
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de expression génica
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de expression génicaCurso de Genómica - UAT (VHIR) 2012 - Análisis de datos de expression génica
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de expression génica
 
Curso de Genómica - UAT (VHIR) 2012 - Microarrays
Curso de Genómica - UAT (VHIR) 2012 - MicroarraysCurso de Genómica - UAT (VHIR) 2012 - Microarrays
Curso de Genómica - UAT (VHIR) 2012 - Microarrays
 
Curso de Genómica - UAT (VHIR) 2012 - Arrays de Proteínas Zeptosens
 Curso de Genómica - UAT (VHIR) 2012 - Arrays de Proteínas Zeptosens Curso de Genómica - UAT (VHIR) 2012 - Arrays de Proteínas Zeptosens
Curso de Genómica - UAT (VHIR) 2012 - Arrays de Proteínas Zeptosens
 
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de NGS
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de NGSCurso de Genómica - UAT (VHIR) 2012 - Análisis de datos de NGS
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de NGS
 

Kürzlich hochgeladen

Chemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdfChemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdfSumit Kumar yadav
 
Stages in the normal growth curve
Stages in the normal growth curveStages in the normal growth curve
Stages in the normal growth curveAreesha Ahmad
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspectsmuralinath2
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)Areesha Ahmad
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryAlex Henderson
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxseri bangash
 
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)AkefAfaneh2
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)Areesha Ahmad
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flyPRADYUMMAURYA1
 
Velocity and Acceleration PowerPoint.ppt
Velocity and Acceleration PowerPoint.pptVelocity and Acceleration PowerPoint.ppt
Velocity and Acceleration PowerPoint.pptRakeshMohan42
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Silpa
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptxryanrooker
 
Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Silpa
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learninglevieagacer
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticssakshisoni2385
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxMohamedFarag457087
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...Scintica Instrumentation
 

Kürzlich hochgeladen (20)

Chemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdfChemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdf
 
Stages in the normal growth curve
Stages in the normal growth curveStages in the normal growth curve
Stages in the normal growth curve
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspects
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
 
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICEPATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
 
Velocity and Acceleration PowerPoint.ppt
Velocity and Acceleration PowerPoint.pptVelocity and Acceleration PowerPoint.ppt
Velocity and Acceleration PowerPoint.ppt
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx
 
Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
 

Basic Aspects of Microarray Technology and Data Analysis (UEB-UAT Bioinformatics Course - Session 3.2 - VHIR, Barcelona)

  • 1. Bioinformàtica per a la Recerca Biomèdica Ricardo Gonzalo Sanz ricardo.gonzalo@vhir.org 20/05/14 Hospital Universitari Vall d’Hebron Institut de Recerca - VHIR Institut d’Investigació Sanitària de l’Instituto de Salud Carlos III (ISCIII) Basic aspects of Microarray technology
  • 2. Affymetrix microarrays manufacture. 2 3 4 5 6 Microarray experiment workflow. Quality Controls. Different types of Affymetrix arrays. 1 Introduction Different types of arrays. Manufactoring. DNA/RNA/Protein
  • 3. 1 Introduction  reproducibility  only show you what you’re looking for  what about ‘indels’, inversions, translocations...  accuracy  sensitivity
  • 5. 1 Introduction  RNA-Seq was superior in detecting low abundance transcripts  also better detecting differentiating biologically isoforms  RNA-Seq demonstrated a broader dynamic range than microarray.
  • 6. 1 Introduction • In molecular biology exist a lot of techniques to measure the gene expression (Northern blot) • Main characteristic from the microarrays discovery (Schena et al. (1995) Science 270:467-70), was not what could be measured, instead the quantity of simultaneous measures that could be done. • Pre microarrays time: study of genes was one by one • Post microarrays time: all the genes together.
  • 7. 1 Introduction • But.... what is a microarray in few words?  DNA fixed to a solid surface (nylon, silica, glass,...)  RNA “problem” is labeled and have to bind to DNA fixed in the solid surface in an specific way.  DNA binded usually is called “probe”  Labeled RNA usually is called “target”
  • 8. Important to know in advanced... 1 Introduction • Microarrays are usually hypothesis-generating: They highlight specific genes or features that are particularly interesting for follow-up experiments. An exception would be the biomarkers discovery studies. • This does not reduce the importance of experimental design
  • 9. 2 Two color microarrays (cDNA) • Usually probes are long (20nt) • Probe is fixed to a glass • Labeling is with two fluorocrom (Cy3/Cy5). • Direct comparison of the two samples due to they are hybridized in the same array. • Each gene appear few times in the array • Long probes facilitate crosshybridization • Not very good reproducibility. Different types of arrays. Manufactoring. DNA/RNA
  • 10. 2 One color microarrays • Short probes (20-25 nt) • Target is labeled with only one fluorocrom • Only one sample is hybridized in each array. • Each gene is represented by a lot of probes in the array Different types of arrays. Manufactoring. DNA/RNA
  • 11. 2 Different types of arrays. Manufactoring. DNA/RNA • DNA Polymorphism (GWAS) • Transcription Factors • Resequencing • Cytogenetics • Expression • Alternative splicing • microRNA DNA RNA
  • 12. 2 Different types of Affymetrix arrays. 3’5’ 3’ IVT Arrays • Biased measurement of the gene expression • Array more used in the literature. A lot of species present. Only genes with polyA tail and good 3’ site will be amplified and will have the chance of hybridize correctly.
  • 13. 2 Different types of Affymetrix arrays. 3’5’ Gene Arrays Exon Arrays Gene/Exon Arrays • Gene arrays are the most used (good quality and price ratio) • Gene arrays 2.0 more updated library and also includes lncRNAs
  • 14. 2 Different types of expression arrays. •153 organisms in the array (human, mouse, rat, canine, ….) •100% miRBase v17 •2.216 snoRNAs and scaRNAs (human small nuclear RNAs) •Low inputs amounts (130 ng total RNA) •2.999 probe sets unique to pre-miRNA hairpins •Able to differentiate pre and mature miRNAs •Useful for FFPE samples miRNA
  • 15. 2 Different types of expression arrays. HTA array
  • 23. 6 Quality Controls Length of amplified cRNA
  • 24. 6 Quality Controls Length of fragmented cRNA
  • 25. Bioinformàtica per a la Recerca Biomèdica Ricardo Gonzalo Sanz ricardo.gonzalo@vhir.org 20/05/14 Hospital Universitari Vall d’Hebron Institut de Recerca - VHIR Institut d’Investigació Sanitària de l’Instituto de Salud Carlos III (ISCIII) Basic aspects of Microarray Data Analysis
  • 26. Filtering 2 3 4 5 6 Statistical inference of diferential expression Clustering Normalization 1 Introduction. Experimental design Quality control 7 8 Annotation Biological interpretation
  • 32. 1 Introduction. Experimental design Microarrays Analysis Workflow
  • 34. 2 Quality Control Was the experiment a success??? • Microarray experiments generate huge quantitites of data • Standard statistical approach use plots to check the quality  show all data together  highlight structures  may help to detect problems (“unusual patterns”) It is hard to decide if things “seem to be all right” just by looking at the numbers.
  • 35. 2 Quality Control Diagnostics plots for microarrays: • Microarray data usually considered at two levels 1. Low level. Data directly coming from the scanner 2. High level. Processed from low level data. Expression values, normalized or not. • Some plots are specific for some type of arrays or for some level
  • 36. 2 Quality Control Diagnostics plots for microarrays: 1. Low level:  Layout image  Degradation plots (only in 3’IVT)  Histogram/density plots  PCA, Boxplot 2. High level:  MA plots  Model based plots (NUSE,RLE,)  PCA, Boxplot
  • 37. 2 Quality Control Diganostics plots for microarrays. Low level. Layout image.
  • 38. 2 Quality Control Diagnostic plots for microarrays. Low level. RNA degradation plot (3’IVT arrays)
  • 39. 2 Quality Control Diagnostics plots for microarrays. Low level. Histogram/density Plot
  • 40. 2 Quality Control Diagnostics plots for microarrays. Low level. Boxplot
  • 42. 2 Quality Control Diagnostics plots for microarrays. Low level. PCA
  • 43. 2 Quality Control Diagnostics plots for microarrays. Low level. PCA
  • 45. 2 Quality Control Diagnostics plots for microarrays. High level. RLE
  • 47. 2 Quality Control Diagnostics plots for microarrays. High level. NUSE
  • 48. 2 Quality Control Diagnostics plots for microarrays. High level. MA plots • MA plots allow pair wise comparison of log-intensity of each array to a reference array and identification of intensity-dependent biases. • The Y axis of the plot contains the log-ratio intentsity of one array to the reference median array, which is called “M” while the X axis contains the average log-intensity of both arrays – called “A”. • The probe levels are not likely to differ a lot so we expect a MA plot centered on the Y=0 axis from low to high intensities.
  • 49. 2 Quality Control Diagnostics plots for microarrays. High level. MA plots
  • 51. 3 Normalization The goal of normalization is to adjust for the effects that are due to variations in the technology rather than the biology.
  • 55. 4 Filtering • In a microarray experiment only a few hundreds/thousand of genes change their expression due to the different conditions •Researcher is interested in keeping the number of tests/genes as low as possible while keeping the interesting genes in the selected subset. •If the truly diferentially expressed genes are over-represented among those selectec in the filtering step, the FDR associated with a certain threshold of the statistic test will be lowered due to the filtering. Genes that do not change introduce noise, therefore is better not to be present when the statistical analysis is done
  • 56. 4 Filtering Exists different types of filtering: • Annotation features (specific):  Specific gene features (i.e. GO term, presence of transcriptional regulative elements in promoters, etc.) Data derived from IPA • Signal features (non specific)  % intensities greater of a user defined value  Interquantile range (IQR) greater of a defined value
  • 57. 4 Filtering Signal filtering: This technique has as its premise the removal of genes that are deemed to be not expressed or unchanged according to some specific criterion that is under the control of the user.
  • 58.
  • 59. 5 Statistical inference of diferential expression • Indirect comparisons: 2 groups, unpaired • Direct comparsions: 2 groups. paired
  • 60. 5 Statistical inference of diferential expression Limma package (Gordon Smith)
  • 61. 5 Statistical inference of diferential expression
  • 62. 5 Statistical inference of diferential expression
  • 63. 5 Statistical inference of diferential expression
  • 64. 5 Statistical inference of diferential expression
  • 65. 6 Clustering Types:  Supervised clustering try to find the best partition for data that belong to a know set o classes  Unsupervised clustering try to define the number and the size of the classes in which the transcription profiles can be fitted in.
  • 67. 6 Clustering Hierarchical Clustering (HCL) • HCL is an agglomerative /divise clustering method. • The iterative process continues until all groups are connected in a hierarchical tree. • Samples more similar between them are closed.