SlideShare ist ein Scribd-Unternehmen logo
1 von 27
Downloaden Sie, um offline zu lesen
Š Lexogen, 2013
Spike-In RNA Variants:
Design, Production and Application
ERCC 2.0 workshop
Stanford University – July 10-11, 2014
PPT Number TBD
Project Number 0221
Theme T5.2 Mixquer Transcript Quantification (WAFF)
Author Lukas Paul
Š Lexogen, 20142
1. Company introduction
2. ERCC spike-in  mixes  in  Lexogen‘s  R&D
3. Design and rational of Spike-In RNA Variants
4. Production and application of Spike-In RNA Variants
ERCC 2.0 Workshop
Vertraulich / Confidential
Š Lexogen, 20143Vertraulich / Confidential
Lexogen: Company
• Founded in 2007
• Based in Vienna, Austria
• 28 employees (75% in R&D)
• Lexogen, Inc.: o/n delivery to US customers
• Services & products with focus on
o Transcriptome profiling technologies
o Complementary technologies to Next Generation Sequencing
o Innovative solutions for transcriptome research
Lexogen’s mission is to develop innovative technologies that will allow to resolve
all complexities of the transcriptome - one of the most enigmatic and exciting
areas in biology.
www.LEXOGEN.com
Š Lexogen, 20144
1. Company introduction
2. ERCC spike-in  mixes  in  Lexogen‘s  R&D
3. Design and rational of Spike-In RNA Variants
4. Production and application of Spike-In RNA Variants
ERCC 2.0 Workshop
Vertraulich / Confidential
Š Lexogen, 20145
SENSETM mRNA-Seq Library Preparation Kit
• Convenient, fragmentation-free workflow
• Core technology: reverse transcription and ligation on intact RNA
• Results in very high preservation of strand orientation
Vertraulich / Confidential PN0203 PPT0383
Š Lexogen, 20146
ERCC-based Validation of Strandedness
• Strandedness usually quantified by comparing the orientation of a mapped
read with the genome annotation
• Problem: annotation incomplete & natural antisense transcription interferes
Use of ERCC transcripts with known orientation provides
an absolute means to determine strandedness
Vertraulich / Confidential PN0203 PPT0383
Total RNA Strand Specificity
(ERCCs only)a
False Antisense
Readsb
Sense Reads
(genome-wide)c
2 Âľg 99.997% 0.003% 99.890%
1 Âľg 99.986% 0.014% 99.815%
500 ng 99.997% 0.003% 99.821%
50 ng 99.965% 0.035% 99.779%
a number of reads mapping to ERCC genes in the sense direction divided by total number of ERCC reads
b number of antisense reads mapping to ERCC transcripts divided by the total number of reads mapped to the ERCC genome
c number of reads mapping to annotated genes in the sense orientation divided by the number of reads mapping in both directions. Note that this
measure includes biologically relevant antisense transcription.
Š Lexogen, 20147
ERCC-validated Strandedness Determines False Positive
Background of Library Preparation Method
Vertraulich / Confidential
Knowing the strandedness of the library preparation
protocol allows for determining whether a detected
transcript is truly antisense or belongs to the false positive
background.
98%
99.9%
strandedness
1153
2415
true antisense
transcripts
Š Lexogen, 20148
“ERCC-validated”  Strandedness  in  Lexogen’s  Portfolio  
• SENSE mRNA-Seq
library preparation kit
• SENSE Total RNA-Seq
library preparation kit
Vertraulich / Confidential PN0203 PPT0383
• QuantSeqTM 3’  mRNA  
library preparation Kit,
see workflow (right),
ERCCs also used to assess
correctness  of  3’  end  
mapping
Š Lexogen, 20149
Correlation Between ERCC Input and FPKM Measured
Vertraulich / Confidential PN0203 PPT0383
FPKM
N of molecules [102]
1 10 102 103 104 105 106
10-21101021037.5x104
o SENSE, R2=0.910
Competitors, R2=0.834
•
Š Lexogen, 201410
Further Use for ERCC: Transcript Length Coverage:
• Native genes: interference from divergent annotations and differentially
expressed transcript variants
• Primer selectivity: aa
 ERCCs with seamless coverage from first to last nucleotide
 Native transcripts start  with  high  coverage  indicative  of  5’  truncated  
annotations
Vertraulich / Confidential PN0203 PPT0383
Example: SQUARE TM library prep with intrinsic over-representation of termini
ERCC-0096 Top 500 transcripts
Š Lexogen, 201411
1. Company introduction
2. ERCC spike-in  mixes  in  Lexogen‘s  R&D
3. Design and rational of Spike-In RNA variants
4. Production and application of Spike-In RNA variants
ERCC 2.0 Workshop
Vertraulich / Confidential
Š Lexogen, 201412
Spike-In RNA Variants (SIRVs) - Rational
• ERCC spike-in controls were designed as mono-exonic RNAs without
sequence overlap.
• Complementary, we found it to be desirable to have a set of nucleic acids
simulating transcript variants that can be used as external spike-in controls.
• This reference set would
o comprise two or more transcript families, with transcripts of the same
family representing reference transcript variants of the same gene
o enable the controlled identification and/or quantification of transcript
variants in one or more samples and
o permit the assessment, validation and correction of Bioinformatics
pipelines.
Vertraulich / Confidential
Š Lexogen, 201413
Spike-In RNA Variants – Gene Structure
Reference genes
• 7 human genes selected because of diversity in exon-intron structure
• Annotated transcripts (Ensembl database) aligned to gene in CLC workbench
• „Master  transcript“  created  for  each  gene  (sequence  of  all  transcript  variants)
KLK5
LDHD
Vertraulich / Confidential
CLC main workbench 5
CLC main workbench 5
PN0203 PPT0383
Š Lexogen, 201414
Addition of Transcript Variants
• Annotated transcript variants were analyzed for AS events
• AS events not covered by a variant within a family were incorporated in a
new variant based on the master transcript
• To cover non-splicing variants, antisense and overlapping transcripts were
added (mono- and poly-exonic)
• Further, Transcription Start-Site (TSS) and End-Site (TES) variants were
added
KLK5
SIRV1
Vertraulich / Confidential
Š Lexogen, 201415
Spike-In RNA Variants (SIRV): Nucleotide Sequence
AIM
• The nucleotide sequence of the SIRVs should be non-homologous at least
to eukarytic genomes and transcriptomes.
• In the best case they should not align with any natural occurring sequence.
SOLUTION
• Genomic sequences from viruses were used to fill-in exon sequences.
 Would work in external controls for eukaryotes.
• Sequences were then inverted (flipped) to lose alignment identiy.
 Final sequences do not align with any entry in the NCBI nt collection when
blasted with standard parameters.
 SIRV sequences also do not align with ERCC sequences.
 In silico experiments confirmed that NGS reads generated from the SIRVs
would  not  map  to  the  genome  of  any  model  organism  or  the  “ERCCome”.
Vertraulich / Confidential
Š Lexogen, 201416
Re-establishing Exon-Intron Junction Dinucleotides
Vertraulich / Confidential
• Most junctions are common, i.e. are also
annotated in the master transcript.
• These intron sequences are currently annotated as
NN (see below), hence junction recognition is no
problem for alignment programs
NN-NN GT-AG GC-AG AT-AC
SIRVS
198 (61.11%) 116 (31.10%)
7 (2.16%) 3 (0.93)
314 (96.91%)
ICE database 98.70% 0.79% 0.08%
• Exon-defined intron boundaries
were converted to GT-AG (97%),
GC-AG (2%) or AT-AC (1%)
Nucleotide conversion to conform with GT-AG rule
Š Lexogen, 201417
SIRV Properties - Summary
SIRVs are modelled on mammalian sequences
• Set of seven SIRV families with 6-18 transcript variants each
• 74 transcript variants in total, average length 1200 nt (median 917 nt)
• Variants include alternative splicing, start- and end-site variations ,
antisense and overlapping transcripts
• GC content: 30-50% (in analogy to ERCC standards)
• Poly(A)  tail:  A(30)  at  3’-end (ERCCs: 19-25 adenosines)
• Length: 220-2,557 nt, longer SIRVs were trimmed by exon removal
Further modifications
• GT-AT exon-intron junction dinucleotide rule observed
• Homopolymer runs:  ≤7nt
• 5’  truncation  to  obtain  5’  G,  needed  for  T7  transcription
• No homology to NCBI nt collection entries or ERCC sequences due to
sequence inversion
Vertraulich / Confidential PN0203 PPT0383
Š Lexogen, 201418
SIRV Design - Overview
Vertraulich / Confidential
Take natural gene structure and annotated transcript variants
Shorten transcript length to a maximum of 2500 nt
Fill gene structure with heterologous sequence
Duplicate and modify to add alternative splicing variants
Add transcription start-site and end-site variants
Add antisense and overlapping variants
observe
GU-AG
intron rule
cassette exon
alternative
start-site
alternative
end-site
alternative last exon
intron retention
overlapping,
antisense antisense
A5SS
A3SS
MXEalternative first exon
overlapping
Š Lexogen, 201419
1. Company introduction
2. ERCC spike-in  mixes  in  Lexogen‘s  R&D
3. Design and rational of Spike-In RNA Variants
4. Production and application of Spike-In RNA Variants
ERCC 2.0 Workshop
Vertraulich / Confidential
Š Lexogen, 201420
SIRV Production: In vitro Transcription Construct
Vertraulich / Confidential
starts with 5’  G,
cap optional
poly(A) tail added Synthetic constructs
cloned for singularization
and amplification
Run-off T7 transcription
T7-PromoterRestr.Site G Sequence A(30) Restr.Site5’ 3’
220 - 2557 nt
Š Lexogen, 201421
SIRV Production, QC and quantification
Production
 Plasmid linearization
 T7 run-off transcription
 Purification (essential!)
 Storage in Na-Citrate buffer
Quality Control
 Photometric (Nanodrop): Purity, quantifcation
 Microfluidics (Bioanalyzer): Integrity, quantifcation
• Planned: qPCR: Accurate quantification
Vertraulich / Confidential
Š Lexogen, 201422
SIRVs: Mixes & RNA-Seq Samples
Initially, 2 mixes were prepared from 60 purified transcript variants:
1. Equimolar:  1:1:1…
2. Low dynamic range: 1:10:100
3 Samples were prepared from these:
1. Equimolar mix,
SIRVs only
illumina TruSeq library prep without poly(A) selection
2. Equimolar mix,
30% SIRVs, 3% ERCCs, 67% UHR (Universal Human Reference RNA)
illumina TruSeq library prep without poly(A) selection
3. Low dynamic range,
30% SIRVs, 3% ERCCs, 67% UHR (Universal Human Reference RNA)
illumina TruSeq library prep without poly(A) selection
Vertraulich / Confidential
Š Lexogen, 201423
SIRVs: RNA-Seq Experiment
• Illumina MiSeq run: 1x150 nt, 27M reads obtained
• Mapping with tophat (v.2.0.8) against combined transcriptomic and
genomic reference (Ensembl GRCh 37.75), Ambion’s ERCC92, and SIRVs
Vertraulich / Confidential
Total reads Mapping reads (%)
Uniquely
Mapping reads (%)
#1, equimolar
SIRVs 10,246,442 8,585,641 83.79% 8,505,344 83.01%
#2, equimolar
SIRVs, ERCCs, UHR 10,119,416 8,642,852 85.41% 8,399,336 83.00%
#3, 1:10:100
SIRVs, ERCCs, UHR 6,308,855 5,404,486 85.67% 5,268,757 83.51%
GRCh37.75 ERCC92 SIRVs
Sample #1 4,330 0.05% 11 0.00% 8,505,555 99.95%
Sample #2 7,521,308 89.55% 38,031 0.45% 839,997 10.00%
Sample #3 4,156,399 78.89% 22,207 0.42% 1,090,151 20.69%
Š Lexogen, 201424
SIRV RNA-Seq: Input / Output correlation
Vertraulich / Confidential
Molecules Molecules
Molecules sample #1 FPKM
sample#2FPKM
#1 #2
#3 #1 vs #2
Š Lexogen, 201425
SIRVs RNA-Seq: Transcript Hypotheses
Transcript Hypotheses by Cufflinks
• Not complete: e.g., 3ASS and exons not recognized despite multiple exon-
exon reads
Vertraulich / Confidential
cufflinks
Š Lexogen, 201426
Spike-In RNA Variants: Short Summary
Design & production
• 74 transcript variants in 7 families (6-18 variants / family)
• Mimic eukaryotic genes in length and GC content; A(30) tail
• Include variation on alternative splicing, transcription start-sites and end-
sites, sense/antisense and overlapping genes
• No homology to NCBI nt collection entries or ERCC sequences
• Produced from stock plasmids as T7 run-off transcripts
Mixtures
• 60 SIRVs were mixed in equimolar or low dynamic range (10²) concentrations
Application in RNA-Seq
• Mixtures showed high mapability and no cross-mapping with UHR or ERCCs
• Low input / output correlation as determined by tophat / cufflinks derived
FPKM
• Cufflinks cannot reconstruct all SIRV transcript variants, even in the
equimolar mix, which will lead to wrong FPKM values
Vertraulich / Confidential
Š Lexogen, 201427
Spike-In RNA Variants: Outlook
Optimizing production & quantification
• Large-scale production and purification of transcripts
• qPCR-based quantification in addition to Nanodrop & Bioanalyzer results
Application
• Evaluation of software for its performance in transcript hypothesis building
and transcript isoform quantification
Open questions
• Concentration range?
• Sufficient variant complexity? Length? Capping? SNPs?
• How many different mixes?
• Pipeline validation (Consortium?)
• Sample comparison (DE)
• Technical variation
• Master mix vs. modules: ERCCs, SIRVs, ncRNA standards & miRNA standards
(complexity, price, validation?)
Vertraulich / Confidential

Weitere ähnliche Inhalte

Was ist angesagt?

Bacterial physiologogy
Bacterial physiologogyBacterial physiologogy
Bacterial physiologogyManoj Mahato
 
The treatment of multi drug resistant tuberculosis (mdr-tb) with sirturo (be...
The  treatment of multi drug resistant tuberculosis (mdr-tb) with sirturo (be...The  treatment of multi drug resistant tuberculosis (mdr-tb) with sirturo (be...
The treatment of multi drug resistant tuberculosis (mdr-tb) with sirturo (be...Kishore Chinna
 
MDL 237 - Miscellaneous obligate intracellular bacteria
MDL 237 - Miscellaneous obligate intracellular bacteriaMDL 237 - Miscellaneous obligate intracellular bacteria
MDL 237 - Miscellaneous obligate intracellular bacteriadegarden
 
Bacteriology of air and water.pptx
Bacteriology of air and water.pptxBacteriology of air and water.pptx
Bacteriology of air and water.pptxDebasish Sahoo
 
Legionella
Legionella Legionella
Legionella Aamir Ali
 
Mycobacteria.pptx
Mycobacteria.pptxMycobacteria.pptx
Mycobacteria.pptxYadav Raj
 
Clinical microbiology
Clinical microbiologyClinical microbiology
Clinical microbiologyNursing Path
 
Micro chapter 31 biofilms - architects of disease
Micro   chapter 31 biofilms - architects of diseaseMicro   chapter 31 biofilms - architects of disease
Micro chapter 31 biofilms - architects of diseaseDonna Kim
 
Microbiology -T.U. B.Sc. 1st Year General Microbiology Syllabus (Nepal)
Microbiology -T.U. B.Sc. 1st Year General Microbiology Syllabus (Nepal)Microbiology -T.U. B.Sc. 1st Year General Microbiology Syllabus (Nepal)
Microbiology -T.U. B.Sc. 1st Year General Microbiology Syllabus (Nepal)Anup Bajracharya
 
Mycoplasma by aslam matania
Mycoplasma by aslam mataniaMycoplasma by aslam matania
Mycoplasma by aslam mataniaAslam Matania
 
Cultivation of Viruses
Cultivation of VirusesCultivation of Viruses
Cultivation of VirusesIslam Sarakbi
 
Module 7 bacilli
Module 7   bacilliModule 7   bacilli
Module 7 bacilliHuang Yu-Wen
 
CUTANEOUS MYCOSES.pdf
CUTANEOUS MYCOSES.pdfCUTANEOUS MYCOSES.pdf
CUTANEOUS MYCOSES.pdfanchalthakur84
 
Legionella pneumonia
Legionella pneumoniaLegionella pneumonia
Legionella pneumoniaYoseli Ventura
 
Bacterial Toxins and Toxin classification
Bacterial Toxins and Toxin classification Bacterial Toxins and Toxin classification
Bacterial Toxins and Toxin classification Meisam Ruzbahani
 

Was ist angesagt? (20)

Antibiotics Success and Failures what is our role by Dr.T.V.Rao MD
AntibioticsSuccess and Failureswhat is our role by Dr.T.V.Rao MDAntibioticsSuccess and Failureswhat is our role by Dr.T.V.Rao MD
Antibiotics Success and Failures what is our role by Dr.T.V.Rao MD
 
Bacterial physiologogy
Bacterial physiologogyBacterial physiologogy
Bacterial physiologogy
 
The treatment of multi drug resistant tuberculosis (mdr-tb) with sirturo (be...
The  treatment of multi drug resistant tuberculosis (mdr-tb) with sirturo (be...The  treatment of multi drug resistant tuberculosis (mdr-tb) with sirturo (be...
The treatment of multi drug resistant tuberculosis (mdr-tb) with sirturo (be...
 
MDL 237 - Miscellaneous obligate intracellular bacteria
MDL 237 - Miscellaneous obligate intracellular bacteriaMDL 237 - Miscellaneous obligate intracellular bacteria
MDL 237 - Miscellaneous obligate intracellular bacteria
 
Bioterrorism
BioterrorismBioterrorism
Bioterrorism
 
Biowarefare
BiowarefareBiowarefare
Biowarefare
 
Bacteriology of air and water.pptx
Bacteriology of air and water.pptxBacteriology of air and water.pptx
Bacteriology of air and water.pptx
 
Biofilms ashraf ..
Biofilms ashraf  ..Biofilms ashraf  ..
Biofilms ashraf ..
 
Legionella
Legionella Legionella
Legionella
 
Mycobacteria.pptx
Mycobacteria.pptxMycobacteria.pptx
Mycobacteria.pptx
 
Clinical microbiology
Clinical microbiologyClinical microbiology
Clinical microbiology
 
Toga virus
Toga virusToga virus
Toga virus
 
Micro chapter 31 biofilms - architects of disease
Micro   chapter 31 biofilms - architects of diseaseMicro   chapter 31 biofilms - architects of disease
Micro chapter 31 biofilms - architects of disease
 
Microbiology -T.U. B.Sc. 1st Year General Microbiology Syllabus (Nepal)
Microbiology -T.U. B.Sc. 1st Year General Microbiology Syllabus (Nepal)Microbiology -T.U. B.Sc. 1st Year General Microbiology Syllabus (Nepal)
Microbiology -T.U. B.Sc. 1st Year General Microbiology Syllabus (Nepal)
 
Mycoplasma by aslam matania
Mycoplasma by aslam mataniaMycoplasma by aslam matania
Mycoplasma by aslam matania
 
Cultivation of Viruses
Cultivation of VirusesCultivation of Viruses
Cultivation of Viruses
 
Module 7 bacilli
Module 7   bacilliModule 7   bacilli
Module 7 bacilli
 
CUTANEOUS MYCOSES.pdf
CUTANEOUS MYCOSES.pdfCUTANEOUS MYCOSES.pdf
CUTANEOUS MYCOSES.pdf
 
Legionella pneumonia
Legionella pneumoniaLegionella pneumonia
Legionella pneumonia
 
Bacterial Toxins and Toxin classification
Bacterial Toxins and Toxin classification Bacterial Toxins and Toxin classification
Bacterial Toxins and Toxin classification
 

Andere mochten auch

Resolving Ambiguity in Target ID Screens - CRISPR-Cas9 Based Essentiality Pro...
Resolving Ambiguity in Target ID Screens - CRISPR-Cas9 Based Essentiality Pro...Resolving Ambiguity in Target ID Screens - CRISPR-Cas9 Based Essentiality Pro...
Resolving Ambiguity in Target ID Screens - CRISPR-Cas9 Based Essentiality Pro...Candy Smellie
 
GIAB-GRC workshop oct2015 giab introduction 151005
GIAB-GRC workshop oct2015 giab introduction 151005GIAB-GRC workshop oct2015 giab introduction 151005
GIAB-GRC workshop oct2015 giab introduction 151005GenomeInABottle
 
Blueprints to blue sky – analyzing the challenges and solutions for IHC compa...
Blueprints to blue sky – analyzing the challenges and solutions for IHC compa...Blueprints to blue sky – analyzing the challenges and solutions for IHC compa...
Blueprints to blue sky – analyzing the challenges and solutions for IHC compa...Candy Smellie
 
Genetic code
Genetic codeGenetic code
Genetic codejayarajgr
 
Characteristics of Genetic Code
Characteristics of Genetic CodeCharacteristics of Genetic Code
Characteristics of Genetic CodePankaj Kukreti
 
Gene regulation in prokaryotes
Gene regulation in prokaryotesGene regulation in prokaryotes
Gene regulation in prokaryotesJannat Iftikhar
 
OMICS tecnology
OMICS tecnologyOMICS tecnology
OMICS tecnologytrishaissar
 
Genetic code
Genetic codeGenetic code
Genetic codeIMma Khan
 
Personalized Medicine and the Omics Revolution by Professor Mike Snyder
Personalized Medicine and the Omics Revolution by Professor Mike SnyderPersonalized Medicine and the Omics Revolution by Professor Mike Snyder
Personalized Medicine and the Omics Revolution by Professor Mike SnyderThe Hive
 
Microbial Genetics
Microbial GeneticsMicrobial Genetics
Microbial GeneticsRoshni Mehta
 

Andere mochten auch (20)

20140711 1 day2_nist_ercc2.0workshop
20140711 1 day2_nist_ercc2.0workshop20140711 1 day2_nist_ercc2.0workshop
20140711 1 day2_nist_ercc2.0workshop
 
20140710 1 day1_nist_ercc2.0workshop
20140710 1 day1_nist_ercc2.0workshop20140710 1 day1_nist_ercc2.0workshop
20140710 1 day1_nist_ercc2.0workshop
 
20140710 4 a_bergstrom_lucas_ercc2.0_workshop
20140710 4 a_bergstrom_lucas_ercc2.0_workshop20140710 4 a_bergstrom_lucas_ercc2.0_workshop
20140710 4 a_bergstrom_lucas_ercc2.0_workshop
 
20140710 5 k_thompson_ercc2.0_workshop
20140710 5 k_thompson_ercc2.0_workshop20140710 5 k_thompson_ercc2.0_workshop
20140710 5 k_thompson_ercc2.0_workshop
 
20140711 4 e_tseng_ercc2.0_workshop
20140711 4 e_tseng_ercc2.0_workshop20140711 4 e_tseng_ercc2.0_workshop
20140711 4 e_tseng_ercc2.0_workshop
 
20140711 2 j_willey_ercc2.0_workshop
20140711 2 j_willey_ercc2.0_workshop20140711 2 j_willey_ercc2.0_workshop
20140711 2 j_willey_ercc2.0_workshop
 
Resolving Ambiguity in Target ID Screens - CRISPR-Cas9 Based Essentiality Pro...
Resolving Ambiguity in Target ID Screens - CRISPR-Cas9 Based Essentiality Pro...Resolving Ambiguity in Target ID Screens - CRISPR-Cas9 Based Essentiality Pro...
Resolving Ambiguity in Target ID Screens - CRISPR-Cas9 Based Essentiality Pro...
 
GIAB-GRC workshop oct2015 giab introduction 151005
GIAB-GRC workshop oct2015 giab introduction 151005GIAB-GRC workshop oct2015 giab introduction 151005
GIAB-GRC workshop oct2015 giab introduction 151005
 
Blueprints to blue sky – analyzing the challenges and solutions for IHC compa...
Blueprints to blue sky – analyzing the challenges and solutions for IHC compa...Blueprints to blue sky – analyzing the challenges and solutions for IHC compa...
Blueprints to blue sky – analyzing the challenges and solutions for IHC compa...
 
Genetic code
Genetic codeGenetic code
Genetic code
 
Genetic code
Genetic codeGenetic code
Genetic code
 
Characteristics of Genetic Code
Characteristics of Genetic CodeCharacteristics of Genetic Code
Characteristics of Genetic Code
 
Ch3 the genetic code
Ch3 the genetic codeCh3 the genetic code
Ch3 the genetic code
 
Gene regulation in prokaryotes
Gene regulation in prokaryotesGene regulation in prokaryotes
Gene regulation in prokaryotes
 
OMICS tecnology
OMICS tecnologyOMICS tecnology
OMICS tecnology
 
Omics era
Omics eraOmics era
Omics era
 
Genetic code
Genetic codeGenetic code
Genetic code
 
Personalized Medicine and the Omics Revolution by Professor Mike Snyder
Personalized Medicine and the Omics Revolution by Professor Mike SnyderPersonalized Medicine and the Omics Revolution by Professor Mike Snyder
Personalized Medicine and the Omics Revolution by Professor Mike Snyder
 
Microbial Genetics
Microbial GeneticsMicrobial Genetics
Microbial Genetics
 
The Genetic Code
The Genetic Code The Genetic Code
The Genetic Code
 

Ähnlich wie 20140710 3 l_paul_ercc2.0_workshop

Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencingARUNDHATI MEHTA
 
nextgenerationsequencing-170606100132.pdf
nextgenerationsequencing-170606100132.pdfnextgenerationsequencing-170606100132.pdf
nextgenerationsequencing-170606100132.pdfAkhileshPathak33
 
NUGEN-X-Gen_2011_poster_trancriptome_sequencing_RNA-Seq
NUGEN-X-Gen_2011_poster_trancriptome_sequencing_RNA-SeqNUGEN-X-Gen_2011_poster_trancriptome_sequencing_RNA-Seq
NUGEN-X-Gen_2011_poster_trancriptome_sequencing_RNA-SeqHimanshu Sethi
 
Apollo Introduction for the Chestnut Research Community
Apollo Introduction for the Chestnut Research CommunityApollo Introduction for the Chestnut Research Community
Apollo Introduction for the Chestnut Research CommunityMonica Munoz-Torres
 
Evolinc: Evolutionary analysis of LincRNA
Evolinc: Evolutionary analysis of LincRNAEvolinc: Evolutionary analysis of LincRNA
Evolinc: Evolutionary analysis of LincRNAUpendra Kumar Devisetty
 
Introduction to Apollo for i5k
Introduction to Apollo for i5kIntroduction to Apollo for i5k
Introduction to Apollo for i5kMonica Munoz-Torres
 
RNASeq - Analysis Pipeline for Differential Expression
RNASeq - Analysis Pipeline for Differential ExpressionRNASeq - Analysis Pipeline for Differential Expression
RNASeq - Analysis Pipeline for Differential ExpressionJatinder Singh
 
Apollo : A workshop for the Manakin Research Coordination Network
Apollo: A workshop for the Manakin Research Coordination NetworkApollo: A workshop for the Manakin Research Coordination Network
Apollo : A workshop for the Manakin Research Coordination NetworkMonica Munoz-Torres
 
Exploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVS
Exploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVSExploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVS
Exploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVSGolden Helix Inc
 
Enabling RNA-Seq With Limited RNA Using Whole Transcriptome Amplification
Enabling RNA-Seq With Limited RNA Using Whole Transcriptome AmplificationEnabling RNA-Seq With Limited RNA Using Whole Transcriptome Amplification
Enabling RNA-Seq With Limited RNA Using Whole Transcriptome AmplificationQIAGEN
 
Apollo Introduction for i5K Groups 2015-10-07
Apollo Introduction for i5K Groups 2015-10-07Apollo Introduction for i5K Groups 2015-10-07
Apollo Introduction for i5K Groups 2015-10-07Monica Munoz-Torres
 
Impact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEGImpact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEGLong Pei
 
Introduction to Apollo: i5K E affinis
Introduction to Apollo: i5K E affinisIntroduction to Apollo: i5K E affinis
Introduction to Apollo: i5K E affinisMonica Munoz-Torres
 
Exploring new frontiers with next-generation sequencing
Exploring new frontiers with next-generation sequencingExploring new frontiers with next-generation sequencing
Exploring new frontiers with next-generation sequencingQIAGEN
 
20150601 bio sb_assembly_course
20150601 bio sb_assembly_course20150601 bio sb_assembly_course
20150601 bio sb_assembly_coursehansjansen9999
 
The research and application progress of transcriptome sequencing technology (i)
The research and application progress of transcriptome sequencing technology (i)The research and application progress of transcriptome sequencing technology (i)
The research and application progress of transcriptome sequencing technology (i)creativebiolabs11
 
ATAC-seq protocol--Creative Biogene
ATAC-seq protocol--Creative BiogeneATAC-seq protocol--Creative Biogene
ATAC-seq protocol--Creative BiogeneDonglin Bao
 
Assign 2.0 software for the analysis of Phred quality values for quality con...
Assign 2.0  software for the analysis of Phred quality values for quality con...Assign 2.0  software for the analysis of Phred quality values for quality con...
Assign 2.0 software for the analysis of Phred quality values for quality con...Crystal Sanchez
 
Knowing Your NGS Upstream: Alignment and Variants
Knowing Your NGS Upstream: Alignment and VariantsKnowing Your NGS Upstream: Alignment and Variants
Knowing Your NGS Upstream: Alignment and VariantsGolden Helix Inc
 

Ähnlich wie 20140710 3 l_paul_ercc2.0_workshop (20)

Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencing
 
nextgenerationsequencing-170606100132.pdf
nextgenerationsequencing-170606100132.pdfnextgenerationsequencing-170606100132.pdf
nextgenerationsequencing-170606100132.pdf
 
Iplant pag
Iplant pagIplant pag
Iplant pag
 
NUGEN-X-Gen_2011_poster_trancriptome_sequencing_RNA-Seq
NUGEN-X-Gen_2011_poster_trancriptome_sequencing_RNA-SeqNUGEN-X-Gen_2011_poster_trancriptome_sequencing_RNA-Seq
NUGEN-X-Gen_2011_poster_trancriptome_sequencing_RNA-Seq
 
Apollo Introduction for the Chestnut Research Community
Apollo Introduction for the Chestnut Research CommunityApollo Introduction for the Chestnut Research Community
Apollo Introduction for the Chestnut Research Community
 
Evolinc: Evolutionary analysis of LincRNA
Evolinc: Evolutionary analysis of LincRNAEvolinc: Evolutionary analysis of LincRNA
Evolinc: Evolutionary analysis of LincRNA
 
Introduction to Apollo for i5k
Introduction to Apollo for i5kIntroduction to Apollo for i5k
Introduction to Apollo for i5k
 
RNASeq - Analysis Pipeline for Differential Expression
RNASeq - Analysis Pipeline for Differential ExpressionRNASeq - Analysis Pipeline for Differential Expression
RNASeq - Analysis Pipeline for Differential Expression
 
Apollo : A workshop for the Manakin Research Coordination Network
Apollo: A workshop for the Manakin Research Coordination NetworkApollo: A workshop for the Manakin Research Coordination Network
Apollo : A workshop for the Manakin Research Coordination Network
 
Exploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVS
Exploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVSExploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVS
Exploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVS
 
Enabling RNA-Seq With Limited RNA Using Whole Transcriptome Amplification
Enabling RNA-Seq With Limited RNA Using Whole Transcriptome AmplificationEnabling RNA-Seq With Limited RNA Using Whole Transcriptome Amplification
Enabling RNA-Seq With Limited RNA Using Whole Transcriptome Amplification
 
Apollo Introduction for i5K Groups 2015-10-07
Apollo Introduction for i5K Groups 2015-10-07Apollo Introduction for i5K Groups 2015-10-07
Apollo Introduction for i5K Groups 2015-10-07
 
Impact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEGImpact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEG
 
Introduction to Apollo: i5K E affinis
Introduction to Apollo: i5K E affinisIntroduction to Apollo: i5K E affinis
Introduction to Apollo: i5K E affinis
 
Exploring new frontiers with next-generation sequencing
Exploring new frontiers with next-generation sequencingExploring new frontiers with next-generation sequencing
Exploring new frontiers with next-generation sequencing
 
20150601 bio sb_assembly_course
20150601 bio sb_assembly_course20150601 bio sb_assembly_course
20150601 bio sb_assembly_course
 
The research and application progress of transcriptome sequencing technology (i)
The research and application progress of transcriptome sequencing technology (i)The research and application progress of transcriptome sequencing technology (i)
The research and application progress of transcriptome sequencing technology (i)
 
ATAC-seq protocol--Creative Biogene
ATAC-seq protocol--Creative BiogeneATAC-seq protocol--Creative Biogene
ATAC-seq protocol--Creative Biogene
 
Assign 2.0 software for the analysis of Phred quality values for quality con...
Assign 2.0  software for the analysis of Phred quality values for quality con...Assign 2.0  software for the analysis of Phred quality values for quality con...
Assign 2.0 software for the analysis of Phred quality values for quality con...
 
Knowing Your NGS Upstream: Alignment and Variants
Knowing Your NGS Upstream: Alignment and VariantsKnowing Your NGS Upstream: Alignment and Variants
Knowing Your NGS Upstream: Alignment and Variants
 

Mehr von External RNA Controls Consortium

Mehr von External RNA Controls Consortium (6)

140926 ercc2 workshopreport
140926 ercc2 workshopreport140926 ercc2 workshopreport
140926 ercc2 workshopreport
 
20140711 7 j_myerson_ercc2.0_workshop
20140711 7 j_myerson_ercc2.0_workshop20140711 7 j_myerson_ercc2.0_workshop
20140711 7 j_myerson_ercc2.0_workshop
 
20140711 3 t_clark_ercc2.0_workshop
20140711 3 t_clark_ercc2.0_workshop20140711 3 t_clark_ercc2.0_workshop
20140711 3 t_clark_ercc2.0_workshop
 
20140711 6 s_munro_ercc2.0_workshop
20140711 6 s_munro_ercc2.0_workshop20140711 6 s_munro_ercc2.0_workshop
20140711 6 s_munro_ercc2.0_workshop
 
20140711 5 s_pond_ercc2.0_workshop
20140711 5 s_pond_ercc2.0_workshop20140711 5 s_pond_ercc2.0_workshop
20140711 5 s_pond_ercc2.0_workshop
 
20140710 6 c_mason_ercc2.0_workshop
20140710 6 c_mason_ercc2.0_workshop20140710 6 c_mason_ercc2.0_workshop
20140710 6 c_mason_ercc2.0_workshop
 

KĂźrzlich hochgeladen

Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 sciencefloriejanemacaya1
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...SĂŠrgio Sacani
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfSumit Kumar yadav
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSĂŠrgio Sacani
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
 

KĂźrzlich hochgeladen (20)

Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 science
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptx
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
 

20140710 3 l_paul_ercc2.0_workshop

  • 1. Š Lexogen, 2013 Spike-In RNA Variants: Design, Production and Application ERCC 2.0 workshop Stanford University – July 10-11, 2014 PPT Number TBD Project Number 0221 Theme T5.2 Mixquer Transcript Quantification (WAFF) Author Lukas Paul
  • 2. Š Lexogen, 20142 1. Company introduction 2. ERCC spike-in  mixes  in  Lexogen‘s  R&D 3. Design and rational of Spike-In RNA Variants 4. Production and application of Spike-In RNA Variants ERCC 2.0 Workshop Vertraulich / Confidential
  • 3. Š Lexogen, 20143Vertraulich / Confidential Lexogen: Company • Founded in 2007 • Based in Vienna, Austria • 28 employees (75% in R&D) • Lexogen, Inc.: o/n delivery to US customers • Services & products with focus on o Transcriptome profiling technologies o Complementary technologies to Next Generation Sequencing o Innovative solutions for transcriptome research Lexogen’s mission is to develop innovative technologies that will allow to resolve all complexities of the transcriptome - one of the most enigmatic and exciting areas in biology. www.LEXOGEN.com
  • 4. Š Lexogen, 20144 1. Company introduction 2. ERCC spike-in  mixes  in  Lexogen‘s  R&D 3. Design and rational of Spike-In RNA Variants 4. Production and application of Spike-In RNA Variants ERCC 2.0 Workshop Vertraulich / Confidential
  • 5. Š Lexogen, 20145 SENSETM mRNA-Seq Library Preparation Kit • Convenient, fragmentation-free workflow • Core technology: reverse transcription and ligation on intact RNA • Results in very high preservation of strand orientation Vertraulich / Confidential PN0203 PPT0383
  • 6. Š Lexogen, 20146 ERCC-based Validation of Strandedness • Strandedness usually quantified by comparing the orientation of a mapped read with the genome annotation • Problem: annotation incomplete & natural antisense transcription interferes Use of ERCC transcripts with known orientation provides an absolute means to determine strandedness Vertraulich / Confidential PN0203 PPT0383 Total RNA Strand Specificity (ERCCs only)a False Antisense Readsb Sense Reads (genome-wide)c 2 Âľg 99.997% 0.003% 99.890% 1 Âľg 99.986% 0.014% 99.815% 500 ng 99.997% 0.003% 99.821% 50 ng 99.965% 0.035% 99.779% a number of reads mapping to ERCC genes in the sense direction divided by total number of ERCC reads b number of antisense reads mapping to ERCC transcripts divided by the total number of reads mapped to the ERCC genome c number of reads mapping to annotated genes in the sense orientation divided by the number of reads mapping in both directions. Note that this measure includes biologically relevant antisense transcription.
  • 7. Š Lexogen, 20147 ERCC-validated Strandedness Determines False Positive Background of Library Preparation Method Vertraulich / Confidential Knowing the strandedness of the library preparation protocol allows for determining whether a detected transcript is truly antisense or belongs to the false positive background. 98% 99.9% strandedness 1153 2415 true antisense transcripts
  • 8. Š Lexogen, 20148 “ERCC-validated”  Strandedness  in  Lexogen’s  Portfolio   • SENSE mRNA-Seq library preparation kit • SENSE Total RNA-Seq library preparation kit Vertraulich / Confidential PN0203 PPT0383 • QuantSeqTM 3’  mRNA   library preparation Kit, see workflow (right), ERCCs also used to assess correctness  of  3’  end   mapping
  • 9. Š Lexogen, 20149 Correlation Between ERCC Input and FPKM Measured Vertraulich / Confidential PN0203 PPT0383 FPKM N of molecules [102] 1 10 102 103 104 105 106 10-21101021037.5x104 o SENSE, R2=0.910 Competitors, R2=0.834 •
  • 10. Š Lexogen, 201410 Further Use for ERCC: Transcript Length Coverage: • Native genes: interference from divergent annotations and differentially expressed transcript variants • Primer selectivity: aa  ERCCs with seamless coverage from first to last nucleotide  Native transcripts start  with  high  coverage  indicative  of  5’  truncated   annotations Vertraulich / Confidential PN0203 PPT0383 Example: SQUARE TM library prep with intrinsic over-representation of termini ERCC-0096 Top 500 transcripts
  • 11. Š Lexogen, 201411 1. Company introduction 2. ERCC spike-in  mixes  in  Lexogen‘s  R&D 3. Design and rational of Spike-In RNA variants 4. Production and application of Spike-In RNA variants ERCC 2.0 Workshop Vertraulich / Confidential
  • 12. Š Lexogen, 201412 Spike-In RNA Variants (SIRVs) - Rational • ERCC spike-in controls were designed as mono-exonic RNAs without sequence overlap. • Complementary, we found it to be desirable to have a set of nucleic acids simulating transcript variants that can be used as external spike-in controls. • This reference set would o comprise two or more transcript families, with transcripts of the same family representing reference transcript variants of the same gene o enable the controlled identification and/or quantification of transcript variants in one or more samples and o permit the assessment, validation and correction of Bioinformatics pipelines. Vertraulich / Confidential
  • 13. Š Lexogen, 201413 Spike-In RNA Variants – Gene Structure Reference genes • 7 human genes selected because of diversity in exon-intron structure • Annotated transcripts (Ensembl database) aligned to gene in CLC workbench • „Master  transcript“  created  for  each  gene  (sequence  of  all  transcript  variants) KLK5 LDHD Vertraulich / Confidential CLC main workbench 5 CLC main workbench 5 PN0203 PPT0383
  • 14. Š Lexogen, 201414 Addition of Transcript Variants • Annotated transcript variants were analyzed for AS events • AS events not covered by a variant within a family were incorporated in a new variant based on the master transcript • To cover non-splicing variants, antisense and overlapping transcripts were added (mono- and poly-exonic) • Further, Transcription Start-Site (TSS) and End-Site (TES) variants were added KLK5 SIRV1 Vertraulich / Confidential
  • 15. Š Lexogen, 201415 Spike-In RNA Variants (SIRV): Nucleotide Sequence AIM • The nucleotide sequence of the SIRVs should be non-homologous at least to eukarytic genomes and transcriptomes. • In the best case they should not align with any natural occurring sequence. SOLUTION • Genomic sequences from viruses were used to fill-in exon sequences.  Would work in external controls for eukaryotes. • Sequences were then inverted (flipped) to lose alignment identiy.  Final sequences do not align with any entry in the NCBI nt collection when blasted with standard parameters.  SIRV sequences also do not align with ERCC sequences.  In silico experiments confirmed that NGS reads generated from the SIRVs would  not  map  to  the  genome  of  any  model  organism  or  the  “ERCCome”. Vertraulich / Confidential
  • 16. Š Lexogen, 201416 Re-establishing Exon-Intron Junction Dinucleotides Vertraulich / Confidential • Most junctions are common, i.e. are also annotated in the master transcript. • These intron sequences are currently annotated as NN (see below), hence junction recognition is no problem for alignment programs NN-NN GT-AG GC-AG AT-AC SIRVS 198 (61.11%) 116 (31.10%) 7 (2.16%) 3 (0.93) 314 (96.91%) ICE database 98.70% 0.79% 0.08% • Exon-defined intron boundaries were converted to GT-AG (97%), GC-AG (2%) or AT-AC (1%) Nucleotide conversion to conform with GT-AG rule
  • 17. Š Lexogen, 201417 SIRV Properties - Summary SIRVs are modelled on mammalian sequences • Set of seven SIRV families with 6-18 transcript variants each • 74 transcript variants in total, average length 1200 nt (median 917 nt) • Variants include alternative splicing, start- and end-site variations , antisense and overlapping transcripts • GC content: 30-50% (in analogy to ERCC standards) • Poly(A)  tail:  A(30)  at  3’-end (ERCCs: 19-25 adenosines) • Length: 220-2,557 nt, longer SIRVs were trimmed by exon removal Further modifications • GT-AT exon-intron junction dinucleotide rule observed • Homopolymer runs:  ≤7nt • 5’  truncation  to  obtain  5’  G,  needed  for  T7  transcription • No homology to NCBI nt collection entries or ERCC sequences due to sequence inversion Vertraulich / Confidential PN0203 PPT0383
  • 18. Š Lexogen, 201418 SIRV Design - Overview Vertraulich / Confidential Take natural gene structure and annotated transcript variants Shorten transcript length to a maximum of 2500 nt Fill gene structure with heterologous sequence Duplicate and modify to add alternative splicing variants Add transcription start-site and end-site variants Add antisense and overlapping variants observe GU-AG intron rule cassette exon alternative start-site alternative end-site alternative last exon intron retention overlapping, antisense antisense A5SS A3SS MXEalternative first exon overlapping
  • 19. Š Lexogen, 201419 1. Company introduction 2. ERCC spike-in  mixes  in  Lexogen‘s  R&D 3. Design and rational of Spike-In RNA Variants 4. Production and application of Spike-In RNA Variants ERCC 2.0 Workshop Vertraulich / Confidential
  • 20. Š Lexogen, 201420 SIRV Production: In vitro Transcription Construct Vertraulich / Confidential starts with 5’  G, cap optional poly(A) tail added Synthetic constructs cloned for singularization and amplification Run-off T7 transcription T7-PromoterRestr.Site G Sequence A(30) Restr.Site5’ 3’ 220 - 2557 nt
  • 21. Š Lexogen, 201421 SIRV Production, QC and quantification Production  Plasmid linearization  T7 run-off transcription  Purification (essential!)  Storage in Na-Citrate buffer Quality Control  Photometric (Nanodrop): Purity, quantifcation  Microfluidics (Bioanalyzer): Integrity, quantifcation • Planned: qPCR: Accurate quantification Vertraulich / Confidential
  • 22. Š Lexogen, 201422 SIRVs: Mixes & RNA-Seq Samples Initially, 2 mixes were prepared from 60 purified transcript variants: 1. Equimolar:  1:1:1… 2. Low dynamic range: 1:10:100 3 Samples were prepared from these: 1. Equimolar mix, SIRVs only illumina TruSeq library prep without poly(A) selection 2. Equimolar mix, 30% SIRVs, 3% ERCCs, 67% UHR (Universal Human Reference RNA) illumina TruSeq library prep without poly(A) selection 3. Low dynamic range, 30% SIRVs, 3% ERCCs, 67% UHR (Universal Human Reference RNA) illumina TruSeq library prep without poly(A) selection Vertraulich / Confidential
  • 23. Š Lexogen, 201423 SIRVs: RNA-Seq Experiment • Illumina MiSeq run: 1x150 nt, 27M reads obtained • Mapping with tophat (v.2.0.8) against combined transcriptomic and genomic reference (Ensembl GRCh 37.75), Ambion’s ERCC92, and SIRVs Vertraulich / Confidential Total reads Mapping reads (%) Uniquely Mapping reads (%) #1, equimolar SIRVs 10,246,442 8,585,641 83.79% 8,505,344 83.01% #2, equimolar SIRVs, ERCCs, UHR 10,119,416 8,642,852 85.41% 8,399,336 83.00% #3, 1:10:100 SIRVs, ERCCs, UHR 6,308,855 5,404,486 85.67% 5,268,757 83.51% GRCh37.75 ERCC92 SIRVs Sample #1 4,330 0.05% 11 0.00% 8,505,555 99.95% Sample #2 7,521,308 89.55% 38,031 0.45% 839,997 10.00% Sample #3 4,156,399 78.89% 22,207 0.42% 1,090,151 20.69%
  • 24. Š Lexogen, 201424 SIRV RNA-Seq: Input / Output correlation Vertraulich / Confidential Molecules Molecules Molecules sample #1 FPKM sample#2FPKM #1 #2 #3 #1 vs #2
  • 25. Š Lexogen, 201425 SIRVs RNA-Seq: Transcript Hypotheses Transcript Hypotheses by Cufflinks • Not complete: e.g., 3ASS and exons not recognized despite multiple exon- exon reads Vertraulich / Confidential cufflinks
  • 26. Š Lexogen, 201426 Spike-In RNA Variants: Short Summary Design & production • 74 transcript variants in 7 families (6-18 variants / family) • Mimic eukaryotic genes in length and GC content; A(30) tail • Include variation on alternative splicing, transcription start-sites and end- sites, sense/antisense and overlapping genes • No homology to NCBI nt collection entries or ERCC sequences • Produced from stock plasmids as T7 run-off transcripts Mixtures • 60 SIRVs were mixed in equimolar or low dynamic range (10²) concentrations Application in RNA-Seq • Mixtures showed high mapability and no cross-mapping with UHR or ERCCs • Low input / output correlation as determined by tophat / cufflinks derived FPKM • Cufflinks cannot reconstruct all SIRV transcript variants, even in the equimolar mix, which will lead to wrong FPKM values Vertraulich / Confidential
  • 27. Š Lexogen, 201427 Spike-In RNA Variants: Outlook Optimizing production & quantification • Large-scale production and purification of transcripts • qPCR-based quantification in addition to Nanodrop & Bioanalyzer results Application • Evaluation of software for its performance in transcript hypothesis building and transcript isoform quantification Open questions • Concentration range? • Sufficient variant complexity? Length? Capping? SNPs? • How many different mixes? • Pipeline validation (Consortium?) • Sample comparison (DE) • Technical variation • Master mix vs. modules: ERCCs, SIRVs, ncRNA standards & miRNA standards (complexity, price, validation?) Vertraulich / Confidential