SlideShare ist ein Scribd-Unternehmen logo
1 von 28
Luca Cozzuto
Bioinformatics Core Facility
vectorQC
A pipeline for assembling
and annotation of vectors
Background
A vector is a DNA molecule used as a vehicle to carry foreign genetic material into a
cell, where it can be replicated and/or expressed.
The vector itself is generally a DNA sequence that consists of an insert (transgene) and
a larger sequence that serves as the "backbone" of the vector.
Background
Vector
Host cell
A vector is a DNA molecule used as a vehicle to carry foreign genetic material into a
cell, where it can be replicated and/or expressed.
The vector itself is generally a DNA sequence that consists of an insert (transgene) and
a larger sequence that serves as the "backbone" of the vector.
Background
Vector
Host cell
Amplification (cloning vector)
A vector is a DNA molecule used as a vehicle to carry foreign genetic material into a
cell, where it can be replicated and/or expressed.
The vector itself is generally a DNA sequence that consists of an insert (transgene) and
a larger sequence that serves as the "backbone" of the vector.
Background
Vector
Host cell
Amplification (cloning vector)
Expression (expression vector)
A vector is a DNA molecule used as a vehicle to carry foreign genetic material into a
cell, where it can be replicated and/or expressed.
The vector itself is generally a DNA sequence that consists of an insert (transgene) and
a larger sequence that serves as the "backbone" of the vector.
Background
A vector is composed of different elements:
• Origin of replication
• Cloning sites: one or more targets for restriction enzymes
The pBR322 plasmid
• Reporter genes: genes that activate / inactivate
their function after successful insertion and colour
the positive colonies
• Antibiotic resistance: for selecting only the
colonies containing the vector
• Promoter
• …
Source: wikipedia
The problem
Nowadays vectors are considered a basic tools in biotechnology and having a library of
vector in a lab / facility is quite common.
After each year there is an increase of the risk of mis-labelling, construct degradation,
contamination.
Having a quality control of the integrity of the
vectors backbone and of the inserted DNA
could help in avoiding wasting of time and
money and in reducing errors.
Solution
Biomolecular Screening
&
Protein Technologies Unit
Genomics Unit
Bioinformatics Unit
Solution
Massive
sequencing
Pool of vectors
Solution
Massive
sequencing
Pool of vectors Analysis
Reproducible
pipeline
Solution
Massive
sequencing
Pool of vectors Analysis
Reproducible
pipeline
Result
Report and map of
each vector
Database
The pipeline: vectorQC
Fragmented DNA
Scaffolds / whole
constructs
Quality
trimming and
assembly
vectorQC
Fragmented DNA
Scaffolds / whole
constructs
Quality
trimming and
assembly
Annotation of
features
DB of features
+ list of inserts
Annotations
Fragmented DNA
Scaffolds / whole
constructs
Quality
trimming and
assembly
Annotation of
features
DB of features
+ list of inserts
Annotations
Generating
maps Generating report
and sequences
vectorQC
Quality control and trimming
• FASTQC: QC of initial and trimmed reads
• Skewer: trimming the raw reads.
vectorQC
Quality control and trimming
• FASTQC: QC of initial and trimmed reads
• Skewer: trimming the raw reads.
Read assembly
• Flash: merging of overlapping reads (optional)
• SPAdes: assembly that is corrected with a custom script for addressing the circularity
• Custom script: to randomly join the scaffolds in a single molecule
vectorQC
Quality control and trimming
• FASTQC: QC of initial and trimmed reads
• Skewer: trimming the raw reads.
Read assembly
• Flash: merging of overlapping reads (optional)
• SPAdes: assembly that is corrected with a custom script for addressing the circularity
• Custom script: to randomly join the scaffolds in a single molecule
Annotation
• Blast: annotating features and eventually detecting the DNA insert.
• Restrict (Emboss): for detecting restriction enzyme sites
• Circular Genome Viewer: for generating the maps
• MultiQC: for collecting the results in a comprehensive report
vectorQC
Available resources
• Database of features: from Plasmapper tool, but can be expanded
• Database of restriction enzyme: REBASE
Custom resources
• Insert list: custom fasta file with the name of the inserts
vectorQC
Available resources
• Database of features: from Plasmapper tool, but can be expanded
• Database of restriction enzyme: REBASE
Custom resources
• Insert list: custom fasta file with the name of the inserts
https://github.com/biocorecrg/vectorQC
vectorQC
Available resources
• Database of features: from Plasmapper tool, but can be expanded
• Database of restriction enzyme: REBASE
Custom resources
• Insert list: custom fasta file with the name of the inserts
https://github.com/biocorecrg/vectorQC
vectorQC
vectorQC
vectorQC
vectorQC
Good practices
Good practices
Continuous integration
Good practices
Docker image in dockerhub with automatic buildings
Next developments
• Improving the assembly: removing the low covered contigs
• Comparison with reference: if provided we should check the concordance of the
contigs with the reference
• Detection of variants: SNP / Indel calling against the reference if provided
https://github.com/biocorecrg/vectorQC
Thank you!
Toni Hermoso Pulido
Julia Ponomarenko
Sarah Bonnin
Jochen Hecht (Genomics Unit)
Carlo Carolis (BS&PT Unit)

Weitere ähnliche Inhalte

Was ist angesagt?

Approaches to analysing 1000s of bacterial isolates - ICEID 2015 Atlanta, USA...
Approaches to analysing 1000s of bacterial isolates - ICEID 2015 Atlanta, USA...Approaches to analysing 1000s of bacterial isolates - ICEID 2015 Atlanta, USA...
Approaches to analysing 1000s of bacterial isolates - ICEID 2015 Atlanta, USA...Torsten Seemann
 
GLBIO/CCBC Metagenomics Workshop
GLBIO/CCBC Metagenomics WorkshopGLBIO/CCBC Metagenomics Workshop
GLBIO/CCBC Metagenomics WorkshopMorgan Langille
 
Long read sequencing - LSCC lab talk - fri 5 june 2015
Long read sequencing - LSCC lab talk - fri 5 june 2015Long read sequencing - LSCC lab talk - fri 5 june 2015
Long read sequencing - LSCC lab talk - fri 5 june 2015Torsten Seemann
 
Making Use of NGS Data: From Reads to Trees and Annotations
Making Use of NGS Data: From Reads to Trees and AnnotationsMaking Use of NGS Data: From Reads to Trees and Annotations
Making Use of NGS Data: From Reads to Trees and AnnotationsJoão André Carriço
 
wings2014 Workshop 1 Design, sequence, align, count, visualize
wings2014 Workshop 1 Design, sequence, align, count, visualizewings2014 Workshop 1 Design, sequence, align, count, visualize
wings2014 Workshop 1 Design, sequence, align, count, visualizeAnn Loraine
 
Reproducible bioinformatics pipelines with Docker and Anduril
Reproducible bioinformatics pipelines with Docker and AndurilReproducible bioinformatics pipelines with Docker and Anduril
Reproducible bioinformatics pipelines with Docker and AndurilChristian Frech
 
Viral biodiversity in rodents
Viral biodiversity in rodentsViral biodiversity in rodents
Viral biodiversity in rodentsNacho Caballero
 
DEseq, voom and vst
DEseq, voom and vstDEseq, voom and vst
DEseq, voom and vstQiang Kou
 
NGx Sequencing 101-platforms
NGx Sequencing 101-platformsNGx Sequencing 101-platforms
NGx Sequencing 101-platformsAllSeq
 
Caporaso sloan qiime_workshop_slides_18_oct2012
Caporaso sloan qiime_workshop_slides_18_oct2012Caporaso sloan qiime_workshop_slides_18_oct2012
Caporaso sloan qiime_workshop_slides_18_oct2012gregcaporaso
 
LUGM-Update of the Illumina Analysis Pipeline
LUGM-Update of the Illumina Analysis PipelineLUGM-Update of the Illumina Analysis Pipeline
LUGM-Update of the Illumina Analysis PipelineHai-Wei Yen
 
Genomic Big Data Management, Integration and Mining - Emanuel Weitschek
Genomic Big Data Management, Integration and Mining - Emanuel WeitschekGenomic Big Data Management, Integration and Mining - Emanuel Weitschek
Genomic Big Data Management, Integration and Mining - Emanuel WeitschekData Driven Innovation
 
BITS training - UCSC Genome Browser - Part 2
BITS training - UCSC Genome Browser - Part 2BITS training - UCSC Genome Browser - Part 2
BITS training - UCSC Genome Browser - Part 2BITS
 
Introduction to 16S rRNA gene multivariate analysis
Introduction to 16S rRNA gene multivariate analysisIntroduction to 16S rRNA gene multivariate analysis
Introduction to 16S rRNA gene multivariate analysisJosh Neufeld
 
Pipeline or pipe dream - Midlands Micro Meeting UK - mon 15 sep 2014
Pipeline or pipe dream - Midlands Micro Meeting UK - mon 15 sep 2014Pipeline or pipe dream - Midlands Micro Meeting UK - mon 15 sep 2014
Pipeline or pipe dream - Midlands Micro Meeting UK - mon 15 sep 2014Torsten Seemann
 
An introduction to RNA-seq data analysis
An introduction to RNA-seq data analysisAn introduction to RNA-seq data analysis
An introduction to RNA-seq data analysisAGRF_Ltd
 
Genome simulation and applications
Genome simulation and applicationsGenome simulation and applications
Genome simulation and applicationsHari Prasad
 

Was ist angesagt? (20)

Approaches to analysing 1000s of bacterial isolates - ICEID 2015 Atlanta, USA...
Approaches to analysing 1000s of bacterial isolates - ICEID 2015 Atlanta, USA...Approaches to analysing 1000s of bacterial isolates - ICEID 2015 Atlanta, USA...
Approaches to analysing 1000s of bacterial isolates - ICEID 2015 Atlanta, USA...
 
Abrf 2017 hadfield j
Abrf 2017 hadfield jAbrf 2017 hadfield j
Abrf 2017 hadfield j
 
GLBIO/CCBC Metagenomics Workshop
GLBIO/CCBC Metagenomics WorkshopGLBIO/CCBC Metagenomics Workshop
GLBIO/CCBC Metagenomics Workshop
 
Long read sequencing - LSCC lab talk - fri 5 june 2015
Long read sequencing - LSCC lab talk - fri 5 june 2015Long read sequencing - LSCC lab talk - fri 5 june 2015
Long read sequencing - LSCC lab talk - fri 5 june 2015
 
Making Use of NGS Data: From Reads to Trees and Annotations
Making Use of NGS Data: From Reads to Trees and AnnotationsMaking Use of NGS Data: From Reads to Trees and Annotations
Making Use of NGS Data: From Reads to Trees and Annotations
 
wings2014 Workshop 1 Design, sequence, align, count, visualize
wings2014 Workshop 1 Design, sequence, align, count, visualizewings2014 Workshop 1 Design, sequence, align, count, visualize
wings2014 Workshop 1 Design, sequence, align, count, visualize
 
Reproducible bioinformatics pipelines with Docker and Anduril
Reproducible bioinformatics pipelines with Docker and AndurilReproducible bioinformatics pipelines with Docker and Anduril
Reproducible bioinformatics pipelines with Docker and Anduril
 
Viral biodiversity in rodents
Viral biodiversity in rodentsViral biodiversity in rodents
Viral biodiversity in rodents
 
DEseq, voom and vst
DEseq, voom and vstDEseq, voom and vst
DEseq, voom and vst
 
NGx Sequencing 101-platforms
NGx Sequencing 101-platformsNGx Sequencing 101-platforms
NGx Sequencing 101-platforms
 
Robust tn5 transposase
Robust tn5 transposaseRobust tn5 transposase
Robust tn5 transposase
 
Benjamin Stielow - Fungi
Benjamin Stielow - FungiBenjamin Stielow - Fungi
Benjamin Stielow - Fungi
 
Caporaso sloan qiime_workshop_slides_18_oct2012
Caporaso sloan qiime_workshop_slides_18_oct2012Caporaso sloan qiime_workshop_slides_18_oct2012
Caporaso sloan qiime_workshop_slides_18_oct2012
 
LUGM-Update of the Illumina Analysis Pipeline
LUGM-Update of the Illumina Analysis PipelineLUGM-Update of the Illumina Analysis Pipeline
LUGM-Update of the Illumina Analysis Pipeline
 
Genomic Big Data Management, Integration and Mining - Emanuel Weitschek
Genomic Big Data Management, Integration and Mining - Emanuel WeitschekGenomic Big Data Management, Integration and Mining - Emanuel Weitschek
Genomic Big Data Management, Integration and Mining - Emanuel Weitschek
 
BITS training - UCSC Genome Browser - Part 2
BITS training - UCSC Genome Browser - Part 2BITS training - UCSC Genome Browser - Part 2
BITS training - UCSC Genome Browser - Part 2
 
Introduction to 16S rRNA gene multivariate analysis
Introduction to 16S rRNA gene multivariate analysisIntroduction to 16S rRNA gene multivariate analysis
Introduction to 16S rRNA gene multivariate analysis
 
Pipeline or pipe dream - Midlands Micro Meeting UK - mon 15 sep 2014
Pipeline or pipe dream - Midlands Micro Meeting UK - mon 15 sep 2014Pipeline or pipe dream - Midlands Micro Meeting UK - mon 15 sep 2014
Pipeline or pipe dream - Midlands Micro Meeting UK - mon 15 sep 2014
 
An introduction to RNA-seq data analysis
An introduction to RNA-seq data analysisAn introduction to RNA-seq data analysis
An introduction to RNA-seq data analysis
 
Genome simulation and applications
Genome simulation and applicationsGenome simulation and applications
Genome simulation and applications
 

Ähnlich wie Vector assembly and annotation pipeline vectorQC

Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016Prof. Wim Van Criekinge
 
Comparison Between Different Types Of Vectors
Comparison Between Different Types Of Vectors Comparison Between Different Types Of Vectors
Comparison Between Different Types Of Vectors فہیمہ کاسی
 
Production Bioinformatics, emphasis on Production
Production Bioinformatics, emphasis on ProductionProduction Bioinformatics, emphasis on Production
Production Bioinformatics, emphasis on ProductionChris Dwan
 
Genomiclibrary 151004020241-lva1-app6891
Genomiclibrary 151004020241-lva1-app6891Genomiclibrary 151004020241-lva1-app6891
Genomiclibrary 151004020241-lva1-app6891saurabh verma
 
Genomic and c dna library by Kailash Sontakke
Genomic and c dna library by Kailash SontakkeGenomic and c dna library by Kailash Sontakke
Genomic and c dna library by Kailash SontakkeKAILASHSONTAKKE
 
cloning vectors.pptx Biotechnology class
cloning vectors.pptx Biotechnology classcloning vectors.pptx Biotechnology class
cloning vectors.pptx Biotechnology classrakeshbarik8
 
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...GenomeInABottle
 
Genome Simulation & Applications: Use of Managed Distributed Compute Infrastr...
Genome Simulation & Applications: Use of Managed Distributed Compute Infrastr...Genome Simulation & Applications: Use of Managed Distributed Compute Infrastr...
Genome Simulation & Applications: Use of Managed Distributed Compute Infrastr...Nick Brown
 
Kim Pruitt trainingbiocuration2015
Kim Pruitt trainingbiocuration2015Kim Pruitt trainingbiocuration2015
Kim Pruitt trainingbiocuration2015Kim D. Pruitt
 
DNA_cloning_principles and procedures.ppt
DNA_cloning_principles and procedures.pptDNA_cloning_principles and procedures.ppt
DNA_cloning_principles and procedures.pptChisamaSichone1
 

Ähnlich wie Vector assembly and annotation pipeline vectorQC (20)

Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016
 
Vectors
VectorsVectors
Vectors
 
Variant analysis and whole exome sequencing
Variant analysis and whole exome sequencingVariant analysis and whole exome sequencing
Variant analysis and whole exome sequencing
 
Major databases in bioinformatics
Major databases in bioinformaticsMajor databases in bioinformatics
Major databases in bioinformatics
 
cloning vectors.ppt
cloning vectors.pptcloning vectors.ppt
cloning vectors.ppt
 
Comparison Between Different Types Of Vectors
Comparison Between Different Types Of Vectors Comparison Between Different Types Of Vectors
Comparison Between Different Types Of Vectors
 
Production Bioinformatics, emphasis on Production
Production Bioinformatics, emphasis on ProductionProduction Bioinformatics, emphasis on Production
Production Bioinformatics, emphasis on Production
 
BioWeka
BioWekaBioWeka
BioWeka
 
Genomiclibrary 151004020241-lva1-app6891
Genomiclibrary 151004020241-lva1-app6891Genomiclibrary 151004020241-lva1-app6891
Genomiclibrary 151004020241-lva1-app6891
 
Gene library
Gene libraryGene library
Gene library
 
Gwas.emes.comp
Gwas.emes.compGwas.emes.comp
Gwas.emes.comp
 
Cloning vector
Cloning vectorCloning vector
Cloning vector
 
Cloning vectors
Cloning vectorsCloning vectors
Cloning vectors
 
Genomic and c dna library by Kailash Sontakke
Genomic and c dna library by Kailash SontakkeGenomic and c dna library by Kailash Sontakke
Genomic and c dna library by Kailash Sontakke
 
cloning vectors.pptx Biotechnology class
cloning vectors.pptx Biotechnology classcloning vectors.pptx Biotechnology class
cloning vectors.pptx Biotechnology class
 
Principles of cloning DNA introduction
Principles of cloning DNA introductionPrinciples of cloning DNA introduction
Principles of cloning DNA introduction
 
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
 
Genome Simulation & Applications: Use of Managed Distributed Compute Infrastr...
Genome Simulation & Applications: Use of Managed Distributed Compute Infrastr...Genome Simulation & Applications: Use of Managed Distributed Compute Infrastr...
Genome Simulation & Applications: Use of Managed Distributed Compute Infrastr...
 
Kim Pruitt trainingbiocuration2015
Kim Pruitt trainingbiocuration2015Kim Pruitt trainingbiocuration2015
Kim Pruitt trainingbiocuration2015
 
DNA_cloning_principles and procedures.ppt
DNA_cloning_principles and procedures.pptDNA_cloning_principles and procedures.ppt
DNA_cloning_principles and procedures.ppt
 

Mehr von Luca Cozzuto

Course on parsing methods for biologists with a focus on ChIP-seq data
Course on parsing methods for biologists with a focus on ChIP-seq dataCourse on parsing methods for biologists with a focus on ChIP-seq data
Course on parsing methods for biologists with a focus on ChIP-seq dataLuca Cozzuto
 
From Zero to Nextflow 2017
From Zero to Nextflow 2017From Zero to Nextflow 2017
From Zero to Nextflow 2017Luca Cozzuto
 
Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...
Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...
Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...Luca Cozzuto
 
Annotating nc-RNAs with Rfam
Annotating nc-RNAs with RfamAnnotating nc-RNAs with Rfam
Annotating nc-RNAs with RfamLuca Cozzuto
 

Mehr von Luca Cozzuto (6)

Course on parsing methods for biologists with a focus on ChIP-seq data
Course on parsing methods for biologists with a focus on ChIP-seq dataCourse on parsing methods for biologists with a focus on ChIP-seq data
Course on parsing methods for biologists with a focus on ChIP-seq data
 
From Zero to Nextflow 2017
From Zero to Nextflow 2017From Zero to Nextflow 2017
From Zero to Nextflow 2017
 
Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...
Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...
Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...
 
AnnoWiki
AnnoWikiAnnoWiki
AnnoWiki
 
Macs course
Macs courseMacs course
Macs course
 
Annotating nc-RNAs with Rfam
Annotating nc-RNAs with RfamAnnotating nc-RNAs with Rfam
Annotating nc-RNAs with Rfam
 

Kürzlich hochgeladen

Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptxRajatChauhan518211
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡anilsa9823
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfnehabiju2046
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxBroad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxjana861314
 

Kürzlich hochgeladen (20)

9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdf
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxBroad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
 

Vector assembly and annotation pipeline vectorQC

  • 1. Luca Cozzuto Bioinformatics Core Facility vectorQC A pipeline for assembling and annotation of vectors
  • 2. Background A vector is a DNA molecule used as a vehicle to carry foreign genetic material into a cell, where it can be replicated and/or expressed. The vector itself is generally a DNA sequence that consists of an insert (transgene) and a larger sequence that serves as the "backbone" of the vector.
  • 3. Background Vector Host cell A vector is a DNA molecule used as a vehicle to carry foreign genetic material into a cell, where it can be replicated and/or expressed. The vector itself is generally a DNA sequence that consists of an insert (transgene) and a larger sequence that serves as the "backbone" of the vector.
  • 4. Background Vector Host cell Amplification (cloning vector) A vector is a DNA molecule used as a vehicle to carry foreign genetic material into a cell, where it can be replicated and/or expressed. The vector itself is generally a DNA sequence that consists of an insert (transgene) and a larger sequence that serves as the "backbone" of the vector.
  • 5. Background Vector Host cell Amplification (cloning vector) Expression (expression vector) A vector is a DNA molecule used as a vehicle to carry foreign genetic material into a cell, where it can be replicated and/or expressed. The vector itself is generally a DNA sequence that consists of an insert (transgene) and a larger sequence that serves as the "backbone" of the vector.
  • 6. Background A vector is composed of different elements: • Origin of replication • Cloning sites: one or more targets for restriction enzymes The pBR322 plasmid • Reporter genes: genes that activate / inactivate their function after successful insertion and colour the positive colonies • Antibiotic resistance: for selecting only the colonies containing the vector • Promoter • … Source: wikipedia
  • 7. The problem Nowadays vectors are considered a basic tools in biotechnology and having a library of vector in a lab / facility is quite common. After each year there is an increase of the risk of mis-labelling, construct degradation, contamination. Having a quality control of the integrity of the vectors backbone and of the inserted DNA could help in avoiding wasting of time and money and in reducing errors.
  • 8. Solution Biomolecular Screening & Protein Technologies Unit Genomics Unit Bioinformatics Unit
  • 10. Solution Massive sequencing Pool of vectors Analysis Reproducible pipeline
  • 11. Solution Massive sequencing Pool of vectors Analysis Reproducible pipeline Result Report and map of each vector Database
  • 12. The pipeline: vectorQC Fragmented DNA Scaffolds / whole constructs Quality trimming and assembly
  • 13. vectorQC Fragmented DNA Scaffolds / whole constructs Quality trimming and assembly Annotation of features DB of features + list of inserts Annotations
  • 14. Fragmented DNA Scaffolds / whole constructs Quality trimming and assembly Annotation of features DB of features + list of inserts Annotations Generating maps Generating report and sequences vectorQC
  • 15. Quality control and trimming • FASTQC: QC of initial and trimmed reads • Skewer: trimming the raw reads. vectorQC
  • 16. Quality control and trimming • FASTQC: QC of initial and trimmed reads • Skewer: trimming the raw reads. Read assembly • Flash: merging of overlapping reads (optional) • SPAdes: assembly that is corrected with a custom script for addressing the circularity • Custom script: to randomly join the scaffolds in a single molecule vectorQC
  • 17. Quality control and trimming • FASTQC: QC of initial and trimmed reads • Skewer: trimming the raw reads. Read assembly • Flash: merging of overlapping reads (optional) • SPAdes: assembly that is corrected with a custom script for addressing the circularity • Custom script: to randomly join the scaffolds in a single molecule Annotation • Blast: annotating features and eventually detecting the DNA insert. • Restrict (Emboss): for detecting restriction enzyme sites • Circular Genome Viewer: for generating the maps • MultiQC: for collecting the results in a comprehensive report vectorQC
  • 18. Available resources • Database of features: from Plasmapper tool, but can be expanded • Database of restriction enzyme: REBASE Custom resources • Insert list: custom fasta file with the name of the inserts vectorQC
  • 19. Available resources • Database of features: from Plasmapper tool, but can be expanded • Database of restriction enzyme: REBASE Custom resources • Insert list: custom fasta file with the name of the inserts https://github.com/biocorecrg/vectorQC vectorQC
  • 20. Available resources • Database of features: from Plasmapper tool, but can be expanded • Database of restriction enzyme: REBASE Custom resources • Insert list: custom fasta file with the name of the inserts https://github.com/biocorecrg/vectorQC vectorQC
  • 26. Good practices Docker image in dockerhub with automatic buildings
  • 27. Next developments • Improving the assembly: removing the low covered contigs • Comparison with reference: if provided we should check the concordance of the contigs with the reference • Detection of variants: SNP / Indel calling against the reference if provided https://github.com/biocorecrg/vectorQC
  • 28. Thank you! Toni Hermoso Pulido Julia Ponomarenko Sarah Bonnin Jochen Hecht (Genomics Unit) Carlo Carolis (BS&PT Unit)