SlideShare ist ein Scribd-Unternehmen logo
1 von 22
Utility of the Salmonella in Silico Typing
Resource(SISTR) to outbreak investigations
James Robertson1, Catherine Yoshida1, Peter Kruczkiewicz2, Eduardo N.
Taboada2 and John H. E. Nash3
1 National Microbiology Laboratory @Guelph , Public Health Agency of Canada
2 National Microbiology Laboratory @Lethbridge, Public Health Agency of Canada
3 National Microbiology Laboratory @Toronto, Public Health Agency of Canada
Salmonella is a leading public health concern
 Salmonella is a leading food-borne pathogen both in Canada and around the world
 Globally, there are an estimated 94 million Salmonella infections every year
 Human costs:
• acute illness
• loss of life (155,000 deaths)
 Societal costs:
• health care costs
• lost productivity
• legal costs
• impact to food industry
2
3
Potential Sources
Challenges in Salmonella typing and epidemiology
 Small number of highly prevalent/globally distributed serovars account for most
outbreaks (e.g. Enteritidis, Typhimurium)
 Epidemiologicaly unrelated isolates within same serovar  difficult to
investigate
 Additional subtyping resolution within a serovar needed (e.g. phage typing)
 Increasing use of genotypic methods (i.e. molecular typing)
 Driven by need for methods with higher discriminatory power
 A number of different approaches have been applied to molecular typing of
Salmonella
4
5
GATCGATCGATCG
GATCAATCGATCG
MLST cgMLST wgSNP’sSerotyping
Discriminatory Power
Low Low-Mid Mid-High High
• Based on reaction
of antibodies to
surface antigens
• Broad usage and
common
nomenclature in
use since the
1930’s
• Multi-Locus Sequence Typing:
developed by Maiden et al. (1998)
• Indexes genetic variation in 7 core (i.e.
“housekeeping”) genes
• cgMLST extends this principle to 100’s
to 1000’s of loci
• Provides a portable naming scheme
which correlates with historical
serotypes
• Utilizes individual
SNP’s and gives
very high
resolution
• Results are not
portable to other
public health
professionals
7
• Initial dataset of 4330 genomes
• 94.6% concordance between predicted
and reported serovar
• in silico serovar predictions based on O
and H antigens
• cgMLST refinement of serovar
assignment and analysis
• Uses minimally processed genome
assemblies
• Very fast ~30 seconds to process a
genome
What does SISTR do?
In silico analysis of WGS data
 assembly statistics
 serovar prediction
 in silico typing (MLST,
cgMLST)
 AMR prediction
Comparative genomic analyses
 cgMLST
 accessory gene content
 core SNPs
Epidemiologic analysis
 geospatial distribution
 temporal distribution
 source association
https://lfz.corefacility.ca/sistr-app/
SISTR cgMLST
• Current cgMLST scheme in SISTR based on 330 core
genes with high “assignability” (i.e. very low levels of
“missing” data)
• Will include international Salmonella cgMLST scheme (i.e.
once it is developed!)
• cgMLST information is used to:
– Assess quality of WGS data  complete, partial,
missing loci
– Supplement genoserotyping predictions
9
Testing the accuracy of SISTR
• ~45,000 Salmonella genomes were downloaded from the
SRA
• Raw reads were assembled using FLASH and Spades
• Assemblies were loaded into SISTR and the serovar
predictions were compared between predicted and
reported (where available)
• Assemblies were checked for contamination using Kraken
• Quality was assessed using Quast
10
Recovery rates of 330 cgMLST genes from Assembled
SRA genomes
11
41781
1393
1905
Number of Genomes with
Complete 330
Number of Genomes with >300
Genes
Number of Genomes with <300
Genes
N=45,079
SISTR Accuracy
12
2347
29884
N=32,321
• 93.7% Overall concordance with serovar
specified
Discordant
Concordant
13
• Two outbreaks of Salmonella Enteriditis were retrospectively sequenced
• Examined the feasibility of WGS to outbreak investigations
• Compared results of traditional molecular and microbial tests to WGS
14
15
16
17
18
SISTR (cgMLST) PARSNP (core SNP)
SNP Tree (Wuyts et al 2015)
• All three methods produce concordant
trees.
• cgMLST has a tendency to overgroup
Outbreak Clustering Categories
B
A
C
B
A+C
B
C
A
A
Correct Incorrectly Split
Over-grouped
A+B
A+C
Incorrectly Split and grouped
Concordance between cgMLST and SNP trees
Study Correct Over-grouped Split Combination Serovar(s)
1 1 1 0 0 Enteriditis
2 2 3 0 0 Enteriditis
3 5 1 0 0 Enteriditis,Typhimurium,
Derby
4 2 7 0 0 Enteriditis
5 2 0 0 0 Enteriditis
6 5 2 0 0 Enteriditis
Total 18 13 0 0
20
Conclusions
• SISTR is a a robust and accurate platform for Salmonella in silico
typing with 93.7% concordance between specified serovar and
predicted serovar
• The prototype 330 gene cgMLST scheme is readily retrievable from
HTS assemblies of varying quality levels.
• The current scheme provides coarse grain separation of Salmonella
genetic lineages that will be useful in outbreak analysis
21
22
Acknowledgements
Team:
 Ed Taboada, Peter Kruczkiewicz, Catherine Yoshida, John Nash
Research partners:
 Public Health Agency of Canada:
 OIE Laboratory for Salmonellosis – National Microbiology Lab (NML) @
Guelph
 Genomics Core and Bioinformatics Core – NML @ Winnipeg
 Public Health Genomics team – NML @ Winnipeg
 IRIDA project team
 Animal Health Veterinary Laboratory Agency – UK
 Austrian Institute of Technology – Austria
Funding:
 Genomics Research and Development Initiative
 Genome Canada (IRIDA project)
 Public Health Agency of Canada

Weitere ähnliche Inhalte

Was ist angesagt?

Antiviral Hammerhead ribozymes are ef
Antiviral Hammerhead ribozymes are efAntiviral Hammerhead ribozymes are ef
Antiviral Hammerhead ribozymes are ef
Siddhesh Sapre
 
symposium-poster-viral ER FINAL FINAL 2015
symposium-poster-viral ER FINAL FINAL 2015symposium-poster-viral ER FINAL FINAL 2015
symposium-poster-viral ER FINAL FINAL 2015
Omar M. Al-Hashimi, MPH
 
McGowan Katherine In vivo bioluminescence imaging can be used to monitor vari...
McGowan Katherine In vivo bioluminescence imaging can be used to monitor vari...McGowan Katherine In vivo bioluminescence imaging can be used to monitor vari...
McGowan Katherine In vivo bioluminescence imaging can be used to monitor vari...
Katherine McGowan
 
PNAS-2016-Shaffer-OptimalCocktails
PNAS-2016-Shaffer-OptimalCocktailsPNAS-2016-Shaffer-OptimalCocktails
PNAS-2016-Shaffer-OptimalCocktails
Scott Shaffer
 
art%3A10.1186%2Fs12936-016-1304-8
art%3A10.1186%2Fs12936-016-1304-8art%3A10.1186%2Fs12936-016-1304-8
art%3A10.1186%2Fs12936-016-1304-8
wagatua njoroge
 
Sigma xi presentation final1
Sigma xi presentation   final1Sigma xi presentation   final1
Sigma xi presentation final1
mukta936
 
Sigma xi presentation mukta asnani
Sigma xi presentation   mukta asnaniSigma xi presentation   mukta asnani
Sigma xi presentation mukta asnani
mukta936
 

Was ist angesagt? (20)

Genomics: The coming challenge to the health system
Genomics: The coming challenge to the health systemGenomics: The coming challenge to the health system
Genomics: The coming challenge to the health system
 
New developments in vaccines against African swine fever
New developments in vaccines against African swine feverNew developments in vaccines against African swine fever
New developments in vaccines against African swine fever
 
2016.11.08 doherty institute symposium
2016.11.08 doherty institute symposium2016.11.08 doherty institute symposium
2016.11.08 doherty institute symposium
 
The Global Micorbial Identifier (GMI) initiative - and its working groups
The Global Micorbial Identifier (GMI) initiative - and its working groupsThe Global Micorbial Identifier (GMI) initiative - and its working groups
The Global Micorbial Identifier (GMI) initiative - and its working groups
 
Big Data and Immunology at GFII Paris fevrier 2014
Big Data and Immunology at GFII Paris fevrier 2014Big Data and Immunology at GFII Paris fevrier 2014
Big Data and Immunology at GFII Paris fevrier 2014
 
Hospital Microbiome Project Proposal
Hospital Microbiome Project ProposalHospital Microbiome Project Proposal
Hospital Microbiome Project Proposal
 
Supporting Genomics in the Practice of Medicine by Heidi Rehm
Supporting Genomics in the Practice of Medicine by Heidi RehmSupporting Genomics in the Practice of Medicine by Heidi Rehm
Supporting Genomics in the Practice of Medicine by Heidi Rehm
 
iGEM 2018 Tokyo Tech presentation
iGEM 2018 Tokyo Tech presentationiGEM 2018 Tokyo Tech presentation
iGEM 2018 Tokyo Tech presentation
 
Antiviral Hammerhead ribozymes are ef
Antiviral Hammerhead ribozymes are efAntiviral Hammerhead ribozymes are ef
Antiviral Hammerhead ribozymes are ef
 
symposium-poster-viral ER FINAL FINAL 2015
symposium-poster-viral ER FINAL FINAL 2015symposium-poster-viral ER FINAL FINAL 2015
symposium-poster-viral ER FINAL FINAL 2015
 
McGowan Katherine In vivo bioluminescence imaging can be used to monitor vari...
McGowan Katherine In vivo bioluminescence imaging can be used to monitor vari...McGowan Katherine In vivo bioluminescence imaging can be used to monitor vari...
McGowan Katherine In vivo bioluminescence imaging can be used to monitor vari...
 
2016.03.08 immem talk salmonella outbreaks
2016.03.08 immem talk   salmonella outbreaks2016.03.08 immem talk   salmonella outbreaks
2016.03.08 immem talk salmonella outbreaks
 
Indo-SA HIVR4P
Indo-SA HIVR4PIndo-SA HIVR4P
Indo-SA HIVR4P
 
Folding
FoldingFolding
Folding
 
IsoPlexis Trifold
IsoPlexis TrifoldIsoPlexis Trifold
IsoPlexis Trifold
 
PNAS-2016-Shaffer-OptimalCocktails
PNAS-2016-Shaffer-OptimalCocktailsPNAS-2016-Shaffer-OptimalCocktails
PNAS-2016-Shaffer-OptimalCocktails
 
art%3A10.1186%2Fs12936-016-1304-8
art%3A10.1186%2Fs12936-016-1304-8art%3A10.1186%2Fs12936-016-1304-8
art%3A10.1186%2Fs12936-016-1304-8
 
Sigma xi presentation final1
Sigma xi presentation   final1Sigma xi presentation   final1
Sigma xi presentation final1
 
Sigma xi presentation mukta asnani
Sigma xi presentation   mukta asnaniSigma xi presentation   mukta asnani
Sigma xi presentation mukta asnani
 
Dr. David Mooney - Simposio Internacional 'Terapias oncológicas avanzadas'
Dr. David Mooney - Simposio Internacional 'Terapias oncológicas avanzadas'Dr. David Mooney - Simposio Internacional 'Terapias oncológicas avanzadas'
Dr. David Mooney - Simposio Internacional 'Terapias oncológicas avanzadas'
 

Ähnlich wie Robertson immemxi final March 2016

Ähnlich wie Robertson immemxi final March 2016 (20)

Listeria monocytogenes from population structure to genomic epidemiology
Listeria monocytogenes from population structure to genomic epidemiologyListeria monocytogenes from population structure to genomic epidemiology
Listeria monocytogenes from population structure to genomic epidemiology
 
Next Generation Sequencing for Identification and Subtyping of Foodborne Pat...
Next Generation Sequencing for Identification and Subtyping of Foodborne Pat...Next Generation Sequencing for Identification and Subtyping of Foodborne Pat...
Next Generation Sequencing for Identification and Subtyping of Foodborne Pat...
 
Next Generation Sequencing for Identification and Subtyping of Foodborne Pat...
Next Generation Sequencing for Identification and Subtyping of Foodborne Pat...Next Generation Sequencing for Identification and Subtyping of Foodborne Pat...
Next Generation Sequencing for Identification and Subtyping of Foodborne Pat...
 
Genome in a bottle april 30 2015 hvp Leiden
Genome in a bottle april 30 2015 hvp LeidenGenome in a bottle april 30 2015 hvp Leiden
Genome in a bottle april 30 2015 hvp Leiden
 
152310_brisse-invs-genoepi-nov2015.pdf
152310_brisse-invs-genoepi-nov2015.pdf152310_brisse-invs-genoepi-nov2015.pdf
152310_brisse-invs-genoepi-nov2015.pdf
 
How to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical informationHow to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical information
 
Gene hunting strategies
Gene hunting strategiesGene hunting strategies
Gene hunting strategies
 
Advances and Applications Enabled by Single Cell Technology
Advances and Applications Enabled by Single Cell TechnologyAdvances and Applications Enabled by Single Cell Technology
Advances and Applications Enabled by Single Cell Technology
 
220614-FEMS-2022-TmpSmx_final.pdf
220614-FEMS-2022-TmpSmx_final.pdf220614-FEMS-2022-TmpSmx_final.pdf
220614-FEMS-2022-TmpSmx_final.pdf
 
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
 
16S MVRSION at Washington University
16S MVRSION at Washington University16S MVRSION at Washington University
16S MVRSION at Washington University
 
Chapter 2.4 tmb and ngs
Chapter 2.4 tmb and ngsChapter 2.4 tmb and ngs
Chapter 2.4 tmb and ngs
 
Whole Genome Trait Association in SVS
Whole Genome Trait Association in SVSWhole Genome Trait Association in SVS
Whole Genome Trait Association in SVS
 
Bioinformatic Analysis of Synthetic Lethality in Breast Cancer
Bioinformatic Analysis of Synthetic Lethality in Breast CancerBioinformatic Analysis of Synthetic Lethality in Breast Cancer
Bioinformatic Analysis of Synthetic Lethality in Breast Cancer
 
Genome responses of trypanosome infected cattle
Genome responses of trypanosome infected cattleGenome responses of trypanosome infected cattle
Genome responses of trypanosome infected cattle
 
Grand round whsiao_may2015
Grand round whsiao_may2015Grand round whsiao_may2015
Grand round whsiao_may2015
 
How Can We Make Genomic Epidemiology a Widespread Reality? - William Hsiao
How Can We Make Genomic Epidemiology a Widespread Reality?  - William HsiaoHow Can We Make Genomic Epidemiology a Widespread Reality?  - William Hsiao
How Can We Make Genomic Epidemiology a Widespread Reality? - William Hsiao
 
Teresa Coque Hospital Universitario Ramón y Cajal.
Teresa Coque  Hospital Universitario Ramón y Cajal. Teresa Coque  Hospital Universitario Ramón y Cajal.
Teresa Coque Hospital Universitario Ramón y Cajal.
 
Optical Sensors in Plant Disease Detection
Optical Sensors in Plant Disease DetectionOptical Sensors in Plant Disease Detection
Optical Sensors in Plant Disease Detection
 
Axt microarrays
Axt microarraysAxt microarrays
Axt microarrays
 

Mehr von IRIDA_community

Mehr von IRIDA_community (14)

Hetman immem xi final March 2016
Hetman immem xi final March 2016Hetman immem xi final March 2016
Hetman immem xi final March 2016
 
Barker immemxi final March 2016
Barker immemxi final March 2016Barker immemxi final March 2016
Barker immemxi final March 2016
 
Emma FoodON poster3
Emma FoodON poster3Emma FoodON poster3
Emma FoodON poster3
 
Emma Food on workshop allergy_eg
Emma Food on workshop allergy_egEmma Food on workshop allergy_eg
Emma Food on workshop allergy_eg
 
Biocuration gen epio_poster
Biocuration gen epio_posterBiocuration gen epio_poster
Biocuration gen epio_poster
 
Emma Griffiths ASM microbe gen_epio_poster
Emma Griffiths ASM microbe gen_epio_posterEmma Griffiths ASM microbe gen_epio_poster
Emma Griffiths ASM microbe gen_epio_poster
 
Julie Shay CCBC poster may 11 2016
Julie Shay CCBC poster may 11 2016Julie Shay CCBC poster may 11 2016
Julie Shay CCBC poster may 11 2016
 
Integrate Ontologies into your apps
Integrate Ontologies into your appsIntegrate Ontologies into your apps
Integrate Ontologies into your apps
 
Report Calc for Quality Control
Report Calc for Quality ControlReport Calc for Quality Control
Report Calc for Quality Control
 
Irida immemxi hsiao
Irida immemxi hsiaoIrida immemxi hsiao
Irida immemxi hsiao
 
Gen epio immem_griffiths
Gen epio immem_griffithsGen epio immem_griffiths
Gen epio immem_griffiths
 
Irida bccdc dec10_2015
Irida bccdc dec10_2015Irida bccdc dec10_2015
Irida bccdc dec10_2015
 
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiao
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiaoIRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiao
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiao
 
Domselaar GMI8 Beijing Canadian WGS Surveillance Experience
Domselaar GMI8 Beijing Canadian WGS Surveillance ExperienceDomselaar GMI8 Beijing Canadian WGS Surveillance Experience
Domselaar GMI8 Beijing Canadian WGS Surveillance Experience
 

Kürzlich hochgeladen

Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
levieagacer
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
Areesha Ahmad
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
Silpa
 

Kürzlich hochgeladen (20)

Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
 
Velocity and Acceleration PowerPoint.ppt
Velocity and Acceleration PowerPoint.pptVelocity and Acceleration PowerPoint.ppt
Velocity and Acceleration PowerPoint.ppt
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICEPATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 
Introduction of DNA analysis in Forensic's .pptx
Introduction of DNA analysis in Forensic's .pptxIntroduction of DNA analysis in Forensic's .pptx
Introduction of DNA analysis in Forensic's .pptx
 
An introduction on sequence tagged site mapping
An introduction on sequence tagged site mappingAn introduction on sequence tagged site mapping
An introduction on sequence tagged site mapping
 
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
 
Stages in the normal growth curve
Stages in the normal growth curveStages in the normal growth curve
Stages in the normal growth curve
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 

Robertson immemxi final March 2016

  • 1. Utility of the Salmonella in Silico Typing Resource(SISTR) to outbreak investigations James Robertson1, Catherine Yoshida1, Peter Kruczkiewicz2, Eduardo N. Taboada2 and John H. E. Nash3 1 National Microbiology Laboratory @Guelph , Public Health Agency of Canada 2 National Microbiology Laboratory @Lethbridge, Public Health Agency of Canada 3 National Microbiology Laboratory @Toronto, Public Health Agency of Canada
  • 2. Salmonella is a leading public health concern  Salmonella is a leading food-borne pathogen both in Canada and around the world  Globally, there are an estimated 94 million Salmonella infections every year  Human costs: • acute illness • loss of life (155,000 deaths)  Societal costs: • health care costs • lost productivity • legal costs • impact to food industry 2
  • 4. Challenges in Salmonella typing and epidemiology  Small number of highly prevalent/globally distributed serovars account for most outbreaks (e.g. Enteritidis, Typhimurium)  Epidemiologicaly unrelated isolates within same serovar  difficult to investigate  Additional subtyping resolution within a serovar needed (e.g. phage typing)  Increasing use of genotypic methods (i.e. molecular typing)  Driven by need for methods with higher discriminatory power  A number of different approaches have been applied to molecular typing of Salmonella 4
  • 5. 5 GATCGATCGATCG GATCAATCGATCG MLST cgMLST wgSNP’sSerotyping Discriminatory Power Low Low-Mid Mid-High High • Based on reaction of antibodies to surface antigens • Broad usage and common nomenclature in use since the 1930’s • Multi-Locus Sequence Typing: developed by Maiden et al. (1998) • Indexes genetic variation in 7 core (i.e. “housekeeping”) genes • cgMLST extends this principle to 100’s to 1000’s of loci • Provides a portable naming scheme which correlates with historical serotypes • Utilizes individual SNP’s and gives very high resolution • Results are not portable to other public health professionals
  • 6.
  • 7. 7 • Initial dataset of 4330 genomes • 94.6% concordance between predicted and reported serovar • in silico serovar predictions based on O and H antigens • cgMLST refinement of serovar assignment and analysis • Uses minimally processed genome assemblies • Very fast ~30 seconds to process a genome
  • 8. What does SISTR do? In silico analysis of WGS data  assembly statistics  serovar prediction  in silico typing (MLST, cgMLST)  AMR prediction Comparative genomic analyses  cgMLST  accessory gene content  core SNPs Epidemiologic analysis  geospatial distribution  temporal distribution  source association https://lfz.corefacility.ca/sistr-app/
  • 9. SISTR cgMLST • Current cgMLST scheme in SISTR based on 330 core genes with high “assignability” (i.e. very low levels of “missing” data) • Will include international Salmonella cgMLST scheme (i.e. once it is developed!) • cgMLST information is used to: – Assess quality of WGS data  complete, partial, missing loci – Supplement genoserotyping predictions 9
  • 10. Testing the accuracy of SISTR • ~45,000 Salmonella genomes were downloaded from the SRA • Raw reads were assembled using FLASH and Spades • Assemblies were loaded into SISTR and the serovar predictions were compared between predicted and reported (where available) • Assemblies were checked for contamination using Kraken • Quality was assessed using Quast 10
  • 11. Recovery rates of 330 cgMLST genes from Assembled SRA genomes 11 41781 1393 1905 Number of Genomes with Complete 330 Number of Genomes with >300 Genes Number of Genomes with <300 Genes N=45,079
  • 12. SISTR Accuracy 12 2347 29884 N=32,321 • 93.7% Overall concordance with serovar specified Discordant Concordant
  • 13. 13 • Two outbreaks of Salmonella Enteriditis were retrospectively sequenced • Examined the feasibility of WGS to outbreak investigations • Compared results of traditional molecular and microbial tests to WGS
  • 14. 14
  • 15. 15
  • 16. 16
  • 17. 17
  • 18. 18 SISTR (cgMLST) PARSNP (core SNP) SNP Tree (Wuyts et al 2015) • All three methods produce concordant trees. • cgMLST has a tendency to overgroup
  • 19. Outbreak Clustering Categories B A C B A+C B C A A Correct Incorrectly Split Over-grouped A+B A+C Incorrectly Split and grouped
  • 20. Concordance between cgMLST and SNP trees Study Correct Over-grouped Split Combination Serovar(s) 1 1 1 0 0 Enteriditis 2 2 3 0 0 Enteriditis 3 5 1 0 0 Enteriditis,Typhimurium, Derby 4 2 7 0 0 Enteriditis 5 2 0 0 0 Enteriditis 6 5 2 0 0 Enteriditis Total 18 13 0 0 20
  • 21. Conclusions • SISTR is a a robust and accurate platform for Salmonella in silico typing with 93.7% concordance between specified serovar and predicted serovar • The prototype 330 gene cgMLST scheme is readily retrievable from HTS assemblies of varying quality levels. • The current scheme provides coarse grain separation of Salmonella genetic lineages that will be useful in outbreak analysis 21
  • 22. 22 Acknowledgements Team:  Ed Taboada, Peter Kruczkiewicz, Catherine Yoshida, John Nash Research partners:  Public Health Agency of Canada:  OIE Laboratory for Salmonellosis – National Microbiology Lab (NML) @ Guelph  Genomics Core and Bioinformatics Core – NML @ Winnipeg  Public Health Genomics team – NML @ Winnipeg  IRIDA project team  Animal Health Veterinary Laboratory Agency – UK  Austrian Institute of Technology – Austria Funding:  Genomics Research and Development Initiative  Genome Canada (IRIDA project)  Public Health Agency of Canada

Hinweis der Redaktion

  1. Serovar prediction provides antigenic formula and serovar name – compatible with historical data