SlideShare ist ein Scribd-Unternehmen logo
1 von 19
1
Bayesian Models
for Chagas Disease
Kimberley M. Zorn, Mary A. Lingerfelt, Jair L. de Siqueira-Neto,
Alex M. Clark, Sean Ekins
2
Epimastigote stage in the bug
Trypomastigote stage to travel
Amastigote stage to replicate
3
▶ Asymptomatic for ~70% of people (infected for life)
▶ Fatal cardiac, neurological, & digestive symptoms can
develop up to 25 years later
▶ Curable… if caught early
▶ Current treatments are not approved in the United States
Chagas Disease
Nifurtimox
Benznidazole
4
Epidemiology
Estimated of 300-500K
in the United States
Estimated 7-8 million
infected worldwide
https://www.cdc.gov/parasites/chagas/gen_info/vectors/index.html
https://www.dndi.org/diseases-projects/chagas/
Machine Learning and Drug Discovery
▶ Simply put: Molecular pattern recognition of biological data
▶ Fingerprints to identify these patterns
▶ Define active and inactive features
▶ Statistics to watch for: Receiver Operator Characteristic (ROC)
▶ Used to generate predictions for drug activity at a certain target
▶ Real life example - Pyronaridine (an approved antimalarial)
5
Pyronaridine, Repurposed
▶ Broad Institute, 4064 compounds
▶ PubChem AID 2044 (EC50)
▶ 1853 active compounds (EC50 < 1 µM)
▶ PubChem AID 2010 (Cytotoxicity)
▶ 1698 active compounds (>10 fold difference in EC50)
▶ ~ 100 compounds tested in vitro, eleven had EC50 < 10 µM
▶ Pyronaridine: 85% in vivo efficacy, EC50 = 225 nM
6
Vehicle | Pyronaridine
Ekins et al., PLoS Negl Trop
Dis. 2015 Jun 26;9(6):e0003878
How can the everyday scientist
use Machine Learning?
7
Private Data
Public Data
Predict Activity
8
AID 2044/2010 in Assay Central
9
▶ Inconclusive = Inactive
▶ EC50 (< 1 µM)
▶ 1853 actives
▶ ROC = 0.78
▶ EC50 + Cytotoxicity (> 10 fold)
▶ 1689 actives
▶ ROC = 0.80
Subvalidations in Assay Central
10
▶ Testing AID 2044 vs Ekins
▶ Defined testing/training set
▶ Threshold = 1 µM
▶ Six actives
▶ ROC = 0.72
▶ What else can we do with
Ekins results?
Predict  Test  Retrain
11
AID2044 predicting Test2017 AID2044+Ekins predicting Test2017
Chagas Models in Assay Central
12
▶ Tulahuen strains targeting specific life cycle stage
▶ Combined strains or stages
▶ Ki measurements
▶ PubChem data discussed herein
▶ Target specific models (cruzain & cruzipain)
▶ Various thresholds
▶ More to come!
13
▶ CPI database currently contains > 150 models
▶ Molecular properties, Disease & ADME Targets
▶ Predictions for more than ten ongoing projects
▶ Assay Central compound predictions being selected for
T. cruzi bioactivity testing
▶ Share models with Java executable on any computer
www.assaycentral.org
How would you care to collaborate?
14
▶ Inexpensive, fast & easy
▶ We need more data & feedback
▶ Curious about your compounds?
Predict them in Assay Central!
▶ Ongoing projects for rare & neglected
disease drug discovery, including
Ebola & TB
More information at:
www.collaborationspharma.com
Thanks!
15
Collaborations Pharmaceuticals, Inc.
Dr. Sean Ekins
Dr. Maggie Hupcey
Dr. Mary Lingerfelt
Software + Chagas Testing
Dr. Alex Clark
Dr. Jair de Siqueira-Neto
Funded by R43GM122196 NIGMS
16
Data Curation & Management
▶ Collect bioactivity data from public & private sources
▶ Bayesian algorithm
▶ ECFP6 descriptors
▶ GitHub to share datasets and models in-house
▶ Private server for additional data backup in-house
▶ Share executable files over Google Drive or DropBox
Prediction Scores
17
Clark, A.M., et al., J. Chem. Inf. Model. 2015, 55, 1231−1245.
Drug Repurposing for Tuberculosis
18
▶ Tuberculosis (https://www.cdc.gov/tb/statistics/default.htm)
▶ 1/3 of the population is infected
▶ 1.8 million deaths in 2015
▶ Assay Central Models (~10)
▶ Public in vitro data & collaborator in vivo data
▶ Targeted models for PyrG & PanK
▶ Predicted compounds & sent for testing
▶ Vendor libraries + FDA approved drugs
▶ Two compounds active at either target, one at both
Work completed by Tom Lane
19
TB Subvalidations
Work completed by Tom Lane

Weitere ähnliche Inhalte

Was ist angesagt?

Team 5 imputing_medical_missing_data_ga approach_preseatation
Team 5 imputing_medical_missing_data_ga approach_preseatationTeam 5 imputing_medical_missing_data_ga approach_preseatation
Team 5 imputing_medical_missing_data_ga approach_preseatation
Nafiz Ishtiaque Ahmed
 
Health Datapalooza 2013: Datalab - Steven Edwards
Health Datapalooza 2013: Datalab - Steven EdwardsHealth Datapalooza 2013: Datalab - Steven Edwards
Health Datapalooza 2013: Datalab - Steven Edwards
Health Data Consortium
 

Was ist angesagt? (13)

Artificial Intelligence in Life Sciences and Agriculture.
Artificial Intelligence in Life Sciences and Agriculture.Artificial Intelligence in Life Sciences and Agriculture.
Artificial Intelligence in Life Sciences and Agriculture.
 
The End of the Drug Development Casino?
The End of the Drug Development Casino?The End of the Drug Development Casino?
The End of the Drug Development Casino?
 
Bloustein Poster
Bloustein PosterBloustein Poster
Bloustein Poster
 
Zen and the Art of Data Science Maintenance
Zen and the Art of Data Science MaintenanceZen and the Art of Data Science Maintenance
Zen and the Art of Data Science Maintenance
 
Multi-omics for drug discovery: what we lose, what we gain
Multi-omics for drug discovery: what we lose, what we gainMulti-omics for drug discovery: what we lose, what we gain
Multi-omics for drug discovery: what we lose, what we gain
 
AI applications in life sciences - drug development
AI applications in life sciences - drug developmentAI applications in life sciences - drug development
AI applications in life sciences - drug development
 
Team 5 imputing_medical_missing_data_ga approach_preseatation
Team 5 imputing_medical_missing_data_ga approach_preseatationTeam 5 imputing_medical_missing_data_ga approach_preseatation
Team 5 imputing_medical_missing_data_ga approach_preseatation
 
Presentatie
PresentatiePresentatie
Presentatie
 
Ai in drug discovery and drug development
Ai in drug discovery and drug developmentAi in drug discovery and drug development
Ai in drug discovery and drug development
 
Permanently eliminate the herpes virus from your body
Permanently eliminate the herpes virus from your bodyPermanently eliminate the herpes virus from your body
Permanently eliminate the herpes virus from your body
 
05 zittartz presentation_ph_v_day_2014
05 zittartz presentation_ph_v_day_201405 zittartz presentation_ph_v_day_2014
05 zittartz presentation_ph_v_day_2014
 
Health Datapalooza 2013: Datalab - Steven Edwards
Health Datapalooza 2013: Datalab - Steven EdwardsHealth Datapalooza 2013: Datalab - Steven Edwards
Health Datapalooza 2013: Datalab - Steven Edwards
 
Role of AI in Drug Discovery and Development
Role of AI in  Drug Discovery and DevelopmentRole of AI in  Drug Discovery and Development
Role of AI in Drug Discovery and Development
 

Ähnlich wie Bayesian Models for Chagas Disease

2010StanfordE25 Michele dragoescu e25 project
2010StanfordE25 Michele dragoescu e25 project2010StanfordE25 Michele dragoescu e25 project
2010StanfordE25 Michele dragoescu e25 project
mdragoescu
 

Ähnlich wie Bayesian Models for Chagas Disease (20)

C&E news talk sept 16
C&E news talk sept 16C&E news talk sept 16
C&E news talk sept 16
 
InSyBio at Open Coffee Athens CI
InSyBio at Open Coffee Athens CIInSyBio at Open Coffee Athens CI
InSyBio at Open Coffee Athens CI
 
Bladder Cancer Diagnostic-Initial Team Project
Bladder Cancer Diagnostic-Initial Team ProjectBladder Cancer Diagnostic-Initial Team Project
Bladder Cancer Diagnostic-Initial Team Project
 
Bigger Data to Increase Drug Discovery
Bigger Data to Increase Drug DiscoveryBigger Data to Increase Drug Discovery
Bigger Data to Increase Drug Discovery
 
JALANov2000
JALANov2000JALANov2000
JALANov2000
 
Oscar Rodríguez-El impacto de las ciencias ómicas en la medicina, la nutrició...
Oscar Rodríguez-El impacto de las ciencias ómicas en la medicina, la nutrició...Oscar Rodríguez-El impacto de las ciencias ómicas en la medicina, la nutrició...
Oscar Rodríguez-El impacto de las ciencias ómicas en la medicina, la nutrició...
 
37º Congresso Brasileiro de Medicina Farmacêutica | Dr. João Batista Calixto
37º Congresso Brasileiro de Medicina Farmacêutica | Dr. João Batista Calixto37º Congresso Brasileiro de Medicina Farmacêutica | Dr. João Batista Calixto
37º Congresso Brasileiro de Medicina Farmacêutica | Dr. João Batista Calixto
 
Impact of Big Data & Artificial Intelligence in Drug Discovery & Development ...
Impact of Big Data & Artificial Intelligence in Drug Discovery & Development ...Impact of Big Data & Artificial Intelligence in Drug Discovery & Development ...
Impact of Big Data & Artificial Intelligence in Drug Discovery & Development ...
 
2010StanfordE25 Michele dragoescu e25 project
2010StanfordE25 Michele dragoescu e25 project2010StanfordE25 Michele dragoescu e25 project
2010StanfordE25 Michele dragoescu e25 project
 
CDx-NGS-webinar
CDx-NGS-webinarCDx-NGS-webinar
CDx-NGS-webinar
 
인공지능 논문작성과 심사에관한요령
인공지능 논문작성과 심사에관한요령인공지능 논문작성과 심사에관한요령
인공지능 논문작성과 심사에관한요령
 
Cyrcadia Health - Health & Wearable Singapore 2015 - The Propell Group
Cyrcadia Health - Health & Wearable Singapore 2015 - The Propell GroupCyrcadia Health - Health & Wearable Singapore 2015 - The Propell Group
Cyrcadia Health - Health & Wearable Singapore 2015 - The Propell Group
 
Pediatric Adverse Drug Events Presentation
Pediatric Adverse Drug Events PresentationPediatric Adverse Drug Events Presentation
Pediatric Adverse Drug Events Presentation
 
GenomeTrakr: Whole-Genome Sequencing for Food Safety and A New Way Forward in...
GenomeTrakr: Whole-Genome Sequencing for Food Safety and A New Way Forward in...GenomeTrakr: Whole-Genome Sequencing for Food Safety and A New Way Forward in...
GenomeTrakr: Whole-Genome Sequencing for Food Safety and A New Way Forward in...
 
Cco retroviruses_2013_art_slides
Cco  retroviruses_2013_art_slidesCco  retroviruses_2013_art_slides
Cco retroviruses_2013_art_slides
 
EuroBioForum2014_speaker_Manolio
EuroBioForum2014_speaker_ManolioEuroBioForum2014_speaker_Manolio
EuroBioForum2014_speaker_Manolio
 
EarlySense - NOAH19 Tel Aviv
EarlySense - NOAH19 Tel AvivEarlySense - NOAH19 Tel Aviv
EarlySense - NOAH19 Tel Aviv
 
Malcolm Pradhan on Pathology in Clincial Decision Support and the role of Dee...
Malcolm Pradhan on Pathology in Clincial Decision Support and the role of Dee...Malcolm Pradhan on Pathology in Clincial Decision Support and the role of Dee...
Malcolm Pradhan on Pathology in Clincial Decision Support and the role of Dee...
 
Healthcare Conference 2013 : Toekomstvisie op ICT in de gezondheidszorg - pro...
Healthcare Conference 2013 : Toekomstvisie op ICT in de gezondheidszorg - pro...Healthcare Conference 2013 : Toekomstvisie op ICT in de gezondheidszorg - pro...
Healthcare Conference 2013 : Toekomstvisie op ICT in de gezondheidszorg - pro...
 
Amia tbi-14-final
Amia tbi-14-finalAmia tbi-14-final
Amia tbi-14-final
 

Mehr von Sean Ekins

Mehr von Sean Ekins (20)

How to Win a small business grant.pptx
How to Win a small business grant.pptxHow to Win a small business grant.pptx
How to Win a small business grant.pptx
 
Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...
Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...
Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...
 
A presentation at the Global Genes rare drug development symposium on governm...
A presentation at the Global Genes rare drug development symposium on governm...A presentation at the Global Genes rare drug development symposium on governm...
A presentation at the Global Genes rare drug development symposium on governm...
 
Leveraging Science Communication and Social Media to Build Your Brand and Ele...
Leveraging Science Communication and Social Media to Build Your Brand and Ele...Leveraging Science Communication and Social Media to Build Your Brand and Ele...
Leveraging Science Communication and Social Media to Build Your Brand and Ele...
 
Drug Discovery Today March 2017 special issue
Drug Discovery Today March 2017 special issueDrug Discovery Today March 2017 special issue
Drug Discovery Today March 2017 special issue
 
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan DiseasesUsing In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
 
Five Ways to Use Social Media to Raise Awareness for Your Paper or Research
Five Ways to Use Social Media to Raise Awareness for Your Paper or ResearchFive Ways to Use Social Media to Raise Awareness for Your Paper or Research
Five Ways to Use Social Media to Raise Awareness for Your Paper or Research
 
Open zika presentation
Open zika presentation Open zika presentation
Open zika presentation
 
academic / small company collaborations for rare and neglected diseasesv2
 academic / small company collaborations for rare and neglected diseasesv2 academic / small company collaborations for rare and neglected diseasesv2
academic / small company collaborations for rare and neglected diseasesv2
 
CDD models case study #3
CDD models case study #3 CDD models case study #3
CDD models case study #3
 
CDD models case study #2
CDD models case study #2 CDD models case study #2
CDD models case study #2
 
CDD Models case study #1
CDD Models case study #1 CDD Models case study #1
CDD Models case study #1
 
Using Machine Learning Models Based on Phenotypic Data to Discover New Molecu...
Using Machine Learning Models Based on Phenotypic Data to Discover New Molecu...Using Machine Learning Models Based on Phenotypic Data to Discover New Molecu...
Using Machine Learning Models Based on Phenotypic Data to Discover New Molecu...
 
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
 
The future of computational chemistry b ig
The future of computational chemistry b igThe future of computational chemistry b ig
The future of computational chemistry b ig
 
#ZikaOpen: Homology Models -
#ZikaOpen: Homology Models - #ZikaOpen: Homology Models -
#ZikaOpen: Homology Models -
 
Slas talk 2016
Slas talk 2016Slas talk 2016
Slas talk 2016
 
Pros and cons of social networking for scientists
Pros and cons of social networking for scientistsPros and cons of social networking for scientists
Pros and cons of social networking for scientists
 
CDD: Vault, CDD: Vision and CDD: Models for Drug Discovery Collaborations
CDD: Vault, CDD: Vision and CDD: Models for Drug Discovery CollaborationsCDD: Vault, CDD: Vision and CDD: Models for Drug Discovery Collaborations
CDD: Vault, CDD: Vision and CDD: Models for Drug Discovery Collaborations
 
Rare pediatric and neglected tropical diseases priority review voucher and tr...
Rare pediatric and neglected tropical diseases priority review voucher and tr...Rare pediatric and neglected tropical diseases priority review voucher and tr...
Rare pediatric and neglected tropical diseases priority review voucher and tr...
 

Kürzlich hochgeladen

Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
PirithiRaju
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
gindu3009
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
RohitNehra6
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
AlMamun560346
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Lokesh Kothari
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
Sérgio Sacani
 

Kürzlich hochgeladen (20)

Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 

Bayesian Models for Chagas Disease

  • 1. 1 Bayesian Models for Chagas Disease Kimberley M. Zorn, Mary A. Lingerfelt, Jair L. de Siqueira-Neto, Alex M. Clark, Sean Ekins
  • 2. 2 Epimastigote stage in the bug Trypomastigote stage to travel Amastigote stage to replicate
  • 3. 3 ▶ Asymptomatic for ~70% of people (infected for life) ▶ Fatal cardiac, neurological, & digestive symptoms can develop up to 25 years later ▶ Curable… if caught early ▶ Current treatments are not approved in the United States Chagas Disease Nifurtimox Benznidazole
  • 4. 4 Epidemiology Estimated of 300-500K in the United States Estimated 7-8 million infected worldwide https://www.cdc.gov/parasites/chagas/gen_info/vectors/index.html https://www.dndi.org/diseases-projects/chagas/
  • 5. Machine Learning and Drug Discovery ▶ Simply put: Molecular pattern recognition of biological data ▶ Fingerprints to identify these patterns ▶ Define active and inactive features ▶ Statistics to watch for: Receiver Operator Characteristic (ROC) ▶ Used to generate predictions for drug activity at a certain target ▶ Real life example - Pyronaridine (an approved antimalarial) 5
  • 6. Pyronaridine, Repurposed ▶ Broad Institute, 4064 compounds ▶ PubChem AID 2044 (EC50) ▶ 1853 active compounds (EC50 < 1 µM) ▶ PubChem AID 2010 (Cytotoxicity) ▶ 1698 active compounds (>10 fold difference in EC50) ▶ ~ 100 compounds tested in vitro, eleven had EC50 < 10 µM ▶ Pyronaridine: 85% in vivo efficacy, EC50 = 225 nM 6 Vehicle | Pyronaridine Ekins et al., PLoS Negl Trop Dis. 2015 Jun 26;9(6):e0003878
  • 7. How can the everyday scientist use Machine Learning? 7 Private Data Public Data Predict Activity
  • 8. 8
  • 9. AID 2044/2010 in Assay Central 9 ▶ Inconclusive = Inactive ▶ EC50 (< 1 µM) ▶ 1853 actives ▶ ROC = 0.78 ▶ EC50 + Cytotoxicity (> 10 fold) ▶ 1689 actives ▶ ROC = 0.80
  • 10. Subvalidations in Assay Central 10 ▶ Testing AID 2044 vs Ekins ▶ Defined testing/training set ▶ Threshold = 1 µM ▶ Six actives ▶ ROC = 0.72 ▶ What else can we do with Ekins results?
  • 11. Predict  Test  Retrain 11 AID2044 predicting Test2017 AID2044+Ekins predicting Test2017
  • 12. Chagas Models in Assay Central 12 ▶ Tulahuen strains targeting specific life cycle stage ▶ Combined strains or stages ▶ Ki measurements ▶ PubChem data discussed herein ▶ Target specific models (cruzain & cruzipain) ▶ Various thresholds ▶ More to come!
  • 13. 13 ▶ CPI database currently contains > 150 models ▶ Molecular properties, Disease & ADME Targets ▶ Predictions for more than ten ongoing projects ▶ Assay Central compound predictions being selected for T. cruzi bioactivity testing ▶ Share models with Java executable on any computer www.assaycentral.org
  • 14. How would you care to collaborate? 14 ▶ Inexpensive, fast & easy ▶ We need more data & feedback ▶ Curious about your compounds? Predict them in Assay Central! ▶ Ongoing projects for rare & neglected disease drug discovery, including Ebola & TB More information at: www.collaborationspharma.com
  • 15. Thanks! 15 Collaborations Pharmaceuticals, Inc. Dr. Sean Ekins Dr. Maggie Hupcey Dr. Mary Lingerfelt Software + Chagas Testing Dr. Alex Clark Dr. Jair de Siqueira-Neto Funded by R43GM122196 NIGMS
  • 16. 16 Data Curation & Management ▶ Collect bioactivity data from public & private sources ▶ Bayesian algorithm ▶ ECFP6 descriptors ▶ GitHub to share datasets and models in-house ▶ Private server for additional data backup in-house ▶ Share executable files over Google Drive or DropBox
  • 17. Prediction Scores 17 Clark, A.M., et al., J. Chem. Inf. Model. 2015, 55, 1231−1245.
  • 18. Drug Repurposing for Tuberculosis 18 ▶ Tuberculosis (https://www.cdc.gov/tb/statistics/default.htm) ▶ 1/3 of the population is infected ▶ 1.8 million deaths in 2015 ▶ Assay Central Models (~10) ▶ Public in vitro data & collaborator in vivo data ▶ Targeted models for PyrG & PanK ▶ Predicted compounds & sent for testing ▶ Vendor libraries + FDA approved drugs ▶ Two compounds active at either target, one at both Work completed by Tom Lane