SlideShare ist ein Scribd-Unternehmen logo
1 von 54
Alexandre Borrel, PhD
Postdoctoral Research Fellow,
National Institute of Environment Health Sciences,
RTP, North Carolina, USA
Exploring the Chemical Universe
using www.ChemMaps.com
@AlBorrel0000-0001-6499-4540
National Institutes of Health
U.S. Department of Health and Human Services2
More than 1.1060 accessible molecules 1,2,3
(1) Hann, M.M., and Oprea, T.I. (2004). Curr. Opin. Chem. Biol. 8: 255–263.
(2) Ursu, O., Rayan, A., Goldblum, A., and Oprea, T.I. (2011). Rev. Comput. Mol. Sci. 1: 760–781.
(3) Drew, K. L. M., Baiman, H., Khwaounjoo, P., Yu, B., & Reynisson, J. (2012). Journal of Pharmacy and Pharmacology, 64(4), 490–495.
Introduction
National Institutes of Health
U.S. Department of Health and Human Services3
More than 1.1060 accessible molecules 1,2,3
(1) Hann, M.M., and Oprea, T.I. (2004). Curr. Opin. Chem. Biol. 8: 255–263.
(2) Ursu, O., Rayan, A., Goldblum, A., and Oprea, T.I. (2011). Rev. Comput. Mol. Sci. 1: 760–781.
(3) Drew, K. L. M., Baiman, H., Khwaounjoo, P., Yu, B., & Reynisson, J. (2012). Journal of Pharmacy and Pharmacology, 64(4), 490–495.
Introduction
National Institutes of Health
U.S. Department of Health and Human Services4
Chemical space: “…Chemical space’ is a term often used
in place of ‘multi- dimensional descriptor space’: it is a
region defined by a particular choice of descriptors…”
Dobson CM (2004) Nature 432:824–828
Chemical space
National Institutes of Health
U.S. Department of Health and Human Services5
Lipinski C, Hopkins A (2004) Nature 432:855–861.
Chemical space
National Institutes of Health
U.S. Department of Health and Human Services6
National Institutes of Health
U.S. Department of Health and Human Services7
Locate chemical
of interest
National Institutes of Health
U.S. Department of Health and Human Services8
Locate chemical
of interest Optimization,
define analogue,
replacement, ….
National Institutes of Health
U.S. Department of Health and Human Services9
Investigate ADME/Tox
properties
Locate chemical
of interest Optimization,
define analogue,
replacement, ….
National Institutes of Health
U.S. Department of Health and Human Services10
Drug repurposing
Locate chemical
of interest Optimization,
define analogue,
replacement, ….
Investigate ADME/Tox
properties
National Institutes of Health
U.S. Department of Health and Human Services11
Define, visualize
domains
Locate chemical
of interest Optimization,
define analogue,
replacement, ….
Investigate ADME/Tox
properties
Drug repurposing
National Institutes of Health
U.S. Department of Health and Human Services12
Investigate
new area
National Institutes of Health
U.S. Department of Health and Human Services13
Efficient navigation tool
14
Google Maps
15
Google Maps approach
• Interactive
• Easy to use
• Informative
• Responsive
• ….
Chemical space
16
DrugMap: compounds
~8,000 drug entries (release 12-2018):
• ~2,500 FDA-approved small molecule drugs
• Over 5,000 experimental drugs.
https://www.drugbank.ca/
17
DrugMap: descriptors
https://www.drugbank.ca/
RDkit: http://www.rdkit.org/
PaDEL: http://www.yapcwsoft.com/dd/padeldescriptor/
1D descriptors:
chemical formula
2D descriptors:
connectivity
3D descriptors:
spatial coordinates
(ligprep to generate 3D)
C23H34O5
• Molecular weight
• Count of atoms
• …
• Pharmacophore based
• … • Volume
• Surface
• …
Descriptor selection:
• Remove null variance
• Person’s correlation coefficient < 0.9
18
DrugMap: descriptors space
Samples
(Compounds)
Variables (descriptors)
X1 X2 ... X238
1 X1,1 X1,2 ... X1, 238
2 X2,1 X2,2 ... X2, 238
... ... ... ... ...
8550 X8550,1 X8550, 2 ... X8550,238
116 1D/2D descriptors
122 3D descriptors
19
PC1 = 14%; PC2=9%; PC3 = 26%
• Sufficient coverage of variance
• Understandable
z = 70 3D descriptors
x, y = 116 1D-2D descriptors
Multiple PCA
DrugMap: descriptors space
National Institutes of Health
U.S. Department of Health and Human Services20
Environmental Chemical Space
~48,000 chemicals with 3D descriptors
Informed by regulatory lists*:
• Endocrine Disruptor Screening Program
• Toxic Substances Control Act Inventory
• Canadian Domestic Substances List
• Swedish Chemicals Agency
~12,000 chemicals with acute systemic toxicity data
• Rat oral LD50 values
• GHS/EPA classifications
*not inclusive
https://comptox.epa.gov/dashboard/
www.chemmaps.com
version 1.0
ACS New Orleans (March 2018)
National Institutes of Health
U.S. Department of Health and Human Services22
www.ChemMaps.com
~30,000 unique users since March 2018 (ACS - New Orleans)
~1,000 new users each month (~ 8,000 in June 2018)
@SpaceChemMaps
National Institutes of Health
U.S. Department of Health and Human Services23
Challenges for version 2
• Extended universe: Distributed Structure-Searchable Toxicity
(DSSTox) Database (EPA – EPA comptox)
• > 800,000 chemicals, (chemical infrastructure for EPA’s Safer
Chemicals Research, including the ToxCast and Tox21 high-
throughput toxicology efforts)
https://www.epa.gov
National Institutes of Health
U.S. Department of Health and Human Services24
ChemMaps v2 work-in-progress
• Extended universe: Distributed Structure-Searchable Toxicity
(DSSTox) Database (EPA comptox dashboard)
• > 800,000 chemicals
• chemical infrastructure for EPA’s Safer Chemicals Research,
including ToxCast and Tox21 high-throughput toxicology efforts
• OPERA model predictions
https://comptox.epa.gov/dashboard
• More interactive
• Users can input their data
• Customizable
• Accessibility
National Institutes of Health
U.S. Department of Health and Human Services25
Live demo (video)
National Institutes of Health
U.S. Department of Health and Human Services26
Future Vision
• Project a all map on the fly
National Institutes of Health
U.S. Department of Health and Human Services27
Future Vision
• Select option on the map
• Project a map on the fly
National Institutes of Health
U.S. Department of Health and Human Services28
Future Vision
• Select option on the map
• Compute distances between several chemicals using
various metrics
• Project a map on the fly
National Institutes of Health
U.S. Department of Health and Human Services29
Conclusions
• www.chemmaps.com (version 2)
• Project the whole DSSTox database
• Update DrugBank (release 12-2018)
• Customize the map
• Upload chemical on the fly
• Accessibility
National Institutes of Health
U.S. Department of Health and Human Services30
Conclusions
• www.chemmaps.com (version 2)
• Project the whole DSSTox database
• Update DrugBank (release 12-2018)
• Customize the map
• Upload chemical on the fly
• Accessibility
Requirements
• Multiplatform (phone, tablet, computer)
• Firefox >59, Chrome >65, Safari >5 (WebGL technology)
• 1GB on GPU memory
National Institutes of Health
U.S. Department of Health and Human Services31
Conclusions
• www.chemmaps.com (version 2)
• Project the all DSSTox database
• Update drugbank (release 12-2018)
• Customize the map
• Upload chemical on the fly
• Accessibility
Requirements
• Multiplatform (phone, tablet, computer)
• Firefox >59, Chrome >65, Safari >5 (WebGL technology)
• 1GB on GPU memory
Beta version on the NIEHS network
Release for the end of April 2019
@SpaceChemMaps
Fourches’ lab
Dr. Denis Fourches
NIEHS
Dr. Nicole Kleinstreuer
Office of Data Science
Dr. Kamel Mansouri
(contractor, ILS)
www.chemmaps.com
@SpaceChemMaps
National Institutes of Health
U.S. Department of Health and Human Services34
Annexes
National Institutes of Health
U.S. Department of Health and Human Services35
EnvMap: projections
Principal component analysis
216 descriptors
PC1=14%, PC2 = 11%, PC3 = 9%
Independent component
analysis
36
Principal component analysis
186 descriptors
PC1=16%, PC2 = 11%, PC3 = 8%
Independent component
analysis
186 descriptors
Multidimensional-scaling 3D
186 descriptors
Euclidian distance
DrugMap: projections
37
DrugMap: compounds
MolVS (RDkit):
- SMILES standardization, normalize
- Remove salts
- Remove hydrogen
- Remove fragments (mixture)
8,752 SMILES
MolVS: https://molvs.readthedocs.io/en/latest/
Fourches,D. et al. (2016). J. Chem. Inf. Model., 56, 1243–1252.
8,551 canonical SMILES
38
DrugMap: compounds
MolVS (RDkit):
- SMILES standardization, normalize
- Remove salts
- Remove hydrogen
- Remove fragments (mixture)
8,752 SMILES
8,551 canonical SMILES
C[S@@](=O)CC[C@H](N)C(O)=O
DB02235
C[S+]([O-])CCC(N)C(=O)O
DB02165
[Zn2+] ….
MolVS: https://molvs.readthedocs.io/en/latest/
Fourches,D. et al. (2016). J. Chem. Inf. Model., 56, 1243–1252.
39
DrugMap: descriptors space
https://www.drugbank.ca/
RDkit: http://www.rdkit.org/
1D descriptors:
chemical formula
2D descriptors:
connectivity
C23H34O5
• Molecular weight
• Count of atoms
• …
• Pharmacophore based
• …
1D/2D RDKit descriptors (648)
40
DrugMap: 3D generation
RDkit: http://www.rdkit.org/
https://www.schrodinger.com/ligprep
Riniker, S.; Landrum, G. A. J. Chem. Inf. Comp. Sci. 55:2562-74 (2015)
3D generations
• Riniker and Laundrum (RDKit)
SMILES
SDF
41
DrugMap: 3D descriptors
Cao,D.-S. et al. (2013) J. Chem. Inf. Model., 53, 3086–3096
3D descriptors:
spatial coordinates
• Volume
• Surface
• Charge distribution
• …
3D PyDPI descriptors (420)
42
DrugMap: descriptors space
Samples
(Compounds)
Variables (descriptors)
X1 X2 ... X1068
1 X1,1 X1,2 ... X1,1068
2 X2,1 X2,2 ... X2,1068
... ... ... ... ...
8550 X8550,1 X8550, 2 ... X8550,1068
43
DrugMap: descriptors space
Samples
(Compounds)
Variables (descriptors)
X1 X2 ... X1068
1 X1,1 X1,2 ... X1,1068
2 X2,1 X2,2 ... X2,1068
... ... ... ... ...
8550 X8550,1 X8550, 2 ... X8850,1068
Descriptor selection:
• Remove null variance
• Pairwise Person’s correlation coefficient < 0.9
National Institutes of Health
U.S. Department of Health and Human Services44
Projection
z = 78 3D descriptors
x, y = 138 1D-2D descriptors
Multiple PCA
PC1: 13%
PC2: 9%
PC3: 24%
Coverage of variance
National Institutes of Health
U.S. Department of Health and Human Services45
Future Vision: Environmental Maps
National Institutes of Health
U.S. Department of Health and Human Services46
Future Vision: Environmental Maps
• Define and project several domains
National Institutes of Health
U.S. Department of Health and Human Services47
Future Vision: Environmental Maps
• Define and project several domains
• Add entire DSSTox Inventory (>700,000 chemicals)
National Institutes of Health
U.S. Department of Health and Human Services48
Future Vision: Environmental Maps
• Define and project several domains
• Add entire DSSTox Inventory (>700,000 chemicals)
• Incorporate diverse biological datasets (e.g. ToxRefDB, HTT)
National Institutes of Health
U.S. Department of Health and Human Services49
Future Vision: map on the fly
• Define new map on the fly
• Chemical databases
• Precomputed coordinates
• Local version
National Institutes of Health
U.S. Department of Health and Human Services50
Future Vision: Virtual reality
National Institutes of Health
U.S. Department of Health and Human Services51
Future Vision: Navigation
National Institutes of Health
U.S. Department of Health and Human Services52
Future VisionFuture Vision: Navigation
• Project new chemical lists on map
• Add your chemicals/data/model predictions on the map
National Institutes of Health
U.S. Department of Health and Human Services53
Future VisionFuture Vision: Navigation
• Select and redefine part of the chemical map on the fly
• Project new chemical lists on map
• Add your chemicals/data/model predictions on the map
National Institutes of Health
U.S. Department of Health and Human Services54
Future Vision : Navigation
• Select and redefine part of the chemical map on the fly
• Compute distances between several chemicals using
various metrics
• Project new chemical lists on map
• Add your chemicals/data/model predictions on the map
• Download area and matrix of distance

Weitere ähnliche Inhalte

Was ist angesagt?

Applying Royal Society of Chemistry cheminformatics skills to support the Pha...
Applying Royal Society of Chemistry cheminformatics skills to support the Pha...Applying Royal Society of Chemistry cheminformatics skills to support the Pha...
Applying Royal Society of Chemistry cheminformatics skills to support the Pha...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
CINF 55: SureChEMBL: An open patent chemistry resource
CINF 55: SureChEMBL: An open patent chemistry resourceCINF 55: SureChEMBL: An open patent chemistry resource
CINF 55: SureChEMBL: An open patent chemistry resource
George Papadatos
 
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Kamel Mansouri
 

Was ist angesagt? (7)

Applying Royal Society of Chemistry cheminformatics skills to support the Pha...
Applying Royal Society of Chemistry cheminformatics skills to support the Pha...Applying Royal Society of Chemistry cheminformatics skills to support the Pha...
Applying Royal Society of Chemistry cheminformatics skills to support the Pha...
 
PubChem LCSS
PubChem LCSSPubChem LCSS
PubChem LCSS
 
CINF 55: SureChEMBL: An open patent chemistry resource
CINF 55: SureChEMBL: An open patent chemistry resourceCINF 55: SureChEMBL: An open patent chemistry resource
CINF 55: SureChEMBL: An open patent chemistry resource
 
!Coughlin at GMA Science Forum - Proposition 65_April 2017
!Coughlin at GMA Science Forum - Proposition 65_April 2017!Coughlin at GMA Science Forum - Proposition 65_April 2017
!Coughlin at GMA Science Forum - Proposition 65_April 2017
 
SureChEMBL and Open PHACTS
SureChEMBL and Open PHACTSSureChEMBL and Open PHACTS
SureChEMBL and Open PHACTS
 
Sourcing high quality online data resources for computational toxicology
Sourcing high quality online data resources for computational toxicologySourcing high quality online data resources for computational toxicology
Sourcing high quality online data resources for computational toxicology
 
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
 

Ähnlich wie ChemMaps version 2, ACS Orlando 2019

EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
Kamel Mansouri
 
Web-based access to experimental and predicted data for environmental fate, t...
Web-based access to experimental and predicted data for environmental fate, t...Web-based access to experimental and predicted data for environmental fate, t...
Web-based access to experimental and predicted data for environmental fate, t...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Tox...
The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Tox...The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Tox...
The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Tox...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Accessing information for chemicals in hydraulic fracturing fluids using the ...
Accessing information for chemicals in hydraulic fracturing fluids using the ...Accessing information for chemicals in hydraulic fracturing fluids using the ...
Accessing information for chemicals in hydraulic fracturing fluids using the ...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Development of a Tool for Systematic Integration of Traditional and New Appro...
Development of a Tool for Systematic Integration of Traditional and New Appro...Development of a Tool for Systematic Integration of Traditional and New Appro...
Development of a Tool for Systematic Integration of Traditional and New Appro...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
New developments in delivering public access to data from the National Center...
New developments in delivering public access to data from the National Center...New developments in delivering public access to data from the National Center...
New developments in delivering public access to data from the National Center...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Toxicological information in PubChem
Toxicological information in PubChemToxicological information in PubChem
Toxicological information in PubChem
Sunghwan Kim
 
CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
CoMPARA: Collaborative Modeling Project for Androgen Receptor ActivityCoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
Kamel Mansouri
 
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental scienceUS-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
The US-EPA CompTox Chemicals Dashboard – an online data integration hub suppo...
The US-EPA CompTox Chemicals Dashboard – an online data integration hub suppo...The US-EPA CompTox Chemicals Dashboard – an online data integration hub suppo...
The US-EPA CompTox Chemicals Dashboard – an online data integration hub suppo...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Health Datapalooza 2013: Datalab - Steven Edwards
Health Datapalooza 2013: Datalab - Steven EdwardsHealth Datapalooza 2013: Datalab - Steven Edwards
Health Datapalooza 2013: Datalab - Steven Edwards
Health Data Consortium
 
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 

Ähnlich wie ChemMaps version 2, ACS Orlando 2019 (20)

EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
 
Web-based access to experimental and predicted data for environmental fate, t...
Web-based access to experimental and predicted data for environmental fate, t...Web-based access to experimental and predicted data for environmental fate, t...
Web-based access to experimental and predicted data for environmental fate, t...
 
The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Tox...
The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Tox...The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Tox...
The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Tox...
 
Accessing information for chemicals in hydraulic fracturing fluids using the ...
Accessing information for chemicals in hydraulic fracturing fluids using the ...Accessing information for chemicals in hydraulic fracturing fluids using the ...
Accessing information for chemicals in hydraulic fracturing fluids using the ...
 
Development of a Tool for Systematic Integration of Traditional and New Appro...
Development of a Tool for Systematic Integration of Traditional and New Appro...Development of a Tool for Systematic Integration of Traditional and New Appro...
Development of a Tool for Systematic Integration of Traditional and New Appro...
 
New developments in delivering public access to data from the National Center...
New developments in delivering public access to data from the National Center...New developments in delivering public access to data from the National Center...
New developments in delivering public access to data from the National Center...
 
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
 
Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...
 
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
 
Toxicological information in PubChem
Toxicological information in PubChemToxicological information in PubChem
Toxicological information in PubChem
 
CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
CoMPARA: Collaborative Modeling Project for Androgen Receptor ActivityCoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
 
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental scienceUS-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
 
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
 
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
 
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
 
The US-EPA CompTox Chemicals Dashboard – an online data integration hub suppo...
The US-EPA CompTox Chemicals Dashboard – an online data integration hub suppo...The US-EPA CompTox Chemicals Dashboard – an online data integration hub suppo...
The US-EPA CompTox Chemicals Dashboard – an online data integration hub suppo...
 
Animal Testing - Science or Tradition
Animal Testing - Science or TraditionAnimal Testing - Science or Tradition
Animal Testing - Science or Tradition
 
Health Datapalooza 2013: Datalab - Steven Edwards
Health Datapalooza 2013: Datalab - Steven EdwardsHealth Datapalooza 2013: Datalab - Steven Edwards
Health Datapalooza 2013: Datalab - Steven Edwards
 
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
 
US-EPA CompTox Chemicals Dashboard: Bioactivity Data for Endocrine Assays
US-EPA CompTox Chemicals Dashboard: Bioactivity Data for Endocrine AssaysUS-EPA CompTox Chemicals Dashboard: Bioactivity Data for Endocrine Assays
US-EPA CompTox Chemicals Dashboard: Bioactivity Data for Endocrine Assays
 

Kürzlich hochgeladen

Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
1301aanya
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
NazaninKarimi6
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
Silpa
 
LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.
Silpa
 
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Silpa
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
MohamedFarag457087
 

Kürzlich hochgeladen (20)

Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRingsTransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspects
 
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICEPATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
 
Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 
LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.
 
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 
GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body
 
Chemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdfChemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdf
 

ChemMaps version 2, ACS Orlando 2019

  • 1. Alexandre Borrel, PhD Postdoctoral Research Fellow, National Institute of Environment Health Sciences, RTP, North Carolina, USA Exploring the Chemical Universe using www.ChemMaps.com @AlBorrel0000-0001-6499-4540
  • 2. National Institutes of Health U.S. Department of Health and Human Services2 More than 1.1060 accessible molecules 1,2,3 (1) Hann, M.M., and Oprea, T.I. (2004). Curr. Opin. Chem. Biol. 8: 255–263. (2) Ursu, O., Rayan, A., Goldblum, A., and Oprea, T.I. (2011). Rev. Comput. Mol. Sci. 1: 760–781. (3) Drew, K. L. M., Baiman, H., Khwaounjoo, P., Yu, B., & Reynisson, J. (2012). Journal of Pharmacy and Pharmacology, 64(4), 490–495. Introduction
  • 3. National Institutes of Health U.S. Department of Health and Human Services3 More than 1.1060 accessible molecules 1,2,3 (1) Hann, M.M., and Oprea, T.I. (2004). Curr. Opin. Chem. Biol. 8: 255–263. (2) Ursu, O., Rayan, A., Goldblum, A., and Oprea, T.I. (2011). Rev. Comput. Mol. Sci. 1: 760–781. (3) Drew, K. L. M., Baiman, H., Khwaounjoo, P., Yu, B., & Reynisson, J. (2012). Journal of Pharmacy and Pharmacology, 64(4), 490–495. Introduction
  • 4. National Institutes of Health U.S. Department of Health and Human Services4 Chemical space: “…Chemical space’ is a term often used in place of ‘multi- dimensional descriptor space’: it is a region defined by a particular choice of descriptors…” Dobson CM (2004) Nature 432:824–828 Chemical space
  • 5. National Institutes of Health U.S. Department of Health and Human Services5 Lipinski C, Hopkins A (2004) Nature 432:855–861. Chemical space
  • 6. National Institutes of Health U.S. Department of Health and Human Services6
  • 7. National Institutes of Health U.S. Department of Health and Human Services7 Locate chemical of interest
  • 8. National Institutes of Health U.S. Department of Health and Human Services8 Locate chemical of interest Optimization, define analogue, replacement, ….
  • 9. National Institutes of Health U.S. Department of Health and Human Services9 Investigate ADME/Tox properties Locate chemical of interest Optimization, define analogue, replacement, ….
  • 10. National Institutes of Health U.S. Department of Health and Human Services10 Drug repurposing Locate chemical of interest Optimization, define analogue, replacement, …. Investigate ADME/Tox properties
  • 11. National Institutes of Health U.S. Department of Health and Human Services11 Define, visualize domains Locate chemical of interest Optimization, define analogue, replacement, …. Investigate ADME/Tox properties Drug repurposing
  • 12. National Institutes of Health U.S. Department of Health and Human Services12 Investigate new area
  • 13. National Institutes of Health U.S. Department of Health and Human Services13 Efficient navigation tool
  • 15. 15 Google Maps approach • Interactive • Easy to use • Informative • Responsive • …. Chemical space
  • 16. 16 DrugMap: compounds ~8,000 drug entries (release 12-2018): • ~2,500 FDA-approved small molecule drugs • Over 5,000 experimental drugs. https://www.drugbank.ca/
  • 17. 17 DrugMap: descriptors https://www.drugbank.ca/ RDkit: http://www.rdkit.org/ PaDEL: http://www.yapcwsoft.com/dd/padeldescriptor/ 1D descriptors: chemical formula 2D descriptors: connectivity 3D descriptors: spatial coordinates (ligprep to generate 3D) C23H34O5 • Molecular weight • Count of atoms • … • Pharmacophore based • … • Volume • Surface • … Descriptor selection: • Remove null variance • Person’s correlation coefficient < 0.9
  • 18. 18 DrugMap: descriptors space Samples (Compounds) Variables (descriptors) X1 X2 ... X238 1 X1,1 X1,2 ... X1, 238 2 X2,1 X2,2 ... X2, 238 ... ... ... ... ... 8550 X8550,1 X8550, 2 ... X8550,238 116 1D/2D descriptors 122 3D descriptors
  • 19. 19 PC1 = 14%; PC2=9%; PC3 = 26% • Sufficient coverage of variance • Understandable z = 70 3D descriptors x, y = 116 1D-2D descriptors Multiple PCA DrugMap: descriptors space
  • 20. National Institutes of Health U.S. Department of Health and Human Services20 Environmental Chemical Space ~48,000 chemicals with 3D descriptors Informed by regulatory lists*: • Endocrine Disruptor Screening Program • Toxic Substances Control Act Inventory • Canadian Domestic Substances List • Swedish Chemicals Agency ~12,000 chemicals with acute systemic toxicity data • Rat oral LD50 values • GHS/EPA classifications *not inclusive https://comptox.epa.gov/dashboard/
  • 21. www.chemmaps.com version 1.0 ACS New Orleans (March 2018)
  • 22. National Institutes of Health U.S. Department of Health and Human Services22 www.ChemMaps.com ~30,000 unique users since March 2018 (ACS - New Orleans) ~1,000 new users each month (~ 8,000 in June 2018) @SpaceChemMaps
  • 23. National Institutes of Health U.S. Department of Health and Human Services23 Challenges for version 2 • Extended universe: Distributed Structure-Searchable Toxicity (DSSTox) Database (EPA – EPA comptox) • > 800,000 chemicals, (chemical infrastructure for EPA’s Safer Chemicals Research, including the ToxCast and Tox21 high- throughput toxicology efforts) https://www.epa.gov
  • 24. National Institutes of Health U.S. Department of Health and Human Services24 ChemMaps v2 work-in-progress • Extended universe: Distributed Structure-Searchable Toxicity (DSSTox) Database (EPA comptox dashboard) • > 800,000 chemicals • chemical infrastructure for EPA’s Safer Chemicals Research, including ToxCast and Tox21 high-throughput toxicology efforts • OPERA model predictions https://comptox.epa.gov/dashboard • More interactive • Users can input their data • Customizable • Accessibility
  • 25. National Institutes of Health U.S. Department of Health and Human Services25 Live demo (video)
  • 26. National Institutes of Health U.S. Department of Health and Human Services26 Future Vision • Project a all map on the fly
  • 27. National Institutes of Health U.S. Department of Health and Human Services27 Future Vision • Select option on the map • Project a map on the fly
  • 28. National Institutes of Health U.S. Department of Health and Human Services28 Future Vision • Select option on the map • Compute distances between several chemicals using various metrics • Project a map on the fly
  • 29. National Institutes of Health U.S. Department of Health and Human Services29 Conclusions • www.chemmaps.com (version 2) • Project the whole DSSTox database • Update DrugBank (release 12-2018) • Customize the map • Upload chemical on the fly • Accessibility
  • 30. National Institutes of Health U.S. Department of Health and Human Services30 Conclusions • www.chemmaps.com (version 2) • Project the whole DSSTox database • Update DrugBank (release 12-2018) • Customize the map • Upload chemical on the fly • Accessibility Requirements • Multiplatform (phone, tablet, computer) • Firefox >59, Chrome >65, Safari >5 (WebGL technology) • 1GB on GPU memory
  • 31. National Institutes of Health U.S. Department of Health and Human Services31 Conclusions • www.chemmaps.com (version 2) • Project the all DSSTox database • Update drugbank (release 12-2018) • Customize the map • Upload chemical on the fly • Accessibility Requirements • Multiplatform (phone, tablet, computer) • Firefox >59, Chrome >65, Safari >5 (WebGL technology) • 1GB on GPU memory Beta version on the NIEHS network Release for the end of April 2019 @SpaceChemMaps
  • 32. Fourches’ lab Dr. Denis Fourches NIEHS Dr. Nicole Kleinstreuer Office of Data Science Dr. Kamel Mansouri (contractor, ILS)
  • 34. National Institutes of Health U.S. Department of Health and Human Services34 Annexes
  • 35. National Institutes of Health U.S. Department of Health and Human Services35 EnvMap: projections Principal component analysis 216 descriptors PC1=14%, PC2 = 11%, PC3 = 9% Independent component analysis
  • 36. 36 Principal component analysis 186 descriptors PC1=16%, PC2 = 11%, PC3 = 8% Independent component analysis 186 descriptors Multidimensional-scaling 3D 186 descriptors Euclidian distance DrugMap: projections
  • 37. 37 DrugMap: compounds MolVS (RDkit): - SMILES standardization, normalize - Remove salts - Remove hydrogen - Remove fragments (mixture) 8,752 SMILES MolVS: https://molvs.readthedocs.io/en/latest/ Fourches,D. et al. (2016). J. Chem. Inf. Model., 56, 1243–1252. 8,551 canonical SMILES
  • 38. 38 DrugMap: compounds MolVS (RDkit): - SMILES standardization, normalize - Remove salts - Remove hydrogen - Remove fragments (mixture) 8,752 SMILES 8,551 canonical SMILES C[S@@](=O)CC[C@H](N)C(O)=O DB02235 C[S+]([O-])CCC(N)C(=O)O DB02165 [Zn2+] …. MolVS: https://molvs.readthedocs.io/en/latest/ Fourches,D. et al. (2016). J. Chem. Inf. Model., 56, 1243–1252.
  • 39. 39 DrugMap: descriptors space https://www.drugbank.ca/ RDkit: http://www.rdkit.org/ 1D descriptors: chemical formula 2D descriptors: connectivity C23H34O5 • Molecular weight • Count of atoms • … • Pharmacophore based • … 1D/2D RDKit descriptors (648)
  • 40. 40 DrugMap: 3D generation RDkit: http://www.rdkit.org/ https://www.schrodinger.com/ligprep Riniker, S.; Landrum, G. A. J. Chem. Inf. Comp. Sci. 55:2562-74 (2015) 3D generations • Riniker and Laundrum (RDKit) SMILES SDF
  • 41. 41 DrugMap: 3D descriptors Cao,D.-S. et al. (2013) J. Chem. Inf. Model., 53, 3086–3096 3D descriptors: spatial coordinates • Volume • Surface • Charge distribution • … 3D PyDPI descriptors (420)
  • 42. 42 DrugMap: descriptors space Samples (Compounds) Variables (descriptors) X1 X2 ... X1068 1 X1,1 X1,2 ... X1,1068 2 X2,1 X2,2 ... X2,1068 ... ... ... ... ... 8550 X8550,1 X8550, 2 ... X8550,1068
  • 43. 43 DrugMap: descriptors space Samples (Compounds) Variables (descriptors) X1 X2 ... X1068 1 X1,1 X1,2 ... X1,1068 2 X2,1 X2,2 ... X2,1068 ... ... ... ... ... 8550 X8550,1 X8550, 2 ... X8850,1068 Descriptor selection: • Remove null variance • Pairwise Person’s correlation coefficient < 0.9
  • 44. National Institutes of Health U.S. Department of Health and Human Services44 Projection z = 78 3D descriptors x, y = 138 1D-2D descriptors Multiple PCA PC1: 13% PC2: 9% PC3: 24% Coverage of variance
  • 45. National Institutes of Health U.S. Department of Health and Human Services45 Future Vision: Environmental Maps
  • 46. National Institutes of Health U.S. Department of Health and Human Services46 Future Vision: Environmental Maps • Define and project several domains
  • 47. National Institutes of Health U.S. Department of Health and Human Services47 Future Vision: Environmental Maps • Define and project several domains • Add entire DSSTox Inventory (>700,000 chemicals)
  • 48. National Institutes of Health U.S. Department of Health and Human Services48 Future Vision: Environmental Maps • Define and project several domains • Add entire DSSTox Inventory (>700,000 chemicals) • Incorporate diverse biological datasets (e.g. ToxRefDB, HTT)
  • 49. National Institutes of Health U.S. Department of Health and Human Services49 Future Vision: map on the fly • Define new map on the fly • Chemical databases • Precomputed coordinates • Local version
  • 50. National Institutes of Health U.S. Department of Health and Human Services50 Future Vision: Virtual reality
  • 51. National Institutes of Health U.S. Department of Health and Human Services51 Future Vision: Navigation
  • 52. National Institutes of Health U.S. Department of Health and Human Services52 Future VisionFuture Vision: Navigation • Project new chemical lists on map • Add your chemicals/data/model predictions on the map
  • 53. National Institutes of Health U.S. Department of Health and Human Services53 Future VisionFuture Vision: Navigation • Select and redefine part of the chemical map on the fly • Project new chemical lists on map • Add your chemicals/data/model predictions on the map
  • 54. National Institutes of Health U.S. Department of Health and Human Services54 Future Vision : Navigation • Select and redefine part of the chemical map on the fly • Compute distances between several chemicals using various metrics • Project new chemical lists on map • Add your chemicals/data/model predictions on the map • Download area and matrix of distance