SlideShare ist ein Scribd-Unternehmen logo
1 von 34
Downloaden Sie, um offline zu lesen
October 2019
Exploiting medicinal chemistry knowledge to accelerate projects
Emerging Challenges for Artificial Intelligence in
Medicinal Chemistry
Dr Ed Griffen
IBSA Lugano October 2019
Exploiting medicinal chemistry knowledge to accelerate projectsExploiting medicinal chemistry knowledge to accelerate projects
• Founded in 2012 by experienced large Pharma
medicinal/computational chemists to accelerate drug
hunting by exploiting data driven knowledge
• Domain leaders in SAR knowledge extraction and
knowledge based design
• > 10 years experience of building AI systems that suggest
actions to chemists (7 years as MedChemica)
• Creators of largest ever documented database of
medicinal chemistry ADMET knowledge
Exploiting medicinal chemistry knowledge to accelerate projectsExploiting medicinal chemistry knowledge to accelerate projects
…7 Years of working with pharma
companies
“Our median number of compounds per LO project is 3000 - this is
unsustainable… [it should be] 300”
– Director of Chemistry (large pharma)
“Can we define the text book of medincal chemistry?”
– Director of Comp Chem (large pharma)
“We are aiming at 300 compound per project – currently we are
about 400, we will get better”
– ExScienta scientist at SCI ‘What can BigData do for chemistry’ –
London Oct 2017
MedChemica uses knowledge extraction techniques to build “expert
systems” to suggest actions to chemists and reduce the time and cost
to critical compounds and candidate drugs.
Exploiting medicinal chemistry knowledge to accelerate projectsExploiting medicinal chemistry knowledge to accelerate projects
Explainable AI
The future of AI lies in enabling people to collaborate with machines to solve complex
problems.
Like any efficient collaboration, this requires good communication, trust, clarity and
understanding.
- Freddy Lecue, Explainable AI Research Lead, Accenture Labs
https://www.accenture.com/gb-en/insights/technology/explainable-ai-human-machine
Black box machine learning models are currently being used for high-stakes decision
making throughout society, causing problems in healthcare, criminal justice and other
domains. Some people hope that creating methods for explaining these black box
models will alleviate some of the problems, but trying to explain black box models,
rather than creating models that are interpretable in the first place, is likely to
perpetuate bad practice and can potentially cause great harm to society. The way
forward is to design models that are inherently interpretable.
- Cynthia Rudin Nature Machine Intelligence (2019), 206–215.
Exploiting medicinal chemistry knowledge to accelerate projects
Use the right Machine Learning tool for the right problem
Where is Medicinal Chemistry?
Interpretable
Failure cost high
Immature science
Highly skilled, critical users
Business-2-Business
Transparent and auditable
Black Box
Failure cost is low
Real time response critical
Interactive = self correcting
Business-2-consumer
User agnostic of process
Exploiting medicinal chemistry knowledge to accelerate projects
Help the HiPPOs – or they’ll crush you
1. McAfee & Brynjolfsson “Big Data: The Management Revolution”,
Harvard Business Review October 2012
“Companies often make most of their
important decisions by relying on
“HiPPO”—the highest-paid person’s
opinion.”1
Chemistry HiPPs:
• experts in pattern recognition
• judged on their ability to make the best decisions with partial data
• highly trained
• time poor
• delivery focused
• gatekeepers to the adoption of new approaches
Exploiting medicinal chemistry knowledge to accelerate projects
Data
Warehouse
rule
finder
Exploitable
Knowledge
Molecule
problem
solving
Explainable
QSAR
Automated
loader
MMPA
Clean
Structures &
Data
Property
Prediction
Idea ranking
Instant SAR
analysis
MCPairs
REST API & GUI
Explainable AI for Medicinal Chemistry Design
Exploiting medicinal chemistry knowledge to accelerate projects
Molecule Problem Solving
Compounds from Rules
• Exploitable Knowledge is a rule database derived from MMPA
• User puts in a problem molecule with a property they wish to
improve – eg solubility, metabolism, hERG….
• System generates potential improved molecules based on data
Exploitable
Knowledge
MC Expert
Enumerator
System
Problem molecule + property to improve
Solution molecules
Compounds from Rules
https://www.youtube.com/watch?v=lITAT6_-i1E&list=PLtkCAojNL97xs1kd5JHngjIRhl4ZPFTlL&index=3
Exploiting medicinal chemistry knowledge to accelerate projects
https://youtu.be/nQxXddJDTfc
Exploiting medicinal chemistry knowledge to accelerate projects
MMPA Enables knowledge sharing
MMPA
MMPA
MMPA
Combine
and
Extract
Rules
Multiple Pharma
ADMET data
>437000 rules
Better
Project
decisions
Increased
Medicinal
Chemistry
learning
Kramer, Robb, Ting, Zheng, Griffen, et al. J. Med. Chem. 2018, 61(8), 3277-3292
http://pubs.acs.org/doi/10.1021/acs.jmedchem.7b00935
Our MMPA technology enabled knowledge sharing between multiple
organisations (AstraZeneca, Hoffman La Roche and Genentech)
Exploiting medicinal chemistry knowledge to accelerate projectsExploiting medicinal chemistry knowledge to accelerate projects
Griffen, E. et al. J. Med. Chem. 2011, 54(22), pp.7739-7750.
Fully Automated Matched Molecular Pair Analysis (MMPA)
Knowledge Extraction that’s understandable by chemists
Δ Data
A-B1
2
2
3
3
3
4
4
4
12
23
3
34
4
4A B
• Matched Molecular Pairs – Molecules that differ only by a
particular, well-defined structural transformation
• Capture the change and environment – MMPs can be
recorded as transformations from Aà B
• Statistical analysis to define “medicinal chemistry rules”
Defined transformations with high probability of improving
properties of molecules
• Store in a high performance database and provide an
intuitive user interface
Exploiting medicinal chemistry knowledge to accelerate projects
Identify and group matching SMIRKS
Calculate statistical parameters for each unique
SMIRKS (n, median, sd, se, n_up/n_down)
Is n ≥ 6?
Not enough data:
ignore transformation
Is the |median| ≤ 0.05 and the
intercentile range (10-90%) ≤ 0.3?
Perform two-tailed binomial test on the
transformation to determine the
significance of the up/ down frequency
transformation is
classified as ‘neutral’
Transformation classified as
‘NED’ (No Effect Determined)
Transformation classified as
‘increase’ or ‘decrease’
depending on which direction the
property is changing
pass	fail	
yes	no	
yes	no	
Rule selection
0 +ve-ve
Median data difference
Neutral IncreaseDecrease
NED
• No assumption of normal
distribution
• Manages ‘censored’ =
qualified / out-of-range
data
Exploiting medicinal chemistry knowledge to accelerate projects
Base of Success Story from Genentech
193 compounds
Enumerated
Objective:
improve
metabolic
stability
Enumeration
Calculated Property
Docking
8 compounds
synthesized
100 cmpds x ($2K make + $1K test) = $ 300 000
8 cmpds x ($2K make + $1K test) = $ 24 000
It is not just money, it is actually time
100 cmpds make & test ~ 15 – 25 weeks
8 cmpds make & test ~ 2 – 4 weeks
Exploiting medicinal chemistry knowledge to accelerate projects
tBu metabolism issue
Benchmark
compound
Predicted to offer most improvement in microsomal stability (in at least 1 species / assay)
R2
R1
tBu Me Et iPr
99
392
16
64
78
410
53
550
99
288
78
515
41
35
98
327
92
372
24
247
35
128
24
62
60
395
39
445
3
21
20
27
57
89
54
89
• Data shown are Clint for HLM and MLM (top and bottom, respectively)
R1 R2R1tBu
Roger Butlin
Rebecca Newton
Allan Jordan
Exploiting medicinal chemistry knowledge to accelerate projectsExploiting medicinal chemistry knowledge to accelerate projects
Tubulin Polymerization Inhibitors
15
Exploiting medicinal chemistry knowledge to accelerate projects
Indole-3-glyoxylamide Based Series of Tubulin Polymerization Inhibitors
– Increase potency, solubility and reduce metabolism
– Enable in-vivo xenograft studies
Thompson, M. et al J. Med. Chem., 2015, 58 (23),
pp 9309–9333
MMPA solubility
& QSAR calcsIndibulin D-24851
LC50 0.032
XlogP 3.35
~ potent
In-vivo activity
poor solubility (~ 1uM)
LC50 0.027
XlogP 2.02
LC50 0.055
XlogP 2.91
solubility (~10-80uM)
LC50 0.031
XlogP 2.57
solubility (~10-80uM)
59
Exploiting medicinal chemistry knowledge to accelerate projects
Idea Ranking
SpotDesign
• Use the knowledge database to estimate how good an idea is
compared to a benchmark molecule
• System generates assessment based on data
17
Exploitable
Knowledge
SpotDesign
Idea molecule + benchmark
molecule + property
Assessment of idea molecule
compared to benchmark
SpotDesign
https://www.youtube.com/watch?v=JMhQvNdBOFs&index=2&list=PLtkCAojNL97xs1kd5JHngjIRhl4ZPFTlL
Exploiting medicinal chemistry knowledge to accelerate projects
https://youtu.be/fDpFo53IdOE
Exploiting medicinal chemistry knowledge to accelerate projects
Property Prediction
Automated Explainable QSAR
• Chemists get predictions with the substructures highlighted that are
driving prediction and the molecules used to support that part of the
model – transparent / explainable AI.
Explainable
QSAR
Clean
Structures &
Data
Property
Prediction
Molecule Structure
+ property to predict
Prediction
+ clear drivers of prediction
Exploiting medicinal chemistry knowledge to accelerate projects
2
Feature Definition
Basic Group Atom or group most likely protonated at pH 7.4
Acidic Group Atom or group most likely deprotonated at pH 7.4, includes N
and C acids
Acceptor Definitions derived from Taylor & Cosgrove
Donor Definitions derived from Taylor & Cosgrove
Hydrophobic C4 or greater cyclic or acyclic alkyl group
Aromatic Attachment connection of any group to an aromatic atom excluding
connections within rings
Aliphatic Attachment connection of any atom to an aliphatic group not in a ring.
Halo F,Cl, Br, I
Reference for Donor acceptor feature definitions:
Taylor, R.; Cole, J. C.; Cosgrove, D. A.; Gardiner, E. J.; Gillet, V. J.; Korb, O. J Comput Aided Mol Des 2012, 26 (4), 451–
472.
Acid & Base definitions are SMARTS including C, N, heteroaromatic acids, bases excluding weak aniline bases,
including amidines, guanidine’s - MedChemica definitions.
MedChemica Advanced Pharmacophore Pairs
Gobbi, A.; Poppinger, D. Biotechnology and Bioengineering 1998, 61 (1), 47–54.
Reutlinger, M.; Koch, C. P.; Reker, D.; Todoroff, N.; Schneider, P.; Rodrigues, T.; Schneider, G. Mol. Inf. 2013, 32 (2),
133–138.
Exploiting medicinal chemistry knowledge to accelerate projects
Pay attention to your descriptors
• Chemistry must make sense
Simple
H bond
acceptor
base acid
Precise
Diclofenac
(1973)
Sulfadiazine
(1941)
DMAP
Exploiting medicinal chemistry knowledge to accelerate projects
Regression Forest & Pharmacophore understanding
• hERG – auditable models
• Identify important chemical features driving potency
• Predict hERG potency from RF model [10 fold CV]
Pharmacophore fp length 280
10 fold CV
Compounds in training 5968
RMSE 0.16
Pearson R2 0.27
Exploiting medicinal chemistry knowledge to accelerate projects
• hERG – auditable models
• Predict hERG potency from RF model [10 fold CV]
• Example CHEMBL12713 sertindole
• Colour structure by feature importance
weighted sum of of pharmacophore pair
fingerprints – show the chemists where the
hotspots are.
• Drill deeper to show the most important
positive and negative features. RF prediction pIC50 7.7
median_with: 5.1
median_without: 4.7
median_diff: 0.4
n_examples_with: 4585
n_examples_without : 1383
median_with: 5.1,
median_without: 5.3
median_diff: -0.2
n_examples_with: 3106
n_examples_without : 2862
Regression Forest & Pharmacophore understanding
Exploiting medicinal chemistry knowledge to accelerate projects
kNN – Understanding from neighbouring structures
• hERG – auditable models
• Predict hERG potency from kNN model [10 fold CV]
• Example CHEMBL12713 sertindole
• Identify the closest neighbours - by
Tanimoto to ECFP4 fingerprint
• Show chemists structures
kNN prediction pIC50 8.2
distance 0.17 0.2 0.23
pIC50 7.7 4.1 8.2
Exploiting medicinal chemistry knowledge to accelerate projects
• ML models built for 20 critical seizure related CNS targets
• Communicate to chemists activity prediction & if model out of domain
• Show close structures and/or toxophores
Seizure prediction by Composite Machine Learning
CHEMBL 12713 sertindole
seizure activity observed
clinically
Predictions in line with
measured data
More potent than 1µM
Less potent than 1µM
Out of Domain – no
prediction possible
Exploiting medicinal chemistry knowledge to accelerate projects
Estimating Risks, finding toxophores
26
Exploiting medicinal chemistry knowledge to accelerate projects
Pair & Rule
Database
Compounds
from Rules
API server
RESTful
API
Compound
to Pairs
MCRules
Corporate structures and measurements
from DB
Structure and
data clean up
Spot Design
Pair
finding
Web GUI
MedChemica
In-House
Design tools
CLI
MedChemica
Clean Structures
& Data
Explainable
QSAR
Engineering and Automation
Exploiting medicinal chemistry knowledge to accelerate projects
Data
Integrity and
curation Knowledge
extraction
algorithms
Engineering,
Automation
and
Interfaces
Interpretability
✓
✓
✓
✓
Knowledge
Database
MCPairs
Overcoming the Barriers to Implementing AI
MC GUI
Exploiting medicinal chemistry knowledge to accelerate projects
Exploiting medicinal chemistry knowledge to accelerate projects
A Less Simple Example
Increase logD and gain solubility
Property Number of
Observations
Direction Mean Change Probability
logD 8 Increase 1.2 100%
Log(Solubility) 14 Increase 1.4 92%
What is the effect on
lipophilicity and solubility?
Roche data is inconclusive! (2
pairs for logD, 1 pair for
solubility)
logD = 2.65
Kinetic solubility = 84 µg/ml
IC50 SST5 = 0.8 µM
logD = 3.63
Kinetic solubility = >452 µg/ml
IC50 SST5 = 0.19 µM
Question:
Available
Statistics:
Roche
Example:
Exploiting medicinal chemistry knowledge to accelerate projects
Instant SAR Analysis
Compound to Pairs
• Chemists can instantly see the pairs to a compound and explore
property changes
31
Exploitable
Knowledge
Compound to
Pairs
Molecule of interest
All the matched pairs of that molecule
Compound to Pairs
https://www.youtube.com/watch?v=OFhZJulxsAw&t=0s&list=PLtkCAojNL97xs1kd5JHngjIRhl4ZPFTlL&index=2
Exploiting medicinal chemistry knowledge to accelerate projects
https://youtu.be/OFhZJulxsAw
Exploiting medicinal chemistry knowledge to accelerate projects
3 Possible input streams….
Rule
Database
REST - API
Your DB
crontab
MCPCLI
REST - API
ETL
custom
plugin
• Extract Transform Load (ETL)
• Custom plugin scripted by MedChemica
• Usually 3 – 4 weeks work
• On-site work and team interaction required
Exploitation
Your DB
Your DB
YOUR FIREWALL
assay1
• Export Flat files of data
• MCPCLI reads in files and deletes
1
2
3
• Direct Read Access to DB
• SQL searches compounds /
measurements
• https requests for compounds /
measurements
• Most robust option
data
10 years
experience
building
automated
systems
MCPairs
Server
Exploiting medicinal chemistry knowledge to accelerate projects
Example Current Pharma install
Rule
Database
In-House Design tools
and workflows
REST - API
MedChemica Web
tool
MedChemica CLI
3 WAYS OF EXPLOITATION
D360
crontab
MCPCLI
REST - API
ETL custom
plugin
• Every 2 days…
• Latest compounds structure pulled from D360 and loaded
• Latest measurements from assays pulled and loaded
• Custom plugin handled data input streaming
• Update the matched pairs and update rules
PHARMA FIREWALL
MCPairs
Server

Weitere ähnliche Inhalte

Was ist angesagt?

AI is the Future of Drug Discovery
AI is the Future of Drug DiscoveryAI is the Future of Drug Discovery
AI is the Future of Drug DiscoveryDavid Leahy
 
IRJET- Classification of Chemical Medicine or Drug using K Nearest Neighb...
IRJET-  	  Classification of Chemical Medicine or Drug using K Nearest Neighb...IRJET-  	  Classification of Chemical Medicine or Drug using K Nearest Neighb...
IRJET- Classification of Chemical Medicine or Drug using K Nearest Neighb...IRJET Journal
 
Harnessing The Proteome With Proteo Iq Quantitative Proteomics Software
Harnessing The Proteome With Proteo Iq Quantitative Proteomics SoftwareHarnessing The Proteome With Proteo Iq Quantitative Proteomics Software
Harnessing The Proteome With Proteo Iq Quantitative Proteomics Softwarejatwood3
 
Structure based and ligand based drug designing
Structure based and ligand based drug designingStructure based and ligand based drug designing
Structure based and ligand based drug designingDr Vysakh Mohan M
 
Driver Analysis and Product Optimization with Bayesian Networks
Driver Analysis and Product Optimization with Bayesian NetworksDriver Analysis and Product Optimization with Bayesian Networks
Driver Analysis and Product Optimization with Bayesian NetworksBayesia USA
 
Software Testing Using Genetic Algorithms
Software Testing Using Genetic AlgorithmsSoftware Testing Using Genetic Algorithms
Software Testing Using Genetic AlgorithmsIJCSES Journal
 
Structure based drug design- kiranmayi
Structure based drug design- kiranmayiStructure based drug design- kiranmayi
Structure based drug design- kiranmayiKiranmayiKnv
 
Ieee transactions on 2018 knowledge and data engineering topics with abstract .
Ieee transactions on 2018 knowledge and data engineering topics with abstract .Ieee transactions on 2018 knowledge and data engineering topics with abstract .
Ieee transactions on 2018 knowledge and data engineering topics with abstract .tsysglobalsolutions
 
De novo drug design assignment
De novo drug design assignmentDe novo drug design assignment
De novo drug design assignmentAbi Judeen
 
Are Evolutionary Algorithms Required to Solve Sudoku Problems
Are Evolutionary Algorithms Required to Solve Sudoku ProblemsAre Evolutionary Algorithms Required to Solve Sudoku Problems
Are Evolutionary Algorithms Required to Solve Sudoku Problemscsandit
 
Qsar studies on gallic acid derivatives and molecular docking studies of bace...
Qsar studies on gallic acid derivatives and molecular docking studies of bace...Qsar studies on gallic acid derivatives and molecular docking studies of bace...
Qsar studies on gallic acid derivatives and molecular docking studies of bace...bioejjournal
 
Qsar Studies on Gallic Acid Derivatives and Molecular Docking Studies of Bace...
Qsar Studies on Gallic Acid Derivatives and Molecular Docking Studies of Bace...Qsar Studies on Gallic Acid Derivatives and Molecular Docking Studies of Bace...
Qsar Studies on Gallic Acid Derivatives and Molecular Docking Studies of Bace...bioejjournal
 
Gordon2003
Gordon2003Gordon2003
Gordon2003toluene
 
IRJET - A Framework for Predicting Drug Effectiveness in Human Body
IRJET - A Framework for Predicting Drug Effectiveness in Human BodyIRJET - A Framework for Predicting Drug Effectiveness in Human Body
IRJET - A Framework for Predicting Drug Effectiveness in Human BodyIRJET Journal
 
Descriptive versus Mechanistic Modeling
Descriptive versus Mechanistic ModelingDescriptive versus Mechanistic Modeling
Descriptive versus Mechanistic ModelingAshwani Dhingra
 
working_example_poster
working_example_posterworking_example_poster
working_example_posterHuikun Zhang
 

Was ist angesagt? (20)

Dissertation
DissertationDissertation
Dissertation
 
AI is the Future of Drug Discovery
AI is the Future of Drug DiscoveryAI is the Future of Drug Discovery
AI is the Future of Drug Discovery
 
Machine learning in computational docking
Machine learning in computational dockingMachine learning in computational docking
Machine learning in computational docking
 
Artificial intelligence
Artificial intelligenceArtificial intelligence
Artificial intelligence
 
IRJET- Classification of Chemical Medicine or Drug using K Nearest Neighb...
IRJET-  	  Classification of Chemical Medicine or Drug using K Nearest Neighb...IRJET-  	  Classification of Chemical Medicine or Drug using K Nearest Neighb...
IRJET- Classification of Chemical Medicine or Drug using K Nearest Neighb...
 
Harnessing The Proteome With Proteo Iq Quantitative Proteomics Software
Harnessing The Proteome With Proteo Iq Quantitative Proteomics SoftwareHarnessing The Proteome With Proteo Iq Quantitative Proteomics Software
Harnessing The Proteome With Proteo Iq Quantitative Proteomics Software
 
Structure based and ligand based drug designing
Structure based and ligand based drug designingStructure based and ligand based drug designing
Structure based and ligand based drug designing
 
Driver Analysis and Product Optimization with Bayesian Networks
Driver Analysis and Product Optimization with Bayesian NetworksDriver Analysis and Product Optimization with Bayesian Networks
Driver Analysis and Product Optimization with Bayesian Networks
 
Software Testing Using Genetic Algorithms
Software Testing Using Genetic AlgorithmsSoftware Testing Using Genetic Algorithms
Software Testing Using Genetic Algorithms
 
Structure based drug design- kiranmayi
Structure based drug design- kiranmayiStructure based drug design- kiranmayi
Structure based drug design- kiranmayi
 
Ieee transactions on 2018 knowledge and data engineering topics with abstract .
Ieee transactions on 2018 knowledge and data engineering topics with abstract .Ieee transactions on 2018 knowledge and data engineering topics with abstract .
Ieee transactions on 2018 knowledge and data engineering topics with abstract .
 
bbbPaper
bbbPaperbbbPaper
bbbPaper
 
De novo drug design assignment
De novo drug design assignmentDe novo drug design assignment
De novo drug design assignment
 
Are Evolutionary Algorithms Required to Solve Sudoku Problems
Are Evolutionary Algorithms Required to Solve Sudoku ProblemsAre Evolutionary Algorithms Required to Solve Sudoku Problems
Are Evolutionary Algorithms Required to Solve Sudoku Problems
 
Qsar studies on gallic acid derivatives and molecular docking studies of bace...
Qsar studies on gallic acid derivatives and molecular docking studies of bace...Qsar studies on gallic acid derivatives and molecular docking studies of bace...
Qsar studies on gallic acid derivatives and molecular docking studies of bace...
 
Qsar Studies on Gallic Acid Derivatives and Molecular Docking Studies of Bace...
Qsar Studies on Gallic Acid Derivatives and Molecular Docking Studies of Bace...Qsar Studies on Gallic Acid Derivatives and Molecular Docking Studies of Bace...
Qsar Studies on Gallic Acid Derivatives and Molecular Docking Studies of Bace...
 
Gordon2003
Gordon2003Gordon2003
Gordon2003
 
IRJET - A Framework for Predicting Drug Effectiveness in Human Body
IRJET - A Framework for Predicting Drug Effectiveness in Human BodyIRJET - A Framework for Predicting Drug Effectiveness in Human Body
IRJET - A Framework for Predicting Drug Effectiveness in Human Body
 
Descriptive versus Mechanistic Modeling
Descriptive versus Mechanistic ModelingDescriptive versus Mechanistic Modeling
Descriptive versus Mechanistic Modeling
 
working_example_poster
working_example_posterworking_example_poster
working_example_poster
 

Ähnlich wie Emerging Challenges for Artificial Intelligence in Medicinal Chemistry

Practical Drug Discovery using Explainable Artificial Intelligence
Practical Drug Discovery using Explainable Artificial IntelligencePractical Drug Discovery using Explainable Artificial Intelligence
Practical Drug Discovery using Explainable Artificial IntelligenceAl Dossetter
 
Maximize Your Understanding of Operational Realities in Manufacturing with Pr...
Maximize Your Understanding of Operational Realities in Manufacturing with Pr...Maximize Your Understanding of Operational Realities in Manufacturing with Pr...
Maximize Your Understanding of Operational Realities in Manufacturing with Pr...Bigfinite
 
Data Science for Business Managers - An intro to ROI for predictive analytics
Data Science for Business Managers - An intro to ROI for predictive analyticsData Science for Business Managers - An intro to ROI for predictive analytics
Data Science for Business Managers - An intro to ROI for predictive analyticsAkin Osman Kazakci
 
SMi Group's AI in Drug Discovery 2020 conference
SMi Group's AI in Drug Discovery 2020 conferenceSMi Group's AI in Drug Discovery 2020 conference
SMi Group's AI in Drug Discovery 2020 conferenceDale Butler
 
Applications-of-AI-in-Drug-Discovery-and-Development-PreScouter.pdf
Applications-of-AI-in-Drug-Discovery-and-Development-PreScouter.pdfApplications-of-AI-in-Drug-Discovery-and-Development-PreScouter.pdf
Applications-of-AI-in-Drug-Discovery-and-Development-PreScouter.pdfArunPrasad880048
 
Insights from Building the Future of Drug Discovery with Apache Spark with Lu...
Insights from Building the Future of Drug Discovery with Apache Spark with Lu...Insights from Building the Future of Drug Discovery with Apache Spark with Lu...
Insights from Building the Future of Drug Discovery with Apache Spark with Lu...Databricks
 
MDC Connect: In-Silico Drug Design - what to do, what not to do - project dri...
MDC Connect: In-Silico Drug Design - what to do, what not to do - project dri...MDC Connect: In-Silico Drug Design - what to do, what not to do - project dri...
MDC Connect: In-Silico Drug Design - what to do, what not to do - project dri...Medicines Discovery Catapult
 
Artificial intelligence in Drug discovery and delivery.pptx
Artificial intelligence in Drug discovery and delivery.pptxArtificial intelligence in Drug discovery and delivery.pptx
Artificial intelligence in Drug discovery and delivery.pptxManjusha Bandi
 
BioAssay Express: Creating and exploiting assay metadata
BioAssay Express: Creating and exploiting assay metadataBioAssay Express: Creating and exploiting assay metadata
BioAssay Express: Creating and exploiting assay metadataPhilip Cheung
 
SCI What can Big Data do for Chemistry 2017 MedChemica
SCI What can Big Data do for Chemistry 2017 MedChemicaSCI What can Big Data do for Chemistry 2017 MedChemica
SCI What can Big Data do for Chemistry 2017 MedChemicaEd Griffen
 
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...ChemAxon
 
IRJET - Machine Learning for Diagnosis of Diabetes
IRJET - Machine Learning for Diagnosis of DiabetesIRJET - Machine Learning for Diagnosis of Diabetes
IRJET - Machine Learning for Diagnosis of DiabetesIRJET Journal
 
SMi Group's 14th annual Drug Design 2015 conference
SMi Group's 14th annual Drug Design 2015 conferenceSMi Group's 14th annual Drug Design 2015 conference
SMi Group's 14th annual Drug Design 2015 conferenceDale Butler
 
FASTER PROCESS DEVELOPMENT WITH HYBRID MODELING AND KNOWLEDGE TRANSFER
FASTER PROCESS DEVELOPMENT WITH HYBRID MODELING AND KNOWLEDGE TRANSFERFASTER PROCESS DEVELOPMENT WITH HYBRID MODELING AND KNOWLEDGE TRANSFER
FASTER PROCESS DEVELOPMENT WITH HYBRID MODELING AND KNOWLEDGE TRANSFERiQHub
 
Artificial intelligence robotics and computational fluid dynamics
Artificial intelligence robotics and computational fluid dynamics Artificial intelligence robotics and computational fluid dynamics
Artificial intelligence robotics and computational fluid dynamics Chandrakant Kharude
 
2020.04.07 automated molecular design and the bradshaw platform webinar
2020.04.07 automated molecular design and the bradshaw platform webinar2020.04.07 automated molecular design and the bradshaw platform webinar
2020.04.07 automated molecular design and the bradshaw platform webinarPistoia Alliance
 
Roadmap to next generation digital lab
Roadmap to next generation digital labRoadmap to next generation digital lab
Roadmap to next generation digital labStephan Gürtler
 
BigDataAnalytics_Talk_KOCH_FINAL
BigDataAnalytics_Talk_KOCH_FINALBigDataAnalytics_Talk_KOCH_FINAL
BigDataAnalytics_Talk_KOCH_FINALJohn Koch
 

Ähnlich wie Emerging Challenges for Artificial Intelligence in Medicinal Chemistry (20)

Practical Drug Discovery using Explainable Artificial Intelligence
Practical Drug Discovery using Explainable Artificial IntelligencePractical Drug Discovery using Explainable Artificial Intelligence
Practical Drug Discovery using Explainable Artificial Intelligence
 
Maximize Your Understanding of Operational Realities in Manufacturing with Pr...
Maximize Your Understanding of Operational Realities in Manufacturing with Pr...Maximize Your Understanding of Operational Realities in Manufacturing with Pr...
Maximize Your Understanding of Operational Realities in Manufacturing with Pr...
 
Data Science for Business Managers - An intro to ROI for predictive analytics
Data Science for Business Managers - An intro to ROI for predictive analyticsData Science for Business Managers - An intro to ROI for predictive analytics
Data Science for Business Managers - An intro to ROI for predictive analytics
 
SMi Group's AI in Drug Discovery 2020 conference
SMi Group's AI in Drug Discovery 2020 conferenceSMi Group's AI in Drug Discovery 2020 conference
SMi Group's AI in Drug Discovery 2020 conference
 
Applications-of-AI-in-Drug-Discovery-and-Development-PreScouter.pdf
Applications-of-AI-in-Drug-Discovery-and-Development-PreScouter.pdfApplications-of-AI-in-Drug-Discovery-and-Development-PreScouter.pdf
Applications-of-AI-in-Drug-Discovery-and-Development-PreScouter.pdf
 
Insights from Building the Future of Drug Discovery with Apache Spark with Lu...
Insights from Building the Future of Drug Discovery with Apache Spark with Lu...Insights from Building the Future of Drug Discovery with Apache Spark with Lu...
Insights from Building the Future of Drug Discovery with Apache Spark with Lu...
 
MDC Connect: In-Silico Drug Design - what to do, what not to do - project dri...
MDC Connect: In-Silico Drug Design - what to do, what not to do - project dri...MDC Connect: In-Silico Drug Design - what to do, what not to do - project dri...
MDC Connect: In-Silico Drug Design - what to do, what not to do - project dri...
 
Artificial intelligence in Drug discovery and delivery.pptx
Artificial intelligence in Drug discovery and delivery.pptxArtificial intelligence in Drug discovery and delivery.pptx
Artificial intelligence in Drug discovery and delivery.pptx
 
BioAssay Express: Creating and exploiting assay metadata
BioAssay Express: Creating and exploiting assay metadataBioAssay Express: Creating and exploiting assay metadata
BioAssay Express: Creating and exploiting assay metadata
 
SCI What can Big Data do for Chemistry 2017 MedChemica
SCI What can Big Data do for Chemistry 2017 MedChemicaSCI What can Big Data do for Chemistry 2017 MedChemica
SCI What can Big Data do for Chemistry 2017 MedChemica
 
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
 
rev_2 (1) (3)
rev_2 (1) (3)rev_2 (1) (3)
rev_2 (1) (3)
 
AI.pptx
AI.pptxAI.pptx
AI.pptx
 
IRJET - Machine Learning for Diagnosis of Diabetes
IRJET - Machine Learning for Diagnosis of DiabetesIRJET - Machine Learning for Diagnosis of Diabetes
IRJET - Machine Learning for Diagnosis of Diabetes
 
SMi Group's 14th annual Drug Design 2015 conference
SMi Group's 14th annual Drug Design 2015 conferenceSMi Group's 14th annual Drug Design 2015 conference
SMi Group's 14th annual Drug Design 2015 conference
 
FASTER PROCESS DEVELOPMENT WITH HYBRID MODELING AND KNOWLEDGE TRANSFER
FASTER PROCESS DEVELOPMENT WITH HYBRID MODELING AND KNOWLEDGE TRANSFERFASTER PROCESS DEVELOPMENT WITH HYBRID MODELING AND KNOWLEDGE TRANSFER
FASTER PROCESS DEVELOPMENT WITH HYBRID MODELING AND KNOWLEDGE TRANSFER
 
Artificial intelligence robotics and computational fluid dynamics
Artificial intelligence robotics and computational fluid dynamics Artificial intelligence robotics and computational fluid dynamics
Artificial intelligence robotics and computational fluid dynamics
 
2020.04.07 automated molecular design and the bradshaw platform webinar
2020.04.07 automated molecular design and the bradshaw platform webinar2020.04.07 automated molecular design and the bradshaw platform webinar
2020.04.07 automated molecular design and the bradshaw platform webinar
 
Roadmap to next generation digital lab
Roadmap to next generation digital labRoadmap to next generation digital lab
Roadmap to next generation digital lab
 
BigDataAnalytics_Talk_KOCH_FINAL
BigDataAnalytics_Talk_KOCH_FINALBigDataAnalytics_Talk_KOCH_FINAL
BigDataAnalytics_Talk_KOCH_FINAL
 

Mehr von Ed Griffen

Griffen MedChemica Virtual Tox Panel
Griffen MedChemica Virtual Tox PanelGriffen MedChemica Virtual Tox Panel
Griffen MedChemica Virtual Tox PanelEd Griffen
 
RSC Hatfield 2018 Kinase meeting : potency patents MMPA approaches
RSC Hatfield 2018  Kinase meeting : potency patents MMPA approachesRSC Hatfield 2018  Kinase meeting : potency patents MMPA approaches
RSC Hatfield 2018 Kinase meeting : potency patents MMPA approachesEd Griffen
 
Learning Medicinal Chemistry ADMET rules UKQSAR Sept 2017
Learning Medicinal Chemistry ADMET rules UKQSAR Sept 2017Learning Medicinal Chemistry ADMET rules UKQSAR Sept 2017
Learning Medicinal Chemistry ADMET rules UKQSAR Sept 2017Ed Griffen
 
Extracting medicinal chemistry knowledge by a secured Matched Molecular Pair ...
Extracting medicinal chemistry knowledge by a secured Matched Molecular Pair ...Extracting medicinal chemistry knowledge by a secured Matched Molecular Pair ...
Extracting medicinal chemistry knowledge by a secured Matched Molecular Pair ...Ed Griffen
 
MedChemica Large scale analysis and sharing of Medicinal chemistry Knowledge ...
MedChemica Large scale analysis and sharing of Medicinal chemistry Knowledge ...MedChemica Large scale analysis and sharing of Medicinal chemistry Knowledge ...
MedChemica Large scale analysis and sharing of Medicinal chemistry Knowledge ...Ed Griffen
 
Extracting actionable knowledge from large scale in vitro pharmacology data
Extracting actionable knowledge from large scale in vitro pharmacology dataExtracting actionable knowledge from large scale in vitro pharmacology data
Extracting actionable knowledge from large scale in vitro pharmacology dataEd Griffen
 
Pharmacophore extraction from Matched Molecular Pair Analysis
Pharmacophore extraction from Matched Molecular Pair AnalysisPharmacophore extraction from Matched Molecular Pair Analysis
Pharmacophore extraction from Matched Molecular Pair AnalysisEd Griffen
 

Mehr von Ed Griffen (7)

Griffen MedChemica Virtual Tox Panel
Griffen MedChemica Virtual Tox PanelGriffen MedChemica Virtual Tox Panel
Griffen MedChemica Virtual Tox Panel
 
RSC Hatfield 2018 Kinase meeting : potency patents MMPA approaches
RSC Hatfield 2018  Kinase meeting : potency patents MMPA approachesRSC Hatfield 2018  Kinase meeting : potency patents MMPA approaches
RSC Hatfield 2018 Kinase meeting : potency patents MMPA approaches
 
Learning Medicinal Chemistry ADMET rules UKQSAR Sept 2017
Learning Medicinal Chemistry ADMET rules UKQSAR Sept 2017Learning Medicinal Chemistry ADMET rules UKQSAR Sept 2017
Learning Medicinal Chemistry ADMET rules UKQSAR Sept 2017
 
Extracting medicinal chemistry knowledge by a secured Matched Molecular Pair ...
Extracting medicinal chemistry knowledge by a secured Matched Molecular Pair ...Extracting medicinal chemistry knowledge by a secured Matched Molecular Pair ...
Extracting medicinal chemistry knowledge by a secured Matched Molecular Pair ...
 
MedChemica Large scale analysis and sharing of Medicinal chemistry Knowledge ...
MedChemica Large scale analysis and sharing of Medicinal chemistry Knowledge ...MedChemica Large scale analysis and sharing of Medicinal chemistry Knowledge ...
MedChemica Large scale analysis and sharing of Medicinal chemistry Knowledge ...
 
Extracting actionable knowledge from large scale in vitro pharmacology data
Extracting actionable knowledge from large scale in vitro pharmacology dataExtracting actionable knowledge from large scale in vitro pharmacology data
Extracting actionable knowledge from large scale in vitro pharmacology data
 
Pharmacophore extraction from Matched Molecular Pair Analysis
Pharmacophore extraction from Matched Molecular Pair AnalysisPharmacophore extraction from Matched Molecular Pair Analysis
Pharmacophore extraction from Matched Molecular Pair Analysis
 

Kürzlich hochgeladen

fundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyfundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyDrAnita Sharma
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptxRajatChauhan518211
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSSLeenakshiTyagi
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)Areesha Ahmad
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfSumit Kumar yadav
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfSumit Kumar yadav
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticssakshisoni2385
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 

Kürzlich hochgeladen (20)

fundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyfundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomology
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSS
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 

Emerging Challenges for Artificial Intelligence in Medicinal Chemistry

  • 1. October 2019 Exploiting medicinal chemistry knowledge to accelerate projects Emerging Challenges for Artificial Intelligence in Medicinal Chemistry Dr Ed Griffen IBSA Lugano October 2019
  • 2. Exploiting medicinal chemistry knowledge to accelerate projectsExploiting medicinal chemistry knowledge to accelerate projects • Founded in 2012 by experienced large Pharma medicinal/computational chemists to accelerate drug hunting by exploiting data driven knowledge • Domain leaders in SAR knowledge extraction and knowledge based design • > 10 years experience of building AI systems that suggest actions to chemists (7 years as MedChemica) • Creators of largest ever documented database of medicinal chemistry ADMET knowledge
  • 3. Exploiting medicinal chemistry knowledge to accelerate projectsExploiting medicinal chemistry knowledge to accelerate projects …7 Years of working with pharma companies “Our median number of compounds per LO project is 3000 - this is unsustainable… [it should be] 300” – Director of Chemistry (large pharma) “Can we define the text book of medincal chemistry?” – Director of Comp Chem (large pharma) “We are aiming at 300 compound per project – currently we are about 400, we will get better” – ExScienta scientist at SCI ‘What can BigData do for chemistry’ – London Oct 2017 MedChemica uses knowledge extraction techniques to build “expert systems” to suggest actions to chemists and reduce the time and cost to critical compounds and candidate drugs.
  • 4. Exploiting medicinal chemistry knowledge to accelerate projectsExploiting medicinal chemistry knowledge to accelerate projects Explainable AI The future of AI lies in enabling people to collaborate with machines to solve complex problems. Like any efficient collaboration, this requires good communication, trust, clarity and understanding. - Freddy Lecue, Explainable AI Research Lead, Accenture Labs https://www.accenture.com/gb-en/insights/technology/explainable-ai-human-machine Black box machine learning models are currently being used for high-stakes decision making throughout society, causing problems in healthcare, criminal justice and other domains. Some people hope that creating methods for explaining these black box models will alleviate some of the problems, but trying to explain black box models, rather than creating models that are interpretable in the first place, is likely to perpetuate bad practice and can potentially cause great harm to society. The way forward is to design models that are inherently interpretable. - Cynthia Rudin Nature Machine Intelligence (2019), 206–215.
  • 5. Exploiting medicinal chemistry knowledge to accelerate projects Use the right Machine Learning tool for the right problem Where is Medicinal Chemistry? Interpretable Failure cost high Immature science Highly skilled, critical users Business-2-Business Transparent and auditable Black Box Failure cost is low Real time response critical Interactive = self correcting Business-2-consumer User agnostic of process
  • 6. Exploiting medicinal chemistry knowledge to accelerate projects Help the HiPPOs – or they’ll crush you 1. McAfee & Brynjolfsson “Big Data: The Management Revolution”, Harvard Business Review October 2012 “Companies often make most of their important decisions by relying on “HiPPO”—the highest-paid person’s opinion.”1 Chemistry HiPPs: • experts in pattern recognition • judged on their ability to make the best decisions with partial data • highly trained • time poor • delivery focused • gatekeepers to the adoption of new approaches
  • 7. Exploiting medicinal chemistry knowledge to accelerate projects Data Warehouse rule finder Exploitable Knowledge Molecule problem solving Explainable QSAR Automated loader MMPA Clean Structures & Data Property Prediction Idea ranking Instant SAR analysis MCPairs REST API & GUI Explainable AI for Medicinal Chemistry Design
  • 8. Exploiting medicinal chemistry knowledge to accelerate projects Molecule Problem Solving Compounds from Rules • Exploitable Knowledge is a rule database derived from MMPA • User puts in a problem molecule with a property they wish to improve – eg solubility, metabolism, hERG…. • System generates potential improved molecules based on data Exploitable Knowledge MC Expert Enumerator System Problem molecule + property to improve Solution molecules Compounds from Rules https://www.youtube.com/watch?v=lITAT6_-i1E&list=PLtkCAojNL97xs1kd5JHngjIRhl4ZPFTlL&index=3
  • 9. Exploiting medicinal chemistry knowledge to accelerate projects https://youtu.be/nQxXddJDTfc
  • 10. Exploiting medicinal chemistry knowledge to accelerate projects MMPA Enables knowledge sharing MMPA MMPA MMPA Combine and Extract Rules Multiple Pharma ADMET data >437000 rules Better Project decisions Increased Medicinal Chemistry learning Kramer, Robb, Ting, Zheng, Griffen, et al. J. Med. Chem. 2018, 61(8), 3277-3292 http://pubs.acs.org/doi/10.1021/acs.jmedchem.7b00935 Our MMPA technology enabled knowledge sharing between multiple organisations (AstraZeneca, Hoffman La Roche and Genentech)
  • 11. Exploiting medicinal chemistry knowledge to accelerate projectsExploiting medicinal chemistry knowledge to accelerate projects Griffen, E. et al. J. Med. Chem. 2011, 54(22), pp.7739-7750. Fully Automated Matched Molecular Pair Analysis (MMPA) Knowledge Extraction that’s understandable by chemists Δ Data A-B1 2 2 3 3 3 4 4 4 12 23 3 34 4 4A B • Matched Molecular Pairs – Molecules that differ only by a particular, well-defined structural transformation • Capture the change and environment – MMPs can be recorded as transformations from Aà B • Statistical analysis to define “medicinal chemistry rules” Defined transformations with high probability of improving properties of molecules • Store in a high performance database and provide an intuitive user interface
  • 12. Exploiting medicinal chemistry knowledge to accelerate projects Identify and group matching SMIRKS Calculate statistical parameters for each unique SMIRKS (n, median, sd, se, n_up/n_down) Is n ≥ 6? Not enough data: ignore transformation Is the |median| ≤ 0.05 and the intercentile range (10-90%) ≤ 0.3? Perform two-tailed binomial test on the transformation to determine the significance of the up/ down frequency transformation is classified as ‘neutral’ Transformation classified as ‘NED’ (No Effect Determined) Transformation classified as ‘increase’ or ‘decrease’ depending on which direction the property is changing pass fail yes no yes no Rule selection 0 +ve-ve Median data difference Neutral IncreaseDecrease NED • No assumption of normal distribution • Manages ‘censored’ = qualified / out-of-range data
  • 13. Exploiting medicinal chemistry knowledge to accelerate projects Base of Success Story from Genentech 193 compounds Enumerated Objective: improve metabolic stability Enumeration Calculated Property Docking 8 compounds synthesized 100 cmpds x ($2K make + $1K test) = $ 300 000 8 cmpds x ($2K make + $1K test) = $ 24 000 It is not just money, it is actually time 100 cmpds make & test ~ 15 – 25 weeks 8 cmpds make & test ~ 2 – 4 weeks
  • 14. Exploiting medicinal chemistry knowledge to accelerate projects tBu metabolism issue Benchmark compound Predicted to offer most improvement in microsomal stability (in at least 1 species / assay) R2 R1 tBu Me Et iPr 99 392 16 64 78 410 53 550 99 288 78 515 41 35 98 327 92 372 24 247 35 128 24 62 60 395 39 445 3 21 20 27 57 89 54 89 • Data shown are Clint for HLM and MLM (top and bottom, respectively) R1 R2R1tBu Roger Butlin Rebecca Newton Allan Jordan
  • 15. Exploiting medicinal chemistry knowledge to accelerate projectsExploiting medicinal chemistry knowledge to accelerate projects Tubulin Polymerization Inhibitors 15
  • 16. Exploiting medicinal chemistry knowledge to accelerate projects Indole-3-glyoxylamide Based Series of Tubulin Polymerization Inhibitors – Increase potency, solubility and reduce metabolism – Enable in-vivo xenograft studies Thompson, M. et al J. Med. Chem., 2015, 58 (23), pp 9309–9333 MMPA solubility & QSAR calcsIndibulin D-24851 LC50 0.032 XlogP 3.35 ~ potent In-vivo activity poor solubility (~ 1uM) LC50 0.027 XlogP 2.02 LC50 0.055 XlogP 2.91 solubility (~10-80uM) LC50 0.031 XlogP 2.57 solubility (~10-80uM) 59
  • 17. Exploiting medicinal chemistry knowledge to accelerate projects Idea Ranking SpotDesign • Use the knowledge database to estimate how good an idea is compared to a benchmark molecule • System generates assessment based on data 17 Exploitable Knowledge SpotDesign Idea molecule + benchmark molecule + property Assessment of idea molecule compared to benchmark SpotDesign https://www.youtube.com/watch?v=JMhQvNdBOFs&index=2&list=PLtkCAojNL97xs1kd5JHngjIRhl4ZPFTlL
  • 18. Exploiting medicinal chemistry knowledge to accelerate projects https://youtu.be/fDpFo53IdOE
  • 19. Exploiting medicinal chemistry knowledge to accelerate projects Property Prediction Automated Explainable QSAR • Chemists get predictions with the substructures highlighted that are driving prediction and the molecules used to support that part of the model – transparent / explainable AI. Explainable QSAR Clean Structures & Data Property Prediction Molecule Structure + property to predict Prediction + clear drivers of prediction
  • 20. Exploiting medicinal chemistry knowledge to accelerate projects 2 Feature Definition Basic Group Atom or group most likely protonated at pH 7.4 Acidic Group Atom or group most likely deprotonated at pH 7.4, includes N and C acids Acceptor Definitions derived from Taylor & Cosgrove Donor Definitions derived from Taylor & Cosgrove Hydrophobic C4 or greater cyclic or acyclic alkyl group Aromatic Attachment connection of any group to an aromatic atom excluding connections within rings Aliphatic Attachment connection of any atom to an aliphatic group not in a ring. Halo F,Cl, Br, I Reference for Donor acceptor feature definitions: Taylor, R.; Cole, J. C.; Cosgrove, D. A.; Gardiner, E. J.; Gillet, V. J.; Korb, O. J Comput Aided Mol Des 2012, 26 (4), 451– 472. Acid & Base definitions are SMARTS including C, N, heteroaromatic acids, bases excluding weak aniline bases, including amidines, guanidine’s - MedChemica definitions. MedChemica Advanced Pharmacophore Pairs Gobbi, A.; Poppinger, D. Biotechnology and Bioengineering 1998, 61 (1), 47–54. Reutlinger, M.; Koch, C. P.; Reker, D.; Todoroff, N.; Schneider, P.; Rodrigues, T.; Schneider, G. Mol. Inf. 2013, 32 (2), 133–138.
  • 21. Exploiting medicinal chemistry knowledge to accelerate projects Pay attention to your descriptors • Chemistry must make sense Simple H bond acceptor base acid Precise Diclofenac (1973) Sulfadiazine (1941) DMAP
  • 22. Exploiting medicinal chemistry knowledge to accelerate projects Regression Forest & Pharmacophore understanding • hERG – auditable models • Identify important chemical features driving potency • Predict hERG potency from RF model [10 fold CV] Pharmacophore fp length 280 10 fold CV Compounds in training 5968 RMSE 0.16 Pearson R2 0.27
  • 23. Exploiting medicinal chemistry knowledge to accelerate projects • hERG – auditable models • Predict hERG potency from RF model [10 fold CV] • Example CHEMBL12713 sertindole • Colour structure by feature importance weighted sum of of pharmacophore pair fingerprints – show the chemists where the hotspots are. • Drill deeper to show the most important positive and negative features. RF prediction pIC50 7.7 median_with: 5.1 median_without: 4.7 median_diff: 0.4 n_examples_with: 4585 n_examples_without : 1383 median_with: 5.1, median_without: 5.3 median_diff: -0.2 n_examples_with: 3106 n_examples_without : 2862 Regression Forest & Pharmacophore understanding
  • 24. Exploiting medicinal chemistry knowledge to accelerate projects kNN – Understanding from neighbouring structures • hERG – auditable models • Predict hERG potency from kNN model [10 fold CV] • Example CHEMBL12713 sertindole • Identify the closest neighbours - by Tanimoto to ECFP4 fingerprint • Show chemists structures kNN prediction pIC50 8.2 distance 0.17 0.2 0.23 pIC50 7.7 4.1 8.2
  • 25. Exploiting medicinal chemistry knowledge to accelerate projects • ML models built for 20 critical seizure related CNS targets • Communicate to chemists activity prediction & if model out of domain • Show close structures and/or toxophores Seizure prediction by Composite Machine Learning CHEMBL 12713 sertindole seizure activity observed clinically Predictions in line with measured data More potent than 1µM Less potent than 1µM Out of Domain – no prediction possible
  • 26. Exploiting medicinal chemistry knowledge to accelerate projects Estimating Risks, finding toxophores 26
  • 27. Exploiting medicinal chemistry knowledge to accelerate projects Pair & Rule Database Compounds from Rules API server RESTful API Compound to Pairs MCRules Corporate structures and measurements from DB Structure and data clean up Spot Design Pair finding Web GUI MedChemica In-House Design tools CLI MedChemica Clean Structures & Data Explainable QSAR Engineering and Automation
  • 28. Exploiting medicinal chemistry knowledge to accelerate projects Data Integrity and curation Knowledge extraction algorithms Engineering, Automation and Interfaces Interpretability ✓ ✓ ✓ ✓ Knowledge Database MCPairs Overcoming the Barriers to Implementing AI MC GUI
  • 29. Exploiting medicinal chemistry knowledge to accelerate projects
  • 30. Exploiting medicinal chemistry knowledge to accelerate projects A Less Simple Example Increase logD and gain solubility Property Number of Observations Direction Mean Change Probability logD 8 Increase 1.2 100% Log(Solubility) 14 Increase 1.4 92% What is the effect on lipophilicity and solubility? Roche data is inconclusive! (2 pairs for logD, 1 pair for solubility) logD = 2.65 Kinetic solubility = 84 µg/ml IC50 SST5 = 0.8 µM logD = 3.63 Kinetic solubility = >452 µg/ml IC50 SST5 = 0.19 µM Question: Available Statistics: Roche Example:
  • 31. Exploiting medicinal chemistry knowledge to accelerate projects Instant SAR Analysis Compound to Pairs • Chemists can instantly see the pairs to a compound and explore property changes 31 Exploitable Knowledge Compound to Pairs Molecule of interest All the matched pairs of that molecule Compound to Pairs https://www.youtube.com/watch?v=OFhZJulxsAw&t=0s&list=PLtkCAojNL97xs1kd5JHngjIRhl4ZPFTlL&index=2
  • 32. Exploiting medicinal chemistry knowledge to accelerate projects https://youtu.be/OFhZJulxsAw
  • 33. Exploiting medicinal chemistry knowledge to accelerate projects 3 Possible input streams…. Rule Database REST - API Your DB crontab MCPCLI REST - API ETL custom plugin • Extract Transform Load (ETL) • Custom plugin scripted by MedChemica • Usually 3 – 4 weeks work • On-site work and team interaction required Exploitation Your DB Your DB YOUR FIREWALL assay1 • Export Flat files of data • MCPCLI reads in files and deletes 1 2 3 • Direct Read Access to DB • SQL searches compounds / measurements • https requests for compounds / measurements • Most robust option data 10 years experience building automated systems MCPairs Server
  • 34. Exploiting medicinal chemistry knowledge to accelerate projects Example Current Pharma install Rule Database In-House Design tools and workflows REST - API MedChemica Web tool MedChemica CLI 3 WAYS OF EXPLOITATION D360 crontab MCPCLI REST - API ETL custom plugin • Every 2 days… • Latest compounds structure pulled from D360 and loaded • Latest measurements from assays pulled and loaded • Custom plugin handled data input streaming • Update the matched pairs and update rules PHARMA FIREWALL MCPairs Server