SlideShare a Scribd company logo
1 of 34
Download to read offline
Exploiting medicinal chemistry knowledge to accelerate projects
May 2019
Explainable Artificial Intelligence in Drug Hunting
Exploiting medicinal chemistry knowledge to accelerate projectsExploiting medicinal chemistry knowledge to accelerate projects
• Founded in 2012 by experienced large Pharma
medicinal/computational chemists to accelerate drug
hunting by exploiting data driven knowledge
• Domain leaders in SAR knowledge extraction and
knowledge based design
• > 10 years experience of building AI systems that suggest
actions to chemists (6 years as MedChemica)
• Creators of largest ever documented database of
medicinal chemistry ADMET knowledge
Exploiting medicinal chemistry knowledge to accelerate projectsExploiting medicinal chemistry knowledge to accelerate projects
…7 Years of working with pharma
companies
“Our median number of compounds per LO project is 3000 - this is
unsustainable… [it should be] 300”
– Director of Chemistry (large pharma)
“Can we define the text book of medincal chemistry?”
– Director of Comp Chem (large pharma)
“We are aiming at 300 compound per project – currently we are
about 400, we will get better”
– ExScienta scientist at SCI ‘What can BigData do for chemistry’ –
London Oct 2017
MedChemica uses knowledge extraction techniques to build “expert
systems” to suggest actions to chemists and reduce the time and cost
to critical compounds and candidate drugs.
Exploiting medicinal chemistry knowledge to accelerate projectsExploiting medicinal chemistry knowledge to accelerate projects
Explainable AI
The future of AI lies in enabling people to collaborate with machines to solve complex
problems.
Like any efficient collaboration, this requires good communication, trust, clarity and
understanding.
- Freddy Lecue, Explainable AI Research Lead, Accenture Labs
https://www.accenture.com/gb-en/insights/technology/explainable-ai-human-machine
Black box machine learning models are currently being used for high-stakes decision
making throughout society, causing problems in healthcare, criminal justice and other
domains. Some people hope that creating methods for explaining these black box
models will alleviate some of the problems, but trying to explain black box models,
rather than creating models that are interpretable in the first place, is likely to
perpetuate bad practice and can potentially cause great harm to society. The way
forward is to design models that are inherently interpretable.
- Cynthia Rudin Nature Machine Intelligence (2019), 206–215.
Exploiting medicinal chemistry knowledge to accelerate projects
Use the right ML tool for the right problem
Where is Medicinal Chemistry?
Interpretable
Failure cost high
Immature science
Highly skilled, critical users
Business-2-Business
Transparent and auditable
Black Box
Failure cost is low
Real time response critical
Interactive = self correcting
Business-2-consumer
User agnostic of process
Exploiting medicinal chemistry knowledge to accelerate projects
Help the HiPPOs – or they’ll crush you
1. McAfee & Brynjolfsson “Big Data: The Management Revolution”,
Harvard Business Review October 2012
“Companies often make most of their
important decisions by relying on
“HiPPO”—the highest-paid person’s
opinion.”1
Chemistry HiPPs:
• experts in pattern recognition
• judged on their ability to make the best decisions with partial data
• highly trained
• time poor
• delivery focused
• gatekeepers to the adoption of new approaches
Exploiting medicinal chemistry knowledge to accelerate projects
Data
Warehouse
rule
finder
Exploitable
Knowledge
Molecule
problem
solving
Explainable
QSAR
Automated
loader
MMPA
Clean
Structures &
Data
Property
Prediction
Idea ranking
Instant SAR
analysis
MCPairs
REST API & GUI
Explainable AI for Medicinal Chemistry Design
Exploiting medicinal chemistry knowledge to accelerate projects
Molecule Problem Solving
Compounds from Rules
• Exploitable Knowledge is a rule database derived from MMPA
• User puts in a problem molecule with a property they wish to
improve – eg solubility, metabolism, hERG….
• System generates potential improved molecules based on data
Exploitable
Knowledge
MC Expert
Enumerator
System
Problem molecule + property to improve
Solution molecules
Compounds from Rules
https://www.youtube.com/watch?v=lITAT6_-i1E&list=PLtkCAojNL97xs1kd5JHngjIRhl4ZPFTlL&index=3
Exploiting medicinal chemistry knowledge to accelerate projects
https://youtu.be/nQxXddJDTfc
Exploiting medicinal chemistry knowledge to accelerate projects
MMPA Enables knowledge sharing
MMPA
MMPA
MMPA
Combine
and
Extract
Rules
Multiple Pharma
ADMET data
>437000 rules
Better
Project
decisions
Increased
Medicinal
Chemistry
learning
Kramer, Robb, Ting, Zheng, Griffen, et al. J. Med. Chem. 2018, 61(8), 3277-3292
http://pubs.acs.org/doi/10.1021/acs.jmedchem.7b00935
Our MMPA technology enabled knowledge sharing between multiple
organisations (AstraZeneca, Hoffman La Roche and Genentech)
Exploiting medicinal chemistry knowledge to accelerate projectsExploiting medicinal chemistry knowledge to accelerate projects
Griffen, E. et al. J. Med. Chem. 2011, 54(22), pp.7739-7750.
Fully Automated Matched Molecular Pair Analysis (MMPA)
Knowledge Extraction that’s understandable by chemists
Δ Data
A-B1
2
2
3
3
3
4
4
4
12
23
3
34
4
4A B
• Matched Molecular Pairs – Molecules that differ only by a
particular, well-defined structural transformation
• Capture the change and environment – MMPs can be
recorded as transformations from Aà B
• Statistical analysis to define “medicinal chemistry rules”
Defined transformations with high probability of improving
properties of molecules
• Store in a high performance database and provide an
intuitive user interface
Exploiting medicinal chemistry knowledge to accelerate projects
Identify and group matching SMIRKS
Calculate statistical parameters for each unique
SMIRKS (n, median, sd, se, n_up/n_down)
Is n ≥ 6?
Not enough data:
ignore transformation
Is the |median| ≤ 0.05 and the
intercentile range (10-90%) ≤ 0.3?
Perform two-tailed binomial test on the
transformation to determine the
significance of the up/ down frequency
transformation is
classified as ‘neutral’
Transformation classified as
‘NED’ (No Effect Determined)
Transformation classified as
‘increase’ or ‘decrease’
depending on which direction the
property is changing
pass	fail	
yes	no	
yes	no	
Rule selection
0 +ve-ve
Median data difference
Neutral IncreaseDecrease
NED
• No assumption of normal
distribution
• Manages ‘censored’ =
qualified / out-of-range
data
Exploiting medicinal chemistry knowledge to accelerate projects
Base of Success Story from Genentech
193 compounds
Enumerated
Objective:
improve
metabolic
stability
Enumeration
Calculated Property
Docking
8 compounds
synthesized
100 cmpds x ($2K make + $1K test) = $ 300 000
8 cmpds x ($2K make + $1K test) = $ 24 000
It is not just money, it is actually time
100 cmpds make & test ~ 15 – 25 weeks
8 cmpds make & test ~ 2 – 4 weeks
Exploiting medicinal chemistry knowledge to accelerate projects
A Less Simple Example
Increase logD and gain solubility
Property Number of
Observations
Direction Mean Change Probability
logD 8 Increase 1.2 100%
Log(Solubility) 14 Increase 1.4 92%
What is the effect on
lipophilicity and solubility?
Roche data is inconclusive! (2
pairs for logD, 1 pair for
solubility)
logD = 2.65
Kinetic solubility = 84 µg/ml
IC50 SST5 = 0.8 µM
logD = 3.63
Kinetic solubility = >452 µg/ml
IC50 SST5 = 0.19 µM
Question:
Available
Statistics:
Roche
Example:
Exploiting medicinal chemistry knowledge to accelerate projects
tBu metabolism issue
Benchmark
compound
Predicted to offer most improvement in microsomal stability (in at least 1 species / assay)
R2
R1
tBu Me Et iPr
99
392
16
64
78
410
53
550
99
288
78
515
41
35
98
327
92
372
24
247
35
128
24
62
60
395
39
445
3
21
20
27
57
89
54
89
• Data shown are Clint for HLM and MLM (top and bottom, respectively)
R1 R2R1tBu
Roger Butlin
Rebecca Newton
Allan Jordan
Exploiting medicinal chemistry knowledge to accelerate projectsExploiting medicinal chemistry knowledge to accelerate projects
Tubulin Polymerization Inhibitors
16
Exploiting medicinal chemistry knowledge to accelerate projects
Indole-3-glyoxylamide Based Series of Tubulin Polymerization Inhibitors
– Increase potency, solubility and reduce metabolism
– Enable in-vivo xenograft studies
Thompson, M. et al J. Med. Chem., 2015, 58 (23),
pp 9309–9333
MMPA solubility
& QSAR calcsIndibulin D-24851
LC50 0.032
XlogP 3.35
~ potent
In-vivo activity
poor solubility (~ 1uM)
LC50 0.027
XlogP 2.02
LC50 0.055
XlogP 2.91
solubility (~10-80uM)
LC50 0.031
XlogP 2.57
solubility (~10-80uM)
59
Exploiting medicinal chemistry knowledge to accelerate projects
Idea Ranking
SpotDesign
• Use the knowledge database to estimate how good an idea is
compared to a benchmark molecule
• System generates assessment based on data
18
Exploitable
Knowledge
SpotDesign
Idea molecule + benchmark
molecule + property
Assessment of idea molecule
compared to benchmark
SpotDesign
https://www.youtube.com/watch?v=JMhQvNdBOFs&index=2&list=PLtkCAojNL97xs1kd5JHngjIRhl4ZPFTlL
Exploiting medicinal chemistry knowledge to accelerate projects
https://youtu.be/fDpFo53IdOE
Exploiting medicinal chemistry knowledge to accelerate projects
Instant SAR Analysis
Compound to Pairs
• Chemists can instantly see the pairs to a compound and explore
property changes
20
Exploitable
Knowledge
Compound to
Pairs
Molecule of interest
All the matched pairs of that molecule
Compound to Pairs
https://www.youtube.com/watch?v=OFhZJulxsAw&t=0s&list=PLtkCAojNL97xs1kd5JHngjIRhl4ZPFTlL&index=2
Exploiting medicinal chemistry knowledge to accelerate projects
https://youtu.be/OFhZJulxsAw
Exploiting medicinal chemistry knowledge to accelerate projects
Property Prediction
Automated Explainable QSAR
• Chemists get predictions with the substructures highlighted that are
driving prediction and the molecules used to support that part of the
model – transparent / explainable AI.
Explainable
QSAR
Clean
Structures &
Data
Property
Prediction
Molecule Structure
+ property to predict
Prediction
+ clear drivers of prediction
Exploiting medicinal chemistry knowledge to accelerate projects
2
Feature Definition
Basic Group Atom or group most likely protonated at pH 7.4
Acidic Group Atom or group most likely deprotonated at pH 7.4, includes N
and C acids
Acceptor Definitions derived from Taylor & Cosgrove
Donor Definitions derived from Taylor & Cosgrove
Hydrophobic C4 or greater cyclic or acyclic alkyl group
Aromatic Attachment connection of any group to an aromatic atom excluding
connections within rings
Aliphatic Attachment connection of any atom to an aliphatic group not in a ring.
Halo F,Cl, Br, I
Reference for Donor acceptor feature definitions:
Taylor, R.; Cole, J. C.; Cosgrove, D. A.; Gardiner, E. J.; Gillet, V. J.; Korb, O. J Comput Aided Mol Des 2012, 26 (4), 451–
472.
Acid & Base definitions are SMARTS including C, N, heteroaromatic acids, bases excluding weak aniline bases,
including amidines, guanidine’s - MedChemica definitions.
MedChemica Advanced Pharmacophore Pairs
Gobbi, A.; Poppinger, D. Biotechnology and Bioengineering 1998, 61 (1), 47–54.
Reutlinger, M.; Koch, C. P.; Reker, D.; Todoroff, N.; Schneider, P.; Rodrigues, T.; Schneider, G. Mol. Inf. 2013, 32 (2),
133–138.
Exploiting medicinal chemistry knowledge to accelerate projects
Pay attention to your descriptors
• Chemistry must make sense
Simple
H bond
acceptor
base acid
Precise
Diclofenac
(1973)
Sulfadiazine
(1941)
DMAP
Exploiting medicinal chemistry knowledge to accelerate projects
Random Forest & Pharmacophore understanding
• hERG – auditable models
• Identify important chemical features driving potency
• Predict hERG potency from RF model [10 fold CV]
Pharmacophore fp length 280
10 fold CV
Compounds in training 5968
RMSE 0.375
Pearson R2 0.51
Exploiting medicinal chemistry knowledge to accelerate projects
Random Forest & Pharmacophore understanding
• hERG – auditable models
• Predict hERG potency from RF model [10 fold CV]
• Example CHEMBL12713 sertindole
• Colour structure by feature importance
weighted sum of of pharmacophore pair
fingerprints – show the chemists where the
hotspots are.
• Drill deeper to show the most important
positive and negative features. RF prediction pIC50 7.7
median_with: 5.1
median_without: 4.7
median_diff: 0.4
n_examples_with: 4585
n_examples_without : 1383
median_with: 5.1,
median_without: 5.3
median_diff: -0.2
n_examples_with: 3106
n_examples_without : 2862
Exploiting medicinal chemistry knowledge to accelerate projects
kNN – Understanding from neighbouring structures
• hERG – auditable models
• Predict hERG potency from kNN model [10 fold CV]
• Example CHEMBL12713 sertindole
• Identify the closest neighbours - by
Tanimoto to ECFP4 fingerprint
• Show chemists structures
kNN prediction pIC50 8.2
distance 0.17 0.2 0.23
pIC50 7.7 4.1 8.2
Exploiting medicinal chemistry knowledge to accelerate projects
• ML models built for 19 critical seizure related CNS targets
• Communicate to chemists activity prediction & if model out of domain
• Show close structures and/or toxophores
2A
1B,
1A
Opioid antagonists
d k µ
CB1
D1
D2
GABA a1
M1
M2
Monoamine
oxidase
NMDA-
NR1
PDE-4D
Acetylcholine esterase
Acetylcholine
Receptor
a1b2
Transporters:
Noradrenaline
Dopamine
5-HT
5HT receptors
antagonists
Seizure prediction by Composite Machine Learning
CHEMBL 12713 sertindole
seizure activity observed
clinically
Predictions in line with
measured data
contact@medchemica.com
More potent than 1µM
Less potent than 1µM
Out of Domain – no
prediction possible
Exploiting medicinal chemistry knowledge to accelerate projects
Pair & Rule
Database
Compounds
from Rules
API server
RESTful
API
Compound
to Pairs
MCRules
Corporate structures and measurements
from DB
Structure and
data clean up
Spot Design
Pair
finding
Web GUI
MedChemica
In-House
Design tools
CLI
MedChemica
Clean Structures
& Data
Explainable
QSAR
Engineering and Automation
Exploiting medicinal chemistry knowledge to accelerate projects
Example Current Pharma install
Rule
Database
In-House Design tools
and workflows
REST - API
MedChemica Web
tool
MedChemica CLI
3 WAYS OF EXPLOITATION
D360
crontab
MCPCLI
REST - API
ETL custom
plugin
• Every 2 days…
• Latest compounds structure pulled from D360 and loaded
• Latest measurements from assays pulled and loaded
• Custom plugin handled data input streaming
• Update the matched pairs and update rules
PHARMA FIREWALL
MCPairs
Server
Exploiting medicinal chemistry knowledge to accelerate projects
Data
Integrity and
curation Knowledge
extraction
algorithms
Engineering,
Automation
and
Interfaces
Interpretability
✓
✓
✓
✓
Knowledge
Database
MCPairs
Overcoming the Barriers to Implementing AI
MC GUI
Exploiting medicinal chemistry knowledge to accelerate projects
Exploiting medicinal chemistry knowledge to accelerate projects
Exploiting medicinal chemistry knowledge to accelerate projects
3 Possible input streams….
Rule
Database
REST - API
Your DB
crontab
MCPCLI
REST - API
ETL
custom
plugin
• Extract Transform Load (ETL)
• Custom plugin scripted by MedChemica
• Usually 3 – 4 weeks work
• On-site work and team interaction required
Exploitation
Your DB
Your DB
YOUR FIREWALL
assay1
• Export Flat files of data
• MCPCLI reads in files and deletes
1
2
3
• Direct Read Access to DB
• SQL searches compounds /
measurements
• https requests for compounds /
measurements
• Most robust option
data
10 years
experience
building
automated
systems
MCPairs
Server

More Related Content

What's hot

Analyzing RRBS methylation data
Analyzing RRBS methylation dataAnalyzing RRBS methylation data
Analyzing RRBS methylation data
Altuna Akalin
 

What's hot (20)

Prodrug Design
Prodrug DesignProdrug Design
Prodrug Design
 
Natural chemistry Structure elucidation of Emetine
Natural chemistry Structure elucidation of EmetineNatural chemistry Structure elucidation of Emetine
Natural chemistry Structure elucidation of Emetine
 
Pharmacophore Mapping and Virtual Screening (Computer aided Drug design)
Pharmacophore Mapping and Virtual Screening (Computer aided Drug design)Pharmacophore Mapping and Virtual Screening (Computer aided Drug design)
Pharmacophore Mapping and Virtual Screening (Computer aided Drug design)
 
Artificial intelligence in drug discovery
Artificial intelligence in drug discoveryArtificial intelligence in drug discovery
Artificial intelligence in drug discovery
 
Pharmacophore mapping and virtual screening
Pharmacophore mapping and virtual screeningPharmacophore mapping and virtual screening
Pharmacophore mapping and virtual screening
 
Energy minimization methods - Molecular Modeling
Energy minimization methods - Molecular ModelingEnergy minimization methods - Molecular Modeling
Energy minimization methods - Molecular Modeling
 
Analyzing RRBS methylation data
Analyzing RRBS methylation dataAnalyzing RRBS methylation data
Analyzing RRBS methylation data
 
Strategies to synthesize 5 & 6 membered heterocyclic.pptx
Strategies to synthesize 5 & 6 membered heterocyclic.pptxStrategies to synthesize 5 & 6 membered heterocyclic.pptx
Strategies to synthesize 5 & 6 membered heterocyclic.pptx
 
SOSA approach of minaprine drug
SOSA  approach of minaprine drugSOSA  approach of minaprine drug
SOSA approach of minaprine drug
 
Pinner pyrimidine synthesis
Pinner pyrimidine  synthesisPinner pyrimidine  synthesis
Pinner pyrimidine synthesis
 
Recombinant DNA Technology and Drug Discovery
Recombinant DNA Technology and Drug DiscoveryRecombinant DNA Technology and Drug Discovery
Recombinant DNA Technology and Drug Discovery
 
2D - QSAR
2D - QSAR2D - QSAR
2D - QSAR
 
Validation & Diversity of drug targets
Validation & Diversity of drug targetsValidation & Diversity of drug targets
Validation & Diversity of drug targets
 
Drug design intro
Drug design introDrug design intro
Drug design intro
 
NEW PHARMACEUTICALS DERIVED FROM BIOTECHNOLOGY
NEW PHARMACEUTICALS DERIVED FROM BIOTECHNOLOGYNEW PHARMACEUTICALS DERIVED FROM BIOTECHNOLOGY
NEW PHARMACEUTICALS DERIVED FROM BIOTECHNOLOGY
 
Molecular modelling and dcoking.pptx
Molecular modelling and dcoking.pptxMolecular modelling and dcoking.pptx
Molecular modelling and dcoking.pptx
 
Thermodynamics for medicinal chemistry design
Thermodynamics for medicinal chemistry designThermodynamics for medicinal chemistry design
Thermodynamics for medicinal chemistry design
 
Synthesis Of Hetero-cyclic Drugs
Synthesis Of Hetero-cyclic DrugsSynthesis Of Hetero-cyclic Drugs
Synthesis Of Hetero-cyclic Drugs
 
Ai in drug discovery and drug development
Ai in drug discovery and drug developmentAi in drug discovery and drug development
Ai in drug discovery and drug development
 
In silico drug design an intro
In silico drug design   an introIn silico drug design   an intro
In silico drug design an intro
 

Similar to Explainable AI in Drug Hunting

Insights from Building the Future of Drug Discovery with Apache Spark with Lu...
Insights from Building the Future of Drug Discovery with Apache Spark with Lu...Insights from Building the Future of Drug Discovery with Apache Spark with Lu...
Insights from Building the Future of Drug Discovery with Apache Spark with Lu...
Databricks
 
How predictive models help Medicinal Chemists design better drugs_webinar
How predictive models help Medicinal Chemists design better drugs_webinarHow predictive models help Medicinal Chemists design better drugs_webinar
How predictive models help Medicinal Chemists design better drugs_webinar
Ann-Marie Roche
 

Similar to Explainable AI in Drug Hunting (20)

Emerging Challenges for Artificial Intelligence in Medicinal Chemistry
Emerging Challenges for Artificial Intelligence in Medicinal ChemistryEmerging Challenges for Artificial Intelligence in Medicinal Chemistry
Emerging Challenges for Artificial Intelligence in Medicinal Chemistry
 
MedChemica Active Learning - Combining MMPA and ML
MedChemica Active Learning - Combining MMPA and MLMedChemica Active Learning - Combining MMPA and ML
MedChemica Active Learning - Combining MMPA and ML
 
SCI What can Big Data do for Chemistry 2017 MedChemica
SCI What can Big Data do for Chemistry 2017 MedChemicaSCI What can Big Data do for Chemistry 2017 MedChemica
SCI What can Big Data do for Chemistry 2017 MedChemica
 
Practical Drug Discovery using Explainable Artificial Intelligence
Practical Drug Discovery using Explainable Artificial IntelligencePractical Drug Discovery using Explainable Artificial Intelligence
Practical Drug Discovery using Explainable Artificial Intelligence
 
Insights from Building the Future of Drug Discovery with Apache Spark with Lu...
Insights from Building the Future of Drug Discovery with Apache Spark with Lu...Insights from Building the Future of Drug Discovery with Apache Spark with Lu...
Insights from Building the Future of Drug Discovery with Apache Spark with Lu...
 
Dissertation
DissertationDissertation
Dissertation
 
Accelerating multiple medicinal chemistry projects using Artificial Intellige...
Accelerating multiple medicinal chemistry projects using Artificial Intellige...Accelerating multiple medicinal chemistry projects using Artificial Intellige...
Accelerating multiple medicinal chemistry projects using Artificial Intellige...
 
Maximize Your Understanding of Operational Realities in Manufacturing with Pr...
Maximize Your Understanding of Operational Realities in Manufacturing with Pr...Maximize Your Understanding of Operational Realities in Manufacturing with Pr...
Maximize Your Understanding of Operational Realities in Manufacturing with Pr...
 
Data Science for Business Managers - An intro to ROI for predictive analytics
Data Science for Business Managers - An intro to ROI for predictive analyticsData Science for Business Managers - An intro to ROI for predictive analytics
Data Science for Business Managers - An intro to ROI for predictive analytics
 
How we Built a Large Scale Matched Pair Analysis Engine (MCPairs) using OpenE...
How we Built a Large Scale Matched Pair Analysis Engine (MCPairs) using OpenE...How we Built a Large Scale Matched Pair Analysis Engine (MCPairs) using OpenE...
How we Built a Large Scale Matched Pair Analysis Engine (MCPairs) using OpenE...
 
MDC Connect: In-Silico Drug Design - what to do, what not to do - project dri...
MDC Connect: In-Silico Drug Design - what to do, what not to do - project dri...MDC Connect: In-Silico Drug Design - what to do, what not to do - project dri...
MDC Connect: In-Silico Drug Design - what to do, what not to do - project dri...
 
IRJET- Extending Association Rule Summarization Techniques to Assess Risk of ...
IRJET- Extending Association Rule Summarization Techniques to Assess Risk of ...IRJET- Extending Association Rule Summarization Techniques to Assess Risk of ...
IRJET- Extending Association Rule Summarization Techniques to Assess Risk of ...
 
SMi Group's AI in Drug Discovery 2020 conference
SMi Group's AI in Drug Discovery 2020 conferenceSMi Group's AI in Drug Discovery 2020 conference
SMi Group's AI in Drug Discovery 2020 conference
 
Artificial intelligence in Drug discovery and delivery.pptx
Artificial intelligence in Drug discovery and delivery.pptxArtificial intelligence in Drug discovery and delivery.pptx
Artificial intelligence in Drug discovery and delivery.pptx
 
IRJET - Machine Learning for Diagnosis of Diabetes
IRJET - Machine Learning for Diagnosis of DiabetesIRJET - Machine Learning for Diagnosis of Diabetes
IRJET - Machine Learning for Diagnosis of Diabetes
 
How predictive models help Medicinal Chemists design better drugs_webinar
How predictive models help Medicinal Chemists design better drugs_webinarHow predictive models help Medicinal Chemists design better drugs_webinar
How predictive models help Medicinal Chemists design better drugs_webinar
 
Applications-of-AI-in-Drug-Discovery-and-Development-PreScouter.pdf
Applications-of-AI-in-Drug-Discovery-and-Development-PreScouter.pdfApplications-of-AI-in-Drug-Discovery-and-Development-PreScouter.pdf
Applications-of-AI-in-Drug-Discovery-and-Development-PreScouter.pdf
 
SMi Group's 14th annual Drug Design 2015 conference
SMi Group's 14th annual Drug Design 2015 conferenceSMi Group's 14th annual Drug Design 2015 conference
SMi Group's 14th annual Drug Design 2015 conference
 
SEMI SUPERVISED BASED SPATIAL EM FRAMEWORK FOR MICROARRAY ANALYSIS
SEMI SUPERVISED BASED SPATIAL EM FRAMEWORK FOR MICROARRAY ANALYSISSEMI SUPERVISED BASED SPATIAL EM FRAMEWORK FOR MICROARRAY ANALYSIS
SEMI SUPERVISED BASED SPATIAL EM FRAMEWORK FOR MICROARRAY ANALYSIS
 
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
 

More from Ed Griffen

More from Ed Griffen (8)

Accelerating lead optimisation with active learning by exploiting MMPA based ...
Accelerating lead optimisation with active learning by exploiting MMPA based ...Accelerating lead optimisation with active learning by exploiting MMPA based ...
Accelerating lead optimisation with active learning by exploiting MMPA based ...
 
Griffen MedChemica Virtual Tox Panel
Griffen MedChemica Virtual Tox PanelGriffen MedChemica Virtual Tox Panel
Griffen MedChemica Virtual Tox Panel
 
RSC Hatfield 2018 Kinase meeting : potency patents MMPA approaches
RSC Hatfield 2018  Kinase meeting : potency patents MMPA approachesRSC Hatfield 2018  Kinase meeting : potency patents MMPA approaches
RSC Hatfield 2018 Kinase meeting : potency patents MMPA approaches
 
Learning Medicinal Chemistry ADMET rules UKQSAR Sept 2017
Learning Medicinal Chemistry ADMET rules UKQSAR Sept 2017Learning Medicinal Chemistry ADMET rules UKQSAR Sept 2017
Learning Medicinal Chemistry ADMET rules UKQSAR Sept 2017
 
Extracting medicinal chemistry knowledge by a secured Matched Molecular Pair ...
Extracting medicinal chemistry knowledge by a secured Matched Molecular Pair ...Extracting medicinal chemistry knowledge by a secured Matched Molecular Pair ...
Extracting medicinal chemistry knowledge by a secured Matched Molecular Pair ...
 
MedChemica Large scale analysis and sharing of Medicinal chemistry Knowledge ...
MedChemica Large scale analysis and sharing of Medicinal chemistry Knowledge ...MedChemica Large scale analysis and sharing of Medicinal chemistry Knowledge ...
MedChemica Large scale analysis and sharing of Medicinal chemistry Knowledge ...
 
Extracting actionable knowledge from large scale in vitro pharmacology data
Extracting actionable knowledge from large scale in vitro pharmacology dataExtracting actionable knowledge from large scale in vitro pharmacology data
Extracting actionable knowledge from large scale in vitro pharmacology data
 
Pharmacophore extraction from Matched Molecular Pair Analysis
Pharmacophore extraction from Matched Molecular Pair AnalysisPharmacophore extraction from Matched Molecular Pair Analysis
Pharmacophore extraction from Matched Molecular Pair Analysis
 

Recently uploaded

Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Lokesh Kothari
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
PirithiRaju
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
Areesha Ahmad
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
RohitNehra6
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
AlMamun560346
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Sérgio Sacani
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 

Recently uploaded (20)

Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Creating and Analyzing Definitive Screening Designs
Creating and Analyzing Definitive Screening DesignsCreating and Analyzing Definitive Screening Designs
Creating and Analyzing Definitive Screening Designs
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 

Explainable AI in Drug Hunting

  • 1. Exploiting medicinal chemistry knowledge to accelerate projects May 2019 Explainable Artificial Intelligence in Drug Hunting
  • 2. Exploiting medicinal chemistry knowledge to accelerate projectsExploiting medicinal chemistry knowledge to accelerate projects • Founded in 2012 by experienced large Pharma medicinal/computational chemists to accelerate drug hunting by exploiting data driven knowledge • Domain leaders in SAR knowledge extraction and knowledge based design • > 10 years experience of building AI systems that suggest actions to chemists (6 years as MedChemica) • Creators of largest ever documented database of medicinal chemistry ADMET knowledge
  • 3. Exploiting medicinal chemistry knowledge to accelerate projectsExploiting medicinal chemistry knowledge to accelerate projects …7 Years of working with pharma companies “Our median number of compounds per LO project is 3000 - this is unsustainable… [it should be] 300” – Director of Chemistry (large pharma) “Can we define the text book of medincal chemistry?” – Director of Comp Chem (large pharma) “We are aiming at 300 compound per project – currently we are about 400, we will get better” – ExScienta scientist at SCI ‘What can BigData do for chemistry’ – London Oct 2017 MedChemica uses knowledge extraction techniques to build “expert systems” to suggest actions to chemists and reduce the time and cost to critical compounds and candidate drugs.
  • 4. Exploiting medicinal chemistry knowledge to accelerate projectsExploiting medicinal chemistry knowledge to accelerate projects Explainable AI The future of AI lies in enabling people to collaborate with machines to solve complex problems. Like any efficient collaboration, this requires good communication, trust, clarity and understanding. - Freddy Lecue, Explainable AI Research Lead, Accenture Labs https://www.accenture.com/gb-en/insights/technology/explainable-ai-human-machine Black box machine learning models are currently being used for high-stakes decision making throughout society, causing problems in healthcare, criminal justice and other domains. Some people hope that creating methods for explaining these black box models will alleviate some of the problems, but trying to explain black box models, rather than creating models that are interpretable in the first place, is likely to perpetuate bad practice and can potentially cause great harm to society. The way forward is to design models that are inherently interpretable. - Cynthia Rudin Nature Machine Intelligence (2019), 206–215.
  • 5. Exploiting medicinal chemistry knowledge to accelerate projects Use the right ML tool for the right problem Where is Medicinal Chemistry? Interpretable Failure cost high Immature science Highly skilled, critical users Business-2-Business Transparent and auditable Black Box Failure cost is low Real time response critical Interactive = self correcting Business-2-consumer User agnostic of process
  • 6. Exploiting medicinal chemistry knowledge to accelerate projects Help the HiPPOs – or they’ll crush you 1. McAfee & Brynjolfsson “Big Data: The Management Revolution”, Harvard Business Review October 2012 “Companies often make most of their important decisions by relying on “HiPPO”—the highest-paid person’s opinion.”1 Chemistry HiPPs: • experts in pattern recognition • judged on their ability to make the best decisions with partial data • highly trained • time poor • delivery focused • gatekeepers to the adoption of new approaches
  • 7. Exploiting medicinal chemistry knowledge to accelerate projects Data Warehouse rule finder Exploitable Knowledge Molecule problem solving Explainable QSAR Automated loader MMPA Clean Structures & Data Property Prediction Idea ranking Instant SAR analysis MCPairs REST API & GUI Explainable AI for Medicinal Chemistry Design
  • 8. Exploiting medicinal chemistry knowledge to accelerate projects Molecule Problem Solving Compounds from Rules • Exploitable Knowledge is a rule database derived from MMPA • User puts in a problem molecule with a property they wish to improve – eg solubility, metabolism, hERG…. • System generates potential improved molecules based on data Exploitable Knowledge MC Expert Enumerator System Problem molecule + property to improve Solution molecules Compounds from Rules https://www.youtube.com/watch?v=lITAT6_-i1E&list=PLtkCAojNL97xs1kd5JHngjIRhl4ZPFTlL&index=3
  • 9. Exploiting medicinal chemistry knowledge to accelerate projects https://youtu.be/nQxXddJDTfc
  • 10. Exploiting medicinal chemistry knowledge to accelerate projects MMPA Enables knowledge sharing MMPA MMPA MMPA Combine and Extract Rules Multiple Pharma ADMET data >437000 rules Better Project decisions Increased Medicinal Chemistry learning Kramer, Robb, Ting, Zheng, Griffen, et al. J. Med. Chem. 2018, 61(8), 3277-3292 http://pubs.acs.org/doi/10.1021/acs.jmedchem.7b00935 Our MMPA technology enabled knowledge sharing between multiple organisations (AstraZeneca, Hoffman La Roche and Genentech)
  • 11. Exploiting medicinal chemistry knowledge to accelerate projectsExploiting medicinal chemistry knowledge to accelerate projects Griffen, E. et al. J. Med. Chem. 2011, 54(22), pp.7739-7750. Fully Automated Matched Molecular Pair Analysis (MMPA) Knowledge Extraction that’s understandable by chemists Δ Data A-B1 2 2 3 3 3 4 4 4 12 23 3 34 4 4A B • Matched Molecular Pairs – Molecules that differ only by a particular, well-defined structural transformation • Capture the change and environment – MMPs can be recorded as transformations from Aà B • Statistical analysis to define “medicinal chemistry rules” Defined transformations with high probability of improving properties of molecules • Store in a high performance database and provide an intuitive user interface
  • 12. Exploiting medicinal chemistry knowledge to accelerate projects Identify and group matching SMIRKS Calculate statistical parameters for each unique SMIRKS (n, median, sd, se, n_up/n_down) Is n ≥ 6? Not enough data: ignore transformation Is the |median| ≤ 0.05 and the intercentile range (10-90%) ≤ 0.3? Perform two-tailed binomial test on the transformation to determine the significance of the up/ down frequency transformation is classified as ‘neutral’ Transformation classified as ‘NED’ (No Effect Determined) Transformation classified as ‘increase’ or ‘decrease’ depending on which direction the property is changing pass fail yes no yes no Rule selection 0 +ve-ve Median data difference Neutral IncreaseDecrease NED • No assumption of normal distribution • Manages ‘censored’ = qualified / out-of-range data
  • 13. Exploiting medicinal chemistry knowledge to accelerate projects Base of Success Story from Genentech 193 compounds Enumerated Objective: improve metabolic stability Enumeration Calculated Property Docking 8 compounds synthesized 100 cmpds x ($2K make + $1K test) = $ 300 000 8 cmpds x ($2K make + $1K test) = $ 24 000 It is not just money, it is actually time 100 cmpds make & test ~ 15 – 25 weeks 8 cmpds make & test ~ 2 – 4 weeks
  • 14. Exploiting medicinal chemistry knowledge to accelerate projects A Less Simple Example Increase logD and gain solubility Property Number of Observations Direction Mean Change Probability logD 8 Increase 1.2 100% Log(Solubility) 14 Increase 1.4 92% What is the effect on lipophilicity and solubility? Roche data is inconclusive! (2 pairs for logD, 1 pair for solubility) logD = 2.65 Kinetic solubility = 84 µg/ml IC50 SST5 = 0.8 µM logD = 3.63 Kinetic solubility = >452 µg/ml IC50 SST5 = 0.19 µM Question: Available Statistics: Roche Example:
  • 15. Exploiting medicinal chemistry knowledge to accelerate projects tBu metabolism issue Benchmark compound Predicted to offer most improvement in microsomal stability (in at least 1 species / assay) R2 R1 tBu Me Et iPr 99 392 16 64 78 410 53 550 99 288 78 515 41 35 98 327 92 372 24 247 35 128 24 62 60 395 39 445 3 21 20 27 57 89 54 89 • Data shown are Clint for HLM and MLM (top and bottom, respectively) R1 R2R1tBu Roger Butlin Rebecca Newton Allan Jordan
  • 16. Exploiting medicinal chemistry knowledge to accelerate projectsExploiting medicinal chemistry knowledge to accelerate projects Tubulin Polymerization Inhibitors 16
  • 17. Exploiting medicinal chemistry knowledge to accelerate projects Indole-3-glyoxylamide Based Series of Tubulin Polymerization Inhibitors – Increase potency, solubility and reduce metabolism – Enable in-vivo xenograft studies Thompson, M. et al J. Med. Chem., 2015, 58 (23), pp 9309–9333 MMPA solubility & QSAR calcsIndibulin D-24851 LC50 0.032 XlogP 3.35 ~ potent In-vivo activity poor solubility (~ 1uM) LC50 0.027 XlogP 2.02 LC50 0.055 XlogP 2.91 solubility (~10-80uM) LC50 0.031 XlogP 2.57 solubility (~10-80uM) 59
  • 18. Exploiting medicinal chemistry knowledge to accelerate projects Idea Ranking SpotDesign • Use the knowledge database to estimate how good an idea is compared to a benchmark molecule • System generates assessment based on data 18 Exploitable Knowledge SpotDesign Idea molecule + benchmark molecule + property Assessment of idea molecule compared to benchmark SpotDesign https://www.youtube.com/watch?v=JMhQvNdBOFs&index=2&list=PLtkCAojNL97xs1kd5JHngjIRhl4ZPFTlL
  • 19. Exploiting medicinal chemistry knowledge to accelerate projects https://youtu.be/fDpFo53IdOE
  • 20. Exploiting medicinal chemistry knowledge to accelerate projects Instant SAR Analysis Compound to Pairs • Chemists can instantly see the pairs to a compound and explore property changes 20 Exploitable Knowledge Compound to Pairs Molecule of interest All the matched pairs of that molecule Compound to Pairs https://www.youtube.com/watch?v=OFhZJulxsAw&t=0s&list=PLtkCAojNL97xs1kd5JHngjIRhl4ZPFTlL&index=2
  • 21. Exploiting medicinal chemistry knowledge to accelerate projects https://youtu.be/OFhZJulxsAw
  • 22. Exploiting medicinal chemistry knowledge to accelerate projects Property Prediction Automated Explainable QSAR • Chemists get predictions with the substructures highlighted that are driving prediction and the molecules used to support that part of the model – transparent / explainable AI. Explainable QSAR Clean Structures & Data Property Prediction Molecule Structure + property to predict Prediction + clear drivers of prediction
  • 23. Exploiting medicinal chemistry knowledge to accelerate projects 2 Feature Definition Basic Group Atom or group most likely protonated at pH 7.4 Acidic Group Atom or group most likely deprotonated at pH 7.4, includes N and C acids Acceptor Definitions derived from Taylor & Cosgrove Donor Definitions derived from Taylor & Cosgrove Hydrophobic C4 or greater cyclic or acyclic alkyl group Aromatic Attachment connection of any group to an aromatic atom excluding connections within rings Aliphatic Attachment connection of any atom to an aliphatic group not in a ring. Halo F,Cl, Br, I Reference for Donor acceptor feature definitions: Taylor, R.; Cole, J. C.; Cosgrove, D. A.; Gardiner, E. J.; Gillet, V. J.; Korb, O. J Comput Aided Mol Des 2012, 26 (4), 451– 472. Acid & Base definitions are SMARTS including C, N, heteroaromatic acids, bases excluding weak aniline bases, including amidines, guanidine’s - MedChemica definitions. MedChemica Advanced Pharmacophore Pairs Gobbi, A.; Poppinger, D. Biotechnology and Bioengineering 1998, 61 (1), 47–54. Reutlinger, M.; Koch, C. P.; Reker, D.; Todoroff, N.; Schneider, P.; Rodrigues, T.; Schneider, G. Mol. Inf. 2013, 32 (2), 133–138.
  • 24. Exploiting medicinal chemistry knowledge to accelerate projects Pay attention to your descriptors • Chemistry must make sense Simple H bond acceptor base acid Precise Diclofenac (1973) Sulfadiazine (1941) DMAP
  • 25. Exploiting medicinal chemistry knowledge to accelerate projects Random Forest & Pharmacophore understanding • hERG – auditable models • Identify important chemical features driving potency • Predict hERG potency from RF model [10 fold CV] Pharmacophore fp length 280 10 fold CV Compounds in training 5968 RMSE 0.375 Pearson R2 0.51
  • 26. Exploiting medicinal chemistry knowledge to accelerate projects Random Forest & Pharmacophore understanding • hERG – auditable models • Predict hERG potency from RF model [10 fold CV] • Example CHEMBL12713 sertindole • Colour structure by feature importance weighted sum of of pharmacophore pair fingerprints – show the chemists where the hotspots are. • Drill deeper to show the most important positive and negative features. RF prediction pIC50 7.7 median_with: 5.1 median_without: 4.7 median_diff: 0.4 n_examples_with: 4585 n_examples_without : 1383 median_with: 5.1, median_without: 5.3 median_diff: -0.2 n_examples_with: 3106 n_examples_without : 2862
  • 27. Exploiting medicinal chemistry knowledge to accelerate projects kNN – Understanding from neighbouring structures • hERG – auditable models • Predict hERG potency from kNN model [10 fold CV] • Example CHEMBL12713 sertindole • Identify the closest neighbours - by Tanimoto to ECFP4 fingerprint • Show chemists structures kNN prediction pIC50 8.2 distance 0.17 0.2 0.23 pIC50 7.7 4.1 8.2
  • 28. Exploiting medicinal chemistry knowledge to accelerate projects • ML models built for 19 critical seizure related CNS targets • Communicate to chemists activity prediction & if model out of domain • Show close structures and/or toxophores 2A 1B, 1A Opioid antagonists d k µ CB1 D1 D2 GABA a1 M1 M2 Monoamine oxidase NMDA- NR1 PDE-4D Acetylcholine esterase Acetylcholine Receptor a1b2 Transporters: Noradrenaline Dopamine 5-HT 5HT receptors antagonists Seizure prediction by Composite Machine Learning CHEMBL 12713 sertindole seizure activity observed clinically Predictions in line with measured data contact@medchemica.com More potent than 1µM Less potent than 1µM Out of Domain – no prediction possible
  • 29. Exploiting medicinal chemistry knowledge to accelerate projects Pair & Rule Database Compounds from Rules API server RESTful API Compound to Pairs MCRules Corporate structures and measurements from DB Structure and data clean up Spot Design Pair finding Web GUI MedChemica In-House Design tools CLI MedChemica Clean Structures & Data Explainable QSAR Engineering and Automation
  • 30. Exploiting medicinal chemistry knowledge to accelerate projects Example Current Pharma install Rule Database In-House Design tools and workflows REST - API MedChemica Web tool MedChemica CLI 3 WAYS OF EXPLOITATION D360 crontab MCPCLI REST - API ETL custom plugin • Every 2 days… • Latest compounds structure pulled from D360 and loaded • Latest measurements from assays pulled and loaded • Custom plugin handled data input streaming • Update the matched pairs and update rules PHARMA FIREWALL MCPairs Server
  • 31. Exploiting medicinal chemistry knowledge to accelerate projects Data Integrity and curation Knowledge extraction algorithms Engineering, Automation and Interfaces Interpretability ✓ ✓ ✓ ✓ Knowledge Database MCPairs Overcoming the Barriers to Implementing AI MC GUI
  • 32. Exploiting medicinal chemistry knowledge to accelerate projects
  • 33. Exploiting medicinal chemistry knowledge to accelerate projects
  • 34. Exploiting medicinal chemistry knowledge to accelerate projects 3 Possible input streams…. Rule Database REST - API Your DB crontab MCPCLI REST - API ETL custom plugin • Extract Transform Load (ETL) • Custom plugin scripted by MedChemica • Usually 3 – 4 weeks work • On-site work and team interaction required Exploitation Your DB Your DB YOUR FIREWALL assay1 • Export Flat files of data • MCPCLI reads in files and deletes 1 2 3 • Direct Read Access to DB • SQL searches compounds / measurements • https requests for compounds / measurements • Most robust option data 10 years experience building automated systems MCPairs Server