SlideShare ist ein Scribd-Unternehmen logo
1 von 17
Automated QSAR Modelling  David E Leahy Newcastle University, UK & Damjan Krstajic Research Centre for Cheminformatics, Serbia
Discovery Bus ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],www.discoverybus.com   “ The Discovery Bus is not a tool for users. It is a system for deriving QSAR models independent of any user”
Discovery Bus QSAR
Chemical structure & response data Transform response 1/X logX X class Split and stratify ? Calculate descriptors D E H L R A Combine descriptors Filter features A&D A&L L&H&R A&E E&D A&D&R ... no filter cfs1 cfs2 cfs4 cfs5 cfs3 Cross validate Build models Test model Rnnet Rrpart Rlin Rpls GARMLR NetlabNN GUIDE GAWRMLR 4 x 8 x 6 x 8 = 1536 models ?&? ? new ff New method? 4 x 8 = 32 filter feature requests 32 filter feature requests x 8 = 256 models 10%
Solubility
Solubility Results  Learner Filter  Reduction Types Linear Fit Training (1167) Test (130) Filter Learner Rel.MSE r 2, Rel.MSE r 2, GUIDE         H 1990 -> 558 -> 54 R,D 1.46 0.11 0.89 0.12 0.89 H 170 -> 26 -> 14 A,E,H,D 0.13 0.11 0.89 0.13 0.88 H 80 -> 16 -> 12 A,H,D 0.14 0.11 0.88 0.12 0.87 C 250 -> 2 -> 2 A,R 0.18 0.13 0.87 0.16 0.84 C 8 -> 2 -> 2 A,L 0.16 0.13 0.87 0.16 0.86 GA1   H 80 -> 16 -> 16 A,H,D 0.14 0.14 0.86 0.18 0.83 C 8 -> 2 -> 2 A,L 0.16 0.17 0.84 0.17 0.83 NN1     H 250 -> 54 -> 54 A,R 0.12 0.09 0.91 0.08 0.92 H 80 -> 16 -> 16 A,H,D 0.14 0.10 0.90 0.12 0.88 H 326 -> 46 -> 46 H,R,D 0.18 0.10 0.90 0.12 0.89
HSA Binding
HSA Binding Learner Filter Reduction Types Linear Fit Training (82) Test (9) Filter Learner Rel.MSE r 2 Rel.MSE r 2 Guide Hh2 332 -> 39 -> 8 A,E,R 0.92 0.40 0.62 0.25 0.81 H 250 -> 59 -> 12 A,R 1.62 0.47 0.56 0.30 0.76 Hh4 382 -> 20 -> 1 A 0.25 0.50 0.50 0.57 0.49 GA1 Hh2 1998 -> 39 -> 26 A,R,D 0.42 0.23 0.77 0.20 0.85 Hh4 344 -> 20 -> 19 H,R,D 0.42 0.26 0.74 0.28 0.78 Hh10 302 -> 9 -> 9 H,R 0.27 0.27 0.73 0.40 0.64 NN1 H 8 -> 5 -> 5 A,L 0.37 0.17 0.83 0.15 0.87 Hh10 346 -> 8 -> 8 A,R,D 0.30 0.30 0.70 0.16 0.84 H 302 -> 19 -> 19 H,R 0.27 0.32 0.70 0.39 0.71
P-Glycoprotein Technique % Correctly Classified  Training Set % Correctly Classified  Test Set Neural  Net Classifier 95.6 69.7 R Part 90.4 81.0
Discovery Bus Architecture
Summary ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Current & Future Work in QSAR ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Reverse QSAR Engineering
Forager: A PSO for Reverse QSAR  ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Forager Optimisation Thanks to Tudor Oprea for a copy of Wombat
Colonist
Acknowledgements

Weitere ähnliche Inhalte

Andere mochten auch

QSAR Study on Antitubercular Drug Derivatives
QSAR Study on Antitubercular Drug DerivativesQSAR Study on Antitubercular Drug Derivatives
QSAR Study on Antitubercular Drug DerivativesLydia Yeshitla
 
Data Analysis in QSAR
Data Analysis in QSARData Analysis in QSAR
Data Analysis in QSARbaoilleach
 
Effect of substituents and functions on drug structure activity relationships
Effect of substituents and functions on drug structure activity relationshipsEffect of substituents and functions on drug structure activity relationships
Effect of substituents and functions on drug structure activity relationshipsOmar Sokkar
 
Computer aided drug design
Computer aided drug designComputer aided drug design
Computer aided drug designAli Ahsan
 
Introduction to Quantitative Structure Activity Relationships
Introduction to Quantitative Structure Activity RelationshipsIntroduction to Quantitative Structure Activity Relationships
Introduction to Quantitative Structure Activity RelationshipsOmar Sokkar
 
Computational Drug Design
Computational Drug DesignComputational Drug Design
Computational Drug Designbaoilleach
 
Free wilson analysis qsar
Free wilson analysis qsarFree wilson analysis qsar
Free wilson analysis qsarRahul B S
 
Computer aided drug designing
Computer aided drug designing Computer aided drug designing
Computer aided drug designing Ayesha Aftab
 
QSAR : Activity Relationships Quantitative Structure
QSAR : Activity Relationships Quantitative StructureQSAR : Activity Relationships Quantitative Structure
QSAR : Activity Relationships Quantitative StructureSaramita De Chakravarti
 
Structure activity relation ship
Structure activity relation shipStructure activity relation ship
Structure activity relation shipAkshil Mehta
 
Qsar and drug design ppt
Qsar and drug design pptQsar and drug design ppt
Qsar and drug design pptAbhik Seal
 

Andere mochten auch (19)

QSAR Study on Antitubercular Drug Derivatives
QSAR Study on Antitubercular Drug DerivativesQSAR Study on Antitubercular Drug Derivatives
QSAR Study on Antitubercular Drug Derivatives
 
Data Analysis in QSAR
Data Analysis in QSARData Analysis in QSAR
Data Analysis in QSAR
 
25.qsar
25.qsar25.qsar
25.qsar
 
Effect of substituents and functions on drug structure activity relationships
Effect of substituents and functions on drug structure activity relationshipsEffect of substituents and functions on drug structure activity relationships
Effect of substituents and functions on drug structure activity relationships
 
Computer aided drug design
Computer aided drug designComputer aided drug design
Computer aided drug design
 
Introduction to Quantitative Structure Activity Relationships
Introduction to Quantitative Structure Activity RelationshipsIntroduction to Quantitative Structure Activity Relationships
Introduction to Quantitative Structure Activity Relationships
 
Meta QSAR
Meta QSARMeta QSAR
Meta QSAR
 
Computer Aided Drug Design
Computer Aided Drug DesignComputer Aided Drug Design
Computer Aided Drug Design
 
Qsar
QsarQsar
Qsar
 
Qsar lecture
Qsar lectureQsar lecture
Qsar lecture
 
QSAR
QSARQSAR
QSAR
 
Qsar by hansch analysis
Qsar by hansch analysisQsar by hansch analysis
Qsar by hansch analysis
 
Computational Drug Design
Computational Drug DesignComputational Drug Design
Computational Drug Design
 
Free wilson analysis qsar
Free wilson analysis qsarFree wilson analysis qsar
Free wilson analysis qsar
 
Computer aided drug designing
Computer aided drug designing Computer aided drug designing
Computer aided drug designing
 
QSAR : Activity Relationships Quantitative Structure
QSAR : Activity Relationships Quantitative StructureQSAR : Activity Relationships Quantitative Structure
QSAR : Activity Relationships Quantitative Structure
 
Qsar
QsarQsar
Qsar
 
Structure activity relation ship
Structure activity relation shipStructure activity relation ship
Structure activity relation ship
 
Qsar and drug design ppt
Qsar and drug design pptQsar and drug design ppt
Qsar and drug design ppt
 

Ähnlich wie Discovery Bus: UK QSAR meeting at GSK

Caret Package for R
Caret Package for RCaret Package for R
Caret Package for Rkmettler
 
Caret max kuhn
Caret max kuhnCaret max kuhn
Caret max kuhnkmettler
 
FPGA Implementation of a GA
FPGA Implementation of a GAFPGA Implementation of a GA
FPGA Implementation of a GAHocine Merabti
 
aserra_phdthesis_ppt
aserra_phdthesis_pptaserra_phdthesis_ppt
aserra_phdthesis_pptaserrapages
 
Unsupervised selection of mother wavelets and parameter optimization
Unsupervised selection of mother wavelets and parameter optimizationUnsupervised selection of mother wavelets and parameter optimization
Unsupervised selection of mother wavelets and parameter optimizationMd Kafiul Islam
 
The caret Package: A Unified Interface for Predictive Models
The caret Package: A Unified Interface for Predictive ModelsThe caret Package: A Unified Interface for Predictive Models
The caret Package: A Unified Interface for Predictive ModelsNYC Predictive Analytics
 
OPERA: A free and open source QSAR tool for predicting physicochemical proper...
OPERA: A free and open source QSAR tool for predicting physicochemical proper...OPERA: A free and open source QSAR tool for predicting physicochemical proper...
OPERA: A free and open source QSAR tool for predicting physicochemical proper...Kamel Mansouri
 
Lab 2: Classification and Regression Prediction Models, training and testing ...
Lab 2: Classification and Regression Prediction Models, training and testing ...Lab 2: Classification and Regression Prediction Models, training and testing ...
Lab 2: Classification and Regression Prediction Models, training and testing ...Yao Yao
 
Blanka Láng, László Kovács and László Mohácsi: Linear regression model select...
Blanka Láng, László Kovács and László Mohácsi: Linear regression model select...Blanka Láng, László Kovács and László Mohácsi: Linear regression model select...
Blanka Láng, László Kovács and László Mohácsi: Linear regression model select...Informatikai Intézet
 
Open Science Data Repository - Dataledger
Open Science Data Repository - DataledgerOpen Science Data Repository - Dataledger
Open Science Data Repository - DataledgerAlexandru Korotcov
 
Forecast of long term wind speed based on optimized support vector regression...
Forecast of long term wind speed based on optimized support vector regression...Forecast of long term wind speed based on optimized support vector regression...
Forecast of long term wind speed based on optimized support vector regression...Aboul Ella Hassanien
 
Algorithm Selection for Preferred Extensions Enumeration
Algorithm Selection for Preferred Extensions EnumerationAlgorithm Selection for Preferred Extensions Enumeration
Algorithm Selection for Preferred Extensions EnumerationFederico Cerutti
 
Translating data to predictive models
Translating data to predictive modelsTranslating data to predictive models
Translating data to predictive modelsChemAxon
 
Automated parameter optimization should be included in future 
defect predict...
Automated parameter optimization should be included in future 
defect predict...Automated parameter optimization should be included in future 
defect predict...
Automated parameter optimization should be included in future 
defect predict...Chakkrit (Kla) Tantithamthavorn
 
Svd filtered temporal usage clustering
Svd filtered temporal usage clusteringSvd filtered temporal usage clustering
Svd filtered temporal usage clusteringLiang Xie, PhD
 
Machine learning methods for chemical properties and toxicity based endpoints
Machine learning methods for chemical properties and toxicity based endpointsMachine learning methods for chemical properties and toxicity based endpoints
Machine learning methods for chemical properties and toxicity based endpointsValery Tkachenko
 
Using HOG Descriptors on Superpixels for Human Detection of UAV Imagery
Using HOG Descriptors on Superpixels for Human Detection of UAV ImageryUsing HOG Descriptors on Superpixels for Human Detection of UAV Imagery
Using HOG Descriptors on Superpixels for Human Detection of UAV ImageryWai Nwe Tun
 

Ähnlich wie Discovery Bus: UK QSAR meeting at GSK (20)

Caret Package for R
Caret Package for RCaret Package for R
Caret Package for R
 
Caret max kuhn
Caret max kuhnCaret max kuhn
Caret max kuhn
 
FPGA Implementation of a GA
FPGA Implementation of a GAFPGA Implementation of a GA
FPGA Implementation of a GA
 
Automated QSAR
Automated QSAR Automated QSAR
Automated QSAR
 
aserra_phdthesis_ppt
aserra_phdthesis_pptaserra_phdthesis_ppt
aserra_phdthesis_ppt
 
Unsupervised selection of mother wavelets and parameter optimization
Unsupervised selection of mother wavelets and parameter optimizationUnsupervised selection of mother wavelets and parameter optimization
Unsupervised selection of mother wavelets and parameter optimization
 
The caret Package: A Unified Interface for Predictive Models
The caret Package: A Unified Interface for Predictive ModelsThe caret Package: A Unified Interface for Predictive Models
The caret Package: A Unified Interface for Predictive Models
 
OPERA: A free and open source QSAR tool for predicting physicochemical proper...
OPERA: A free and open source QSAR tool for predicting physicochemical proper...OPERA: A free and open source QSAR tool for predicting physicochemical proper...
OPERA: A free and open source QSAR tool for predicting physicochemical proper...
 
Lab 2: Classification and Regression Prediction Models, training and testing ...
Lab 2: Classification and Regression Prediction Models, training and testing ...Lab 2: Classification and Regression Prediction Models, training and testing ...
Lab 2: Classification and Regression Prediction Models, training and testing ...
 
Introduction to Genetic algorithm and its significance in VLSI design and aut...
Introduction to Genetic algorithm and its significance in VLSI design and aut...Introduction to Genetic algorithm and its significance in VLSI design and aut...
Introduction to Genetic algorithm and its significance in VLSI design and aut...
 
Blanka Láng, László Kovács and László Mohácsi: Linear regression model select...
Blanka Láng, László Kovács and László Mohácsi: Linear regression model select...Blanka Láng, László Kovács and László Mohácsi: Linear regression model select...
Blanka Láng, László Kovács and László Mohácsi: Linear regression model select...
 
Open Science Data Repository - Dataledger
Open Science Data Repository - DataledgerOpen Science Data Repository - Dataledger
Open Science Data Repository - Dataledger
 
Prediction of pKa from chemical structure using free and open source tools
Prediction of pKa from chemical structure using free and open source toolsPrediction of pKa from chemical structure using free and open source tools
Prediction of pKa from chemical structure using free and open source tools
 
Forecast of long term wind speed based on optimized support vector regression...
Forecast of long term wind speed based on optimized support vector regression...Forecast of long term wind speed based on optimized support vector regression...
Forecast of long term wind speed based on optimized support vector regression...
 
Algorithm Selection for Preferred Extensions Enumeration
Algorithm Selection for Preferred Extensions EnumerationAlgorithm Selection for Preferred Extensions Enumeration
Algorithm Selection for Preferred Extensions Enumeration
 
Translating data to predictive models
Translating data to predictive modelsTranslating data to predictive models
Translating data to predictive models
 
Automated parameter optimization should be included in future 
defect predict...
Automated parameter optimization should be included in future 
defect predict...Automated parameter optimization should be included in future 
defect predict...
Automated parameter optimization should be included in future 
defect predict...
 
Svd filtered temporal usage clustering
Svd filtered temporal usage clusteringSvd filtered temporal usage clustering
Svd filtered temporal usage clustering
 
Machine learning methods for chemical properties and toxicity based endpoints
Machine learning methods for chemical properties and toxicity based endpointsMachine learning methods for chemical properties and toxicity based endpoints
Machine learning methods for chemical properties and toxicity based endpoints
 
Using HOG Descriptors on Superpixels for Human Detection of UAV Imagery
Using HOG Descriptors on Superpixels for Human Detection of UAV ImageryUsing HOG Descriptors on Superpixels for Human Detection of UAV Imagery
Using HOG Descriptors on Superpixels for Human Detection of UAV Imagery
 

Mehr von David Leahy

AI is the Future of Drug Discovery
AI is the Future of Drug DiscoveryAI is the Future of Drug Discovery
AI is the Future of Drug DiscoveryDavid Leahy
 
Most Drug Discovery Scientists could be replaced by Software Systems
Most Drug Discovery Scientists could be replaced by Software SystemsMost Drug Discovery Scientists could be replaced by Software Systems
Most Drug Discovery Scientists could be replaced by Software SystemsDavid Leahy
 
From Hammett to the Semantic Web
From Hammett to the Semantic WebFrom Hammett to the Semantic Web
From Hammett to the Semantic WebDavid Leahy
 
InkSpot Science presentation at Open Science Meeting
InkSpot Science presentation at Open Science MeetingInkSpot Science presentation at Open Science Meeting
InkSpot Science presentation at Open Science MeetingDavid Leahy
 
PBPK simulation as an alternative to animal testing
PBPK simulation as an alternative to animal testingPBPK simulation as an alternative to animal testing
PBPK simulation as an alternative to animal testingDavid Leahy
 

Mehr von David Leahy (7)

AI is the Future of Drug Discovery
AI is the Future of Drug DiscoveryAI is the Future of Drug Discovery
AI is the Future of Drug Discovery
 
Most Drug Discovery Scientists could be replaced by Software Systems
Most Drug Discovery Scientists could be replaced by Software SystemsMost Drug Discovery Scientists could be replaced by Software Systems
Most Drug Discovery Scientists could be replaced by Software Systems
 
From Hammett to the Semantic Web
From Hammett to the Semantic WebFrom Hammett to the Semantic Web
From Hammett to the Semantic Web
 
InkSpot Science presentation at Open Science Meeting
InkSpot Science presentation at Open Science MeetingInkSpot Science presentation at Open Science Meeting
InkSpot Science presentation at Open Science Meeting
 
PBPK simulation as an alternative to animal testing
PBPK simulation as an alternative to animal testingPBPK simulation as an alternative to animal testing
PBPK simulation as an alternative to animal testing
 
Forager Poster
Forager PosterForager Poster
Forager Poster
 
Colonist
ColonistColonist
Colonist
 

Kürzlich hochgeladen

The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 

Kürzlich hochgeladen (20)

The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 

Discovery Bus: UK QSAR meeting at GSK

  • 1. Automated QSAR Modelling David E Leahy Newcastle University, UK & Damjan Krstajic Research Centre for Cheminformatics, Serbia
  • 2.
  • 4. Chemical structure & response data Transform response 1/X logX X class Split and stratify ? Calculate descriptors D E H L R A Combine descriptors Filter features A&D A&L L&H&R A&E E&D A&D&R ... no filter cfs1 cfs2 cfs4 cfs5 cfs3 Cross validate Build models Test model Rnnet Rrpart Rlin Rpls GARMLR NetlabNN GUIDE GAWRMLR 4 x 8 x 6 x 8 = 1536 models ?&? ? new ff New method? 4 x 8 = 32 filter feature requests 32 filter feature requests x 8 = 256 models 10%
  • 6. Solubility Results Learner Filter Reduction Types Linear Fit Training (1167) Test (130) Filter Learner Rel.MSE r 2, Rel.MSE r 2, GUIDE         H 1990 -> 558 -> 54 R,D 1.46 0.11 0.89 0.12 0.89 H 170 -> 26 -> 14 A,E,H,D 0.13 0.11 0.89 0.13 0.88 H 80 -> 16 -> 12 A,H,D 0.14 0.11 0.88 0.12 0.87 C 250 -> 2 -> 2 A,R 0.18 0.13 0.87 0.16 0.84 C 8 -> 2 -> 2 A,L 0.16 0.13 0.87 0.16 0.86 GA1   H 80 -> 16 -> 16 A,H,D 0.14 0.14 0.86 0.18 0.83 C 8 -> 2 -> 2 A,L 0.16 0.17 0.84 0.17 0.83 NN1     H 250 -> 54 -> 54 A,R 0.12 0.09 0.91 0.08 0.92 H 80 -> 16 -> 16 A,H,D 0.14 0.10 0.90 0.12 0.88 H 326 -> 46 -> 46 H,R,D 0.18 0.10 0.90 0.12 0.89
  • 8. HSA Binding Learner Filter Reduction Types Linear Fit Training (82) Test (9) Filter Learner Rel.MSE r 2 Rel.MSE r 2 Guide Hh2 332 -> 39 -> 8 A,E,R 0.92 0.40 0.62 0.25 0.81 H 250 -> 59 -> 12 A,R 1.62 0.47 0.56 0.30 0.76 Hh4 382 -> 20 -> 1 A 0.25 0.50 0.50 0.57 0.49 GA1 Hh2 1998 -> 39 -> 26 A,R,D 0.42 0.23 0.77 0.20 0.85 Hh4 344 -> 20 -> 19 H,R,D 0.42 0.26 0.74 0.28 0.78 Hh10 302 -> 9 -> 9 H,R 0.27 0.27 0.73 0.40 0.64 NN1 H 8 -> 5 -> 5 A,L 0.37 0.17 0.83 0.15 0.87 Hh10 346 -> 8 -> 8 A,R,D 0.30 0.30 0.70 0.16 0.84 H 302 -> 19 -> 19 H,R 0.27 0.32 0.70 0.39 0.71
  • 9. P-Glycoprotein Technique % Correctly Classified Training Set % Correctly Classified Test Set Neural Net Classifier 95.6 69.7 R Part 90.4 81.0
  • 11.
  • 12.
  • 14.
  • 15. Forager Optimisation Thanks to Tudor Oprea for a copy of Wombat