SlideShare ist ein Scribd-Unternehmen logo
1 von 22
ARChem – Synthesizing Ideas
Bringing Computational Power to Organic Synthesis
RSC Presentation - SimBioSys Inc.
Orr Ravitz
Chief Operating Officer
What is the value of a good
synthetic idea?
The Motivation
Goal:
Synthesize quickly, efficiently, economically, ecologically.
Intuition
Knowledge
Experience
Literature
Data:
- 1000’s of high utility synthetic methods
- 100,000’s of building blocks
The value of a good idea:
 Faster R&D turnaround – remain ahead of the curve
 Synthetic efficiency – lower development, production costs
 Better use of human time – increased productivity
Chemical data have been used in the same manner for over 100 years!
Synthetic plan Thought
process
There is more than one way to synthesize
a compound. Little in current approaches
assists in finding the better ones.
• Non-linear
• Highly biased
• Driven by intuition and knowledge
• Gaps filled using literature searches
What is a synthetic idea?
A full synthetic route, a key step, a critical sequence of reactions
Not necessarily in your “comfort zone”
Utilizing starting materials efficiently
No obvious published precedent
Why Computer-Aided
Synthesis Design?
Cover more options, miss less opportunities
Chemist
Creativity
Intuition
Strategic perspective
Knowledge (what
works, what doesn’t)
Computer
Thoroughness
Lack of bias
Speed
Low cost
The Approach
• Comprehensive rule- and precedent-based retrosynthetic analysis back to
available starting materials.
• Automated rule generation with manual rule curation.
• Generate many alternatives.
• Provide supporting literature examples.
• Allow user guidance and control.
Rule Generation
Reactions
MOS
Reaction Rules
Reactions
Reaction Rules
Reaction Perception
Source reaction:
Extracted core
Extended core
Reaction file with atom mapping
Atoms attached to bonds changed, made or broken in the reaction
Include all structural motifs that are essential for the reaction to occur
Rule Extraction
Similar extended cores
Completed reaction rule
Common extracted core
Nucleofuge (NF) -
a leaving group which
carries away the bonding
electron pair.
Generalized rule
Generalized group (NF) is
replaced by the most
common group.
Reactions
Reaction Rules
Source reactions
Esterification examples
Other examples
··· → ···
··· → ···
··· → ···
Esterification rule
Other rule
··· → ···
Reactions
Reaction Rules
Rule Extraction
System Design
Reactions
MOS
Reaction Rules
Starting Materials
Expert Knowledge-
bases
Target
Limitations Associated with
Small Reaction Source
Methods in Organic Synthesis (MOS) – 44,000 mapped reactions.
• Partial coverage of synthetic methods
• Small clusters – higher risk of over- and under-constrained rules
• Not enough statistical power for supplementary information – yield,
regioselectivity
• Too few examples to determine functional group tolerance
• Exact matches are rare
Larger databases are available, but not as part of CDS:
• Reaxys - Elsevier
• ChemInform (CIRX) – Wiley & Sons
Solutions Ranking
Prioritization of the alternatives – show best solutions first
Transforms merit is evaluated using:
• Reduction of target complexity (simplifying transforms before FGI/FGA).
• Minimize wastage (atom efficient reactions).
• Starting material coverage.
• Prefer thoroughly explored chemistry (based on example count) .
• Penalty for interference.
• Yield.
Registering
Quota System
• Each institution is assigned a search quota
• Registration is open only when searches are available
• For registered users – new search page deactivated when
quota is filled
• Old searches remain accessible even when quota is filled
ARChem – Synthesizing Ideas
Bringing Computational Power to Organic Synthesis
SimBioSys Inc.
Thank you!
For more information:
Orr Ravitz, Ph.D.
ravitz@simbiosys.com
+1 (416) 741-4263
ARChem
Transforming data into Knowledge. Generating ideas.
Reactions Reaction
Rules
Starting
Materials
High Level
Reasoning
examples methods
Reaction mechanisms
Synthetic strategies
Search strategies
Solutions ranking
Methods
in Organic
Synthesis
US Patent 6,211,244
NPS Pharm. 2001
ARChem on the National Chemical Database Service Portal
ARChem on the National Chemical Database Service Portal

Weitere ähnliche Inhalte

Andere mochten auch

Impacto de las tic en la educación (1)
Impacto de las tic en la educación (1)Impacto de las tic en la educación (1)
Impacto de las tic en la educación (1)
yusmeily munoz
 
Final 2014 food and health survey executive summary
Final 2014 food and health survey executive summaryFinal 2014 food and health survey executive summary
Final 2014 food and health survey executive summary
Food Insight
 
How to Create an Epic Presence on Facebook for your Nonprofit
How to Create an Epic Presence on Facebook for your NonprofitHow to Create an Epic Presence on Facebook for your Nonprofit
How to Create an Epic Presence on Facebook for your Nonprofit
John Haydon
 

Andere mochten auch (18)

Raspberry Pi Hacks
Raspberry Pi HacksRaspberry Pi Hacks
Raspberry Pi Hacks
 
Responsive design - no size, fits all
Responsive design - no size, fits allResponsive design - no size, fits all
Responsive design - no size, fits all
 
Titulación universitaria versus empleo.
Titulación universitaria versus empleo.Titulación universitaria versus empleo.
Titulación universitaria versus empleo.
 
Vital Trends in Digital and Social in 2015 and Beyond by Dion Hinchcliffe
Vital Trends in Digital and Social in 2015 and Beyond by Dion HinchcliffeVital Trends in Digital and Social in 2015 and Beyond by Dion Hinchcliffe
Vital Trends in Digital and Social in 2015 and Beyond by Dion Hinchcliffe
 
Impacto de las tic en la educación (1)
Impacto de las tic en la educación (1)Impacto de las tic en la educación (1)
Impacto de las tic en la educación (1)
 
Calculating ledge profile 1997
Calculating ledge profile 1997Calculating ledge profile 1997
Calculating ledge profile 1997
 
Final 2014 food and health survey executive summary
Final 2014 food and health survey executive summaryFinal 2014 food and health survey executive summary
Final 2014 food and health survey executive summary
 
Periscope Session at #MayoClinicETF
Periscope Session at #MayoClinicETFPeriscope Session at #MayoClinicETF
Periscope Session at #MayoClinicETF
 
RESUME
RESUMERESUME
RESUME
 
Blogging and smart content management
Blogging and smart content managementBlogging and smart content management
Blogging and smart content management
 
EXPEDIA-ORBITZ MERGER
EXPEDIA-ORBITZ MERGEREXPEDIA-ORBITZ MERGER
EXPEDIA-ORBITZ MERGER
 
How to Create an Epic Presence on Facebook for your Nonprofit
How to Create an Epic Presence on Facebook for your NonprofitHow to Create an Epic Presence on Facebook for your Nonprofit
How to Create an Epic Presence on Facebook for your Nonprofit
 
ЦТТМ и Фаблаб Политех (Fab Lab Polytech)
ЦТТМ и Фаблаб Политех (Fab Lab Polytech)ЦТТМ и Фаблаб Политех (Fab Lab Polytech)
ЦТТМ и Фаблаб Политех (Fab Lab Polytech)
 
Síndrome de wolfram.
Síndrome de wolfram.Síndrome de wolfram.
Síndrome de wolfram.
 
Together App - Case Study
Together App - Case StudyTogether App - Case Study
Together App - Case Study
 
Lsgjune10
Lsgjune10Lsgjune10
Lsgjune10
 
Vls
VlsVls
Vls
 
How to not disable SELinux
How to not disable SELinuxHow to not disable SELinux
How to not disable SELinux
 

Ähnlich wie ARChem on the National Chemical Database Service Portal

June brownbagpressurvey
June brownbagpressurveyJune brownbagpressurvey
June brownbagpressurvey
Micah Altman
 
Elsevier Industry Talk - WSDM 2020
Elsevier Industry Talk - WSDM 2020Elsevier Industry Talk - WSDM 2020
Elsevier Industry Talk - WSDM 2020
Daniel Kershaw
 
Searching the medical literature aug 2010
Searching the medical literature aug 2010Searching the medical literature aug 2010
Searching the medical literature aug 2010
Robin Featherstone
 
10th Compound Libraries Conference - 27 - 29 October, 2014 - Hotel Palace Be...
10th Compound Libraries Conference  - 27 - 29 October, 2014 - Hotel Palace Be...10th Compound Libraries Conference  - 27 - 29 October, 2014 - Hotel Palace Be...
10th Compound Libraries Conference - 27 - 29 October, 2014 - Hotel Palace Be...
Torben Haagh
 
10th International Conference Compound Libraries 2014
10th International Conference Compound Libraries  201410th International Conference Compound Libraries  2014
10th International Conference Compound Libraries 2014
Torben Haagh
 
2016 AEHS Statistics Sediment Forensic Presentation CHERRY
2016 AEHS Statistics Sediment Forensic Presentation CHERRY2016 AEHS Statistics Sediment Forensic Presentation CHERRY
2016 AEHS Statistics Sediment Forensic Presentation CHERRY
Eric Cherry
 
Ecol Econ And Labs
Ecol Econ And LabsEcol Econ And Labs
Ecol Econ And Labs
DIv CHAS
 
Activities at the Royal Society of Chemistry to gather, extract and analyze b...
Activities at the Royal Society of Chemistry to gather, extract and analyze b...Activities at the Royal Society of Chemistry to gather, extract and analyze b...
Activities at the Royal Society of Chemistry to gather, extract and analyze b...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Three methodological issues for system dynamics practice
Three methodological issues for system dynamics practiceThree methodological issues for system dynamics practice
Three methodological issues for system dynamics practice
Andreas Größler
 

Ähnlich wie ARChem on the National Chemical Database Service Portal (20)

ICIC 2014 New Product Introduction Wiley
ICIC 2014 New Product Introduction WileyICIC 2014 New Product Introduction Wiley
ICIC 2014 New Product Introduction Wiley
 
Approaches to Preservation Storage Technologies
Approaches to Preservation Storage Technologies Approaches to Preservation Storage Technologies
Approaches to Preservation Storage Technologies
 
June brownbagpressurvey
June brownbagpressurveyJune brownbagpressurvey
June brownbagpressurvey
 
Introduction to Systemics with focus on Systems Biology
Introduction to Systemics with focus on Systems BiologyIntroduction to Systemics with focus on Systems Biology
Introduction to Systemics with focus on Systems Biology
 
Elsevier Industry Talk - WSDM 2020
Elsevier Industry Talk - WSDM 2020Elsevier Industry Talk - WSDM 2020
Elsevier Industry Talk - WSDM 2020
 
A new, automated retrosynthetic search engine: ARChem
A new, automated retrosynthetic search engine: ARChemA new, automated retrosynthetic search engine: ARChem
A new, automated retrosynthetic search engine: ARChem
 
Searching the medical literature aug 2010
Searching the medical literature aug 2010Searching the medical literature aug 2010
Searching the medical literature aug 2010
 
10th Compound Libraries Conference - 27 - 29 October, 2014 - Hotel Palace Be...
10th Compound Libraries Conference  - 27 - 29 October, 2014 - Hotel Palace Be...10th Compound Libraries Conference  - 27 - 29 October, 2014 - Hotel Palace Be...
10th Compound Libraries Conference - 27 - 29 October, 2014 - Hotel Palace Be...
 
10th International Conference Compound Libraries 2014
10th International Conference Compound Libraries  201410th International Conference Compound Libraries  2014
10th International Conference Compound Libraries 2014
 
How to conduct a systematic review
How to conduct a systematic reviewHow to conduct a systematic review
How to conduct a systematic review
 
2016 AEHS Statistics Sediment Forensic Presentation CHERRY
2016 AEHS Statistics Sediment Forensic Presentation CHERRY2016 AEHS Statistics Sediment Forensic Presentation CHERRY
2016 AEHS Statistics Sediment Forensic Presentation CHERRY
 
In Silico Approaches for Predicting Hazards from Chemical Structure and Exist...
In Silico Approaches for Predicting Hazards from Chemical Structure and Exist...In Silico Approaches for Predicting Hazards from Chemical Structure and Exist...
In Silico Approaches for Predicting Hazards from Chemical Structure and Exist...
 
HCF 2019 Panel 2: Jack de Bruijn
HCF 2019 Panel 2: Jack de BruijnHCF 2019 Panel 2: Jack de Bruijn
HCF 2019 Panel 2: Jack de Bruijn
 
Molecular modelling for in silico drug discovery
Molecular modelling for in silico drug discoveryMolecular modelling for in silico drug discovery
Molecular modelling for in silico drug discovery
 
Ecol Econ And Labs
Ecol Econ And LabsEcol Econ And Labs
Ecol Econ And Labs
 
Beyond Transparency: Success & Lessons From tambisBoston2003
Beyond Transparency: Success & Lessons From tambisBoston2003Beyond Transparency: Success & Lessons From tambisBoston2003
Beyond Transparency: Success & Lessons From tambisBoston2003
 
Activities at the Royal Society of Chemistry to gather, extract and analyze b...
Activities at the Royal Society of Chemistry to gather, extract and analyze b...Activities at the Royal Society of Chemistry to gather, extract and analyze b...
Activities at the Royal Society of Chemistry to gather, extract and analyze b...
 
Three methodological issues for system dynamics practice
Three methodological issues for system dynamics practiceThree methodological issues for system dynamics practice
Three methodological issues for system dynamics practice
 
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
 
1 the science of patient safety
1 the science of patient safety1 the science of patient safety
1 the science of patient safety
 

Kürzlich hochgeladen

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Kürzlich hochgeladen (20)

Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 

ARChem on the National Chemical Database Service Portal

  • 1. ARChem – Synthesizing Ideas Bringing Computational Power to Organic Synthesis RSC Presentation - SimBioSys Inc. Orr Ravitz Chief Operating Officer
  • 2. What is the value of a good synthetic idea?
  • 3. The Motivation Goal: Synthesize quickly, efficiently, economically, ecologically. Intuition Knowledge Experience Literature Data: - 1000’s of high utility synthetic methods - 100,000’s of building blocks The value of a good idea:  Faster R&D turnaround – remain ahead of the curve  Synthetic efficiency – lower development, production costs  Better use of human time – increased productivity
  • 4. Chemical data have been used in the same manner for over 100 years! Synthetic plan Thought process There is more than one way to synthesize a compound. Little in current approaches assists in finding the better ones. • Non-linear • Highly biased • Driven by intuition and knowledge • Gaps filled using literature searches
  • 5. What is a synthetic idea? A full synthetic route, a key step, a critical sequence of reactions Not necessarily in your “comfort zone” Utilizing starting materials efficiently No obvious published precedent
  • 6. Why Computer-Aided Synthesis Design? Cover more options, miss less opportunities Chemist Creativity Intuition Strategic perspective Knowledge (what works, what doesn’t) Computer Thoroughness Lack of bias Speed Low cost
  • 7. The Approach • Comprehensive rule- and precedent-based retrosynthetic analysis back to available starting materials. • Automated rule generation with manual rule curation. • Generate many alternatives. • Provide supporting literature examples. • Allow user guidance and control.
  • 9. Reactions Reaction Rules Reaction Perception Source reaction: Extracted core Extended core Reaction file with atom mapping Atoms attached to bonds changed, made or broken in the reaction Include all structural motifs that are essential for the reaction to occur
  • 10. Rule Extraction Similar extended cores Completed reaction rule Common extracted core Nucleofuge (NF) - a leaving group which carries away the bonding electron pair. Generalized rule Generalized group (NF) is replaced by the most common group. Reactions Reaction Rules
  • 11. Source reactions Esterification examples Other examples ··· → ··· ··· → ··· ··· → ··· Esterification rule Other rule ··· → ··· Reactions Reaction Rules Rule Extraction
  • 12. System Design Reactions MOS Reaction Rules Starting Materials Expert Knowledge- bases Target
  • 13. Limitations Associated with Small Reaction Source Methods in Organic Synthesis (MOS) – 44,000 mapped reactions. • Partial coverage of synthetic methods • Small clusters – higher risk of over- and under-constrained rules • Not enough statistical power for supplementary information – yield, regioselectivity • Too few examples to determine functional group tolerance • Exact matches are rare Larger databases are available, but not as part of CDS: • Reaxys - Elsevier • ChemInform (CIRX) – Wiley & Sons
  • 14. Solutions Ranking Prioritization of the alternatives – show best solutions first Transforms merit is evaluated using: • Reduction of target complexity (simplifying transforms before FGI/FGA). • Minimize wastage (atom efficient reactions). • Starting material coverage. • Prefer thoroughly explored chemistry (based on example count) . • Penalty for interference. • Yield.
  • 16. Quota System • Each institution is assigned a search quota • Registration is open only when searches are available • For registered users – new search page deactivated when quota is filled • Old searches remain accessible even when quota is filled
  • 17. ARChem – Synthesizing Ideas Bringing Computational Power to Organic Synthesis SimBioSys Inc. Thank you! For more information: Orr Ravitz, Ph.D. ravitz@simbiosys.com +1 (416) 741-4263
  • 18. ARChem Transforming data into Knowledge. Generating ideas. Reactions Reaction Rules Starting Materials High Level Reasoning examples methods Reaction mechanisms Synthetic strategies Search strategies Solutions ranking Methods in Organic Synthesis
  • 19.
  • 20. US Patent 6,211,244 NPS Pharm. 2001