SlideShare ist ein Scribd-Unternehmen logo
1 von 19
Downloaden Sie, um offline zu lesen
AI Drug Discovery in Patent
Space
Hanjo Kim
Principal Scientist at Standigm Inc.
hanjo.kim@standigm.com
business@standigm.com
apply@standigm.com
www.standigm.com
Disclaimer
• Statements of fact and opinions expressed in this presentation
and on the following slides are solely those of the presenter and
not necessarily those of Standigm Inc.
Standigm Inc.
2015
Founded by three researchers at Samsung Advanced Institute of Technology
Jinhan Kim, PhD Artificial Intelligence (The University of Edinburgh)
Sang Ok Song, PhD Chemical Engineering (Seoul National University)
So Jeong Yun, PhD Systems Biology (POSTECH)
$23M
Funding raised
SK Holdings, Mirae Asset Capital, Mirae Asset Venture Investment, DSC
Investment, Wonik Investment, Atinum Investment, LB Investment, Kakao
Ventures
Seoul Korea (33)
Ann Arbor
Michigan (2)
Standigm= drug discovery company that generates and optimizes therapeutic
lead compounds by using advanced artificial intelligence toward license-out
Cambridge
UK (1)
AI, 16
Biology, 6
Chemistry, 8
Systems Biology,
4
Advisor, 3
PhD
20/37*
* Except Operation 5, Patent attorney 1
The AI solution
Disease Hit Lead Preclinical Clinical Drug
Drug
repositioning
The Standigm AI solution is industrializing drug discovery
Discovery at Scale
Target
* developing
BEST
TM
ASK
TM
Insight
TM
FIRST
*
Standigm ASKTM is freely available at
https://icluenask.standigm.com
Standigm BEST Platform
Standigm BESTStandigm
ASK
Knowledge
based biology
platform
for
novel targets,
pathways, and
MoA discovery
Standigm
FIRST
Hit generation
platform
for
novel and/or
undruggable
targets
Generative Models
Graph-based VAE
Scaffold-based
conditional enumerator
Novel Molecular
Representation
Scoring Functions
Simulations
AI rescoring models
Machine learning models
Compound Database
Known Molecules
Seed Molecules
Novel Virtual Structures
Commercial Library Privileged Standigm Library
Target Database Public data (gene, protein, function) BEST Feasibility
Public Library
Strategy setup Hit Generation Hit-2-Lead
Predictive Models
ADME/Tox predictors
Novelty (patentability)
Synthetic accessibility
Filters/Ranking models
External
CROs
Organic
synthesis,
In vitro/in vivo
Assays
Novel/Commercial Hits Lead Series
Graph-based VAE
Chemical
space
Encoder Decoder
Latent
space
Chemical
space
E DZ
Learning chemical space
Training DB
~4M
Y
Property/Target information
Contextualizing:
- substructures
- topology
- shape
- etc
property 1
property 2
property 3
Z : latent space
predictor
q(y|z)
seed molecules
decoder
p(x|z)
X : original chemical space
encoder
q(z|x)
Analogue structure generation
functionally similar
but novel scaffolds/molecules
Lead optimization
novel molecules
w/ better desired properties
decoder
p(x|z)
Smart library expansion
IP generation & expansion
Patent Space
Target A Compounds in latent space
Competitor 1
Competitor 2
Competitor 3
Interesting Area
potentweak
Chemical Space Navigation
• Chemical Space ~ Map
• Known scaffolds ~ POIs
• Information-rich space (ChEMBL, PubChem Bioassays, etc.)
• Novel scaffold ~ New POI
• El Dorado
• Patent
• Markush structure: How to protect as wide as possible area
• Exemplified compounds: boundary stones
Using ChemCurator
• Project types
• Google Patents (most cases)
• PDF files (do not use pdf files!)
• Text files (when google ocr is not good)
Using ChemCurator
Google patents
Using ChemCurator
Text files
OCR (and chemical OCR)
• Lessons
• Google patents is reliable in most cases
• It even provides the compound table though very primitive
• Professional OCR software can give better results
• Convert pdf file to plain text with chemical names
• Complex tables
• Image (not OCRed) tables (next 3 slides)
• Chemical OCR engine helps a lot
• Text-image comparison
• Chemical OCR engines
• CLiDE (recommended, proprietary)
• Osra (open-source, recommended on Linux machine)
• Imago (I have no experience)
• Unsupported engines (like ChemGrapher,
https://pubs.acs.org/doi/10.1021/acs.jcim.0c00459)
Chemical structures in patens
Chemical structures in patents
Chemical structures in patents
Better OCR result
Markush Structures
• Very expressive
• Same set of compounds can be written to very different forms
• Not well-validated
• ChemCurator helps
• Extracting example compounds
• Matching them to the Markush structure
• Require manual correction
• Sentence to chemical groups
• Ambiguous/incomplete R-group definitions
AI can help
• Reduction of frequent text OCR error
• NLP technique can correct frequent OCR errors
• The availability of large training set is important
• Extraction of relevant data
• Biological activities
• Analytical data
• Chemical OCR can be improved
• AI can do image recognition very well
• Different drawing styles can be managed
Acknowledgement
• Standigm Inc.
• Sanghyung JIN, Minkyu HA, Soyeon Kim, Sangok SONG
• T&J Tech. (Korean distributor)
• Jung-A HAN

Weitere ähnliche Inhalte

Ähnlich wie Patent Data for Artificial Intelligence based Drug Discovery

20 million public patent structures: looking at the gift horse
20 million public patent structures: looking at the gift horse20 million public patent structures: looking at the gift horse
20 million public patent structures: looking at the gift horseChris Southan
 
artifical intelligence (ai), robotics and cf in pharmaceutical dynamics
artifical intelligence (ai), robotics and cf in pharmaceutical dynamicsartifical intelligence (ai), robotics and cf in pharmaceutical dynamics
artifical intelligence (ai), robotics and cf in pharmaceutical dynamicssurya singh
 
Learning Systems for Science
Learning Systems for ScienceLearning Systems for Science
Learning Systems for ScienceIan Foster
 
IT Cluster Skolkovo Presentation at FRUCT.org conference
IT Cluster Skolkovo Presentation at FRUCT.org conferenceIT Cluster Skolkovo Presentation at FRUCT.org conference
IT Cluster Skolkovo Presentation at FRUCT.org conferenceAlbert Yefimov
 
A Peek Into a Must-Have Add-On Solution for Oracle Clinical
A Peek Into a Must-Have Add-On Solution for Oracle ClinicalA Peek Into a Must-Have Add-On Solution for Oracle Clinical
A Peek Into a Must-Have Add-On Solution for Oracle ClinicalPerficient
 
Resilience Engineering: A field of study, a community, and some perspective s...
Resilience Engineering: A field of study, a community, and some perspective s...Resilience Engineering: A field of study, a community, and some perspective s...
Resilience Engineering: A field of study, a community, and some perspective s...John Allspaw
 
Predicting medical tests results using Driverless AI
Predicting medical tests results using Driverless AIPredicting medical tests results using Driverless AI
Predicting medical tests results using Driverless AIAlexander Gedranovich
 
Lionel Briand ICSM 2011 Keynote
Lionel Briand ICSM 2011 KeynoteLionel Briand ICSM 2011 Keynote
Lionel Briand ICSM 2011 KeynoteICSM 2011
 
Liquilume NSF Final Presentation
Liquilume NSF Final PresentationLiquilume NSF Final Presentation
Liquilume NSF Final PresentationStanford University
 
Linking chemistry: wider lessons for how we publish research
Linking chemistry: wider lessons for how we publish researchLinking chemistry: wider lessons for how we publish research
Linking chemistry: wider lessons for how we publish researchRoyal Society of Chemistry
 
Short TRIZ Workshop for the University of the Philippines
Short TRIZ Workshop for the University of the PhilippinesShort TRIZ Workshop for the University of the Philippines
Short TRIZ Workshop for the University of the PhilippinesRichard Platt
 
Why and How to do a Software Startup
Why and How to do a Software StartupWhy and How to do a Software Startup
Why and How to do a Software Startupkyptin
 
ICIC 2014 Chemical Patent Curation and Management – New Tools and Capabilities
ICIC 2014 Chemical Patent Curation and Management – New Tools and Capabilities  ICIC 2014 Chemical Patent Curation and Management – New Tools and Capabilities
ICIC 2014 Chemical Patent Curation and Management – New Tools and Capabilities Dr. Haxel Consult
 
Indian Jugaad Technology (Frugal Engineering)
Indian Jugaad Technology (Frugal Engineering)Indian Jugaad Technology (Frugal Engineering)
Indian Jugaad Technology (Frugal Engineering)Javed Rizvi
 
Enhancing the Quality of ImmPort Data
Enhancing the Quality of ImmPort DataEnhancing the Quality of ImmPort Data
Enhancing the Quality of ImmPort DataBarry Smith
 

Ähnlich wie Patent Data for Artificial Intelligence based Drug Discovery (20)

20 million public patent structures: looking at the gift horse
20 million public patent structures: looking at the gift horse20 million public patent structures: looking at the gift horse
20 million public patent structures: looking at the gift horse
 
artifical intelligence (ai), robotics and cf in pharmaceutical dynamics
artifical intelligence (ai), robotics and cf in pharmaceutical dynamicsartifical intelligence (ai), robotics and cf in pharmaceutical dynamics
artifical intelligence (ai), robotics and cf in pharmaceutical dynamics
 
Nesher Tech I-Corps@NIH 121014
Nesher Tech I-Corps@NIH 121014Nesher Tech I-Corps@NIH 121014
Nesher Tech I-Corps@NIH 121014
 
Learning Systems for Science
Learning Systems for ScienceLearning Systems for Science
Learning Systems for Science
 
IT Cluster Skolkovo Presentation at FRUCT.org conference
IT Cluster Skolkovo Presentation at FRUCT.org conferenceIT Cluster Skolkovo Presentation at FRUCT.org conference
IT Cluster Skolkovo Presentation at FRUCT.org conference
 
Osp 1st sep2015 OSDD
Osp 1st sep2015 OSDDOsp 1st sep2015 OSDD
Osp 1st sep2015 OSDD
 
A Peek Into a Must-Have Add-On Solution for Oracle Clinical
A Peek Into a Must-Have Add-On Solution for Oracle ClinicalA Peek Into a Must-Have Add-On Solution for Oracle Clinical
A Peek Into a Must-Have Add-On Solution for Oracle Clinical
 
Resilience Engineering: A field of study, a community, and some perspective s...
Resilience Engineering: A field of study, a community, and some perspective s...Resilience Engineering: A field of study, a community, and some perspective s...
Resilience Engineering: A field of study, a community, and some perspective s...
 
Overview of SureChEMBL
Overview of SureChEMBLOverview of SureChEMBL
Overview of SureChEMBL
 
Predicting medical tests results using Driverless AI
Predicting medical tests results using Driverless AIPredicting medical tests results using Driverless AI
Predicting medical tests results using Driverless AI
 
IntroVision investment
IntroVision investmentIntroVision investment
IntroVision investment
 
Lionel Briand ICSM 2011 Keynote
Lionel Briand ICSM 2011 KeynoteLionel Briand ICSM 2011 Keynote
Lionel Briand ICSM 2011 Keynote
 
Liquilume NSF Final Presentation
Liquilume NSF Final PresentationLiquilume NSF Final Presentation
Liquilume NSF Final Presentation
 
Linking chemistry: wider lessons for how we publish research
Linking chemistry: wider lessons for how we publish researchLinking chemistry: wider lessons for how we publish research
Linking chemistry: wider lessons for how we publish research
 
Short TRIZ Workshop for the University of the Philippines
Short TRIZ Workshop for the University of the PhilippinesShort TRIZ Workshop for the University of the Philippines
Short TRIZ Workshop for the University of the Philippines
 
Why and How to do a Software Startup
Why and How to do a Software StartupWhy and How to do a Software Startup
Why and How to do a Software Startup
 
ICIC 2014 Chemical Patent Curation and Management – New Tools and Capabilities
ICIC 2014 Chemical Patent Curation and Management – New Tools and Capabilities  ICIC 2014 Chemical Patent Curation and Management – New Tools and Capabilities
ICIC 2014 Chemical Patent Curation and Management – New Tools and Capabilities
 
Indian Jugaad Technology (Frugal Engineering)
Indian Jugaad Technology (Frugal Engineering)Indian Jugaad Technology (Frugal Engineering)
Indian Jugaad Technology (Frugal Engineering)
 
Enhancing the Quality of ImmPort Data
Enhancing the Quality of ImmPort DataEnhancing the Quality of ImmPort Data
Enhancing the Quality of ImmPort Data
 
SLA Summer 2008
SLA Summer 2008SLA Summer 2008
SLA Summer 2008
 

Mehr von ChemAxon

Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?
Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?
Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?ChemAxon
 
Chemaxon EU UGM 2022 | Translating data to predictive models
Chemaxon EU UGM 2022 | Translating data to predictive modelsChemaxon EU UGM 2022 | Translating data to predictive models
Chemaxon EU UGM 2022 | Translating data to predictive modelsChemAxon
 
Translating data to predictive models
Translating data to predictive modelsTranslating data to predictive models
Translating data to predictive modelsChemAxon
 
Efficient biomolecular structural data handling and analysis - Webinar with D...
Efficient biomolecular structural data handling and analysis - Webinar with D...Efficient biomolecular structural data handling and analysis - Webinar with D...
Efficient biomolecular structural data handling and analysis - Webinar with D...ChemAxon
 
Biomolecule structural data management
Biomolecule structural data managementBiomolecule structural data management
Biomolecule structural data managementChemAxon
 
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first releaseCheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first releaseChemAxon
 
Enhanced stereochemistry representation
Enhanced stereochemistry representation Enhanced stereochemistry representation
Enhanced stereochemistry representation ChemAxon
 
Intellectual property (IP) intelligence solutions designed for the way resear...
Intellectual property (IP) intelligence solutions designed for the way resear...Intellectual property (IP) intelligence solutions designed for the way resear...
Intellectual property (IP) intelligence solutions designed for the way resear...ChemAxon
 
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...ChemAxon
 
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...ChemAxon
 
Research data management on the cloud
Research data management on the cloudResearch data management on the cloud
Research data management on the cloudChemAxon
 
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound RegistrationCheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound RegistrationChemAxon
 
Cheminfo Stories APAC 2020 - JChem Engines introduction
Cheminfo Stories APAC 2020 - JChem Engines introduction Cheminfo Stories APAC 2020 - JChem Engines introduction
Cheminfo Stories APAC 2020 - JChem Engines introduction ChemAxon
 
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...ChemAxon
 
Cheminfo Stories APAC 2020 -- Markush technology
Cheminfo Stories APAC 2020 -- Markush technology Cheminfo Stories APAC 2020 -- Markush technology
Cheminfo Stories APAC 2020 -- Markush technology ChemAxon
 
JChem Microservices
JChem MicroservicesJChem Microservices
JChem MicroservicesChemAxon
 
Migration from joc to jpc or choral
Migration from joc to jpc or choralMigration from joc to jpc or choral
Migration from joc to jpc or choralChemAxon
 
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5ChemAxon
 
Chemicalize Pro - Cheminfo Stories 2020 Day 5
Chemicalize Pro - Cheminfo Stories 2020 Day 5Chemicalize Pro - Cheminfo Stories 2020 Day 5
Chemicalize Pro - Cheminfo Stories 2020 Day 5ChemAxon
 
Pasteur Institute User Story - Cheminfo Stories 2020 Day 5
Pasteur Institute User Story - Cheminfo Stories 2020 Day 5Pasteur Institute User Story - Cheminfo Stories 2020 Day 5
Pasteur Institute User Story - Cheminfo Stories 2020 Day 5ChemAxon
 

Mehr von ChemAxon (20)

Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?
Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?
Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?
 
Chemaxon EU UGM 2022 | Translating data to predictive models
Chemaxon EU UGM 2022 | Translating data to predictive modelsChemaxon EU UGM 2022 | Translating data to predictive models
Chemaxon EU UGM 2022 | Translating data to predictive models
 
Translating data to predictive models
Translating data to predictive modelsTranslating data to predictive models
Translating data to predictive models
 
Efficient biomolecular structural data handling and analysis - Webinar with D...
Efficient biomolecular structural data handling and analysis - Webinar with D...Efficient biomolecular structural data handling and analysis - Webinar with D...
Efficient biomolecular structural data handling and analysis - Webinar with D...
 
Biomolecule structural data management
Biomolecule structural data managementBiomolecule structural data management
Biomolecule structural data management
 
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first releaseCheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
 
Enhanced stereochemistry representation
Enhanced stereochemistry representation Enhanced stereochemistry representation
Enhanced stereochemistry representation
 
Intellectual property (IP) intelligence solutions designed for the way resear...
Intellectual property (IP) intelligence solutions designed for the way resear...Intellectual property (IP) intelligence solutions designed for the way resear...
Intellectual property (IP) intelligence solutions designed for the way resear...
 
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
 
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
 
Research data management on the cloud
Research data management on the cloudResearch data management on the cloud
Research data management on the cloud
 
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound RegistrationCheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
 
Cheminfo Stories APAC 2020 - JChem Engines introduction
Cheminfo Stories APAC 2020 - JChem Engines introduction Cheminfo Stories APAC 2020 - JChem Engines introduction
Cheminfo Stories APAC 2020 - JChem Engines introduction
 
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
 
Cheminfo Stories APAC 2020 -- Markush technology
Cheminfo Stories APAC 2020 -- Markush technology Cheminfo Stories APAC 2020 -- Markush technology
Cheminfo Stories APAC 2020 -- Markush technology
 
JChem Microservices
JChem MicroservicesJChem Microservices
JChem Microservices
 
Migration from joc to jpc or choral
Migration from joc to jpc or choralMigration from joc to jpc or choral
Migration from joc to jpc or choral
 
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
 
Chemicalize Pro - Cheminfo Stories 2020 Day 5
Chemicalize Pro - Cheminfo Stories 2020 Day 5Chemicalize Pro - Cheminfo Stories 2020 Day 5
Chemicalize Pro - Cheminfo Stories 2020 Day 5
 
Pasteur Institute User Story - Cheminfo Stories 2020 Day 5
Pasteur Institute User Story - Cheminfo Stories 2020 Day 5Pasteur Institute User Story - Cheminfo Stories 2020 Day 5
Pasteur Institute User Story - Cheminfo Stories 2020 Day 5
 

Kürzlich hochgeladen

WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?WSO2
 
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...masabamasaba
 
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...masabamasaba
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is insideshinachiaurasa2
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplatePresentation.STUDIO
 
WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With SimplicityWSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With SimplicityWSO2
 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrainmasabamasaba
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnAmarnathKambale
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionOnePlan Solutions
 
%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrandmasabamasaba
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park masabamasaba
 
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech studentsHimanshiGarg82
 
tonesoftg
tonesoftgtonesoftg
tonesoftglanshi9
 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...Shane Coughlan
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...panagenda
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️Delhi Call girls
 
Architecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastArchitecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastPapp Krisztián
 
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...WSO2
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...masabamasaba
 
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024VictoriaMetrics
 

Kürzlich hochgeladen (20)

WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?
 
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
 
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With SimplicityWSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learn
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
 
%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
 
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students
 
tonesoftg
tonesoftgtonesoftg
tonesoftg
 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
Architecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastArchitecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the past
 
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
 
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
 

Patent Data for Artificial Intelligence based Drug Discovery

  • 1. AI Drug Discovery in Patent Space Hanjo Kim Principal Scientist at Standigm Inc. hanjo.kim@standigm.com business@standigm.com apply@standigm.com www.standigm.com
  • 2. Disclaimer • Statements of fact and opinions expressed in this presentation and on the following slides are solely those of the presenter and not necessarily those of Standigm Inc.
  • 3. Standigm Inc. 2015 Founded by three researchers at Samsung Advanced Institute of Technology Jinhan Kim, PhD Artificial Intelligence (The University of Edinburgh) Sang Ok Song, PhD Chemical Engineering (Seoul National University) So Jeong Yun, PhD Systems Biology (POSTECH) $23M Funding raised SK Holdings, Mirae Asset Capital, Mirae Asset Venture Investment, DSC Investment, Wonik Investment, Atinum Investment, LB Investment, Kakao Ventures Seoul Korea (33) Ann Arbor Michigan (2) Standigm= drug discovery company that generates and optimizes therapeutic lead compounds by using advanced artificial intelligence toward license-out Cambridge UK (1) AI, 16 Biology, 6 Chemistry, 8 Systems Biology, 4 Advisor, 3 PhD 20/37* * Except Operation 5, Patent attorney 1
  • 4. The AI solution Disease Hit Lead Preclinical Clinical Drug Drug repositioning The Standigm AI solution is industrializing drug discovery Discovery at Scale Target * developing BEST TM ASK TM Insight TM FIRST * Standigm ASKTM is freely available at https://icluenask.standigm.com
  • 5. Standigm BEST Platform Standigm BESTStandigm ASK Knowledge based biology platform for novel targets, pathways, and MoA discovery Standigm FIRST Hit generation platform for novel and/or undruggable targets Generative Models Graph-based VAE Scaffold-based conditional enumerator Novel Molecular Representation Scoring Functions Simulations AI rescoring models Machine learning models Compound Database Known Molecules Seed Molecules Novel Virtual Structures Commercial Library Privileged Standigm Library Target Database Public data (gene, protein, function) BEST Feasibility Public Library Strategy setup Hit Generation Hit-2-Lead Predictive Models ADME/Tox predictors Novelty (patentability) Synthetic accessibility Filters/Ranking models External CROs Organic synthesis, In vitro/in vivo Assays Novel/Commercial Hits Lead Series
  • 6. Graph-based VAE Chemical space Encoder Decoder Latent space Chemical space E DZ Learning chemical space Training DB ~4M Y Property/Target information Contextualizing: - substructures - topology - shape - etc property 1 property 2 property 3 Z : latent space predictor q(y|z) seed molecules decoder p(x|z) X : original chemical space encoder q(z|x) Analogue structure generation functionally similar but novel scaffolds/molecules Lead optimization novel molecules w/ better desired properties decoder p(x|z) Smart library expansion IP generation & expansion
  • 7. Patent Space Target A Compounds in latent space Competitor 1 Competitor 2 Competitor 3 Interesting Area potentweak
  • 8. Chemical Space Navigation • Chemical Space ~ Map • Known scaffolds ~ POIs • Information-rich space (ChEMBL, PubChem Bioassays, etc.) • Novel scaffold ~ New POI • El Dorado • Patent • Markush structure: How to protect as wide as possible area • Exemplified compounds: boundary stones
  • 9. Using ChemCurator • Project types • Google Patents (most cases) • PDF files (do not use pdf files!) • Text files (when google ocr is not good)
  • 12. OCR (and chemical OCR) • Lessons • Google patents is reliable in most cases • It even provides the compound table though very primitive • Professional OCR software can give better results • Convert pdf file to plain text with chemical names • Complex tables • Image (not OCRed) tables (next 3 slides) • Chemical OCR engine helps a lot • Text-image comparison • Chemical OCR engines • CLiDE (recommended, proprietary) • Osra (open-source, recommended on Linux machine) • Imago (I have no experience) • Unsupported engines (like ChemGrapher, https://pubs.acs.org/doi/10.1021/acs.jcim.0c00459)
  • 17. Markush Structures • Very expressive • Same set of compounds can be written to very different forms • Not well-validated • ChemCurator helps • Extracting example compounds • Matching them to the Markush structure • Require manual correction • Sentence to chemical groups • Ambiguous/incomplete R-group definitions
  • 18. AI can help • Reduction of frequent text OCR error • NLP technique can correct frequent OCR errors • The availability of large training set is important • Extraction of relevant data • Biological activities • Analytical data • Chemical OCR can be improved • AI can do image recognition very well • Different drawing styles can be managed
  • 19. Acknowledgement • Standigm Inc. • Sanghyung JIN, Minkyu HA, Soyeon Kim, Sangok SONG • T&J Tech. (Korean distributor) • Jung-A HAN