SlideShare ist ein Scribd-Unternehmen logo
1 von 12
http://pistoiaalliance.org @PistoiaAlliance
Pistoia Alliance HELM Project
- What About the Big Guys?
The emerging HELM standard for macromolecular
representation
Domain Lead – Sergio Rotstein
Business Technology, Pfizer
What is a “Biomolecule”?
2
Peptides
Therapeutic
Proteins
ADCs
Antibodies
Vaccines
ASOs
siRNAs
For our purposes, anything
that is not a small molecule is
a biomolecule
Goal
• Eliminate biomolecule
penalty
• Make these entities first-
class citizens of the
Informatics tool portfolio
G
A
P
So what’s the problem?
3
N
NH
O
O
O
N
NH
O
O
O
Small
Molecules
Sequences
Biomolecules
Small Molecule Tools Sequence-Based Tools
“Fit-for-Purpose” Structure Representation
We need to enable the
representation, manipulation and
visualization of each molecule type in
a way that is appropriate for its size
and complexity
4
Fit for Purpose: “Monomer” Level
• While you could draw out an oligonucleotide like this:
• The representation is likely more intuitive / practical:
5
Fit for Purpose: Sequence Level
• But even the monomer level representation would not scale well to
proteins with hundreds of amino acids. Larger molecules require a
more sequence-oriented representation:
6
Fit for Purpose: Component Level
• For multi-component structures such as antibody drug
conjugates, component level representations are required to enable
each component to dealt with separately.
7
“Collapsed” Antibody
Expanded Drug
Ab
Hierarchical Editing Language for Macromolecules
– Hierarchical – Amenable to the various “levels”
• Complex Polymer ⇒ Simple Polymer ⇒ Monomer ⇒ Atom
– Extensible
• Allowing addition of new biopolymer types
– (Reasonably) comprehensive
• e.g. Allowing representation of oligonucleotide
hybridization
– Canonicalizable
• Facilitating uniqueness checking
– (Somewhat) human-readable
8
HELM Example: Simple polymer
• HELM notation: A.R.G.[dF].C.K.[ahA].E.D.A
– Non-natural amino acid codes are enclosed in square
brackets
• Natural equivalent: ARGFCKXEDA
9
HELM Example: Complex Polymer
10
Monomer Database
• Each monomer used in the notation needs to be predefined in a
monomer database
• The database includes the chemical structure of the monomer and
a description of all acceptable attachment points
11
J. Chem. Inf. Model 2012, 52, 2796-2806
12

Weitere ähnliche Inhalte

Ähnlich wie HELM Notation Overview

Drug R&D Portfolio Challenges
Drug R&D Portfolio ChallengesDrug R&D Portfolio Challenges
Drug R&D Portfolio Challengesmeijia_yang
 
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011Adam Ford
 
Designing a community resource - Sandra Orchard
Designing a community resource - Sandra OrchardDesigning a community resource - Sandra Orchard
Designing a community resource - Sandra OrchardEMBL-ABR
 
MDC Connects Series 2021 | A Guide to Complex Medicines: Developing the assay...
MDC Connects Series 2021 | A Guide to Complex Medicines: Developing the assay...MDC Connects Series 2021 | A Guide to Complex Medicines: Developing the assay...
MDC Connects Series 2021 | A Guide to Complex Medicines: Developing the assay...Medicines Discovery Catapult
 
Machine learning, health data & the limits of knowledge
Machine learning, health data & the limits of knowledgeMachine learning, health data & the limits of knowledge
Machine learning, health data & the limits of knowledgePaul Agapow
 
Biovays Discovery Summit Presentation
Biovays Discovery Summit PresentationBiovays Discovery Summit Presentation
Biovays Discovery Summit PresentationIguanaBio Iguana
 
Session 3 part 5
Session 3 part 5Session 3 part 5
Session 3 part 5plmiami
 
Computational Prediction Of Protein-1.pptx
Computational Prediction Of Protein-1.pptxComputational Prediction Of Protein-1.pptx
Computational Prediction Of Protein-1.pptxashharnomani
 
How to Implement Biomedical Named Entity Recognition with Machine Learning
How to Implement Biomedical Named Entity Recognition with Machine Learning How to Implement Biomedical Named Entity Recognition with Machine Learning
How to Implement Biomedical Named Entity Recognition with Machine Learning Skyl.ai
 
Informatics In The Manchester Centre For Integrative Systems Biology
Informatics In The Manchester Centre For Integrative Systems BiologyInformatics In The Manchester Centre For Integrative Systems Biology
Informatics In The Manchester Centre For Integrative Systems BiologyNeil Swainston
 
Fake news detection
Fake news detection Fake news detection
Fake news detection shalushamil
 
Multi-Agent Modelling With applications to robotics and cognition
Multi-Agent Modelling With applications to robotics and cognitionMulti-Agent Modelling With applications to robotics and cognition
Multi-Agent Modelling With applications to robotics and cognitionAladdin Ayesh
 
Sample Prep Solutions for Microbiome Research
Sample Prep Solutions for Microbiome ResearchSample Prep Solutions for Microbiome Research
Sample Prep Solutions for Microbiome ResearchQIAGEN
 
Lecture1-Introduction-Jan18-2021.pptx
Lecture1-Introduction-Jan18-2021.pptxLecture1-Introduction-Jan18-2021.pptx
Lecture1-Introduction-Jan18-2021.pptxSangeetaTripathi8
 
Intro to in silico drug discovery 2014
Intro to in silico drug discovery 2014Intro to in silico drug discovery 2014
Intro to in silico drug discovery 2014Lee Larcombe
 

Ähnlich wie HELM Notation Overview (20)

Drug R&D Portfolio Challenges
Drug R&D Portfolio ChallengesDrug R&D Portfolio Challenges
Drug R&D Portfolio Challenges
 
Innovation og værdiskabelse i it-projekter
Innovation og værdiskabelse i it-projekterInnovation og værdiskabelse i it-projekter
Innovation og værdiskabelse i it-projekter
 
Session ii g2 lab modeling mmc
Session ii g2 lab modeling mmcSession ii g2 lab modeling mmc
Session ii g2 lab modeling mmc
 
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
 
Designing a community resource - Sandra Orchard
Designing a community resource - Sandra OrchardDesigning a community resource - Sandra Orchard
Designing a community resource - Sandra Orchard
 
MDC Connects Series 2021 | A Guide to Complex Medicines: Developing the assay...
MDC Connects Series 2021 | A Guide to Complex Medicines: Developing the assay...MDC Connects Series 2021 | A Guide to Complex Medicines: Developing the assay...
MDC Connects Series 2021 | A Guide to Complex Medicines: Developing the assay...
 
Machine learning, health data & the limits of knowledge
Machine learning, health data & the limits of knowledgeMachine learning, health data & the limits of knowledge
Machine learning, health data & the limits of knowledge
 
Biovays Discovery Summit Presentation
Biovays Discovery Summit PresentationBiovays Discovery Summit Presentation
Biovays Discovery Summit Presentation
 
Neolite Business Credential
Neolite Business CredentialNeolite Business Credential
Neolite Business Credential
 
Session 3 part 5
Session 3 part 5Session 3 part 5
Session 3 part 5
 
Computational Prediction Of Protein-1.pptx
Computational Prediction Of Protein-1.pptxComputational Prediction Of Protein-1.pptx
Computational Prediction Of Protein-1.pptx
 
How to Implement Biomedical Named Entity Recognition with Machine Learning
How to Implement Biomedical Named Entity Recognition with Machine Learning How to Implement Biomedical Named Entity Recognition with Machine Learning
How to Implement Biomedical Named Entity Recognition with Machine Learning
 
Neo4j and bioinformatics
Neo4j and bioinformaticsNeo4j and bioinformatics
Neo4j and bioinformatics
 
Switching from academia to industry - and back
Switching from academia to industry - and backSwitching from academia to industry - and back
Switching from academia to industry - and back
 
Informatics In The Manchester Centre For Integrative Systems Biology
Informatics In The Manchester Centre For Integrative Systems BiologyInformatics In The Manchester Centre For Integrative Systems Biology
Informatics In The Manchester Centre For Integrative Systems Biology
 
Fake news detection
Fake news detection Fake news detection
Fake news detection
 
Multi-Agent Modelling With applications to robotics and cognition
Multi-Agent Modelling With applications to robotics and cognitionMulti-Agent Modelling With applications to robotics and cognition
Multi-Agent Modelling With applications to robotics and cognition
 
Sample Prep Solutions for Microbiome Research
Sample Prep Solutions for Microbiome ResearchSample Prep Solutions for Microbiome Research
Sample Prep Solutions for Microbiome Research
 
Lecture1-Introduction-Jan18-2021.pptx
Lecture1-Introduction-Jan18-2021.pptxLecture1-Introduction-Jan18-2021.pptx
Lecture1-Introduction-Jan18-2021.pptx
 
Intro to in silico drug discovery 2014
Intro to in silico drug discovery 2014Intro to in silico drug discovery 2014
Intro to in silico drug discovery 2014
 

Kürzlich hochgeladen

Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 

Kürzlich hochgeladen (20)

Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 

HELM Notation Overview

  • 1. http://pistoiaalliance.org @PistoiaAlliance Pistoia Alliance HELM Project - What About the Big Guys? The emerging HELM standard for macromolecular representation Domain Lead – Sergio Rotstein Business Technology, Pfizer
  • 2. What is a “Biomolecule”? 2 Peptides Therapeutic Proteins ADCs Antibodies Vaccines ASOs siRNAs For our purposes, anything that is not a small molecule is a biomolecule Goal • Eliminate biomolecule penalty • Make these entities first- class citizens of the Informatics tool portfolio
  • 3. G A P So what’s the problem? 3 N NH O O O N NH O O O Small Molecules Sequences Biomolecules Small Molecule Tools Sequence-Based Tools
  • 4. “Fit-for-Purpose” Structure Representation We need to enable the representation, manipulation and visualization of each molecule type in a way that is appropriate for its size and complexity 4
  • 5. Fit for Purpose: “Monomer” Level • While you could draw out an oligonucleotide like this: • The representation is likely more intuitive / practical: 5
  • 6. Fit for Purpose: Sequence Level • But even the monomer level representation would not scale well to proteins with hundreds of amino acids. Larger molecules require a more sequence-oriented representation: 6
  • 7. Fit for Purpose: Component Level • For multi-component structures such as antibody drug conjugates, component level representations are required to enable each component to dealt with separately. 7 “Collapsed” Antibody Expanded Drug Ab
  • 8. Hierarchical Editing Language for Macromolecules – Hierarchical – Amenable to the various “levels” • Complex Polymer ⇒ Simple Polymer ⇒ Monomer ⇒ Atom – Extensible • Allowing addition of new biopolymer types – (Reasonably) comprehensive • e.g. Allowing representation of oligonucleotide hybridization – Canonicalizable • Facilitating uniqueness checking – (Somewhat) human-readable 8
  • 9. HELM Example: Simple polymer • HELM notation: A.R.G.[dF].C.K.[ahA].E.D.A – Non-natural amino acid codes are enclosed in square brackets • Natural equivalent: ARGFCKXEDA 9
  • 10. HELM Example: Complex Polymer 10
  • 11. Monomer Database • Each monomer used in the notation needs to be predefined in a monomer database • The database includes the chemical structure of the monomer and a description of all acceptable attachment points 11
  • 12. J. Chem. Inf. Model 2012, 52, 2796-2806 12

Hinweis der Redaktion

  1. Paper will soon be posted on the upcoming HELM web site.