SlideShare ist ein Scribd-Unternehmen logo
1 von 34
Downloaden Sie, um offline zu lesen
representation and display of
non-standard peptides using
semi-systematic amino acid
monomer naming
Roger Sayle
Nextmove software, cambridge, uk
247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
The primordial soup
• The vast majority of chemical entities stored in
chemical databases and pharmaceutical registration
systems are classical “small molecules”.
• For these molecules, the universal cheminformatics
“all-heavy-atom” representations work well,
providing duplicate checking, substructure search,
normalization, depiction, etc.
• Solutions include InChI, SMILES and V2000 mol file.
247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
Biopolymers of life
• The low Shannon entropy of a small,
but scientifically significant, fraction
of compounds, biopolymers, allow
them to represented more succinctly.
• For these frequent subunits (or
monomers) can be used instead of
atoms in connection tables.
• Representations with small monomer
sets are easier for scientists to
comprehend.
247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
Peptide representation
The all-atom depiction (left) is more complicated to
interpret than the 3-letter code and 1-letter code
forms (right).
Recognizing this is [Asp5]oxytocin is perhaps only
possible at the “sequence” level.
247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
Protein representation
All atom representation vs. 3-letter and 1-letter sequence forms of crambin.
247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
human insulin representations
CC[C@H](C)[C@H]1C(=O)N[C@H]2CSSC[C@@H](C(=O)N[C@@H](CSSC[C@@H](C(=O)NCC(=O)N[C@H
](C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)N
[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CSSC[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N
C(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H](NC2
=O)CO)CC(C)C)CC3=CC=C(C=C3)O)CCC(=O)N)CC(C)C)CCC(=O)O)CC(=O)N)CC4=CC=C(C=C4)O)C(
=O)N[C@@H](CC(=O)N)C(=O)O)C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCNC(=N)N)C
(=O)NCC(=O)N[C@@H](CC5=CC=CC=C5)C(=O)N[C@@H](CC6=CC=CC=C6)C(=O)N[C@@H](CC7=CC=C(
C=C7)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N8CCC[C@H]8C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([
C@@H](C)O)C(=O)O)C(C)C)CC(C)C)CC9=CC=C(C=C9)O)CC(C)C)C)CCC(=O)O)C(C)C)CC(C)C)CC2
=CN=CN2)CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)NC(=O)
[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@H](C(=O)N[C@
H](C(=O)N1)CO)[C@@H](C)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C
(C)C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)CN Human Insulin
247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
Sequence bioinformatics
• Traditional bioinformatics represents sequences as
one letter codes, often in FASTA or Uniprot format.
• Four characters for nucleic acids:
– A,C,G and T for DNA and A,C,G and U for RNA.
• 20 (or 22) characters for proteinogenic amino acids:
– A,R,N,D,C,E,Q,G,H,I,L,K,M,F,P,S,T,W,Y and V (plus O and U).
• The challenge arises when more than the primary
sequence needs to be represented.
247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
3-letter codes to the rescue?
• The use of 3-letter codes to represent monomers
allows for significantly more possible monomers.
• Conveniently common AAs have standard codes:
– Ala, Arg, Asn, Asp, Cys, Glu, Gln, Gly, His, Ile, Leu, Lys, Met,
Phe, Pro, Ser, Thr, Trp, Tyr and Val, plus Sec and Pyl.
• These assignments are mandated by IUPAC and IUB
recommendations known as 3AA (dated 1983).
• Unfortunately the prevalence of standard amino
acids and 3-letter codes has had adverse affects.
247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
Pdb 3-letter residue codes
• wwPDB’s components.cif currently lists 17,764
3-letter residue names (digits and capitals).
• These contain not only amino acids, but
nucleic acids, carbohydrates, ligands, salts
solvents and co-factors.
• Completely different codes are assigned to the
L- and D- forms of amino acids.
247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
Monomer database solutions
• Biopolymer systems based on monomer databases
and strict 3-letter codes breakdown when faced with
the huge number of potential chemical and post-
translational modifications.
• At best, a human being can remember 50-100 codes
before having to consult “dictionary” to lookup
solutions.
• Worse, once beyond the commonly encountered
amino acids, biochemists disagree on the meaning of
3-letter codes…
247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
Example ambiguities
• Cpa
– Cyano-propionic amino acid
– Beta-cyclopropylalanine
– 4-amino-1-carbamoyl-piperidine-4-carboxylic acid
• Hpg
– 4-hydroxyphenylglycine
– Homopropargylglycine
• Nty
– Nortyrosine [Chemical Abstracts Service]
– N-nitrotyrosine [Pisotoia HELM]
247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
48 hexopyranoses
247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
264 deoxyhexopyranoses
9540 substituted hexopyranoses
This only considers the four most common substituents:
amino, acetyl, acetylamino and methoxy
Bitter Sweet Conclusion
• In practice, a glycoinformatics registration system has
to handle over 232 monosaccharide monomers.
• This number is clearly too large to assign each its
own 3-letter code or to use a monomer database.
• For example, monosaccharidedb.org curates a
database of 771 monosaccharides, whereas 936 of
the simple monosaccharides on the previous page
can be found in PubChem.
247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
The aa naming solution
• The solution to this problem is the systematic
naming of amino acid derivatives.
• This approach is well known to peptide chemists,
with these names routinely used in scientific articles,
patent applications and supplier catalogues, and
easily recognizable by chemists.
• Alas this “de facto” nomenclature, which hasn’t been
formalized by IUPAC nor CAS, has been overlooked
by the informatics community.
247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
Systematic amino acid code examples
• Examples of systematic codes for representing
common non-standard amino acids
– Thr(tBu), Ser(tBu), Phe(4-Cl), Phe(3-Cl), Phe(4-Ph),
Tyr(PO3H2), Cys(Acm), Lys(Boc), Tyr(Bn(2,6-diCl), Tyr(3-F),
Lys(Cbz), Glu(OtBu), Tyr(SO3H), Phe(4-CF3), Tyr(tBu),
Arg(NO2), Trp(For), Lys(palmitoyl), aThr(tBu), Phe(4-F),
Thr(Bn), Ser(Bn), Asp(OtBu), Tyr(decyl), Glu(OBn), Gly(tBu),
Phe(4-hexyl), Thr(SO3H), Tyr(Me), Asp(OBn), Tyr(Bn),
Phe(4-NO2), Lys(Ac), Nle(Me), Cys(tBu), Ala(cyclopentyl),
Cys(StBu), Trp(2-Bn), Ser(PO3H2), Phe(4-I), Arg(Tos), etc.
247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
monosaccharide examples
• In fact constructing systematic names is the same
solution as used by the glycoinformatics community
– b-D-GlcpNAc, a-D-Neup5Ac, b-D-GlcpA, D-GlcNAc-ol, a-D-
GlcpN, a-L-4-en-4-deoxy-thrHexpA, b-D-Galp3Me, b-D-6-
deoxy-Glcp, b-D-2,6-deoxy-ribHexp, b-D-GlcpA6Me, a-D-
Rhap4N-formyl, b-D-Glcp6Ac, a-D-Neup5Ac9Ac…
• And in RNA informatics
– P-rGuo, P-Gua-Rib2Me, P-m22Gua-Rib, P-m2Gua-Rib, P-
Cyt-Rib2Me, P-m5Cyt-Rib, P-m5Ura-Rib, P-m1Ade-Rib…
247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
Rule #1: use of D- and DL- prefixes
• Rather than have separate code forms for D- form
(and DL-) amino acids, the widely understood and
accepted stereo configuration prefixes “D-”, “L-”
(default) and “DL-” should be used.
– D-Arg
– DL-Ala
– D-Ser
247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
Rule #2: retain widely used 3LC’s.
• Universally accepted code/definitions should be used
– Abu (2-aminobutyric acid), Aib (2-aminoisobutyric acid),
aIle (allo-isoleucine), aThe (allo-threonine), bAla (beta-
alanine), Cha (beta-cyclohexylalanine), Chg (alpha-
cyclohexylglycine), Cit (Citrulline), Dab (2,3-diaminobutyric
acid), Dap (2,3-diaminopropionic acid), hArg
(homoarginine), Hcy (homocysteine), Hse (homoserine),
hPhe (homophenylalanine), Nle (norleucine), Nva
(norvaline), Orn (ornithine), Pen (penicillamine), Phg
(phenylglycine), Sar (sarcosine), Sec (selenocysteine), 2Thi
(thien-2-ylalanine), 3Thi (thien-3-ylalanine), xiIle (xi-
isoleucine), xiThr (xi-threonine), and many more.
247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
Rule #3: modifying prefixes
• “N(Me)” and friends are used to specify N-modified
variants. “N(Me)Gly” is equivalent to “Sar”.
• The “O” prefix is used to specify depsi-peptides that
connect via ester linkages, e.g. “Arg-OAla-Ser”.
• The “aMe” prefix is used to specify alpha-methyl
variants of amino acids. “aMeAla” is same as “Aib”.
247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
Rule #4: line formulae substs
• Substitutions are indicated by line formulae
– Ph phenyl CF3 trifluoromethyl
– Me methyl Tos tosyl
– Ac acetyl Trt trityl
– Cl chloro Tf triflyl
– CN cyano SO3H sulfonic acid
– Bn benzyl OPfp perfluorophenoxy
– NH2 amino iPr isopropyl
– NO2 nitro For formyl
– tBu tert-butyl … and many more
247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
RULE #5: DEFAULT SUBSTITUTION
• Many amino acids have default substitution locants
– Abu ABA.CG
– Ala ALA.CB
– Arg ARG.NH2
– Cys CYS.SG
– Dab DAB.ND
– Dap DPP.NG
– Gly GLY.CA
– Lys LYS.NZ
– Ser SER.OG
– Thr THR.OG1
– Tyr TYR.OH
247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
RULE #6: specified SUBSTITUTION
• Amino acids may be substituted at particular locant
using the following syntax:
– Phe(4-Cl)
– Phg(4-CH2NH2)
– Tyr(2,6-diCl-Bn)
• Substitutions may occur at more than one locant:
– His(2,5-diI)
– Tyr(2,6-diF)
247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
Rule #7: implicit leaving groups
• For most amino acids, parenthesized substitution
implies replaced of hydrogen.
– Ala(Cl)
• Carboxylic acids assume an OH leaving group,
requiring an explicit “oxy” to form esters.
– Asp(NH2)
– Asp(OMe)
• Cysteine (and penicillamine) substitute on the sulfur
– Cys(tBu)
– Cys(StBu)
247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
The need for standardization #1
• For peptide registration, we need a canonical name.
• Preferred forms are most visible in line notations.
– For carboxy, do we prefer “COOH” or “CO2H”.
– For acetyl, “Ac” or “COCH3”.
– For ethylamine, “EtNH2” or “CH2CH2NH2”.
– For phosphate, “PO3H2” vs. “P”.
• However decomposition also affects preferred forms
– Gly(Me) is the same as Ala
– Asp(NH2) is the same as Asn
– Phe(4-Ph) is the same as 4-Bip
247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
The need for standardization #2
• More seriously we need to avoid ambiguous names.
• It is widely acknowledge by peptide chemists that
“Ac” means acetyl, rather than Actinium, and
connects to the parent at the carbonyl carbon.
• In supplier catalogues for a German vendor, the
name “Cys(AcOH)” implies “*CC(=O)O” instead of
“*C(=O)CO”.
• Which names should be used, “COCH2OH” etc.?
247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
Example Peptide names (chembl)
• The following names are machine generated
• H-Cys-Pro-Trp-His-Leu-Leu-Pro-Phe-Cys-OH CHEMBL501567
• H-Tyr-Pro-Phe-Phe-OtBu CHEMBL500195
• cyclo[Ala-Tyr-Val-Orn-Leu-D-Phe-Pro-Phe-D-Phe-Asn] CHEMBL438006
• H-Nle(Et)-Tyr-Pro-Trp-Phe-NH2 CHEMBL500704
• H-DL-hPhe-Val-Met-Tyr(PO3H2)-Asn-Leu-Gly-Glu-OH CHEMBL439086
• cyclo[Phe-D-Trp-Tyr(Me)-D-Pro] CHEMBL507127
• H-D-Pyr-D-Leu-pyrrolidide CHEMBL1181307
• Ac-DL-Phe-aThr-Leu-Asp-Ala-Asp-DL-Phe(4-Cl)-OH CHEMBL1791047
• H-D-Cys(1)-D-Asp-Gly-Tyr(3-NO2)-Gly-Hyp-Asp-D-Cys(1)-NH2 CHEMBL583516
• Boc-Tyr-Tyr(3-Br)-OMe CHEMBL1976073
247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
Example Peptide names (pubchem)
• The following names are existing supplier synonyms
• Boc-Asp(OtBu)-Pro-OH CID57383532
• Tyr-D-Ala-Gly-NMePhe CID45483974
• Tyr-D-Pro-Gly-Trp-NMeNle-Asp-Phe-NH2 CID11693641
• Ac-Cys-Ile-Tyr-Lys-Phe(4-Cl)-Tyr CID44412001
• Z-D-Val-Lys(Z)-OH CID5978?
• deamino-Cys-D-Tyr(Et)-Ile-Thr-Asn-Cys-Pro-Orn-Gly-NH2 CID68613
• H-Arg(NO2)-OMe.HCl CID135193
• Unlike HELM or PLN, these IUPAC condensed line
notations (3AA) are already used in publications.
247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
The big win: display of peptides
• One of the largest end-user benefits of systematic
naming is in the improved display of compounds:
Structure of Degarelix from wikipedia: http://en.wikipedia.org/wiki/Degarelix
247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
The big win: Display of peptides
• The image of Degarelix becomes optimal (IMHO)
Image credit: WO2011066386A1
247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
A real example: CID10033747
CC(C)[C@H]1C(=O)N[C@H](C(=O)N[C@H](
C(=O)N2CCC[C@H]2C(=O)N[C@H](C(=O)N
[C@H](C(=O)N1)CC3=CC=C(C=C3)OP(=O)(
O)O)CSSC[C@@H](C(=O)O)NC(=O)C)C(C)
C)CC(=O)N
IUPAC depiction
cyclo[Asn-Val-Pro-Cys(1)-Tyr(PO3H2)-Val].
Ac-Cys(1)-OH.
IUPAC Condensed
SMILES
PEPTIDE1{[ac].C}|PEPTIDE2{N.V.P.C}|
CHEM1{*N[C@@H](Cc1ccc(cc1)OP(=O)
(O)O)C(=O)*|$_R1;;;;;;;;;;;;;;;;;_R2$|}|
PEPTIDE3{V}$
PEPTIDE1,PEPTIDE2,2:R34:R3|
PEPTIDE2,CHEM1,4:R2-1:R1|
CHEM1,PEPTIDE3,1:R21:R1|
PEPTIDE3,PEPTIDE2,1:R2-1:R1$$$
HELM
247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
conclusions
• The use of 3-letter codes to represent non-standard
amino acids doesn’t scale to peptide registration
systems.
• Adopting the systematic rules already frequently
used by peptide chemists looks to solve a number of
amino acid nomenclature challenges faced by PDB,
HELM and the pharmaceutical industry.
• This presentation/proposal formalizes some of the
syntax/rules for constructing systematic codes for
non-standard amino acids.
247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
acknowledgements
• Lisa Sach-Peltason, Hoffmann-La Roche, Basel.
• Evan Bolton, NCBI PubChem project, Bethesda, MD.
• Noel O’Boyle, NextMove Software, Cambridge, UK.
• Daniel Lowe, NextMove Software, Cambridge, UK.
247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014

Weitere ähnliche Inhalte

Was ist angesagt?

02 organic chemistry
02 organic chemistry02 organic chemistry
02 organic chemistrymrsjacobs
 
Dissecting NOS electronics_Biochem J 2013
Dissecting NOS electronics_Biochem J 2013Dissecting NOS electronics_Biochem J 2013
Dissecting NOS electronics_Biochem J 2013Luciana Hannibal, Ph.D.
 
2014 terachem-nuclear medicine and biology, v. 41, is. 7, p. 547-650
2014 terachem-nuclear medicine and biology, v. 41, is. 7, p. 547-6502014 terachem-nuclear medicine and biology, v. 41, is. 7, p. 547-650
2014 terachem-nuclear medicine and biology, v. 41, is. 7, p. 547-650Konstantin German
 
Extraction, Analysis, Atom Mapping, Classification and Naming of Reactions fr...
Extraction, Analysis, Atom Mapping, Classification and Naming of Reactions fr...Extraction, Analysis, Atom Mapping, Classification and Naming of Reactions fr...
Extraction, Analysis, Atom Mapping, Classification and Naming of Reactions fr...NextMove Software
 
OPSIN: Taming the jungle of IUPAC chemical nomenclature
OPSIN: Taming the jungle of IUPAC chemical nomenclatureOPSIN: Taming the jungle of IUPAC chemical nomenclature
OPSIN: Taming the jungle of IUPAC chemical nomenclaturedan2097
 
OPSIN: Taming the Jungle of IUPAC Chemical Nomenclature
OPSIN: Taming the Jungle of IUPAC Chemical NomenclatureOPSIN: Taming the Jungle of IUPAC Chemical Nomenclature
OPSIN: Taming the Jungle of IUPAC Chemical Nomenclaturedan2097
 
DG051 JMC 2010 jm900838g
DG051 JMC 2010 jm900838gDG051 JMC 2010 jm900838g
DG051 JMC 2010 jm900838gAlex Kiselyov
 
Radical retrosynthesis
 Radical retrosynthesis Radical retrosynthesis
Radical retrosynthesisAadil Ali Wani
 
Basic Concepts Of Retrosynthesis (Part1)
Basic Concepts Of Retrosynthesis (Part1)Basic Concepts Of Retrosynthesis (Part1)
Basic Concepts Of Retrosynthesis (Part1)munirnizami
 
150202-Biophysical Society Poster2
150202-Biophysical Society Poster2150202-Biophysical Society Poster2
150202-Biophysical Society Poster2Lukasz Olenginski
 
Informational macromolecules (2)
Informational macromolecules (2)Informational macromolecules (2)
Informational macromolecules (2)hussamdr
 
2006 Hamada Et Al Srmnc
2006 Hamada Et Al Srmnc2006 Hamada Et Al Srmnc
2006 Hamada Et Al Srmncyzhamada
 

Was ist angesagt? (20)

LTA4H FBDD 2009 JMC
LTA4H FBDD 2009 JMCLTA4H FBDD 2009 JMC
LTA4H FBDD 2009 JMC
 
02 organic chemistry
02 organic chemistry02 organic chemistry
02 organic chemistry
 
Finished Dissertation
Finished DissertationFinished Dissertation
Finished Dissertation
 
EPCI paper
EPCI paperEPCI paper
EPCI paper
 
Dissecting NOS electronics_Biochem J 2013
Dissecting NOS electronics_Biochem J 2013Dissecting NOS electronics_Biochem J 2013
Dissecting NOS electronics_Biochem J 2013
 
2014 terachem-nuclear medicine and biology, v. 41, is. 7, p. 547-650
2014 terachem-nuclear medicine and biology, v. 41, is. 7, p. 547-6502014 terachem-nuclear medicine and biology, v. 41, is. 7, p. 547-650
2014 terachem-nuclear medicine and biology, v. 41, is. 7, p. 547-650
 
ogundele journal
ogundele journalogundele journal
ogundele journal
 
Extraction, Analysis, Atom Mapping, Classification and Naming of Reactions fr...
Extraction, Analysis, Atom Mapping, Classification and Naming of Reactions fr...Extraction, Analysis, Atom Mapping, Classification and Naming of Reactions fr...
Extraction, Analysis, Atom Mapping, Classification and Naming of Reactions fr...
 
OPSIN: Taming the jungle of IUPAC chemical nomenclature
OPSIN: Taming the jungle of IUPAC chemical nomenclatureOPSIN: Taming the jungle of IUPAC chemical nomenclature
OPSIN: Taming the jungle of IUPAC chemical nomenclature
 
OPSIN: Taming the Jungle of IUPAC Chemical Nomenclature
OPSIN: Taming the Jungle of IUPAC Chemical NomenclatureOPSIN: Taming the Jungle of IUPAC Chemical Nomenclature
OPSIN: Taming the Jungle of IUPAC Chemical Nomenclature
 
C7sc04518b
C7sc04518bC7sc04518b
C7sc04518b
 
DG051 JMC 2010 jm900838g
DG051 JMC 2010 jm900838gDG051 JMC 2010 jm900838g
DG051 JMC 2010 jm900838g
 
Radical retrosynthesis
 Radical retrosynthesis Radical retrosynthesis
Radical retrosynthesis
 
C4DT02257B
C4DT02257BC4DT02257B
C4DT02257B
 
Basic Concepts Of Retrosynthesis (Part1)
Basic Concepts Of Retrosynthesis (Part1)Basic Concepts Of Retrosynthesis (Part1)
Basic Concepts Of Retrosynthesis (Part1)
 
ol200031t
ol200031tol200031t
ol200031t
 
150202-Biophysical Society Poster2
150202-Biophysical Society Poster2150202-Biophysical Society Poster2
150202-Biophysical Society Poster2
 
Informational macromolecules (2)
Informational macromolecules (2)Informational macromolecules (2)
Informational macromolecules (2)
 
Final_Randhawa
Final_RandhawaFinal_Randhawa
Final_Randhawa
 
2006 Hamada Et Al Srmnc
2006 Hamada Et Al Srmnc2006 Hamada Et Al Srmnc
2006 Hamada Et Al Srmnc
 

Andere mochten auch

Chemistry and reactions from non-US patents
Chemistry and reactions from non-US patentsChemistry and reactions from non-US patents
Chemistry and reactions from non-US patentsNextMove Software
 
Peptide Informatics - Bridging the gap between small-molecule and large-molec...
Peptide Informatics - Bridging the gap between small-molecule and large-molec...Peptide Informatics - Bridging the gap between small-molecule and large-molec...
Peptide Informatics - Bridging the gap between small-molecule and large-molec...NextMove Software
 
Using Matched Molecular Series as a Predictive Tool To Optimize Biological Ac...
Using Matched Molecular Series as a Predictive Tool To Optimize Biological Ac...Using Matched Molecular Series as a Predictive Tool To Optimize Biological Ac...
Using Matched Molecular Series as a Predictive Tool To Optimize Biological Ac...NextMove Software
 
Using Matched Series to decide what compound to make next
Using Matched Series to decide what compound to make nextUsing Matched Series to decide what compound to make next
Using Matched Series to decide what compound to make nextNextMove Software
 
Standardized Representations of ELN Reactions for Categorization and Duplicat...
Standardized Representations of ELN Reactions for Categorization and Duplicat...Standardized Representations of ELN Reactions for Categorization and Duplicat...
Standardized Representations of ELN Reactions for Categorization and Duplicat...NextMove Software
 
Evidence-based medicinal chemistry using matched molecular series
Evidence-based medicinal chemistry using matched molecular seriesEvidence-based medicinal chemistry using matched molecular series
Evidence-based medicinal chemistry using matched molecular seriesNextMove Software
 
Revising the Topliss Decision Tree
Revising the Topliss Decision TreeRevising the Topliss Decision Tree
Revising the Topliss Decision TreeNextMove Software
 

Andere mochten auch (8)

Chemistry and reactions from non-US patents
Chemistry and reactions from non-US patentsChemistry and reactions from non-US patents
Chemistry and reactions from non-US patents
 
InChI for Large Molecules
InChI for Large MoleculesInChI for Large Molecules
InChI for Large Molecules
 
Peptide Informatics - Bridging the gap between small-molecule and large-molec...
Peptide Informatics - Bridging the gap between small-molecule and large-molec...Peptide Informatics - Bridging the gap between small-molecule and large-molec...
Peptide Informatics - Bridging the gap between small-molecule and large-molec...
 
Using Matched Molecular Series as a Predictive Tool To Optimize Biological Ac...
Using Matched Molecular Series as a Predictive Tool To Optimize Biological Ac...Using Matched Molecular Series as a Predictive Tool To Optimize Biological Ac...
Using Matched Molecular Series as a Predictive Tool To Optimize Biological Ac...
 
Using Matched Series to decide what compound to make next
Using Matched Series to decide what compound to make nextUsing Matched Series to decide what compound to make next
Using Matched Series to decide what compound to make next
 
Standardized Representations of ELN Reactions for Categorization and Duplicat...
Standardized Representations of ELN Reactions for Categorization and Duplicat...Standardized Representations of ELN Reactions for Categorization and Duplicat...
Standardized Representations of ELN Reactions for Categorization and Duplicat...
 
Evidence-based medicinal chemistry using matched molecular series
Evidence-based medicinal chemistry using matched molecular seriesEvidence-based medicinal chemistry using matched molecular series
Evidence-based medicinal chemistry using matched molecular series
 
Revising the Topliss Decision Tree
Revising the Topliss Decision TreeRevising the Topliss Decision Tree
Revising the Topliss Decision Tree
 

Ähnlich wie Representation and display of non-standard peptides using semi-systematic amino acid monomer naming

CINF 4: Naming algorithms for derivatives of peptide-like natural products
CINF 4: Naming algorithms for derivatives of peptide-like natural productsCINF 4: Naming algorithms for derivatives of peptide-like natural products
CINF 4: Naming algorithms for derivatives of peptide-like natural productsNextMove Software
 
Classification, Nomenclature and structural isomerism of organic compound
Classification, Nomenclature and structural isomerism of organic compound Classification, Nomenclature and structural isomerism of organic compound
Classification, Nomenclature and structural isomerism of organic compound Ganesh Mote
 
Iupac nomenclature class 11 CBSE-organic chemistry some basic principles and ...
Iupac nomenclature class 11 CBSE-organic chemistry some basic principles and ...Iupac nomenclature class 11 CBSE-organic chemistry some basic principles and ...
Iupac nomenclature class 11 CBSE-organic chemistry some basic principles and ...ritik
 
Organic Chemistry 4.pptx
Organic Chemistry 4.pptxOrganic Chemistry 4.pptx
Organic Chemistry 4.pptxAshishChhabra33
 
Chapter no 1 poc presentation no 1
Chapter no 1 poc presentation no 1Chapter no 1 poc presentation no 1
Chapter no 1 poc presentation no 1Uttara Joshi
 
Noteshydrocarbon nomenclaturesidechains-120524142846-phpapp01
Noteshydrocarbon nomenclaturesidechains-120524142846-phpapp01Noteshydrocarbon nomenclaturesidechains-120524142846-phpapp01
Noteshydrocarbon nomenclaturesidechains-120524142846-phpapp01Cleophas Rwemera
 
Organic Compounds Nomenclature handouts
Organic Compounds Nomenclature handoutsOrganic Compounds Nomenclature handouts
Organic Compounds Nomenclature handoutsLexter Supnet
 
Line notations for nucleic acids (both natural and therapeutic)
Line notations for nucleic acids (both natural and therapeutic)Line notations for nucleic acids (both natural and therapeutic)
Line notations for nucleic acids (both natural and therapeutic)NextMove Software
 
Classification, representation and analysis of cyclic peptides and peptide-li...
Classification, representation and analysis of cyclic peptides and peptide-li...Classification, representation and analysis of cyclic peptides and peptide-li...
Classification, representation and analysis of cyclic peptides and peptide-li...NextMove Software
 
B.sc. frst yr fundamental concepts
B.sc. frst yr fundamental conceptsB.sc. frst yr fundamental concepts
B.sc. frst yr fundamental conceptsRam Darash Pandey
 
enzyme classfication.pptx
enzyme classfication.pptxenzyme classfication.pptx
enzyme classfication.pptxDrReyaIssac
 
06. structure of dna and rna
06. structure of dna and rna 06. structure of dna and rna
06. structure of dna and rna saveena solanki
 
Naming chemical compounds
Naming chemical compounds Naming chemical compounds
Naming chemical compounds K. Shahzad Baig
 
Nucleic Acid Chemistry session one.ppt
Nucleic Acid Chemistry session one.pptNucleic Acid Chemistry session one.ppt
Nucleic Acid Chemistry session one.pptPharmTecM
 
Nucleic Acid Chemistry session one-1.ppt
Nucleic Acid Chemistry session one-1.pptNucleic Acid Chemistry session one-1.ppt
Nucleic Acid Chemistry session one-1.pptPharmTecM
 
organic chemistry unit 1.pptx
organic chemistry unit 1.pptxorganic chemistry unit 1.pptx
organic chemistry unit 1.pptxManoharKumar81
 
54 ch14replication2008
54 ch14replication200854 ch14replication2008
54 ch14replication2008sbarkanic
 
Building a bridge between human-readable and machine-readable representations...
Building a bridge between human-readable and machine-readable representations...Building a bridge between human-readable and machine-readable representations...
Building a bridge between human-readable and machine-readable representations...NextMove Software
 

Ähnlich wie Representation and display of non-standard peptides using semi-systematic amino acid monomer naming (20)

CINF 4: Naming algorithms for derivatives of peptide-like natural products
CINF 4: Naming algorithms for derivatives of peptide-like natural productsCINF 4: Naming algorithms for derivatives of peptide-like natural products
CINF 4: Naming algorithms for derivatives of peptide-like natural products
 
Organic chemistry nomenclature
Organic chemistry nomenclatureOrganic chemistry nomenclature
Organic chemistry nomenclature
 
Classification, Nomenclature and structural isomerism of organic compound
Classification, Nomenclature and structural isomerism of organic compound Classification, Nomenclature and structural isomerism of organic compound
Classification, Nomenclature and structural isomerism of organic compound
 
Iupac nomenclature class 11 CBSE-organic chemistry some basic principles and ...
Iupac nomenclature class 11 CBSE-organic chemistry some basic principles and ...Iupac nomenclature class 11 CBSE-organic chemistry some basic principles and ...
Iupac nomenclature class 11 CBSE-organic chemistry some basic principles and ...
 
Organic Chemistry 4.pptx
Organic Chemistry 4.pptxOrganic Chemistry 4.pptx
Organic Chemistry 4.pptx
 
Chapter no 1 poc presentation no 1
Chapter no 1 poc presentation no 1Chapter no 1 poc presentation no 1
Chapter no 1 poc presentation no 1
 
Noteshydrocarbon nomenclaturesidechains-120524142846-phpapp01
Noteshydrocarbon nomenclaturesidechains-120524142846-phpapp01Noteshydrocarbon nomenclaturesidechains-120524142846-phpapp01
Noteshydrocarbon nomenclaturesidechains-120524142846-phpapp01
 
Organic Compounds Nomenclature handouts
Organic Compounds Nomenclature handoutsOrganic Compounds Nomenclature handouts
Organic Compounds Nomenclature handouts
 
Line notations for nucleic acids (both natural and therapeutic)
Line notations for nucleic acids (both natural and therapeutic)Line notations for nucleic acids (both natural and therapeutic)
Line notations for nucleic acids (both natural and therapeutic)
 
Classification, representation and analysis of cyclic peptides and peptide-li...
Classification, representation and analysis of cyclic peptides and peptide-li...Classification, representation and analysis of cyclic peptides and peptide-li...
Classification, representation and analysis of cyclic peptides and peptide-li...
 
B.sc. frst yr fundamental concepts
B.sc. frst yr fundamental conceptsB.sc. frst yr fundamental concepts
B.sc. frst yr fundamental concepts
 
enzyme classfication.pptx
enzyme classfication.pptxenzyme classfication.pptx
enzyme classfication.pptx
 
06. structure of dna and rna
06. structure of dna and rna 06. structure of dna and rna
06. structure of dna and rna
 
Naming chemical compounds
Naming chemical compounds Naming chemical compounds
Naming chemical compounds
 
Nucleic Acid Chemistry session one.ppt
Nucleic Acid Chemistry session one.pptNucleic Acid Chemistry session one.ppt
Nucleic Acid Chemistry session one.ppt
 
Nucleic Acid Chemistry session one-1.ppt
Nucleic Acid Chemistry session one-1.pptNucleic Acid Chemistry session one-1.ppt
Nucleic Acid Chemistry session one-1.ppt
 
organic chemistry unit 1.pptx
organic chemistry unit 1.pptxorganic chemistry unit 1.pptx
organic chemistry unit 1.pptx
 
54 ch14replication2008
54 ch14replication200854 ch14replication2008
54 ch14replication2008
 
Hydrocarbon nomenclature
Hydrocarbon nomenclatureHydrocarbon nomenclature
Hydrocarbon nomenclature
 
Building a bridge between human-readable and machine-readable representations...
Building a bridge between human-readable and machine-readable representations...Building a bridge between human-readable and machine-readable representations...
Building a bridge between human-readable and machine-readable representations...
 

Mehr von NextMove Software

CINF 170: Regioselectivity: An application of expert systems and ontologies t...
CINF 170: Regioselectivity: An application of expert systems and ontologies t...CINF 170: Regioselectivity: An application of expert systems and ontologies t...
CINF 170: Regioselectivity: An application of expert systems and ontologies t...NextMove Software
 
CINF 35: Structure searching for patent information: The need for speed
CINF 35: Structure searching for patent information: The need for speedCINF 35: Structure searching for patent information: The need for speed
CINF 35: Structure searching for patent information: The need for speedNextMove Software
 
A de facto standard or a free-for-all? A benchmark for reading SMILES
A de facto standard or a free-for-all? A benchmark for reading SMILESA de facto standard or a free-for-all? A benchmark for reading SMILES
A de facto standard or a free-for-all? A benchmark for reading SMILESNextMove Software
 
Recent Advances in Chemical & Biological Search Systems: Evolution vs Revolution
Recent Advances in Chemical & Biological Search Systems: Evolution vs RevolutionRecent Advances in Chemical & Biological Search Systems: Evolution vs Revolution
Recent Advances in Chemical & Biological Search Systems: Evolution vs RevolutionNextMove Software
 
Can we agree on the structure represented by a SMILES string? A benchmark dat...
Can we agree on the structure represented by a SMILES string? A benchmark dat...Can we agree on the structure represented by a SMILES string? A benchmark dat...
Can we agree on the structure represented by a SMILES string? A benchmark dat...NextMove Software
 
Comparing Cahn-Ingold-Prelog Rule Implementations
Comparing Cahn-Ingold-Prelog Rule ImplementationsComparing Cahn-Ingold-Prelog Rule Implementations
Comparing Cahn-Ingold-Prelog Rule ImplementationsNextMove Software
 
Eugene Garfield: the father of chemical text mining and artificial intelligen...
Eugene Garfield: the father of chemical text mining and artificial intelligen...Eugene Garfield: the father of chemical text mining and artificial intelligen...
Eugene Garfield: the father of chemical text mining and artificial intelligen...NextMove Software
 
Chemical similarity using multi-terabyte graph databases: 68 billion nodes an...
Chemical similarity using multi-terabyte graph databases: 68 billion nodes an...Chemical similarity using multi-terabyte graph databases: 68 billion nodes an...
Chemical similarity using multi-terabyte graph databases: 68 billion nodes an...NextMove Software
 
Recent improvements to the RDKit
Recent improvements to the RDKitRecent improvements to the RDKit
Recent improvements to the RDKitNextMove Software
 
Pharmaceutical industry best practices in lessons learned: ELN implementation...
Pharmaceutical industry best practices in lessons learned: ELN implementation...Pharmaceutical industry best practices in lessons learned: ELN implementation...
Pharmaceutical industry best practices in lessons learned: ELN implementation...NextMove Software
 
Digital Chemical Representations
Digital Chemical RepresentationsDigital Chemical Representations
Digital Chemical RepresentationsNextMove Software
 
Challenges and successes in machine interpretation of Markush descriptions
Challenges and successes in machine interpretation of Markush descriptionsChallenges and successes in machine interpretation of Markush descriptions
Challenges and successes in machine interpretation of Markush descriptionsNextMove Software
 
PubChem as a Biologics Database
PubChem as a Biologics DatabasePubChem as a Biologics Database
PubChem as a Biologics DatabaseNextMove Software
 
CINF 17: Comparing Cahn-Ingold-Prelog Rule Implementations: The need for an o...
CINF 17: Comparing Cahn-Ingold-Prelog Rule Implementations: The need for an o...CINF 17: Comparing Cahn-Ingold-Prelog Rule Implementations: The need for an o...
CINF 17: Comparing Cahn-Ingold-Prelog Rule Implementations: The need for an o...NextMove Software
 
CINF 13: Pistachio - Search and Faceting of Large Reaction Databases
CINF 13: Pistachio - Search and Faceting of Large Reaction DatabasesCINF 13: Pistachio - Search and Faceting of Large Reaction Databases
CINF 13: Pistachio - Search and Faceting of Large Reaction DatabasesNextMove Software
 
Building on Sand: Standard InChIs on non-standard molfiles
Building on Sand: Standard InChIs on non-standard molfilesBuilding on Sand: Standard InChIs on non-standard molfiles
Building on Sand: Standard InChIs on non-standard molfilesNextMove Software
 
Chemical Structure Representation of Inorganic Salts and Mixtures of Gases: A...
Chemical Structure Representation of Inorganic Salts and Mixtures of Gases: A...Chemical Structure Representation of Inorganic Salts and Mixtures of Gases: A...
Chemical Structure Representation of Inorganic Salts and Mixtures of Gases: A...NextMove Software
 
Advanced grammars for state-of-the-art named entity recognition (NER)
Advanced grammars for state-of-the-art named entity recognition (NER)Advanced grammars for state-of-the-art named entity recognition (NER)
Advanced grammars for state-of-the-art named entity recognition (NER)NextMove Software
 
Challenges in Chemical Information Exchange
Challenges in Chemical Information ExchangeChallenges in Chemical Information Exchange
Challenges in Chemical Information ExchangeNextMove Software
 

Mehr von NextMove Software (20)

DeepSMILES
DeepSMILESDeepSMILES
DeepSMILES
 
CINF 170: Regioselectivity: An application of expert systems and ontologies t...
CINF 170: Regioselectivity: An application of expert systems and ontologies t...CINF 170: Regioselectivity: An application of expert systems and ontologies t...
CINF 170: Regioselectivity: An application of expert systems and ontologies t...
 
CINF 35: Structure searching for patent information: The need for speed
CINF 35: Structure searching for patent information: The need for speedCINF 35: Structure searching for patent information: The need for speed
CINF 35: Structure searching for patent information: The need for speed
 
A de facto standard or a free-for-all? A benchmark for reading SMILES
A de facto standard or a free-for-all? A benchmark for reading SMILESA de facto standard or a free-for-all? A benchmark for reading SMILES
A de facto standard or a free-for-all? A benchmark for reading SMILES
 
Recent Advances in Chemical & Biological Search Systems: Evolution vs Revolution
Recent Advances in Chemical & Biological Search Systems: Evolution vs RevolutionRecent Advances in Chemical & Biological Search Systems: Evolution vs Revolution
Recent Advances in Chemical & Biological Search Systems: Evolution vs Revolution
 
Can we agree on the structure represented by a SMILES string? A benchmark dat...
Can we agree on the structure represented by a SMILES string? A benchmark dat...Can we agree on the structure represented by a SMILES string? A benchmark dat...
Can we agree on the structure represented by a SMILES string? A benchmark dat...
 
Comparing Cahn-Ingold-Prelog Rule Implementations
Comparing Cahn-Ingold-Prelog Rule ImplementationsComparing Cahn-Ingold-Prelog Rule Implementations
Comparing Cahn-Ingold-Prelog Rule Implementations
 
Eugene Garfield: the father of chemical text mining and artificial intelligen...
Eugene Garfield: the father of chemical text mining and artificial intelligen...Eugene Garfield: the father of chemical text mining and artificial intelligen...
Eugene Garfield: the father of chemical text mining and artificial intelligen...
 
Chemical similarity using multi-terabyte graph databases: 68 billion nodes an...
Chemical similarity using multi-terabyte graph databases: 68 billion nodes an...Chemical similarity using multi-terabyte graph databases: 68 billion nodes an...
Chemical similarity using multi-terabyte graph databases: 68 billion nodes an...
 
Recent improvements to the RDKit
Recent improvements to the RDKitRecent improvements to the RDKit
Recent improvements to the RDKit
 
Pharmaceutical industry best practices in lessons learned: ELN implementation...
Pharmaceutical industry best practices in lessons learned: ELN implementation...Pharmaceutical industry best practices in lessons learned: ELN implementation...
Pharmaceutical industry best practices in lessons learned: ELN implementation...
 
Digital Chemical Representations
Digital Chemical RepresentationsDigital Chemical Representations
Digital Chemical Representations
 
Challenges and successes in machine interpretation of Markush descriptions
Challenges and successes in machine interpretation of Markush descriptionsChallenges and successes in machine interpretation of Markush descriptions
Challenges and successes in machine interpretation of Markush descriptions
 
PubChem as a Biologics Database
PubChem as a Biologics DatabasePubChem as a Biologics Database
PubChem as a Biologics Database
 
CINF 17: Comparing Cahn-Ingold-Prelog Rule Implementations: The need for an o...
CINF 17: Comparing Cahn-Ingold-Prelog Rule Implementations: The need for an o...CINF 17: Comparing Cahn-Ingold-Prelog Rule Implementations: The need for an o...
CINF 17: Comparing Cahn-Ingold-Prelog Rule Implementations: The need for an o...
 
CINF 13: Pistachio - Search and Faceting of Large Reaction Databases
CINF 13: Pistachio - Search and Faceting of Large Reaction DatabasesCINF 13: Pistachio - Search and Faceting of Large Reaction Databases
CINF 13: Pistachio - Search and Faceting of Large Reaction Databases
 
Building on Sand: Standard InChIs on non-standard molfiles
Building on Sand: Standard InChIs on non-standard molfilesBuilding on Sand: Standard InChIs on non-standard molfiles
Building on Sand: Standard InChIs on non-standard molfiles
 
Chemical Structure Representation of Inorganic Salts and Mixtures of Gases: A...
Chemical Structure Representation of Inorganic Salts and Mixtures of Gases: A...Chemical Structure Representation of Inorganic Salts and Mixtures of Gases: A...
Chemical Structure Representation of Inorganic Salts and Mixtures of Gases: A...
 
Advanced grammars for state-of-the-art named entity recognition (NER)
Advanced grammars for state-of-the-art named entity recognition (NER)Advanced grammars for state-of-the-art named entity recognition (NER)
Advanced grammars for state-of-the-art named entity recognition (NER)
 
Challenges in Chemical Information Exchange
Challenges in Chemical Information ExchangeChallenges in Chemical Information Exchange
Challenges in Chemical Information Exchange
 

Kürzlich hochgeladen

GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticssakshisoni2385
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.Nitya salvi
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000Sapana Sha
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPirithiRaju
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfrohankumarsinghrore1
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINsankalpkumarsahoo174
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 

Kürzlich hochgeladen (20)

GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 

Representation and display of non-standard peptides using semi-systematic amino acid monomer naming

  • 1. representation and display of non-standard peptides using semi-systematic amino acid monomer naming Roger Sayle Nextmove software, cambridge, uk 247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
  • 2. The primordial soup • The vast majority of chemical entities stored in chemical databases and pharmaceutical registration systems are classical “small molecules”. • For these molecules, the universal cheminformatics “all-heavy-atom” representations work well, providing duplicate checking, substructure search, normalization, depiction, etc. • Solutions include InChI, SMILES and V2000 mol file. 247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
  • 3. Biopolymers of life • The low Shannon entropy of a small, but scientifically significant, fraction of compounds, biopolymers, allow them to represented more succinctly. • For these frequent subunits (or monomers) can be used instead of atoms in connection tables. • Representations with small monomer sets are easier for scientists to comprehend. 247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
  • 4. Peptide representation The all-atom depiction (left) is more complicated to interpret than the 3-letter code and 1-letter code forms (right). Recognizing this is [Asp5]oxytocin is perhaps only possible at the “sequence” level. 247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
  • 5. Protein representation All atom representation vs. 3-letter and 1-letter sequence forms of crambin. 247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
  • 6. human insulin representations CC[C@H](C)[C@H]1C(=O)N[C@H]2CSSC[C@@H](C(=O)N[C@@H](CSSC[C@@H](C(=O)NCC(=O)N[C@H ](C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)N [C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CSSC[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N C(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H](NC2 =O)CO)CC(C)C)CC3=CC=C(C=C3)O)CCC(=O)N)CC(C)C)CCC(=O)O)CC(=O)N)CC4=CC=C(C=C4)O)C( =O)N[C@@H](CC(=O)N)C(=O)O)C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCNC(=N)N)C (=O)NCC(=O)N[C@@H](CC5=CC=CC=C5)C(=O)N[C@@H](CC6=CC=CC=C6)C(=O)N[C@@H](CC7=CC=C( C=C7)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N8CCC[C@H]8C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([ C@@H](C)O)C(=O)O)C(C)C)CC(C)C)CC9=CC=C(C=C9)O)CC(C)C)C)CCC(=O)O)C(C)C)CC(C)C)CC2 =CN=CN2)CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)NC(=O) [C@H](CC(=O)N)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@H](C(=O)N[C@ H](C(=O)N1)CO)[C@@H](C)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C (C)C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)CN Human Insulin 247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
  • 7. Sequence bioinformatics • Traditional bioinformatics represents sequences as one letter codes, often in FASTA or Uniprot format. • Four characters for nucleic acids: – A,C,G and T for DNA and A,C,G and U for RNA. • 20 (or 22) characters for proteinogenic amino acids: – A,R,N,D,C,E,Q,G,H,I,L,K,M,F,P,S,T,W,Y and V (plus O and U). • The challenge arises when more than the primary sequence needs to be represented. 247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
  • 8. 3-letter codes to the rescue? • The use of 3-letter codes to represent monomers allows for significantly more possible monomers. • Conveniently common AAs have standard codes: – Ala, Arg, Asn, Asp, Cys, Glu, Gln, Gly, His, Ile, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr and Val, plus Sec and Pyl. • These assignments are mandated by IUPAC and IUB recommendations known as 3AA (dated 1983). • Unfortunately the prevalence of standard amino acids and 3-letter codes has had adverse affects. 247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
  • 9. Pdb 3-letter residue codes • wwPDB’s components.cif currently lists 17,764 3-letter residue names (digits and capitals). • These contain not only amino acids, but nucleic acids, carbohydrates, ligands, salts solvents and co-factors. • Completely different codes are assigned to the L- and D- forms of amino acids. 247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
  • 10. Monomer database solutions • Biopolymer systems based on monomer databases and strict 3-letter codes breakdown when faced with the huge number of potential chemical and post- translational modifications. • At best, a human being can remember 50-100 codes before having to consult “dictionary” to lookup solutions. • Worse, once beyond the commonly encountered amino acids, biochemists disagree on the meaning of 3-letter codes… 247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
  • 11. Example ambiguities • Cpa – Cyano-propionic amino acid – Beta-cyclopropylalanine – 4-amino-1-carbamoyl-piperidine-4-carboxylic acid • Hpg – 4-hydroxyphenylglycine – Homopropargylglycine • Nty – Nortyrosine [Chemical Abstracts Service] – N-nitrotyrosine [Pisotoia HELM] 247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
  • 12. 48 hexopyranoses 247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
  • 14. 9540 substituted hexopyranoses This only considers the four most common substituents: amino, acetyl, acetylamino and methoxy
  • 15. Bitter Sweet Conclusion • In practice, a glycoinformatics registration system has to handle over 232 monosaccharide monomers. • This number is clearly too large to assign each its own 3-letter code or to use a monomer database. • For example, monosaccharidedb.org curates a database of 771 monosaccharides, whereas 936 of the simple monosaccharides on the previous page can be found in PubChem. 247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
  • 16. The aa naming solution • The solution to this problem is the systematic naming of amino acid derivatives. • This approach is well known to peptide chemists, with these names routinely used in scientific articles, patent applications and supplier catalogues, and easily recognizable by chemists. • Alas this “de facto” nomenclature, which hasn’t been formalized by IUPAC nor CAS, has been overlooked by the informatics community. 247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
  • 17. Systematic amino acid code examples • Examples of systematic codes for representing common non-standard amino acids – Thr(tBu), Ser(tBu), Phe(4-Cl), Phe(3-Cl), Phe(4-Ph), Tyr(PO3H2), Cys(Acm), Lys(Boc), Tyr(Bn(2,6-diCl), Tyr(3-F), Lys(Cbz), Glu(OtBu), Tyr(SO3H), Phe(4-CF3), Tyr(tBu), Arg(NO2), Trp(For), Lys(palmitoyl), aThr(tBu), Phe(4-F), Thr(Bn), Ser(Bn), Asp(OtBu), Tyr(decyl), Glu(OBn), Gly(tBu), Phe(4-hexyl), Thr(SO3H), Tyr(Me), Asp(OBn), Tyr(Bn), Phe(4-NO2), Lys(Ac), Nle(Me), Cys(tBu), Ala(cyclopentyl), Cys(StBu), Trp(2-Bn), Ser(PO3H2), Phe(4-I), Arg(Tos), etc. 247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
  • 18. monosaccharide examples • In fact constructing systematic names is the same solution as used by the glycoinformatics community – b-D-GlcpNAc, a-D-Neup5Ac, b-D-GlcpA, D-GlcNAc-ol, a-D- GlcpN, a-L-4-en-4-deoxy-thrHexpA, b-D-Galp3Me, b-D-6- deoxy-Glcp, b-D-2,6-deoxy-ribHexp, b-D-GlcpA6Me, a-D- Rhap4N-formyl, b-D-Glcp6Ac, a-D-Neup5Ac9Ac… • And in RNA informatics – P-rGuo, P-Gua-Rib2Me, P-m22Gua-Rib, P-m2Gua-Rib, P- Cyt-Rib2Me, P-m5Cyt-Rib, P-m5Ura-Rib, P-m1Ade-Rib… 247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
  • 19. Rule #1: use of D- and DL- prefixes • Rather than have separate code forms for D- form (and DL-) amino acids, the widely understood and accepted stereo configuration prefixes “D-”, “L-” (default) and “DL-” should be used. – D-Arg – DL-Ala – D-Ser 247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
  • 20. Rule #2: retain widely used 3LC’s. • Universally accepted code/definitions should be used – Abu (2-aminobutyric acid), Aib (2-aminoisobutyric acid), aIle (allo-isoleucine), aThe (allo-threonine), bAla (beta- alanine), Cha (beta-cyclohexylalanine), Chg (alpha- cyclohexylglycine), Cit (Citrulline), Dab (2,3-diaminobutyric acid), Dap (2,3-diaminopropionic acid), hArg (homoarginine), Hcy (homocysteine), Hse (homoserine), hPhe (homophenylalanine), Nle (norleucine), Nva (norvaline), Orn (ornithine), Pen (penicillamine), Phg (phenylglycine), Sar (sarcosine), Sec (selenocysteine), 2Thi (thien-2-ylalanine), 3Thi (thien-3-ylalanine), xiIle (xi- isoleucine), xiThr (xi-threonine), and many more. 247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
  • 21. Rule #3: modifying prefixes • “N(Me)” and friends are used to specify N-modified variants. “N(Me)Gly” is equivalent to “Sar”. • The “O” prefix is used to specify depsi-peptides that connect via ester linkages, e.g. “Arg-OAla-Ser”. • The “aMe” prefix is used to specify alpha-methyl variants of amino acids. “aMeAla” is same as “Aib”. 247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
  • 22. Rule #4: line formulae substs • Substitutions are indicated by line formulae – Ph phenyl CF3 trifluoromethyl – Me methyl Tos tosyl – Ac acetyl Trt trityl – Cl chloro Tf triflyl – CN cyano SO3H sulfonic acid – Bn benzyl OPfp perfluorophenoxy – NH2 amino iPr isopropyl – NO2 nitro For formyl – tBu tert-butyl … and many more 247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
  • 23. RULE #5: DEFAULT SUBSTITUTION • Many amino acids have default substitution locants – Abu ABA.CG – Ala ALA.CB – Arg ARG.NH2 – Cys CYS.SG – Dab DAB.ND – Dap DPP.NG – Gly GLY.CA – Lys LYS.NZ – Ser SER.OG – Thr THR.OG1 – Tyr TYR.OH 247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
  • 24. RULE #6: specified SUBSTITUTION • Amino acids may be substituted at particular locant using the following syntax: – Phe(4-Cl) – Phg(4-CH2NH2) – Tyr(2,6-diCl-Bn) • Substitutions may occur at more than one locant: – His(2,5-diI) – Tyr(2,6-diF) 247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
  • 25. Rule #7: implicit leaving groups • For most amino acids, parenthesized substitution implies replaced of hydrogen. – Ala(Cl) • Carboxylic acids assume an OH leaving group, requiring an explicit “oxy” to form esters. – Asp(NH2) – Asp(OMe) • Cysteine (and penicillamine) substitute on the sulfur – Cys(tBu) – Cys(StBu) 247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
  • 26. The need for standardization #1 • For peptide registration, we need a canonical name. • Preferred forms are most visible in line notations. – For carboxy, do we prefer “COOH” or “CO2H”. – For acetyl, “Ac” or “COCH3”. – For ethylamine, “EtNH2” or “CH2CH2NH2”. – For phosphate, “PO3H2” vs. “P”. • However decomposition also affects preferred forms – Gly(Me) is the same as Ala – Asp(NH2) is the same as Asn – Phe(4-Ph) is the same as 4-Bip 247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
  • 27. The need for standardization #2 • More seriously we need to avoid ambiguous names. • It is widely acknowledge by peptide chemists that “Ac” means acetyl, rather than Actinium, and connects to the parent at the carbonyl carbon. • In supplier catalogues for a German vendor, the name “Cys(AcOH)” implies “*CC(=O)O” instead of “*C(=O)CO”. • Which names should be used, “COCH2OH” etc.? 247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
  • 28. Example Peptide names (chembl) • The following names are machine generated • H-Cys-Pro-Trp-His-Leu-Leu-Pro-Phe-Cys-OH CHEMBL501567 • H-Tyr-Pro-Phe-Phe-OtBu CHEMBL500195 • cyclo[Ala-Tyr-Val-Orn-Leu-D-Phe-Pro-Phe-D-Phe-Asn] CHEMBL438006 • H-Nle(Et)-Tyr-Pro-Trp-Phe-NH2 CHEMBL500704 • H-DL-hPhe-Val-Met-Tyr(PO3H2)-Asn-Leu-Gly-Glu-OH CHEMBL439086 • cyclo[Phe-D-Trp-Tyr(Me)-D-Pro] CHEMBL507127 • H-D-Pyr-D-Leu-pyrrolidide CHEMBL1181307 • Ac-DL-Phe-aThr-Leu-Asp-Ala-Asp-DL-Phe(4-Cl)-OH CHEMBL1791047 • H-D-Cys(1)-D-Asp-Gly-Tyr(3-NO2)-Gly-Hyp-Asp-D-Cys(1)-NH2 CHEMBL583516 • Boc-Tyr-Tyr(3-Br)-OMe CHEMBL1976073 247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
  • 29. Example Peptide names (pubchem) • The following names are existing supplier synonyms • Boc-Asp(OtBu)-Pro-OH CID57383532 • Tyr-D-Ala-Gly-NMePhe CID45483974 • Tyr-D-Pro-Gly-Trp-NMeNle-Asp-Phe-NH2 CID11693641 • Ac-Cys-Ile-Tyr-Lys-Phe(4-Cl)-Tyr CID44412001 • Z-D-Val-Lys(Z)-OH CID5978? • deamino-Cys-D-Tyr(Et)-Ile-Thr-Asn-Cys-Pro-Orn-Gly-NH2 CID68613 • H-Arg(NO2)-OMe.HCl CID135193 • Unlike HELM or PLN, these IUPAC condensed line notations (3AA) are already used in publications. 247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
  • 30. The big win: display of peptides • One of the largest end-user benefits of systematic naming is in the improved display of compounds: Structure of Degarelix from wikipedia: http://en.wikipedia.org/wiki/Degarelix 247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
  • 31. The big win: Display of peptides • The image of Degarelix becomes optimal (IMHO) Image credit: WO2011066386A1 247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
  • 32. A real example: CID10033747 CC(C)[C@H]1C(=O)N[C@H](C(=O)N[C@H]( C(=O)N2CCC[C@H]2C(=O)N[C@H](C(=O)N [C@H](C(=O)N1)CC3=CC=C(C=C3)OP(=O)( O)O)CSSC[C@@H](C(=O)O)NC(=O)C)C(C) C)CC(=O)N IUPAC depiction cyclo[Asn-Val-Pro-Cys(1)-Tyr(PO3H2)-Val]. Ac-Cys(1)-OH. IUPAC Condensed SMILES PEPTIDE1{[ac].C}|PEPTIDE2{N.V.P.C}| CHEM1{*N[C@@H](Cc1ccc(cc1)OP(=O) (O)O)C(=O)*|$_R1;;;;;;;;;;;;;;;;;_R2$|}| PEPTIDE3{V}$ PEPTIDE1,PEPTIDE2,2:R34:R3| PEPTIDE2,CHEM1,4:R2-1:R1| CHEM1,PEPTIDE3,1:R21:R1| PEPTIDE3,PEPTIDE2,1:R2-1:R1$$$ HELM 247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
  • 33. conclusions • The use of 3-letter codes to represent non-standard amino acids doesn’t scale to peptide registration systems. • Adopting the systematic rules already frequently used by peptide chemists looks to solve a number of amino acid nomenclature challenges faced by PDB, HELM and the pharmaceutical industry. • This presentation/proposal formalizes some of the syntax/rules for constructing systematic codes for non-standard amino acids. 247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014
  • 34. acknowledgements • Lisa Sach-Peltason, Hoffmann-La Roche, Basel. • Evan Bolton, NCBI PubChem project, Bethesda, MD. • Noel O’Boyle, NextMove Software, Cambridge, UK. • Daniel Lowe, NextMove Software, Cambridge, UK. 247th ACS National Meeting, Dallas, TX, Wednesday 19th March 2014