Suche senden
Hochladen
Comparing Cahn-Ingold-Prelog Rule Implementations
âą
0 gefÀllt mir
âą
378 views
NextMove Software
Folgen
Posted on comparing Cahn-Ingold-Prelog Rule Implementations
Weniger lesen
Mehr lesen
Wissenschaft
Melden
Teilen
Melden
Teilen
1 von 1
Jetzt herunterladen
Downloaden Sie, um offline zu lesen
Empfohlen
RNA editing as a drug target in tryp. development of a high throughput sceeni...
RNA editing as a drug target in tryp. development of a high throughput sceeni...
Laurence Dawkins-Hall
Â
Calibration of scintillation crystals for air kerma rate castle
Calibration of scintillation crystals for air kerma rate castle
Leishman Associates
Â
Chemical Named Entity Recognition
Chemical Named Entity Recognition
SriTeja Allaparthi
Â
GPCRs_HouseLA
GPCRs_HouseLA
Lindsay House
Â
CINF 17: Comparing Cahn-Ingold-Prelog Rule Implementations: The need for an o...
CINF 17: Comparing Cahn-Ingold-Prelog Rule Implementations: The need for an o...
NextMove Software
Â
Randomizing genome-scale metabolic networks
Randomizing genome-scale metabolic networks
Areejit Samal
Â
Extracting Synthetic Knowledge from Reaction Databases - ARChem at the 246th ACS
Extracting Synthetic Knowledge from Reaction Databases - ARChem at the 246th ACS
SimBioSys_Inc
Â
Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understand...
Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understand...
Cresset
Â
Empfohlen
RNA editing as a drug target in tryp. development of a high throughput sceeni...
RNA editing as a drug target in tryp. development of a high throughput sceeni...
Laurence Dawkins-Hall
Â
Calibration of scintillation crystals for air kerma rate castle
Calibration of scintillation crystals for air kerma rate castle
Leishman Associates
Â
Chemical Named Entity Recognition
Chemical Named Entity Recognition
SriTeja Allaparthi
Â
GPCRs_HouseLA
GPCRs_HouseLA
Lindsay House
Â
CINF 17: Comparing Cahn-Ingold-Prelog Rule Implementations: The need for an o...
CINF 17: Comparing Cahn-Ingold-Prelog Rule Implementations: The need for an o...
NextMove Software
Â
Randomizing genome-scale metabolic networks
Randomizing genome-scale metabolic networks
Areejit Samal
Â
Extracting Synthetic Knowledge from Reaction Databases - ARChem at the 246th ACS
Extracting Synthetic Knowledge from Reaction Databases - ARChem at the 246th ACS
SimBioSys_Inc
Â
Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understand...
Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understand...
Cresset
Â
Bits protein structure
Bits protein structure
BITS
Â
foglar book.pdf
foglar book.pdf
BalqeesMustafa
Â
modelling assignment
modelling assignment
ShwetA Kumari
Â
Areejit Samal Emergence Alaska 2013
Areejit Samal Emergence Alaska 2013
Areejit Samal
Â
Towards More Reliable 13C and 1H Chemical Shift Prediction: A Systematic Comp...
Towards More Reliable 13C and 1H Chemical Shift Prediction: A Systematic Comp...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
Â
Rna secondary structure prediction, a cuckoo search approach
Rna secondary structure prediction, a cuckoo search approach
eSAT Journals
Â
QSAR STUDY ON READY BIODEGRADABILITY OF CHEMICALS. Presented at the 3rd Chemo...
QSAR STUDY ON READY BIODEGRADABILITY OF CHEMICALS. Presented at the 3rd Chemo...
Kamel Mansouri
Â
Multiobjective Optimization Tool for a Free Structure Analog Circuits Design ...
Multiobjective Optimization Tool for a Free Structure Analog Circuits Design ...
Yaser Kalifa
Â
Electron Density Derived Descriptors in Drug Discovery and Protein Modeling
Electron Density Derived Descriptors in Drug Discovery and Protein Modeling
N. Sukumar
Â
Event 32
Event 32
etheruoh
Â
NANO281 Lecture 01 - Introduction to Data Science in Materials Science
NANO281 Lecture 01 - Introduction to Data Science in Materials Science
University of California, San Diego
Â
Gordon2003
Gordon2003
toluene
Â
Fault detection in power transformers using random neural networks
Fault detection in power transformers using random neural networks
IJECEIAES
Â
GPU-accelerated Virtual Screening
GPU-accelerated Virtual Screening
Olexandr Isayev
Â
The Performance Validation of Neural Network Based 13C NMR Prediction Using a...
The Performance Validation of Neural Network Based 13C NMR Prediction Using a...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
Â
Protein structure prediction with a focus on Rosetta
Protein structure prediction with a focus on Rosetta
Bioinformatics and Computational Biosciences Branch
Â
A comparison of three chromatographic retention time prediction models
A comparison of three chromatographic retention time prediction models
Andrew McEachran
Â
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
Kamel Mansouri
Â
Computational Chemistry Robots
Computational Chemistry Robots
University of Cambridge
Â
Making effective use of graphics processing units (GPUs) in computations
Making effective use of graphics processing units (GPUs) in computations
Oregon State University
Â
DeepSMILES
DeepSMILES
NextMove Software
Â
CINF 170: Regioselectivity: An application of expert systems and ontologies t...
CINF 170: Regioselectivity: An application of expert systems and ontologies t...
NextMove Software
Â
Weitere Àhnliche Inhalte
Ăhnlich wie Comparing Cahn-Ingold-Prelog Rule Implementations
Bits protein structure
Bits protein structure
BITS
Â
foglar book.pdf
foglar book.pdf
BalqeesMustafa
Â
modelling assignment
modelling assignment
ShwetA Kumari
Â
Areejit Samal Emergence Alaska 2013
Areejit Samal Emergence Alaska 2013
Areejit Samal
Â
Towards More Reliable 13C and 1H Chemical Shift Prediction: A Systematic Comp...
Towards More Reliable 13C and 1H Chemical Shift Prediction: A Systematic Comp...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
Â
Rna secondary structure prediction, a cuckoo search approach
Rna secondary structure prediction, a cuckoo search approach
eSAT Journals
Â
QSAR STUDY ON READY BIODEGRADABILITY OF CHEMICALS. Presented at the 3rd Chemo...
QSAR STUDY ON READY BIODEGRADABILITY OF CHEMICALS. Presented at the 3rd Chemo...
Kamel Mansouri
Â
Multiobjective Optimization Tool for a Free Structure Analog Circuits Design ...
Multiobjective Optimization Tool for a Free Structure Analog Circuits Design ...
Yaser Kalifa
Â
Electron Density Derived Descriptors in Drug Discovery and Protein Modeling
Electron Density Derived Descriptors in Drug Discovery and Protein Modeling
N. Sukumar
Â
Event 32
Event 32
etheruoh
Â
NANO281 Lecture 01 - Introduction to Data Science in Materials Science
NANO281 Lecture 01 - Introduction to Data Science in Materials Science
University of California, San Diego
Â
Gordon2003
Gordon2003
toluene
Â
Fault detection in power transformers using random neural networks
Fault detection in power transformers using random neural networks
IJECEIAES
Â
GPU-accelerated Virtual Screening
GPU-accelerated Virtual Screening
Olexandr Isayev
Â
The Performance Validation of Neural Network Based 13C NMR Prediction Using a...
The Performance Validation of Neural Network Based 13C NMR Prediction Using a...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
Â
Protein structure prediction with a focus on Rosetta
Protein structure prediction with a focus on Rosetta
Bioinformatics and Computational Biosciences Branch
Â
A comparison of three chromatographic retention time prediction models
A comparison of three chromatographic retention time prediction models
Andrew McEachran
Â
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
Kamel Mansouri
Â
Computational Chemistry Robots
Computational Chemistry Robots
University of Cambridge
Â
Making effective use of graphics processing units (GPUs) in computations
Making effective use of graphics processing units (GPUs) in computations
Oregon State University
Â
Ăhnlich wie Comparing Cahn-Ingold-Prelog Rule Implementations
(20)
Bits protein structure
Bits protein structure
Â
foglar book.pdf
foglar book.pdf
Â
modelling assignment
modelling assignment
Â
Areejit Samal Emergence Alaska 2013
Areejit Samal Emergence Alaska 2013
Â
Towards More Reliable 13C and 1H Chemical Shift Prediction: A Systematic Comp...
Towards More Reliable 13C and 1H Chemical Shift Prediction: A Systematic Comp...
Â
Rna secondary structure prediction, a cuckoo search approach
Rna secondary structure prediction, a cuckoo search approach
Â
QSAR STUDY ON READY BIODEGRADABILITY OF CHEMICALS. Presented at the 3rd Chemo...
QSAR STUDY ON READY BIODEGRADABILITY OF CHEMICALS. Presented at the 3rd Chemo...
Â
Multiobjective Optimization Tool for a Free Structure Analog Circuits Design ...
Multiobjective Optimization Tool for a Free Structure Analog Circuits Design ...
Â
Electron Density Derived Descriptors in Drug Discovery and Protein Modeling
Electron Density Derived Descriptors in Drug Discovery and Protein Modeling
Â
Event 32
Event 32
Â
NANO281 Lecture 01 - Introduction to Data Science in Materials Science
NANO281 Lecture 01 - Introduction to Data Science in Materials Science
Â
Gordon2003
Gordon2003
Â
Fault detection in power transformers using random neural networks
Fault detection in power transformers using random neural networks
Â
GPU-accelerated Virtual Screening
GPU-accelerated Virtual Screening
Â
The Performance Validation of Neural Network Based 13C NMR Prediction Using a...
The Performance Validation of Neural Network Based 13C NMR Prediction Using a...
Â
Protein structure prediction with a focus on Rosetta
Protein structure prediction with a focus on Rosetta
Â
A comparison of three chromatographic retention time prediction models
A comparison of three chromatographic retention time prediction models
Â
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
Â
Computational Chemistry Robots
Computational Chemistry Robots
Â
Making effective use of graphics processing units (GPUs) in computations
Making effective use of graphics processing units (GPUs) in computations
Â
Mehr von NextMove Software
DeepSMILES
DeepSMILES
NextMove Software
Â
CINF 170: Regioselectivity: An application of expert systems and ontologies t...
CINF 170: Regioselectivity: An application of expert systems and ontologies t...
NextMove Software
Â
Building a bridge between human-readable and machine-readable representations...
Building a bridge between human-readable and machine-readable representations...
NextMove Software
Â
CINF 35: Structure searching for patent information: The need for speed
CINF 35: Structure searching for patent information: The need for speed
NextMove Software
Â
A de facto standard or a free-for-all? A benchmark for reading SMILES
A de facto standard or a free-for-all? A benchmark for reading SMILES
NextMove Software
Â
Recent Advances in Chemical & Biological Search Systems: Evolution vs Revolution
Recent Advances in Chemical & Biological Search Systems: Evolution vs Revolution
NextMove Software
Â
Can we agree on the structure represented by a SMILES string? A benchmark dat...
Can we agree on the structure represented by a SMILES string? A benchmark dat...
NextMove Software
Â
Eugene Garfield: the father of chemical text mining and artificial intelligen...
Eugene Garfield: the father of chemical text mining and artificial intelligen...
NextMove Software
Â
Chemical similarity using multi-terabyte graph databases: 68 billion nodes an...
Chemical similarity using multi-terabyte graph databases: 68 billion nodes an...
NextMove Software
Â
Recent improvements to the RDKit
Recent improvements to the RDKit
NextMove Software
Â
Pharmaceutical industry best practices in lessons learned: ELN implementation...
Pharmaceutical industry best practices in lessons learned: ELN implementation...
NextMove Software
Â
Digital Chemical Representations
Digital Chemical Representations
NextMove Software
Â
Challenges and successes in machine interpretation of Markush descriptions
Challenges and successes in machine interpretation of Markush descriptions
NextMove Software
Â
PubChem as a Biologics Database
PubChem as a Biologics Database
NextMove Software
Â
CINF 13: Pistachio - Search and Faceting of Large Reaction Databases
CINF 13: Pistachio - Search and Faceting of Large Reaction Databases
NextMove Software
Â
Building on Sand: Standard InChIs on non-standard molfiles
Building on Sand: Standard InChIs on non-standard molfiles
NextMove Software
Â
Chemical Structure Representation of Inorganic Salts and Mixtures of Gases: A...
Chemical Structure Representation of Inorganic Salts and Mixtures of Gases: A...
NextMove Software
Â
Advanced grammars for state-of-the-art named entity recognition (NER)
Advanced grammars for state-of-the-art named entity recognition (NER)
NextMove Software
Â
Challenges in Chemical Information Exchange
Challenges in Chemical Information Exchange
NextMove Software
Â
Automatic extraction of bioactivity data from patents
Automatic extraction of bioactivity data from patents
NextMove Software
Â
Mehr von NextMove Software
(20)
DeepSMILES
DeepSMILES
Â
CINF 170: Regioselectivity: An application of expert systems and ontologies t...
CINF 170: Regioselectivity: An application of expert systems and ontologies t...
Â
Building a bridge between human-readable and machine-readable representations...
Building a bridge between human-readable and machine-readable representations...
Â
CINF 35: Structure searching for patent information: The need for speed
CINF 35: Structure searching for patent information: The need for speed
Â
A de facto standard or a free-for-all? A benchmark for reading SMILES
A de facto standard or a free-for-all? A benchmark for reading SMILES
Â
Recent Advances in Chemical & Biological Search Systems: Evolution vs Revolution
Recent Advances in Chemical & Biological Search Systems: Evolution vs Revolution
Â
Can we agree on the structure represented by a SMILES string? A benchmark dat...
Can we agree on the structure represented by a SMILES string? A benchmark dat...
Â
Eugene Garfield: the father of chemical text mining and artificial intelligen...
Eugene Garfield: the father of chemical text mining and artificial intelligen...
Â
Chemical similarity using multi-terabyte graph databases: 68 billion nodes an...
Chemical similarity using multi-terabyte graph databases: 68 billion nodes an...
Â
Recent improvements to the RDKit
Recent improvements to the RDKit
Â
Pharmaceutical industry best practices in lessons learned: ELN implementation...
Pharmaceutical industry best practices in lessons learned: ELN implementation...
Â
Digital Chemical Representations
Digital Chemical Representations
Â
Challenges and successes in machine interpretation of Markush descriptions
Challenges and successes in machine interpretation of Markush descriptions
Â
PubChem as a Biologics Database
PubChem as a Biologics Database
Â
CINF 13: Pistachio - Search and Faceting of Large Reaction Databases
CINF 13: Pistachio - Search and Faceting of Large Reaction Databases
Â
Building on Sand: Standard InChIs on non-standard molfiles
Building on Sand: Standard InChIs on non-standard molfiles
Â
Chemical Structure Representation of Inorganic Salts and Mixtures of Gases: A...
Chemical Structure Representation of Inorganic Salts and Mixtures of Gases: A...
Â
Advanced grammars for state-of-the-art named entity recognition (NER)
Advanced grammars for state-of-the-art named entity recognition (NER)
Â
Challenges in Chemical Information Exchange
Challenges in Chemical Information Exchange
Â
Automatic extraction of bioactivity data from patents
Automatic extraction of bioactivity data from patents
Â
KĂŒrzlich hochgeladen
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
NazaninKarimi6
Â
High Profile đ 8250077686 đ Call Girls Service in GTB Nagarđ
High Profile đ 8250077686 đ Call Girls Service in GTB Nagarđ
Damini Dixit
Â
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
monikaservice1
Â
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
PirithiRaju
Â
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
SĂ©rgio Sacani
Â
chemical bonding Essentials of Physical Chemistry2.pdf
chemical bonding Essentials of Physical Chemistry2.pdf
TukamushabaBismark
Â
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
PRADYUMMAURYA1
Â
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
Silpa
Â
High Class Escorts in Hyderabad âč7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad âč7.5k Pick Up & Drop With Cash Payment 969456...
chandars293
Â
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx .
Poonam Aher Patil
Â
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(âïž+971_581248768%)**%*]'#abortion pills for sale in dubai@
Â
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
Areesha Ahmad
Â
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
shivanisharma5244
Â
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
Sapana Sha
Â
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
FarihaAbdulRasheed
Â
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
Areesha Ahmad
Â
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Silpa
Â
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
muralinath2
Â
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
PirithiRaju
Â
Kochi â€CALL GIRL 84099*07087 â€CALL GIRLS IN Kochi ESCORT SERVICEâ€CALL GIRL
Kochi â€CALL GIRL 84099*07087 â€CALL GIRLS IN Kochi ESCORT SERVICEâ€CALL GIRL
kantirani197
Â
KĂŒrzlich hochgeladen
(20)
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
Â
High Profile đ 8250077686 đ Call Girls Service in GTB Nagarđ
High Profile đ 8250077686 đ Call Girls Service in GTB Nagarđ
Â
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Â
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Â
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Â
chemical bonding Essentials of Physical Chemistry2.pdf
chemical bonding Essentials of Physical Chemistry2.pdf
Â
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
Â
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
Â
High Class Escorts in Hyderabad âč7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad âč7.5k Pick Up & Drop With Cash Payment 969456...
Â
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx .
Â
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
Â
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
Â
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Â
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
Â
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
Â
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
Â
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Â
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Â
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
Â
Kochi â€CALL GIRL 84099*07087 â€CALL GIRLS IN Kochi ESCORT SERVICEâ€CALL GIRL
Kochi â€CALL GIRL 84099*07087 â€CALL GIRLS IN Kochi ESCORT SERVICEâ€CALL GIRL
Â
Comparing Cahn-Ingold-Prelog Rule Implementations
1.
John Mayfield, Daniel Lowe and Roger Sayle NextMove Software Ltd, Cambridge, UK. NextMove Software Limited Innovation Centre (Unit 23) Cambridge Science Park Milton Road, Cambridge UK CB4 0EY www.nextmovesoftware.com Introduction Robert Hanson, Andrey
Yerin, Mikko Vainio, and Sophia Gillian Musacchio for initiating and participating in the âFix CIPâ collaboration and the many in-depth technical discussions that have lead to improvements in the tools. Karl Nedwed for providing KnowItAll results. Philip Skinner for providing ChemDraw licenses. Noel OâBoyle for feedback and suggestions. the need for open-cip The Cahn-Ingold-Prelog (CIP) priority rules rank atoms around a stereogenic unit to assign a stereo-descriptor that is invariant to atom order and layout, for example R (right) or S (left) for tetrahedral atoms. A directed acyclic graph (digraph) is constructed for each stereogenic unit and the out edges from the root node compared and ranked according to eight sequence rules[1]. Each rule is applied exhaustively and tested on the entire digraph before applying the next rule[2]. Acknowledgements Results 1. P-92.1.3 Nomenclature of Organic Chemistry: IUPAC Recommendations and Preferred Names 2013 2. Paulina Mata. The CIP System Again:â Respecting Hierarchies Is Always a Must. J. Chem. Inf. Comput. Sci., 1999, 39 (6) Bibliography Conclusion The CIP sequence rules provide a standard way for chemists to effectively describe the configurations of most stereogenic units. However, beyond simple cases the complexity of the rules necessitates software is used as an aid to naming configurations. The results demonstrate even then, software implementations do not all agree on the configuration. Through the results presented here and the on-going effort of the Fix CIP collaboration, software should aim to converge upon consistent stereochemistry naming. An Open CIP software tool could provide âblessedâ stereochemistry configuration names and provide a standard algorithm implementation for other vendors to integrate or adapt. Comparing Cahn-Ingold-Prelog Rule Implementations Rule 1 a. Higher atomic number precedes lower b. An atom node duplicated closer to the root ranks higher than one duplicated further Rule 2 Higher atomic mass number precedes lower Rule 3 Z precedes E and this precedes nonstereogenic (nst) double bonds Rule 4 a. Chiral stereogenic units precede pseudoasymmetric stereogenic units and these precede nonstereogenic units (R = S > r = s > nst) b. When two ligands have different descriptor pairs, the one with the first chosen like descriptor pairs has priority over the one with a corresponding unlike descriptor pairs c. r precedes s Rule 5 An atom or group with descriptor R has priority over its enantiomorph S Stereochemistry in Databases 154 _23 hem nce 601 0 5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 95 100 % of Dataset Count 0 1 2 3 4 5 6 7 8 9 eMolecules (June 2017) PubChem Substance PubChem Compound (Aug 2017) ChEMBL 23 ChEBI 154 14 million records 234 million records 93 million records 1.7 million records 95 thousand records 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 95 100 % of Dataset Count 0 1 2 3 4 5 6 7 8 9 Number of Stereogenic Units + _154 l_23 hem ance 0601 0 5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 95 100 % of Dataset Count 0 1 2 3 4 5 6 7 8 9 The number of defined stereogenic units per molecule varies between databases. The application of Rule 1a to the digraph for 2-butanol ranks the out edges connected to the root as giving the label S (4 > 2 > 5 are anticlockwise looking towards 6). ChEBI ChEMBL eMolecules PubChem Compound1 PubChem Substance Rule 1a 281K 99.6% 1.8M 98.6% 2.4M 97.0% 53.5M 100.0% 93.1M 98.7% Rule 1b 4 1 164 255 Rule 2 14 3,565 6,789 Rule 3 29 3 441 36 45 Rule 4a 122 126 273 4 12,770 Rule 4b 563 0.2% 4,037 0.2% 3,188 0.1% 125K 0.1% Rule 4c 19 558 Rule 5 285 0.1% 23.4K 1.2% 69K 2.8% 15 1.1M 1.2% Total 282K 1.9M 2.4M 53.5M 94.3M The majority of stereogenic units are constitutionally asymmetric and can be ranked using Rule 1a. However, in some datasets the number of stereogenic units requiring Rule 4b and 5 can be significant. I II III IV V VI VII VIII IX X XIa XIb XII XIII Centres 2.0 R R R R R R R R R r R R r R JMol 14.20.3 R R R R R R R R R r R R r R ACD/ChemSketch 14.05beta R R R R R R R R R r R R r R Balloon 1.6.5beta R R R R R R R R R r R R r R KnowItAll ChemWindow 2018 R R R R R R R R R r R R r R5 ChemDraw 16.0 R R R R S R R R R r R R r R BIOVIA Draw 2017 R R R - R R R R R -1 R R -1 R MarvinSketch 17.17 R - - - S R - R - r R R r - Indigo 1.3.0Beta.r16 -2 R - - R - R R R r S R - - RDKit 2017.03.03 S R S R S R R S R R R R - - DataWarrior 4.6.0 R R R - S R R S R R R3 R - - CACTVS (NCI Resolver Aug 17) R R S - S4 R R S R R S R - - OPSIN 2.3.1 R R R R R - - - - - S R - - LexiChem (OEChem) 20170613 R R - - R - - - - - S R - - ChemDoodle 7.0.2 R R - - S - - s - r S R - - CDK 2.0 - R R5 - S - - - - - S R - - JUMBO 6 R - S - - - - - - - S S - - Constitutional (Rule 1a, 1b, 2) Geometrical + Topographical (Rule 3,4a,4b,4c,5) Special (Mancude, Aux Descriptors) 1. Pseudoasymmetric r/s labels not displayed but must be calculated due to answers given for IX and XIII 2. Runtime error occurs 3. Impossible to test as different KekulĂ© forms are normalised 4. R in CACTVS since Feb 2015, NCI resolver is old version 5. Other descriptor is assigned differently A set of fourteen structures was collected to identify differences between software implementations. The structures were selected to cover all the sequence rules and their applications to special cases. Eight sequence rules (in essence) Fix CIP Collaboration Since submitting this work for presentation the developers: Centres, JMol, ACD/ ChemSketch, and Balloon have begun a collaboration. We are in the process of submitting for publication an extended in-depth validation set and proposing sequence rule refinements and additions where they are required. 1As part of the PubChem Compoundâs processing, non-constitutional stereochemistry is removed: for example the nine stereoisomers of inositols are all represented by CID 892. Atoms connected by double and triple bonds as well as ring closures result in duplicated nodes in the digraph. In the structure below atoms 5 and 6 appear twice and atom 1 (the root) appears three times. Due to this duplication, complex ring systems can generate exponentially large digraphs that are not computationally tractable. Further complexity in digraphs is caused by the use of fractional atomic numbers in mancude ring-systems and assignment of auxiliary descriptors for applying Rules 3-5. H OH H H H H H H H H H 1 7 6 5 (1) (1) 65234 O O 3 4 2 1 6 5 7 7 O H H H H H H H H H 321 5 4 6 1 2 3 5 6 4 H
Jetzt herunterladen