5. Command Line Applications
Mol2Nam
• Convert a file of molecules to names
bicyclo[3.2.1]octane
bicyclo[3.2.1]octane
Nam2Mol
• Convert a file of names to molecules
Glycinate Translate
• Convert a file of chemical names into a different
グリシナート language
5
8. Pipelines
• Large scale
conversion of
structures to names
and names to
structures
• Easy integration in
workflows
8
9. Webservices
• Lexichem Webservice available
• Integration with 3rd party webservices
• PubChem uses
Lexichem
Mol2Nam
Molecule
PUG Hits in
s
SOAP PubChem
9
10. Lexiparser
• Automated extraction of structures and names from documents
• Supported formats:
- .txt - .docx
- .html - .rtf
- .doc - .pdf FUTURE
10
14. Why a New Performance Metric ?
• Ensures consistent improvement of Lexichem
• Identify areas in need of development
• Gold standard for all chemical nomenclature software
14
15. Round Tripping
• Compare the initial Canonical
and final structure
after name generation Isomeric SMILES
• Easy to calculate
TP
%RTCS = ×100
TotalStructures
English IUPAC
Name
*E.O.Cannon. JCIM 2012, DOI: 10.1021/ci3000419
15
30. Main Window
• Converts input SMILES string • Chemical information:
or chemical name - Molecular weight
• Visual display of the structure - SMILES
- IUPAC name
22
Start by talking about some real world applications of Lexichem, what you can do with it and who is using it.then move on to talk about Lexichem, TK v2.11….released Metric to assess how well Lexichem is performingNew features from v2.0.2Finish off with GUI
Now lets talk about the main purpose of my were here, Lexichem.
Lexichem is OpenEyes chemical nomenclature software, you can:->convert names to molecules->molecules to names->translate names from one language to anotherLexichem comes in two flavours
Standalone applications run from command line.
For those who want to program and use Lexichem:Lexichem TK is written in C++ and Swig (Simplified Wrapper and Interface Generator) wrapped to python, Java and C#.
So what can Lexichem do, other than help you buy heroin?
Keith Taylor showed yesterday a Pipeline pilot workflow with a node for converting structures to namesWorkflow integration with Pipeline Pilot.Node use 1 of Lexichem’s functions-> mol2nam, nam2mol, translateMatt Stahl working on nodes using OpenEye Software.Lexichem node highlighted in square ->convert structures to names.
Craig Bruce hired recently, been developing Webservices for OE, one being LexichemUse Lexichem prior / post processing to Webservice.PUG (Power User Gateway)Search for structures with the names found in across PubChem.
Chemical name extraction from patents/documents and structure generation (Lexiparser uses Lexichem)->Uses Lexichem after it’s extracted chemical names to convert them to molecules.
Lexichem is the engine beneath the hood, when a user draws a structure a call to Lexichem is made which generates a name which can be rendered.Alternatively you can import chemical names convert them to molecules and visualize the image.
Purpose of metric-> ensure we are not regressing but improving Lexichem-> identify areas / features in need of improvement-> gold standard which other companies can then compare chemical nomenclature software
->Concept: start pt, and an end pt after some processing, then compare start pt to end pt+ve: quick to calculate, gives one figure value of how accurate Lexichem is on dataset->Paper acceptedAdv SMILEs: human readable, less verbose than inchi, tautomer support
->Concept: start pt, and an end pt after some processing, then compare start pt to end pt+ve: quick to calculate, gives one figure value of how accurate Lexichem is on dataset->Paper acceptedAdv SMILEs: human readable, less verbose than inchi, tautomer support
The concept of a benchmark is good, but do we have good results using it?
Whilst these results are good, is Lexichem feasible on a large scale?Seen Lexichem performs well and is feasible on large datasets, now lets look at what features been added.----- Meeting Notes (3/28/12 11:38) -----Mol2Nam -> canonicalize atoms & bonds,identify atom types, identify ring systems and size, bridges, locants and positions,identify stereo / walk the graph
So what have we added since v2.0.2?Our main drive has been looking nam2mol features (as they’re not quite as well supported as mol2nam), in particular the ability to generate molecules for large ring systems. (one of Lexichem’s weaker points in the previous releases)
Von Baeyer -> polyalicyclic ring systems-> previously only bicyclic supported.-> working hard on augmenting natural productsBeta-carotene in carrots -> Provitamin A carotenoid
Von Baeyer -> polyalicyclic ring systems-> previously only bicyclic supported.-> working hard on augmenting natural productsBeta-carotene in carrots -> Provitamin A carotenoid
Von Baeyer -> polyalicyclic ring systems-> previously only bicyclic supported.-> working hard on augmenting natural productsBeta-carotene in carrots -> Provitamin A carotenoid
Von Baeyer -> polyalicyclic ring systems-> previously only bicyclic supported.-> working hard on augmenting natural productsBeta-carotene in carrots -> Provitamin A carotenoid
Von Baeyer -> polyalicyclic ring systems-> previously only bicyclic supported.-> working hard on augmenting natural productsBeta-carotene in carrots -> Provitamin A carotenoid
Von Baeyer -> polyalicyclic ring systems-> previously only bicyclic supported.-> working hard on augmenting natural productsBeta-carotene in carrots -> Provitamin A carotenoid
Von Baeyer -> polyalicyclic ring systems-> previously only bicyclic supported.-> working hard on augmenting natural productsBeta-carotene in carrots -> Provitamin A carotenoid
Von Baeyer -> polyalicyclic ring systems-> previously only bicyclic supported.-> working hard on augmenting natural productsBeta-carotene in carrots -> Provitamin A carotenoid
Ring templates for conversion of molecules to names.Mainly bridge and fused ring templates have been added.
Primary goal of the GUI was to:-> Lower the bar to use Lexichem’s functionality (for people not keen on using an API to program against, or using the command line)
->Primarily modeled on the command line tools, but provided numerous additional features2 Menus: Open a molecular fileImport name, SMILES, set default options, clear the view.
Filter the results
Story about Lexichem, you know you want it, then please feel free to contact usFuture work: continue to work on fused polycyclic ring systems, natural products