SlideShare ist ein Scribd-Unternehmen logo
1 von 36
Lexichem, a New Era
Overview

•

•

•

•

•




           2
Lexichem
Lexichem




     Nordefrin

Supported Nomenclature     Supported Languages (17)

•IUPAC 79 / 93 / 2005      •English (American / British)
•Chemical Abstract / CAS   •German
•Traditional               •Japanese
•MDL / Beilstein           •Spanish
•AutoNom                   •Swedish
•OpenEye
                                                           4
Command Line Applications

                        Mol2Nam
                        • Convert a file of molecules to names
bicyclo[3.2.1]octane



 bicyclo[3.2.1]octane
                        Nam2Mol
                        • Convert a file of names to molecules


  Glycinate             Translate
        • Convert a file of chemical names into a different
 グリシナート   language
                                                                 5
Lexichem TK




              6
Applications
Pipelines




• Large scale
  conversion of
  structures to names
  and names to
  structures

• Easy integration in
  workflows
                        8
Webservices


• Lexichem Webservice available

• Integration with 3rd party webservices

                                            • PubChem uses
                                            Lexichem



                    Mol2Nam

         Molecule
                               PUG          Hits in
            s
                              SOAP         PubChem
                                                        9
Lexiparser

• Automated extraction of structures and names from documents
• Supported formats:
        - .txt                    - .docx
        - .html                   - .rtf
        - .doc                    - .pdf FUTURE




                                                         10
Extracting Structures from a
  Patent
Patent URL




             Names extracted




                                 Generate Structures
                                                11
Desktop Applications




Electronic Laboratory Notebook




                                 NEW! Lexichem Workbench
                                                       12
Performance Metric
Why a New Performance Metric ?


• Ensures consistent improvement of Lexichem

• Identify areas in need of development

• Gold standard for all chemical nomenclature software




                                                    14
Round Tripping


  • Compare the initial                   Canonical
    and final structure
    after name generation              Isomeric SMILES

  • Easy to calculate
                  TP
  %RTCS =                   ×100
            TotalStructures

                                         English IUPAC
                                            Name

*E.O.Cannon. JCIM 2012, DOI: 10.1021/ci3000419
                                                         15
Results
Performance

                                %RTCS
                 98.71
100.00       88.94         89.05 92.43     89.88         93.83
 90.00                          84.54
 80.00
 70.00
 60.00                                               60.02
                        48.69            52.80
  50.00
  40.00                                                                           V2.0.2
  30.00                                                                           V2.1.1
  20.00
  10.00
    0.00
                                                                    V2.1.1                        Speed
           Maybridge    MDDR                                      V2.0.2
                                                                 12000
                                NCI
                                      Wombat
                                                   PubChem
                                                                 10000


                                                                  8000

                                                 Names s-1
                                                                  6000
                                                 Molecules s-1

                                                                  4000


                                                                  2000


       NCI             Wombat    PubChem                             0
                                                                             Maybridge     MDDR      NCI   Wombat
                                                                                                                    17
                                                                                                                     PubChem
New Features
Nam2mol – New Features

–




                             19
Nam2mol – New Features

–

–




                             18
Nam2mol – New Features

–

–

–




                             18
Nam2mol – New Features

–

–

–

–

               H       H       H        H




                           H       H
                   H




                                   18
Nam2mol – New Features

–

–

–

–

–




                             18
Nam2mol – New Features

–

–

–

–

–

–




                             18
Nam2mol – New Features

–

–

–

–

–

–           L-Arginine       D-Arginine

–


                                          18
Nam2mol – New Features

–

–

–

–

–

–

–

–
                             18
Octahydro-1H-4,7-epoxyisoindole                   Benzo[cd]indole

                                        Ring
                                      templates




5,6,6a,7-tetrahydro-4H-dibenzo[de,g]quinoline       Yohimban

                                                                    19
Lexichem Workbench
Lexichem Workbench

•
•
    –
    –
    –
    –
    –
    –
    –


                     21
Main Window




• Converts input SMILES string      • Chemical information:
  or chemical name                     - Molecular weight
• Visual display of the structure      - SMILES
                                       - IUPAC name
                                                          22
Results




• Results history
• Original input on display


                              23
Results




• Text options:
   - Copy selected cells
   - Save table
   - Display selection
                           24
Results




• Display options:
   - Save
   - Copy
   - Print
                     25
Substructure Search




• Options:
  - Functional group from list
  - Custom SMILES/SMARTS pattern
  - Custom name
                                   26
Conclusions

•



•

•




              27
•
•
•
•
•
•


    28

Weitere ähnliche Inhalte

Andere mochten auch

catalytic enantioselective trioxygenation
catalytic enantioselective trioxygenationcatalytic enantioselective trioxygenation
catalytic enantioselective trioxygenation
Gayan A. Abeykoon
 
Lady Bird Deeds
Lady Bird DeedsLady Bird Deeds
Lady Bird Deeds
gkahle
 
orchid mounted on tree slab
orchid mounted on tree slaborchid mounted on tree slab
orchid mounted on tree slab
scoregonzo
 
*Annual Report 2014-15
*Annual Report 2014-15*Annual Report 2014-15
*Annual Report 2014-15
Ryan Gabbart
 

Andere mochten auch (17)

Announcement 3
Announcement 3Announcement 3
Announcement 3
 
catalytic enantioselective trioxygenation
catalytic enantioselective trioxygenationcatalytic enantioselective trioxygenation
catalytic enantioselective trioxygenation
 
Монголія
МонголіяМонголія
Монголія
 
Lady Bird Deeds
Lady Bird DeedsLady Bird Deeds
Lady Bird Deeds
 
Corgraf Papelaria 2015
Corgraf Papelaria 2015Corgraf Papelaria 2015
Corgraf Papelaria 2015
 
Tectonica de Placas
Tectonica de PlacasTectonica de Placas
Tectonica de Placas
 
教學Ppt吉貝國中陳良毓
教學Ppt吉貝國中陳良毓教學Ppt吉貝國中陳良毓
教學Ppt吉貝國中陳良毓
 
Presentación1
Presentación1Presentación1
Presentación1
 
orchid mounted on tree slab
orchid mounted on tree slaborchid mounted on tree slab
orchid mounted on tree slab
 
*Annual Report 2014-15
*Annual Report 2014-15*Annual Report 2014-15
*Annual Report 2014-15
 
Free facebook insights_report_on_tremont_tearoom_(07_29_2014-08_10_2014_pdt)
Free facebook insights_report_on_tremont_tearoom_(07_29_2014-08_10_2014_pdt)Free facebook insights_report_on_tremont_tearoom_(07_29_2014-08_10_2014_pdt)
Free facebook insights_report_on_tremont_tearoom_(07_29_2014-08_10_2014_pdt)
 
SMAC- Facebook Analytics
SMAC- Facebook AnalyticsSMAC- Facebook Analytics
SMAC- Facebook Analytics
 
History and curiosities of clasical movies
History and curiosities of clasical moviesHistory and curiosities of clasical movies
History and curiosities of clasical movies
 
COMO ATRAIR 3 VEZES MAIS CLIENTES SEM INVESTIR UM CENTAVO?
COMO ATRAIR 3 VEZES MAIS CLIENTES SEM INVESTIR UM CENTAVO?COMO ATRAIR 3 VEZES MAIS CLIENTES SEM INVESTIR UM CENTAVO?
COMO ATRAIR 3 VEZES MAIS CLIENTES SEM INVESTIR UM CENTAVO?
 
黄山
黄山黄山
黄山
 
Catálogo discos ff
Catálogo discos ffCatálogo discos ff
Catálogo discos ff
 
Itc hotels
Itc hotelsItc hotels
Itc hotels
 

Ähnlich wie Jcup 3 (2012) Presentation: Lexichem, a new Era. By Ed Cannon

Ee325 cmos design lab 4 report - loren k schwappach
Ee325 cmos design   lab 4 report - loren k schwappachEe325 cmos design   lab 4 report - loren k schwappach
Ee325 cmos design lab 4 report - loren k schwappach
Loren Schwappach
 
Review of some successes
Review of some successesReview of some successes
Review of some successes
Andrea Zaliani
 
J Ruby Kungfu Rails
J Ruby   Kungfu RailsJ Ruby   Kungfu Rails
J Ruby Kungfu Rails
Daniel Lv
 

Ähnlich wie Jcup 3 (2012) Presentation: Lexichem, a new Era. By Ed Cannon (20)

some_other_API
some_other_APIsome_other_API
some_other_API
 
(ATS4-PLAT04) Chemistry Data Model Enhancements in Pipeline Pilot 9.0: what a...
(ATS4-PLAT04) Chemistry Data Model Enhancements in Pipeline Pilot 9.0: what a...(ATS4-PLAT04) Chemistry Data Model Enhancements in Pipeline Pilot 9.0: what a...
(ATS4-PLAT04) Chemistry Data Model Enhancements in Pipeline Pilot 9.0: what a...
 
Annular nanolayered shrink film 3 26-2018
Annular nanolayered shrink film 3 26-2018Annular nanolayered shrink film 3 26-2018
Annular nanolayered shrink film 3 26-2018
 
MOCVD Brochure.PDF
MOCVD Brochure.PDFMOCVD Brochure.PDF
MOCVD Brochure.PDF
 
Ee325 cmos design lab 4 report - loren k schwappach
Ee325 cmos design   lab 4 report - loren k schwappachEe325 cmos design   lab 4 report - loren k schwappach
Ee325 cmos design lab 4 report - loren k schwappach
 
ELMARCO - Company overview
ELMARCO - Company overviewELMARCO - Company overview
ELMARCO - Company overview
 
Riak - From Small to Large - StrangeLoop
Riak - From Small to Large - StrangeLoopRiak - From Small to Large - StrangeLoop
Riak - From Small to Large - StrangeLoop
 
Riak - From Small to Large
Riak - From Small to LargeRiak - From Small to Large
Riak - From Small to Large
 
NEGOCIOS DEL AMONIACO INNOVACION Y FUTURO EN CHILE
NEGOCIOS DEL AMONIACO INNOVACION Y FUTURO EN CHILENEGOCIOS DEL AMONIACO INNOVACION Y FUTURO EN CHILE
NEGOCIOS DEL AMONIACO INNOVACION Y FUTURO EN CHILE
 
16f877
16f87716f877
16f877
 
30292c
30292c30292c
30292c
 
Ali max-bev demo presentation 2-9-18
Ali   max-bev demo presentation 2-9-18Ali   max-bev demo presentation 2-9-18
Ali max-bev demo presentation 2-9-18
 
Large customers want postgresql too !!
Large customers want postgresql too !!Large customers want postgresql too !!
Large customers want postgresql too !!
 
Hydroxylamine Cleaning Chemistries
Hydroxylamine Cleaning ChemistriesHydroxylamine Cleaning Chemistries
Hydroxylamine Cleaning Chemistries
 
NEW BATTERY TECHNOLOGY FOR ELECTROLYTE AND ANODE MATERIALS
NEW BATTERY TECHNOLOGY FOR ELECTROLYTE AND ANODE MATERIALSNEW BATTERY TECHNOLOGY FOR ELECTROLYTE AND ANODE MATERIALS
NEW BATTERY TECHNOLOGY FOR ELECTROLYTE AND ANODE MATERIALS
 
Mil Aer Homeland
Mil Aer HomelandMil Aer Homeland
Mil Aer Homeland
 
2010 engine aftertreatment nrel
2010 engine aftertreatment nrel2010 engine aftertreatment nrel
2010 engine aftertreatment nrel
 
Review of some successes
Review of some successesReview of some successes
Review of some successes
 
Modelling, Simulation and Optimization of Refining Processes
Modelling, Simulation and Optimization of Refining ProcessesModelling, Simulation and Optimization of Refining Processes
Modelling, Simulation and Optimization of Refining Processes
 
J Ruby Kungfu Rails
J Ruby   Kungfu RailsJ Ruby   Kungfu Rails
J Ruby Kungfu Rails
 

Kürzlich hochgeladen

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Kürzlich hochgeladen (20)

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 

Jcup 3 (2012) Presentation: Lexichem, a new Era. By Ed Cannon

Hinweis der Redaktion

  1. Start by talking about some real world applications of Lexichem, what you can do with it and who is using it.then move on to talk about Lexichem, TK v2.11….released Metric to assess how well Lexichem is performingNew features from v2.0.2Finish off with GUI
  2. Now lets talk about the main purpose of my were here, Lexichem.
  3. Lexichem is OpenEyes chemical nomenclature software, you can:->convert names to molecules->molecules to names->translate names from one language to anotherLexichem comes in two flavours
  4. Standalone applications run from command line.
  5. For those who want to program and use Lexichem:Lexichem TK is written in C++ and Swig (Simplified Wrapper and Interface Generator) wrapped to python, Java and C#.
  6. So what can Lexichem do, other than help you buy heroin?
  7. Keith Taylor showed yesterday a Pipeline pilot workflow with a node for converting structures to namesWorkflow integration with Pipeline Pilot.Node use 1 of Lexichem’s functions-> mol2nam, nam2mol, translateMatt Stahl working on nodes using OpenEye Software.Lexichem node highlighted in square ->convert structures to names.
  8. Craig Bruce hired recently, been developing Webservices for OE, one being LexichemUse Lexichem prior / post processing to Webservice.PUG (Power User Gateway)Search for structures with the names found in across PubChem.
  9. Chemical name extraction from patents/documents and structure generation (Lexiparser uses Lexichem)->Uses Lexichem after it’s extracted chemical names to convert them to molecules.
  10. Lexichem is the engine beneath the hood, when a user draws a structure a call to Lexichem is made which generates a name which can be rendered.Alternatively you can import chemical names convert them to molecules and visualize the image.
  11. Purpose of metric-> ensure we are not regressing but improving Lexichem-> identify areas / features in need of improvement-> gold standard which other companies can then compare chemical nomenclature software
  12. ->Concept: start pt, and an end pt after some processing, then compare start pt to end pt+ve: quick to calculate, gives one figure value of how accurate Lexichem is on dataset->Paper acceptedAdv SMILEs: human readable, less verbose than inchi, tautomer support
  13. ->Concept: start pt, and an end pt after some processing, then compare start pt to end pt+ve: quick to calculate, gives one figure value of how accurate Lexichem is on dataset->Paper acceptedAdv SMILEs: human readable, less verbose than inchi, tautomer support
  14. The concept of a benchmark is good, but do we have good results using it?
  15. Whilst these results are good, is Lexichem feasible on a large scale?Seen Lexichem performs well and is feasible on large datasets, now lets look at what features been added.----- Meeting Notes (3/28/12 11:38) -----Mol2Nam -> canonicalize atoms & bonds,identify atom types, identify ring systems and size, bridges, locants and positions,identify stereo / walk the graph
  16. So what have we added since v2.0.2?Our main drive has been looking nam2mol features (as they’re not quite as well supported as mol2nam), in particular the ability to generate molecules for large ring systems. (one of Lexichem’s weaker points in the previous releases)
  17. Von Baeyer -> polyalicyclic ring systems-> previously only bicyclic supported.-> working hard on augmenting natural productsBeta-carotene in carrots -> Provitamin A carotenoid
  18. Von Baeyer -> polyalicyclic ring systems-> previously only bicyclic supported.-> working hard on augmenting natural productsBeta-carotene in carrots -> Provitamin A carotenoid
  19. Von Baeyer -> polyalicyclic ring systems-> previously only bicyclic supported.-> working hard on augmenting natural productsBeta-carotene in carrots -> Provitamin A carotenoid
  20. Von Baeyer -> polyalicyclic ring systems-> previously only bicyclic supported.-> working hard on augmenting natural productsBeta-carotene in carrots -> Provitamin A carotenoid
  21. Von Baeyer -> polyalicyclic ring systems-> previously only bicyclic supported.-> working hard on augmenting natural productsBeta-carotene in carrots -> Provitamin A carotenoid
  22. Von Baeyer -> polyalicyclic ring systems-> previously only bicyclic supported.-> working hard on augmenting natural productsBeta-carotene in carrots -> Provitamin A carotenoid
  23. Von Baeyer -> polyalicyclic ring systems-> previously only bicyclic supported.-> working hard on augmenting natural productsBeta-carotene in carrots -> Provitamin A carotenoid
  24. Von Baeyer -> polyalicyclic ring systems-> previously only bicyclic supported.-> working hard on augmenting natural productsBeta-carotene in carrots -> Provitamin A carotenoid
  25. Ring templates for conversion of molecules to names.Mainly bridge and fused ring templates have been added.
  26. Primary goal of the GUI was to:-> Lower the bar to use Lexichem’s functionality (for people not keen on using an API to program against, or using the command line)
  27. ->Primarily modeled on the command line tools, but provided numerous additional features2 Menus: Open a molecular fileImport name, SMILES, set default options, clear the view.
  28. Filter the results
  29. Story about Lexichem, you know you want it, then please feel free to contact usFuture work: continue to work on fused polycyclic ring systems, natural products