SlideShare ist ein Scribd-Unternehmen logo
1 von 22
LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE
BERND RUPP
Chemical Information systems:
From compound collections to rationally
designed HTS library
LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE
The Library Design Team
Head: Ronald Kühne
Module Cheminformatic/Drug Design
M. Lisurek
 Design and enhancement of
Screening- Libraries
 Maintenance of Vendor-Data
- Library Design:
 Design of focussed Libraries
 Design of Virtual Screens
 Design of Docking Studies
 Development of Homology Models
- Modelling:
 Pharmacophore Models
 QSAR
Module Database Design and IT
M. Pawletta, R. AL-Yamori and B. Rupp
- Database design:
 Development and Maintenance of relational
Database of Screening results
 Development of commercial available
compound library
- Software development
- Hardware Management
 Tools for Processing Structural Data
LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE
Screening Libraries @ FMP
[CBB 6: - 1.280 Cpds, LOPAC-library of
pharmacologically active annotated Cpds]
2005 CBB 1: - 16.544 Cpds from ChemDiv
2006/07 CBB 4: - 4.224 Cpds, ArtChem-library
CBB 3: - 4.576 Cpds, fragment library
2008/09 CBB 2: - 3.520 Cpds from Enamine, IBS,
ChemDiv, KeyOrganics, Maybridge
2011 CBB X: - 1.765 Cpds from IBS
2012 CBB Y: - 2.600 Cpds from Enamine
and UORSY
[CBB 5: - 1.780 Cpds synthesized by academic
groups, collected by E. Specker]
∑ ~35.000 Cpds
€ 130.000
€ ~10.000
€ ~30.000
€ ~80.000
€ ~45.000
€ ~45.000
€ 20.000
∑ € ~360.000
LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE
Database of Available Chemical Substances (DACS)
Usage:
- Experimental:
design of HTS, Ligation,
Building Block und focused
Libraries
- Theoretical:
Virtual Screening, Docking
Testing new Methods
Advantages:
- Effective Data Control
- Individual Tagging and Search
functionality
LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE
DACS 2005
~20 vendors
~ 2 million Cpds
LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE
Upload History
147 311
12 489 623
13 996 350
4 350 900
42 659 022
0
5 000 000
10 000 000
15 000 000
20 000 000
25 000 000
30 000 000
35 000 000
40 000 000
45 000 000
2006 2007 2008 2009 2010 2011 2012 2013
Import Year
Records
Structures
LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE
Data Processing Routine
SD-File
SDB
UDB
FMP-Data
Management
Tool
SD-File
Webinterface
Searchtools
Vendor collections
SD-File registration
Unique Structure
LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE
SD File collection 2012
Vendor collection_count file_count mol_count Vendor collection_count file_count mol_count
enamine 38 89 21.238.992,00 labotest 4 5 122.867,00
ukrorgsynthesis 16 54 14.918.628,00 sigma_aldrich 3 4 118.392,00
chemdiv 50 134 6.826.433,00 apollo_scientific 6 16 111.361,00
aurora 9 31 5.494.875,00 zylexa 5 6 107.425,00
vitas_m_lab 7 41 4.255.322,00 matrix_scientific 1 2 91.556,00
otava 32 47 4.099.533,00 analyticon 5 15 84.037,00
chembridge 20 41 4.038.934,00 nano_syn 2 9 65.327,00
life_chemicals 76 95 2.875.379,00 toslab 9 15 47.944,00
Ibs 4 24 1.908.031,00 arkad 1 1 31.102,00
amri 9 18 1.258.137,00 fluorochem 1 1 29.304,00
asinex 14 20 1.253.381,00 emc 1 1 28.932,00
specs 16 19 1.140.632,00 asis_chem 19 19 24.485,00
princeton_bio 12 15 949.659,00 peakdale 2 2 15.462,00
timtec 11 21 535.435,00 spectrum_info 1 1 8.678,00
pharmeks 2 5 524.224,00 evoblocks 1 1 8.367,00
aronis 3 12 278.177,00 acb_blocks 3 3 6.561,00
chemical_block 2 6 258.925,00 microsource 7 7 5.120,00
nch_laboratories 3 3 230.831,00 sequoia 1 1 2.323,00
ko_bionet 25 57 192.096,00 selleckchem 7 7 1.828,00
arvi 2 58 180.728,00 combi_blocks 1 1 1.747,00
maybridge 13 15 171.881,00 menai_organics_limited 1 3 652,00
synthon_labs 3 3 129.544,00 Total: 43 448 928 73.673.247,00
LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE
DACS 2012
1
10
100
1 000
10 000
100 000
1 000 000
10 000 000
100 000 000
Records
Structures
Unique Structures
LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE
SDB-Routine
SD-File
SDB
UDB
FMP-Data
Management
Tool
SD-File
SDB
Webinterface
Searchtools
LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE
SD-File Registration
- Determination of key data (Orderno., saltdata, vendor,
file, catalog, import_date, user...)
- Testing of correct Syntax for Structure and Property
block
- Upload into SD-File registration Database
- Upload verification
LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE
Catalog (SD-File) Registration
LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE
Import Application Form
LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE
Import controlling
LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE
Import controlling
LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE
ER-Model of the SD-File Registration
ER-model := Entity relationship model
LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE
UDB Routine
SD-File
SDB
UDB
FMP-Data
Management
Tool
SDB
UDB
Webinterface
Searchtools
LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE
Unique structure database
- Standardization of chemical Structures
- Normalization of the Data (removal of
redundancy)
- Storage in a Data Warehouse system
LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE
ER-Model of the UDB
LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE
Summary
SDF-Registration 2007-2011: 99.2GB
SDF-Registration 2012 : 116.7GB
UDB 2012 : 142.9GB
Molecule Records: 73.64 Mill
Structure Records: 37.89 Mill
Property Records: 65.72 Mill
SaltData Records: 2386
Smallfragment Records: 4838
Lost-molecule Records: 6360
Duplicate Records: 2070
Empty Records: 3582
Error Records: 708
Java 7 (jdk 1.7.10)
Chemaxon 5.12
MySQL 5.5.
12 Missing files (42179 Records):
error in count line of SD-File:
Is fixed with jchem later than 5.12.3
3 2 0 0 0 0 0 0 0
3 2 0 0 0 0 0 0 0 0999 V2000
Wrong count line:
correct count line:
LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE
Outlook
 Web interface and start of the Webserver
 Implementation of FMP Reactivity
and Tox. Tag
 Implementation of FMP Solubility
Prediction and other in-house models
 Development of a GUI for UDB Routine
 Connection to other Public recourses like
PubChem and ChEMBL
 Development of virtual Screening and
Docking interfaces
 Update and Bug fix for SDF-Registration
and UDB-Routine
Further Targets:
Coming Soon:
LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE
Acknowledgment
Drug Design / Molecular Modeling
• Ronald Kühne, Dr.
• Daniela Müller, Dr.
• Kirill Piotukh, Dr.
• Martyna Pawletta , master student
• Raed Al-Yamori, tech. Informatics
• Robert Opitz, Dr. student
• Matthias Barone, Dr. student
• Matthias Müller, Dr. student
• Michael Lisurek, Dr.
• Frank Eisenmenger, Dr.
Screnning Unit/ Compound
Managment
• Jens-Peter von Kries, Dr.
• Martin Neuenschwander, Dr.
• Edgar Specker, Dr.
Former member:
• Jörg Wichard, Dr. (Bayer AG)
• Lara Kuhnke , master student
Funding:
• Helmholtz Drug Research
initiative; Ronald Frank, Dr.
Partner:
• Prof. Gerhard Wolber (FU Berlin)
• Chemaxon

Weitere ähnliche Inhalte

Andere mochten auch

EUGM 2013 - Christopher Southan (TW2Informatics): Chemicalize.org, SureChemOp...
EUGM 2013 - Christopher Southan (TW2Informatics): Chemicalize.org, SureChemOp...EUGM 2013 - Christopher Southan (TW2Informatics): Chemicalize.org, SureChemOp...
EUGM 2013 - Christopher Southan (TW2Informatics): Chemicalize.org, SureChemOp...
ChemAxon
 
The Braun Foundation for International Exchange
The Braun Foundation for International ExchangeThe Braun Foundation for International Exchange
The Braun Foundation for International Exchange
Martin Moessmer
 

Andere mochten auch (19)

EUGM 2013 - David Deng, Daniel Bonniot (ChemAxon) - What’s New with Naming
EUGM 2013 - David Deng, Daniel Bonniot (ChemAxon) - What’s New with NamingEUGM 2013 - David Deng, Daniel Bonniot (ChemAxon) - What’s New with Naming
EUGM 2013 - David Deng, Daniel Bonniot (ChemAxon) - What’s New with Naming
 
EUGM 2013 - Roland Knispel (ChemAxon) - Biologics at ChemAxon From Old Powerh...
EUGM 2013 - Roland Knispel (ChemAxon) - Biologics at ChemAxon From Old Powerh...EUGM 2013 - Roland Knispel (ChemAxon) - Biologics at ChemAxon From Old Powerh...
EUGM 2013 - Roland Knispel (ChemAxon) - Biologics at ChemAxon From Old Powerh...
 
EUGM 2013 - Odon Farkas (Eotvos University) - Conformation search via cool dy...
EUGM 2013 - Odon Farkas (Eotvos University) - Conformation search via cool dy...EUGM 2013 - Odon Farkas (Eotvos University) - Conformation search via cool dy...
EUGM 2013 - Odon Farkas (Eotvos University) - Conformation search via cool dy...
 
EUGM 2013 - Andras Stracz (ChemAxon) - ChemAxon Plexus: A desktop application...
EUGM 2013 - Andras Stracz (ChemAxon) - ChemAxon Plexus: A desktop application...EUGM 2013 - Andras Stracz (ChemAxon) - ChemAxon Plexus: A desktop application...
EUGM 2013 - Andras Stracz (ChemAxon) - ChemAxon Plexus: A desktop application...
 
EUGM 2013 - Miklos Szabo (ChemAxon) - Recent Successful Discovery Strategies ...
EUGM 2013 - Miklos Szabo (ChemAxon) - Recent Successful Discovery Strategies ...EUGM 2013 - Miklos Szabo (ChemAxon) - Recent Successful Discovery Strategies ...
EUGM 2013 - Miklos Szabo (ChemAxon) - Recent Successful Discovery Strategies ...
 
EUGM 2013 - Attila Szabo (ChemAxon) - Collaborate and search in SharePoint wi...
EUGM 2013 - Attila Szabo (ChemAxon) - Collaborate and search in SharePoint wi...EUGM 2013 - Attila Szabo (ChemAxon) - Collaborate and search in SharePoint wi...
EUGM 2013 - Attila Szabo (ChemAxon) - Collaborate and search in SharePoint wi...
 
EUGM 2013 - Dragos Horváth (Labooratoire de Chemoinformatique Univ Strasbourg...
EUGM 2013 - Dragos Horváth (Labooratoire de Chemoinformatique Univ Strasbourg...EUGM 2013 - Dragos Horváth (Labooratoire de Chemoinformatique Univ Strasbourg...
EUGM 2013 - Dragos Horváth (Labooratoire de Chemoinformatique Univ Strasbourg...
 
EUGM 2013 - Peter Englert, Peter Kovacs (ChemAxon) - The Next Generation of M...
EUGM 2013 - Peter Englert, Peter Kovacs (ChemAxon) - The Next Generation of M...EUGM 2013 - Peter Englert, Peter Kovacs (ChemAxon) - The Next Generation of M...
EUGM 2013 - Peter Englert, Peter Kovacs (ChemAxon) - The Next Generation of M...
 
EUGM 2013 - Steve Hajkowski (Thomson Reuters): Patent analytics - what can Ma...
EUGM 2013 - Steve Hajkowski (Thomson Reuters): Patent analytics - what can Ma...EUGM 2013 - Steve Hajkowski (Thomson Reuters): Patent analytics - what can Ma...
EUGM 2013 - Steve Hajkowski (Thomson Reuters): Patent analytics - what can Ma...
 
EUGM 2013 - Christopher Southan (TW2Informatics): Chemicalize.org, SureChemOp...
EUGM 2013 - Christopher Southan (TW2Informatics): Chemicalize.org, SureChemOp...EUGM 2013 - Christopher Southan (TW2Informatics): Chemicalize.org, SureChemOp...
EUGM 2013 - Christopher Southan (TW2Informatics): Chemicalize.org, SureChemOp...
 
EUGM 2013 - Björn Windshügel (European ScreeningPort): Chemoinformatic tools ...
EUGM 2013 - Björn Windshügel (European ScreeningPort): Chemoinformatic tools ...EUGM 2013 - Björn Windshügel (European ScreeningPort): Chemoinformatic tools ...
EUGM 2013 - Björn Windshügel (European ScreeningPort): Chemoinformatic tools ...
 
EUGM 2013 - Gyorgy Pirok (ChemAxon) - Prediction of Xenobiotic Metabolism
EUGM 2013 - Gyorgy Pirok (ChemAxon) - Prediction of Xenobiotic MetabolismEUGM 2013 - Gyorgy Pirok (ChemAxon) - Prediction of Xenobiotic Metabolism
EUGM 2013 - Gyorgy Pirok (ChemAxon) - Prediction of Xenobiotic Metabolism
 
EUGM 2013 - Ian Berry, Bob Marmon (Evotec): Classification and analysis of 21...
EUGM 2013 - Ian Berry, Bob Marmon (Evotec): Classification and analysis of 21...EUGM 2013 - Ian Berry, Bob Marmon (Evotec): Classification and analysis of 21...
EUGM 2013 - Ian Berry, Bob Marmon (Evotec): Classification and analysis of 21...
 
EUGM 2013 - Jon Patterson (ChemAxon) ChemAxon Platform for Scientists
EUGM 2013 - Jon Patterson (ChemAxon) ChemAxon Platform for Scientists EUGM 2013 - Jon Patterson (ChemAxon) ChemAxon Platform for Scientists
EUGM 2013 - Jon Patterson (ChemAxon) ChemAxon Platform for Scientists
 
EUGM 2013 - Anna Tomin (ChemAxon) - Reaction Library Design
EUGM 2013 - Anna Tomin (ChemAxon) - Reaction Library DesignEUGM 2013 - Anna Tomin (ChemAxon) - Reaction Library Design
EUGM 2013 - Anna Tomin (ChemAxon) - Reaction Library Design
 
EUGM 2013 - Timea Polgar (ChemAxon) - 3D visualization for medicinal chemists
EUGM 2013 - Timea Polgar (ChemAxon) - 3D visualization for medicinal chemistsEUGM 2013 - Timea Polgar (ChemAxon) - 3D visualization for medicinal chemists
EUGM 2013 - Timea Polgar (ChemAxon) - 3D visualization for medicinal chemists
 
Merchandise presentation
Merchandise presentationMerchandise presentation
Merchandise presentation
 
024 the force of gravity and orbital movements
024 the force of gravity and orbital movements024 the force of gravity and orbital movements
024 the force of gravity and orbital movements
 
The Braun Foundation for International Exchange
The Braun Foundation for International ExchangeThe Braun Foundation for International Exchange
The Braun Foundation for International Exchange
 

Ähnlich wie EUGM 2013 - Bernd Rupp (FMP) Chemical Information systems: From compound collections to rationally designed HTS library

FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
Carole Goble
 
Biological Database Systems
Biological Database SystemsBiological Database Systems
Biological Database Systems
Denis Shestakov
 
Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...
Revolutionizing Laboratory  Instrument Data for the  Pharmaceutical Industry:...Revolutionizing Laboratory  Instrument Data for the  Pharmaceutical Industry:...
Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...
OSTHUS
 
ICIC 2014 From SureChem to SureChEMBL
ICIC 2014 From SureChem to SureChEMBLICIC 2014 From SureChem to SureChEMBL
ICIC 2014 From SureChem to SureChEMBL
Dr. Haxel Consult
 

Ähnlich wie EUGM 2013 - Bernd Rupp (FMP) Chemical Information systems: From compound collections to rationally designed HTS library (20)

Scientific Data Management
Scientific Data ManagementScientific Data Management
Scientific Data Management
 
Major resources of bioinformatics 2
Major resources of bioinformatics 2Major resources of bioinformatics 2
Major resources of bioinformatics 2
 
Data-sharing in Metabolomics beyond your supplemental PDF
Data-sharing in Metabolomics beyond your supplemental PDFData-sharing in Metabolomics beyond your supplemental PDF
Data-sharing in Metabolomics beyond your supplemental PDF
 
MongoDB in the Healthcare Enterprise
MongoDB in the Healthcare EnterpriseMongoDB in the Healthcare Enterprise
MongoDB in the Healthcare Enterprise
 
INFORMATICS 2.pptx
INFORMATICS 2.pptxINFORMATICS 2.pptx
INFORMATICS 2.pptx
 
INFORMATICS 2.pptx
INFORMATICS 2.pptxINFORMATICS 2.pptx
INFORMATICS 2.pptx
 
Semantics for integrated laboratory analytical processes - The Allotrope Pers...
Semantics for integrated laboratory analytical processes - The Allotrope Pers...Semantics for integrated laboratory analytical processes - The Allotrope Pers...
Semantics for integrated laboratory analytical processes - The Allotrope Pers...
 
EUGM 2014 - Marco Brazzarola (Aptuit): Aptuit Compound Registration and Inte...
EUGM 2014 -  Marco Brazzarola (Aptuit): Aptuit Compound Registration and Inte...EUGM 2014 -  Marco Brazzarola (Aptuit): Aptuit Compound Registration and Inte...
EUGM 2014 - Marco Brazzarola (Aptuit): Aptuit Compound Registration and Inte...
 
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTSImplementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
 
GPU-accelerated Virtual Screening
GPU-accelerated Virtual ScreeningGPU-accelerated Virtual Screening
GPU-accelerated Virtual Screening
 
Standarization in Proteomics: From raw data to metadata files
Standarization in Proteomics: From raw data to metadata filesStandarization in Proteomics: From raw data to metadata files
Standarization in Proteomics: From raw data to metadata files
 
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
 
R.P Maurya ppt on C C D C & DSSP(Bioinformatics)
R.P Maurya ppt  on C C D C & DSSP(Bioinformatics)R.P Maurya ppt  on C C D C & DSSP(Bioinformatics)
R.P Maurya ppt on C C D C & DSSP(Bioinformatics)
 
Biological Database Systems
Biological Database SystemsBiological Database Systems
Biological Database Systems
 
Bio2RDF: Towards A Mashup To Build Bioinformatics Knowledge System
Bio2RDF: Towards A Mashup To Build Bioinformatics Knowledge SystemBio2RDF: Towards A Mashup To Build Bioinformatics Knowledge System
Bio2RDF: Towards A Mashup To Build Bioinformatics Knowledge System
 
Jack Verhoosel | Semantics in Dairy Farming: towards a Common Dairy Ontology
Jack Verhoosel | Semantics in Dairy Farming: towards a Common Dairy OntologyJack Verhoosel | Semantics in Dairy Farming: towards a Common Dairy Ontology
Jack Verhoosel | Semantics in Dairy Farming: towards a Common Dairy Ontology
 
Benchmarking Commercial RDF Stores with Publications Office Dataset
Benchmarking Commercial RDF Stores with Publications Office DatasetBenchmarking Commercial RDF Stores with Publications Office Dataset
Benchmarking Commercial RDF Stores with Publications Office Dataset
 
Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...
Revolutionizing Laboratory  Instrument Data for the  Pharmaceutical Industry:...Revolutionizing Laboratory  Instrument Data for the  Pharmaceutical Industry:...
Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...
 
ICIC 2014 From SureChem to SureChEMBL
ICIC 2014 From SureChem to SureChEMBLICIC 2014 From SureChem to SureChEMBL
ICIC 2014 From SureChem to SureChEMBL
 
ChemSpider – An Online Database and Registration System Linking the Web
ChemSpider – An Online Database and  Registration System Linking the WebChemSpider – An Online Database and  Registration System Linking the Web
ChemSpider – An Online Database and Registration System Linking the Web
 

Mehr von ChemAxon

Translating data to predictive models
Translating data to predictive modelsTranslating data to predictive models
Translating data to predictive models
ChemAxon
 

Mehr von ChemAxon (20)

Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?
Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?
Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?
 
Chemaxon EU UGM 2022 | Translating data to predictive models
Chemaxon EU UGM 2022 | Translating data to predictive modelsChemaxon EU UGM 2022 | Translating data to predictive models
Chemaxon EU UGM 2022 | Translating data to predictive models
 
Translating data to predictive models
Translating data to predictive modelsTranslating data to predictive models
Translating data to predictive models
 
Efficient biomolecular structural data handling and analysis - Webinar with D...
Efficient biomolecular structural data handling and analysis - Webinar with D...Efficient biomolecular structural data handling and analysis - Webinar with D...
Efficient biomolecular structural data handling and analysis - Webinar with D...
 
Biomolecule structural data management
Biomolecule structural data managementBiomolecule structural data management
Biomolecule structural data management
 
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first releaseCheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
 
Enhanced stereochemistry representation
Enhanced stereochemistry representation Enhanced stereochemistry representation
Enhanced stereochemistry representation
 
Intellectual property (IP) intelligence solutions designed for the way resear...
Intellectual property (IP) intelligence solutions designed for the way resear...Intellectual property (IP) intelligence solutions designed for the way resear...
Intellectual property (IP) intelligence solutions designed for the way resear...
 
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
 
Patent Data for Artificial Intelligence based Drug Discovery
Patent Data for Artificial Intelligence based Drug DiscoveryPatent Data for Artificial Intelligence based Drug Discovery
Patent Data for Artificial Intelligence based Drug Discovery
 
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
 
Research data management on the cloud
Research data management on the cloudResearch data management on the cloud
Research data management on the cloud
 
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound RegistrationCheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
 
Cheminfo Stories APAC 2020 - JChem Engines introduction
Cheminfo Stories APAC 2020 - JChem Engines introduction Cheminfo Stories APAC 2020 - JChem Engines introduction
Cheminfo Stories APAC 2020 - JChem Engines introduction
 
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
 
Cheminfo Stories APAC 2020 -- Markush technology
Cheminfo Stories APAC 2020 -- Markush technology Cheminfo Stories APAC 2020 -- Markush technology
Cheminfo Stories APAC 2020 -- Markush technology
 
JChem Microservices
JChem MicroservicesJChem Microservices
JChem Microservices
 
Migration from joc to jpc or choral
Migration from joc to jpc or choralMigration from joc to jpc or choral
Migration from joc to jpc or choral
 
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
 
Chemicalize Pro - Cheminfo Stories 2020 Day 5
Chemicalize Pro - Cheminfo Stories 2020 Day 5Chemicalize Pro - Cheminfo Stories 2020 Day 5
Chemicalize Pro - Cheminfo Stories 2020 Day 5
 

Kürzlich hochgeladen

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Kürzlich hochgeladen (20)

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 

EUGM 2013 - Bernd Rupp (FMP) Chemical Information systems: From compound collections to rationally designed HTS library

  • 1. LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE BERND RUPP Chemical Information systems: From compound collections to rationally designed HTS library
  • 2. LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE The Library Design Team Head: Ronald Kühne Module Cheminformatic/Drug Design M. Lisurek  Design and enhancement of Screening- Libraries  Maintenance of Vendor-Data - Library Design:  Design of focussed Libraries  Design of Virtual Screens  Design of Docking Studies  Development of Homology Models - Modelling:  Pharmacophore Models  QSAR Module Database Design and IT M. Pawletta, R. AL-Yamori and B. Rupp - Database design:  Development and Maintenance of relational Database of Screening results  Development of commercial available compound library - Software development - Hardware Management  Tools for Processing Structural Data
  • 3. LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE Screening Libraries @ FMP [CBB 6: - 1.280 Cpds, LOPAC-library of pharmacologically active annotated Cpds] 2005 CBB 1: - 16.544 Cpds from ChemDiv 2006/07 CBB 4: - 4.224 Cpds, ArtChem-library CBB 3: - 4.576 Cpds, fragment library 2008/09 CBB 2: - 3.520 Cpds from Enamine, IBS, ChemDiv, KeyOrganics, Maybridge 2011 CBB X: - 1.765 Cpds from IBS 2012 CBB Y: - 2.600 Cpds from Enamine and UORSY [CBB 5: - 1.780 Cpds synthesized by academic groups, collected by E. Specker] ∑ ~35.000 Cpds € 130.000 € ~10.000 € ~30.000 € ~80.000 € ~45.000 € ~45.000 € 20.000 ∑ € ~360.000
  • 4. LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE Database of Available Chemical Substances (DACS) Usage: - Experimental: design of HTS, Ligation, Building Block und focused Libraries - Theoretical: Virtual Screening, Docking Testing new Methods Advantages: - Effective Data Control - Individual Tagging and Search functionality
  • 5. LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE DACS 2005 ~20 vendors ~ 2 million Cpds
  • 6. LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE Upload History 147 311 12 489 623 13 996 350 4 350 900 42 659 022 0 5 000 000 10 000 000 15 000 000 20 000 000 25 000 000 30 000 000 35 000 000 40 000 000 45 000 000 2006 2007 2008 2009 2010 2011 2012 2013 Import Year Records Structures
  • 7. LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE Data Processing Routine SD-File SDB UDB FMP-Data Management Tool SD-File Webinterface Searchtools Vendor collections SD-File registration Unique Structure
  • 8. LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE SD File collection 2012 Vendor collection_count file_count mol_count Vendor collection_count file_count mol_count enamine 38 89 21.238.992,00 labotest 4 5 122.867,00 ukrorgsynthesis 16 54 14.918.628,00 sigma_aldrich 3 4 118.392,00 chemdiv 50 134 6.826.433,00 apollo_scientific 6 16 111.361,00 aurora 9 31 5.494.875,00 zylexa 5 6 107.425,00 vitas_m_lab 7 41 4.255.322,00 matrix_scientific 1 2 91.556,00 otava 32 47 4.099.533,00 analyticon 5 15 84.037,00 chembridge 20 41 4.038.934,00 nano_syn 2 9 65.327,00 life_chemicals 76 95 2.875.379,00 toslab 9 15 47.944,00 Ibs 4 24 1.908.031,00 arkad 1 1 31.102,00 amri 9 18 1.258.137,00 fluorochem 1 1 29.304,00 asinex 14 20 1.253.381,00 emc 1 1 28.932,00 specs 16 19 1.140.632,00 asis_chem 19 19 24.485,00 princeton_bio 12 15 949.659,00 peakdale 2 2 15.462,00 timtec 11 21 535.435,00 spectrum_info 1 1 8.678,00 pharmeks 2 5 524.224,00 evoblocks 1 1 8.367,00 aronis 3 12 278.177,00 acb_blocks 3 3 6.561,00 chemical_block 2 6 258.925,00 microsource 7 7 5.120,00 nch_laboratories 3 3 230.831,00 sequoia 1 1 2.323,00 ko_bionet 25 57 192.096,00 selleckchem 7 7 1.828,00 arvi 2 58 180.728,00 combi_blocks 1 1 1.747,00 maybridge 13 15 171.881,00 menai_organics_limited 1 3 652,00 synthon_labs 3 3 129.544,00 Total: 43 448 928 73.673.247,00
  • 9. LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE DACS 2012 1 10 100 1 000 10 000 100 000 1 000 000 10 000 000 100 000 000 Records Structures Unique Structures
  • 10. LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE SDB-Routine SD-File SDB UDB FMP-Data Management Tool SD-File SDB Webinterface Searchtools
  • 11. LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE SD-File Registration - Determination of key data (Orderno., saltdata, vendor, file, catalog, import_date, user...) - Testing of correct Syntax for Structure and Property block - Upload into SD-File registration Database - Upload verification
  • 12. LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE Catalog (SD-File) Registration
  • 13. LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE Import Application Form
  • 14. LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE Import controlling
  • 15. LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE Import controlling
  • 16. LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE ER-Model of the SD-File Registration ER-model := Entity relationship model
  • 17. LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE UDB Routine SD-File SDB UDB FMP-Data Management Tool SDB UDB Webinterface Searchtools
  • 18. LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE Unique structure database - Standardization of chemical Structures - Normalization of the Data (removal of redundancy) - Storage in a Data Warehouse system
  • 19. LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE ER-Model of the UDB
  • 20. LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE Summary SDF-Registration 2007-2011: 99.2GB SDF-Registration 2012 : 116.7GB UDB 2012 : 142.9GB Molecule Records: 73.64 Mill Structure Records: 37.89 Mill Property Records: 65.72 Mill SaltData Records: 2386 Smallfragment Records: 4838 Lost-molecule Records: 6360 Duplicate Records: 2070 Empty Records: 3582 Error Records: 708 Java 7 (jdk 1.7.10) Chemaxon 5.12 MySQL 5.5. 12 Missing files (42179 Records): error in count line of SD-File: Is fixed with jchem later than 5.12.3 3 2 0 0 0 0 0 0 0 3 2 0 0 0 0 0 0 0 0999 V2000 Wrong count line: correct count line:
  • 21. LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE Outlook  Web interface and start of the Webserver  Implementation of FMP Reactivity and Tox. Tag  Implementation of FMP Solubility Prediction and other in-house models  Development of a GUI for UDB Routine  Connection to other Public recourses like PubChem and ChEMBL  Development of virtual Screening and Docking interfaces  Update and Bug fix for SDF-Registration and UDB-Routine Further Targets: Coming Soon:
  • 22. LEIBNIZ-INSTITUT FÜR MOLEKULARE PHARMAKOLOGIE Acknowledgment Drug Design / Molecular Modeling • Ronald Kühne, Dr. • Daniela Müller, Dr. • Kirill Piotukh, Dr. • Martyna Pawletta , master student • Raed Al-Yamori, tech. Informatics • Robert Opitz, Dr. student • Matthias Barone, Dr. student • Matthias Müller, Dr. student • Michael Lisurek, Dr. • Frank Eisenmenger, Dr. Screnning Unit/ Compound Managment • Jens-Peter von Kries, Dr. • Martin Neuenschwander, Dr. • Edgar Specker, Dr. Former member: • Jörg Wichard, Dr. (Bayer AG) • Lara Kuhnke , master student Funding: • Helmholtz Drug Research initiative; Ronald Frank, Dr. Partner: • Prof. Gerhard Wolber (FU Berlin) • Chemaxon