SlideShare ist ein Scribd-Unternehmen logo
1 von 30
Downloaden Sie, um offline zu lesen
Derwent Markush Resource
(DWPIM) on STN
Andreas Barth, Steve Hajkowski
• Introduction
• Derwent and STN Generic Node Concepts
• Application of Match Levels
• Derwent and STN Bonding Conventions
• Some Examples
• STN Unified Markush Solution
• Derwent Markush Resource – Data Content
2
Agenda
From Specific Structures to Markush Structures
DWPIM on STN – ICIC, Nizza, 2015 3
4
A Typical “Real” Markush
R3
DWPIM on STN – ICIC, Nizza, 2015
 1.9 M Markush structures referencing 768 K DWPI documents
 Records in DWPIM are structure based
– Markush Compound Number (YYWW-AAANN) links to DWPI
– Single Markush structure including all variations
 Nodes in the structure may be a specific atom, a shortcut, a
superatom representing generically-described groups in the
structure or a variable group
 Only generic structures are indexed in DWPIM
– Specific structures can be simultaneously searched for in DCR
– In some rare cases specific structures are contained only in
DWPIM
 Substance Classes: organics, organometallics, inorganics,
polymers, fullerenes, peptides
 Indexing from 33 patent issuing authorities worldwide
DWPIM on STN – ICIC, Nizza, 2015 5
Content of the Derwent Markush Resource
Chemistry-related Databases of Thomson Reuters
DWPIM on STN – ICIC, Nizza, 2015 6
M3 *01* A111 A940 B115
B702 B713
Derwent World Patent
Index (DWPI)
AN 2012-N47214 [201312] DWPI
TI Preparation of active material …
28.8 m Inventions
Derwent Chemistry
Resource (DCR)
0334-20904
2.5 m Chemical Structures
Derwent Markush
Resource (DWPIM)
1154-11901-K
1.9 m Markush Structures
 Integrate different concepts of STN and Derwent
– Different set of homology nodes
– Different bonding conventions
 Retain indexing concept of Derwent within the STN
environment
 Develop a new state of the art Markush search engine
– Focus on full recall
– Provide full set of search functionalities for high precision
 Improve current possibilities for the evaluation of Markush
search results
 Provide high performance search environment
DWPIM on STN – ICIC, Nizza, 2015 7
Challenges
DWPIM Concept is based on 22 Superatoms
DWPIM on STN – ICIC, Nizza, 2015 8
Acyclic / Cyclic
(ATOM, CLASS, ANY)
CHK (Alkyl, Alkylene)
CHE (Alkenyl, Alkenylene)
CHY (Alkynyl, Alkynylene)
ARY (Aryl)
CYC (Cycloaliphatic)
HEA (Monocyclic heteroaryl)
HET (Monocyclic nonaromatic)
HEF (Fused heterocyclic)
DWPIM Concept is based on 22 Superatoms
DWPIM on STN – ICIC, Nizza, 2015 9
Acyclic / Cyclic
(ATOM, CLASS, ANY)
Elements
(ATOM, CLASS, ANY)
Others
(only match on themselves)
CHK (Alkyl, Alkylene) MX (Any metal) ACY (Acyl)
CHE (Alkenyl, Alkenylene) A35 (Group III A - V A metal) DYE (Chromophore)
CHY (Alkynyl, Alkynylene) ACT (Actinide) PEG (Polymer end group)
ARY (Aryl) AMX (Alkali/alkaline earth
metal)
POL (Polymer)
CYC (Cycloaliphatic) LAN (Lanthanide) PRT (Protecting group)
HEA (Monocyclic heteroaryl) TRM (Transition metal) UNK (Any atom or group including H)
HET (Monocyclic nonaromatic) HAL (Halogen) XX (Any atom or group excluding H)
HEF (Fused heterocyclic)
Derwent Indexing Hierarchy of Superatoms
DWPIM on STN – ICIC, Nizza, 2015 10
XX
ARY CYC HEF HEA HET CHK CHE CHY MX HAL
A35 ACT AMX LAN TRM
STN Query Hierarchy of Nodes
DWPIM on STN – ICIC, Nizza, 2015 11
R
Cb Ak
Cy
XMHy
DWPIM on STN – ICIC, Nizza, 2015 12
Integrated Query Hierarchy of Generic Nodes
R
Cb
AkCy
specific:
benzene
specific:
cyclohexane
specific:
quinoline
specific:
pyridine
specific:
piperidine
specific:
CH3 …
specific:
CH2 = CH3
specific:
ethyne
ARY CYC HEF HEA HET CHK CHE CHY
Hy
 Match levels control the degree of structure query matching
between the query structure and the structure in the search
file.
 Match levels are assigned to each atom and generic
group/superatom.
 All nodes of a ring system have the same match level
– Note: Special cases require different match levels for the
atoms of a ring system
 Default Match Levels on STN
– ATOM for ring nodes
– CLASS for chain nodes
DWPIM on STN – ICIC, Nizza, 2015 13
Match Levels Control the Search Process
R
Cb
AkCy
specific:
benzene
specific:
cyclohexane
specific:
quinoline
specific:
pyridine
specific:
piperidine
specific:
CH3 …
specific:
CH2 = CH3
specific:
ethyne
ARY CYC HEF HEA HET CHK CHE CHY
Hy
Effect of Match Levels on the Node Hierarchy
DWPIM on STN – ICIC, Nizza, 2015 14
CLASS
ATOM
ANY
Search: pyridine
MLE: ATOM
DWPIM on STN – ICIC, Nizza, 2015 15
Effect of Match Level in DWPIM: ATOM
R
Cb
AkCy
specific:
benzene
specific:
cyclohexane
specific:
quinoline
specific:
pyridine
specific:
piperidine
specific:
CH3 …
specific:
CH2 = CH3
specific:
ethyne
ARY CYC HEF HEA HET CHK CHE CHY
Hy
DWPIM on STN – ICIC, Nizza, 2015 16
Effect of Match Level in DWPIM: CLASS
R
Cb
AkCy
specific:
benzene
specific:
cyclohexane
specific:
quinoline
specific:
pyridine
specific:
piperidine
specific:
CH3 …
specific:
CH2 = CH3
specific:
ethyne
ARY CYC HEF HEA HET CHK CHE CHY
Hy
Search: pyridine
MLE: CLASS
DWPIM on STN – ICIC, Nizza, 2015 17
Effect of Match Level in DWPIM: ANY
Search: pyridine
MLE: ANY
R
Cb
AkCy
specific:
benzene
specific:
cyclohexane
specific:
quinoline
specific:
pyridine
specific:
piperidine
specific:
CH3 …
specific:
CH2 = CH3
specific:
ethyne
ARY CYC HEF HEA HET CHK CHE CHY
Hy
Match Level ATOM Results in Specific Nodes
DWPIM on STN – ICIC, Nizza, 2015 18
MCN 9912-JKW07Query (MLE ATOM)
Match Level CLASS Results in Generic Nodes
Answer #5: 9917-IVD01Query (MLE CLASS)
DWPIM on STN – ICIC, Nizza, 2015 19
 STN-Convention: Bond values in the structure editor
• Note that “exact” from structure editor is
translated to single exact or double exact
 Indexed bonds: Bond value in the indexed
structures, e.g.
• single exact can only match to a single bond
• single/normalized bond can either match to a
single or to a normalize bond in the indexed
structure.
DWPIM on STN – ICIC, Nizza, 2015 20
Different Types of Bond Conventions
Why are bond values important ?
DWPIM on STN – ICIC, Nizza, 2015 21
Query structure
Exact (= single exact)
normalized
exact/
normalized
 Rings: Normalized bonds are used in ring systems with an even
number of atoms containing alternate double and single bonds.
• Benzene, Pyridine, Naphtalene
• Exceptions: cyclopentadienyl anion, cycloheptatrienly cation
 Tautomers:
DWPIM on STN – ICIC, Nizza, 2015 22
Bond Normalization
Where Z can be: B, C, Si, N, P, As, S, Se, Te, F, Cl, Br, I
X and Y can be: O, S, Se, Te, N
DWPIM on STN – ICIC, Nizza, 2015 23
Query:
Search Example 1
DWPIM on STN – ICIC, Nizza, 2015 24
Search Example 2
Query:
 Consistent search of MARPAT
from CAS and DWPIM from
Thomson Reuters on a single
platform
 Offers same Markush attributes
as classic STN
 Retrieves results as Markush
structures, not references
 Highly efficient retrieval and
evaluation of Markush results
DWPIM hit structures:
Both the Assembled view and
Full view will be available.
With DWPIM new STN will deliver the first-ever,
Unified Markush Search Solution
 Full integration in Thomson Reuters and CAS content
 Simultaneous search for generic and specific chemical structures
DWPISM
Derwent World
Patents Index®
DCR Derwent
Chemistry
Resource
~2.5 Mio
DWPIM
Derwent Markush
Resource
~1.9 Mio
CAplusSM
Chemical
Abstracts
MARPAT®
CAS Markush
Database
~ 1.1 Mio
CAS
RegistrySM
~ 102 Mio
Structure
Search
REAXYSFILE
~ 25 Mio
DWPIM is integrated with DWPI for efficient searching together
with CAS Databases
• Introduction
• Derwent and STN Generic Node Concepts
• Application of Match Levels
• Derwent and STN Bonding Conventions
• Some Examples
• STN Unified Markush Solution
• Derwent Markush Resource – Data Content
27
Agenda
DERWENT MARKUSH RESOURCE ON STN
28
DWPIM on STN – ICIC, Nizza, 2015
Contains full DWPI and INPI Markush data backfile
 1.9 million Markush structures
 33 patent issuing authorities covered
 Indexing for >777,000 patent families in DWPI
 US, EP and WO coverage from 1978 onwards
 DWPI data from 1987 to date
 INPI sourced data from 1961 to 1998
 Full coverage of organics, organometallics, inorganic salts
and metal oxides
 Partial coverage of alloys, intermetallics and polymers
DERWENT MARKUSH RESOURCE
COUNTRY COVERAGE
29DWPIM on STN – ICIC, Nizza, 2015
DERWENT MARKUSH RESOURCE ON STN
CONTENT
• Integration of INPI content
– 213,000 structure files
– New DWPI format compound numbers created and added
into 120,000 DWPI records
• Format 82nn-nnnnn and 83nn-nnnnn
• INPI indexing was added to all relevant DWPI records
– including those already containing DWPI sourced Markush
indexing for the period 1987 – 1998 (ie double indexing)
• Re-indexed DWPI content
– 45,0000 structures were re-indexed, with new compound
numbers added into DWPI
• Format 84nn-nnnnn and 85nn-nnnnn
30

Weitere ähnliche Inhalte

Ähnlich wie DWPI Markush Database on STN – A New Perspective for Searching Markush Structures

Omdi2021 Ontologies for (Materials) Science in the Digital Age
Omdi2021 Ontologies for (Materials) Science in the Digital AgeOmdi2021 Ontologies for (Materials) Science in the Digital Age
Omdi2021 Ontologies for (Materials) Science in the Digital Agepetermurrayrust
 
NACK_U3__Maeder_Quantum_Dots.pptx
NACK_U3__Maeder_Quantum_Dots.pptxNACK_U3__Maeder_Quantum_Dots.pptx
NACK_U3__Maeder_Quantum_Dots.pptxSoumyadeepPradhan7
 
Introduction to nanoparticles and bionanomaterials
Introduction to nanoparticles and bionanomaterialsIntroduction to nanoparticles and bionanomaterials
Introduction to nanoparticles and bionanomaterialsShreyaBhatt23
 
Standardising Lab tests and data using SNOMED CT
Standardising Lab tests and data using SNOMED CTStandardising Lab tests and data using SNOMED CT
Standardising Lab tests and data using SNOMED CTJay Kola
 
HPC Applications of Materials Modeling
HPC Applications of Materials ModelingHPC Applications of Materials Modeling
HPC Applications of Materials ModelingGeorge Fitzgerald
 
Computational Chemistry: From Theory to Practice
Computational Chemistry: From Theory to PracticeComputational Chemistry: From Theory to Practice
Computational Chemistry: From Theory to PracticeDavid Thompson
 
Chemistry Resource FS1:15
Chemistry Resource FS1:15Chemistry Resource FS1:15
Chemistry Resource FS1:15Krystal Huffer
 
Tutorial: Formal Methods for Hardware Verification - Overview and Application...
Tutorial: Formal Methods for Hardware Verification - Overview and Application...Tutorial: Formal Methods for Hardware Verification - Overview and Application...
Tutorial: Formal Methods for Hardware Verification - Overview and Application...Peter Breuer
 
03 plasma-surface-interaction-snyders-u mons
03 plasma-surface-interaction-snyders-u mons03 plasma-surface-interaction-snyders-u mons
03 plasma-surface-interaction-snyders-u monsSirris
 
The influence of data curation on QSAR Modeling – Presented at American Chemi...
The influence of data curation on QSAR Modeling – Presented at American Chemi...The influence of data curation on QSAR Modeling – Presented at American Chemi...
The influence of data curation on QSAR Modeling – Presented at American Chemi...Kamel Mansouri
 
Communities building ontologies: Tensions and Reality
Communities building ontologies: Tensions and RealityCommunities building ontologies: Tensions and Reality
Communities building ontologies: Tensions and Realityrobertstevens65
 
Cuantificacion de Amorfo por Difraccion de Rayos X
Cuantificacion de Amorfo por Difraccion de Rayos XCuantificacion de Amorfo por Difraccion de Rayos X
Cuantificacion de Amorfo por Difraccion de Rayos XFabioJurado2
 
EVOLUTION OF ONTOLOGY-BASED MAPPINGS
EVOLUTION OF ONTOLOGY-BASED MAPPINGSEVOLUTION OF ONTOLOGY-BASED MAPPINGS
EVOLUTION OF ONTOLOGY-BASED MAPPINGSAksw Group
 
University of Toronto Chemistry Librarians Workshop June 2012
University of Toronto Chemistry Librarians Workshop June 2012University of Toronto Chemistry Librarians Workshop June 2012
University of Toronto Chemistry Librarians Workshop June 2012Brock University
 

Ähnlich wie DWPI Markush Database on STN – A New Perspective for Searching Markush Structures (20)

Omdi2021 Ontologies for (Materials) Science in the Digital Age
Omdi2021 Ontologies for (Materials) Science in the Digital AgeOmdi2021 Ontologies for (Materials) Science in the Digital Age
Omdi2021 Ontologies for (Materials) Science in the Digital Age
 
NACK_U3__Maeder_Quantum_Dots.pptx
NACK_U3__Maeder_Quantum_Dots.pptxNACK_U3__Maeder_Quantum_Dots.pptx
NACK_U3__Maeder_Quantum_Dots.pptx
 
Introduction to nanoparticles and bionanomaterials
Introduction to nanoparticles and bionanomaterialsIntroduction to nanoparticles and bionanomaterials
Introduction to nanoparticles and bionanomaterials
 
Bioinformatica t3-scoring matrices
Bioinformatica t3-scoring matricesBioinformatica t3-scoring matrices
Bioinformatica t3-scoring matrices
 
Standardising Lab tests and data using SNOMED CT
Standardising Lab tests and data using SNOMED CTStandardising Lab tests and data using SNOMED CT
Standardising Lab tests and data using SNOMED CT
 
HPC Applications of Materials Modeling
HPC Applications of Materials ModelingHPC Applications of Materials Modeling
HPC Applications of Materials Modeling
 
Computational Chemistry: From Theory to Practice
Computational Chemistry: From Theory to PracticeComputational Chemistry: From Theory to Practice
Computational Chemistry: From Theory to Practice
 
CFD_Lecture_1.pdf
CFD_Lecture_1.pdfCFD_Lecture_1.pdf
CFD_Lecture_1.pdf
 
Chemistry Resource FS1:15
Chemistry Resource FS1:15Chemistry Resource FS1:15
Chemistry Resource FS1:15
 
WP1.3 – Transient fluid dynamics of CO2 mixtures in pipelines - Alexandre Mor...
WP1.3 – Transient fluid dynamics of CO2 mixtures in pipelines - Alexandre Mor...WP1.3 – Transient fluid dynamics of CO2 mixtures in pipelines - Alexandre Mor...
WP1.3 – Transient fluid dynamics of CO2 mixtures in pipelines - Alexandre Mor...
 
Tutorial: Formal Methods for Hardware Verification - Overview and Application...
Tutorial: Formal Methods for Hardware Verification - Overview and Application...Tutorial: Formal Methods for Hardware Verification - Overview and Application...
Tutorial: Formal Methods for Hardware Verification - Overview and Application...
 
03 plasma-surface-interaction-snyders-u mons
03 plasma-surface-interaction-snyders-u mons03 plasma-surface-interaction-snyders-u mons
03 plasma-surface-interaction-snyders-u mons
 
The influence of data curation on QSAR Modeling – Presented at American Chemi...
The influence of data curation on QSAR Modeling – Presented at American Chemi...The influence of data curation on QSAR Modeling – Presented at American Chemi...
The influence of data curation on QSAR Modeling – Presented at American Chemi...
 
Communities building ontologies: Tensions and Reality
Communities building ontologies: Tensions and RealityCommunities building ontologies: Tensions and Reality
Communities building ontologies: Tensions and Reality
 
qHNMR for purity determination
qHNMR for purity determinationqHNMR for purity determination
qHNMR for purity determination
 
Cuantificacion de Amorfo por Difraccion de Rayos X
Cuantificacion de Amorfo por Difraccion de Rayos XCuantificacion de Amorfo por Difraccion de Rayos X
Cuantificacion de Amorfo por Difraccion de Rayos X
 
EVOLUTION OF ONTOLOGY-BASED MAPPINGS
EVOLUTION OF ONTOLOGY-BASED MAPPINGSEVOLUTION OF ONTOLOGY-BASED MAPPINGS
EVOLUTION OF ONTOLOGY-BASED MAPPINGS
 
Polymerase chain reaction
Polymerase chain reactionPolymerase chain reaction
Polymerase chain reaction
 
01-10 Exploring new high potential 2D materials - Angioni.pdf
01-10 Exploring new high potential 2D materials - Angioni.pdf01-10 Exploring new high potential 2D materials - Angioni.pdf
01-10 Exploring new high potential 2D materials - Angioni.pdf
 
University of Toronto Chemistry Librarians Workshop June 2012
University of Toronto Chemistry Librarians Workshop June 2012University of Toronto Chemistry Librarians Workshop June 2012
University of Toronto Chemistry Librarians Workshop June 2012
 

Mehr von Dr. Haxel Consult

AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementAI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementDr. Haxel Consult
 
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...Dr. Haxel Consult
 
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...Dr. Haxel Consult
 
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...Dr. Haxel Consult
 
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...Dr. Haxel Consult
 
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...Dr. Haxel Consult
 
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...Dr. Haxel Consult
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...Dr. Haxel Consult
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...Dr. Haxel Consult
 
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...Dr. Haxel Consult
 
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...Dr. Haxel Consult
 
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...Dr. Haxel Consult
 
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...Dr. Haxel Consult
 
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...Dr. Haxel Consult
 
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...Dr. Haxel Consult
 
AI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterAI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterDr. Haxel Consult
 
AI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCAI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCDr. Haxel Consult
 
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...Dr. Haxel Consult
 
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...Dr. Haxel Consult
 

Mehr von Dr. Haxel Consult (20)

AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementAI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
 
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
 
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
 
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
 
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
 
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
 
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
 
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
 
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
 
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
 
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
 
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
 
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
 
AI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterAI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance Center
 
AI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IPAI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IP
 
AI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCAI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOC
 
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
 
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
 

Kürzlich hochgeladen

GDG Cloud Southlake 32: Kyle Hettinger: Demystifying the Dark Web
GDG Cloud Southlake 32: Kyle Hettinger: Demystifying the Dark WebGDG Cloud Southlake 32: Kyle Hettinger: Demystifying the Dark Web
GDG Cloud Southlake 32: Kyle Hettinger: Demystifying the Dark WebJames Anderson
 
Chennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts serviceChennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts servicevipmodelshub1
 
VIP 7001035870 Find & Meet Hyderabad Call Girls LB Nagar high-profile Call Girl
VIP 7001035870 Find & Meet Hyderabad Call Girls LB Nagar high-profile Call GirlVIP 7001035870 Find & Meet Hyderabad Call Girls LB Nagar high-profile Call Girl
VIP 7001035870 Find & Meet Hyderabad Call Girls LB Nagar high-profile Call Girladitipandeya
 
Challengers I Told Ya ShirtChallengers I Told Ya Shirt
Challengers I Told Ya ShirtChallengers I Told Ya ShirtChallengers I Told Ya ShirtChallengers I Told Ya Shirt
Challengers I Told Ya ShirtChallengers I Told Ya Shirtrahman018755
 
Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$
Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$
Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$kojalkojal131
 
Radiant Call girls in Dubai O56338O268 Dubai Call girls
Radiant Call girls in Dubai O56338O268 Dubai Call girlsRadiant Call girls in Dubai O56338O268 Dubai Call girls
Radiant Call girls in Dubai O56338O268 Dubai Call girlsstephieert
 
Low Rate Call Girls Kolkata Avani 🤌 8250192130 🚀 Vip Call Girls Kolkata
Low Rate Call Girls Kolkata Avani 🤌  8250192130 🚀 Vip Call Girls KolkataLow Rate Call Girls Kolkata Avani 🤌  8250192130 🚀 Vip Call Girls Kolkata
Low Rate Call Girls Kolkata Avani 🤌 8250192130 🚀 Vip Call Girls Kolkataanamikaraghav4
 
Networking in the Penumbra presented by Geoff Huston at NZNOG
Networking in the Penumbra presented by Geoff Huston at NZNOGNetworking in the Penumbra presented by Geoff Huston at NZNOG
Networking in the Penumbra presented by Geoff Huston at NZNOGAPNIC
 
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012rehmti665
 
FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607
FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607
FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607dollysharma2066
 
Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts serviceChennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts servicesonalikaur4
 
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...APNIC
 
Russian Call girls in Dubai +971563133746 Dubai Call girls
Russian  Call girls in Dubai +971563133746 Dubai  Call girlsRussian  Call girls in Dubai +971563133746 Dubai  Call girls
Russian Call girls in Dubai +971563133746 Dubai Call girlsstephieert
 
VIP Kolkata Call Girl Salt Lake 👉 8250192130 Available With Room
VIP Kolkata Call Girl Salt Lake 👉 8250192130  Available With RoomVIP Kolkata Call Girl Salt Lake 👉 8250192130  Available With Room
VIP Kolkata Call Girl Salt Lake 👉 8250192130 Available With Roomishabajaj13
 
AWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptxAWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptxellan12
 
How is AI changing journalism? (v. April 2024)
How is AI changing journalism? (v. April 2024)How is AI changing journalism? (v. April 2024)
How is AI changing journalism? (v. April 2024)Damian Radcliffe
 
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...Diya Sharma
 
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
10.pdfMature Call girls in Dubai +971563133746 Dubai Call girls
10.pdfMature Call girls in Dubai +971563133746 Dubai Call girls10.pdfMature Call girls in Dubai +971563133746 Dubai Call girls
10.pdfMature Call girls in Dubai +971563133746 Dubai Call girlsstephieert
 

Kürzlich hochgeladen (20)

GDG Cloud Southlake 32: Kyle Hettinger: Demystifying the Dark Web
GDG Cloud Southlake 32: Kyle Hettinger: Demystifying the Dark WebGDG Cloud Southlake 32: Kyle Hettinger: Demystifying the Dark Web
GDG Cloud Southlake 32: Kyle Hettinger: Demystifying the Dark Web
 
Chennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts serviceChennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts service
 
VIP 7001035870 Find & Meet Hyderabad Call Girls LB Nagar high-profile Call Girl
VIP 7001035870 Find & Meet Hyderabad Call Girls LB Nagar high-profile Call GirlVIP 7001035870 Find & Meet Hyderabad Call Girls LB Nagar high-profile Call Girl
VIP 7001035870 Find & Meet Hyderabad Call Girls LB Nagar high-profile Call Girl
 
Challengers I Told Ya ShirtChallengers I Told Ya Shirt
Challengers I Told Ya ShirtChallengers I Told Ya ShirtChallengers I Told Ya ShirtChallengers I Told Ya Shirt
Challengers I Told Ya ShirtChallengers I Told Ya Shirt
 
Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$
Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$
Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$
 
Radiant Call girls in Dubai O56338O268 Dubai Call girls
Radiant Call girls in Dubai O56338O268 Dubai Call girlsRadiant Call girls in Dubai O56338O268 Dubai Call girls
Radiant Call girls in Dubai O56338O268 Dubai Call girls
 
Low Rate Call Girls Kolkata Avani 🤌 8250192130 🚀 Vip Call Girls Kolkata
Low Rate Call Girls Kolkata Avani 🤌  8250192130 🚀 Vip Call Girls KolkataLow Rate Call Girls Kolkata Avani 🤌  8250192130 🚀 Vip Call Girls Kolkata
Low Rate Call Girls Kolkata Avani 🤌 8250192130 🚀 Vip Call Girls Kolkata
 
Networking in the Penumbra presented by Geoff Huston at NZNOG
Networking in the Penumbra presented by Geoff Huston at NZNOGNetworking in the Penumbra presented by Geoff Huston at NZNOG
Networking in the Penumbra presented by Geoff Huston at NZNOG
 
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
 
FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607
FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607
FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607
 
Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts serviceChennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts service
 
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
 
Russian Call girls in Dubai +971563133746 Dubai Call girls
Russian  Call girls in Dubai +971563133746 Dubai  Call girlsRussian  Call girls in Dubai +971563133746 Dubai  Call girls
Russian Call girls in Dubai +971563133746 Dubai Call girls
 
VIP Kolkata Call Girl Salt Lake 👉 8250192130 Available With Room
VIP Kolkata Call Girl Salt Lake 👉 8250192130  Available With RoomVIP Kolkata Call Girl Salt Lake 👉 8250192130  Available With Room
VIP Kolkata Call Girl Salt Lake 👉 8250192130 Available With Room
 
AWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptxAWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptx
 
How is AI changing journalism? (v. April 2024)
How is AI changing journalism? (v. April 2024)How is AI changing journalism? (v. April 2024)
How is AI changing journalism? (v. April 2024)
 
Rohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No AdvanceRohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
 
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
 
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝
 
10.pdfMature Call girls in Dubai +971563133746 Dubai Call girls
10.pdfMature Call girls in Dubai +971563133746 Dubai Call girls10.pdfMature Call girls in Dubai +971563133746 Dubai Call girls
10.pdfMature Call girls in Dubai +971563133746 Dubai Call girls
 

DWPI Markush Database on STN – A New Perspective for Searching Markush Structures

  • 1. Derwent Markush Resource (DWPIM) on STN Andreas Barth, Steve Hajkowski
  • 2. • Introduction • Derwent and STN Generic Node Concepts • Application of Match Levels • Derwent and STN Bonding Conventions • Some Examples • STN Unified Markush Solution • Derwent Markush Resource – Data Content 2 Agenda
  • 3. From Specific Structures to Markush Structures DWPIM on STN – ICIC, Nizza, 2015 3
  • 4. 4 A Typical “Real” Markush R3 DWPIM on STN – ICIC, Nizza, 2015
  • 5.  1.9 M Markush structures referencing 768 K DWPI documents  Records in DWPIM are structure based – Markush Compound Number (YYWW-AAANN) links to DWPI – Single Markush structure including all variations  Nodes in the structure may be a specific atom, a shortcut, a superatom representing generically-described groups in the structure or a variable group  Only generic structures are indexed in DWPIM – Specific structures can be simultaneously searched for in DCR – In some rare cases specific structures are contained only in DWPIM  Substance Classes: organics, organometallics, inorganics, polymers, fullerenes, peptides  Indexing from 33 patent issuing authorities worldwide DWPIM on STN – ICIC, Nizza, 2015 5 Content of the Derwent Markush Resource
  • 6. Chemistry-related Databases of Thomson Reuters DWPIM on STN – ICIC, Nizza, 2015 6 M3 *01* A111 A940 B115 B702 B713 Derwent World Patent Index (DWPI) AN 2012-N47214 [201312] DWPI TI Preparation of active material … 28.8 m Inventions Derwent Chemistry Resource (DCR) 0334-20904 2.5 m Chemical Structures Derwent Markush Resource (DWPIM) 1154-11901-K 1.9 m Markush Structures
  • 7.  Integrate different concepts of STN and Derwent – Different set of homology nodes – Different bonding conventions  Retain indexing concept of Derwent within the STN environment  Develop a new state of the art Markush search engine – Focus on full recall – Provide full set of search functionalities for high precision  Improve current possibilities for the evaluation of Markush search results  Provide high performance search environment DWPIM on STN – ICIC, Nizza, 2015 7 Challenges
  • 8. DWPIM Concept is based on 22 Superatoms DWPIM on STN – ICIC, Nizza, 2015 8 Acyclic / Cyclic (ATOM, CLASS, ANY) CHK (Alkyl, Alkylene) CHE (Alkenyl, Alkenylene) CHY (Alkynyl, Alkynylene) ARY (Aryl) CYC (Cycloaliphatic) HEA (Monocyclic heteroaryl) HET (Monocyclic nonaromatic) HEF (Fused heterocyclic)
  • 9. DWPIM Concept is based on 22 Superatoms DWPIM on STN – ICIC, Nizza, 2015 9 Acyclic / Cyclic (ATOM, CLASS, ANY) Elements (ATOM, CLASS, ANY) Others (only match on themselves) CHK (Alkyl, Alkylene) MX (Any metal) ACY (Acyl) CHE (Alkenyl, Alkenylene) A35 (Group III A - V A metal) DYE (Chromophore) CHY (Alkynyl, Alkynylene) ACT (Actinide) PEG (Polymer end group) ARY (Aryl) AMX (Alkali/alkaline earth metal) POL (Polymer) CYC (Cycloaliphatic) LAN (Lanthanide) PRT (Protecting group) HEA (Monocyclic heteroaryl) TRM (Transition metal) UNK (Any atom or group including H) HET (Monocyclic nonaromatic) HAL (Halogen) XX (Any atom or group excluding H) HEF (Fused heterocyclic)
  • 10. Derwent Indexing Hierarchy of Superatoms DWPIM on STN – ICIC, Nizza, 2015 10 XX ARY CYC HEF HEA HET CHK CHE CHY MX HAL A35 ACT AMX LAN TRM
  • 11. STN Query Hierarchy of Nodes DWPIM on STN – ICIC, Nizza, 2015 11 R Cb Ak Cy XMHy
  • 12. DWPIM on STN – ICIC, Nizza, 2015 12 Integrated Query Hierarchy of Generic Nodes R Cb AkCy specific: benzene specific: cyclohexane specific: quinoline specific: pyridine specific: piperidine specific: CH3 … specific: CH2 = CH3 specific: ethyne ARY CYC HEF HEA HET CHK CHE CHY Hy
  • 13.  Match levels control the degree of structure query matching between the query structure and the structure in the search file.  Match levels are assigned to each atom and generic group/superatom.  All nodes of a ring system have the same match level – Note: Special cases require different match levels for the atoms of a ring system  Default Match Levels on STN – ATOM for ring nodes – CLASS for chain nodes DWPIM on STN – ICIC, Nizza, 2015 13 Match Levels Control the Search Process
  • 14. R Cb AkCy specific: benzene specific: cyclohexane specific: quinoline specific: pyridine specific: piperidine specific: CH3 … specific: CH2 = CH3 specific: ethyne ARY CYC HEF HEA HET CHK CHE CHY Hy Effect of Match Levels on the Node Hierarchy DWPIM on STN – ICIC, Nizza, 2015 14 CLASS ATOM ANY
  • 15. Search: pyridine MLE: ATOM DWPIM on STN – ICIC, Nizza, 2015 15 Effect of Match Level in DWPIM: ATOM R Cb AkCy specific: benzene specific: cyclohexane specific: quinoline specific: pyridine specific: piperidine specific: CH3 … specific: CH2 = CH3 specific: ethyne ARY CYC HEF HEA HET CHK CHE CHY Hy
  • 16. DWPIM on STN – ICIC, Nizza, 2015 16 Effect of Match Level in DWPIM: CLASS R Cb AkCy specific: benzene specific: cyclohexane specific: quinoline specific: pyridine specific: piperidine specific: CH3 … specific: CH2 = CH3 specific: ethyne ARY CYC HEF HEA HET CHK CHE CHY Hy Search: pyridine MLE: CLASS
  • 17. DWPIM on STN – ICIC, Nizza, 2015 17 Effect of Match Level in DWPIM: ANY Search: pyridine MLE: ANY R Cb AkCy specific: benzene specific: cyclohexane specific: quinoline specific: pyridine specific: piperidine specific: CH3 … specific: CH2 = CH3 specific: ethyne ARY CYC HEF HEA HET CHK CHE CHY Hy
  • 18. Match Level ATOM Results in Specific Nodes DWPIM on STN – ICIC, Nizza, 2015 18 MCN 9912-JKW07Query (MLE ATOM)
  • 19. Match Level CLASS Results in Generic Nodes Answer #5: 9917-IVD01Query (MLE CLASS) DWPIM on STN – ICIC, Nizza, 2015 19
  • 20.  STN-Convention: Bond values in the structure editor • Note that “exact” from structure editor is translated to single exact or double exact  Indexed bonds: Bond value in the indexed structures, e.g. • single exact can only match to a single bond • single/normalized bond can either match to a single or to a normalize bond in the indexed structure. DWPIM on STN – ICIC, Nizza, 2015 20 Different Types of Bond Conventions
  • 21. Why are bond values important ? DWPIM on STN – ICIC, Nizza, 2015 21 Query structure Exact (= single exact) normalized exact/ normalized
  • 22.  Rings: Normalized bonds are used in ring systems with an even number of atoms containing alternate double and single bonds. • Benzene, Pyridine, Naphtalene • Exceptions: cyclopentadienyl anion, cycloheptatrienly cation  Tautomers: DWPIM on STN – ICIC, Nizza, 2015 22 Bond Normalization Where Z can be: B, C, Si, N, P, As, S, Se, Te, F, Cl, Br, I X and Y can be: O, S, Se, Te, N
  • 23. DWPIM on STN – ICIC, Nizza, 2015 23 Query: Search Example 1
  • 24. DWPIM on STN – ICIC, Nizza, 2015 24 Search Example 2 Query:
  • 25.  Consistent search of MARPAT from CAS and DWPIM from Thomson Reuters on a single platform  Offers same Markush attributes as classic STN  Retrieves results as Markush structures, not references  Highly efficient retrieval and evaluation of Markush results DWPIM hit structures: Both the Assembled view and Full view will be available. With DWPIM new STN will deliver the first-ever, Unified Markush Search Solution
  • 26.  Full integration in Thomson Reuters and CAS content  Simultaneous search for generic and specific chemical structures DWPISM Derwent World Patents Index® DCR Derwent Chemistry Resource ~2.5 Mio DWPIM Derwent Markush Resource ~1.9 Mio CAplusSM Chemical Abstracts MARPAT® CAS Markush Database ~ 1.1 Mio CAS RegistrySM ~ 102 Mio Structure Search REAXYSFILE ~ 25 Mio DWPIM is integrated with DWPI for efficient searching together with CAS Databases
  • 27. • Introduction • Derwent and STN Generic Node Concepts • Application of Match Levels • Derwent and STN Bonding Conventions • Some Examples • STN Unified Markush Solution • Derwent Markush Resource – Data Content 27 Agenda
  • 28. DERWENT MARKUSH RESOURCE ON STN 28 DWPIM on STN – ICIC, Nizza, 2015 Contains full DWPI and INPI Markush data backfile  1.9 million Markush structures  33 patent issuing authorities covered  Indexing for >777,000 patent families in DWPI  US, EP and WO coverage from 1978 onwards  DWPI data from 1987 to date  INPI sourced data from 1961 to 1998  Full coverage of organics, organometallics, inorganic salts and metal oxides  Partial coverage of alloys, intermetallics and polymers
  • 29. DERWENT MARKUSH RESOURCE COUNTRY COVERAGE 29DWPIM on STN – ICIC, Nizza, 2015
  • 30. DERWENT MARKUSH RESOURCE ON STN CONTENT • Integration of INPI content – 213,000 structure files – New DWPI format compound numbers created and added into 120,000 DWPI records • Format 82nn-nnnnn and 83nn-nnnnn • INPI indexing was added to all relevant DWPI records – including those already containing DWPI sourced Markush indexing for the period 1987 – 1998 (ie double indexing) • Re-indexed DWPI content – 45,0000 structures were re-indexed, with new compound numbers added into DWPI • Format 84nn-nnnnn and 85nn-nnnnn 30