SlideShare ist ein Scribd-Unternehmen logo
1 von 30
Downloaden Sie, um offline zu lesen
Richard Resnick
CEO
II-SDV 2015, Nice, France
Integrated Keyword and Biological
Sequence Searching in the Life Sciences
KEYWORD SEARCHING IN THE LIFE SCIENCES IS CHALLENGING
How do you spell“somatostatin”?
Ala-Gly-Cys-Lys-Asn-Phe-Phe-Trp-Lys-Thr-Phe-Thr-Ser-Cys
somato*
AND (Mus
musculus
OR mouse)
TGAACCTCACAGC
ATGGAGCCCCTCT
CTTTGGCTTCCAC
ACCTAGCTGGAAT
GCCTCAGCTGCT
100%/4.2%/100%
is not aa
Relevance of results to life sciences
Completenessofpatentauthoritycoverage
Size of bubble corresponds to
the number of hits returned
GOAL: HIGHLY RELEVANT RESULTS FROM BROAD PATENT AUTHORITY
COVERAGE
SEQUENCE SEARCHING PRESENTS CHALLENGES
CCCTCCATCATTTCACCATCCACACTCATAATAATCATATATATTCATCAATCATCTATATAAGTAGTGGCAGGAGCAATGAGAGGGAGG
GTTTCTCCACTGATGCTGTTGCTAGGGATCCTTGTCCTGGCTTCAGTTTCTGCAACGCATGCCAAGTCATCACCTTACCAGAAGAAAACA
GAGAACCCCTGCGCCCAGAGGTGCCTCCAGAGTTGTCAACAGGAACCGGATGACTTGAAGCAAAAGGCATGCGAGTCTCGCTGCACCAAG
CTCGAGTATGATCCTCGTTGTGTCTATGATCCTCGAGGACACACTGGCACCACCAACCAACGTTCCCCTCCAGGGGAGCGGACACGTGGC
CGCCAACCCGGAGACTACGATGATGACCGCCGTCAACCCCGAAGAGAGGAAGGAGGCCGATGGGGACCAGCTGGACCGAGGGAGCGTGAA
AGAGAAGAAGACTGGAGACAACCAAGAGAAGATTGGAGGCGACCAAGTCATCAGCAGCCACGGAAAATAAGGCCCGAAGGAAGAGAAGGA
GAACAAGAGTGGGGAACACCAGGTAGCCATGTGAGGGAAGAAACATCTCGGAACAACCCTTTCTACTTCCCGTCAAGGCGGTTTAGCACC
CGCTACGGGAACCAAAACGGTAGGATCCGGGTCCTGCAGAGGTTTGACCAAAGGTCAAGGCAGTTTCAGAATCTCCAGAATCACCGTATT
GTGCAGATCGAGGCCAAACCTAACACTCTTGTTCTTCCCAAGCACGCTGATGCTGATAACATCCTTGTTATCCAGCAAGGTATCAAATCT
AATTCTATTCTAAACTACATATATTTTGTTGCTTGATACATATGATTCATTGGATTGCAGGGCAAGCCACCGTGACCGTAGCAAATGGCA
ATAACAGAAGAGCTTTAATCTTGACGAGGGCCATGCACTCAGAATCCCATCCGTTTCATTTCCTACATCTTGACGACATGACACCAGAAC
TCAGAGTAGCTAAATCTCATGCCGTTAACACACCCGGCCAGTTTGAGGTAGGTACCTCTTTCTTCTCACATATATATTCAATTCTCAATT
ATCATCTTACATGTTGTGGGTGTTGCTTCACAGGATTTCTTCCCGGCGAGCAGCCGAGACCAATCATCCTACTTGCAGGGATTCAGCAGG
AATACTTTGGAGGCCGCCTTCAATGTAAGCAAATGTGTCATAATTATGGAATTAAAAGAACGATCATGTTATAAACTTATAATATATATA
TACATAGGCGGAATTCAATGAGATACGGAGGGTGCTGTTAGAAGAGAATGCAGGAGGTGAGCAAGAGGAGAGAGGGCAGAGGCGATGGAG
TACTCGGAGTAGTGAGAACAATGAAGGAGTGATAGTCGAAGTGTCAAAGGAGCACGTTGAAGAACTTACTAAGCACGCTAAATCCGTCTC
AAAGAAAGGCTCCGAAGAAGAGGGAGATATCACCAACCCAATCAACTTGAGAGAAGGCGAGCCCGATCTTTCTGACAACTTTGGGAGGTT
ATTTGAGGTGAAGCCAGACAAGAAGAACCCCCAGCTTCAGGACCTGGACATGATGCTCACCTGTGTAGAGATCAAAGAAGGAGCTTTGAT
GCTCCCACACTTCAACTCAAAGGCCATGGTCATCGTCGTCATCAACAAAGGAACTGGAAACCTTGAACTCGTAGCTGTAAGAAAAGAGCA
ACAACAGAGGGGACGGCGGGAACAAGAGTGGGAAGAAGAGGAGGAAGATGAAGAAGAGGAGGGAAGTAACAGAGAGGTGCGTAGGTACAC
AGCGAGGTTGAAGGAAGGCGATGTGTTCATCATGCCAGCAGCTCATCCAGTAGCCATCAACGCTTCCTCCGAACTCCATCTGCTTGGCTT
CGGTATCAACGCTGAAAACAACCACAGAATCTTCCTTGCAGGTGATAAGGACAATGTGGTAGACCAGATAGAGAAGCAAGCGAAGGATTT
AGCATTCCCTGGTTCGGGTGAACAAGTTGAGAAGCTCATCAAAAACCAGAGGGAGTCTCACTTTGTGAGTGCTCGTCCTCAATCTCAATC
TCCGTCGTCTCCTGAAAAAGAGGACCAAGAGGAGGAAAACCAGGGAGGGAAGGGTCCACTCCTTTCAATTTTGAAGGCTTTTAACTGAGA
ATGGAGGAAACTTGTTATGTATCCATAATAAGATCACGCTTTTGTAATCTACTATCCAAAAACTTATCAATAAATAAAAACGTTTGTGCG
TTGTTTCTCCAAGAAATACGGGTGGCGCTTATGGTTGTTTATTTATACGAAACTAATTAAATACATCATAACGGCAACGACCTCTTATTT
TGTAATTTTCTT	
  
BLAST?
90% ID?
Do I want total query coverage or
total subject coverage?
Global
alignment?
What word size?
How do my sequence hits relate
to my text search results?
Fragment?
Motif?
pn:EP* AND somato*^5 AND [mus musculus] AND clm:[transgenic animal ~3] AND pd:
[19950101 TO 20140215] pn:EP* AND somato*^5 AND [mus musculus] AND clm:
[transgenic animal ~3] AND pd:[19950101 TO 20140215] pn:EP* AND somato*^5 AND
[mus musculus] AND clm:[transgenic animal ~3] AND pd:[19950101 TO 20140215]
pn:EP* AND somato*^5 AND [mus musculus] AND clm:[transgenic animal ~3] AND pd:
[19950101 TO 20140215] pn:EP* AND somato*^5 AND [mus musculus] AND clm:
[transgenic animal ~3] AND pd:[19950101 TO 20140215] pn:EP* AND somato*^5 AND
[mus musculus] AND clm:[transgenic animal ~3] AND pd:[19950101 TO 20140215]
pn:EP* AND somato*^5 AND [mus musculus] AND clm:[transgenic animal ~3] AND pd:
[19950101 TO 20140215] pn:EP* AND somato*^5 AND [mus musculus] AND clm:
[transgenic animal ~3] AND pd:[19950101 TO 20140215] pn:EP* AND somato*^5 AND
[mus musculus] AND clm:[transgenic animal ~3] AND pd:[19950101 TO 20140215]
pn:EP* AND somato*^5 AND [mus musculus] AND clm:[transgenic animal ~3] AND pd:
[19950101 TO 20140215] pn:EP* AND somato*^5 AND [mus musculus] AND clm:
[transgenic animal ~3] AND pd:[19950101 TO 20140215] pn:EP* AND somato*^5 AND
[mus musculus] AND clm:[transgenic animal ~3] AND pd:[19950101 TO 20140215]
pn:EP* AND somato*^5 AND [mus musculus] AND clm:[transgenic animal ~3] AND pd:
[19950101 TO 20140215]
KEYWORD SEARCHING IN THE LIFE SCIENCES PRESENTS CHALLENGES
How do my text search results
relate to my sequence hits?
How do I figure out this
system’s query syntax?
What if a keyword is
misspelled in a patent
claim?
How can I exclude patents
unrelated to my domain easily?
How do I build and
maintain reliable
synonym lists?
Can I be sure that all of
the documents I need
to review exist in the
underlying database?
BUILDING A REPORT FROM DIFFERENT PLATFORMS IS CHALLENGING
Lack of life science specificity in search
platforms create multiple false-positive
hits that require additional user review
Varying underlying algorithms can
create an apples-to-oranges comparison
Different output formats make it
dicult to analyze and compare results
Little cross-platform integration
necessitates downloading multiple
files for manual collation
Identify prior art surrounding gene modification in peanut for
gene families implicated in food allergies.
“Ara h 1” is a seed storage protein from Arachis hypogaea. It
is known because sensitization to it was found in 95%
of peanut-allergic patients from North America.
We’re seeking prior art that describes vaccines related
to these allergies or sequences that hit to the Ara h 1
gene.
CASE STUDY
Run a sequence search against the prior art
for the peanut“ara h 1”gene sequence:
Arachis hypogaea cultivar LUHUA 8 Ara h 1
allergen (ara h 1) gene (cds)
Identify relevant documents related to
peanuts and claiming transgenic
modification of plants that decrease
allergy risks, and limited to the documents
published after January 1st 2010
Text Search Sequence Search
CCCTCCATCATTTCACCATCCACACTCATAATAATCATATATA
TTCATCAATCATCTATATAAGTAGTGGCAGGAGCAATGAGA
GGGAGGGTTTCTCCACTGATGCTGTTGCT…
SOLUTION: INTEGRATED LIFE SCIENCE SEARCH PLATFORMS
Union
Combine into a single, unique workfile
A COMPLETE REPORT FOR ANALYSIS
Claims contains
vaccin* in green
Bioinformatics-related
patents in red
Sequence search
results in blue
A single, unified report for analyzing results.
STANDARD KEYWORDS AND BOOLEAN SYNTAX AREN’T ENOUGH
Life science applications are more than collections of discrete, specific
keywords.
They include field-specific ontological terms that can have synonyms,
alternate spellings, and varying word order.
Building a single query that addresses all of these issues, plus allows
the flexibility of Boolean, proximity, wildcard, field grouping, range
searches, and term boosting, can be dicult.
USE EXISTING ONTOLOGY TERMS OR DEFINE YOUR OWN
As you type, suggested
matching terms appear, based
on the ontologies you choose
Simply typing“transgenic”
with the NCBI ontology list
allows“Transgenic Plants”
as one option
At any time, type in the ?
symbol for a complete list of
field choices
Specify words in claims,
date ranges, and many more
options to further refine
your query
Define your own ontologies
and synonyms that are
relevant for your specific
search area
Includes synonyms and
alternate spellings for the
genus and species of peanut
Hit“Search”or <return> to run the search
INSTANT RESULTS
A result preview is shown, and we save it as a workfile called“TEXT SEARCH”
THE “TEXT SEARCH”WORKFILE
Sort by any
column
Rank for
priority
Color code to
categorize
Quickly assign
colors/ranks
using keyboard
shortcuts
3
(for 3 stars)
O
(for orange)
All the results seem relevant, but we want to annotate the documents talking
about vaccines in the claims with a green color.
NAVIGATE A WORKFILE
Easily apply bulk
annotations for future
workfile manipulation Keyboard
shortcuts allow
fast workfile
evaluation
(next record)
(close preview)
(previous record)
FILTER A WORKFILE
Type in free text, use
wildcards, or type in“?”to filter
by terms in a specific field
FILTER A WORKFILE
Apply the filter to pull out the subset of documents that match your query.
12 documents contain the
word“vaccine”, or related
terms, in the claims.
12
Let’s annotate these in green.
MAKING DOCUMENTS WITH VACCINES IN THE CLAIMS GREEN
MAKING DOCUMENTS WITH VACCINES IN THE CLAIMS GREEN
Here is what our subset (vaccine in claims) looks like.
You can reset the filter to see other documents that are in the workfile.
Let’s annotate in red the documents that are probably not really relevant.
Notice that“Bio-informatics”is a synonym list and includes multiple spellings.
MAKING BIOINFORMATICS DOCUMENTS RED
40 documents relate to
bioinformatics methods.
vaccin* in claims
bioinformatics
related
HERE IS OUR TEXT SEARCH WORKFILE
Now it’s time to complete the analysis with sequence search results.
ara h 1 CDS sequence
GenePast 90%ID over the length of the query or the subject (1000 results)
PREPARE YOUR SEQUENCE SEARCH RESULTS
We export these results to a LifeQuest workfile.
Apply a filter to keep the patents where the Patent sequence location of my hits are in the
claims: that leads to 81 results in 25 patents.
FILTER YOUR SEQUENCE SEARCH RESULTS & EXPORT
Save it as a new“SEQ search”workfile, and open to analyze.
EXPORT YOUR SEARCH RESULTS TO A WORKFILE
In the“SEQ search”workfile, color code all as blue.
MARK ALL OF THE SEQUENCE SEARCH DOCUMENTS BLUE
Run a sequence search against the prior art
for the peanut“ara h 1”gene sequence:
Arachis hypogaea cultivar LUHUA 8 Ara h 1
allergen (ara h 1) gene (cds)
Identify relevant documents related to
peanuts and claiming transgenic
modification of plants that decrease
allergy risks, and limited to the documents
published after January 1st 2010
Text Search Sequence Search
CCCTCCATCATTTCACCATCCACACTCATAATAATCATATATA
TTCATCAATCATCTATATAAGTAGTGGCAGGAGCAATGAGA
GGGAGGGTTTCTCCACTGATGCTGTTGCT…
SOLUTION: INTEGRATED LIFE SCIENCE SEARCH PLATFORMS
Union
Combine into a single, unique workfile
CONSOLIDATE TEXT SEARCH AND SEQUENCE SEARCH RESULTS
Merge the two workfiles together (union) to get
a complete set for final analysis.
Sort, filter, analyze, and export!
EVALUATE THE MERGED DATA SETS
vaccin* in claims
bioinformatics
related
sequence hit
in claims
GENERATE A COMPLETE REPORT
GENERATE A COMPLETE REPORT FOR ANALYSIS
Includes results from
both sequence & text
searches
Create color codes for
your specific categories
Merge with other
outputs or export to any
format
Sort or filter by any field
Rank hits (1, 2, 3 stars) to
easily identify priority
Claims contain vaccin*
bioinformatics related
found using the“ara h 1”
DNA sequence
A single, unified report for analyzing results.
PLEASE COME BY OUR BOOTH
FOR MORE INFORMATION.

Weitere ähnliche Inhalte

Was ist angesagt?

ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...
ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...
ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Presentation cybernetics immunology-ver1.0 (for-criticism) - copy
Presentation cybernetics immunology-ver1.0 (for-criticism) - copyPresentation cybernetics immunology-ver1.0 (for-criticism) - copy
Presentation cybernetics immunology-ver1.0 (for-criticism) - copy
EmadFaragHABIB
 

Was ist angesagt? (11)

Kishor Presentation
Kishor PresentationKishor Presentation
Kishor Presentation
 
ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...
ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...
ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...
 
Graph Analytics in Pharmacology over the Web of Life Sciences Linked Open Data
Graph Analytics in Pharmacology over the Web of Life Sciences Linked Open DataGraph Analytics in Pharmacology over the Web of Life Sciences Linked Open Data
Graph Analytics in Pharmacology over the Web of Life Sciences Linked Open Data
 
Data Visualization in Biomedical Sciences: More than Meets the Eye
Data Visualization in Biomedical Sciences: More than Meets the EyeData Visualization in Biomedical Sciences: More than Meets the Eye
Data Visualization in Biomedical Sciences: More than Meets the Eye
 
Math 225-spring-2012
Math 225-spring-2012Math 225-spring-2012
Math 225-spring-2012
 
Math 225-fall-2014
Math 225-fall-2014Math 225-fall-2014
Math 225-fall-2014
 
SureChEMBL patent annotations in Open PHACTS
SureChEMBL patent annotations in Open PHACTSSureChEMBL patent annotations in Open PHACTS
SureChEMBL patent annotations in Open PHACTS
 
Pistoia Alliance European Conference 2015 - Nick Lynch / Open PHACTS Foundation
Pistoia Alliance European Conference 2015 - Nick Lynch / Open PHACTS FoundationPistoia Alliance European Conference 2015 - Nick Lynch / Open PHACTS Foundation
Pistoia Alliance European Conference 2015 - Nick Lynch / Open PHACTS Foundation
 
2011-11-28 Open PHACTS at RSC CICAG
2011-11-28 Open PHACTS at RSC CICAG2011-11-28 Open PHACTS at RSC CICAG
2011-11-28 Open PHACTS at RSC CICAG
 
Presentation cybernetics immunology-ver1.0 (for-criticism) - copy
Presentation cybernetics immunology-ver1.0 (for-criticism) - copyPresentation cybernetics immunology-ver1.0 (for-criticism) - copy
Presentation cybernetics immunology-ver1.0 (for-criticism) - copy
 
TAIR -Using biological ontologies to accelerate progress in plant biology res...
TAIR -Using biological ontologies to accelerate progress in plant biology res...TAIR -Using biological ontologies to accelerate progress in plant biology res...
TAIR -Using biological ontologies to accelerate progress in plant biology res...
 

Andere mochten auch

II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
Dr. Haxel Consult
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
Dr. Haxel Consult
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
Dr. Haxel Consult
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
Dr. Haxel Consult
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
Dr. Haxel Consult
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
Dr. Haxel Consult
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
Dr. Haxel Consult
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
Dr. Haxel Consult
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
Dr. Haxel Consult
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
Dr. Haxel Consult
 

Andere mochten auch (10)

II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
 

Ähnlich wie II-SDV 2015, 20 - 21 April, in Nice

Practical 7 dna, rna and the flow of genetic information5
Practical 7 dna, rna and the flow of genetic information5Practical 7 dna, rna and the flow of genetic information5
Practical 7 dna, rna and the flow of genetic information5
Osama Barayan
 
Towards comprehensive syntactic and semantic annotations of the clinical narr...
Towards comprehensive syntactic and semantic annotations of the clinical narr...Towards comprehensive syntactic and semantic annotations of the clinical narr...
Towards comprehensive syntactic and semantic annotations of the clinical narr...
Jinho Choi
 
In silico 360 Analysis for Drug Development
In silico 360 Analysis for Drug DevelopmentIn silico 360 Analysis for Drug Development
In silico 360 Analysis for Drug Development
Chris Southan
 
2009 12 06 - LOINC Workshop
2009 12 06 - LOINC Workshop2009 12 06 - LOINC Workshop
2009 12 06 - LOINC Workshop
dvreeman
 
1PhylogeneticAnalysisHomeworkassignmentThisa.docx
1PhylogeneticAnalysisHomeworkassignmentThisa.docx1PhylogeneticAnalysisHomeworkassignmentThisa.docx
1PhylogeneticAnalysisHomeworkassignmentThisa.docx
felicidaddinwoodie
 

Ähnlich wie II-SDV 2015, 20 - 21 April, in Nice (20)

Practical 7 dna, rna and the flow of genetic information5
Practical 7 dna, rna and the flow of genetic information5Practical 7 dna, rna and the flow of genetic information5
Practical 7 dna, rna and the flow of genetic information5
 
Online Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery SystemsOnline Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery Systems
 
20080609 Loinc Workshop
20080609   Loinc Workshop20080609   Loinc Workshop
20080609 Loinc Workshop
 
Stratergies for the intergration of information (IPI_ConfEX)
Stratergies for the intergration of information (IPI_ConfEX)Stratergies for the intergration of information (IPI_ConfEX)
Stratergies for the intergration of information (IPI_ConfEX)
 
Improving online chemistry one structure at a time
Improving online chemistry one structure at a timeImproving online chemistry one structure at a time
Improving online chemistry one structure at a time
 
Bioinformatics MiRON
Bioinformatics MiRONBioinformatics MiRON
Bioinformatics MiRON
 
Towards comprehensive syntactic and semantic annotations of the clinical narr...
Towards comprehensive syntactic and semantic annotations of the clinical narr...Towards comprehensive syntactic and semantic annotations of the clinical narr...
Towards comprehensive syntactic and semantic annotations of the clinical narr...
 
David
DavidDavid
David
 
Vanderwall cheminformatics Drexel Part 1
Vanderwall cheminformatics Drexel Part 1Vanderwall cheminformatics Drexel Part 1
Vanderwall cheminformatics Drexel Part 1
 
In silico 360 Analysis for Drug Development
In silico 360 Analysis for Drug DevelopmentIn silico 360 Analysis for Drug Development
In silico 360 Analysis for Drug Development
 
2009 12 06 - LOINC Workshop
2009 12 06 - LOINC Workshop2009 12 06 - LOINC Workshop
2009 12 06 - LOINC Workshop
 
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
 
Biological database by kk sahu
Biological database by kk sahuBiological database by kk sahu
Biological database by kk sahu
 
Major biological nucleotide databases
Major biological nucleotide databasesMajor biological nucleotide databases
Major biological nucleotide databases
 
Mikel egana itbam_2010_ogo_system
Mikel egana itbam_2010_ogo_systemMikel egana itbam_2010_ogo_system
Mikel egana itbam_2010_ogo_system
 
1PhylogeneticAnalysisHomeworkassignmentThisa.docx
1PhylogeneticAnalysisHomeworkassignmentThisa.docx1PhylogeneticAnalysisHomeworkassignmentThisa.docx
1PhylogeneticAnalysisHomeworkassignmentThisa.docx
 
Predicting Drug Candidates Safety : the Role and Usage of Knowledge Bases
Predicting Drug Candidates Safety : the Role and Usage of Knowledge BasesPredicting Drug Candidates Safety : the Role and Usage of Knowledge Bases
Predicting Drug Candidates Safety : the Role and Usage of Knowledge Bases
 
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
 
Text Mining for Biocuration of Bacterial Infectious Diseases
Text Mining for Biocuration of Bacterial Infectious DiseasesText Mining for Biocuration of Bacterial Infectious Diseases
Text Mining for Biocuration of Bacterial Infectious Diseases
 
Exhaustive Literature Searching (Systematic Reviews)
Exhaustive Literature Searching (Systematic Reviews)Exhaustive Literature Searching (Systematic Reviews)
Exhaustive Literature Searching (Systematic Reviews)
 

Mehr von Dr. Haxel Consult

AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
Dr. Haxel Consult
 
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
Dr. Haxel Consult
 
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
Dr. Haxel Consult
 

Mehr von Dr. Haxel Consult (20)

AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementAI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
 
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
 
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
 
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
 
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
 
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
 
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
 
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
 
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
 
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
 
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
 
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
 
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
 
AI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterAI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance Center
 
AI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IPAI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IP
 
AI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCAI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOC
 
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
 
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
 

KĂźrzlich hochgeladen

Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRLLucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
imonikaupta
 
Russian Call Girls Pune (Adult Only) 8005736733 Escort Service 24x7 Cash Pay...
Russian Call Girls Pune  (Adult Only) 8005736733 Escort Service 24x7 Cash Pay...Russian Call Girls Pune  (Adult Only) 8005736733 Escort Service 24x7 Cash Pay...
Russian Call Girls Pune (Adult Only) 8005736733 Escort Service 24x7 Cash Pay...
SUHANI PANDEY
 
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝
soniya singh
 
Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...
Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...
Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...
Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure
 
Low Sexy Call Girls In Mohali 9053900678 🥵Have Save And Good Place 🥵
Low Sexy Call Girls In Mohali 9053900678 🥵Have Save And Good Place 🥵Low Sexy Call Girls In Mohali 9053900678 🥵Have Save And Good Place 🥵
Low Sexy Call Girls In Mohali 9053900678 🥵Have Save And Good Place 🥵
Chandigarh Call girls 9053900678 Call girls in Chandigarh
 
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
soniya singh
 
Ganeshkhind ! Call Girls Pune - 450+ Call Girl Cash Payment 8005736733 Neha T...
Ganeshkhind ! Call Girls Pune - 450+ Call Girl Cash Payment 8005736733 Neha T...Ganeshkhind ! Call Girls Pune - 450+ Call Girl Cash Payment 8005736733 Neha T...
Ganeshkhind ! Call Girls Pune - 450+ Call Girl Cash Payment 8005736733 Neha T...
SUHANI PANDEY
 
valsad Escorts Service ☎️ 6378878445 ( Sakshi Sinha ) High Profile Call Girls...
valsad Escorts Service ☎️ 6378878445 ( Sakshi Sinha ) High Profile Call Girls...valsad Escorts Service ☎️ 6378878445 ( Sakshi Sinha ) High Profile Call Girls...
valsad Escorts Service ☎️ 6378878445 ( Sakshi Sinha ) High Profile Call Girls...
Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure
 
Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝
soniya singh
 

KĂźrzlich hochgeladen (20)

WhatsApp 📞 8448380779 ✅Call Girls In Mamura Sector 66 ( Noida)
WhatsApp 📞 8448380779 ✅Call Girls In Mamura Sector 66 ( Noida)WhatsApp 📞 8448380779 ✅Call Girls In Mamura Sector 66 ( Noida)
WhatsApp 📞 8448380779 ✅Call Girls In Mamura Sector 66 ( Noida)
 
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRLLucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
 
Call Now ☎ 8264348440 !! Call Girls in Green Park Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Green Park Escort Service Delhi N.C.R.Call Now ☎ 8264348440 !! Call Girls in Green Park Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Green Park Escort Service Delhi N.C.R.
 
Russian Call Girls Pune (Adult Only) 8005736733 Escort Service 24x7 Cash Pay...
Russian Call Girls Pune  (Adult Only) 8005736733 Escort Service 24x7 Cash Pay...Russian Call Girls Pune  (Adult Only) 8005736733 Escort Service 24x7 Cash Pay...
Russian Call Girls Pune (Adult Only) 8005736733 Escort Service 24x7 Cash Pay...
 
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
 
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝
 
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
 
Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...
Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...
Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...
 
Low Sexy Call Girls In Mohali 9053900678 🥵Have Save And Good Place 🥵
Low Sexy Call Girls In Mohali 9053900678 🥵Have Save And Good Place 🥵Low Sexy Call Girls In Mohali 9053900678 🥵Have Save And Good Place 🥵
Low Sexy Call Girls In Mohali 9053900678 🥵Have Save And Good Place 🥵
 
(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7
(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7
(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7
 
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
 
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
Ganeshkhind ! Call Girls Pune - 450+ Call Girl Cash Payment 8005736733 Neha T...
Ganeshkhind ! Call Girls Pune - 450+ Call Girl Cash Payment 8005736733 Neha T...Ganeshkhind ! Call Girls Pune - 450+ Call Girl Cash Payment 8005736733 Neha T...
Ganeshkhind ! Call Girls Pune - 450+ Call Girl Cash Payment 8005736733 Neha T...
 
Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...
Pune Airport ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready...Pune Airport ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready...
Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...
 
GDG Cloud Southlake 32: Kyle Hettinger: Demystifying the Dark Web
GDG Cloud Southlake 32: Kyle Hettinger: Demystifying the Dark WebGDG Cloud Southlake 32: Kyle Hettinger: Demystifying the Dark Web
GDG Cloud Southlake 32: Kyle Hettinger: Demystifying the Dark Web
 
valsad Escorts Service ☎️ 6378878445 ( Sakshi Sinha ) High Profile Call Girls...
valsad Escorts Service ☎️ 6378878445 ( Sakshi Sinha ) High Profile Call Girls...valsad Escorts Service ☎️ 6378878445 ( Sakshi Sinha ) High Profile Call Girls...
valsad Escorts Service ☎️ 6378878445 ( Sakshi Sinha ) High Profile Call Girls...
 
Moving Beyond Twitter/X and Facebook - Social Media for local news providers
Moving Beyond Twitter/X and Facebook - Social Media for local news providersMoving Beyond Twitter/X and Facebook - Social Media for local news providers
Moving Beyond Twitter/X and Facebook - Social Media for local news providers
 
Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.
 
Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝
 
Busty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort Service
Busty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort ServiceBusty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort Service
Busty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort Service
 

II-SDV 2015, 20 - 21 April, in Nice

  • 1. Richard Resnick CEO II-SDV 2015, Nice, France Integrated Keyword and Biological Sequence Searching in the Life Sciences
  • 2. KEYWORD SEARCHING IN THE LIFE SCIENCES IS CHALLENGING How do you spell“somatostatin”? Ala-Gly-Cys-Lys-Asn-Phe-Phe-Trp-Lys-Thr-Phe-Thr-Ser-Cys somato* AND (Mus musculus OR mouse) TGAACCTCACAGC ATGGAGCCCCTCT CTTTGGCTTCCAC ACCTAGCTGGAAT GCCTCAGCTGCT 100%/4.2%/100% is not aa
  • 3. Relevance of results to life sciences Completenessofpatentauthoritycoverage Size of bubble corresponds to the number of hits returned GOAL: HIGHLY RELEVANT RESULTS FROM BROAD PATENT AUTHORITY COVERAGE
  • 4. SEQUENCE SEARCHING PRESENTS CHALLENGES CCCTCCATCATTTCACCATCCACACTCATAATAATCATATATATTCATCAATCATCTATATAAGTAGTGGCAGGAGCAATGAGAGGGAGG GTTTCTCCACTGATGCTGTTGCTAGGGATCCTTGTCCTGGCTTCAGTTTCTGCAACGCATGCCAAGTCATCACCTTACCAGAAGAAAACA GAGAACCCCTGCGCCCAGAGGTGCCTCCAGAGTTGTCAACAGGAACCGGATGACTTGAAGCAAAAGGCATGCGAGTCTCGCTGCACCAAG CTCGAGTATGATCCTCGTTGTGTCTATGATCCTCGAGGACACACTGGCACCACCAACCAACGTTCCCCTCCAGGGGAGCGGACACGTGGC CGCCAACCCGGAGACTACGATGATGACCGCCGTCAACCCCGAAGAGAGGAAGGAGGCCGATGGGGACCAGCTGGACCGAGGGAGCGTGAA AGAGAAGAAGACTGGAGACAACCAAGAGAAGATTGGAGGCGACCAAGTCATCAGCAGCCACGGAAAATAAGGCCCGAAGGAAGAGAAGGA GAACAAGAGTGGGGAACACCAGGTAGCCATGTGAGGGAAGAAACATCTCGGAACAACCCTTTCTACTTCCCGTCAAGGCGGTTTAGCACC CGCTACGGGAACCAAAACGGTAGGATCCGGGTCCTGCAGAGGTTTGACCAAAGGTCAAGGCAGTTTCAGAATCTCCAGAATCACCGTATT GTGCAGATCGAGGCCAAACCTAACACTCTTGTTCTTCCCAAGCACGCTGATGCTGATAACATCCTTGTTATCCAGCAAGGTATCAAATCT AATTCTATTCTAAACTACATATATTTTGTTGCTTGATACATATGATTCATTGGATTGCAGGGCAAGCCACCGTGACCGTAGCAAATGGCA ATAACAGAAGAGCTTTAATCTTGACGAGGGCCATGCACTCAGAATCCCATCCGTTTCATTTCCTACATCTTGACGACATGACACCAGAAC TCAGAGTAGCTAAATCTCATGCCGTTAACACACCCGGCCAGTTTGAGGTAGGTACCTCTTTCTTCTCACATATATATTCAATTCTCAATT ATCATCTTACATGTTGTGGGTGTTGCTTCACAGGATTTCTTCCCGGCGAGCAGCCGAGACCAATCATCCTACTTGCAGGGATTCAGCAGG AATACTTTGGAGGCCGCCTTCAATGTAAGCAAATGTGTCATAATTATGGAATTAAAAGAACGATCATGTTATAAACTTATAATATATATA TACATAGGCGGAATTCAATGAGATACGGAGGGTGCTGTTAGAAGAGAATGCAGGAGGTGAGCAAGAGGAGAGAGGGCAGAGGCGATGGAG TACTCGGAGTAGTGAGAACAATGAAGGAGTGATAGTCGAAGTGTCAAAGGAGCACGTTGAAGAACTTACTAAGCACGCTAAATCCGTCTC AAAGAAAGGCTCCGAAGAAGAGGGAGATATCACCAACCCAATCAACTTGAGAGAAGGCGAGCCCGATCTTTCTGACAACTTTGGGAGGTT ATTTGAGGTGAAGCCAGACAAGAAGAACCCCCAGCTTCAGGACCTGGACATGATGCTCACCTGTGTAGAGATCAAAGAAGGAGCTTTGAT GCTCCCACACTTCAACTCAAAGGCCATGGTCATCGTCGTCATCAACAAAGGAACTGGAAACCTTGAACTCGTAGCTGTAAGAAAAGAGCA ACAACAGAGGGGACGGCGGGAACAAGAGTGGGAAGAAGAGGAGGAAGATGAAGAAGAGGAGGGAAGTAACAGAGAGGTGCGTAGGTACAC AGCGAGGTTGAAGGAAGGCGATGTGTTCATCATGCCAGCAGCTCATCCAGTAGCCATCAACGCTTCCTCCGAACTCCATCTGCTTGGCTT CGGTATCAACGCTGAAAACAACCACAGAATCTTCCTTGCAGGTGATAAGGACAATGTGGTAGACCAGATAGAGAAGCAAGCGAAGGATTT AGCATTCCCTGGTTCGGGTGAACAAGTTGAGAAGCTCATCAAAAACCAGAGGGAGTCTCACTTTGTGAGTGCTCGTCCTCAATCTCAATC TCCGTCGTCTCCTGAAAAAGAGGACCAAGAGGAGGAAAACCAGGGAGGGAAGGGTCCACTCCTTTCAATTTTGAAGGCTTTTAACTGAGA ATGGAGGAAACTTGTTATGTATCCATAATAAGATCACGCTTTTGTAATCTACTATCCAAAAACTTATCAATAAATAAAAACGTTTGTGCG TTGTTTCTCCAAGAAATACGGGTGGCGCTTATGGTTGTTTATTTATACGAAACTAATTAAATACATCATAACGGCAACGACCTCTTATTT TGTAATTTTCTT   BLAST? 90% ID? Do I want total query coverage or total subject coverage? Global alignment? What word size? How do my sequence hits relate to my text search results? Fragment? Motif?
  • 5. pn:EP* AND somato*^5 AND [mus musculus] AND clm:[transgenic animal ~3] AND pd: [19950101 TO 20140215] pn:EP* AND somato*^5 AND [mus musculus] AND clm: [transgenic animal ~3] AND pd:[19950101 TO 20140215] pn:EP* AND somato*^5 AND [mus musculus] AND clm:[transgenic animal ~3] AND pd:[19950101 TO 20140215] pn:EP* AND somato*^5 AND [mus musculus] AND clm:[transgenic animal ~3] AND pd: [19950101 TO 20140215] pn:EP* AND somato*^5 AND [mus musculus] AND clm: [transgenic animal ~3] AND pd:[19950101 TO 20140215] pn:EP* AND somato*^5 AND [mus musculus] AND clm:[transgenic animal ~3] AND pd:[19950101 TO 20140215] pn:EP* AND somato*^5 AND [mus musculus] AND clm:[transgenic animal ~3] AND pd: [19950101 TO 20140215] pn:EP* AND somato*^5 AND [mus musculus] AND clm: [transgenic animal ~3] AND pd:[19950101 TO 20140215] pn:EP* AND somato*^5 AND [mus musculus] AND clm:[transgenic animal ~3] AND pd:[19950101 TO 20140215] pn:EP* AND somato*^5 AND [mus musculus] AND clm:[transgenic animal ~3] AND pd: [19950101 TO 20140215] pn:EP* AND somato*^5 AND [mus musculus] AND clm: [transgenic animal ~3] AND pd:[19950101 TO 20140215] pn:EP* AND somato*^5 AND [mus musculus] AND clm:[transgenic animal ~3] AND pd:[19950101 TO 20140215] pn:EP* AND somato*^5 AND [mus musculus] AND clm:[transgenic animal ~3] AND pd: [19950101 TO 20140215] KEYWORD SEARCHING IN THE LIFE SCIENCES PRESENTS CHALLENGES How do my text search results relate to my sequence hits? How do I figure out this system’s query syntax? What if a keyword is misspelled in a patent claim? How can I exclude patents unrelated to my domain easily? How do I build and maintain reliable synonym lists? Can I be sure that all of the documents I need to review exist in the underlying database?
  • 6. BUILDING A REPORT FROM DIFFERENT PLATFORMS IS CHALLENGING Lack of life science specificity in search platforms create multiple false-positive hits that require additional user review Varying underlying algorithms can create an apples-to-oranges comparison Different output formats make it dicult to analyze and compare results Little cross-platform integration necessitates downloading multiple files for manual collation
  • 7. Identify prior art surrounding gene modification in peanut for gene families implicated in food allergies. “Ara h 1” is a seed storage protein from Arachis hypogaea. It is known because sensitization to it was found in 95% of peanut-allergic patients from North America. We’re seeking prior art that describes vaccines related to these allergies or sequences that hit to the Ara h 1 gene. CASE STUDY
  • 8. Run a sequence search against the prior art for the peanut“ara h 1”gene sequence: Arachis hypogaea cultivar LUHUA 8 Ara h 1 allergen (ara h 1) gene (cds) Identify relevant documents related to peanuts and claiming transgenic modification of plants that decrease allergy risks, and limited to the documents published after January 1st 2010 Text Search Sequence Search CCCTCCATCATTTCACCATCCACACTCATAATAATCATATATA TTCATCAATCATCTATATAAGTAGTGGCAGGAGCAATGAGA GGGAGGGTTTCTCCACTGATGCTGTTGCT… SOLUTION: INTEGRATED LIFE SCIENCE SEARCH PLATFORMS Union Combine into a single, unique workfile
  • 9. A COMPLETE REPORT FOR ANALYSIS Claims contains vaccin* in green Bioinformatics-related patents in red Sequence search results in blue A single, unified report for analyzing results.
  • 10. STANDARD KEYWORDS AND BOOLEAN SYNTAX AREN’T ENOUGH Life science applications are more than collections of discrete, specific keywords. They include field-specific ontological terms that can have synonyms, alternate spellings, and varying word order. Building a single query that addresses all of these issues, plus allows the flexibility of Boolean, proximity, wildcard, field grouping, range searches, and term boosting, can be dicult.
  • 11. USE EXISTING ONTOLOGY TERMS OR DEFINE YOUR OWN As you type, suggested matching terms appear, based on the ontologies you choose Simply typing“transgenic” with the NCBI ontology list allows“Transgenic Plants” as one option At any time, type in the ? symbol for a complete list of field choices Specify words in claims, date ranges, and many more options to further refine your query Define your own ontologies and synonyms that are relevant for your specific search area Includes synonyms and alternate spellings for the genus and species of peanut Hit“Search”or <return> to run the search
  • 12. INSTANT RESULTS A result preview is shown, and we save it as a workfile called“TEXT SEARCH”
  • 13. THE “TEXT SEARCH”WORKFILE Sort by any column Rank for priority Color code to categorize Quickly assign colors/ranks using keyboard shortcuts 3 (for 3 stars) O (for orange)
  • 14. All the results seem relevant, but we want to annotate the documents talking about vaccines in the claims with a green color. NAVIGATE A WORKFILE Easily apply bulk annotations for future workfile manipulation Keyboard shortcuts allow fast workfile evaluation (next record) (close preview) (previous record)
  • 15. FILTER A WORKFILE Type in free text, use wildcards, or type in“?”to filter by terms in a specific field
  • 16. FILTER A WORKFILE Apply the filter to pull out the subset of documents that match your query. 12 documents contain the word“vaccine”, or related terms, in the claims. 12
  • 17. Let’s annotate these in green. MAKING DOCUMENTS WITH VACCINES IN THE CLAIMS GREEN
  • 18. MAKING DOCUMENTS WITH VACCINES IN THE CLAIMS GREEN Here is what our subset (vaccine in claims) looks like. You can reset the filter to see other documents that are in the workfile.
  • 19. Let’s annotate in red the documents that are probably not really relevant. Notice that“Bio-informatics”is a synonym list and includes multiple spellings. MAKING BIOINFORMATICS DOCUMENTS RED 40 documents relate to bioinformatics methods.
  • 20. vaccin* in claims bioinformatics related HERE IS OUR TEXT SEARCH WORKFILE
  • 21. Now it’s time to complete the analysis with sequence search results. ara h 1 CDS sequence GenePast 90%ID over the length of the query or the subject (1000 results) PREPARE YOUR SEQUENCE SEARCH RESULTS
  • 22. We export these results to a LifeQuest workfile. Apply a filter to keep the patents where the Patent sequence location of my hits are in the claims: that leads to 81 results in 25 patents. FILTER YOUR SEQUENCE SEARCH RESULTS & EXPORT
  • 23. Save it as a new“SEQ search”workfile, and open to analyze. EXPORT YOUR SEARCH RESULTS TO A WORKFILE
  • 24. In the“SEQ search”workfile, color code all as blue. MARK ALL OF THE SEQUENCE SEARCH DOCUMENTS BLUE
  • 25. Run a sequence search against the prior art for the peanut“ara h 1”gene sequence: Arachis hypogaea cultivar LUHUA 8 Ara h 1 allergen (ara h 1) gene (cds) Identify relevant documents related to peanuts and claiming transgenic modification of plants that decrease allergy risks, and limited to the documents published after January 1st 2010 Text Search Sequence Search CCCTCCATCATTTCACCATCCACACTCATAATAATCATATATA TTCATCAATCATCTATATAAGTAGTGGCAGGAGCAATGAGA GGGAGGGTTTCTCCACTGATGCTGTTGCT… SOLUTION: INTEGRATED LIFE SCIENCE SEARCH PLATFORMS Union Combine into a single, unique workfile
  • 26. CONSOLIDATE TEXT SEARCH AND SEQUENCE SEARCH RESULTS Merge the two workfiles together (union) to get a complete set for final analysis.
  • 27. Sort, filter, analyze, and export! EVALUATE THE MERGED DATA SETS vaccin* in claims bioinformatics related sequence hit in claims
  • 29. GENERATE A COMPLETE REPORT FOR ANALYSIS Includes results from both sequence & text searches Create color codes for your specific categories Merge with other outputs or export to any format Sort or filter by any field Rank hits (1, 2, 3 stars) to easily identify priority Claims contain vaccin* bioinformatics related found using the“ara h 1” DNA sequence A single, unified report for analyzing results.
  • 30. PLEASE COME BY OUR BOOTH FOR MORE INFORMATION.