SlideShare ist ein Scribd-Unternehmen logo
1 von 26
Downloaden Sie, um offline zu lesen
Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 1
Enriching Content with Semantic Tagging
Molecular Connections, Bangalore, India
www.molecularconnections.com
ICIC 2013, Vienna
Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 2
Outline
• Introduction to MC
• Content Enrichment – Concept
• Content Enrichment Use Case
• Key Take Aways
About MC OPERATIONS
 Information curation and annotation expertise
 work with leading R & D Institutions , STM publishing &
IP Search & Law Firms
 Right mix of human resources and scale
 LifeScience (Bio – Chem), Engineering, IP, information
and technology background
 Established workflow and processes to ensure quality
and on time delivery
 ISO 27001: 2005 Certified knowledge management
platforms and workflow systems
CORPORATE
 Established in 2001
 Executive team backed by
renowned informaticans & strong
advisory board -~ 1000 strong
 Scalable & state of the art
infrastructure
 Global footprint
 Core Values: Customer focused,
Quality, Ethics, Excellence,
Accountability
Life Sciences
companies
Text mining &
Informatics
IP
Verticals
Publishing,
R & D
Institutions
 MCPaIRS
 MCDESiGN
 Patent Search Services
Highly
Customized
Services
CONTENT
MINING
CONTENT
REPRESENTATION
/ DELIVERY
CONTENT
MANAGEMENT
 App Development
 User Interface Design
 Visualization
 Analytics
• Indexing ( automatic and semi-automatic),
• Abstraction (manual and semi-automatic)
• Open Access Data Mining
• Content Enrichment
• Semantic Tagging & systematic review of
literature
• MC Outlink - Text Mining & Discovery
• Developing customized text mining engines
• Ontology Building
• Custom Dbase Creation
• Content Normalization
End <– to –> End Solutions
Over 3500 Man Years of expertise
MC - Solutions
Semantic Tagging
Text Mining
Ontology
Mapping
Augmented
Reference
Outlinking
Enriching Content
CONTENT
ENRICHMENT
Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 7
Why CE?
• Enables deeper knowledge discovery from diverse sources like patent,
databases, journal etc.
• Semantic tagging ensures that different names of an entity are mapped
to standard name and hence, searchable by any name.
For Instance: Discoverability is a challenge in pharma patents as entities
of interest may be named differently in different patents by different
authors.
• Publishers are quick to adopt CE, time to adopt it for patents?
Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 8
Unlocking Small Data to Big Data
Number of articles (diamonds) and patents (open boxes) abstracted
annually by Chemical Abstracts Services
Bachrach Journal of Cheminformatics 2009 1:2 doi:10.1186/1758-2946-1-2
Need Smarter Content
Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 9
Leveraging Linked Data
Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 10
Implementation - Content Enrichment Levels
What kind of Content Enrichment can be done?
• Entity
• Document
• Others
- Journal article
- Patent
- Book chapter
- Image
- Table
- Multimedia
- News links
- Author/Assignee, Protein, Gene, Drug, Chemical, Disease,
Reaction, Organism, Technology, Organization
Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 11
Content Enrichment – Use Case
MCPaIRS TM (Proprietary Indian Patent Database)
•"Expertly , Manually Curated,
Fully Searchable, Value Added
Knowledgebase" of Full Text of
Indian Granted and Applied
Patents
•Caters to a diversified user-base
of bench Scientists, Engineers,
R&D Managers & Business
Professionals.
Molecular Connections Patent Information Retrieval System
Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 13
MCPaIRS TM – Homepage
Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 14
MCPaIRS TM – Search
Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 15
MCPaIRS TM – View Patent
Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 16
Demo of actual full text document
Benefits of Semantic Search Cartridge Enabled
MCPaIRS TM
 All results in a single query
 Automatic Expansion of the query with all possible synonyms
 Broadening of the search query
 Complex search queries possible
 All the synonyms highlighted
17
Automatic Expansion of the query with all
possible synonyms
18
Automatic Expansion of the query with all
possible synonyms
Multiple key-words highlighted for the
search: VEGF
Complex Queries can be performed by using
operators
Boolean search is performed
Sample queries with Semantic Search Cartridge
No Query
No of results in
iPairs
No of results in
mcpairs
No of results in mcpairs with
semantic search cartridge
1 Salbutamol 27 1560 2548
2 Amethocaine 0 58 954
3 Diazepam 4 1725 2146
4 Valsartan 84 1372 1429
5 Imatinib 65 1703 1999
6 Tamoxifen 16 3950 4190
7 Aspirin 61 5679 6427
8 Paracetamol 74 1161 3696
9 MyoD 2 130 138
10 Pax3 1 49 56
11 Sox9 0 39 58
12 FGF10 0 43 131
13 VEGF 192 4808 6058
14 BMP2 5 137 214
15 Salbutamol AND CD48 0 0 4
Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 23
Benefit - Identifying Related Patents
A B
Proteins
Chemicals
Indications
…….
Proteins
Chemicals
Indications
…….
Similarity Score
Relatedness
Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 24
Content Enrichment Approaches
• Manual
 high quality, costly, not scalable, slow
• Automated
 fast, quality below par, cost effective, scalable
• Hybrid
 high quality, cost effective, scalable, reasonable
speed
Molecular Connections is a pioneer in the use of hybrid approach to content enrichment
Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 25
Key Takeaways
 Content Enrichment can improve search and retrieval
immensely
?? CE can be looked at various levels
- Biology / chemistry / both / authors etc.
 You can bring the Web into the document through CE
- e.g. Augmented reference cards
 Growing Adoption of Content Enrichment
- Publishing (Early adopters)
- Patents
Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 26
Thank You
Molecular Connections
www.molecularconnections.com

Weitere ähnliche Inhalte

Was ist angesagt?

II-SDV 2014 The Road to Federated Text Mining: Are we there yet? (Guy Singh -...
II-SDV 2014 The Road to Federated Text Mining: Are we there yet? (Guy Singh -...II-SDV 2014 The Road to Federated Text Mining: Are we there yet? (Guy Singh -...
II-SDV 2014 The Road to Federated Text Mining: Are we there yet? (Guy Singh -...
Dr. Haxel Consult
 
II-SDV 2014 Recommender Systems for Analysis Applications (Roger Bradford - A...
II-SDV 2014 Recommender Systems for Analysis Applications (Roger Bradford - A...II-SDV 2014 Recommender Systems for Analysis Applications (Roger Bradford - A...
II-SDV 2014 Recommender Systems for Analysis Applications (Roger Bradford - A...
Dr. Haxel Consult
 
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
Michel Dumontier
 

Was ist angesagt? (20)

AI-SDV 2021 - Klaus Kater - The secret of successful CI: precise targeting + ...
AI-SDV 2021 - Klaus Kater - The secret of successful CI: precise targeting + ...AI-SDV 2021 - Klaus Kater - The secret of successful CI: precise targeting + ...
AI-SDV 2021 - Klaus Kater - The secret of successful CI: precise targeting + ...
 
IC-SDV 2018: Stefan Geißler (Expert System) Navigating to new shores: the Bio...
IC-SDV 2018: Stefan Geißler (Expert System) Navigating to new shores: the Bio...IC-SDV 2018: Stefan Geißler (Expert System) Navigating to new shores: the Bio...
IC-SDV 2018: Stefan Geißler (Expert System) Navigating to new shores: the Bio...
 
Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016
 
II-SDV 2014 The Road to Federated Text Mining: Are we there yet? (Guy Singh -...
II-SDV 2014 The Road to Federated Text Mining: Are we there yet? (Guy Singh -...II-SDV 2014 The Road to Federated Text Mining: Are we there yet? (Guy Singh -...
II-SDV 2014 The Road to Federated Text Mining: Are we there yet? (Guy Singh -...
 
II-SDV 2014 Recommender Systems for Analysis Applications (Roger Bradford - A...
II-SDV 2014 Recommender Systems for Analysis Applications (Roger Bradford - A...II-SDV 2014 Recommender Systems for Analysis Applications (Roger Bradford - A...
II-SDV 2014 Recommender Systems for Analysis Applications (Roger Bradford - A...
 
SAML protected resources: the theory and practice of granularity and manageme...
SAML protected resources: the theory and practice of granularity and manageme...SAML protected resources: the theory and practice of granularity and manageme...
SAML protected resources: the theory and practice of granularity and manageme...
 
Open PHACTS MIOSS may 2016
Open PHACTS MIOSS may 2016Open PHACTS MIOSS may 2016
Open PHACTS MIOSS may 2016
 
Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016
 
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
 
Channeling insights to the right people
Channeling insights to the right peopleChanneling insights to the right people
Channeling insights to the right people
 
Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016
 
Digital Representation of Privacy Terms
Digital Representation of Privacy TermsDigital Representation of Privacy Terms
Digital Representation of Privacy Terms
 
Emily Thompson
Emily ThompsonEmily Thompson
Emily Thompson
 
Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016
 
Privacy preserving computing and secure multi-party computation ISACA Atlanta
Privacy preserving computing and secure multi-party computation ISACA AtlantaPrivacy preserving computing and secure multi-party computation ISACA Atlanta
Privacy preserving computing and secure multi-party computation ISACA Atlanta
 
W3C DPVCG - DPV v0.2
W3C DPVCG - DPV v0.2W3C DPVCG - DPV v0.2
W3C DPVCG - DPV v0.2
 
AI-SDV 2020: Using Transformer technology to build an AI based personal News ...
AI-SDV 2020: Using Transformer technology to build an AI based personal News ...AI-SDV 2020: Using Transformer technology to build an AI based personal News ...
AI-SDV 2020: Using Transformer technology to build an AI based personal News ...
 
Data Analytics and the Legal Landscape: Intellectual Property and Data Protec...
Data Analytics and the Legal Landscape: Intellectual Property and Data Protec...Data Analytics and the Legal Landscape: Intellectual Property and Data Protec...
Data Analytics and the Legal Landscape: Intellectual Property and Data Protec...
 
Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016
 
Future data security ‘will come from several sources’
Future data security ‘will come from several sources’Future data security ‘will come from several sources’
Future data security ‘will come from several sources’
 

Andere mochten auch (6)

Building Blocks of IFRS 9 Impairment Modeling
Building Blocks of IFRS 9 Impairment ModelingBuilding Blocks of IFRS 9 Impairment Modeling
Building Blocks of IFRS 9 Impairment Modeling
 
Credit Impairment under IFRS 9 for Banks
Credit Impairment under IFRS 9 for BanksCredit Impairment under IFRS 9 for Banks
Credit Impairment under IFRS 9 for Banks
 
IFRS 9 conference presentation - Philip Lewis
IFRS 9 conference presentation - Philip LewisIFRS 9 conference presentation - Philip Lewis
IFRS 9 conference presentation - Philip Lewis
 
Ifrs 9
Ifrs 9Ifrs 9
Ifrs 9
 
IFRS 9 Overview (For all Accountants)
IFRS 9 Overview (For all Accountants)IFRS 9 Overview (For all Accountants)
IFRS 9 Overview (For all Accountants)
 
Build Features, Not Apps
Build Features, Not AppsBuild Features, Not Apps
Build Features, Not Apps
 

Ähnlich wie ICIC 2013 Conference Proceedings Krishna Molecular Connections

(ATS6-APP03) Thomson Rueters Content used in Acclrys Pipeline Pilot
(ATS6-APP03) Thomson Rueters Content used in Acclrys Pipeline Pilot(ATS6-APP03) Thomson Rueters Content used in Acclrys Pipeline Pilot
(ATS6-APP03) Thomson Rueters Content used in Acclrys Pipeline Pilot
BIOVIA
 
Maven and google pharma r&d (1)
Maven and google pharma r&d  (1)Maven and google pharma r&d  (1)
Maven and google pharma r&d (1)
Matt Barnes
 

Ähnlich wie ICIC 2013 Conference Proceedings Krishna Molecular Connections (20)

Drug Discovery Innovation in a Precompetitive Cloud Platform (LFS302-S) - AWS...
Drug Discovery Innovation in a Precompetitive Cloud Platform (LFS302-S) - AWS...Drug Discovery Innovation in a Precompetitive Cloud Platform (LFS302-S) - AWS...
Drug Discovery Innovation in a Precompetitive Cloud Platform (LFS302-S) - AWS...
 
(ATS6-APP03) Thomson Rueters Content used in Acclrys Pipeline Pilot
(ATS6-APP03) Thomson Rueters Content used in Acclrys Pipeline Pilot(ATS6-APP03) Thomson Rueters Content used in Acclrys Pipeline Pilot
(ATS6-APP03) Thomson Rueters Content used in Acclrys Pipeline Pilot
 
ELIXIR and Industry presentation given by Jerome Wojcik, CEO, Quartz Bio at E...
ELIXIR and Industry presentation given by Jerome Wojcik, CEO, Quartz Bio at E...ELIXIR and Industry presentation given by Jerome Wojcik, CEO, Quartz Bio at E...
ELIXIR and Industry presentation given by Jerome Wojcik, CEO, Quartz Bio at E...
 
Joint Pistoia Alliance & PRISME AI in pharma webinar 18 Oct 2018
Joint Pistoia Alliance & PRISME AI in pharma webinar 18 Oct 2018Joint Pistoia Alliance & PRISME AI in pharma webinar 18 Oct 2018
Joint Pistoia Alliance & PRISME AI in pharma webinar 18 Oct 2018
 
PharmaLedger Press Release #2 June 2020
PharmaLedger Press Release #2 June 2020 PharmaLedger Press Release #2 June 2020
PharmaLedger Press Release #2 June 2020
 
Connected Health: The Importance of Systems Integration
Connected Health: The Importance of Systems IntegrationConnected Health: The Importance of Systems Integration
Connected Health: The Importance of Systems Integration
 
PharmaLedger: A Digital Trust Ecosystem for Healthcare
PharmaLedger: A Digital Trust Ecosystem for HealthcarePharmaLedger: A Digital Trust Ecosystem for Healthcare
PharmaLedger: A Digital Trust Ecosystem for Healthcare
 
2011-12-02 Open PHACTS at STM Innovation
2011-12-02 Open PHACTS at STM Innovation2011-12-02 Open PHACTS at STM Innovation
2011-12-02 Open PHACTS at STM Innovation
 
Compound passport (BOS)
Compound passport (BOS)Compound passport (BOS)
Compound passport (BOS)
 
About Indegene
About IndegeneAbout Indegene
About Indegene
 
Michael Alvers, Transinsight, DE (Fortissimo)
Michael Alvers, Transinsight, DE (Fortissimo)Michael Alvers, Transinsight, DE (Fortissimo)
Michael Alvers, Transinsight, DE (Fortissimo)
 
e-HealthWhitepaper
e-HealthWhitepapere-HealthWhitepaper
e-HealthWhitepaper
 
New technologies for data protection
New technologies for data protectionNew technologies for data protection
New technologies for data protection
 
Maven and google pharma r&d (1)
Maven and google pharma r&d  (1)Maven and google pharma r&d  (1)
Maven and google pharma r&d (1)
 
Introduction to healthcare and life sciences
Introduction to healthcare and life sciencesIntroduction to healthcare and life sciences
Introduction to healthcare and life sciences
 
Precompetitive Collaborations
Precompetitive CollaborationsPrecompetitive Collaborations
Precompetitive Collaborations
 
SmartChem Presentation
SmartChem PresentationSmartChem Presentation
SmartChem Presentation
 
Artificial intelligence robotics and computational fluid dynamics
Artificial intelligence robotics and computational fluid dynamics Artificial intelligence robotics and computational fluid dynamics
Artificial intelligence robotics and computational fluid dynamics
 
Managing Regulatory Compliance and Food safety With Cloud Data Supervision-Ch...
Managing Regulatory Compliance and Food safety With Cloud Data Supervision-Ch...Managing Regulatory Compliance and Food safety With Cloud Data Supervision-Ch...
Managing Regulatory Compliance and Food safety With Cloud Data Supervision-Ch...
 
Kemxtree Presentation
Kemxtree PresentationKemxtree Presentation
Kemxtree Presentation
 

Mehr von Dr. Haxel Consult

AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
Dr. Haxel Consult
 
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
Dr. Haxel Consult
 
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
Dr. Haxel Consult
 

Mehr von Dr. Haxel Consult (20)

AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementAI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
 
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
 
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
 
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
 
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
 
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
 
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
 
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
 
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
 
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
 
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
 
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
 
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
 
AI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterAI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance Center
 
AI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IPAI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IP
 
AI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCAI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOC
 
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
 
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
 

Kürzlich hochgeladen

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Kürzlich hochgeladen (20)

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 

ICIC 2013 Conference Proceedings Krishna Molecular Connections

  • 1. Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 1 Enriching Content with Semantic Tagging Molecular Connections, Bangalore, India www.molecularconnections.com ICIC 2013, Vienna
  • 2. Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 2 Outline • Introduction to MC • Content Enrichment – Concept • Content Enrichment Use Case • Key Take Aways
  • 3. About MC OPERATIONS  Information curation and annotation expertise  work with leading R & D Institutions , STM publishing & IP Search & Law Firms  Right mix of human resources and scale  LifeScience (Bio – Chem), Engineering, IP, information and technology background  Established workflow and processes to ensure quality and on time delivery  ISO 27001: 2005 Certified knowledge management platforms and workflow systems CORPORATE  Established in 2001  Executive team backed by renowned informaticans & strong advisory board -~ 1000 strong  Scalable & state of the art infrastructure  Global footprint  Core Values: Customer focused, Quality, Ethics, Excellence, Accountability
  • 4. Life Sciences companies Text mining & Informatics IP Verticals Publishing, R & D Institutions  MCPaIRS  MCDESiGN  Patent Search Services
  • 5. Highly Customized Services CONTENT MINING CONTENT REPRESENTATION / DELIVERY CONTENT MANAGEMENT  App Development  User Interface Design  Visualization  Analytics • Indexing ( automatic and semi-automatic), • Abstraction (manual and semi-automatic) • Open Access Data Mining • Content Enrichment • Semantic Tagging & systematic review of literature • MC Outlink - Text Mining & Discovery • Developing customized text mining engines • Ontology Building • Custom Dbase Creation • Content Normalization End <– to –> End Solutions Over 3500 Man Years of expertise MC - Solutions
  • 7. Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 7 Why CE? • Enables deeper knowledge discovery from diverse sources like patent, databases, journal etc. • Semantic tagging ensures that different names of an entity are mapped to standard name and hence, searchable by any name. For Instance: Discoverability is a challenge in pharma patents as entities of interest may be named differently in different patents by different authors. • Publishers are quick to adopt CE, time to adopt it for patents?
  • 8. Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 8 Unlocking Small Data to Big Data Number of articles (diamonds) and patents (open boxes) abstracted annually by Chemical Abstracts Services Bachrach Journal of Cheminformatics 2009 1:2 doi:10.1186/1758-2946-1-2 Need Smarter Content
  • 9. Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 9 Leveraging Linked Data
  • 10. Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 10 Implementation - Content Enrichment Levels What kind of Content Enrichment can be done? • Entity • Document • Others - Journal article - Patent - Book chapter - Image - Table - Multimedia - News links - Author/Assignee, Protein, Gene, Drug, Chemical, Disease, Reaction, Organism, Technology, Organization
  • 11. Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 11 Content Enrichment – Use Case
  • 12. MCPaIRS TM (Proprietary Indian Patent Database) •"Expertly , Manually Curated, Fully Searchable, Value Added Knowledgebase" of Full Text of Indian Granted and Applied Patents •Caters to a diversified user-base of bench Scientists, Engineers, R&D Managers & Business Professionals. Molecular Connections Patent Information Retrieval System
  • 13. Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 13 MCPaIRS TM – Homepage
  • 14. Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 14 MCPaIRS TM – Search
  • 15. Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 15 MCPaIRS TM – View Patent
  • 16. Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 16 Demo of actual full text document
  • 17. Benefits of Semantic Search Cartridge Enabled MCPaIRS TM  All results in a single query  Automatic Expansion of the query with all possible synonyms  Broadening of the search query  Complex search queries possible  All the synonyms highlighted 17
  • 18. Automatic Expansion of the query with all possible synonyms 18
  • 19. Automatic Expansion of the query with all possible synonyms
  • 20. Multiple key-words highlighted for the search: VEGF
  • 21. Complex Queries can be performed by using operators Boolean search is performed
  • 22. Sample queries with Semantic Search Cartridge No Query No of results in iPairs No of results in mcpairs No of results in mcpairs with semantic search cartridge 1 Salbutamol 27 1560 2548 2 Amethocaine 0 58 954 3 Diazepam 4 1725 2146 4 Valsartan 84 1372 1429 5 Imatinib 65 1703 1999 6 Tamoxifen 16 3950 4190 7 Aspirin 61 5679 6427 8 Paracetamol 74 1161 3696 9 MyoD 2 130 138 10 Pax3 1 49 56 11 Sox9 0 39 58 12 FGF10 0 43 131 13 VEGF 192 4808 6058 14 BMP2 5 137 214 15 Salbutamol AND CD48 0 0 4
  • 23. Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 23 Benefit - Identifying Related Patents A B Proteins Chemicals Indications ……. Proteins Chemicals Indications ……. Similarity Score Relatedness
  • 24. Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 24 Content Enrichment Approaches • Manual  high quality, costly, not scalable, slow • Automated  fast, quality below par, cost effective, scalable • Hybrid  high quality, cost effective, scalable, reasonable speed Molecular Connections is a pioneer in the use of hybrid approach to content enrichment
  • 25. Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 25 Key Takeaways  Content Enrichment can improve search and retrieval immensely ?? CE can be looked at various levels - Biology / chemistry / both / authors etc.  You can bring the Web into the document through CE - e.g. Augmented reference cards  Growing Adoption of Content Enrichment - Publishing (Early adopters) - Patents
  • 26. Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 26 Thank You Molecular Connections www.molecularconnections.com