SlideShare ist ein Scribd-Unternehmen logo
1 von 12
Downloaden Sie, um offline zu lesen
InfoCodex Semantic Technologies
Turning Information into Knowledge
Scientific Discovery by Machine Intelligence:
A New Avenue for Drug Research?
Dr. Carlo A. Trugenberger
Co-Founder and Chief Scientific Officer
InfoCodex Semantic Technologies AG, CH-9470 Buchs
September	
  2,	
  2015	
   1	
  www.InfoCodex.com	
  
Semantics 2015
InfoCodex Semantic Technologies
Turning Information into Knowledge
Big changes in pharmaceutical research
The end of the blockbuster era?
Challenges Opportunities
02/09/15	
   www.InfoCodex.com	
   2	
  
Ø  Genomics / Proteomics
Ø  Big data / data mining
➪ structure-based design
Ø  Drugs are “computed”
rather than discovered
Ø  Costs are exploding
Ø  Regulatory pressure
Ø  Personalized medicine
Ø  Outsourcing of critical
processes
Critical for survival:
Ø  Shorten time-to market
Ø  Early recognition of dead ends
Critical to beat competition:
Ø  Data + data analysis power
Ø  Machine intelligence
InfoCodex Semantic Technologies
Turning Information into Knowledge
The data deluge as an opportunity for eDiscovery
Traditional bioinformatics: structured data
New Idea: exploit unstructured data
02/09/15	
   www.InfoCodex.com	
   3	
  
Experiment: Merck + Thomson Reuters + InfoCodex
Is it possible to drive drug research by text mining
large pools of biomedical documents?
sequence alignment, gene finding, genome assembly,
protein structure prediction, gene expression…
PubMed: 22 million citations, growing at the rate of I.7 paper/
minute
InfoCodex Semantic Technologies
Turning Information into Knowledge
02/09/15	
   www.InfoCodex.com	
  
4	
  
The Experiment of Merck & Co with InfoCodex
The tasks:
Ø  Discover novel biomarkers for diabetes
and obesity (D&O) by analyzing 120’000
medical publications (PubMed
+ClinicalTrials.org + internal)
Ø  Blind experiment, no human feedback
The aim:
Ø  Test pure machine intelligence for
“semantic drug research”
Biomarker: $13.6 billion market in 2011, growing to $25 billion by 2016.
InfoCodex Semantic Technologies
Turning Information into Knowledge
Semantic technologies in the pharma industry
Most existing projects use NLP to extract triples “entity 1-relation-entity
2” sentence by sentence ➪ help to curate ontologies / libraries
However: this is not a discovery approach
Relations found this way have been explicitly written by human authors
and are thus known in one way or another
Going beyond triples: analyze text collections globally to identify small,
seemingly unrelated and unnoticed facts dispersed over isolated texts
assembling the scattered pieces of a puzzle
Critical: machine intelligence
02/09/15	
   www.InfoCodex.com	
   5	
  
InfoCodex Semantic Technologies
Turning Information into Knowledge
The Technology: eDiscovery by InfoCodex
Linguistics + Information Theory + Self-Organization
02/09/15	
   www.InfoCodex.com	
   6	
  
Ø  Completely automatic semantic analysis of content.
Ø  Designed for uncovering unnoticed correlations amongst information
distributed over documents groups and collections (contrary to NLP)
Ø  “Assemble the pieces of a puzzle”
Ø  Knowledge discovery as opposed to information extraction
InfoCodex Semantic Technologies
Turning Information into Knowledge
02/09/15	
   www.InfoCodex.com	
   7	
  
InfoCodex Semantic Technologies
Turning Information into Knowledge
Step 1 : establish reference models for biomarkers / phenotypes
Ø  Cluster documents describing known biomarkers (224 references found)
Ø  Reference model for each cluster → meanings for “biomarkers diabetes” …
Step 2: determine the meaning of unknown words by machine
inference.
Step 3: analyze documents and generate a list of potential D&O
biomarkers/phenotypes by comparison with the reference models.
Step4: establish confidence levels
02/09/15	
   www.InfoCodex.com	
   8	
  
Encoded
meanings
InfoCodex Semantic Technologies
Turning Information into Knowledge
Determination of the meaning of unknown words: machine inference
Example:
“Hctz” is a “diuretic drug” and is a
synonym of “hydrochlorothiazide”
Such relations established only on the
basis of machine intelligence combined
with internal knowledge base
02/09/15	
   www.InfoCodex.com	
   9	
  
Co-occurrences with words in internal knowledge base
→ most probable hypernym → “is a” , “has to do”
InfoCodex Semantic Technologies
Turning Information into Knowledge
02/09/15	
   www.InfoCodex.com	
   10	
  
The output
InfoCodex Semantic Technologies
Turning Information into Knowledge
02/09/15	
   www.InfoCodex.com	
   11	
  
Many uninteresting candidates
Too much noise
(the problem has been identified
and corrected)
Lots of “needles in the haystack”
Tens of extremely interesting and
valuable candidates with very
high potential
The Results
InfoCodex Semantic Technologies
Turning Information into Knowledge
Conclusion
ü  Approach has high potential for discovery
ü  Approach has potential to impact pharma research
q  Speed up time-to-market
q  Early recognition of dead ends
X  Improvements in the process are needed: problems have been
identified and corrected.
Ø  Most promising is a hybrid approach
q  Human expertise in formulation of reference models
q  Human curation of candidates prior to passing to the
laboratory
ü  Possibly inevitable development
02/09/15	
   www.InfoCodex.com	
   12	
  

Weitere ähnliche Inhalte

Andere mochten auch

Florian Bauer: Using open data thesauri to connect climate platforms
Florian Bauer: Using open data thesauri to connect climate platformsFlorian Bauer: Using open data thesauri to connect climate platforms
Florian Bauer: Using open data thesauri to connect climate platformsSemantic Web Company
 
(Open) Data Activities in the City of Vienna
(Open) Data Activities in the City of Vienna(Open) Data Activities in the City of Vienna
(Open) Data Activities in the City of ViennaSemantic Web Company
 
Tomas Knap: UnifiedViews in COMSODE pilot projects
Tomas Knap: UnifiedViews in COMSODE pilot projectsTomas Knap: UnifiedViews in COMSODE pilot projects
Tomas Knap: UnifiedViews in COMSODE pilot projectsSemantic Web Company
 
Vincenzo Orabona (Raffaele Palmieri): Semantic Web technologies to increase W...
Vincenzo Orabona (Raffaele Palmieri): Semantic Web technologies to increase W...Vincenzo Orabona (Raffaele Palmieri): Semantic Web technologies to increase W...
Vincenzo Orabona (Raffaele Palmieri): Semantic Web technologies to increase W...Semantic Web Company
 
BigDataEurope - Empowering Communities with Data Technologies
BigDataEurope - Empowering Communities with Data TechnologiesBigDataEurope - Empowering Communities with Data Technologies
BigDataEurope - Empowering Communities with Data TechnologiesSemantic Web Company
 
Heiner Oberkampf: Semantics for Integrated Analytical Laboratory Processes – ...
Heiner Oberkampf: Semantics for Integrated Analytical Laboratory Processes – ...Heiner Oberkampf: Semantics for Integrated Analytical Laboratory Processes – ...
Heiner Oberkampf: Semantics for Integrated Analytical Laboratory Processes – ...Semantic Web Company
 
Data Strategies: Metadata, Open Data, Linked Data
Data Strategies: Metadata, Open Data, Linked DataData Strategies: Metadata, Open Data, Linked Data
Data Strategies: Metadata, Open Data, Linked DataSemantic Web Company
 
SKOS as a key element in Enterprise Linked Data Strategies
SKOS as a key element in Enterprise Linked Data StrategiesSKOS as a key element in Enterprise Linked Data Strategies
SKOS as a key element in Enterprise Linked Data StrategiesSemantic Web Company
 
Achim Steinacker: Technical Documentation in the age of Industry 4.0
Achim Steinacker: Technical Documentation in the age of Industry 4.0Achim Steinacker: Technical Documentation in the age of Industry 4.0
Achim Steinacker: Technical Documentation in the age of Industry 4.0Semantic Web Company
 
David Baehrens: Large-Scale Patent Classification at the European Patent Office
David Baehrens: Large-Scale Patent Classification at the European Patent OfficeDavid Baehrens: Large-Scale Patent Classification at the European Patent Office
David Baehrens: Large-Scale Patent Classification at the European Patent OfficeSemantic Web Company
 
Big Data – From Strategy to Production
Big Data – From Strategy to ProductionBig Data – From Strategy to Production
Big Data – From Strategy to ProductionSemantic Web Company
 
Lieke Verhelst: Ontology Development ..the Lean way
Lieke Verhelst: Ontology Development ..the Lean wayLieke Verhelst: Ontology Development ..the Lean way
Lieke Verhelst: Ontology Development ..the Lean waySemantic Web Company
 
Taxonomies and Ontologies – The Yin and Yang of Knowledge Modelling
Taxonomies and Ontologies – The Yin and Yang of Knowledge ModellingTaxonomies and Ontologies – The Yin and Yang of Knowledge Modelling
Taxonomies and Ontologies – The Yin and Yang of Knowledge ModellingSemantic Web Company
 
Julien Gonçalves: Named entity recognition and disambiguation using an iterat...
Julien Gonçalves: Named entity recognition and disambiguation using an iterat...Julien Gonçalves: Named entity recognition and disambiguation using an iterat...
Julien Gonçalves: Named entity recognition and disambiguation using an iterat...Semantic Web Company
 

Andere mochten auch (20)

Florian Bauer: Using open data thesauri to connect climate platforms
Florian Bauer: Using open data thesauri to connect climate platformsFlorian Bauer: Using open data thesauri to connect climate platforms
Florian Bauer: Using open data thesauri to connect climate platforms
 
(Open) Data Activities in the City of Vienna
(Open) Data Activities in the City of Vienna(Open) Data Activities in the City of Vienna
(Open) Data Activities in the City of Vienna
 
Tomas Knap: UnifiedViews in COMSODE pilot projects
Tomas Knap: UnifiedViews in COMSODE pilot projectsTomas Knap: UnifiedViews in COMSODE pilot projects
Tomas Knap: UnifiedViews in COMSODE pilot projects
 
Vincenzo Orabona (Raffaele Palmieri): Semantic Web technologies to increase W...
Vincenzo Orabona (Raffaele Palmieri): Semantic Web technologies to increase W...Vincenzo Orabona (Raffaele Palmieri): Semantic Web technologies to increase W...
Vincenzo Orabona (Raffaele Palmieri): Semantic Web technologies to increase W...
 
SKOS - An Overview
SKOS - An OverviewSKOS - An Overview
SKOS - An Overview
 
BigDataEurope - Empowering Communities with Data Technologies
BigDataEurope - Empowering Communities with Data TechnologiesBigDataEurope - Empowering Communities with Data Technologies
BigDataEurope - Empowering Communities with Data Technologies
 
Data Activities in Austria
Data Activities in AustriaData Activities in Austria
Data Activities in Austria
 
Heiner Oberkampf: Semantics for Integrated Analytical Laboratory Processes – ...
Heiner Oberkampf: Semantics for Integrated Analytical Laboratory Processes – ...Heiner Oberkampf: Semantics for Integrated Analytical Laboratory Processes – ...
Heiner Oberkampf: Semantics for Integrated Analytical Laboratory Processes – ...
 
The Healthdirect Australia Story
The Healthdirect Australia StoryThe Healthdirect Australia Story
The Healthdirect Australia Story
 
Data Strategies: Metadata, Open Data, Linked Data
Data Strategies: Metadata, Open Data, Linked DataData Strategies: Metadata, Open Data, Linked Data
Data Strategies: Metadata, Open Data, Linked Data
 
SKOS as a key element in Enterprise Linked Data Strategies
SKOS as a key element in Enterprise Linked Data StrategiesSKOS as a key element in Enterprise Linked Data Strategies
SKOS as a key element in Enterprise Linked Data Strategies
 
Achim Steinacker: Technical Documentation in the age of Industry 4.0
Achim Steinacker: Technical Documentation in the age of Industry 4.0Achim Steinacker: Technical Documentation in the age of Industry 4.0
Achim Steinacker: Technical Documentation in the age of Industry 4.0
 
David Baehrens: Large-Scale Patent Classification at the European Patent Office
David Baehrens: Large-Scale Patent Classification at the European Patent OfficeDavid Baehrens: Large-Scale Patent Classification at the European Patent Office
David Baehrens: Large-Scale Patent Classification at the European Patent Office
 
Big Data – From Strategy to Production
Big Data – From Strategy to ProductionBig Data – From Strategy to Production
Big Data – From Strategy to Production
 
Lieke Verhelst: Ontology Development ..the Lean way
Lieke Verhelst: Ontology Development ..the Lean wayLieke Verhelst: Ontology Development ..the Lean way
Lieke Verhelst: Ontology Development ..the Lean way
 
ODINE - Open Data Incubator Europe
ODINE - Open Data Incubator EuropeODINE - Open Data Incubator Europe
ODINE - Open Data Incubator Europe
 
SKOS - Some Use Cases
SKOS - Some Use CasesSKOS - Some Use Cases
SKOS - Some Use Cases
 
Study: #Big Data in #Austria
Study: #Big Data in #AustriaStudy: #Big Data in #Austria
Study: #Big Data in #Austria
 
Taxonomies and Ontologies – The Yin and Yang of Knowledge Modelling
Taxonomies and Ontologies – The Yin and Yang of Knowledge ModellingTaxonomies and Ontologies – The Yin and Yang of Knowledge Modelling
Taxonomies and Ontologies – The Yin and Yang of Knowledge Modelling
 
Julien Gonçalves: Named entity recognition and disambiguation using an iterat...
Julien Gonçalves: Named entity recognition and disambiguation using an iterat...Julien Gonçalves: Named entity recognition and disambiguation using an iterat...
Julien Gonçalves: Named entity recognition and disambiguation using an iterat...
 

Ähnlich wie Carlo Trugenberger: Scientific Discovery by Machine Intelligence: A New Avenue fro Drug Research

Notes on "Artificial Intelligence in Bioscience Symposium 2017"
Notes on "Artificial Intelligence in Bioscience Symposium 2017"Notes on "Artificial Intelligence in Bioscience Symposium 2017"
Notes on "Artificial Intelligence in Bioscience Symposium 2017"PetteriTeikariPhD
 
Big Data in Disease Management
Big Data in Disease ManagementBig Data in Disease Management
Big Data in Disease ManagementInterpretOmics
 
II-SDV 2012 From (Text) Mining to Models: Applying Large-Scale Text Mining on...
II-SDV 2012 From (Text) Mining to Models: Applying Large-Scale Text Mining on...II-SDV 2012 From (Text) Mining to Models: Applying Large-Scale Text Mining on...
II-SDV 2012 From (Text) Mining to Models: Applying Large-Scale Text Mining on...Dr. Haxel Consult
 
AI is Not Magic: It’s Time to Demystify and Apply Srinivasan Parthiban (VINGY...
AI is Not Magic: It’s Time to Demystify and Apply Srinivasan Parthiban (VINGY...AI is Not Magic: It’s Time to Demystify and Apply Srinivasan Parthiban (VINGY...
AI is Not Magic: It’s Time to Demystify and Apply Srinivasan Parthiban (VINGY...Dr. Haxel Consult
 
2015-06-02-SCIA-Presentation-Infocodex-Final
2015-06-02-SCIA-Presentation-Infocodex-Final2015-06-02-SCIA-Presentation-Infocodex-Final
2015-06-02-SCIA-Presentation-Infocodex-FinalBeat Meyer
 
Open PHACTS : Linked Data Future Challenges
Open PHACTS : Linked Data Future ChallengesOpen PHACTS : Linked Data Future Challenges
Open PHACTS : Linked Data Future ChallengesSciBite Limited
 
Unveiling the Power of Data Science.pdf
Unveiling the Power of Data Science.pdfUnveiling the Power of Data Science.pdf
Unveiling the Power of Data Science.pdfKajal Digital
 
Open Source Collaboration in Drug Discovery in Pharma
Open Source Collaboration in Drug Discovery in PharmaOpen Source Collaboration in Drug Discovery in Pharma
Open Source Collaboration in Drug Discovery in PharmaKees van Bochove
 
Data science e machine learning
Data science e machine learningData science e machine learning
Data science e machine learningGiuseppe Manco
 
TCS Point of View Session - Analyze by Dr. Gautam Shroff, VP and Chief Scient...
TCS Point of View Session - Analyze by Dr. Gautam Shroff, VP and Chief Scient...TCS Point of View Session - Analyze by Dr. Gautam Shroff, VP and Chief Scient...
TCS Point of View Session - Analyze by Dr. Gautam Shroff, VP and Chief Scient...Tata Consultancy Services
 
Knowledge Will Propel Machine Understanding of Big Data
Knowledge Will Propel Machine Understanding of Big DataKnowledge Will Propel Machine Understanding of Big Data
Knowledge Will Propel Machine Understanding of Big DataAmit Sheth
 
Opening up pharmacological space, the OPEN PHACTs api
Opening up pharmacological space, the OPEN PHACTs apiOpening up pharmacological space, the OPEN PHACTs api
Opening up pharmacological space, the OPEN PHACTs apiChris Evelo
 
Big Data, AI, and Pharma
Big Data, AI, and PharmaBig Data, AI, and Pharma
Big Data, AI, and PharmaAmit Sheth
 
Artificial Intelligence: Role In Pharma Sector
Artificial Intelligence: Role In Pharma SectorArtificial Intelligence: Role In Pharma Sector
Artificial Intelligence: Role In Pharma SectorJayBhavsar41
 
Insight into AstraZeneca's Technology Services.
Insight into AstraZeneca's Technology Services.Insight into AstraZeneca's Technology Services.
Insight into AstraZeneca's Technology Services.Nick Brown
 
Innovation series 112318
Innovation series 112318Innovation series 112318
Innovation series 112318Tim Maurer
 
Presentation to the J. Craig Venter Institute, Dec. 2014
Presentation to the J. Craig Venter Institute, Dec. 2014Presentation to the J. Craig Venter Institute, Dec. 2014
Presentation to the J. Craig Venter Institute, Dec. 2014Mark Wilkinson
 
Nlp and semantic_web_for_competitive_int
Nlp and semantic_web_for_competitive_intNlp and semantic_web_for_competitive_int
Nlp and semantic_web_for_competitive_intKarenVacca
 
Applied Artificial Intelligence & How it's Transforming Life Sciences
Applied Artificial Intelligence & How it's Transforming Life SciencesApplied Artificial Intelligence & How it's Transforming Life Sciences
Applied Artificial Intelligence & How it's Transforming Life SciencesKumaraguru Veerasamy
 
CHARITE ENTREPRENEURSHIP SUMMIT 2016 - Singularity University workshop (27 ma...
CHARITE ENTREPRENEURSHIP SUMMIT 2016 - Singularity University workshop (27 ma...CHARITE ENTREPRENEURSHIP SUMMIT 2016 - Singularity University workshop (27 ma...
CHARITE ENTREPRENEURSHIP SUMMIT 2016 - Singularity University workshop (27 ma...Jorge Juan Fernández García
 

Ähnlich wie Carlo Trugenberger: Scientific Discovery by Machine Intelligence: A New Avenue fro Drug Research (20)

Notes on "Artificial Intelligence in Bioscience Symposium 2017"
Notes on "Artificial Intelligence in Bioscience Symposium 2017"Notes on "Artificial Intelligence in Bioscience Symposium 2017"
Notes on "Artificial Intelligence in Bioscience Symposium 2017"
 
Big Data in Disease Management
Big Data in Disease ManagementBig Data in Disease Management
Big Data in Disease Management
 
II-SDV 2012 From (Text) Mining to Models: Applying Large-Scale Text Mining on...
II-SDV 2012 From (Text) Mining to Models: Applying Large-Scale Text Mining on...II-SDV 2012 From (Text) Mining to Models: Applying Large-Scale Text Mining on...
II-SDV 2012 From (Text) Mining to Models: Applying Large-Scale Text Mining on...
 
AI is Not Magic: It’s Time to Demystify and Apply Srinivasan Parthiban (VINGY...
AI is Not Magic: It’s Time to Demystify and Apply Srinivasan Parthiban (VINGY...AI is Not Magic: It’s Time to Demystify and Apply Srinivasan Parthiban (VINGY...
AI is Not Magic: It’s Time to Demystify and Apply Srinivasan Parthiban (VINGY...
 
2015-06-02-SCIA-Presentation-Infocodex-Final
2015-06-02-SCIA-Presentation-Infocodex-Final2015-06-02-SCIA-Presentation-Infocodex-Final
2015-06-02-SCIA-Presentation-Infocodex-Final
 
Open PHACTS : Linked Data Future Challenges
Open PHACTS : Linked Data Future ChallengesOpen PHACTS : Linked Data Future Challenges
Open PHACTS : Linked Data Future Challenges
 
Unveiling the Power of Data Science.pdf
Unveiling the Power of Data Science.pdfUnveiling the Power of Data Science.pdf
Unveiling the Power of Data Science.pdf
 
Open Source Collaboration in Drug Discovery in Pharma
Open Source Collaboration in Drug Discovery in PharmaOpen Source Collaboration in Drug Discovery in Pharma
Open Source Collaboration in Drug Discovery in Pharma
 
Data science e machine learning
Data science e machine learningData science e machine learning
Data science e machine learning
 
TCS Point of View Session - Analyze by Dr. Gautam Shroff, VP and Chief Scient...
TCS Point of View Session - Analyze by Dr. Gautam Shroff, VP and Chief Scient...TCS Point of View Session - Analyze by Dr. Gautam Shroff, VP and Chief Scient...
TCS Point of View Session - Analyze by Dr. Gautam Shroff, VP and Chief Scient...
 
Knowledge Will Propel Machine Understanding of Big Data
Knowledge Will Propel Machine Understanding of Big DataKnowledge Will Propel Machine Understanding of Big Data
Knowledge Will Propel Machine Understanding of Big Data
 
Opening up pharmacological space, the OPEN PHACTs api
Opening up pharmacological space, the OPEN PHACTs apiOpening up pharmacological space, the OPEN PHACTs api
Opening up pharmacological space, the OPEN PHACTs api
 
Big Data, AI, and Pharma
Big Data, AI, and PharmaBig Data, AI, and Pharma
Big Data, AI, and Pharma
 
Artificial Intelligence: Role In Pharma Sector
Artificial Intelligence: Role In Pharma SectorArtificial Intelligence: Role In Pharma Sector
Artificial Intelligence: Role In Pharma Sector
 
Insight into AstraZeneca's Technology Services.
Insight into AstraZeneca's Technology Services.Insight into AstraZeneca's Technology Services.
Insight into AstraZeneca's Technology Services.
 
Innovation series 112318
Innovation series 112318Innovation series 112318
Innovation series 112318
 
Presentation to the J. Craig Venter Institute, Dec. 2014
Presentation to the J. Craig Venter Institute, Dec. 2014Presentation to the J. Craig Venter Institute, Dec. 2014
Presentation to the J. Craig Venter Institute, Dec. 2014
 
Nlp and semantic_web_for_competitive_int
Nlp and semantic_web_for_competitive_intNlp and semantic_web_for_competitive_int
Nlp and semantic_web_for_competitive_int
 
Applied Artificial Intelligence & How it's Transforming Life Sciences
Applied Artificial Intelligence & How it's Transforming Life SciencesApplied Artificial Intelligence & How it's Transforming Life Sciences
Applied Artificial Intelligence & How it's Transforming Life Sciences
 
CHARITE ENTREPRENEURSHIP SUMMIT 2016 - Singularity University workshop (27 ma...
CHARITE ENTREPRENEURSHIP SUMMIT 2016 - Singularity University workshop (27 ma...CHARITE ENTREPRENEURSHIP SUMMIT 2016 - Singularity University workshop (27 ma...
CHARITE ENTREPRENEURSHIP SUMMIT 2016 - Singularity University workshop (27 ma...
 

Mehr von Semantic Web Company

How Enterprise Architecture & Knowledge Graph Technologies Can Scale Business...
How Enterprise Architecture & Knowledge Graph Technologies Can Scale Business...How Enterprise Architecture & Knowledge Graph Technologies Can Scale Business...
How Enterprise Architecture & Knowledge Graph Technologies Can Scale Business...Semantic Web Company
 
Introduction to Knowledge Graphs and Semantic AI
Introduction to Knowledge Graphs and Semantic AIIntroduction to Knowledge Graphs and Semantic AI
Introduction to Knowledge Graphs and Semantic AISemantic Web Company
 
Deep Text Analytics - How to extract hidden information and aboutness from text
Deep Text Analytics - How to extract hidden information and aboutness from textDeep Text Analytics - How to extract hidden information and aboutness from text
Deep Text Analytics - How to extract hidden information and aboutness from textSemantic Web Company
 
Leveraging Knowledge Graphs in your Enterprise Knowledge Management System
Leveraging Knowledge Graphs in your Enterprise Knowledge Management SystemLeveraging Knowledge Graphs in your Enterprise Knowledge Management System
Leveraging Knowledge Graphs in your Enterprise Knowledge Management SystemSemantic Web Company
 
Linking SharePoint Documents with Structured Data
Linking SharePoint Documents with Structured DataLinking SharePoint Documents with Structured Data
Linking SharePoint Documents with Structured DataSemantic Web Company
 
The Fast Track to Knowledge Engineering
The Fast Track to Knowledge EngineeringThe Fast Track to Knowledge Engineering
The Fast Track to Knowledge EngineeringSemantic Web Company
 
Leveraging Taxonomy Management with Machine Learning
Leveraging Taxonomy Management with Machine LearningLeveraging Taxonomy Management with Machine Learning
Leveraging Taxonomy Management with Machine LearningSemantic Web Company
 
PoolParty GraphSearch - The Fusion of Search, Recommendation and Analytics
PoolParty GraphSearch - The Fusion of Search, Recommendation and AnalyticsPoolParty GraphSearch - The Fusion of Search, Recommendation and Analytics
PoolParty GraphSearch - The Fusion of Search, Recommendation and AnalyticsSemantic Web Company
 
Semantics as the Basis of Advanced Cognitive Computing
Semantics as the Basis of Advanced Cognitive ComputingSemantics as the Basis of Advanced Cognitive Computing
Semantics as the Basis of Advanced Cognitive ComputingSemantic Web Company
 
PoolParty 6.0 - Climbing the Semantic Ladder
PoolParty 6.0 - Climbing the Semantic LadderPoolParty 6.0 - Climbing the Semantic Ladder
PoolParty 6.0 - Climbing the Semantic LadderSemantic Web Company
 
PoolParty Semantic Suite - Release 6.0 (Technical Overview)
PoolParty Semantic Suite - Release 6.0 (Technical Overview)PoolParty Semantic Suite - Release 6.0 (Technical Overview)
PoolParty Semantic Suite - Release 6.0 (Technical Overview)Semantic Web Company
 
PROPEL . Austrian's Roadmap for Enterprise Linked Data
PROPEL . Austrian's Roadmap for Enterprise Linked DataPROPEL . Austrian's Roadmap for Enterprise Linked Data
PROPEL . Austrian's Roadmap for Enterprise Linked DataSemantic Web Company
 
PoolParty Semantic Suite - Release 5.5
PoolParty Semantic Suite - Release 5.5PoolParty Semantic Suite - Release 5.5
PoolParty Semantic Suite - Release 5.5Semantic Web Company
 

Mehr von Semantic Web Company (20)

How Enterprise Architecture & Knowledge Graph Technologies Can Scale Business...
How Enterprise Architecture & Knowledge Graph Technologies Can Scale Business...How Enterprise Architecture & Knowledge Graph Technologies Can Scale Business...
How Enterprise Architecture & Knowledge Graph Technologies Can Scale Business...
 
Introduction to Knowledge Graphs and Semantic AI
Introduction to Knowledge Graphs and Semantic AIIntroduction to Knowledge Graphs and Semantic AI
Introduction to Knowledge Graphs and Semantic AI
 
Deep Text Analytics - How to extract hidden information and aboutness from text
Deep Text Analytics - How to extract hidden information and aboutness from textDeep Text Analytics - How to extract hidden information and aboutness from text
Deep Text Analytics - How to extract hidden information and aboutness from text
 
Leveraging Knowledge Graphs in your Enterprise Knowledge Management System
Leveraging Knowledge Graphs in your Enterprise Knowledge Management SystemLeveraging Knowledge Graphs in your Enterprise Knowledge Management System
Leveraging Knowledge Graphs in your Enterprise Knowledge Management System
 
Linking SharePoint Documents with Structured Data
Linking SharePoint Documents with Structured DataLinking SharePoint Documents with Structured Data
Linking SharePoint Documents with Structured Data
 
The Fast Track to Knowledge Engineering
The Fast Track to Knowledge EngineeringThe Fast Track to Knowledge Engineering
The Fast Track to Knowledge Engineering
 
Semantic AI
Semantic AISemantic AI
Semantic AI
 
BrightTALK - Semantic AI
BrightTALK - Semantic AI BrightTALK - Semantic AI
BrightTALK - Semantic AI
 
PoolParty Semantic Classifier
PoolParty Semantic ClassifierPoolParty Semantic Classifier
PoolParty Semantic Classifier
 
Leveraging Taxonomy Management with Machine Learning
Leveraging Taxonomy Management with Machine LearningLeveraging Taxonomy Management with Machine Learning
Leveraging Taxonomy Management with Machine Learning
 
Taxonomies put in the right place
Taxonomies put in the right placeTaxonomies put in the right place
Taxonomies put in the right place
 
PoolParty GraphSearch - The Fusion of Search, Recommendation and Analytics
PoolParty GraphSearch - The Fusion of Search, Recommendation and AnalyticsPoolParty GraphSearch - The Fusion of Search, Recommendation and Analytics
PoolParty GraphSearch - The Fusion of Search, Recommendation and Analytics
 
Semantics as the Basis of Advanced Cognitive Computing
Semantics as the Basis of Advanced Cognitive ComputingSemantics as the Basis of Advanced Cognitive Computing
Semantics as the Basis of Advanced Cognitive Computing
 
Structured Content Meets Taxonomy
Structured Content Meets TaxonomyStructured Content Meets Taxonomy
Structured Content Meets Taxonomy
 
PoolParty 6.0 - Climbing the Semantic Ladder
PoolParty 6.0 - Climbing the Semantic LadderPoolParty 6.0 - Climbing the Semantic Ladder
PoolParty 6.0 - Climbing the Semantic Ladder
 
PoolParty Semantic Suite - Release 6.0 (Technical Overview)
PoolParty Semantic Suite - Release 6.0 (Technical Overview)PoolParty Semantic Suite - Release 6.0 (Technical Overview)
PoolParty Semantic Suite - Release 6.0 (Technical Overview)
 
PROPEL . Austrian's Roadmap for Enterprise Linked Data
PROPEL . Austrian's Roadmap for Enterprise Linked DataPROPEL . Austrian's Roadmap for Enterprise Linked Data
PROPEL . Austrian's Roadmap for Enterprise Linked Data
 
Taxonomy Quality Assessment
Taxonomy Quality AssessmentTaxonomy Quality Assessment
Taxonomy Quality Assessment
 
Taxonomy-Driven UX
Taxonomy-Driven UXTaxonomy-Driven UX
Taxonomy-Driven UX
 
PoolParty Semantic Suite - Release 5.5
PoolParty Semantic Suite - Release 5.5PoolParty Semantic Suite - Release 5.5
PoolParty Semantic Suite - Release 5.5
 

Kürzlich hochgeladen

Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxBoston Institute of Analytics
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
While-For-loop in python used in college
While-For-loop in python used in collegeWhile-For-loop in python used in college
While-For-loop in python used in collegessuser7a7cd61
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
Machine learning classification ppt.ppt
Machine learning classification  ppt.pptMachine learning classification  ppt.ppt
Machine learning classification ppt.pptamreenkhanum0307
 

Kürzlich hochgeladen (20)

Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
While-For-loop in python used in college
While-For-loop in python used in collegeWhile-For-loop in python used in college
While-For-loop in python used in college
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
Machine learning classification ppt.ppt
Machine learning classification  ppt.pptMachine learning classification  ppt.ppt
Machine learning classification ppt.ppt
 
Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 

Carlo Trugenberger: Scientific Discovery by Machine Intelligence: A New Avenue fro Drug Research

  • 1. InfoCodex Semantic Technologies Turning Information into Knowledge Scientific Discovery by Machine Intelligence: A New Avenue for Drug Research? Dr. Carlo A. Trugenberger Co-Founder and Chief Scientific Officer InfoCodex Semantic Technologies AG, CH-9470 Buchs September  2,  2015   1  www.InfoCodex.com   Semantics 2015
  • 2. InfoCodex Semantic Technologies Turning Information into Knowledge Big changes in pharmaceutical research The end of the blockbuster era? Challenges Opportunities 02/09/15   www.InfoCodex.com   2   Ø  Genomics / Proteomics Ø  Big data / data mining ➪ structure-based design Ø  Drugs are “computed” rather than discovered Ø  Costs are exploding Ø  Regulatory pressure Ø  Personalized medicine Ø  Outsourcing of critical processes Critical for survival: Ø  Shorten time-to market Ø  Early recognition of dead ends Critical to beat competition: Ø  Data + data analysis power Ø  Machine intelligence
  • 3. InfoCodex Semantic Technologies Turning Information into Knowledge The data deluge as an opportunity for eDiscovery Traditional bioinformatics: structured data New Idea: exploit unstructured data 02/09/15   www.InfoCodex.com   3   Experiment: Merck + Thomson Reuters + InfoCodex Is it possible to drive drug research by text mining large pools of biomedical documents? sequence alignment, gene finding, genome assembly, protein structure prediction, gene expression… PubMed: 22 million citations, growing at the rate of I.7 paper/ minute
  • 4. InfoCodex Semantic Technologies Turning Information into Knowledge 02/09/15   www.InfoCodex.com   4   The Experiment of Merck & Co with InfoCodex The tasks: Ø  Discover novel biomarkers for diabetes and obesity (D&O) by analyzing 120’000 medical publications (PubMed +ClinicalTrials.org + internal) Ø  Blind experiment, no human feedback The aim: Ø  Test pure machine intelligence for “semantic drug research” Biomarker: $13.6 billion market in 2011, growing to $25 billion by 2016.
  • 5. InfoCodex Semantic Technologies Turning Information into Knowledge Semantic technologies in the pharma industry Most existing projects use NLP to extract triples “entity 1-relation-entity 2” sentence by sentence ➪ help to curate ontologies / libraries However: this is not a discovery approach Relations found this way have been explicitly written by human authors and are thus known in one way or another Going beyond triples: analyze text collections globally to identify small, seemingly unrelated and unnoticed facts dispersed over isolated texts assembling the scattered pieces of a puzzle Critical: machine intelligence 02/09/15   www.InfoCodex.com   5  
  • 6. InfoCodex Semantic Technologies Turning Information into Knowledge The Technology: eDiscovery by InfoCodex Linguistics + Information Theory + Self-Organization 02/09/15   www.InfoCodex.com   6   Ø  Completely automatic semantic analysis of content. Ø  Designed for uncovering unnoticed correlations amongst information distributed over documents groups and collections (contrary to NLP) Ø  “Assemble the pieces of a puzzle” Ø  Knowledge discovery as opposed to information extraction
  • 7. InfoCodex Semantic Technologies Turning Information into Knowledge 02/09/15   www.InfoCodex.com   7  
  • 8. InfoCodex Semantic Technologies Turning Information into Knowledge Step 1 : establish reference models for biomarkers / phenotypes Ø  Cluster documents describing known biomarkers (224 references found) Ø  Reference model for each cluster → meanings for “biomarkers diabetes” … Step 2: determine the meaning of unknown words by machine inference. Step 3: analyze documents and generate a list of potential D&O biomarkers/phenotypes by comparison with the reference models. Step4: establish confidence levels 02/09/15   www.InfoCodex.com   8   Encoded meanings
  • 9. InfoCodex Semantic Technologies Turning Information into Knowledge Determination of the meaning of unknown words: machine inference Example: “Hctz” is a “diuretic drug” and is a synonym of “hydrochlorothiazide” Such relations established only on the basis of machine intelligence combined with internal knowledge base 02/09/15   www.InfoCodex.com   9   Co-occurrences with words in internal knowledge base → most probable hypernym → “is a” , “has to do”
  • 10. InfoCodex Semantic Technologies Turning Information into Knowledge 02/09/15   www.InfoCodex.com   10   The output
  • 11. InfoCodex Semantic Technologies Turning Information into Knowledge 02/09/15   www.InfoCodex.com   11   Many uninteresting candidates Too much noise (the problem has been identified and corrected) Lots of “needles in the haystack” Tens of extremely interesting and valuable candidates with very high potential The Results
  • 12. InfoCodex Semantic Technologies Turning Information into Knowledge Conclusion ü  Approach has high potential for discovery ü  Approach has potential to impact pharma research q  Speed up time-to-market q  Early recognition of dead ends X  Improvements in the process are needed: problems have been identified and corrected. Ø  Most promising is a hybrid approach q  Human expertise in formulation of reference models q  Human curation of candidates prior to passing to the laboratory ü  Possibly inevitable development 02/09/15   www.InfoCodex.com   12