SlideShare a Scribd company logo
1 of 58
Download to read offline
Big Data Analyses in Pharma
An Overview
Josef Scheiber, PhD
Managing Director
July 2015
Geographie
Startup Center in Waldsassen
Main site
Data Analyses and Software
Development
Westpark Center
Garmischer Str. in Munich
Scientific ActivitiesSince Jan 1, 2015
Basel/Switzerland
Data Curation and customer-
related activities
Prag
150 km
München
200 km
Berlin
300 km
Frankfurt
250 km
BioVariance at a Glance –
Get most out of your complex data
Curate.Integrate
Analyze.Model
Visualize.Explore
DECIDE
Overview
• Background
• Strategy
• Examples
Background
Courtesy: M. Zeinab, slideshare
What do we need out of Big Data?
1. What are the inhibitors of kinase X and the five most similar
kinases with IC50 < 1 μM and with MW < 500 from all internal and
external data sources?
2. What assay technologies have been used against my kinase?
Which cell lines?
3. What other proteins are in the same kinase branch as target X,
where there were validated chemical hits from external or
internal sources?
4. If I hit a particular kinase, what would the potential side-effect
profile look like? Which known inhibitor of this kinase has the
best safety profile and the fewest known IC50s?
5. Have I identified other compounds with a bioactivity profile
similar to compound X and with the same core substructure?
6. Can we create a phylochemical tree of kinases and for a new
kinase target place it into the tree on the basis of activity against a
reference panel of compounds?
7. Have I identified all kinases with an x-ray structure (in-house or
external) that are in pathway X?
Bridging Chemical and Biological Data: Implications for Pharmaceutical Drug Discovery
JL Jenkins, J Scheiber, D Mikhailov, A Bender, A Schuffenhauer, B Cornett, V Chan, J
Kondracki, B Rohde, JW Davies (2012) In: Computational Approaches in Cheminformatics and
Bioinformatics Edited by:A Bender, R Guha. 25-56 John Wiley & Sons, Inc.
ANSWERS
Context matters!
metabolites
drugs
targets pathways
diseases (phenotypes)
Context matters
RNADNA
It´s not that simple …
Descriptive:
What happened?
Diagnostic:
Why did it happen?
Predictive:
What will happen?
Prescriptive:
How can we make it
happen?
Better data for better analytics
Hindsight Insight Foresight
Need for interpretation
33,3
10
20
30
70
33,3 80
70
60
10
33,3
10 10 10
20
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
Before molecular
biology
Molecular biology
golden age
Genomics age Deep sequencing
age
Very soon
Data Analysis Experiment Experimental Design
Big Data?
Volume
Genome Sequencing
Slide adapted from George Church
Genome Sequencing
Slide adapted from George Church
Cost Reduction - Example
458 Ferrari Spider - $398,000 in 2006 –
40 cents now!
 Much more data for way less
money
Challenges for Informatics? –
1 genome is roughly 500 GB/data
2011 – several 100 exomes
Drug Discovery Pipeline
Target
finding
Lead Finding
Lead
Optimization
… Phase 1 … Market
Drug candidates Patients
Velocity
Velocity
• Mutations in tumor
• Resistance mechanisms in patients
• long term/short term AE
• compliance
• Nutrition and microbiome
• Data from wearables relevant for drugs
For each patient
Variety
Variety
Variety
• Bioinformatics
• Clinical
• Social network
• E-health
• Also text/patents
A simplified overview –
Molecules in Man
Adapted from Gohlke JM, Portier CJ.
Environ. Health Perspect. 115:1261-1263 (2007)
A question of complexity –They all
interact …
Biology
Chemistry
Physics
Dealing with a very complex environment –
i.e. many opportunities
 DNA
 RNA
 Protein
 Interactions
 Clinical parameters
 Treatment History
 Tissue anatomy
 Surgical History
 Epigenetic Profiles from many
patients at different
timeponits
 Target
 Off-targets
 Metabolites
 Additional indications
 Unspecific effects
 Similar drugs
Adapted from: J. Scheiber; How can we enable drug discovery informatics for personalized healthcare?
Expert Opinion on Drug Discovery, 1-6; 2/2011
… individual polypharmacology
Sequences Expression Proteomics Biological networks
(but also: Cells, Tissues, Organs)
POPULATION
Veracity
Veracity
• Chemogenomics data
• Gene expression data
 Imputation?
Veracity - Chemogenomics
Adapted from Tanrikulu et al. Missing
Value Estimation for Compound-
Target Activity Data, J. Mol. Inf
Veracity - Interactomics
A Proteome-Scale Map of the Human
Interactome Network
Rolland, Thomas et al.
Cell , Volume 159 , Issue 5 , 1212 - 1226
Veracity – Social Media
Strategy
Biological/Pharmacological
Understanding
drugs
targets pathways
diseases (phenotypes)
Data integration strategy
a) A central vocabulary/pointer server (information
stored are preferred names and synonyms plus
pointers to data servers, where to find what)
b)  semantic integration layer with domain-specific
terminology and referential data
c) A database for each datatype collected, storing only
preferred names along with raw measurements
d) Clearly defined APIs for further integration with
public data sources and to enable large-scale
analyses
Vocabularies needed
• Genes, Drugs, Proteins
• Diseases
• Organisms
• Microbiome species & genes
• Localization & source
• Phenotype
• Metabolite common names
Answering workflow
Vocabulary
Vocabulary server acts as
translator, aggregator and
locator, i.e. knows where
the respective facts can be
found
Firmicutes produce alpha-Linolein and thereby cause gut irritation
species
metabolite
Further
Data of each type is
stored in a specific
database to
enhance
performance of
large-scale analyses
Expert tools talk to
data directly or via
webservices
API
API
API
API
Enduserinterfaceand
visualization
Examples
Genome data at scale
Workflow
Identify drug targets
(primary and off-targets,
from DrugBank)
Call variations on a per-
individuum basis
Workflow
Analyse mutation rates in
the targets and in
particular drug binding
pockets
Example: Donepezil /
Acetylcholinesterase
• PDB 4EY7
Image extracted from Cheung et al.,
2012 [2]
Example: Donepezil /
Acetylcholinesterase
Example: Acetylcholinesterase
Integrative Genomics Viewer
Not very successful
Alignment of the 3D
structures of mutant
number 52 (yellow) and
PDB 4EY7 AChE protein
(green). The only changed
residue is the Y150
(magenta) to H150 (red).
The white surface
represents the molecular
surface of donepezil.
Why is this a bad example?
AChE a key enzyme in human biology  these are
the most highly conserved, even interspecies
 Learning: Look at that stuff before investing
time 
Generating
Vocabularies
Vocabulary generation
Extensive mapping of terms from various sources
Vocabulary generation
397211
preferred
names
598532
synonyms
102086
identifiers
The chevron diagram shows the number of samples annotated
with names. Already by looking at the numbers you can see tha
mapping everything is non-trivial.
A Big Data exercise in itself …
Tweet mining
Mining Twitter for side effects
Needed Drug Name
and synonyms:
Adalimumab
Humira
Exemptia
331731-18-1
L04AB04
MedDRA vocabulary
Many birds tweet lots of noise …
BUT …
• [1] "Lipitor headache 0"
[1] "Lipitor rash 1"
[1] "Lipitor pain 27"
[1] "Lipitor bleeding 0"
[1] "Lipitor cough 0"
[1] "Lisinopril headache 0"
[1] "Lisinopril rash 0"
[1] "Lisinopril pain 8"
[1] "Lisinopril bleeding 0"
[1] "Lisinopril cough 7"
[1] "Simvastatin headache 0"
[1] "Simvastatin rash 0"
[1] "Simvastatin pain 0"
[1] "Simvastatin bleeding 0"
[1] "Simvastatin cough 0"
[1] "Plavix headache 0"
[1] "Plavix rash 0"
[1] "Plavix pain 0"
[1] "Plavix bleeding 1"
[1] "Plavix cough 0"
[1] "Crestor headache 0"
[1] "Crestor rash 0"
[1] "Crestor pain 0"
[1] "Crestor bleeding 0"
[1] "Crestor cough 0"
Top 200 drugs
- Cutoff is at 1500 tweets that a
few drugs easily surpass (although
it's mostly only pharmacies
advertizing)
- Others are not mentioned once
(probably a synonym issue as I
restricted to English as language). -
- top drugs are tweeted more
often, but e.g. Tarceva (in 2006) at
the very bottom also reaches the
top number of tweets (109 on list).
089 – 189 6582 – 80
Garmischer Str. 4/V
80339 München
josef.scheiber@biovariance.com:
09632 – 9248 325
Konnersreuther Str. 6g
95652 Waldsassen
Questions?

More Related Content

What's hot

William Vambenepe – Google Cloud Dataflow and Flink , Stream Processing by De...
William Vambenepe – Google Cloud Dataflow and Flink , Stream Processing by De...William Vambenepe – Google Cloud Dataflow and Flink , Stream Processing by De...
William Vambenepe – Google Cloud Dataflow and Flink , Stream Processing by De...Flink Forward
 
MT115 Precision Medicine: Integrating genomics to enable better patient outcomes
MT115 Precision Medicine: Integrating genomics to enable better patient outcomesMT115 Precision Medicine: Integrating genomics to enable better patient outcomes
MT115 Precision Medicine: Integrating genomics to enable better patient outcomesDell EMC World
 
Lecture1 introduction to big data
Lecture1 introduction to big dataLecture1 introduction to big data
Lecture1 introduction to big datahktripathy
 
Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)
Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)
Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)Hellmuth Broda
 
Application of data science in healthcare
Application of data science in healthcareApplication of data science in healthcare
Application of data science in healthcareShreyaPai7
 
Business impact without data governance
Business impact without data governanceBusiness impact without data governance
Business impact without data governanceJohn Bao Vuu
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise AnalyticsDATAVERSITY
 
Implementing the Data Maturity Model (DMM)
Implementing the Data Maturity Model (DMM)Implementing the Data Maturity Model (DMM)
Implementing the Data Maturity Model (DMM)DATAVERSITY
 
Data Quality
Data QualityData Quality
Data QualityVijaya K
 
Data science in health care
Data science in health careData science in health care
Data science in health careChetan Khanzode
 
Dbms and it infrastructure
Dbms and  it infrastructureDbms and  it infrastructure
Dbms and it infrastructureprojectandppt
 
Drug Repurposing using Deep Learning on Knowledge Graphs
Drug Repurposing using Deep Learning on Knowledge GraphsDrug Repurposing using Deep Learning on Knowledge Graphs
Drug Repurposing using Deep Learning on Knowledge GraphsDatabricks
 
Data Visualization in Exploratory Data Analysis
Data Visualization in Exploratory Data AnalysisData Visualization in Exploratory Data Analysis
Data Visualization in Exploratory Data AnalysisEva Durall
 
Introduction to data management
Introduction to data managementIntroduction to data management
Introduction to data managementCunera Buys
 
Creating a Data Culture
Creating a Data CultureCreating a Data Culture
Creating a Data CulturePipa Unsworth
 
Using Big Data to Drive Customer 360
Using Big Data to Drive Customer 360Using Big Data to Drive Customer 360
Using Big Data to Drive Customer 360Cloudera, Inc.
 

What's hot (20)

William Vambenepe – Google Cloud Dataflow and Flink , Stream Processing by De...
William Vambenepe – Google Cloud Dataflow and Flink , Stream Processing by De...William Vambenepe – Google Cloud Dataflow and Flink , Stream Processing by De...
William Vambenepe – Google Cloud Dataflow and Flink , Stream Processing by De...
 
MT115 Precision Medicine: Integrating genomics to enable better patient outcomes
MT115 Precision Medicine: Integrating genomics to enable better patient outcomesMT115 Precision Medicine: Integrating genomics to enable better patient outcomes
MT115 Precision Medicine: Integrating genomics to enable better patient outcomes
 
Lecture1 introduction to big data
Lecture1 introduction to big dataLecture1 introduction to big data
Lecture1 introduction to big data
 
Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)
Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)
Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)
 
Application of data science in healthcare
Application of data science in healthcareApplication of data science in healthcare
Application of data science in healthcare
 
Three Big Data Case Studies
Three Big Data Case StudiesThree Big Data Case Studies
Three Big Data Case Studies
 
Business impact without data governance
Business impact without data governanceBusiness impact without data governance
Business impact without data governance
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics
 
Implementing the Data Maturity Model (DMM)
Implementing the Data Maturity Model (DMM)Implementing the Data Maturity Model (DMM)
Implementing the Data Maturity Model (DMM)
 
Data Quality
Data QualityData Quality
Data Quality
 
Data Quality Presentation
Data Quality PresentationData Quality Presentation
Data Quality Presentation
 
Data science in health care
Data science in health careData science in health care
Data science in health care
 
Business intelligence
Business intelligenceBusiness intelligence
Business intelligence
 
Dbms and it infrastructure
Dbms and  it infrastructureDbms and  it infrastructure
Dbms and it infrastructure
 
Drug Repurposing using Deep Learning on Knowledge Graphs
Drug Repurposing using Deep Learning on Knowledge GraphsDrug Repurposing using Deep Learning on Knowledge Graphs
Drug Repurposing using Deep Learning on Knowledge Graphs
 
Data Visualization in Exploratory Data Analysis
Data Visualization in Exploratory Data AnalysisData Visualization in Exploratory Data Analysis
Data Visualization in Exploratory Data Analysis
 
Machine Learning in Healthcare and Life Science
Machine Learning in Healthcare and Life ScienceMachine Learning in Healthcare and Life Science
Machine Learning in Healthcare and Life Science
 
Introduction to data management
Introduction to data managementIntroduction to data management
Introduction to data management
 
Creating a Data Culture
Creating a Data CultureCreating a Data Culture
Creating a Data Culture
 
Using Big Data to Drive Customer 360
Using Big Data to Drive Customer 360Using Big Data to Drive Customer 360
Using Big Data to Drive Customer 360
 

Viewers also liked

Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma Ankur Khanna
 
Improving pharmaceutical marketing using big data solutions
Improving pharmaceutical marketing using big data solutionsImproving pharmaceutical marketing using big data solutions
Improving pharmaceutical marketing using big data solutionsPaul Grant
 
Data mining (DM) in the pharmaceutical industry
Data mining (DM) in the pharmaceutical industryData mining (DM) in the pharmaceutical industry
Data mining (DM) in the pharmaceutical industrylurdhu agnes
 
New Pharma Market Reality - Predictive Analytics is the Solution
New Pharma Market Reality - Predictive Analytics is the SolutionNew Pharma Market Reality - Predictive Analytics is the Solution
New Pharma Market Reality - Predictive Analytics is the SolutionDr. Sandeep Juneja
 
Application of BI in pharmaceutical industry
Application of BI in pharmaceutical industryApplication of BI in pharmaceutical industry
Application of BI in pharmaceutical industryBiBoard.Org
 
Bio variance j_scheiber_bioit_repurposingworkshop2013_draft
Bio variance j_scheiber_bioit_repurposingworkshop2013_draftBio variance j_scheiber_bioit_repurposingworkshop2013_draft
Bio variance j_scheiber_bioit_repurposingworkshop2013_draftJosef Scheiber
 
BioVariance Research Services - Target Profile Prediction
BioVariance Research Services - Target Profile PredictionBioVariance Research Services - Target Profile Prediction
BioVariance Research Services - Target Profile PredictionJosef Scheiber
 
Conference presentation from #iccs2014 in Noordwijkerhout
Conference presentation from #iccs2014 in NoordwijkerhoutConference presentation from #iccs2014 in Noordwijkerhout
Conference presentation from #iccs2014 in NoordwijkerhoutJosef Scheiber
 
BioVariance Research Services - Mapping Pharmaceutical patents to Biological ...
BioVariance Research Services - Mapping Pharmaceutical patents to Biological ...BioVariance Research Services - Mapping Pharmaceutical patents to Biological ...
BioVariance Research Services - Mapping Pharmaceutical patents to Biological ...Josef Scheiber
 
BioVariance - Pediatric Pharmacogenomics in Drug Discovery
BioVariance - Pediatric Pharmacogenomics in Drug DiscoveryBioVariance - Pediatric Pharmacogenomics in Drug Discovery
BioVariance - Pediatric Pharmacogenomics in Drug DiscoveryJosef Scheiber
 
Mobile Health Forum Frankfurt - Therapieempfehlung per Smartphone
Mobile Health Forum Frankfurt - Therapieempfehlung per SmartphoneMobile Health Forum Frankfurt - Therapieempfehlung per Smartphone
Mobile Health Forum Frankfurt - Therapieempfehlung per SmartphoneJosef Scheiber
 
Big Data in Healthcare Made Simple: Where It Stands Today and Where It’s Going
Big Data in Healthcare Made Simple: Where It Stands Today and Where It’s GoingBig Data in Healthcare Made Simple: Where It Stands Today and Where It’s Going
Big Data in Healthcare Made Simple: Where It Stands Today and Where It’s GoingHealth Catalyst
 
Digital Asset Management in Pharma
Digital Asset Management in PharmaDigital Asset Management in Pharma
Digital Asset Management in Pharmaphillycaferacer
 
Legal Content Management on SharePoint 2010
Legal Content Management on SharePoint 2010Legal Content Management on SharePoint 2010
Legal Content Management on SharePoint 2010phillycaferacer
 
Big Data Challenges for Real-Time Personalized Medicine
Big Data Challenges for Real-Time Personalized MedicineBig Data Challenges for Real-Time Personalized Medicine
Big Data Challenges for Real-Time Personalized MedicineSAP Technology
 
Zeller Edm Summit Agile Deployment Of Predictive Analytics
Zeller Edm Summit   Agile Deployment Of Predictive AnalyticsZeller Edm Summit   Agile Deployment Of Predictive Analytics
Zeller Edm Summit Agile Deployment Of Predictive AnalyticsRonald.Ramos
 
20160512 predictive and adaptive approach
20160512   predictive and adaptive approach20160512   predictive and adaptive approach
20160512 predictive and adaptive approachSilvia Fragola
 
Agile 2013 presentation, tom grant
Agile 2013 presentation, tom grantAgile 2013 presentation, tom grant
Agile 2013 presentation, tom grantTom Grant
 
WE Europe 2015: Innovating in disruptive ecosystems: lessons from the life sc...
WE Europe 2015: Innovating in disruptive ecosystems: lessons from the life sc...WE Europe 2015: Innovating in disruptive ecosystems: lessons from the life sc...
WE Europe 2015: Innovating in disruptive ecosystems: lessons from the life sc...Society of Women Engineers
 

Viewers also liked (20)

Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma
 
Analytics in Pharmaceutical Industry
Analytics in Pharmaceutical IndustryAnalytics in Pharmaceutical Industry
Analytics in Pharmaceutical Industry
 
Improving pharmaceutical marketing using big data solutions
Improving pharmaceutical marketing using big data solutionsImproving pharmaceutical marketing using big data solutions
Improving pharmaceutical marketing using big data solutions
 
Data mining (DM) in the pharmaceutical industry
Data mining (DM) in the pharmaceutical industryData mining (DM) in the pharmaceutical industry
Data mining (DM) in the pharmaceutical industry
 
New Pharma Market Reality - Predictive Analytics is the Solution
New Pharma Market Reality - Predictive Analytics is the SolutionNew Pharma Market Reality - Predictive Analytics is the Solution
New Pharma Market Reality - Predictive Analytics is the Solution
 
Application of BI in pharmaceutical industry
Application of BI in pharmaceutical industryApplication of BI in pharmaceutical industry
Application of BI in pharmaceutical industry
 
Bio variance j_scheiber_bioit_repurposingworkshop2013_draft
Bio variance j_scheiber_bioit_repurposingworkshop2013_draftBio variance j_scheiber_bioit_repurposingworkshop2013_draft
Bio variance j_scheiber_bioit_repurposingworkshop2013_draft
 
BioVariance Research Services - Target Profile Prediction
BioVariance Research Services - Target Profile PredictionBioVariance Research Services - Target Profile Prediction
BioVariance Research Services - Target Profile Prediction
 
Conference presentation from #iccs2014 in Noordwijkerhout
Conference presentation from #iccs2014 in NoordwijkerhoutConference presentation from #iccs2014 in Noordwijkerhout
Conference presentation from #iccs2014 in Noordwijkerhout
 
BioVariance Research Services - Mapping Pharmaceutical patents to Biological ...
BioVariance Research Services - Mapping Pharmaceutical patents to Biological ...BioVariance Research Services - Mapping Pharmaceutical patents to Biological ...
BioVariance Research Services - Mapping Pharmaceutical patents to Biological ...
 
BioVariance - Pediatric Pharmacogenomics in Drug Discovery
BioVariance - Pediatric Pharmacogenomics in Drug DiscoveryBioVariance - Pediatric Pharmacogenomics in Drug Discovery
BioVariance - Pediatric Pharmacogenomics in Drug Discovery
 
Mobile Health Forum Frankfurt - Therapieempfehlung per Smartphone
Mobile Health Forum Frankfurt - Therapieempfehlung per SmartphoneMobile Health Forum Frankfurt - Therapieempfehlung per Smartphone
Mobile Health Forum Frankfurt - Therapieempfehlung per Smartphone
 
Big Data in Healthcare Made Simple: Where It Stands Today and Where It’s Going
Big Data in Healthcare Made Simple: Where It Stands Today and Where It’s GoingBig Data in Healthcare Made Simple: Where It Stands Today and Where It’s Going
Big Data in Healthcare Made Simple: Where It Stands Today and Where It’s Going
 
Digital Asset Management in Pharma
Digital Asset Management in PharmaDigital Asset Management in Pharma
Digital Asset Management in Pharma
 
Legal Content Management on SharePoint 2010
Legal Content Management on SharePoint 2010Legal Content Management on SharePoint 2010
Legal Content Management on SharePoint 2010
 
Big Data Challenges for Real-Time Personalized Medicine
Big Data Challenges for Real-Time Personalized MedicineBig Data Challenges for Real-Time Personalized Medicine
Big Data Challenges for Real-Time Personalized Medicine
 
Zeller Edm Summit Agile Deployment Of Predictive Analytics
Zeller Edm Summit   Agile Deployment Of Predictive AnalyticsZeller Edm Summit   Agile Deployment Of Predictive Analytics
Zeller Edm Summit Agile Deployment Of Predictive Analytics
 
20160512 predictive and adaptive approach
20160512   predictive and adaptive approach20160512   predictive and adaptive approach
20160512 predictive and adaptive approach
 
Agile 2013 presentation, tom grant
Agile 2013 presentation, tom grantAgile 2013 presentation, tom grant
Agile 2013 presentation, tom grant
 
WE Europe 2015: Innovating in disruptive ecosystems: lessons from the life sc...
WE Europe 2015: Innovating in disruptive ecosystems: lessons from the life sc...WE Europe 2015: Innovating in disruptive ecosystems: lessons from the life sc...
WE Europe 2015: Innovating in disruptive ecosystems: lessons from the life sc...
 

Similar to Big Data in Pharma - Overview and Use Cases

Introduction to Bioinformatics.
 Introduction to Bioinformatics. Introduction to Bioinformatics.
Introduction to Bioinformatics.Elena Sügis
 
Bioinformatics
BioinformaticsBioinformatics
BioinformaticsJTADrexel
 
Artificial Intelligence for Discovery
Artificial Intelligence for DiscoveryArtificial Intelligence for Discovery
Artificial Intelligence for DiscoveryDayOne
 
01. Introduction to Bioinformatics.pptx
01. Introduction to Bioinformatics.pptx01. Introduction to Bioinformatics.pptx
01. Introduction to Bioinformatics.pptxHussainTaqi1
 
acs talk open source drug discovery
acs talk open source drug discoveryacs talk open source drug discovery
acs talk open source drug discoverySean Ekins
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Ian Foster
 
Single-Cell Sequencing for Drug Discovery: Applications and Challenges
Single-Cell Sequencing for Drug Discovery: Applications and ChallengesSingle-Cell Sequencing for Drug Discovery: Applications and Challenges
Single-Cell Sequencing for Drug Discovery: Applications and Challengesinside-BigData.com
 
WEBINAR: The Yosemite Project PART 6 -- Data-Driven Biomedical Research with ...
WEBINAR: The Yosemite Project PART 6 -- Data-Driven Biomedical Research with ...WEBINAR: The Yosemite Project PART 6 -- Data-Driven Biomedical Research with ...
WEBINAR: The Yosemite Project PART 6 -- Data-Driven Biomedical Research with ...DATAVERSITY
 
Methods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big dataMethods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big dataChirag Patel
 
Sequence analysis in the regulated domain - A Pistoia Alliance Debates webina...
Sequence analysis in the regulated domain - A Pistoia Alliance Debates webina...Sequence analysis in the regulated domain - A Pistoia Alliance Debates webina...
Sequence analysis in the regulated domain - A Pistoia Alliance Debates webina...Pistoia Alliance
 
TLSC Biotech 101 Noc 2010 (Moore)
TLSC Biotech 101 Noc 2010 (Moore)TLSC Biotech 101 Noc 2010 (Moore)
TLSC Biotech 101 Noc 2010 (Moore)jmoore89
 
Bioinformatics issues and challanges presentation at s p college
Bioinformatics  issues and challanges  presentation at s p collegeBioinformatics  issues and challanges  presentation at s p college
Bioinformatics issues and challanges presentation at s p collegeSKUASTKashmir
 
Big Data & ML for Clinical Data
Big Data & ML for Clinical DataBig Data & ML for Clinical Data
Big Data & ML for Clinical DataPaul Agapow
 
2019-06-21 YC Preso V5.pdf
2019-06-21 YC Preso V5.pdf2019-06-21 YC Preso V5.pdf
2019-06-21 YC Preso V5.pdfYue Cathy Chang
 
Big Data Analytics in the Health Domain
Big Data Analytics in the Health DomainBig Data Analytics in the Health Domain
Big Data Analytics in the Health DomainBigData_Europe
 
Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!adcobb
 
Amia tb-review-08
Amia tb-review-08Amia tb-review-08
Amia tb-review-08Russ Altman
 
Stephen Friend Dana Farber Cancer Institute 2011-10-24
Stephen Friend Dana Farber Cancer Institute 2011-10-24Stephen Friend Dana Farber Cancer Institute 2011-10-24
Stephen Friend Dana Farber Cancer Institute 2011-10-24Sage Base
 
Data analysis & integration challenges in genomics
Data analysis & integration challenges in genomicsData analysis & integration challenges in genomics
Data analysis & integration challenges in genomicsmikaelhuss
 

Similar to Big Data in Pharma - Overview and Use Cases (20)

Introduction to Bioinformatics.
 Introduction to Bioinformatics. Introduction to Bioinformatics.
Introduction to Bioinformatics.
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Artificial Intelligence for Discovery
Artificial Intelligence for DiscoveryArtificial Intelligence for Discovery
Artificial Intelligence for Discovery
 
01. Introduction to Bioinformatics.pptx
01. Introduction to Bioinformatics.pptx01. Introduction to Bioinformatics.pptx
01. Introduction to Bioinformatics.pptx
 
acs talk open source drug discovery
acs talk open source drug discoveryacs talk open source drug discovery
acs talk open source drug discovery
 
Online Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery SystemsOnline Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery Systems
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009
 
Single-Cell Sequencing for Drug Discovery: Applications and Challenges
Single-Cell Sequencing for Drug Discovery: Applications and ChallengesSingle-Cell Sequencing for Drug Discovery: Applications and Challenges
Single-Cell Sequencing for Drug Discovery: Applications and Challenges
 
WEBINAR: The Yosemite Project PART 6 -- Data-Driven Biomedical Research with ...
WEBINAR: The Yosemite Project PART 6 -- Data-Driven Biomedical Research with ...WEBINAR: The Yosemite Project PART 6 -- Data-Driven Biomedical Research with ...
WEBINAR: The Yosemite Project PART 6 -- Data-Driven Biomedical Research with ...
 
Methods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big dataMethods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big data
 
Sequence analysis in the regulated domain - A Pistoia Alliance Debates webina...
Sequence analysis in the regulated domain - A Pistoia Alliance Debates webina...Sequence analysis in the regulated domain - A Pistoia Alliance Debates webina...
Sequence analysis in the regulated domain - A Pistoia Alliance Debates webina...
 
TLSC Biotech 101 Noc 2010 (Moore)
TLSC Biotech 101 Noc 2010 (Moore)TLSC Biotech 101 Noc 2010 (Moore)
TLSC Biotech 101 Noc 2010 (Moore)
 
Bioinformatics issues and challanges presentation at s p college
Bioinformatics  issues and challanges  presentation at s p collegeBioinformatics  issues and challanges  presentation at s p college
Bioinformatics issues and challanges presentation at s p college
 
Big Data & ML for Clinical Data
Big Data & ML for Clinical DataBig Data & ML for Clinical Data
Big Data & ML for Clinical Data
 
2019-06-21 YC Preso V5.pdf
2019-06-21 YC Preso V5.pdf2019-06-21 YC Preso V5.pdf
2019-06-21 YC Preso V5.pdf
 
Big Data Analytics in the Health Domain
Big Data Analytics in the Health DomainBig Data Analytics in the Health Domain
Big Data Analytics in the Health Domain
 
Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!
 
Amia tb-review-08
Amia tb-review-08Amia tb-review-08
Amia tb-review-08
 
Stephen Friend Dana Farber Cancer Institute 2011-10-24
Stephen Friend Dana Farber Cancer Institute 2011-10-24Stephen Friend Dana Farber Cancer Institute 2011-10-24
Stephen Friend Dana Farber Cancer Institute 2011-10-24
 
Data analysis & integration challenges in genomics
Data analysis & integration challenges in genomicsData analysis & integration challenges in genomics
Data analysis & integration challenges in genomics
 

Recently uploaded

Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksdeepakthakur548787
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxSimranPal17
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectBoston Institute of Analytics
 
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...Milind Agarwal
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics
 
Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataTecnoIncentive
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024Susanna-Assunta Sansone
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
convolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfconvolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfSubhamKumar3239
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesTimothy Spann
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxHaritikaChhatwal1
 

Recently uploaded (20)

Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing works
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptx
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis Project
 
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
 
Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded data
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
 
Insurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis ProjectInsurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis Project
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
Data Analysis Project: Stroke Prediction
Data Analysis Project: Stroke PredictionData Analysis Project: Stroke Prediction
Data Analysis Project: Stroke Prediction
 
convolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfconvolutional neural network and its applications.pdf
convolutional neural network and its applications.pdf
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptx
 

Big Data in Pharma - Overview and Use Cases

  • 1. Big Data Analyses in Pharma An Overview Josef Scheiber, PhD Managing Director July 2015
  • 2. Geographie Startup Center in Waldsassen Main site Data Analyses and Software Development Westpark Center Garmischer Str. in Munich Scientific ActivitiesSince Jan 1, 2015 Basel/Switzerland Data Curation and customer- related activities Prag 150 km München 200 km Berlin 300 km Frankfurt 250 km
  • 3. BioVariance at a Glance – Get most out of your complex data Curate.Integrate Analyze.Model Visualize.Explore DECIDE
  • 6. Courtesy: M. Zeinab, slideshare
  • 7. What do we need out of Big Data? 1. What are the inhibitors of kinase X and the five most similar kinases with IC50 < 1 μM and with MW < 500 from all internal and external data sources? 2. What assay technologies have been used against my kinase? Which cell lines? 3. What other proteins are in the same kinase branch as target X, where there were validated chemical hits from external or internal sources? 4. If I hit a particular kinase, what would the potential side-effect profile look like? Which known inhibitor of this kinase has the best safety profile and the fewest known IC50s? 5. Have I identified other compounds with a bioactivity profile similar to compound X and with the same core substructure? 6. Can we create a phylochemical tree of kinases and for a new kinase target place it into the tree on the basis of activity against a reference panel of compounds? 7. Have I identified all kinases with an x-ray structure (in-house or external) that are in pathway X? Bridging Chemical and Biological Data: Implications for Pharmaceutical Drug Discovery JL Jenkins, J Scheiber, D Mikhailov, A Bender, A Schuffenhauer, B Cornett, V Chan, J Kondracki, B Rohde, JW Davies (2012) In: Computational Approaches in Cheminformatics and Bioinformatics Edited by:A Bender, R Guha. 25-56 John Wiley & Sons, Inc. ANSWERS
  • 10. Descriptive: What happened? Diagnostic: Why did it happen? Predictive: What will happen? Prescriptive: How can we make it happen? Better data for better analytics Hindsight Insight Foresight
  • 11. Need for interpretation 33,3 10 20 30 70 33,3 80 70 60 10 33,3 10 10 10 20 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% Before molecular biology Molecular biology golden age Genomics age Deep sequencing age Very soon Data Analysis Experiment Experimental Design
  • 14. Genome Sequencing Slide adapted from George Church
  • 15. Genome Sequencing Slide adapted from George Church
  • 16. Cost Reduction - Example 458 Ferrari Spider - $398,000 in 2006 – 40 cents now!
  • 17.  Much more data for way less money
  • 18. Challenges for Informatics? – 1 genome is roughly 500 GB/data 2011 – several 100 exomes
  • 19. Drug Discovery Pipeline Target finding Lead Finding Lead Optimization … Phase 1 … Market Drug candidates Patients
  • 21. Velocity • Mutations in tumor • Resistance mechanisms in patients • long term/short term AE • compliance • Nutrition and microbiome • Data from wearables relevant for drugs
  • 25. Variety • Bioinformatics • Clinical • Social network • E-health • Also text/patents
  • 26. A simplified overview – Molecules in Man Adapted from Gohlke JM, Portier CJ. Environ. Health Perspect. 115:1261-1263 (2007)
  • 27. A question of complexity –They all interact … Biology Chemistry Physics
  • 28. Dealing with a very complex environment – i.e. many opportunities  DNA  RNA  Protein  Interactions  Clinical parameters  Treatment History  Tissue anatomy  Surgical History  Epigenetic Profiles from many patients at different timeponits  Target  Off-targets  Metabolites  Additional indications  Unspecific effects  Similar drugs Adapted from: J. Scheiber; How can we enable drug discovery informatics for personalized healthcare? Expert Opinion on Drug Discovery, 1-6; 2/2011
  • 30. Sequences Expression Proteomics Biological networks (but also: Cells, Tissues, Organs) POPULATION
  • 32. Veracity • Chemogenomics data • Gene expression data  Imputation?
  • 33. Veracity - Chemogenomics Adapted from Tanrikulu et al. Missing Value Estimation for Compound- Target Activity Data, J. Mol. Inf
  • 34. Veracity - Interactomics A Proteome-Scale Map of the Human Interactome Network Rolland, Thomas et al. Cell , Volume 159 , Issue 5 , 1212 - 1226
  • 36.
  • 39. Data integration strategy a) A central vocabulary/pointer server (information stored are preferred names and synonyms plus pointers to data servers, where to find what) b)  semantic integration layer with domain-specific terminology and referential data c) A database for each datatype collected, storing only preferred names along with raw measurements d) Clearly defined APIs for further integration with public data sources and to enable large-scale analyses
  • 40. Vocabularies needed • Genes, Drugs, Proteins • Diseases • Organisms • Microbiome species & genes • Localization & source • Phenotype • Metabolite common names
  • 41. Answering workflow Vocabulary Vocabulary server acts as translator, aggregator and locator, i.e. knows where the respective facts can be found Firmicutes produce alpha-Linolein and thereby cause gut irritation species metabolite Further Data of each type is stored in a specific database to enhance performance of large-scale analyses Expert tools talk to data directly or via webservices API API API API Enduserinterfaceand visualization
  • 43. Genome data at scale
  • 44. Workflow Identify drug targets (primary and off-targets, from DrugBank) Call variations on a per- individuum basis
  • 45. Workflow Analyse mutation rates in the targets and in particular drug binding pockets
  • 46. Example: Donepezil / Acetylcholinesterase • PDB 4EY7 Image extracted from Cheung et al., 2012 [2]
  • 49. Not very successful Alignment of the 3D structures of mutant number 52 (yellow) and PDB 4EY7 AChE protein (green). The only changed residue is the Y150 (magenta) to H150 (red). The white surface represents the molecular surface of donepezil.
  • 50. Why is this a bad example? AChE a key enzyme in human biology  these are the most highly conserved, even interspecies  Learning: Look at that stuff before investing time 
  • 52. Vocabulary generation Extensive mapping of terms from various sources
  • 53. Vocabulary generation 397211 preferred names 598532 synonyms 102086 identifiers The chevron diagram shows the number of samples annotated with names. Already by looking at the numbers you can see tha mapping everything is non-trivial. A Big Data exercise in itself …
  • 55. Mining Twitter for side effects Needed Drug Name and synonyms: Adalimumab Humira Exemptia 331731-18-1 L04AB04 MedDRA vocabulary
  • 56. Many birds tweet lots of noise … BUT … • [1] "Lipitor headache 0" [1] "Lipitor rash 1" [1] "Lipitor pain 27" [1] "Lipitor bleeding 0" [1] "Lipitor cough 0" [1] "Lisinopril headache 0" [1] "Lisinopril rash 0" [1] "Lisinopril pain 8" [1] "Lisinopril bleeding 0" [1] "Lisinopril cough 7" [1] "Simvastatin headache 0" [1] "Simvastatin rash 0" [1] "Simvastatin pain 0" [1] "Simvastatin bleeding 0" [1] "Simvastatin cough 0" [1] "Plavix headache 0" [1] "Plavix rash 0" [1] "Plavix pain 0" [1] "Plavix bleeding 1" [1] "Plavix cough 0" [1] "Crestor headache 0" [1] "Crestor rash 0" [1] "Crestor pain 0" [1] "Crestor bleeding 0" [1] "Crestor cough 0"
  • 57. Top 200 drugs - Cutoff is at 1500 tweets that a few drugs easily surpass (although it's mostly only pharmacies advertizing) - Others are not mentioned once (probably a synonym issue as I restricted to English as language). - - top drugs are tweeted more often, but e.g. Tarceva (in 2006) at the very bottom also reaches the top number of tweets (109 on list).
  • 58. 089 – 189 6582 – 80 Garmischer Str. 4/V 80339 München josef.scheiber@biovariance.com: 09632 – 9248 325 Konnersreuther Str. 6g 95652 Waldsassen Questions?