Suche senden
Hochladen
Genomics Crash Course for Data Engineers
âą
49 gefÀllt mir
âą
20,327 views
Allen Day, PhD
Folgen
Genomics Crash Course for Data Engineers
Weniger lesen
Mehr lesen
Daten & Analysen
Wissenschaft
Melden
Teilen
Melden
Teilen
1 von 54
Empfohlen
RedNatura red natura
RedNatura red natura
Roberto Santillan
Â
Pagina web, sitio web, web 2.0 y web 3.0
Pagina web, sitio web, web 2.0 y web 3.0
Jorge Garcia
Â
Mexico
Mexico
antoniocruzdosar
Â
Pedro Rojas: Estrategias de Reclutamiento 2.0
Pedro Rojas: Estrategias de Reclutamiento 2.0
jobandtalent2
Â
Death and Disease Rates of Vegetarians and Vegans â Summary of Prospective Co...
Death and Disease Rates of Vegetarians and Vegans â Summary of Prospective Co...
Jussi Riekki
Â
The Future Of Work & The Work Of The Future
The Future Of Work & The Work Of The Future
Arturo Pelayo
Â
Seguros 2.0
Seguros 2.0
pocketbox
Â
Edecanes mexico
Edecanes mexico
disenolumnivision
Â
Empfohlen
RedNatura red natura
RedNatura red natura
Roberto Santillan
Â
Pagina web, sitio web, web 2.0 y web 3.0
Pagina web, sitio web, web 2.0 y web 3.0
Jorge Garcia
Â
Mexico
Mexico
antoniocruzdosar
Â
Pedro Rojas: Estrategias de Reclutamiento 2.0
Pedro Rojas: Estrategias de Reclutamiento 2.0
jobandtalent2
Â
Death and Disease Rates of Vegetarians and Vegans â Summary of Prospective Co...
Death and Disease Rates of Vegetarians and Vegans â Summary of Prospective Co...
Jussi Riekki
Â
The Future Of Work & The Work Of The Future
The Future Of Work & The Work Of The Future
Arturo Pelayo
Â
Seguros 2.0
Seguros 2.0
pocketbox
Â
Edecanes mexico
Edecanes mexico
disenolumnivision
Â
Intel - Challenges and Opportunities in Cloud-Based Genomics Analytics
Intel - Challenges and Opportunities in Cloud-Based Genomics Analytics
IntelHealthcare
Â
PresentaciĂłn 2018-2019
PresentaciĂłn 2018-2019
Juan José Taboada León
Â
Data analytics challenges in genomics
Data analytics challenges in genomics
mikaelhuss
Â
Genomics isn't Special
Genomics isn't Special
Allen Day, PhD
Â
CAD CAM CAE
CAD CAM CAE
Rejvi Ahmed
Â
Genome Analysis Pipelines with Spark and ADAM
Genome Analysis Pipelines with Spark and ADAM
Allen Day, PhD
Â
Crowdfunding: an Easy and Creative Way of Funding
Crowdfunding: an Easy and Creative Way of Funding
justverycurious
Â
7 #designgames The Innovation Games: methods to help teams develop breakthrou...
7 #designgames The Innovation Games: methods to help teams develop breakthrou...
John Knight
Â
How to pitch an american VC by Blake Armstrong, Partner at TheFamily
How to pitch an american VC by Blake Armstrong, Partner at TheFamily
TheFamily
Â
Cad cam cae
Cad cam cae
Fab Lab LIMA
Â
How Scientists Engage the Public
How Scientists Engage the Public
Pew Research Center's Internet & American Life Project
Â
Earth images from space 2014 (2014ćčŽ ć€Șç©șæçć°çç §ç)
Earth images from space 2014 (2014ćčŽ ć€Șç©șæçć°çç §ç)
Chung Yen Chang
Â
2014.06.30 - Renaissance in Medicine - Singapore Management University - Data...
2014.06.30 - Renaissance in Medicine - Singapore Management University - Data...
Allen Day, PhD
Â
Human Genetics & Big Data [sans Ethics]
Human Genetics & Big Data [sans Ethics]
Allen Day, PhD
Â
Hadoop and Genomics - What you need to know - Cambridge - Sanger Center and EBI
Hadoop and Genomics - What you need to know - Cambridge - Sanger Center and EBI
Allen Day, PhD
Â
Hadoop as a Platform for Genomics - Strata 2015, San Jose
Hadoop as a Platform for Genomics - Strata 2015, San Jose
Allen Day, PhD
Â
Hadoop and Genomics - What You Need to Know - London - Viadex RCC - 2015.03.17
Hadoop and Genomics - What You Need to Know - London - Viadex RCC - 2015.03.17
Allen Day, PhD
Â
Renaissance in Medicine - Strata - NoSQL and Genomics
Renaissance in Medicine - Strata - NoSQL and Genomics
Allen Day, PhD
Â
Hadoop as a Platform for Genomics
Hadoop as a Platform for Genomics
MapR Technologies
Â
Genome Analysis Pipelines, Big Data Style
Genome Analysis Pipelines, Big Data Style
Julius Remigio, CBIP
Â
Hadoop and Genomics - What you need to know - 2015.04.09 - Shenzhen - BGI
Hadoop and Genomics - What you need to know - 2015.04.09 - Shenzhen - BGI
Allen Day, PhD
Â
Insights from Building the Future of Drug Discovery with Apache Spark with Lu...
Insights from Building the Future of Drug Discovery with Apache Spark with Lu...
Databricks
Â
Weitere Àhnliche Inhalte
Andere mochten auch
Intel - Challenges and Opportunities in Cloud-Based Genomics Analytics
Intel - Challenges and Opportunities in Cloud-Based Genomics Analytics
IntelHealthcare
Â
PresentaciĂłn 2018-2019
PresentaciĂłn 2018-2019
Juan José Taboada León
Â
Data analytics challenges in genomics
Data analytics challenges in genomics
mikaelhuss
Â
Genomics isn't Special
Genomics isn't Special
Allen Day, PhD
Â
CAD CAM CAE
CAD CAM CAE
Rejvi Ahmed
Â
Genome Analysis Pipelines with Spark and ADAM
Genome Analysis Pipelines with Spark and ADAM
Allen Day, PhD
Â
Crowdfunding: an Easy and Creative Way of Funding
Crowdfunding: an Easy and Creative Way of Funding
justverycurious
Â
7 #designgames The Innovation Games: methods to help teams develop breakthrou...
7 #designgames The Innovation Games: methods to help teams develop breakthrou...
John Knight
Â
How to pitch an american VC by Blake Armstrong, Partner at TheFamily
How to pitch an american VC by Blake Armstrong, Partner at TheFamily
TheFamily
Â
Cad cam cae
Cad cam cae
Fab Lab LIMA
Â
How Scientists Engage the Public
How Scientists Engage the Public
Pew Research Center's Internet & American Life Project
Â
Earth images from space 2014 (2014ćčŽ ć€Șç©șæçć°çç §ç)
Earth images from space 2014 (2014ćčŽ ć€Șç©șæçć°çç §ç)
Chung Yen Chang
Â
Andere mochten auch
(12)
Intel - Challenges and Opportunities in Cloud-Based Genomics Analytics
Intel - Challenges and Opportunities in Cloud-Based Genomics Analytics
Â
PresentaciĂłn 2018-2019
PresentaciĂłn 2018-2019
Â
Data analytics challenges in genomics
Data analytics challenges in genomics
Â
Genomics isn't Special
Genomics isn't Special
Â
CAD CAM CAE
CAD CAM CAE
Â
Genome Analysis Pipelines with Spark and ADAM
Genome Analysis Pipelines with Spark and ADAM
Â
Crowdfunding: an Easy and Creative Way of Funding
Crowdfunding: an Easy and Creative Way of Funding
Â
7 #designgames The Innovation Games: methods to help teams develop breakthrou...
7 #designgames The Innovation Games: methods to help teams develop breakthrou...
Â
How to pitch an american VC by Blake Armstrong, Partner at TheFamily
How to pitch an american VC by Blake Armstrong, Partner at TheFamily
Â
Cad cam cae
Cad cam cae
Â
How Scientists Engage the Public
How Scientists Engage the Public
Â
Earth images from space 2014 (2014ćčŽ ć€Șç©șæçć°çç §ç)
Earth images from space 2014 (2014ćčŽ ć€Șç©șæçć°çç §ç)
Â
Ăhnlich wie Genomics Crash Course for Data Engineers
2014.06.30 - Renaissance in Medicine - Singapore Management University - Data...
2014.06.30 - Renaissance in Medicine - Singapore Management University - Data...
Allen Day, PhD
Â
Human Genetics & Big Data [sans Ethics]
Human Genetics & Big Data [sans Ethics]
Allen Day, PhD
Â
Hadoop and Genomics - What you need to know - Cambridge - Sanger Center and EBI
Hadoop and Genomics - What you need to know - Cambridge - Sanger Center and EBI
Allen Day, PhD
Â
Hadoop as a Platform for Genomics - Strata 2015, San Jose
Hadoop as a Platform for Genomics - Strata 2015, San Jose
Allen Day, PhD
Â
Hadoop and Genomics - What You Need to Know - London - Viadex RCC - 2015.03.17
Hadoop and Genomics - What You Need to Know - London - Viadex RCC - 2015.03.17
Allen Day, PhD
Â
Renaissance in Medicine - Strata - NoSQL and Genomics
Renaissance in Medicine - Strata - NoSQL and Genomics
Allen Day, PhD
Â
Hadoop as a Platform for Genomics
Hadoop as a Platform for Genomics
MapR Technologies
Â
Genome Analysis Pipelines, Big Data Style
Genome Analysis Pipelines, Big Data Style
Julius Remigio, CBIP
Â
Hadoop and Genomics - What you need to know - 2015.04.09 - Shenzhen - BGI
Hadoop and Genomics - What you need to know - 2015.04.09 - Shenzhen - BGI
Allen Day, PhD
Â
Insights from Building the Future of Drug Discovery with Apache Spark with Lu...
Insights from Building the Future of Drug Discovery with Apache Spark with Lu...
Databricks
Â
Deep Learning for AI (3)
Deep Learning for AI (3)
Dongheon Lee
Â
Machine Learning: Past, Present and Future - by Tom Dietterich
Machine Learning: Past, Present and Future - by Tom Dietterich
BigML, Inc
Â
How Big Data is Reducing Costs and Improving Outcomes in Health Care
How Big Data is Reducing Costs and Improving Outcomes in Health Care
Carol McDonald
Â
SCOPE Summit - Applying the OMOP data model & OHDSI software to national Euro...
SCOPE Summit - Applying the OMOP data model & OHDSI software to national Euro...
Kees van Bochove
Â
[Keynote] predictive technologies and the prediction of technology - Bob Will...
[Keynote] predictive technologies and the prediction of technology - Bob Will...
PAPIs.io
Â
Hadoop recognition of biomedical named entity using conditional random fields...
Hadoop recognition of biomedical named entity using conditional random fields...
LeMeniz Infotech
Â
Parkinson disease classification v2.0
Parkinson disease classification v2.0
Nikhil Shrivastava, MS, SAFe PMPO
Â
COMPUTERS IN PHARMACEUTICAL DEVELOPMENT
COMPUTERS IN PHARMACEUTICAL DEVELOPMENT
Arunpandiyan59
Â
Maximize Your Understanding of Operational Realities in Manufacturing with Pr...
Maximize Your Understanding of Operational Realities in Manufacturing with Pr...
Bigfinite
Â
From data lakes to actionable data (adventures in data curation)
From data lakes to actionable data (adventures in data curation)
Novartis Institutes for BioMedical Research
Â
Ăhnlich wie Genomics Crash Course for Data Engineers
(20)
2014.06.30 - Renaissance in Medicine - Singapore Management University - Data...
2014.06.30 - Renaissance in Medicine - Singapore Management University - Data...
Â
Human Genetics & Big Data [sans Ethics]
Human Genetics & Big Data [sans Ethics]
Â
Hadoop and Genomics - What you need to know - Cambridge - Sanger Center and EBI
Hadoop and Genomics - What you need to know - Cambridge - Sanger Center and EBI
Â
Hadoop as a Platform for Genomics - Strata 2015, San Jose
Hadoop as a Platform for Genomics - Strata 2015, San Jose
Â
Hadoop and Genomics - What You Need to Know - London - Viadex RCC - 2015.03.17
Hadoop and Genomics - What You Need to Know - London - Viadex RCC - 2015.03.17
Â
Renaissance in Medicine - Strata - NoSQL and Genomics
Renaissance in Medicine - Strata - NoSQL and Genomics
Â
Hadoop as a Platform for Genomics
Hadoop as a Platform for Genomics
Â
Genome Analysis Pipelines, Big Data Style
Genome Analysis Pipelines, Big Data Style
Â
Hadoop and Genomics - What you need to know - 2015.04.09 - Shenzhen - BGI
Hadoop and Genomics - What you need to know - 2015.04.09 - Shenzhen - BGI
Â
Insights from Building the Future of Drug Discovery with Apache Spark with Lu...
Insights from Building the Future of Drug Discovery with Apache Spark with Lu...
Â
Deep Learning for AI (3)
Deep Learning for AI (3)
Â
Machine Learning: Past, Present and Future - by Tom Dietterich
Machine Learning: Past, Present and Future - by Tom Dietterich
Â
How Big Data is Reducing Costs and Improving Outcomes in Health Care
How Big Data is Reducing Costs and Improving Outcomes in Health Care
Â
SCOPE Summit - Applying the OMOP data model & OHDSI software to national Euro...
SCOPE Summit - Applying the OMOP data model & OHDSI software to national Euro...
Â
[Keynote] predictive technologies and the prediction of technology - Bob Will...
[Keynote] predictive technologies and the prediction of technology - Bob Will...
Â
Hadoop recognition of biomedical named entity using conditional random fields...
Hadoop recognition of biomedical named entity using conditional random fields...
Â
Parkinson disease classification v2.0
Parkinson disease classification v2.0
Â
COMPUTERS IN PHARMACEUTICAL DEVELOPMENT
COMPUTERS IN PHARMACEUTICAL DEVELOPMENT
Â
Maximize Your Understanding of Operational Realities in Manufacturing with Pr...
Maximize Your Understanding of Operational Realities in Manufacturing with Pr...
Â
From data lakes to actionable data (adventures in data curation)
From data lakes to actionable data (adventures in data curation)
Â
Mehr von Allen Day, PhD
Deep learning in medicine: An introduction and applications to next-generatio...
Deep learning in medicine: An introduction and applications to next-generatio...
Allen Day, PhD
Â
20170428 - Look to Precision Agriculture to Bootstrap Precision Medicine - Cu...
20170428 - Look to Precision Agriculture to Bootstrap Precision Medicine - Cu...
Allen Day, PhD
Â
20170426 - Deep Learning Applications in Genomics - Vancouver - Simon Fraser ...
20170426 - Deep Learning Applications in Genomics - Vancouver - Simon Fraser ...
Allen Day, PhD
Â
20170424 - Big Data in Biology - Vancouver - Simon Fraser University
20170424 - Big Data in Biology - Vancouver - Simon Fraser University
Allen Day, PhD
Â
20170406 Genomics@Google - KeyGene - Wageningen
20170406 Genomics@Google - KeyGene - Wageningen
Allen Day, PhD
Â
20170402 Crop Innovation and Business - Amsterdam
20170402 Crop Innovation and Business - Amsterdam
Allen Day, PhD
Â
20170315 Cloud Accelerated Genomics - Tel Aviv / Phoenix
20170315 Cloud Accelerated Genomics - Tel Aviv / Phoenix
Allen Day, PhD
Â
2014.06.16 - BGI - Genomics BigData Workloads - Shenzhen China
2014.06.16 - BGI - Genomics BigData Workloads - Shenzhen China
Allen Day, PhD
Â
R + Storm Moneyball - Realtime Advanced Statistics - Hadoop Summit - San Jose
R + Storm Moneyball - Realtime Advanced Statistics - Hadoop Summit - San Jose
Allen Day, PhD
Â
Building Data Science Teams, Abbreviated
Building Data Science Teams, Abbreviated
Allen Day, PhD
Â
20140228 - Singapore - BDAS - Ensuring Hadoop Production Success
20140228 - Singapore - BDAS - Ensuring Hadoop Production Success
Allen Day, PhD
Â
20131212 - Sydney - Garvan Institute - Human Genetics and Big Data
20131212 - Sydney - Garvan Institute - Human Genetics and Big Data
Allen Day, PhD
Â
2013.12.12 - Sydney - Big Data Analytics
2013.12.12 - Sydney - Big Data Analytics
Allen Day, PhD
Â
20131011 - Los Gatos - Netflix - Big Data Design Patterns
20131011 - Los Gatos - Netflix - Big Data Design Patterns
Allen Day, PhD
Â
20131111 - Santa Monica - BigDataCamp - Big Data Design Patterns
20131111 - Santa Monica - BigDataCamp - Big Data Design Patterns
Allen Day, PhD
Â
Mehr von Allen Day, PhD
(15)
Deep learning in medicine: An introduction and applications to next-generatio...
Deep learning in medicine: An introduction and applications to next-generatio...
Â
20170428 - Look to Precision Agriculture to Bootstrap Precision Medicine - Cu...
20170428 - Look to Precision Agriculture to Bootstrap Precision Medicine - Cu...
Â
20170426 - Deep Learning Applications in Genomics - Vancouver - Simon Fraser ...
20170426 - Deep Learning Applications in Genomics - Vancouver - Simon Fraser ...
Â
20170424 - Big Data in Biology - Vancouver - Simon Fraser University
20170424 - Big Data in Biology - Vancouver - Simon Fraser University
Â
20170406 Genomics@Google - KeyGene - Wageningen
20170406 Genomics@Google - KeyGene - Wageningen
Â
20170402 Crop Innovation and Business - Amsterdam
20170402 Crop Innovation and Business - Amsterdam
Â
20170315 Cloud Accelerated Genomics - Tel Aviv / Phoenix
20170315 Cloud Accelerated Genomics - Tel Aviv / Phoenix
Â
2014.06.16 - BGI - Genomics BigData Workloads - Shenzhen China
2014.06.16 - BGI - Genomics BigData Workloads - Shenzhen China
Â
R + Storm Moneyball - Realtime Advanced Statistics - Hadoop Summit - San Jose
R + Storm Moneyball - Realtime Advanced Statistics - Hadoop Summit - San Jose
Â
Building Data Science Teams, Abbreviated
Building Data Science Teams, Abbreviated
Â
20140228 - Singapore - BDAS - Ensuring Hadoop Production Success
20140228 - Singapore - BDAS - Ensuring Hadoop Production Success
Â
20131212 - Sydney - Garvan Institute - Human Genetics and Big Data
20131212 - Sydney - Garvan Institute - Human Genetics and Big Data
Â
2013.12.12 - Sydney - Big Data Analytics
2013.12.12 - Sydney - Big Data Analytics
Â
20131011 - Los Gatos - Netflix - Big Data Design Patterns
20131011 - Los Gatos - Netflix - Big Data Design Patterns
Â
20131111 - Santa Monica - BigDataCamp - Big Data Design Patterns
20131111 - Santa Monica - BigDataCamp - Big Data Design Patterns
Â
KĂŒrzlich hochgeladen
Call Girls Bommasandra Just Call đ 7737669865 đ Top Class Call Girl Service B...
Call Girls Bommasandra Just Call đ 7737669865 đ Top Class Call Girl Service B...
amitlee9823
Â
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
fulawalesam
Â
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
Boston Institute of Analytics
Â
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
MarinCaroMartnezBerg
Â
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
olyaivanovalion
Â
Mg Road Call Girls Service: đ 7737669865 đ High Profile Model Escorts | Banga...
Mg Road Call Girls Service: đ 7737669865 đ High Profile Model Escorts | Banga...
amitlee9823
Â
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
olyaivanovalion
Â
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
olyaivanovalion
Â
CHEAP Call Girls in Saket (-DELHI )đ 9953056974đ(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )đ 9953056974đ(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
Â
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
olyaivanovalion
Â
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
JoseMangaJr1
Â
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
olyaivanovalion
Â
Call Girls Hsr Layout Just Call đ 7737669865 đ Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call đ 7737669865 đ Top Class Call Girl Service Ba...
amitlee9823
Â
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
michael115558
Â
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
olyaivanovalion
Â
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
Â
Call Girls in Sarai Kale Khan Delhi đŻ Call Us đ9205541914 đ( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi đŻ Call Us đ9205541914 đ( Delhi) Escorts S...
Delhi Call girls
Â
Call Girls Jalahalli Just Call đ 7737669865 đ Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call đ 7737669865 đ Top Class Call Girl Service Ban...
amitlee9823
Â
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
MoniSankarHazra
Â
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
manisha194592
Â
KĂŒrzlich hochgeladen
(20)
Call Girls Bommasandra Just Call đ 7737669865 đ Top Class Call Girl Service B...
Call Girls Bommasandra Just Call đ 7737669865 đ Top Class Call Girl Service B...
Â
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
Â
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
Â
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
Â
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
Â
Mg Road Call Girls Service: đ 7737669865 đ High Profile Model Escorts | Banga...
Mg Road Call Girls Service: đ 7737669865 đ High Profile Model Escorts | Banga...
Â
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
Â
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
Â
CHEAP Call Girls in Saket (-DELHI )đ 9953056974đ(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )đ 9953056974đ(=)/CALL GIRLS SERVICE
Â
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
Â
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
Â
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
Â
Call Girls Hsr Layout Just Call đ 7737669865 đ Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call đ 7737669865 đ Top Class Call Girl Service Ba...
Â
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
Â
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
Â
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Â
Call Girls in Sarai Kale Khan Delhi đŻ Call Us đ9205541914 đ( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi đŻ Call Us đ9205541914 đ( Delhi) Escorts S...
Â
Call Girls Jalahalli Just Call đ 7737669865 đ Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call đ 7737669865 đ Top Class Call Girl Service Ban...
Â
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
Â
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
Â
Genomics Crash Course for Data Engineers
1.
© 2014 MapR
Technologies 1
2.
© 2014 MapR
Technologies 2 Biomedical & Advertising Tech Overarching Themes* *Obligatory movie references⊠shout-out to my hometown LA Eugenics & Determinism Free will vs. Determinism Media Tech & Privacy
3.
© 2014 MapR
Technologies 3 Biomedical Research Goal: Therapeutics => Diagnostics => Prognostics âą Therapeutics => traditional medicine âą Diagnostics => personalized medicine â NextGen public health â Requires hi-res mechanical knowledge â Reverse engineer how genetic variation leads to (un)desired traits âą Prognostics => GATTACA (dys/eu)topia â Managed populations / NextGen eugenics
4.
© 2014 MapR
Technologies 4Star Wars III: Revenge of the Sith
5.
© 2014 MapR
Technologies 5Star Wars V: The Empire Strikes Back
6.
© 2014 MapR
Technologies 6 Genetic Basis of Facial Features self-reported values of {sex, ancestry} + observer scores [race, sex]} + 3D facial scan + genome scan ______________________________ Allelic model of 20 genes that determine facial characteristics Claes, et al. 2014. Modeling 3D Facial Shape from DNA
7.
© 2014 MapR
Technologies 7 Genetic Basis of Facial Features Claes, et al. 2014. Modeling 3D Facial Shape from DNA
8.
© 2014 MapR
Technologies 8 So Get Ready⊠www.theness.com
9.
© 2014 MapR
Technologies 9© 2014 MapR Technologies Genomics Crash Course for Data Engineers
10.
© 2014 MapR
Technologies 10 Me, Us âą Allen Day, Principal Data Scientist, MapR 5yr Hadoop Dev, R project contributor PhD, Human Genetics, UCLA Medicine âą MapR Distributes open source components for Hadoop Adds major technology for performance, HA, industry standard APIâs âą See Also â âallendayâ most places (twitter, github, etc.) â @mapR
11.
© 2014 MapR
Technologies 11 Clinical Sequencing Business Process Workflow PhysicianPatient Clinic blood/saliva Clinical Lab Analytics extract
12.
© 2014 MapR
Technologies 12 One Bad MTHFR MTHFR C677T Methylfolate helps make neurotransmitters in your brain. When methylfolate levels are low, so are your neurotransmitters. Low production of neurotransmitters may cause conditions of addictive behavior, depression, anxiety, ADHD, mania, irritability, insomnia, learning disorders and others. Everyone should get tested. Why? Because 1 in 2 people are affected and if one knows they have a MTHFR polymorphism, they know they have to be very proactive in taking care of themselves. http://thyroid.about.com/od/MTHFR-Gene-Mutations-and-Polymorphisms/fl/The- Link-Between-MTHFR-Gene-Mutations-and-Disease-Including-Thyroid- Health.htm
13.
© 2014 MapR
Technologies 13 One Bad MTHFR MTHFR C677T Methylfolate helps make neurotransmitters in your brain. When methylfolate levels are low, so are your neurotransmitters. Low production of neurotransmitters may cause conditions of addictive behavior, depression, anxiety, ADHD, mania, irritability, insomnia, learning disorders and others. Everyone should get tested. Why? Because 1 in 2 people are affected and if one knows they have a MTHFR polymorphism, they know they have to be very proactive in taking care of themselves. http://thyroid.about.com/od/MTHFR-Gene-Mutations-and-Polymorphisms/fl/The- Link-Between-MTHFR-Gene-Mutations-and-Disease-Including-Thyroid- Health.htm
14.
© 2014 MapR
Technologies 14 One Bad MTHFR MTHFR C677T Methylfolate helps make neurotransmitters in your brain. When methylfolate levels are low, so are your neurotransmitters. Low production of neurotransmitters may cause conditions of addictive behavior, depression, anxiety, ADHD, mania, irritability, insomnia, learning disorders and others. Everyone should get tested. Why? Because 1 in 2 people are affected and if one knows they have a MTHFR polymorphism, they know they have to be very proactive in taking care of themselves. http://thyroid.about.com/od/MTHFR-Gene-Mutations-and-Polymorphisms/fl/The- Link-Between-MTHFR-Gene-Mutations-and-Disease-Including-Thyroid- Health.htm
15.
© 2014 MapR
Technologies 15 One Bad MTHFR MTHFR C677T Methylfolate helps make neurotransmitters in your brain. When methylfolate levels are low, so are your neurotransmitters. Low production of neurotransmitters may cause conditions of addictive behavior, depression, anxiety, ADHD, mania, irritability, insomnia, learning disorders and others. Everyone should get tested. Why? Because 1 in 2 people are affected and if one knows they have a MTHFR polymorphism, they know they have to be very proactive in taking care of themselves. http://thyroid.about.com/od/MTHFR-Gene-Mutations-and-Polymorphisms/fl/The- Link-Between-MTHFR-Gene-Mutations-and-Disease-Including-Thyroid- Health.htm
16.
© 2014 MapR
Technologies 16 Clinical Sequencing Business Process Workflow PhysicianPatient Clinic blood/saliva Clinical Lab Analytics extract
17.
© 2014 MapR
Technologies 17 Clinical Genomics, Information Systems Perspective Compressed Structured Base4 Data Uncompressed Unstructured Base2 Data extract Base4=>Base2 Converter [[ DE-STRUCTURES ]] âBIâ Reporting and Visualization tools PhysicianPatient AnalystStakeholder
18.
© 2014 MapR
Technologies 18 Clinical Genomics, Information Systems Perspective PhysicianPatient AnalystStakeholder ETL Reporting and Viz Data Store Analytics
19.
© 2014 MapR
Technologies 19 Sequencing âEven Mooreâs Lawâ Stein. 2010. The case for cloud computing in genome informatics
20.
© 2014 MapR
Technologies 20 The Evolving Genomics Workload Sboner, et al, 2011. The real cost of sequencing: higher than you think! <= 1Âș analytics âcurrent high ROI use casesâ <= 2Âș analytics ânext-gen high ROI use casesâ
21.
© 2014 MapR
Technologies 21 Clinical Genomics, Information Systems Perspective PhysicianPatient AnalystStakeholder ETL Reporting and Viz Data Store Analytics 1Âș analytics 2Âș analytics Not much in this presentation, see also: http://slidesha.re/1sC2BOX
22.
© 2014 MapR
Technologies 22 Sequence Analysis, Quick Partial Details [âŠ] G A C T A G A fragment1 A C A G T T T A C A fragment2 A G A T A - - A G A fragment3 A A C A G C T T A C A [âŠ] fragment4 C T A T A G A T A A fragment5 [âŠ] G A T T A C A G A T T A C A G A T T A C A [âŠ] referenceDNA [âŠ] G A C T A C A G A T A A C A G A T T A C A [âŠ] patient__DNA
23.
© 2014 MapR
Technologies 23 What is the (Probable) Color of Each Column?
24.
© 2014 MapR
Technologies 24 Which Columns are (probably) Not White? Strategy 1: examine foreach column, foreach row O(rows*cols) + O(1 col) memory
25.
© 2014 MapR
Technologies 25 Which Columns are (probably) Not White? Strategy 2: examine foreach row. keep running tallies O(rows) + O(rows*cols) memory
26.
© 2014 MapR
Technologies 26 Which Columns are (probably) Not White? Strategy 3: rotate matrix. examine foreach column O(rows log rows) + O(cols) + O(1 col) memory
27.
© 2014 MapR
Technologies 27 Comparison of Strategies Strategy 1 âą Low mem req âą Random access pattern, many ops Strategy 3 âą Low mem req âą Sequential access pattern âą Requires Sort Strategy 2 âą High mem req âą Sequential access pattern O(rows*cols) + O(1 col) memory O(rows) + O(rows*cols) memory O(rows log rows) + O(cols) + O(1 col) memory
28.
© 2014 MapR
Technologies 28 Comparison of Strategies Strategy 1 âą Low mem req âą Random access pattern, many ops Strategy 3 âą Low mem req âą Sequential access pattern âą Requires Sort Strategy 2 âą High mem req âą Sequential access pattern O(rows*cols) + O(1 col) memory O(rows) + O(rows*cols) memory O(rows log rows) Ă· shards + O(cols) Ă· shards + O(1 col) memory As # of rows & columns increases Strategy 3 becomes more attractive
29.
© 2014 MapR
Technologies 29 1Âș Sequence Analysis (ETL), MapReduce style .fastq .bam .vcf short read alignment genotype calling MAP MAP REDUCE, rotate matrix 90Âș (O(mn)) / 1 (O(mn) + O(n log n)) / s
30.
© 2014 MapR
Technologies 30 Crossbow (MapReduce Strategy, implemented) Langmead, et al. 2009. Searching for SNPs with cloud computing
31.
© 2014 MapR
Technologies 31 Ion Flux (MapReduce Strategy, implemented for Enterprise) âą Sequencing workflow in MapReduce (Hadoop, Cascading, Amazon Elastic M/R) âą Integrated with Ion Torrent as a plugin to stream sequence to the cloud âą Emphasis on scalability and latency â assay->clinical report turnaround in < 24h âą Compare to fast-follower stack ILMN MiSeq+BaseSpace http://aws.amazon.com/solutions/case-studies/ion-flux/ http://ionflux.com
32.
© 2014 MapR
Technologies 32© 2014 MapR Technologies Non-Genomics Digression, 1 of 2 Data Warehouse ETL Offload
33.
© 2014 MapR
Technologies 33 The Problem âą Major telecom vendor âą Key step in billing pipeline handled by data warehouse (EDW) âą EDW at maximum capacity âą Multiple rounds of software optimization already done âą Revenue limiting (= career limiting) bottleneck
34.
© 2014 MapR
Technologies 34 Three Options 1. No more revenue growth 2. Increase EDW size â Expensive â Known to not scale well 3. Find a more scalable solution
35.
© 2014 MapR
Technologies 35 ETL CDR billing records Billing reports Data Warehouse Customer bills Original Flow â ELTL
36.
© 2014 MapR
Technologies 36 Simplified Analysis â EDW Strategy âą 70% of EDW consumed by ELTL processing â Caused by 10% of code (CDR transformations) âą 200% EDW capacity adds capital cost is ~X âą Indirect costs non-trivial (floor space, power) âą 150% performance increase (poor division of labor)
37.
© 2014 MapR
Technologies 37 ETL CDR billing records Billing reports Data Warehouse Customer billing With ETL Offload
38.
© 2014 MapR
Technologies 38 Simplified Analysis â MapR Strategy âą Hardware + MapR cost ~1/20X âą ETL replacement development costs ~1/20X âą 300% performance increase
39.
© 2014 MapR
Technologies 39 Price Performance âą EDW strategy â 1.5x performance â Cost is X âą MapR Strategy â 3x performance â Cost is 1/10X âą 20x cost/performance advantage for MapR strategy
40.
© 2014 MapR
Technologies 40 Platform Advantages âą Standard Hadoop eco-system components allow efficient CDR parsing and ETL âą MapR platform provides high availability, disaster recovery âą MapR NFS interface allows direct load of transformed data
41.
© 2014 MapR
Technologies 41© 2014 MapR Technologies Non-Genomics Digression, 2 of 2
42.
© 2014 MapR
Technologies 42© 2014 MapR Technologies <Recommendation System. Redacted>
43.
© 2014 MapR
Technologies 50© 2014 MapR Technologies Hybrid Use-Cases
44.
© 2014 MapR
Technologies 51 MapR Data Platform Advantage, Telecommunications CO-OCCURRENCE (MAHOUT) SOLR INDEXING ETL BILLING REPORTS WEB TIERDATA WAREHOUSE CDR BILLING RECORDS CUSTOMER BILLING USER HISTORY QUERY / CONTEXT RECOMENDATIONS COMPLETE HISTORY (all users) ITEM META-DATA INDEX SHARDS
45.
© 2014 MapR
Technologies 52 MapR Data Platform Advantage, Clinical Genomics Epidemiological, Actuarial Analyses Denormalization for Search, Viz, Research ETL Clinical Reporting WEB TIERClinical Reporting Systems CLINICAL TREATMENT OF PATIENTS RESEARCHERS National Pop. Database INDEX SHARDSPrognostic Capability
46.
© 2014 MapR
Technologies 53© 2014 MapR Technologies Bonus Round: 2Âș Analytics
47.
© 2014 MapR
Technologies 54 Clinical Genomics, Information Systems Perspective PhysicianPatient AnalystStakeholder ETL Reporting and Viz Data Store Analytics 2Âș analytics Not much in this presentation, see also: http://slidesha.re/1sC2BOX
48.
© 2014 MapR
Technologies 55 Matrices A (U*Q) and B (U*V) Query Term = Clicked Term Users Query Terms Users Clicked Videos
49.
© 2014 MapR
Technologies 56 Relate Q to V Users Query Terms
50.
© 2014 MapR
Technologies 57 Relate Q to V Users Query Terms
51.
© 2014 MapR
Technologies 58 Relate Q to V: itâs a Cross-Recommender QueryTerms Videos
52.
© 2014 MapR
Technologies 59 Users Query Terms
53.
© 2014 MapR
Technologies 60 If they were unlabeled, would you know which is which? Friend. 2010. The Need for Precompetitive Integrative Bionetwork Disease Model Building NPR. 2011. The Search For Analysts To Make Sense Of 'Big Dataâ http://www.npr.org/2011/11/30/142893065
54.
© 2014 MapR
Technologies 61 If they were unlabeled, would you know which is which? Friend. 2010. The Need for Precompetitive Integrative Bionetwork Disease Model Building âą Identify network structures âą Label them âą Observe stimulus=>response space mapping âą Purposefully target âą PROFIT ! ! ! !