SlideShare ist ein Scribd-Unternehmen logo
1 von 23
Downloaden Sie, um offline zu lesen
How to make bioinformatics
accessible to normal people! 

                               Mik Black
               Department of Biochemistry
                      University of Otago
Some musings…	

•  Accessibility - two aspects:	

   1.  Methodology development  distribution
       (can you get it?)	

   2.  Methodology uptake (can you use it?)	

•  “Normal people”:	

   1.  Who are they?	

   2.  What do they want? What do they
       need? Is there a happy medium?
My background
                        	

•  Statistical design and analysis of
   microarray experiments:	

   –  Methodology development	

   –  Applying existing bioinformatics techniques	

   –  Adapting “standard” statistical methods	

   –  “Forensic” analysis	

•  Technologies:	

   –  Microarrays (mRNA, SNPs, CNV/CGH)	

   –  Second generation sequencing (SNP/CNV)
http://www.r-project.org/
The joys of the command line…	

•  Large amounts of statistical genomics
   methodology available via R	

  –  Accessible? 	

  –  Uptake? 	

  –  Who are the end users?	

•  Can’t we just teach EVERYONE to use R?
http://www.broad.mit.edu/cancer/software/genepattern/	



              Reich et al. (2006) GenePattern 2.0., Nature Genetics, 38, 500-501.
                                                                                	



   •  GenePattern provides a web-based
      method for analysing microarray (and
      other) data.	

   •  Provides a simple interface to tools
      developed in Java, R, Matlab and other
      languages.	

   •  Analysis performed on server:	

       –  no compute resources required by users.	

       –  Facilitates sharing of results.
Using GenePattern	

•  User friendly	

   –  Third year bioinformatics course at Otago.	

   –  Workshop for lab personnel at TGen.	

•  Guided analysis 	

   –  Facilitates use of standard analysis methods.	

   –  Pipeline creation and “versioned” analysis.	

•  End users?	

   –  At Otago: 3rd  4th year Biochemistry students	

   –  At TGen: lab techs, iterns, bench scientists, PIs
Common analysis tasks	

•  Basic data analysis/exploration:	

   –  Heatmap creation	

   –  Hierarchical clustering	

   –  Identifying differentially expressed genes	

   –  Gene set analysis	

   –  Survival analysis	

•  GenePattern provides these tools in a
   modular format.
GenePattern interface
Simple analysis pipeline
Power calculations	

         Δ
               P
Network analysis
BeSTGRID: Broadband-enabled
Science and Technology GRID




                   http://www.bestgrid.org
GenePattern on BeSTGRID	


•  Services:	

   –  GenePattern server	

   –  Development environment (server and SVN)	

•  GenePattern training (coming soon):	

   –  Basic usage	

   –  Module development (uptake path for
      bioinformatics tool developers)
Next steps…	

•  Full GenePattern deployment:	

   –  Transfer of development modules to public server	

   –  Documentation and training	

   –  Use of ROCKS cluster for job submission	

•  Modules for Second Gen Sequencing data:	

   –  DNAseq, RNAseq, ChIPseq	

   –  R/Bioconductor (e.g., ShortRead, Biostrings,
      RSamTools, GenomeGraphs…)	

   –  Analysis, visualization and quality assurance
Community effort
                        	

•  Some current examples:	

  –  VISG/MapNet: statisticians  geneticists	

  –  BeSTGRID: middleware development 
     deployment through to end users	

  –  CTCR: cancer researchers  clinicians	

•  Each group has the goal of placing
   powerful (and useful, and usable) tools
   into the hands of end users.
Bioinformatics community
                          	

•  NZGL provides opportunity for
   community-based effort.	

  –  National infrastructure for genomics research	

  –  Includes strong bioinformatics component	

•  Key issue: engagement with end users	

  –  Methodology development and distribution	

  –  Uptake, interaction and training
Bioinformatics community
                          	

•  NZGL provides opportunity for
   community-based effort.	

  –  National infrastructure for genomics research	

  –  Includes strong bioinformatics component	

•  Key issue: engagement with end users	

  –  Methodology development and distribution	

  –  Uptake and interaction	

LETS GO FIND US SOME “NORMAL” PEOPLE!
Acknowledgements	

University of Otago	

        The University of Auckland	

Marcus Davy	

                Nick Jones	

Tim Molteno	

                Mark Gahegan	

Thomas Allen	

               Yuriy Halytskyy	

Sarah Song	

                 Cristin Print 	

Chris Brown	

                Daniel Hurley	

Anthony Reeve	

              Christoff Knapp	

Tony Merriman	


University of Canterbury	

Vladimir Mencl

Weitere ähnliche Inhalte

Was ist angesagt?

Supporting researchers in the molecular life sciences Jeff Christiansen
Supporting researchers in the molecular life sciences Jeff Christiansen Supporting researchers in the molecular life sciences Jeff Christiansen
Supporting researchers in the molecular life sciences Jeff Christiansen ARDC
 
Computational Approaches to Systems Biology
Computational Approaches to Systems BiologyComputational Approaches to Systems Biology
Computational Approaches to Systems BiologyMike Hucka
 
Overall Vision for NRNB: 2015-2020
Overall Vision for NRNB: 2015-2020Overall Vision for NRNB: 2015-2020
Overall Vision for NRNB: 2015-2020Alexander Pico
 
NRNB Annual Report 2016: Overall
NRNB Annual Report 2016: OverallNRNB Annual Report 2016: Overall
NRNB Annual Report 2016: OverallAlexander Pico
 
Systems Biology Systems
Systems Biology SystemsSystems Biology Systems
Systems Biology SystemsMike Hucka
 
NetBioSIG2013-Talk Tijana Milenkovic
NetBioSIG2013-Talk Tijana MilenkovicNetBioSIG2013-Talk Tijana Milenkovic
NetBioSIG2013-Talk Tijana MilenkovicAlexander Pico
 
B.sc biochem i bobi u-1 introduction to bioinformatics
B.sc biochem i bobi u-1 introduction to bioinformaticsB.sc biochem i bobi u-1 introduction to bioinformatics
B.sc biochem i bobi u-1 introduction to bioinformaticsRai University
 
Technology R&D Theme 2: From Descriptive to Predictive Networks
Technology R&D Theme 2: From Descriptive to Predictive NetworksTechnology R&D Theme 2: From Descriptive to Predictive Networks
Technology R&D Theme 2: From Descriptive to Predictive NetworksAlexander Pico
 
Introduction to Systemics with focus on Systems Biology
Introduction to Systemics with focus on Systems BiologyIntroduction to Systemics with focus on Systems Biology
Introduction to Systemics with focus on Systems BiologyMrinal Vashisth
 
Technology R&D Theme 1: Differential Networks
Technology R&D Theme 1: Differential NetworksTechnology R&D Theme 1: Differential Networks
Technology R&D Theme 1: Differential NetworksAlexander Pico
 
NetBioSIG2013-Talk Martina Kutmon
NetBioSIG2013-Talk Martina KutmonNetBioSIG2013-Talk Martina Kutmon
NetBioSIG2013-Talk Martina KutmonAlexander Pico
 
Multi-Omics Bioinformatics across Application Domains
Multi-Omics Bioinformatics across Application DomainsMulti-Omics Bioinformatics across Application Domains
Multi-Omics Bioinformatics across Application DomainsChristoph Steinbeck
 
Interoperable Data for KnetMiner and DFW Use Cases
Interoperable Data for KnetMiner and DFW Use CasesInteroperable Data for KnetMiner and DFW Use Cases
Interoperable Data for KnetMiner and DFW Use CasesRothamsted Research, UK
 
Technology R&D Theme 3: Multi-scale Network Representations
Technology R&D Theme 3: Multi-scale Network RepresentationsTechnology R&D Theme 3: Multi-scale Network Representations
Technology R&D Theme 3: Multi-scale Network RepresentationsAlexander Pico
 
Branch: An interactive, web-based tool for building decision tree classifiers
Branch: An interactive, web-based tool for building decision tree classifiersBranch: An interactive, web-based tool for building decision tree classifiers
Branch: An interactive, web-based tool for building decision tree classifiersBenjamin Good
 
NetBioSIG2012 anyatsalenko-en-viz
NetBioSIG2012 anyatsalenko-en-vizNetBioSIG2012 anyatsalenko-en-viz
NetBioSIG2012 anyatsalenko-en-vizAlexander Pico
 
CORBEL West-Life webinar slides
CORBEL West-Life webinar slidesCORBEL West-Life webinar slides
CORBEL West-Life webinar slidesCORBEL
 
EMBL Australia Bioinformatics Resource BioInfoSummer 2016
EMBL Australia Bioinformatics Resource BioInfoSummer 2016EMBL Australia Bioinformatics Resource BioInfoSummer 2016
EMBL Australia Bioinformatics Resource BioInfoSummer 2016Philippa Griffin
 

Was ist angesagt? (20)

Supporting researchers in the molecular life sciences Jeff Christiansen
Supporting researchers in the molecular life sciences Jeff Christiansen Supporting researchers in the molecular life sciences Jeff Christiansen
Supporting researchers in the molecular life sciences Jeff Christiansen
 
Computational Approaches to Systems Biology
Computational Approaches to Systems BiologyComputational Approaches to Systems Biology
Computational Approaches to Systems Biology
 
Overall Vision for NRNB: 2015-2020
Overall Vision for NRNB: 2015-2020Overall Vision for NRNB: 2015-2020
Overall Vision for NRNB: 2015-2020
 
NRNB Annual Report 2016: Overall
NRNB Annual Report 2016: OverallNRNB Annual Report 2016: Overall
NRNB Annual Report 2016: Overall
 
Systems Biology Systems
Systems Biology SystemsSystems Biology Systems
Systems Biology Systems
 
NetBioSIG2013-Talk Tijana Milenkovic
NetBioSIG2013-Talk Tijana MilenkovicNetBioSIG2013-Talk Tijana Milenkovic
NetBioSIG2013-Talk Tijana Milenkovic
 
NRNB EAC Report 2011
NRNB EAC Report 2011NRNB EAC Report 2011
NRNB EAC Report 2011
 
B.sc biochem i bobi u-1 introduction to bioinformatics
B.sc biochem i bobi u-1 introduction to bioinformaticsB.sc biochem i bobi u-1 introduction to bioinformatics
B.sc biochem i bobi u-1 introduction to bioinformatics
 
NRNB EAC Meeting 2012
NRNB EAC Meeting 2012NRNB EAC Meeting 2012
NRNB EAC Meeting 2012
 
Technology R&D Theme 2: From Descriptive to Predictive Networks
Technology R&D Theme 2: From Descriptive to Predictive NetworksTechnology R&D Theme 2: From Descriptive to Predictive Networks
Technology R&D Theme 2: From Descriptive to Predictive Networks
 
Introduction to Systemics with focus on Systems Biology
Introduction to Systemics with focus on Systems BiologyIntroduction to Systemics with focus on Systems Biology
Introduction to Systemics with focus on Systems Biology
 
Technology R&D Theme 1: Differential Networks
Technology R&D Theme 1: Differential NetworksTechnology R&D Theme 1: Differential Networks
Technology R&D Theme 1: Differential Networks
 
NetBioSIG2013-Talk Martina Kutmon
NetBioSIG2013-Talk Martina KutmonNetBioSIG2013-Talk Martina Kutmon
NetBioSIG2013-Talk Martina Kutmon
 
Multi-Omics Bioinformatics across Application Domains
Multi-Omics Bioinformatics across Application DomainsMulti-Omics Bioinformatics across Application Domains
Multi-Omics Bioinformatics across Application Domains
 
Interoperable Data for KnetMiner and DFW Use Cases
Interoperable Data for KnetMiner and DFW Use CasesInteroperable Data for KnetMiner and DFW Use Cases
Interoperable Data for KnetMiner and DFW Use Cases
 
Technology R&D Theme 3: Multi-scale Network Representations
Technology R&D Theme 3: Multi-scale Network RepresentationsTechnology R&D Theme 3: Multi-scale Network Representations
Technology R&D Theme 3: Multi-scale Network Representations
 
Branch: An interactive, web-based tool for building decision tree classifiers
Branch: An interactive, web-based tool for building decision tree classifiersBranch: An interactive, web-based tool for building decision tree classifiers
Branch: An interactive, web-based tool for building decision tree classifiers
 
NetBioSIG2012 anyatsalenko-en-viz
NetBioSIG2012 anyatsalenko-en-vizNetBioSIG2012 anyatsalenko-en-viz
NetBioSIG2012 anyatsalenko-en-viz
 
CORBEL West-Life webinar slides
CORBEL West-Life webinar slidesCORBEL West-Life webinar slides
CORBEL West-Life webinar slides
 
EMBL Australia Bioinformatics Resource BioInfoSummer 2016
EMBL Australia Bioinformatics Resource BioInfoSummer 2016EMBL Australia Bioinformatics Resource BioInfoSummer 2016
EMBL Australia Bioinformatics Resource BioInfoSummer 2016
 

Andere mochten auch

Doppler Academy: SEO 2016, cambios a tener en cuenta en tu estrategia de Posi...
Doppler Academy: SEO 2016, cambios a tener en cuenta en tu estrategia de Posi...Doppler Academy: SEO 2016, cambios a tener en cuenta en tu estrategia de Posi...
Doppler Academy: SEO 2016, cambios a tener en cuenta en tu estrategia de Posi...FromDoppler
 
Peter smith evaluation 4th part
Peter smith   evaluation 4th partPeter smith   evaluation 4th part
Peter smith evaluation 4th partguestb2370ab
 
EMMS 2012 - Caso de éxito de Koala Buy
EMMS 2012 - Caso de éxito de Koala BuyEMMS 2012 - Caso de éxito de Koala Buy
EMMS 2012 - Caso de éxito de Koala BuyFromDoppler
 
Estrategias de Email Marketing
Estrategias de Email MarketingEstrategias de Email Marketing
Estrategias de Email MarketingFromDoppler
 
Doppler Academy: Cómo usar Redes Sociales para generar ventas
Doppler Academy: Cómo usar Redes Sociales para generar ventasDoppler Academy: Cómo usar Redes Sociales para generar ventas
Doppler Academy: Cómo usar Redes Sociales para generar ventasFromDoppler
 
Welcome to economy feud!
Welcome to economy feud!Welcome to economy feud!
Welcome to economy feud!gueste8d5e25
 

Andere mochten auch (9)

Doppler Academy: SEO 2016, cambios a tener en cuenta en tu estrategia de Posi...
Doppler Academy: SEO 2016, cambios a tener en cuenta en tu estrategia de Posi...Doppler Academy: SEO 2016, cambios a tener en cuenta en tu estrategia de Posi...
Doppler Academy: SEO 2016, cambios a tener en cuenta en tu estrategia de Posi...
 
Peter smith evaluation 4th part
Peter smith   evaluation 4th partPeter smith   evaluation 4th part
Peter smith evaluation 4th part
 
EMMS 2012 - Caso de éxito de Koala Buy
EMMS 2012 - Caso de éxito de Koala BuyEMMS 2012 - Caso de éxito de Koala Buy
EMMS 2012 - Caso de éxito de Koala Buy
 
Fact
FactFact
Fact
 
Estrategias de Email Marketing
Estrategias de Email MarketingEstrategias de Email Marketing
Estrategias de Email Marketing
 
Doppler Academy: Cómo usar Redes Sociales para generar ventas
Doppler Academy: Cómo usar Redes Sociales para generar ventasDoppler Academy: Cómo usar Redes Sociales para generar ventas
Doppler Academy: Cómo usar Redes Sociales para generar ventas
 
Welcome to economy feud!
Welcome to economy feud!Welcome to economy feud!
Welcome to economy feud!
 
1 24-10
1 24-101 24-10
1 24-10
 
Training presentation
Training presentationTraining presentation
Training presentation
 

Ähnlich wie Mik Black bioinformatics symposium

Career oppurtunities in the field of Bioinformatics
Career oppurtunities in the field of BioinformaticsCareer oppurtunities in the field of Bioinformatics
Career oppurtunities in the field of BioinformaticsShikha Thakur
 
wolstencroft-ogf20-astro
wolstencroft-ogf20-astrowolstencroft-ogf20-astro
wolstencroft-ogf20-astrowebuploader
 
Ramil Mauleon: Galaxy: bioinformatics for rice scientists
Ramil Mauleon: Galaxy: bioinformatics for rice scientistsRamil Mauleon: Galaxy: bioinformatics for rice scientists
Ramil Mauleon: Galaxy: bioinformatics for rice scientistsGigaScience, BGI Hong Kong
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART and the One Min...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART and the One Min...tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART and the One Min...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART and the One Min...David Peyruc
 
International Cancer Genomics Consortium (ICGC) Data Coordinating Center
International Cancer Genomics Consortium (ICGC) Data Coordinating CenterInternational Cancer Genomics Consortium (ICGC) Data Coordinating Center
International Cancer Genomics Consortium (ICGC) Data Coordinating CenterNeuro, McGill University
 
Data is the new oil: Big data, data mining and bio - inspiring techniques
Data is the new oil: Big data, data mining and bio - inspiring techniquesData is the new oil: Big data, data mining and bio - inspiring techniques
Data is the new oil: Big data, data mining and bio - inspiring techniquesAboul Ella Hassanien
 
Data are the new oil: Big data, data mining and bio - inspiring techniques
Data are the new oil: Big data, data mining and bio - inspiring techniquesData are the new oil: Big data, data mining and bio - inspiring techniques
Data are the new oil: Big data, data mining and bio - inspiring techniquesAboul Ella Hassanien
 
MOBILE DEVICE FORENSICS USING NLP
MOBILE DEVICE FORENSICS USING NLPMOBILE DEVICE FORENSICS USING NLP
MOBILE DEVICE FORENSICS USING NLPAnkita Jadhao
 
MOBILE DEVICE FORENSICS USING NLP
MOBILE DEVICE FORENSICS USING NLPMOBILE DEVICE FORENSICS USING NLP
MOBILE DEVICE FORENSICS USING NLPAnkita Jadhao
 
Giab jan2016 intro and update 160128
Giab jan2016 intro and update 160128Giab jan2016 intro and update 160128
Giab jan2016 intro and update 160128GenomeInABottle
 
Grand round whsiao_may2015
Grand round whsiao_may2015Grand round whsiao_may2015
Grand round whsiao_may2015IRIDA_community
 
How Can We Make Genomic Epidemiology a Widespread Reality? - William Hsiao
How Can We Make Genomic Epidemiology a Widespread Reality?  - William HsiaoHow Can We Make Genomic Epidemiology a Widespread Reality?  - William Hsiao
How Can We Make Genomic Epidemiology a Widespread Reality? - William HsiaoWilliam Hsiao
 
NSF Software @ ApacheConNA
NSF Software @ ApacheConNANSF Software @ ApacheConNA
NSF Software @ ApacheConNADaniel S. Katz
 
Natusfera Lifewatch Competence Center EGI amsterdam 2016 small
Natusfera Lifewatch Competence Center EGI amsterdam 2016  smallNatusfera Lifewatch Competence Center EGI amsterdam 2016  small
Natusfera Lifewatch Competence Center EGI amsterdam 2016 smallFrancisco Pando
 
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental MetadataMaking it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental MetadataMichel Dumontier
 
BEACON's Cyberinfrastructure Needs
BEACON's Cyberinfrastructure NeedsBEACON's Cyberinfrastructure Needs
BEACON's Cyberinfrastructure Needsc.titus.brown
 
Genomic Big Data Management, Integration and Mining - Emanuel Weitschek
Genomic Big Data Management, Integration and Mining - Emanuel WeitschekGenomic Big Data Management, Integration and Mining - Emanuel Weitschek
Genomic Big Data Management, Integration and Mining - Emanuel WeitschekData Driven Innovation
 

Ähnlich wie Mik Black bioinformatics symposium (20)

Career oppurtunities in the field of Bioinformatics
Career oppurtunities in the field of BioinformaticsCareer oppurtunities in the field of Bioinformatics
Career oppurtunities in the field of Bioinformatics
 
Overview of Next Gen Sequencing Data Analysis
Overview of Next Gen Sequencing Data AnalysisOverview of Next Gen Sequencing Data Analysis
Overview of Next Gen Sequencing Data Analysis
 
wolstencroft-ogf20-astro
wolstencroft-ogf20-astrowolstencroft-ogf20-astro
wolstencroft-ogf20-astro
 
Ramil Mauleon: Galaxy: bioinformatics for rice scientists
Ramil Mauleon: Galaxy: bioinformatics for rice scientistsRamil Mauleon: Galaxy: bioinformatics for rice scientists
Ramil Mauleon: Galaxy: bioinformatics for rice scientists
 
Sgci iwsg-a-10-10-16
Sgci iwsg-a-10-10-16Sgci iwsg-a-10-10-16
Sgci iwsg-a-10-10-16
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART and the One Min...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART and the One Min...tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART and the One Min...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART and the One Min...
 
International Cancer Genomics Consortium (ICGC) Data Coordinating Center
International Cancer Genomics Consortium (ICGC) Data Coordinating CenterInternational Cancer Genomics Consortium (ICGC) Data Coordinating Center
International Cancer Genomics Consortium (ICGC) Data Coordinating Center
 
Data is the new oil: Big data, data mining and bio - inspiring techniques
Data is the new oil: Big data, data mining and bio - inspiring techniquesData is the new oil: Big data, data mining and bio - inspiring techniques
Data is the new oil: Big data, data mining and bio - inspiring techniques
 
Data are the new oil: Big data, data mining and bio - inspiring techniques
Data are the new oil: Big data, data mining and bio - inspiring techniquesData are the new oil: Big data, data mining and bio - inspiring techniques
Data are the new oil: Big data, data mining and bio - inspiring techniques
 
MOBILE DEVICE FORENSICS USING NLP
MOBILE DEVICE FORENSICS USING NLPMOBILE DEVICE FORENSICS USING NLP
MOBILE DEVICE FORENSICS USING NLP
 
MOBILE DEVICE FORENSICS USING NLP
MOBILE DEVICE FORENSICS USING NLPMOBILE DEVICE FORENSICS USING NLP
MOBILE DEVICE FORENSICS USING NLP
 
Giab jan2016 intro and update 160128
Giab jan2016 intro and update 160128Giab jan2016 intro and update 160128
Giab jan2016 intro and update 160128
 
Grand round whsiao_may2015
Grand round whsiao_may2015Grand round whsiao_may2015
Grand round whsiao_may2015
 
How Can We Make Genomic Epidemiology a Widespread Reality? - William Hsiao
How Can We Make Genomic Epidemiology a Widespread Reality?  - William HsiaoHow Can We Make Genomic Epidemiology a Widespread Reality?  - William Hsiao
How Can We Make Genomic Epidemiology a Widespread Reality? - William Hsiao
 
NSF Software @ ApacheConNA
NSF Software @ ApacheConNANSF Software @ ApacheConNA
NSF Software @ ApacheConNA
 
Natusfera Lifewatch Competence Center EGI amsterdam 2016 small
Natusfera Lifewatch Competence Center EGI amsterdam 2016  smallNatusfera Lifewatch Competence Center EGI amsterdam 2016  small
Natusfera Lifewatch Competence Center EGI amsterdam 2016 small
 
2016 davis-biotech
2016 davis-biotech2016 davis-biotech
2016 davis-biotech
 
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental MetadataMaking it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
 
BEACON's Cyberinfrastructure Needs
BEACON's Cyberinfrastructure NeedsBEACON's Cyberinfrastructure Needs
BEACON's Cyberinfrastructure Needs
 
Genomic Big Data Management, Integration and Mining - Emanuel Weitschek
Genomic Big Data Management, Integration and Mining - Emanuel WeitschekGenomic Big Data Management, Integration and Mining - Emanuel Weitschek
Genomic Big Data Management, Integration and Mining - Emanuel Weitschek
 

Mik Black bioinformatics symposium

  • 1. How to make bioinformatics accessible to normal people! Mik Black Department of Biochemistry University of Otago
  • 2. Some musings… •  Accessibility - two aspects: 1.  Methodology development distribution (can you get it?) 2.  Methodology uptake (can you use it?) •  “Normal people”: 1.  Who are they? 2.  What do they want? What do they need? Is there a happy medium?
  • 3. My background •  Statistical design and analysis of microarray experiments: –  Methodology development –  Applying existing bioinformatics techniques –  Adapting “standard” statistical methods –  “Forensic” analysis •  Technologies: –  Microarrays (mRNA, SNPs, CNV/CGH) –  Second generation sequencing (SNP/CNV)
  • 5.
  • 6. The joys of the command line… •  Large amounts of statistical genomics methodology available via R –  Accessible? –  Uptake? –  Who are the end users? •  Can’t we just teach EVERYONE to use R?
  • 7. http://www.broad.mit.edu/cancer/software/genepattern/ Reich et al. (2006) GenePattern 2.0., Nature Genetics, 38, 500-501. •  GenePattern provides a web-based method for analysing microarray (and other) data. •  Provides a simple interface to tools developed in Java, R, Matlab and other languages. •  Analysis performed on server: –  no compute resources required by users. –  Facilitates sharing of results.
  • 8. Using GenePattern •  User friendly –  Third year bioinformatics course at Otago. –  Workshop for lab personnel at TGen. •  Guided analysis –  Facilitates use of standard analysis methods. –  Pipeline creation and “versioned” analysis. •  End users? –  At Otago: 3rd 4th year Biochemistry students –  At TGen: lab techs, iterns, bench scientists, PIs
  • 9. Common analysis tasks •  Basic data analysis/exploration: –  Heatmap creation –  Hierarchical clustering –  Identifying differentially expressed genes –  Gene set analysis –  Survival analysis •  GenePattern provides these tools in a modular format.
  • 12.
  • 13.
  • 14.
  • 17. BeSTGRID: Broadband-enabled Science and Technology GRID http://www.bestgrid.org
  • 18. GenePattern on BeSTGRID •  Services: –  GenePattern server –  Development environment (server and SVN) •  GenePattern training (coming soon): –  Basic usage –  Module development (uptake path for bioinformatics tool developers)
  • 19. Next steps… •  Full GenePattern deployment: –  Transfer of development modules to public server –  Documentation and training –  Use of ROCKS cluster for job submission •  Modules for Second Gen Sequencing data: –  DNAseq, RNAseq, ChIPseq –  R/Bioconductor (e.g., ShortRead, Biostrings, RSamTools, GenomeGraphs…) –  Analysis, visualization and quality assurance
  • 20. Community effort •  Some current examples: –  VISG/MapNet: statisticians geneticists –  BeSTGRID: middleware development deployment through to end users –  CTCR: cancer researchers clinicians •  Each group has the goal of placing powerful (and useful, and usable) tools into the hands of end users.
  • 21. Bioinformatics community •  NZGL provides opportunity for community-based effort. –  National infrastructure for genomics research –  Includes strong bioinformatics component •  Key issue: engagement with end users –  Methodology development and distribution –  Uptake, interaction and training
  • 22. Bioinformatics community •  NZGL provides opportunity for community-based effort. –  National infrastructure for genomics research –  Includes strong bioinformatics component •  Key issue: engagement with end users –  Methodology development and distribution –  Uptake and interaction LETS GO FIND US SOME “NORMAL” PEOPLE!
  • 23. Acknowledgements University of Otago The University of Auckland Marcus Davy Nick Jones Tim Molteno Mark Gahegan Thomas Allen Yuriy Halytskyy Sarah Song Cristin Print Chris Brown Daniel Hurley Anthony Reeve Christoff Knapp Tony Merriman University of Canterbury Vladimir Mencl