SlideShare ist ein Scribd-Unternehmen logo
1 von 12
Downloaden Sie, um offline zu lesen
Bioinformatics Career Day
24 May 2012




Felix Klein
Background


    • physics diploma, University of Heidelberg



    • diploma thesis in radiation dosimetry
      at DKFZ


    • measurements at HIT




2      24.05.2012     Felix Klein
Why bioinformatics?


    • interdisciplinary

    • programmed in R

    • worked on data analysis




3      24.05.2012         Felix Klein
Progress in science is driven by technology




4     24.05.2012   Felix Klein
Chromatin loops




5     24.05.2012   Felix Klein
Investigation of chromatin 3D structure
    • role of chromatin 3D structure in gene regulation

    • 4C to investigate detailed interactions of
      cis-regulatory modules (CRMs)

    • global chromatin interactome using HiC




6      24.05.2012      Felix Klein
Investigation of chromatin 3D structure




7     24.05.2012   Felix Klein
Automated analysis of microscopy based
     RNAi screens
                                                                                                                      Features
                    Imaging                             Segmentation                                                  extraction




 Source image                       Calibrated image                            Segmentation mask
      9.241719
       g.pd




                                                                                  g.x        g.y     g.s g.p     g.pdm
      g.s g.p
      194 67




                                                                         [1,]   123.1391   3.288660 194 67      9.241719
                                                                         [2,]   206.7460   9.442248 961 153    20.513190
                                                                         [3,]   502.9589   7.616438 219 60      8.286918
                                                                         [4,]    20.1919 22.358418 1568 157    22.219461
      3.288660




                                                                         [5,]   344.7959 45.501992 2259 233    35.158966
                 Summary                               Classification    [6,]   188.2611 50.451863 2711 249    28.732680
        g.y




                                                                         [7,]   269.7996 46.404036 2131 180    26.419631
                              aft       apt   neg                        [8,]   106.6127 58.364243 1348 143    21.662879
                                                                         [9,]   218.5582 77.299007 1913 215    25.724580
                                                                        [10,]    19.1766 81.840147 1908 209    26.303760
      123.1391




                                                                        [11,]     6.3558 62.017647 340 68      10.314127
        g.x




                                                                        [12,]    58.9873 86.034128 2139 214    27.463158
                                                                        [13,]   245.1087 94.387405 1048 123    18.280901
                                                                        [14,]   411.2741 109.198678 2572 225   28.660816
                              int       pos                             [15,]
                                                                        [16,]
                                                                                167.8151 107.966014 1942 160
                                                                                281.7084 121.609892 2871 209
                                                                                                               24.671533
                                                                                                               31.577270


Phenotypic profile             Objects labels                                        Object features


 8
What was important for me?
    • bioinformatics group with
      members of diverse
      backgrounds

    • PI who successfully
      trained bioinformaticians

    • well established group in
      bioinformatics




9      24.05.2012      Felix Klein
What might be interesting for you
     • turn data into biology

     • interaction with people from biology groups

     • communication skills !!!

     • workload divides mainly into:
        • programming (50 %)
        • reports, meetings, email




10      24.05.2012        Felix Klein
Acknowledgements
Wolfgang Huber
Simon Anders
Joseph Barry
Bernd Fischer
Julian Gehring
Aleksandra Pekowska
Paul Theodor Pyl
Alejandro Reyes
Maria Secrier

Collaborators:
Michael Boutros
Christian Volz

Eileen Furlong
Yad Ghavi Helm



11     24.05.2012     Felix Klein
Data production rates
LHC: 1.8 GB / s at peak capacity (i.e. actively conducting a
primary aspect of the LHC’s four main experiments: ATLAS,
ALICE, CMS, and LHCb).
These experiments will take roughly a decade to complete, and
each of them is expected to produce over a 1 PB per year of
data.

One Illumina HiSeq: up to 600 Gb/run , i.e. ~600 GB/10 days =
18 TB/year (not including derived data e.g. BAM)
One Digital Embryo (2008): 3.5 TB (2048 x 2048 x 370 x 1226)
EMBL-EBI: in 9/2011, data storage capacity was 14 PB

Weitere ähnliche Inhalte

Ähnlich wie P3 training and_life_as_a_postdoc_(felix_klein)

Tvm table3
Tvm table3Tvm table3
Tvm table3
divyaav
 
Sip _ready_reckoner___compliance_approved
Sip  _ready_reckoner___compliance_approvedSip  _ready_reckoner___compliance_approved
Sip _ready_reckoner___compliance_approved
guestc7ba7d90
 
6 dimension and properties table of ipe shape
6 dimension and properties table of ipe shape6 dimension and properties table of ipe shape
6 dimension and properties table of ipe shape
Chhay Teng
 
Silver dollars stock club chap2 (marvie f.)
Silver dollars stock club chap2 (marvie f.)Silver dollars stock club chap2 (marvie f.)
Silver dollars stock club chap2 (marvie f.)
marvie-marv
 
9 dimension and properties table of upe
9 dimension and properties table of upe9 dimension and properties table of upe
9 dimension and properties table of upe
Chhay Teng
 
(東北大学)環境報告書20(再提出)
(東北大学)環境報告書20(再提出)(東北大学)環境報告書20(再提出)
(東北大学)環境報告書20(再提出)
env63
 
Ltn200804281069 C
Ltn200804281069 CLtn200804281069 C
Ltn200804281069 C
guest54ca90
 
Recap des résultats 2010 sociétés cotées
Recap des résultats 2010 sociétés cotéesRecap des résultats 2010 sociétés cotées
Recap des résultats 2010 sociétés cotées
IDIR2001
 
Recap des sociétés cotées
Recap des sociétés cotéesRecap des sociétés cotées
Recap des sociétés cotées
IDIR2001
 
Estadistica basica I
Estadistica basica IEstadistica basica I
Estadistica basica I
gmayo
 

Ähnlich wie P3 training and_life_as_a_postdoc_(felix_klein) (20)

Tvm table3
Tvm table3Tvm table3
Tvm table3
 
Korepatentistaitsitkleri
KorepatentistaitsitkleriKorepatentistaitsitkleri
Korepatentistaitsitkleri
 
Sip _ready_reckoner___compliance_approved
Sip  _ready_reckoner___compliance_approvedSip  _ready_reckoner___compliance_approved
Sip _ready_reckoner___compliance_approved
 
Table Of Trigonometric Ratios
Table Of Trigonometric RatiosTable Of Trigonometric Ratios
Table Of Trigonometric Ratios
 
Tabla de afiliacion_ASOMATE
Tabla de afiliacion_ASOMATETabla de afiliacion_ASOMATE
Tabla de afiliacion_ASOMATE
 
Limites de control para gráficos xr xs
Limites de control para gráficos xr xsLimites de control para gráficos xr xs
Limites de control para gráficos xr xs
 
6 dimension and properties table of ipe shape
6 dimension and properties table of ipe shape6 dimension and properties table of ipe shape
6 dimension and properties table of ipe shape
 
Appendix a present value tables
Appendix a   present value tablesAppendix a   present value tables
Appendix a present value tables
 
Fundamental Equity Analysis - World Gold Miners
Fundamental Equity Analysis - World Gold MinersFundamental Equity Analysis - World Gold Miners
Fundamental Equity Analysis - World Gold Miners
 
Silver dollars stock club chap2 (marvie f.)
Silver dollars stock club chap2 (marvie f.)Silver dollars stock club chap2 (marvie f.)
Silver dollars stock club chap2 (marvie f.)
 
Sanhuu udirdlaga
Sanhuu udirdlagaSanhuu udirdlaga
Sanhuu udirdlaga
 
Forum links
Forum linksForum links
Forum links
 
9 dimension and properties table of upe
9 dimension and properties table of upe9 dimension and properties table of upe
9 dimension and properties table of upe
 
(東北大学)環境報告書20(再提出)
(東北大学)環境報告書20(再提出)(東北大学)環境報告書20(再提出)
(東北大学)環境報告書20(再提出)
 
Ltn200804281069 C
Ltn200804281069 CLtn200804281069 C
Ltn200804281069 C
 
Petroleum Import (2000-2010)
Petroleum Import (2000-2010)Petroleum Import (2000-2010)
Petroleum Import (2000-2010)
 
Gsom1
Gsom1Gsom1
Gsom1
 
Recap des résultats 2010 sociétés cotées
Recap des résultats 2010 sociétés cotéesRecap des résultats 2010 sociétés cotées
Recap des résultats 2010 sociétés cotées
 
Recap des sociétés cotées
Recap des sociétés cotéesRecap des sociétés cotées
Recap des sociétés cotées
 
Estadistica basica I
Estadistica basica IEstadistica basica I
Estadistica basica I
 

Mehr von phdcareers (8)

PhDretreat
PhDretreat PhDretreat
PhDretreat
 
P2 training and_life_as_a_postdoc_(kota_miura)
P2 training and_life_as_a_postdoc_(kota_miura)P2 training and_life_as_a_postdoc_(kota_miura)
P2 training and_life_as_a_postdoc_(kota_miura)
 
E2 life as_a_scientific_database_curator_(sandra_orchard)
E2 life as_a_scientific_database_curator_(sandra_orchard)E2 life as_a_scientific_database_curator_(sandra_orchard)
E2 life as_a_scientific_database_curator_(sandra_orchard)
 
E1 life as_an_outreach_project_leader_(giulietta_spudich)
E1 life as_an_outreach_project_leader_(giulietta_spudich)E1 life as_an_outreach_project_leader_(giulietta_spudich)
E1 life as_an_outreach_project_leader_(giulietta_spudich)
 
2 training opportunities_at_embl_(helke_hillebrand)
2 training opportunities_at_embl_(helke_hillebrand)2 training opportunities_at_embl_(helke_hillebrand)
2 training opportunities_at_embl_(helke_hillebrand)
 
1 introduction to_the_ebi_(katrina_pavelin)
1 introduction to_the_ebi_(katrina_pavelin)1 introduction to_the_ebi_(katrina_pavelin)
1 introduction to_the_ebi_(katrina_pavelin)
 
P4 training and_life_as_a_postdoc_(shinichi_sunagawa)
P4 training and_life_as_a_postdoc_(shinichi_sunagawa)P4 training and_life_as_a_postdoc_(shinichi_sunagawa)
P4 training and_life_as_a_postdoc_(shinichi_sunagawa)
 
Bioinformatics Career Day
Bioinformatics Career DayBioinformatics Career Day
Bioinformatics Career Day
 

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 

P3 training and_life_as_a_postdoc_(felix_klein)

  • 1. Bioinformatics Career Day 24 May 2012 Felix Klein
  • 2. Background • physics diploma, University of Heidelberg • diploma thesis in radiation dosimetry at DKFZ • measurements at HIT 2 24.05.2012 Felix Klein
  • 3. Why bioinformatics? • interdisciplinary • programmed in R • worked on data analysis 3 24.05.2012 Felix Klein
  • 4. Progress in science is driven by technology 4 24.05.2012 Felix Klein
  • 5. Chromatin loops 5 24.05.2012 Felix Klein
  • 6. Investigation of chromatin 3D structure • role of chromatin 3D structure in gene regulation • 4C to investigate detailed interactions of cis-regulatory modules (CRMs) • global chromatin interactome using HiC 6 24.05.2012 Felix Klein
  • 7. Investigation of chromatin 3D structure 7 24.05.2012 Felix Klein
  • 8. Automated analysis of microscopy based RNAi screens Features Imaging Segmentation extraction Source image Calibrated image Segmentation mask 9.241719 g.pd g.x g.y g.s g.p g.pdm g.s g.p 194 67 [1,] 123.1391 3.288660 194 67 9.241719 [2,] 206.7460 9.442248 961 153 20.513190 [3,] 502.9589 7.616438 219 60 8.286918 [4,] 20.1919 22.358418 1568 157 22.219461 3.288660 [5,] 344.7959 45.501992 2259 233 35.158966 Summary Classification [6,] 188.2611 50.451863 2711 249 28.732680 g.y [7,] 269.7996 46.404036 2131 180 26.419631 aft apt neg [8,] 106.6127 58.364243 1348 143 21.662879 [9,] 218.5582 77.299007 1913 215 25.724580 [10,] 19.1766 81.840147 1908 209 26.303760 123.1391 [11,] 6.3558 62.017647 340 68 10.314127 g.x [12,] 58.9873 86.034128 2139 214 27.463158 [13,] 245.1087 94.387405 1048 123 18.280901 [14,] 411.2741 109.198678 2572 225 28.660816 int pos [15,] [16,] 167.8151 107.966014 1942 160 281.7084 121.609892 2871 209 24.671533 31.577270 Phenotypic profile Objects labels Object features 8
  • 9. What was important for me? • bioinformatics group with members of diverse backgrounds • PI who successfully trained bioinformaticians • well established group in bioinformatics 9 24.05.2012 Felix Klein
  • 10. What might be interesting for you • turn data into biology • interaction with people from biology groups • communication skills !!! • workload divides mainly into: • programming (50 %) • reports, meetings, email 10 24.05.2012 Felix Klein
  • 11. Acknowledgements Wolfgang Huber Simon Anders Joseph Barry Bernd Fischer Julian Gehring Aleksandra Pekowska Paul Theodor Pyl Alejandro Reyes Maria Secrier Collaborators: Michael Boutros Christian Volz Eileen Furlong Yad Ghavi Helm 11 24.05.2012 Felix Klein
  • 12. Data production rates LHC: 1.8 GB / s at peak capacity (i.e. actively conducting a primary aspect of the LHC’s four main experiments: ATLAS, ALICE, CMS, and LHCb). These experiments will take roughly a decade to complete, and each of them is expected to produce over a 1 PB per year of data. One Illumina HiSeq: up to 600 Gb/run , i.e. ~600 GB/10 days = 18 TB/year (not including derived data e.g. BAM) One Digital Embryo (2008): 3.5 TB (2048 x 2048 x 370 x 1226) EMBL-EBI: in 9/2011, data storage capacity was 14 PB