SlideShare ist ein Scribd-Unternehmen logo
1 von 19
Downloaden Sie, um offline zu lesen
The past, present, and future of HPC in life sciences
Erich Birngruber, Ümit Seren
Gregor Mendel Institute for Molecular Plant Biology (GMI)
AHPC17
Who we are
- Basic research institute in plant sciences
- 9 independent research groups
- Employees 100 + 20 (scientific + admin)
- HPC Operations Team: 2 + 1 (engineer + lead)
Past: Beginnings as traditional HPC
Scientific computing at GMI
- Started in 2010
- SGI ICE-X since 2013 (MENDEL)
(72 nodes, 144 today)
- SGI UV2000
- Rich software environment
(EasyBuild, lmod)
- Keeping up with current
developments
Machine specs
3 generations of nodes:
- 72x 16c E5-2609, 192gb mem
- 18x 20c E5-2680, 256gb mem
- 54x 24c E5-2650, 256gb mem,
230gb ssd
UV2000: 96c E5-4617, 2tb mem
IB FDR interconnect (1 fabric)
Storage: Lustre 300tb, NetApp >1pb
Past: System architecture
Present: GMI site specifics
- Services: customers are biologists
- On campus initial training
- Consulting and support (w/ ticket system, intranet wiki)
- Software installations
- Provided as modules: different versions, repeatability
- This is getting harder with the demand for more complex software
- Monitoring software usage
Present: Monitoring software usage
- Software in env modules
- 460 software packages
in 1297 versions
- Monitoring module usage
(load, unload)
- Reporting by user, job, project
Present: Monitoring system activity
Monitoring and metrics
The foundation for all future decisions
- Resource consumption
- Capacity planning
- Software, technology usage
- Auditing
Alerting
Nodestatus
Jobresources
Present: Applications & Appliances
Phenobox (in development)
- Web-interface, API
- MySQL (DB)
- DSLR, RaspberryPi
- HPC (computer vision, storage)
GWA-Portal (https://gwas.gmi.oeaw.ac.at)
- Web-interface, API
- Elasticsearch (fulltext search)
- PostgreSQL (DB)
- Docker (Python microservices)
- HPC (analysis, storage)
Galaxy (https://galaxyproject.org/)
- Web-interface, API
- MySQL (DB)
- Visualization
- HPC (analysis, storage)
PacBio SMRT Link
(https://github.com/PacificBiosciences/SMRT-Link)
- Web-interface, API
- MySQL (DB)
- HPC (analysis, storage)
Own developments: 3rd party software:
Present: new developments
Deployment of OpenStack (IaaS):
- Cross-vendor open source project
- On-premises cloud
- Provision VMs and containers
- Deploy classic application services
- Enables self-service for customers
Consequences:
- More heterogeneous use-cases
- Customer base is increasing
- Non-human “customers” of HPC
- Services are more complex and
distributed over subsystems
Past: System architecture
Present: MENDEL, Openstack
Present: Problem 1: maintenance
- VMs are difficult to maintain
- Wrong abstraction for the use-case
- What is the next step?
- Containers?
- Container Orchestration Engines?
- Provide Software as a Service (SaaS)?
Fact is: the field is evolving
Present: MENDEL, Openstack
Future: Problem 2: integration
Applications sit on different islands:
HPC vs. Cloud
Drawbacks:
- Hard to maintain (infra)
- Hard to debug (app)
Vision: converged compute platform.
Unified infrastructure to schedule all
types of tasks
New challenges:
- Networking - Storage
- IDM - Accounting
What do others do?
Container Orchestration Engine (Google
Kubernetes, Docker Swarm, Apache Mesos)
First steps:
- Containers for HPC
- Biocontainers http://biocontainers.pro
- Singularity http://singularity.lbl.gov
- Current status: test deployment
Contact / References:
Erich Birngruber <erich.birngruber@gmi.oeaw.ac.at>, @ebirn
Ümit Seren <uemit.seren@gmi.oeaw.ac.at>, @timeu_s
GMI on Github:
https://github.com/Gregor-Mendel-Institute
Total recall: holistic metrics for broad systems performance and user experience visibility in a
data-intensive computing environment
https://dl.acm.org/citation.cfm?id=2835001
Acknowledgements
Gregor Mendel Institute
of Molecular Plant
Biology
Dr Bohr-Gasse 3
1030 Vienna, Austria
EOF

Weitere ähnliche Inhalte

Andere mochten auch

Gospel of hip hop
Gospel of hip hopGospel of hip hop
Gospel of hip hopJalen Terry
 
Important Personalities of Mahabharata
Important Personalities of Mahabharata Important Personalities of Mahabharata
Important Personalities of Mahabharata Abhishek Sharma
 
Pengolahan Limbah Cair dengan metode Elektrokoagulasi
Pengolahan Limbah Cair dengan metode Elektrokoagulasi Pengolahan Limbah Cair dengan metode Elektrokoagulasi
Pengolahan Limbah Cair dengan metode Elektrokoagulasi ansyahrobi
 
Fast & Furious: building HPC solutions in a nutshell
Fast & Furious: building HPC solutions in a nutshellFast & Furious: building HPC solutions in a nutshell
Fast & Furious: building HPC solutions in a nutshellVictor Haydin
 
FIEV report on steel sector
FIEV report on steel sectorFIEV report on steel sector
FIEV report on steel sectorSourav Mahato
 
Kathryn Gregg Resume
Kathryn Gregg ResumeKathryn Gregg Resume
Kathryn Gregg ResumeKaydee Gregg
 
CASO CLÍNICO ORTO ADRIÁN QUIZHPE
CASO CLÍNICO ORTO ADRIÁN QUIZHPECASO CLÍNICO ORTO ADRIÁN QUIZHPE
CASO CLÍNICO ORTO ADRIÁN QUIZHPEAAQQ91
 
HPC in healthcare
HPC in healthcareHPC in healthcare
HPC in healthcareluckyanup
 
Introduction To Apache Mesos
Introduction To Apache MesosIntroduction To Apache Mesos
Introduction To Apache MesosJoe Stein
 
Big Data HPC Convergence
Big Data HPC ConvergenceBig Data HPC Convergence
Big Data HPC ConvergenceGeoffrey Fox
 
ODCA Board Best Practice: High Performance Computing at BMW
ODCA Board Best Practice: High Performance Computing at BMWODCA Board Best Practice: High Performance Computing at BMW
ODCA Board Best Practice: High Performance Computing at BMWOpen Data Center Alliance
 
Big Data and High Performance Computing Solutions in the AWS Cloud
Big Data and High Performance Computing Solutions in the AWS CloudBig Data and High Performance Computing Solutions in the AWS Cloud
Big Data and High Performance Computing Solutions in the AWS CloudAmazon Web Services
 
Intro to High Performance Computing in the AWS Cloud
Intro to High Performance Computing in the AWS CloudIntro to High Performance Computing in the AWS Cloud
Intro to High Performance Computing in the AWS CloudAmazon Web Services
 
AWS re:Invent 2016: High Performance Computing on AWS (CMP207)
AWS re:Invent 2016: High Performance Computing on AWS (CMP207)AWS re:Invent 2016: High Performance Computing on AWS (CMP207)
AWS re:Invent 2016: High Performance Computing on AWS (CMP207)Amazon Web Services
 

Andere mochten auch (20)

Shooting schedule
Shooting scheduleShooting schedule
Shooting schedule
 
Gospel of hip hop
Gospel of hip hopGospel of hip hop
Gospel of hip hop
 
Annual-Report-2013
Annual-Report-2013Annual-Report-2013
Annual-Report-2013
 
Mood board
Mood boardMood board
Mood board
 
Doc1
Doc1Doc1
Doc1
 
Important Personalities of Mahabharata
Important Personalities of Mahabharata Important Personalities of Mahabharata
Important Personalities of Mahabharata
 
Pengolahan Limbah Cair dengan metode Elektrokoagulasi
Pengolahan Limbah Cair dengan metode Elektrokoagulasi Pengolahan Limbah Cair dengan metode Elektrokoagulasi
Pengolahan Limbah Cair dengan metode Elektrokoagulasi
 
Fast & Furious: building HPC solutions in a nutshell
Fast & Furious: building HPC solutions in a nutshellFast & Furious: building HPC solutions in a nutshell
Fast & Furious: building HPC solutions in a nutshell
 
FIEV report on steel sector
FIEV report on steel sectorFIEV report on steel sector
FIEV report on steel sector
 
Kathryn Gregg Resume
Kathryn Gregg ResumeKathryn Gregg Resume
Kathryn Gregg Resume
 
CASO CLÍNICO ORTO ADRIÁN QUIZHPE
CASO CLÍNICO ORTO ADRIÁN QUIZHPECASO CLÍNICO ORTO ADRIÁN QUIZHPE
CASO CLÍNICO ORTO ADRIÁN QUIZHPE
 
HPC in healthcare
HPC in healthcareHPC in healthcare
HPC in healthcare
 
Digital pen
Digital penDigital pen
Digital pen
 
Introduction To Apache Mesos
Introduction To Apache MesosIntroduction To Apache Mesos
Introduction To Apache Mesos
 
Big Data HPC Convergence
Big Data HPC ConvergenceBig Data HPC Convergence
Big Data HPC Convergence
 
ODCA Board Best Practice: High Performance Computing at BMW
ODCA Board Best Practice: High Performance Computing at BMWODCA Board Best Practice: High Performance Computing at BMW
ODCA Board Best Practice: High Performance Computing at BMW
 
Big Data and High Performance Computing Solutions in the AWS Cloud
Big Data and High Performance Computing Solutions in the AWS CloudBig Data and High Performance Computing Solutions in the AWS Cloud
Big Data and High Performance Computing Solutions in the AWS Cloud
 
Intro to High Performance Computing in the AWS Cloud
Intro to High Performance Computing in the AWS CloudIntro to High Performance Computing in the AWS Cloud
Intro to High Performance Computing in the AWS Cloud
 
HPC Market Update from IDC
HPC Market Update from IDCHPC Market Update from IDC
HPC Market Update from IDC
 
AWS re:Invent 2016: High Performance Computing on AWS (CMP207)
AWS re:Invent 2016: High Performance Computing on AWS (CMP207)AWS re:Invent 2016: High Performance Computing on AWS (CMP207)
AWS re:Invent 2016: High Performance Computing on AWS (CMP207)
 

Ähnlich wie Past, present, and future of HPC in life sciences

General Introduction to technologies that will be seen in the school
General Introduction to technologies that will be seen in the school General Introduction to technologies that will be seen in the school
General Introduction to technologies that will be seen in the school ISSGC Summer School
 
Hungarian ClusterGrid and its applications
Hungarian ClusterGrid and its applicationsHungarian ClusterGrid and its applications
Hungarian ClusterGrid and its applicationsFerenc Szalai
 
Accelerate Big Data Processing with High-Performance Computing Technologies
Accelerate Big Data Processing with High-Performance Computing TechnologiesAccelerate Big Data Processing with High-Performance Computing Technologies
Accelerate Big Data Processing with High-Performance Computing TechnologiesIntel® Software
 
NIIF Grid Development portfolio
NIIF Grid Development portfolioNIIF Grid Development portfolio
NIIF Grid Development portfolioFerenc Szalai
 
20160201_resume_Vladimir_Chesnokov
20160201_resume_Vladimir_Chesnokov20160201_resume_Vladimir_Chesnokov
20160201_resume_Vladimir_ChesnokovVladimir Chesnokov
 
Designing HPC, Deep Learning, and Cloud Middleware for Exascale Systems
Designing HPC, Deep Learning, and Cloud Middleware for Exascale SystemsDesigning HPC, Deep Learning, and Cloud Middleware for Exascale Systems
Designing HPC, Deep Learning, and Cloud Middleware for Exascale Systemsinside-BigData.com
 
Scallable Distributed Deep Learning on OpenPOWER systems
Scallable Distributed Deep Learning on OpenPOWER systemsScallable Distributed Deep Learning on OpenPOWER systems
Scallable Distributed Deep Learning on OpenPOWER systemsGanesan Narayanasamy
 
Pathways for EOSC-hub and MaX collaboration
Pathways for EOSC-hub and MaX collaborationPathways for EOSC-hub and MaX collaboration
Pathways for EOSC-hub and MaX collaborationEOSC-hub project
 
Best pratices at BGI for the Challenges in the Era of Big Genomics Data
Best pratices at BGI for the Challenges in the Era of Big Genomics DataBest pratices at BGI for the Challenges in the Era of Big Genomics Data
Best pratices at BGI for the Challenges in the Era of Big Genomics DataXing Xu
 
EGI Cloud Compute service for EOSC-hub
EGI Cloud Compute service for EOSC-hub EGI Cloud Compute service for EOSC-hub
EGI Cloud Compute service for EOSC-hub EOSC-hub project
 
Designing High-Performance and Scalable Middleware for HPC, AI and Data Science
Designing High-Performance and Scalable Middleware for HPC, AI and Data ScienceDesigning High-Performance and Scalable Middleware for HPC, AI and Data Science
Designing High-Performance and Scalable Middleware for HPC, AI and Data ScienceObject Automation
 
Designing HPC & Deep Learning Middleware for Exascale Systems
Designing HPC & Deep Learning Middleware for Exascale SystemsDesigning HPC & Deep Learning Middleware for Exascale Systems
Designing HPC & Deep Learning Middleware for Exascale Systemsinside-BigData.com
 
Designing High performance & Scalable Middleware for HPC
Designing High performance & Scalable Middleware for HPCDesigning High performance & Scalable Middleware for HPC
Designing High performance & Scalable Middleware for HPCObject Automation
 
Opentelemetry - From frontend to backend
Opentelemetry - From frontend to backendOpentelemetry - From frontend to backend
Opentelemetry - From frontend to backendSebastian Poxhofer
 
Exploring emerging technologies in the HPC co-design space
Exploring emerging technologies in the HPC co-design spaceExploring emerging technologies in the HPC co-design space
Exploring emerging technologies in the HPC co-design spacejsvetter
 

Ähnlich wie Past, present, and future of HPC in life sciences (20)

General Introduction to technologies that will be seen in the school
General Introduction to technologies that will be seen in the school General Introduction to technologies that will be seen in the school
General Introduction to technologies that will be seen in the school
 
Hungarian ClusterGrid and its applications
Hungarian ClusterGrid and its applicationsHungarian ClusterGrid and its applications
Hungarian ClusterGrid and its applications
 
Accelerate Big Data Processing with High-Performance Computing Technologies
Accelerate Big Data Processing with High-Performance Computing TechnologiesAccelerate Big Data Processing with High-Performance Computing Technologies
Accelerate Big Data Processing with High-Performance Computing Technologies
 
NIIF Grid Development portfolio
NIIF Grid Development portfolioNIIF Grid Development portfolio
NIIF Grid Development portfolio
 
20160201_resume_Vladimir_Chesnokov
20160201_resume_Vladimir_Chesnokov20160201_resume_Vladimir_Chesnokov
20160201_resume_Vladimir_Chesnokov
 
Designing HPC, Deep Learning, and Cloud Middleware for Exascale Systems
Designing HPC, Deep Learning, and Cloud Middleware for Exascale SystemsDesigning HPC, Deep Learning, and Cloud Middleware for Exascale Systems
Designing HPC, Deep Learning, and Cloud Middleware for Exascale Systems
 
Scallable Distributed Deep Learning on OpenPOWER systems
Scallable Distributed Deep Learning on OpenPOWER systemsScallable Distributed Deep Learning on OpenPOWER systems
Scallable Distributed Deep Learning on OpenPOWER systems
 
Pathways for EOSC-hub and MaX collaboration
Pathways for EOSC-hub and MaX collaborationPathways for EOSC-hub and MaX collaboration
Pathways for EOSC-hub and MaX collaboration
 
NWU and HPC
NWU and HPCNWU and HPC
NWU and HPC
 
Best pratices at BGI for the Challenges in the Era of Big Genomics Data
Best pratices at BGI for the Challenges in the Era of Big Genomics DataBest pratices at BGI for the Challenges in the Era of Big Genomics Data
Best pratices at BGI for the Challenges in the Era of Big Genomics Data
 
H2020-AHTOOLS Use Case 3 Functional Design
H2020-AHTOOLS Use Case 3 Functional DesignH2020-AHTOOLS Use Case 3 Functional Design
H2020-AHTOOLS Use Case 3 Functional Design
 
optimizing_ceph_flash
optimizing_ceph_flashoptimizing_ceph_flash
optimizing_ceph_flash
 
EGI Cloud Compute service for EOSC-hub
EGI Cloud Compute service for EOSC-hub EGI Cloud Compute service for EOSC-hub
EGI Cloud Compute service for EOSC-hub
 
Available HPC Resources at CSUC
Available HPC Resources at CSUCAvailable HPC Resources at CSUC
Available HPC Resources at CSUC
 
Designing High-Performance and Scalable Middleware for HPC, AI and Data Science
Designing High-Performance and Scalable Middleware for HPC, AI and Data ScienceDesigning High-Performance and Scalable Middleware for HPC, AI and Data Science
Designing High-Performance and Scalable Middleware for HPC, AI and Data Science
 
Designing HPC & Deep Learning Middleware for Exascale Systems
Designing HPC & Deep Learning Middleware for Exascale SystemsDesigning HPC & Deep Learning Middleware for Exascale Systems
Designing HPC & Deep Learning Middleware for Exascale Systems
 
NSCC Training Introductory Class
NSCC Training Introductory Class NSCC Training Introductory Class
NSCC Training Introductory Class
 
Designing High performance & Scalable Middleware for HPC
Designing High performance & Scalable Middleware for HPCDesigning High performance & Scalable Middleware for HPC
Designing High performance & Scalable Middleware for HPC
 
Opentelemetry - From frontend to backend
Opentelemetry - From frontend to backendOpentelemetry - From frontend to backend
Opentelemetry - From frontend to backend
 
Exploring emerging technologies in the HPC co-design space
Exploring emerging technologies in the HPC co-design spaceExploring emerging technologies in the HPC co-design space
Exploring emerging technologies in the HPC co-design space
 

Kürzlich hochgeladen

Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 

Kürzlich hochgeladen (20)

Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 

Past, present, and future of HPC in life sciences

  • 1. The past, present, and future of HPC in life sciences Erich Birngruber, Ümit Seren Gregor Mendel Institute for Molecular Plant Biology (GMI) AHPC17
  • 2. Who we are - Basic research institute in plant sciences - 9 independent research groups - Employees 100 + 20 (scientific + admin) - HPC Operations Team: 2 + 1 (engineer + lead)
  • 3. Past: Beginnings as traditional HPC Scientific computing at GMI - Started in 2010 - SGI ICE-X since 2013 (MENDEL) (72 nodes, 144 today) - SGI UV2000 - Rich software environment (EasyBuild, lmod) - Keeping up with current developments Machine specs 3 generations of nodes: - 72x 16c E5-2609, 192gb mem - 18x 20c E5-2680, 256gb mem - 54x 24c E5-2650, 256gb mem, 230gb ssd UV2000: 96c E5-4617, 2tb mem IB FDR interconnect (1 fabric) Storage: Lustre 300tb, NetApp >1pb
  • 5. Present: GMI site specifics - Services: customers are biologists - On campus initial training - Consulting and support (w/ ticket system, intranet wiki) - Software installations - Provided as modules: different versions, repeatability - This is getting harder with the demand for more complex software - Monitoring software usage
  • 6. Present: Monitoring software usage - Software in env modules - 460 software packages in 1297 versions - Monitoring module usage (load, unload) - Reporting by user, job, project
  • 7. Present: Monitoring system activity Monitoring and metrics The foundation for all future decisions - Resource consumption - Capacity planning - Software, technology usage - Auditing Alerting
  • 10. Present: Applications & Appliances Phenobox (in development) - Web-interface, API - MySQL (DB) - DSLR, RaspberryPi - HPC (computer vision, storage) GWA-Portal (https://gwas.gmi.oeaw.ac.at) - Web-interface, API - Elasticsearch (fulltext search) - PostgreSQL (DB) - Docker (Python microservices) - HPC (analysis, storage) Galaxy (https://galaxyproject.org/) - Web-interface, API - MySQL (DB) - Visualization - HPC (analysis, storage) PacBio SMRT Link (https://github.com/PacificBiosciences/SMRT-Link) - Web-interface, API - MySQL (DB) - HPC (analysis, storage) Own developments: 3rd party software:
  • 11. Present: new developments Deployment of OpenStack (IaaS): - Cross-vendor open source project - On-premises cloud - Provision VMs and containers - Deploy classic application services - Enables self-service for customers Consequences: - More heterogeneous use-cases - Customer base is increasing - Non-human “customers” of HPC - Services are more complex and distributed over subsystems
  • 14. Present: Problem 1: maintenance - VMs are difficult to maintain - Wrong abstraction for the use-case - What is the next step? - Containers? - Container Orchestration Engines? - Provide Software as a Service (SaaS)? Fact is: the field is evolving
  • 16. Future: Problem 2: integration Applications sit on different islands: HPC vs. Cloud Drawbacks: - Hard to maintain (infra) - Hard to debug (app) Vision: converged compute platform. Unified infrastructure to schedule all types of tasks New challenges: - Networking - Storage - IDM - Accounting What do others do? Container Orchestration Engine (Google Kubernetes, Docker Swarm, Apache Mesos) First steps: - Containers for HPC - Biocontainers http://biocontainers.pro - Singularity http://singularity.lbl.gov - Current status: test deployment
  • 17. Contact / References: Erich Birngruber <erich.birngruber@gmi.oeaw.ac.at>, @ebirn Ümit Seren <uemit.seren@gmi.oeaw.ac.at>, @timeu_s GMI on Github: https://github.com/Gregor-Mendel-Institute Total recall: holistic metrics for broad systems performance and user experience visibility in a data-intensive computing environment https://dl.acm.org/citation.cfm?id=2835001
  • 18. Acknowledgements Gregor Mendel Institute of Molecular Plant Biology Dr Bohr-Gasse 3 1030 Vienna, Austria
  • 19. EOF