Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and Bring-Your-Own-Resources

Larry Smarr
Larry SmarrInstitute Director, Calit2 um California Institute for Telecommunications and Information Technology
Open Infrastructure for an Open Society:
OSG, Commercial Clouds, and
Bring-Your-Own-Resources
4NRP
February 9th, 2023
• James Deaton
• Executive Director, Great Plains Network
• Derek Weitzel
• Research Assistant Professor, University of Nebraska-Lincoln,
OSG, PATh, PNRP
• Jeremy Evert
• Associate Professor, Computer Science, Southwestern
Oklahoma State University
• Igor Sfiligoi
• Lead Scientific Software Developer and Researcher, San Diego
Supercomputer Center
Open Infrastructure
Derek Weitzel – University of Nebraska-Lincoln
(Strictly Derek’s Opinions)
This project is supported by the National Science Foundation under Cooperative
Agreements OAC-2112167,. Any opinions, findings, conclusions or
recommendations expressed in this material are those of the authors and do not
necessarily reflect the views of the National Science Foundation.
National Research Platform
How is NRP “Open Infrastructure”
•All components are Open Source
• Kubernetes and containers
•Anyone can contribute resources
•Anyone can use the resources
•Documented Interfaces
•Resources were ”seeded” through various grants
• But grew with contributions from users
OSG
How is OSG “Open Infrastructure”
•All components are Open Source
• HTCondor and various tools
•Anyone can contribute resources
•Anyone can utilize the resources
•Interfaces are documented: osg-htc.org/docs
•Resources are ”seeded” by organizations such as LHC, and now CC*.
• But have grown through contributions of users
Open Science Data Federation
osdf.osg-htc.org
How is OSDF “Open Infrastructure”
• All components are Open Source
• Anyone can contribute resources
• Interfaces are documented
• Resources were “seeded” by various grants and Internet2
• But have grown by contributions from users, and soon CC*
Leveraging NRP on a smaller
campus
Jeremy Evert
Associate Professor, Southwestern Oklahoma State University
February 9th, 2023
About Southwestern Oklahoma State University
● 10th in the state in enrollment behind 2 community colleges
● 5,000 students across two campuses
○ Formerly a teaching college
○ Formerly a tribal serving institution
● Non-PhD Granting
● Serves a portion of the minorities in the area
● Around 200 full time faculty and about 60% hold a terminal degree
Bringing Our Own Resource
● SWOSU had: 200 Sq. Ft. Server closet, 5 ton A/C, 42U rack
○ NSF CC* switch
● Dell Server, 96 AMD cores, some memory, spinning disk, small gpu
● San Diego team guided SWOSU through NVMe storage upgrade
● Faculty installed Ubuntu for a base OS
● OneNet (State ISP) helped troubleshoot network
● San Diego deployed Nautilus node
● James Deaton enabled user authentication through
github.com/SWOSU
● OneNet (state ISP) and SWOSU central IT provided an alias for
jupyter.swosu.edu
Engage and empower every SWOSU student
● SWOSU Computer Science Discrete Structures assignment: join
GitHub.com/swosu
● Students are pointed to our server as soon as they start running codes
that heat up their laptop
● Promoted on every syllabus I have
Engage and empower every elementary and high
school student and researcher
● SWOSU invites area technology teachers for a weeklong camp
○ Esports, graphic design, Microsoft, and programming
● Full day on teaching programming
● Teachers run jobs on jupyter.swosu.edu
Supporting SWOSU for the next 10 years
● Enable more science drivers
○ Physics, Math, Biology, and other Compute Science faculty
● Partner with SWOSU Education Department to integrate more of the
Campus Champions / Carpentries type trainings into new primary
education curriculum
● Leverage mentors from NRP / Great Plains Network / OneNet /
OneOklahoma Cyber Infrastructure Initiative to keep growing
○ Look to NSF CC* or small school MRI to expand current platform
Please consider a weekly statewide call
● Set up a email list
● Encourage key players to join
● Allowing staff to show up and make connections
● Look for ways to add value to the individuals and larger community
● Connection to a larger community enables faculty at smaller schools
Open Infrastructure for an Open Society:
Commercial Clouds
Igor Sfiligoi
University of California San Diego
San Diego Supercomputer Center
Fourth National Research Platform (4NRP) – Feb 9th, 2023 1
Who cares about Commercial Clouds?
• Seems like everyone in industry is moving there!
• Not really, but it does look like it
• The big players have huge compute capacity
• Personally verified I can access 50k GPUs
• Others demonstrated access to several million CPU cores
• They have a large variety of compute resources
• Many x86 variants and several ARM CPUs
• Many GPU variants
• AI accelerators and FPGAs
• Great networking setups (both WAN and HPC-class LAN/Infiniband)
2
Our “famous” Cloud burst
3
From 0 to 50k GPUs
in about 2 hours
Who cares about Commercial Clouds?
• Seems like everyone in industry is moving there!
• Not really, but it does look like it
• The big players have huge compute capacity
• Personally verified I can access 50k GPUs
• Others demonstrated access to several million CPU cores
• They have a large variety of compute resources
• Many x86 variants and several ARM CPUs
• Many GPU variants
• AI accelerators and FPGAs
• Great networking setups (both WAN and HPC-class LAN/Infiniband)
4
Often have new HW
available before
you can buy it
5
Also,
Cloud-exclusive
HW variants
• CPUs
• INTEL Saphire Rapids available
on Google Cloud now
• AMD EPYC Milan-X available on
Azure now
• AMD EPYC Genoa in preview
• NVIDIA GPUs
• A10s were available in AWS
in 2021
• ARM CPUs
• AWS has its own ARM CPU
• Azure and Google regular one
• AI Accelerators
• AWS has Inferentia
• Google has TPUs
• AWS also offers Habana Gaudi
• FPGAs
• AWS had FPGAs since forever
Who cares about Commercial Clouds?
• Seems like everyone in industry is moving there!
• Not really, but it does look like it
• The big players have huge compute capacity
• Personally verified I can access 50k GPUs
• Others demonstrated access to several million CPU cores
• They have a large variety of compute resources
• Many x86 variants and several ARM CPUs
• Many GPU variants
• AI accelerators and FPGAs
• Great networking setups (both WAN and HPC-class LAN/Infiniband)
6
7
Azure HPC instances offer 1.6 Tbps Infiniband per-node networking
A couple
extremes
Pros and cons of Commercial Clouds
• Pros:
• See previous slide
• No need to go through allocation processes… all you need is money
• Cons:
• You need money
• And lots of it
• ”Regular”, on-demand Cloud computing is expensive
• Anywhere between 3x and 10x what you would pay on-prem on 24/7 basis
• Spot pricing is almost comparable to on-prem, but only useful for preemptible work
• Easy to get in, hard to get out
• Pricing optimized to let data get in cheaply, but expensive to move out
• No automatic price caps, easy to overspend
8
9
vs
Uber/Lyft
Public transit
Both will get you from A to B
Which one would you pick?
10
vs
Private jet
Commercial airline
Ticket bought 2 months in advance
in economy class
through your travel department
Both will get you from A to B
Which one would you pick?
Who should consider Commercial Cloud?
• Flexible/urgent computing
• Hard to beat the scalability of the clouds
• Costs acceptable for short spikes
• Prototyping, R&D
• The variety of HW available in the clouds is hard to match
• Instant access, no-contention drastically raises productivity
• Ultra-High-Availability services
• Hard to beat the breath of Cloud deployments
• Many large datacenters, proven track record
11
Is Commercial Cloud easy to use?
• Yes and no
• Provide enormous flexibility
• You can do virtually everything you could do with your personal server
• But that can be daunting for non-IT users
• Lots of support services
• No need to reinvent the wheel, just pick one
• Finding what you need can be a challenge, lots of competing options
• Cloud providers invest a lot in the user interfaces
• More intuitive than anything you will find on-prem
• But each provider has its own flavor
• How do you mix on-prem and Cloud resources?
12
Facilitating Cloud access for science users
• CloudBank
• Account management and monitoring (I love their spend/budget tracking!)
• Extensive documentation/training
• Integrate with OSG/PATh/HTCondor ecosystem
• IT-savvy support staff can easily add cloud resources to a HTCondor pool
• Users see only HTCondor, cloud HW no different that on-prem HW
• Kubernetes (k8s) to the rescue
• All Cloud Providers expose a Kubernetes interface, too
• Cloud k8s feels like on-prem k8s (at least for compute)
• Kubernetes federation can make it completely transparent, e.g. from Nautilus
13
14
Open for discussion
1 von 30

Recomendados

The Effectiveness, Efficiency and Legitimacy of Outsourcing Your Data von
The Effectiveness, Efficiency and Legitimacy of Outsourcing Your Data The Effectiveness, Efficiency and Legitimacy of Outsourcing Your Data
The Effectiveness, Efficiency and Legitimacy of Outsourcing Your Data DataCentred
217 views24 Folien
An Introduction to Red Hat Enterprise Linux OpenStack Platform von
An Introduction to Red Hat Enterprise Linux OpenStack PlatformAn Introduction to Red Hat Enterprise Linux OpenStack Platform
An Introduction to Red Hat Enterprise Linux OpenStack PlatformYandex
691 views14 Folien
Coding Secure Infrastructure in the Cloud using the PIE framework von
Coding Secure Infrastructure in the Cloud using the PIE frameworkCoding Secure Infrastructure in the Cloud using the PIE framework
Coding Secure Infrastructure in the Cloud using the PIE frameworkJames Wickett
3K views72 Folien
AWS re:Invent 2016: Bringing Deep Learning to the Cloud with Amazon EC2 (CMP314) von
AWS re:Invent 2016: Bringing Deep Learning to the Cloud with Amazon EC2 (CMP314)AWS re:Invent 2016: Bringing Deep Learning to the Cloud with Amazon EC2 (CMP314)
AWS re:Invent 2016: Bringing Deep Learning to the Cloud with Amazon EC2 (CMP314)Amazon Web Services
2.3K views51 Folien
How to Build a Compute Cluster von
How to Build a Compute ClusterHow to Build a Compute Cluster
How to Build a Compute ClusterRamsay Key
57 views35 Folien
e-Infrastructure available for research, using the right tool for the right job von
e-Infrastructure available for research, using the right tool for the right jobe-Infrastructure available for research, using the right tool for the right job
e-Infrastructure available for research, using the right tool for the right jobDavid Wallom
283 views64 Folien

Más contenido relacionado

Similar a Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and Bring-Your-Own-Resources

UI Dev in Big data world using open source von
UI Dev in Big data world using open sourceUI Dev in Big data world using open source
UI Dev in Big data world using open sourceTech Triveni
113 views56 Folien
Supporting Research through "Desktop as a Service" models of e-infrastructure... von
Supporting Research through "Desktop as a Service" models of e-infrastructure...Supporting Research through "Desktop as a Service" models of e-infrastructure...
Supporting Research through "Desktop as a Service" models of e-infrastructure...David Wallom
125 views24 Folien
Hpc lunch and learn von
Hpc lunch and learnHpc lunch and learn
Hpc lunch and learnJohn D Almon
709 views59 Folien
Application Virtualization, University of New Hampshire von
Application Virtualization, University of New HampshireApplication Virtualization, University of New Hampshire
Application Virtualization, University of New HampshireTony Austwick
602 views28 Folien
Yow Conference Dec 2013 Netflix Workshop Slides with Notes von
Yow Conference Dec 2013 Netflix Workshop Slides with NotesYow Conference Dec 2013 Netflix Workshop Slides with Notes
Yow Conference Dec 2013 Netflix Workshop Slides with NotesAdrian Cockcroft
49.3K views187 Folien
Utilising Cloud Computing for Research through Infrastructure, Software and D... von
Utilising Cloud Computing for Research through Infrastructure, Software and D...Utilising Cloud Computing for Research through Infrastructure, Software and D...
Utilising Cloud Computing for Research through Infrastructure, Software and D...David Wallom
304 views32 Folien

Similar a Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and Bring-Your-Own-Resources(20)

UI Dev in Big data world using open source von Tech Triveni
UI Dev in Big data world using open sourceUI Dev in Big data world using open source
UI Dev in Big data world using open source
Tech Triveni113 views
Supporting Research through "Desktop as a Service" models of e-infrastructure... von David Wallom
Supporting Research through "Desktop as a Service" models of e-infrastructure...Supporting Research through "Desktop as a Service" models of e-infrastructure...
Supporting Research through "Desktop as a Service" models of e-infrastructure...
David Wallom125 views
Application Virtualization, University of New Hampshire von Tony Austwick
Application Virtualization, University of New HampshireApplication Virtualization, University of New Hampshire
Application Virtualization, University of New Hampshire
Tony Austwick602 views
Yow Conference Dec 2013 Netflix Workshop Slides with Notes von Adrian Cockcroft
Yow Conference Dec 2013 Netflix Workshop Slides with NotesYow Conference Dec 2013 Netflix Workshop Slides with Notes
Yow Conference Dec 2013 Netflix Workshop Slides with Notes
Adrian Cockcroft49.3K views
Utilising Cloud Computing for Research through Infrastructure, Software and D... von David Wallom
Utilising Cloud Computing for Research through Infrastructure, Software and D...Utilising Cloud Computing for Research through Infrastructure, Software and D...
Utilising Cloud Computing for Research through Infrastructure, Software and D...
David Wallom304 views
SolidFire + Platform9: Simply Faster OpenStack von Platform9
SolidFire + Platform9: Simply Faster OpenStackSolidFire + Platform9: Simply Faster OpenStack
SolidFire + Platform9: Simply Faster OpenStack
Platform9477 views
Sanger, upcoming Openstack for Bio-informaticians von Peter Clapham
Sanger, upcoming Openstack for Bio-informaticiansSanger, upcoming Openstack for Bio-informaticians
Sanger, upcoming Openstack for Bio-informaticians
Peter Clapham271 views
2022-Devnexus-StatefulMicroservices.pptx.pdf von Grace Jansen
2022-Devnexus-StatefulMicroservices.pptx.pdf2022-Devnexus-StatefulMicroservices.pptx.pdf
2022-Devnexus-StatefulMicroservices.pptx.pdf
Grace Jansen129 views
A Summary about Hykes' Keynote on Dockercon 2015 von Henry Huang
A Summary about Hykes' Keynote on Dockercon 2015A Summary about Hykes' Keynote on Dockercon 2015
A Summary about Hykes' Keynote on Dockercon 2015
Henry Huang1.1K views
What ya gonna do? von CQD
What ya gonna do?What ya gonna do?
What ya gonna do?
CQD364 views
Technical standards & the RDTF Vision: some considerations von Paul Walk
Technical standards & the RDTF Vision: some considerationsTechnical standards & the RDTF Vision: some considerations
Technical standards & the RDTF Vision: some considerations
Paul Walk658 views
2016 05 sanger von Chris Dwan
2016 05 sanger2016 05 sanger
2016 05 sanger
Chris Dwan545 views

Más de Larry Smarr

Panel: Reaching More Minority Serving Institutions von
Panel: Reaching More Minority Serving InstitutionsPanel: Reaching More Minority Serving Institutions
Panel: Reaching More Minority Serving InstitutionsLarry Smarr
80 views100 Folien
Global Network Advancement Group - Next Generation Network-Integrated Systems von
Global Network Advancement Group - Next Generation Network-Integrated SystemsGlobal Network Advancement Group - Next Generation Network-Integrated Systems
Global Network Advancement Group - Next Generation Network-Integrated SystemsLarry Smarr
109 views72 Folien
Wireless FasterData and Distributed Open Compute Opportunities and (some) Us... von
 Wireless FasterData and Distributed Open Compute Opportunities and (some) Us... Wireless FasterData and Distributed Open Compute Opportunities and (some) Us...
Wireless FasterData and Distributed Open Compute Opportunities and (some) Us...Larry Smarr
98 views13 Folien
Panel Discussion: Engaging underrepresented technologists, researchers, and e... von
Panel Discussion: Engaging underrepresented technologists, researchers, and e...Panel Discussion: Engaging underrepresented technologists, researchers, and e...
Panel Discussion: Engaging underrepresented technologists, researchers, and e...Larry Smarr
84 views12 Folien
The Asia Pacific and Korea Research Platforms: An Overview Jeonghoon Moon von
The Asia Pacific and Korea Research Platforms: An Overview Jeonghoon MoonThe Asia Pacific and Korea Research Platforms: An Overview Jeonghoon Moon
The Asia Pacific and Korea Research Platforms: An Overview Jeonghoon MoonLarry Smarr
93 views22 Folien
Panel: Reaching More Minority Serving Institutions von
Panel: Reaching More Minority Serving InstitutionsPanel: Reaching More Minority Serving Institutions
Panel: Reaching More Minority Serving InstitutionsLarry Smarr
8 views100 Folien

Más de Larry Smarr(20)

Panel: Reaching More Minority Serving Institutions von Larry Smarr
Panel: Reaching More Minority Serving InstitutionsPanel: Reaching More Minority Serving Institutions
Panel: Reaching More Minority Serving Institutions
Larry Smarr80 views
Global Network Advancement Group - Next Generation Network-Integrated Systems von Larry Smarr
Global Network Advancement Group - Next Generation Network-Integrated SystemsGlobal Network Advancement Group - Next Generation Network-Integrated Systems
Global Network Advancement Group - Next Generation Network-Integrated Systems
Larry Smarr109 views
Wireless FasterData and Distributed Open Compute Opportunities and (some) Us... von Larry Smarr
 Wireless FasterData and Distributed Open Compute Opportunities and (some) Us... Wireless FasterData and Distributed Open Compute Opportunities and (some) Us...
Wireless FasterData and Distributed Open Compute Opportunities and (some) Us...
Larry Smarr98 views
Panel Discussion: Engaging underrepresented technologists, researchers, and e... von Larry Smarr
Panel Discussion: Engaging underrepresented technologists, researchers, and e...Panel Discussion: Engaging underrepresented technologists, researchers, and e...
Panel Discussion: Engaging underrepresented technologists, researchers, and e...
Larry Smarr84 views
The Asia Pacific and Korea Research Platforms: An Overview Jeonghoon Moon von Larry Smarr
The Asia Pacific and Korea Research Platforms: An Overview Jeonghoon MoonThe Asia Pacific and Korea Research Platforms: An Overview Jeonghoon Moon
The Asia Pacific and Korea Research Platforms: An Overview Jeonghoon Moon
Larry Smarr93 views
Panel: Reaching More Minority Serving Institutions von Larry Smarr
Panel: Reaching More Minority Serving InstitutionsPanel: Reaching More Minority Serving Institutions
Panel: Reaching More Minority Serving Institutions
Larry Smarr8 views
Panel: The Global Research Platform: An Overview von Larry Smarr
Panel: The Global Research Platform: An OverviewPanel: The Global Research Platform: An Overview
Panel: The Global Research Platform: An Overview
Larry Smarr94 views
Panel: Future Wireless Extensions of Regional Optical Networks von Larry Smarr
Panel: Future Wireless Extensions of Regional Optical NetworksPanel: Future Wireless Extensions of Regional Optical Networks
Panel: Future Wireless Extensions of Regional Optical Networks
Larry Smarr119 views
Global Research Platform Workshops - Maxine Brown von Larry Smarr
Global Research Platform Workshops - Maxine BrownGlobal Research Platform Workshops - Maxine Brown
Global Research Platform Workshops - Maxine Brown
Larry Smarr92 views
Built around answering questions von Larry Smarr
Built around answering questionsBuilt around answering questions
Built around answering questions
Larry Smarr101 views
Panel: NRP Science Impacts​ von Larry Smarr
Panel: NRP Science Impacts​Panel: NRP Science Impacts​
Panel: NRP Science Impacts​
Larry Smarr92 views
Democratizing Science through Cyberinfrastructure - Manish Parashar von Larry Smarr
Democratizing Science through Cyberinfrastructure - Manish ParasharDemocratizing Science through Cyberinfrastructure - Manish Parashar
Democratizing Science through Cyberinfrastructure - Manish Parashar
Larry Smarr114 views
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses; von Larry Smarr
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
Larry Smarr92 views
Open Force Field: Scavenging pre-emptible CPU hours* in the age of COVID - Je... von Larry Smarr
Open Force Field: Scavenging pre-emptible CPU hours* in the age of COVID - Je...Open Force Field: Scavenging pre-emptible CPU hours* in the age of COVID - Je...
Open Force Field: Scavenging pre-emptible CPU hours* in the age of COVID - Je...
Larry Smarr101 views
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B... von Larry Smarr
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Larry Smarr193 views
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B... von Larry Smarr
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Larry Smarr7 views
Frank Würthwein - NRP and the Path forward von Larry Smarr
Frank Würthwein - NRP and the Path forwardFrank Würthwein - NRP and the Path forward
Frank Würthwein - NRP and the Path forward
Larry Smarr130 views
Global Network Advancement Group Next Generation Network-Integrated Sys... von Larry Smarr
      Global Network Advancement GroupNext Generation Network-Integrated Sys...      Global Network Advancement GroupNext Generation Network-Integrated Sys...
Global Network Advancement Group Next Generation Network-Integrated Sys...
Larry Smarr42 views
Robert Kwon: Panel - Future Wireless Extensions of Regional Optical Networks von Larry Smarr
Robert Kwon: Panel - Future Wireless Extensions of Regional Optical NetworksRobert Kwon: Panel - Future Wireless Extensions of Regional Optical Networks
Robert Kwon: Panel - Future Wireless Extensions of Regional Optical Networks
Larry Smarr5 views
Larry Smarr - NRP Application Drivers von Larry Smarr
Larry Smarr - NRP Application DriversLarry Smarr - NRP Application Drivers
Larry Smarr - NRP Application Drivers
Larry Smarr141 views

Último

Astera Labs: Intelligent Connectivity for Cloud and AI Infrastructure von
Astera Labs:  Intelligent Connectivity for Cloud and AI InfrastructureAstera Labs:  Intelligent Connectivity for Cloud and AI Infrastructure
Astera Labs: Intelligent Connectivity for Cloud and AI InfrastructureCXL Forum
125 views16 Folien
The Importance of Cybersecurity for Digital Transformation von
The Importance of Cybersecurity for Digital TransformationThe Importance of Cybersecurity for Digital Transformation
The Importance of Cybersecurity for Digital TransformationNUS-ISS
25 views26 Folien
Samsung: CMM-H Tiered Memory Solution with Built-in DRAM von
Samsung: CMM-H Tiered Memory Solution with Built-in DRAMSamsung: CMM-H Tiered Memory Solution with Built-in DRAM
Samsung: CMM-H Tiered Memory Solution with Built-in DRAMCXL Forum
105 views7 Folien
Business Analyst Series 2023 - Week 3 Session 5 von
Business Analyst Series 2023 -  Week 3 Session 5Business Analyst Series 2023 -  Week 3 Session 5
Business Analyst Series 2023 - Week 3 Session 5DianaGray10
165 views20 Folien
Photowave Presentation Slides - 11.8.23.pptx von
Photowave Presentation Slides - 11.8.23.pptxPhotowave Presentation Slides - 11.8.23.pptx
Photowave Presentation Slides - 11.8.23.pptxCXL Forum
126 views16 Folien
Webinar : Competing for tomorrow’s leaders – How MENA insurers can win the wa... von
Webinar : Competing for tomorrow’s leaders – How MENA insurers can win the wa...Webinar : Competing for tomorrow’s leaders – How MENA insurers can win the wa...
Webinar : Competing for tomorrow’s leaders – How MENA insurers can win the wa...The Digital Insurer
28 views18 Folien

Último(20)

Astera Labs: Intelligent Connectivity for Cloud and AI Infrastructure von CXL Forum
Astera Labs:  Intelligent Connectivity for Cloud and AI InfrastructureAstera Labs:  Intelligent Connectivity for Cloud and AI Infrastructure
Astera Labs: Intelligent Connectivity for Cloud and AI Infrastructure
CXL Forum125 views
The Importance of Cybersecurity for Digital Transformation von NUS-ISS
The Importance of Cybersecurity for Digital TransformationThe Importance of Cybersecurity for Digital Transformation
The Importance of Cybersecurity for Digital Transformation
NUS-ISS25 views
Samsung: CMM-H Tiered Memory Solution with Built-in DRAM von CXL Forum
Samsung: CMM-H Tiered Memory Solution with Built-in DRAMSamsung: CMM-H Tiered Memory Solution with Built-in DRAM
Samsung: CMM-H Tiered Memory Solution with Built-in DRAM
CXL Forum105 views
Business Analyst Series 2023 - Week 3 Session 5 von DianaGray10
Business Analyst Series 2023 -  Week 3 Session 5Business Analyst Series 2023 -  Week 3 Session 5
Business Analyst Series 2023 - Week 3 Session 5
DianaGray10165 views
Photowave Presentation Slides - 11.8.23.pptx von CXL Forum
Photowave Presentation Slides - 11.8.23.pptxPhotowave Presentation Slides - 11.8.23.pptx
Photowave Presentation Slides - 11.8.23.pptx
CXL Forum126 views
Webinar : Competing for tomorrow’s leaders – How MENA insurers can win the wa... von The Digital Insurer
Webinar : Competing for tomorrow’s leaders – How MENA insurers can win the wa...Webinar : Competing for tomorrow’s leaders – How MENA insurers can win the wa...
Webinar : Competing for tomorrow’s leaders – How MENA insurers can win the wa...
Understanding GenAI/LLM and What is Google Offering - Felix Goh von NUS-ISS
Understanding GenAI/LLM and What is Google Offering - Felix GohUnderstanding GenAI/LLM and What is Google Offering - Felix Goh
Understanding GenAI/LLM and What is Google Offering - Felix Goh
NUS-ISS39 views
Architecting CX Measurement Frameworks and Ensuring CX Metrics are fit for Pu... von NUS-ISS
Architecting CX Measurement Frameworks and Ensuring CX Metrics are fit for Pu...Architecting CX Measurement Frameworks and Ensuring CX Metrics are fit for Pu...
Architecting CX Measurement Frameworks and Ensuring CX Metrics are fit for Pu...
NUS-ISS32 views
How to reduce cold starts for Java Serverless applications in AWS at JCON Wor... von Vadym Kazulkin
How to reduce cold starts for Java Serverless applications in AWS at JCON Wor...How to reduce cold starts for Java Serverless applications in AWS at JCON Wor...
How to reduce cold starts for Java Serverless applications in AWS at JCON Wor...
Vadym Kazulkin70 views
Spesifikasi Lengkap ASUS Vivobook Go 14 von Dot Semarang
Spesifikasi Lengkap ASUS Vivobook Go 14Spesifikasi Lengkap ASUS Vivobook Go 14
Spesifikasi Lengkap ASUS Vivobook Go 14
Dot Semarang35 views
Empathic Computing: Delivering the Potential of the Metaverse von Mark Billinghurst
Empathic Computing: Delivering  the Potential of the MetaverseEmpathic Computing: Delivering  the Potential of the Metaverse
Empathic Computing: Delivering the Potential of the Metaverse
Mark Billinghurst449 views
AMD: 4th Generation EPYC CXL Demo von CXL Forum
AMD: 4th Generation EPYC CXL DemoAMD: 4th Generation EPYC CXL Demo
AMD: 4th Generation EPYC CXL Demo
CXL Forum126 views
GigaIO: The March of Composability Onward to Memory with CXL von CXL Forum
GigaIO: The March of Composability Onward to Memory with CXLGigaIO: The March of Composability Onward to Memory with CXL
GigaIO: The March of Composability Onward to Memory with CXL
CXL Forum126 views
The details of description: Techniques, tips, and tangents on alternative tex... von BookNet Canada
The details of description: Techniques, tips, and tangents on alternative tex...The details of description: Techniques, tips, and tangents on alternative tex...
The details of description: Techniques, tips, and tangents on alternative tex...
BookNet Canada110 views
CXL at OCP von CXL Forum
CXL at OCPCXL at OCP
CXL at OCP
CXL Forum208 views
Future of Learning - Khoong Chan Meng von NUS-ISS
Future of Learning - Khoong Chan MengFuture of Learning - Khoong Chan Meng
Future of Learning - Khoong Chan Meng
NUS-ISS31 views

Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and Bring-Your-Own-Resources

  • 1. Open Infrastructure for an Open Society: OSG, Commercial Clouds, and Bring-Your-Own-Resources 4NRP February 9th, 2023
  • 2. • James Deaton • Executive Director, Great Plains Network • Derek Weitzel • Research Assistant Professor, University of Nebraska-Lincoln, OSG, PATh, PNRP • Jeremy Evert • Associate Professor, Computer Science, Southwestern Oklahoma State University • Igor Sfiligoi • Lead Scientific Software Developer and Researcher, San Diego Supercomputer Center
  • 3. Open Infrastructure Derek Weitzel – University of Nebraska-Lincoln (Strictly Derek’s Opinions) This project is supported by the National Science Foundation under Cooperative Agreements OAC-2112167,. Any opinions, findings, conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation.
  • 5. How is NRP “Open Infrastructure” •All components are Open Source • Kubernetes and containers •Anyone can contribute resources •Anyone can use the resources •Documented Interfaces •Resources were ”seeded” through various grants • But grew with contributions from users
  • 6. OSG
  • 7. How is OSG “Open Infrastructure” •All components are Open Source • HTCondor and various tools •Anyone can contribute resources •Anyone can utilize the resources •Interfaces are documented: osg-htc.org/docs •Resources are ”seeded” by organizations such as LHC, and now CC*. • But have grown through contributions of users
  • 8. Open Science Data Federation osdf.osg-htc.org
  • 9. How is OSDF “Open Infrastructure” • All components are Open Source • Anyone can contribute resources • Interfaces are documented • Resources were “seeded” by various grants and Internet2 • But have grown by contributions from users, and soon CC*
  • 10. Leveraging NRP on a smaller campus Jeremy Evert Associate Professor, Southwestern Oklahoma State University February 9th, 2023
  • 11. About Southwestern Oklahoma State University ● 10th in the state in enrollment behind 2 community colleges ● 5,000 students across two campuses ○ Formerly a teaching college ○ Formerly a tribal serving institution ● Non-PhD Granting ● Serves a portion of the minorities in the area ● Around 200 full time faculty and about 60% hold a terminal degree
  • 12. Bringing Our Own Resource ● SWOSU had: 200 Sq. Ft. Server closet, 5 ton A/C, 42U rack ○ NSF CC* switch ● Dell Server, 96 AMD cores, some memory, spinning disk, small gpu ● San Diego team guided SWOSU through NVMe storage upgrade ● Faculty installed Ubuntu for a base OS ● OneNet (State ISP) helped troubleshoot network ● San Diego deployed Nautilus node ● James Deaton enabled user authentication through github.com/SWOSU ● OneNet (state ISP) and SWOSU central IT provided an alias for jupyter.swosu.edu
  • 13. Engage and empower every SWOSU student ● SWOSU Computer Science Discrete Structures assignment: join GitHub.com/swosu ● Students are pointed to our server as soon as they start running codes that heat up their laptop ● Promoted on every syllabus I have
  • 14. Engage and empower every elementary and high school student and researcher ● SWOSU invites area technology teachers for a weeklong camp ○ Esports, graphic design, Microsoft, and programming ● Full day on teaching programming ● Teachers run jobs on jupyter.swosu.edu
  • 15. Supporting SWOSU for the next 10 years ● Enable more science drivers ○ Physics, Math, Biology, and other Compute Science faculty ● Partner with SWOSU Education Department to integrate more of the Campus Champions / Carpentries type trainings into new primary education curriculum ● Leverage mentors from NRP / Great Plains Network / OneNet / OneOklahoma Cyber Infrastructure Initiative to keep growing ○ Look to NSF CC* or small school MRI to expand current platform
  • 16. Please consider a weekly statewide call ● Set up a email list ● Encourage key players to join ● Allowing staff to show up and make connections ● Look for ways to add value to the individuals and larger community ● Connection to a larger community enables faculty at smaller schools
  • 17. Open Infrastructure for an Open Society: Commercial Clouds Igor Sfiligoi University of California San Diego San Diego Supercomputer Center Fourth National Research Platform (4NRP) – Feb 9th, 2023 1
  • 18. Who cares about Commercial Clouds? • Seems like everyone in industry is moving there! • Not really, but it does look like it • The big players have huge compute capacity • Personally verified I can access 50k GPUs • Others demonstrated access to several million CPU cores • They have a large variety of compute resources • Many x86 variants and several ARM CPUs • Many GPU variants • AI accelerators and FPGAs • Great networking setups (both WAN and HPC-class LAN/Infiniband) 2
  • 19. Our “famous” Cloud burst 3 From 0 to 50k GPUs in about 2 hours
  • 20. Who cares about Commercial Clouds? • Seems like everyone in industry is moving there! • Not really, but it does look like it • The big players have huge compute capacity • Personally verified I can access 50k GPUs • Others demonstrated access to several million CPU cores • They have a large variety of compute resources • Many x86 variants and several ARM CPUs • Many GPU variants • AI accelerators and FPGAs • Great networking setups (both WAN and HPC-class LAN/Infiniband) 4
  • 21. Often have new HW available before you can buy it 5 Also, Cloud-exclusive HW variants • CPUs • INTEL Saphire Rapids available on Google Cloud now • AMD EPYC Milan-X available on Azure now • AMD EPYC Genoa in preview • NVIDIA GPUs • A10s were available in AWS in 2021 • ARM CPUs • AWS has its own ARM CPU • Azure and Google regular one • AI Accelerators • AWS has Inferentia • Google has TPUs • AWS also offers Habana Gaudi • FPGAs • AWS had FPGAs since forever
  • 22. Who cares about Commercial Clouds? • Seems like everyone in industry is moving there! • Not really, but it does look like it • The big players have huge compute capacity • Personally verified I can access 50k GPUs • Others demonstrated access to several million CPU cores • They have a large variety of compute resources • Many x86 variants and several ARM CPUs • Many GPU variants • AI accelerators and FPGAs • Great networking setups (both WAN and HPC-class LAN/Infiniband) 6
  • 23. 7 Azure HPC instances offer 1.6 Tbps Infiniband per-node networking A couple extremes
  • 24. Pros and cons of Commercial Clouds • Pros: • See previous slide • No need to go through allocation processes… all you need is money • Cons: • You need money • And lots of it • ”Regular”, on-demand Cloud computing is expensive • Anywhere between 3x and 10x what you would pay on-prem on 24/7 basis • Spot pricing is almost comparable to on-prem, but only useful for preemptible work • Easy to get in, hard to get out • Pricing optimized to let data get in cheaply, but expensive to move out • No automatic price caps, easy to overspend 8
  • 25. 9 vs Uber/Lyft Public transit Both will get you from A to B Which one would you pick?
  • 26. 10 vs Private jet Commercial airline Ticket bought 2 months in advance in economy class through your travel department Both will get you from A to B Which one would you pick?
  • 27. Who should consider Commercial Cloud? • Flexible/urgent computing • Hard to beat the scalability of the clouds • Costs acceptable for short spikes • Prototyping, R&D • The variety of HW available in the clouds is hard to match • Instant access, no-contention drastically raises productivity • Ultra-High-Availability services • Hard to beat the breath of Cloud deployments • Many large datacenters, proven track record 11
  • 28. Is Commercial Cloud easy to use? • Yes and no • Provide enormous flexibility • You can do virtually everything you could do with your personal server • But that can be daunting for non-IT users • Lots of support services • No need to reinvent the wheel, just pick one • Finding what you need can be a challenge, lots of competing options • Cloud providers invest a lot in the user interfaces • More intuitive than anything you will find on-prem • But each provider has its own flavor • How do you mix on-prem and Cloud resources? 12
  • 29. Facilitating Cloud access for science users • CloudBank • Account management and monitoring (I love their spend/budget tracking!) • Extensive documentation/training • Integrate with OSG/PATh/HTCondor ecosystem • IT-savvy support staff can easily add cloud resources to a HTCondor pool • Users see only HTCondor, cloud HW no different that on-prem HW • Kubernetes (k8s) to the rescue • All Cloud Providers expose a Kubernetes interface, too • Cloud k8s feels like on-prem k8s (at least for compute) • Kubernetes federation can make it completely transparent, e.g. from Nautilus 13