SlideShare ist ein Scribd-Unternehmen logo
1 von 34
Accelerating Science Discovery
with AI Inference-as-a-Service
High Energy Physics and Gravitational Wave Showcases
Shih-Chieh Hsu
University of Washington
Fourth National Research Platform (4NRP)
Feb 9 2023
San Diego Supercomputer Center
https://a3d3.ai/
OAC-2117997
NSF HDR Institute A3D3
Accelerated Artificial Intelligence Algorithms for Data-Driven Discovery
Our vision is to establish a tightly coupled organization of
domain scientists, computer scientists, and engineers that
unite three core components which are essential to achieve real-
time AI to transform science and engineering discoveries.
NSF Harnessing the Data Revolution (HDR)
● A3D3 is among a much bigger Ecosystem
● A national-scale activity to enable new
modes of data-driven discovery that will
address fundamental questions at the
frontiers of science and engineering.
● Three parallel tracks (~70 awards,~$200M)
○ Institutes
■ Ideas Labs+Framework (28)
[NSF 19-543][NSF 19-549] $52.8M
■ Institutes (5) [NSF 21-519] $78.5M
○ TRIPODS
■ Phase I (15) [NSF 19-550] $22.2M
■ Phase II (2) [NSF 21-604] $20M
○ DSC (19)
■ [NSF 19-518] [NSF 21-604] $25.4M
Amy Walton Oct 26
HDR
A Nationwide Institute
79 Members/10 institutions:
● 17 Senior Personnel
● 3 Research Scientists
● 11 Postdocs
● 27 PhD
● 3 Master
● 12 Undergrad
● 4 Postbacs (Sum ‘22)
● 1 High School
$15M for 5 years since 2021
$1.25M supplement to empower
HDR Ecosystem
trainees.html
S. Hauck
S.-C. Hsu
A. Orsborn E. Shlizerman
M. Coughlin P. Harris E. Katsavounidis
S. Han
K. Hanson
M. Neubauer D. Chen
K.Scholberg
J. Duart
M. Graham
M. Liu M. D. Makin
P. Li
Director Deputy Dir.
co-PI
co-PI
co-PI
Trending of big data volume
• Next-generation experiments will outpace industry data volumes
APS/Alan Stonebraker and V. Gülzow/DESY
HL-LHC projection
CMS Twiki
ATLAS twiki
● Substantial continued software R&D improvements starting to fit into optimistic resource regions
● Similar story for disk and tape
● Memory, network projects are uncertain but undeniably finite resources
Increasing complexity of data
CMS High Granularity Calorimeter
w/ 200 simultaneous pp collisions
Global LIGO-VIRGO-KAGRA Gravitational Wave
detection and parameter inference analysis
Cross-disciplinary challenge
Revolution of AI
AI algorithms have the ability to go beyond algorithms
- Using low level features with deep neural networks and more
advanced data structures lead to long latency
Revolution of AI
E. Moreno et al. Phys. Rev. D 102, 012010 (2020)
AI algorithms have the ability to go beyond algorithms
- Using low level features with deep neural networks and more
advanced data structures lead to long latency
AI algorithms can naturally be accelerated by coprocessors.
The question is HOW!
Direct connect
● Simplest form of coprocessor
implementation
● Difficult to scale up
● User needs to know coprocessor knowledge
in some details
as-a-Service Connection
● Simplest support for mixed hardware
● Scalable
● Throughput optimizations for multiple-client
● Simple client side
FastML Lab
https://fastmachinelearning.org/
Machine Learning as-a-Service
D Rankin’s talk
J. Duarte et al., Com. Soft. Big Sci. 3 (2019) 13
D. Rankin et. al., H2RC51942.2020.00010
Brainwave
EC2 F1
OAC-1904444
Machine Learning as-a-Service
D Rankin’s talk
J. Duarte et al., Com. Soft. Big Sci. 3 (2019) 13
D. Rankin et. al., H2RC51942.2020.00010
Brainwave
EC2 F1
M. Wang et al., fdata.2020.604083
J. Krupa, MLST 2 (2021) 035005
OAC-1904444
Machine Learning as-a-Service
D Rankin’s talk
J. Duarte et al., Com. Soft. Big Sci. 3 (2019) 13
D. Rankin et. al., H2RC51942.2020.00010
Brainwave
EC2 F1
M. Wang et al., fdata.2020.604083
J. Krupa, MLST 2 (2021) 035005
Y. Feng et. al., ML Acce@MIT 2023
OAC-1904444
Machine Learning as-a-Service
Use NVidia triton inference server
for GPU + Customized GCP
Kubernetes
SONIC for HEP
https://github.com/hls-fpga-machine-learning/SonicCMS
Hermes for Gravitational Wave
https://github.com/ML4GW/hermes
aaS for High-Level Trigger and Offline Reconstruction
19
Scalability Test
Benchmarks
Calorimeter
Energy regression
3 layer MLP
2k parameters
Top jet classification
Large CNN
10M parameters
Heterogeneous computing performance comparison
● FPGA-aaST greatly outperfoms GPU-aaS for FACILE
○ Small network, large batch is ideally suited for FPGA
Heterogeneous computing performance comparison
● FPGA-aaST greatly outperfoms GPU-aaS for FACILE
○ Small network, large batch is ideally suited for FPGA
● Comparable performance between FPGA-aaST and GPU-aaS for ResNet
CMS MiniAOD
MiniAOD derivation = step in offline
processing
● Large-scale tests of SONIC-
enabled workflows in Google
Cloud :~100 GPUs/
~10,000 CPU cores
● Achieved ~10% speed up relative
to running all jobs on CPU
● Optimized CPU:GPU ratio of 32:1
Three ML algorithms
in workflow (10% per-
event latency):Jet
tagger, MET
regression, Tau ID
P. McCormack
ProtoDune-Single Phase 1kt LAr-TPC
● SONIC framework has been implemented
to enable use of CNN for track vs. EM
cascade discrimination (EmTrkMichelId)
● A 100 GPU cloud server with Kubernetes
load balancer was used to process 6.4
million out of 7.2 million events from 2018
● SONIC accelerates ML inference for
ProtoDUNE reconstruction
○ – 2.7x speed up of full ProtoDUNE workflow
● Optimal: 1 T4 GPU per 68 CPU
T. Yang
A real-time monitoring view
Gravitational Wave
Nature Astronomy 6(2022) 529
NVIDIA V100 32GB GPUs+32
vCPUs+6 concurrent execution
Deploy DeepClean to remove noise from roughly a month’s worth of strain data from the O3
observing run of the LIGO-Virgo instruments
X10
X5
IaaS CPU
IaaS GPU
FACILE @ NRP with FaaST [2010.08556]
● Original studies with FaaST leveraged
large DDR4 memory banks
● NRP equipped with Alveo U55Cs,
contain new high bandwidth memory
(HBM)
○ Understanding how to most efficiently use
HBM memory to transfer data is vital for
high throughput applications
○ Only really possible with physical cards at
NRP!
NRP - a wonderful playground for aaS R&D
● CMS: High-Level Trigger / HPC:
● miniAOD-as-a-Service (offline integration)
● HLT-as-a-service(concept demo)
● ATLAS:
○ AthenaTriton
○ Simulation: FastCaloSimGPU-as-a-Serve
● Clustering: SPVCNN Calorimeter and Vertex
● Tracking: ACTSExaTrkX-as-a-Service
● LIGO-Virgo-KAGRA
○ ML4GW/HERMES: Inference-as-a-Service in the upcoming 4th run for hardware denoising and Gravitational
Wave detection
● Zwicky Transient Facility
○ NMMA: ML-assisted follow-up for GW counterparts due to kilonovae or Gamma-Ray Burst afterglow
● Neural science
○ Large-scale brain recording & behavioral monitoring
Accelerating Physics with ML@MIT, Jan 2023
Summary
● AI as a service shifts paradigm of real time AI processing and offline
processing
○ We have demonstrate promising acceleration for LHC experiment and LIGO
○ It has been used in the ProtoDune-SP data reprocessing
● Influence on system like at NRP is crucial for future tests (multiple cards,
multiple algorithms)
29
https://a3d3.ai/jobs.html
Backup
Community engagement and training
● bring together developers and stakeholders with an interest in fully integrating
machine learning-based tools from experiment to physics analyses and results
Accelerating Physics with ML@MIT, Jan
2023
https://indico.cern.ch/event/1224718/
Fast Machine Learning workshop, Oct 2022
A3D3 for Machine Learning Challenge
● A3D3 receives $1.25M supplement grant. One of the activities is to lead Machine Learning Challenge
for the NSF HDR Ecosystem and looking for collaboration with HEP community
● Aim is to make a series of datasets released to public and explore common ML and data approaches
a. Use these datasets to make a set of ML Challenges
b. Use for education, training and outreach
c. Engagement with industry partners to ensure challenges are aligned with real-world applications
(training and professional development pipeline)
● We are lacking a clear framework for testing and validation
a. There are potentially a few options:
i. Hugging Face
ii. https://www.modelshare.org/
● We are looking for building strong connections with MLCommons Science, FAIR4HEP and FAIR-
Universe.
FACILE @ NRP with FaaST [2010.08556]
● FACILE algorithm for calorimeter
energy reconstruction from
overlapping pulses in CMS
hadronic calorimeter (HCAL)
● 3-layer MLP, 2k parameters,
necessary to run 16k times per
event (large inherent batch)
● 15 ms latency on CPU, 2 ms
on GPU (8x), 0.2 ms on FPGA
(80x)
D. Rankin
FACILE @ NRP with FaaST [2010.08556]
● FACILE algorithm for calorimeter
energy reconstruction from
overlapping pulses in CMS
hadronic calorimeter (HCAL)
● 3-layer MLP, 2k parameters,
necessary to run 16k times per
event (large inherent batch)
● 15 ms latency on CPU, 2 ms
on GPU (8x), 0.2 ms on FPGA
(80x)

Weitere ähnliche Inhalte

Ähnlich wie Panel: NRP Science Impacts​

Scientific Application Development and Early results on Summit
Scientific Application Development and Early results on SummitScientific Application Development and Early results on Summit
Scientific Application Development and Early results on SummitGanesan Narayanasamy
 
Wireless FasterData and Distributed Open Compute Opportunities and (some) Us...
 Wireless FasterData and Distributed Open Compute Opportunities and (some) Us... Wireless FasterData and Distributed Open Compute Opportunities and (some) Us...
Wireless FasterData and Distributed Open Compute Opportunities and (some) Us...Larry Smarr
 
Andrew Wiedlea - Wireless FasterData and Distributed Open Compute Opportuniti...
Andrew Wiedlea - Wireless FasterData and Distributed Open Compute Opportuniti...Andrew Wiedlea - Wireless FasterData and Distributed Open Compute Opportuniti...
Andrew Wiedlea - Wireless FasterData and Distributed Open Compute Opportuniti...Larry Smarr
 
Security Challenges and the Pacific Research Platform
Security Challenges and the Pacific Research PlatformSecurity Challenges and the Pacific Research Platform
Security Challenges and the Pacific Research PlatformLarry Smarr
 
Convergence of Machine Learning, Big Data and Supercomputing
Convergence of Machine Learning, Big Data and SupercomputingConvergence of Machine Learning, Big Data and Supercomputing
Convergence of Machine Learning, Big Data and SupercomputingDESMOND YUEN
 
The Coming Age of Extreme Heterogeneity in HPC
The Coming Age of Extreme Heterogeneity in HPCThe Coming Age of Extreme Heterogeneity in HPC
The Coming Age of Extreme Heterogeneity in HPCinside-BigData.com
 
OpenACC Monthly Highlights Summer 2019
OpenACC Monthly Highlights Summer 2019OpenACC Monthly Highlights Summer 2019
OpenACC Monthly Highlights Summer 2019OpenACC
 
Cyberinfrastructure and Applications Overview: Howard University June22
Cyberinfrastructure and Applications Overview: Howard University June22Cyberinfrastructure and Applications Overview: Howard University June22
Cyberinfrastructure and Applications Overview: Howard University June22marpierc
 
OpenACC and Open Hackathons Monthly Highlights May 2023.pdf
OpenACC and Open Hackathons Monthly Highlights May  2023.pdfOpenACC and Open Hackathons Monthly Highlights May  2023.pdf
OpenACC and Open Hackathons Monthly Highlights May 2023.pdfOpenACC
 
Arm A64fx and Post-K: Game-Changing CPU & Supercomputer for HPC, Big Data, & AI
Arm A64fx and Post-K: Game-Changing CPU & Supercomputer for HPC, Big Data, & AIArm A64fx and Post-K: Game-Changing CPU & Supercomputer for HPC, Big Data, & AI
Arm A64fx and Post-K: Game-Changing CPU & Supercomputer for HPC, Big Data, & AIinside-BigData.com
 
OpenACC Monthly Highlights: May 2019
OpenACC Monthly Highlights: May 2019OpenACC Monthly Highlights: May 2019
OpenACC Monthly Highlights: May 2019OpenACC
 
OpenACC Monthly Highlights: October2020
OpenACC Monthly Highlights: October2020OpenACC Monthly Highlights: October2020
OpenACC Monthly Highlights: October2020OpenACC
 
Fast and energy-efficient eNVM based memory organisation at L3-L1 layers for ...
Fast and energy-efficient eNVM based memory organisation at L3-L1 layers for ...Fast and energy-efficient eNVM based memory organisation at L3-L1 layers for ...
Fast and energy-efficient eNVM based memory organisation at L3-L1 layers for ...Facultad de Informática UCM
 
OpenACC and Open Hackathons Monthly Highlights June 2022.pdf
OpenACC and Open Hackathons Monthly Highlights June 2022.pdfOpenACC and Open Hackathons Monthly Highlights June 2022.pdf
OpenACC and Open Hackathons Monthly Highlights June 2022.pdfOpenACC
 
OpenACC and Open Hackathons Monthly Highlights: September 2022.pptx
OpenACC and Open Hackathons Monthly Highlights: September 2022.pptxOpenACC and Open Hackathons Monthly Highlights: September 2022.pptx
OpenACC and Open Hackathons Monthly Highlights: September 2022.pptxOpenACC
 
Performance Characterization and Optimization of In-Memory Data Analytics on ...
Performance Characterization and Optimization of In-Memory Data Analytics on ...Performance Characterization and Optimization of In-Memory Data Analytics on ...
Performance Characterization and Optimization of In-Memory Data Analytics on ...Ahsan Javed Awan
 
OpenACC and Hackathons Monthly Highlights
OpenACC and Hackathons Monthly HighlightsOpenACC and Hackathons Monthly Highlights
OpenACC and Hackathons Monthly HighlightsOpenACC
 
OpenACC and Open Hackathons Monthly Highlights: July 2022.pptx
OpenACC and Open Hackathons Monthly Highlights: July 2022.pptxOpenACC and Open Hackathons Monthly Highlights: July 2022.pptx
OpenACC and Open Hackathons Monthly Highlights: July 2022.pptxOpenACC
 

Ähnlich wie Panel: NRP Science Impacts​ (20)

AI Super computer update
AI Super computer update AI Super computer update
AI Super computer update
 
Scientific Application Development and Early results on Summit
Scientific Application Development and Early results on SummitScientific Application Development and Early results on Summit
Scientific Application Development and Early results on Summit
 
Wireless FasterData and Distributed Open Compute Opportunities and (some) Us...
 Wireless FasterData and Distributed Open Compute Opportunities and (some) Us... Wireless FasterData and Distributed Open Compute Opportunities and (some) Us...
Wireless FasterData and Distributed Open Compute Opportunities and (some) Us...
 
Andrew Wiedlea - Wireless FasterData and Distributed Open Compute Opportuniti...
Andrew Wiedlea - Wireless FasterData and Distributed Open Compute Opportuniti...Andrew Wiedlea - Wireless FasterData and Distributed Open Compute Opportuniti...
Andrew Wiedlea - Wireless FasterData and Distributed Open Compute Opportuniti...
 
Security Challenges and the Pacific Research Platform
Security Challenges and the Pacific Research PlatformSecurity Challenges and the Pacific Research Platform
Security Challenges and the Pacific Research Platform
 
Convergence of Machine Learning, Big Data and Supercomputing
Convergence of Machine Learning, Big Data and SupercomputingConvergence of Machine Learning, Big Data and Supercomputing
Convergence of Machine Learning, Big Data and Supercomputing
 
The Coming Age of Extreme Heterogeneity in HPC
The Coming Age of Extreme Heterogeneity in HPCThe Coming Age of Extreme Heterogeneity in HPC
The Coming Age of Extreme Heterogeneity in HPC
 
OpenACC Monthly Highlights Summer 2019
OpenACC Monthly Highlights Summer 2019OpenACC Monthly Highlights Summer 2019
OpenACC Monthly Highlights Summer 2019
 
Cyberinfrastructure and Applications Overview: Howard University June22
Cyberinfrastructure and Applications Overview: Howard University June22Cyberinfrastructure and Applications Overview: Howard University June22
Cyberinfrastructure and Applications Overview: Howard University June22
 
OpenACC and Open Hackathons Monthly Highlights May 2023.pdf
OpenACC and Open Hackathons Monthly Highlights May  2023.pdfOpenACC and Open Hackathons Monthly Highlights May  2023.pdf
OpenACC and Open Hackathons Monthly Highlights May 2023.pdf
 
Arm A64fx and Post-K: Game-Changing CPU & Supercomputer for HPC, Big Data, & AI
Arm A64fx and Post-K: Game-Changing CPU & Supercomputer for HPC, Big Data, & AIArm A64fx and Post-K: Game-Changing CPU & Supercomputer for HPC, Big Data, & AI
Arm A64fx and Post-K: Game-Changing CPU & Supercomputer for HPC, Big Data, & AI
 
OpenACC Monthly Highlights: May 2019
OpenACC Monthly Highlights: May 2019OpenACC Monthly Highlights: May 2019
OpenACC Monthly Highlights: May 2019
 
OpenACC Monthly Highlights: October2020
OpenACC Monthly Highlights: October2020OpenACC Monthly Highlights: October2020
OpenACC Monthly Highlights: October2020
 
Fast and energy-efficient eNVM based memory organisation at L3-L1 layers for ...
Fast and energy-efficient eNVM based memory organisation at L3-L1 layers for ...Fast and energy-efficient eNVM based memory organisation at L3-L1 layers for ...
Fast and energy-efficient eNVM based memory organisation at L3-L1 layers for ...
 
OpenACC and Open Hackathons Monthly Highlights June 2022.pdf
OpenACC and Open Hackathons Monthly Highlights June 2022.pdfOpenACC and Open Hackathons Monthly Highlights June 2022.pdf
OpenACC and Open Hackathons Monthly Highlights June 2022.pdf
 
grid computing
grid computinggrid computing
grid computing
 
OpenACC and Open Hackathons Monthly Highlights: September 2022.pptx
OpenACC and Open Hackathons Monthly Highlights: September 2022.pptxOpenACC and Open Hackathons Monthly Highlights: September 2022.pptx
OpenACC and Open Hackathons Monthly Highlights: September 2022.pptx
 
Performance Characterization and Optimization of In-Memory Data Analytics on ...
Performance Characterization and Optimization of In-Memory Data Analytics on ...Performance Characterization and Optimization of In-Memory Data Analytics on ...
Performance Characterization and Optimization of In-Memory Data Analytics on ...
 
OpenACC and Hackathons Monthly Highlights
OpenACC and Hackathons Monthly HighlightsOpenACC and Hackathons Monthly Highlights
OpenACC and Hackathons Monthly Highlights
 
OpenACC and Open Hackathons Monthly Highlights: July 2022.pptx
OpenACC and Open Hackathons Monthly Highlights: July 2022.pptxOpenACC and Open Hackathons Monthly Highlights: July 2022.pptx
OpenACC and Open Hackathons Monthly Highlights: July 2022.pptx
 

Mehr von Larry Smarr

My Remembrances of Mike Norman Over The Last 45 Years
My Remembrances of Mike Norman Over The Last 45 YearsMy Remembrances of Mike Norman Over The Last 45 Years
My Remembrances of Mike Norman Over The Last 45 YearsLarry Smarr
 
Metagenics How Do I Quantify My Body and Try to Improve its Health? June 18 2019
Metagenics How Do I Quantify My Body and Try to Improve its Health? June 18 2019Metagenics How Do I Quantify My Body and Try to Improve its Health? June 18 2019
Metagenics How Do I Quantify My Body and Try to Improve its Health? June 18 2019Larry Smarr
 
Panel: Reaching More Minority Serving Institutions
Panel: Reaching More Minority Serving InstitutionsPanel: Reaching More Minority Serving Institutions
Panel: Reaching More Minority Serving InstitutionsLarry Smarr
 
Global Network Advancement Group - Next Generation Network-Integrated Systems
Global Network Advancement Group - Next Generation Network-Integrated SystemsGlobal Network Advancement Group - Next Generation Network-Integrated Systems
Global Network Advancement Group - Next Generation Network-Integrated SystemsLarry Smarr
 
Panel Discussion: Engaging underrepresented technologists, researchers, and e...
Panel Discussion: Engaging underrepresented technologists, researchers, and e...Panel Discussion: Engaging underrepresented technologists, researchers, and e...
Panel Discussion: Engaging underrepresented technologists, researchers, and e...Larry Smarr
 
The Asia Pacific and Korea Research Platforms: An Overview Jeonghoon Moon
The Asia Pacific and Korea Research Platforms: An Overview Jeonghoon MoonThe Asia Pacific and Korea Research Platforms: An Overview Jeonghoon Moon
The Asia Pacific and Korea Research Platforms: An Overview Jeonghoon MoonLarry Smarr
 
Panel: Reaching More Minority Serving Institutions
Panel: Reaching More Minority Serving InstitutionsPanel: Reaching More Minority Serving Institutions
Panel: Reaching More Minority Serving InstitutionsLarry Smarr
 
Panel: The Global Research Platform: An Overview
Panel: The Global Research Platform: An OverviewPanel: The Global Research Platform: An Overview
Panel: The Global Research Platform: An OverviewLarry Smarr
 
Panel: Future Wireless Extensions of Regional Optical Networks
Panel: Future Wireless Extensions of Regional Optical NetworksPanel: Future Wireless Extensions of Regional Optical Networks
Panel: Future Wireless Extensions of Regional Optical NetworksLarry Smarr
 
Global Research Platform Workshops - Maxine Brown
Global Research Platform Workshops - Maxine BrownGlobal Research Platform Workshops - Maxine Brown
Global Research Platform Workshops - Maxine BrownLarry Smarr
 
Built around answering questions
Built around answering questionsBuilt around answering questions
Built around answering questionsLarry Smarr
 
Democratizing Science through Cyberinfrastructure - Manish Parashar
Democratizing Science through Cyberinfrastructure - Manish ParasharDemocratizing Science through Cyberinfrastructure - Manish Parashar
Democratizing Science through Cyberinfrastructure - Manish ParasharLarry Smarr
 
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;Larry Smarr
 
Open Force Field: Scavenging pre-emptible CPU hours* in the age of COVID - Je...
Open Force Field: Scavenging pre-emptible CPU hours* in the age of COVID - Je...Open Force Field: Scavenging pre-emptible CPU hours* in the age of COVID - Je...
Open Force Field: Scavenging pre-emptible CPU hours* in the age of COVID - Je...Larry Smarr
 
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Larry Smarr
 
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Larry Smarr
 
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Larry Smarr
 
Frank Würthwein - NRP and the Path forward
Frank Würthwein - NRP and the Path forwardFrank Würthwein - NRP and the Path forward
Frank Würthwein - NRP and the Path forwardLarry Smarr
 
Global Network Advancement Group Next Generation Network-Integrated Sys...
      Global Network Advancement GroupNext Generation Network-Integrated Sys...      Global Network Advancement GroupNext Generation Network-Integrated Sys...
Global Network Advancement Group Next Generation Network-Integrated Sys...Larry Smarr
 
Robert Kwon: Panel - Future Wireless Extensions of Regional Optical Networks
Robert Kwon: Panel - Future Wireless Extensions of Regional Optical NetworksRobert Kwon: Panel - Future Wireless Extensions of Regional Optical Networks
Robert Kwon: Panel - Future Wireless Extensions of Regional Optical NetworksLarry Smarr
 

Mehr von Larry Smarr (20)

My Remembrances of Mike Norman Over The Last 45 Years
My Remembrances of Mike Norman Over The Last 45 YearsMy Remembrances of Mike Norman Over The Last 45 Years
My Remembrances of Mike Norman Over The Last 45 Years
 
Metagenics How Do I Quantify My Body and Try to Improve its Health? June 18 2019
Metagenics How Do I Quantify My Body and Try to Improve its Health? June 18 2019Metagenics How Do I Quantify My Body and Try to Improve its Health? June 18 2019
Metagenics How Do I Quantify My Body and Try to Improve its Health? June 18 2019
 
Panel: Reaching More Minority Serving Institutions
Panel: Reaching More Minority Serving InstitutionsPanel: Reaching More Minority Serving Institutions
Panel: Reaching More Minority Serving Institutions
 
Global Network Advancement Group - Next Generation Network-Integrated Systems
Global Network Advancement Group - Next Generation Network-Integrated SystemsGlobal Network Advancement Group - Next Generation Network-Integrated Systems
Global Network Advancement Group - Next Generation Network-Integrated Systems
 
Panel Discussion: Engaging underrepresented technologists, researchers, and e...
Panel Discussion: Engaging underrepresented technologists, researchers, and e...Panel Discussion: Engaging underrepresented technologists, researchers, and e...
Panel Discussion: Engaging underrepresented technologists, researchers, and e...
 
The Asia Pacific and Korea Research Platforms: An Overview Jeonghoon Moon
The Asia Pacific and Korea Research Platforms: An Overview Jeonghoon MoonThe Asia Pacific and Korea Research Platforms: An Overview Jeonghoon Moon
The Asia Pacific and Korea Research Platforms: An Overview Jeonghoon Moon
 
Panel: Reaching More Minority Serving Institutions
Panel: Reaching More Minority Serving InstitutionsPanel: Reaching More Minority Serving Institutions
Panel: Reaching More Minority Serving Institutions
 
Panel: The Global Research Platform: An Overview
Panel: The Global Research Platform: An OverviewPanel: The Global Research Platform: An Overview
Panel: The Global Research Platform: An Overview
 
Panel: Future Wireless Extensions of Regional Optical Networks
Panel: Future Wireless Extensions of Regional Optical NetworksPanel: Future Wireless Extensions of Regional Optical Networks
Panel: Future Wireless Extensions of Regional Optical Networks
 
Global Research Platform Workshops - Maxine Brown
Global Research Platform Workshops - Maxine BrownGlobal Research Platform Workshops - Maxine Brown
Global Research Platform Workshops - Maxine Brown
 
Built around answering questions
Built around answering questionsBuilt around answering questions
Built around answering questions
 
Democratizing Science through Cyberinfrastructure - Manish Parashar
Democratizing Science through Cyberinfrastructure - Manish ParasharDemocratizing Science through Cyberinfrastructure - Manish Parashar
Democratizing Science through Cyberinfrastructure - Manish Parashar
 
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
 
Open Force Field: Scavenging pre-emptible CPU hours* in the age of COVID - Je...
Open Force Field: Scavenging pre-emptible CPU hours* in the age of COVID - Je...Open Force Field: Scavenging pre-emptible CPU hours* in the age of COVID - Je...
Open Force Field: Scavenging pre-emptible CPU hours* in the age of COVID - Je...
 
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
 
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
 
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
 
Frank Würthwein - NRP and the Path forward
Frank Würthwein - NRP and the Path forwardFrank Würthwein - NRP and the Path forward
Frank Würthwein - NRP and the Path forward
 
Global Network Advancement Group Next Generation Network-Integrated Sys...
      Global Network Advancement GroupNext Generation Network-Integrated Sys...      Global Network Advancement GroupNext Generation Network-Integrated Sys...
Global Network Advancement Group Next Generation Network-Integrated Sys...
 
Robert Kwon: Panel - Future Wireless Extensions of Regional Optical Networks
Robert Kwon: Panel - Future Wireless Extensions of Regional Optical NetworksRobert Kwon: Panel - Future Wireless Extensions of Regional Optical Networks
Robert Kwon: Panel - Future Wireless Extensions of Regional Optical Networks
 

Kürzlich hochgeladen

The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 

Kürzlich hochgeladen (20)

The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 

Panel: NRP Science Impacts​

  • 1. Accelerating Science Discovery with AI Inference-as-a-Service High Energy Physics and Gravitational Wave Showcases Shih-Chieh Hsu University of Washington Fourth National Research Platform (4NRP) Feb 9 2023 San Diego Supercomputer Center https://a3d3.ai/ OAC-2117997
  • 2. NSF HDR Institute A3D3 Accelerated Artificial Intelligence Algorithms for Data-Driven Discovery
  • 3. Our vision is to establish a tightly coupled organization of domain scientists, computer scientists, and engineers that unite three core components which are essential to achieve real- time AI to transform science and engineering discoveries.
  • 4. NSF Harnessing the Data Revolution (HDR) ● A3D3 is among a much bigger Ecosystem ● A national-scale activity to enable new modes of data-driven discovery that will address fundamental questions at the frontiers of science and engineering. ● Three parallel tracks (~70 awards,~$200M) ○ Institutes ■ Ideas Labs+Framework (28) [NSF 19-543][NSF 19-549] $52.8M ■ Institutes (5) [NSF 21-519] $78.5M ○ TRIPODS ■ Phase I (15) [NSF 19-550] $22.2M ■ Phase II (2) [NSF 21-604] $20M ○ DSC (19) ■ [NSF 19-518] [NSF 21-604] $25.4M Amy Walton Oct 26 HDR
  • 5. A Nationwide Institute 79 Members/10 institutions: ● 17 Senior Personnel ● 3 Research Scientists ● 11 Postdocs ● 27 PhD ● 3 Master ● 12 Undergrad ● 4 Postbacs (Sum ‘22) ● 1 High School $15M for 5 years since 2021 $1.25M supplement to empower HDR Ecosystem trainees.html S. Hauck S.-C. Hsu A. Orsborn E. Shlizerman M. Coughlin P. Harris E. Katsavounidis S. Han K. Hanson M. Neubauer D. Chen K.Scholberg J. Duart M. Graham M. Liu M. D. Makin P. Li Director Deputy Dir. co-PI co-PI co-PI
  • 6. Trending of big data volume • Next-generation experiments will outpace industry data volumes APS/Alan Stonebraker and V. Gülzow/DESY
  • 7. HL-LHC projection CMS Twiki ATLAS twiki ● Substantial continued software R&D improvements starting to fit into optimistic resource regions ● Similar story for disk and tape ● Memory, network projects are uncertain but undeniably finite resources
  • 8. Increasing complexity of data CMS High Granularity Calorimeter w/ 200 simultaneous pp collisions Global LIGO-VIRGO-KAGRA Gravitational Wave detection and parameter inference analysis
  • 10. Revolution of AI AI algorithms have the ability to go beyond algorithms - Using low level features with deep neural networks and more advanced data structures lead to long latency
  • 11. Revolution of AI E. Moreno et al. Phys. Rev. D 102, 012010 (2020) AI algorithms have the ability to go beyond algorithms - Using low level features with deep neural networks and more advanced data structures lead to long latency AI algorithms can naturally be accelerated by coprocessors. The question is HOW!
  • 12. Direct connect ● Simplest form of coprocessor implementation ● Difficult to scale up ● User needs to know coprocessor knowledge in some details
  • 13. as-a-Service Connection ● Simplest support for mixed hardware ● Scalable ● Throughput optimizations for multiple-client ● Simple client side FastML Lab https://fastmachinelearning.org/
  • 14. Machine Learning as-a-Service D Rankin’s talk J. Duarte et al., Com. Soft. Big Sci. 3 (2019) 13 D. Rankin et. al., H2RC51942.2020.00010 Brainwave EC2 F1 OAC-1904444
  • 15. Machine Learning as-a-Service D Rankin’s talk J. Duarte et al., Com. Soft. Big Sci. 3 (2019) 13 D. Rankin et. al., H2RC51942.2020.00010 Brainwave EC2 F1 M. Wang et al., fdata.2020.604083 J. Krupa, MLST 2 (2021) 035005 OAC-1904444
  • 16. Machine Learning as-a-Service D Rankin’s talk J. Duarte et al., Com. Soft. Big Sci. 3 (2019) 13 D. Rankin et. al., H2RC51942.2020.00010 Brainwave EC2 F1 M. Wang et al., fdata.2020.604083 J. Krupa, MLST 2 (2021) 035005 Y. Feng et. al., ML Acce@MIT 2023 OAC-1904444
  • 17. Machine Learning as-a-Service Use NVidia triton inference server for GPU + Customized GCP Kubernetes SONIC for HEP https://github.com/hls-fpga-machine-learning/SonicCMS Hermes for Gravitational Wave https://github.com/ML4GW/hermes
  • 18. aaS for High-Level Trigger and Offline Reconstruction
  • 20. Benchmarks Calorimeter Energy regression 3 layer MLP 2k parameters Top jet classification Large CNN 10M parameters
  • 21. Heterogeneous computing performance comparison ● FPGA-aaST greatly outperfoms GPU-aaS for FACILE ○ Small network, large batch is ideally suited for FPGA
  • 22. Heterogeneous computing performance comparison ● FPGA-aaST greatly outperfoms GPU-aaS for FACILE ○ Small network, large batch is ideally suited for FPGA ● Comparable performance between FPGA-aaST and GPU-aaS for ResNet
  • 23. CMS MiniAOD MiniAOD derivation = step in offline processing ● Large-scale tests of SONIC- enabled workflows in Google Cloud :~100 GPUs/ ~10,000 CPU cores ● Achieved ~10% speed up relative to running all jobs on CPU ● Optimized CPU:GPU ratio of 32:1 Three ML algorithms in workflow (10% per- event latency):Jet tagger, MET regression, Tau ID P. McCormack
  • 24. ProtoDune-Single Phase 1kt LAr-TPC ● SONIC framework has been implemented to enable use of CNN for track vs. EM cascade discrimination (EmTrkMichelId) ● A 100 GPU cloud server with Kubernetes load balancer was used to process 6.4 million out of 7.2 million events from 2018 ● SONIC accelerates ML inference for ProtoDUNE reconstruction ○ – 2.7x speed up of full ProtoDUNE workflow ● Optimal: 1 T4 GPU per 68 CPU T. Yang A real-time monitoring view
  • 25. Gravitational Wave Nature Astronomy 6(2022) 529 NVIDIA V100 32GB GPUs+32 vCPUs+6 concurrent execution Deploy DeepClean to remove noise from roughly a month’s worth of strain data from the O3 observing run of the LIGO-Virgo instruments X10 X5 IaaS CPU IaaS GPU
  • 26. FACILE @ NRP with FaaST [2010.08556] ● Original studies with FaaST leveraged large DDR4 memory banks ● NRP equipped with Alveo U55Cs, contain new high bandwidth memory (HBM) ○ Understanding how to most efficiently use HBM memory to transfer data is vital for high throughput applications ○ Only really possible with physical cards at NRP!
  • 27. NRP - a wonderful playground for aaS R&D ● CMS: High-Level Trigger / HPC: ● miniAOD-as-a-Service (offline integration) ● HLT-as-a-service(concept demo) ● ATLAS: ○ AthenaTriton ○ Simulation: FastCaloSimGPU-as-a-Serve ● Clustering: SPVCNN Calorimeter and Vertex ● Tracking: ACTSExaTrkX-as-a-Service ● LIGO-Virgo-KAGRA ○ ML4GW/HERMES: Inference-as-a-Service in the upcoming 4th run for hardware denoising and Gravitational Wave detection ● Zwicky Transient Facility ○ NMMA: ML-assisted follow-up for GW counterparts due to kilonovae or Gamma-Ray Burst afterglow ● Neural science ○ Large-scale brain recording & behavioral monitoring Accelerating Physics with ML@MIT, Jan 2023
  • 28. Summary ● AI as a service shifts paradigm of real time AI processing and offline processing ○ We have demonstrate promising acceleration for LHC experiment and LIGO ○ It has been used in the ProtoDune-SP data reprocessing ● Influence on system like at NRP is crucial for future tests (multiple cards, multiple algorithms)
  • 31. Community engagement and training ● bring together developers and stakeholders with an interest in fully integrating machine learning-based tools from experiment to physics analyses and results Accelerating Physics with ML@MIT, Jan 2023 https://indico.cern.ch/event/1224718/ Fast Machine Learning workshop, Oct 2022
  • 32. A3D3 for Machine Learning Challenge ● A3D3 receives $1.25M supplement grant. One of the activities is to lead Machine Learning Challenge for the NSF HDR Ecosystem and looking for collaboration with HEP community ● Aim is to make a series of datasets released to public and explore common ML and data approaches a. Use these datasets to make a set of ML Challenges b. Use for education, training and outreach c. Engagement with industry partners to ensure challenges are aligned with real-world applications (training and professional development pipeline) ● We are lacking a clear framework for testing and validation a. There are potentially a few options: i. Hugging Face ii. https://www.modelshare.org/ ● We are looking for building strong connections with MLCommons Science, FAIR4HEP and FAIR- Universe.
  • 33. FACILE @ NRP with FaaST [2010.08556] ● FACILE algorithm for calorimeter energy reconstruction from overlapping pulses in CMS hadronic calorimeter (HCAL) ● 3-layer MLP, 2k parameters, necessary to run 16k times per event (large inherent batch) ● 15 ms latency on CPU, 2 ms on GPU (8x), 0.2 ms on FPGA (80x) D. Rankin
  • 34. FACILE @ NRP with FaaST [2010.08556] ● FACILE algorithm for calorimeter energy reconstruction from overlapping pulses in CMS hadronic calorimeter (HCAL) ● 3-layer MLP, 2k parameters, necessary to run 16k times per event (large inherent batch) ● 15 ms latency on CPU, 2 ms on GPU (8x), 0.2 ms on FPGA (80x)

Hinweis der Redaktion

  1. Large Hadron Collider (LHC) and the Square Kilometre Array (SKA)
  2. Are there things around you in the ecosystem that do not exist, but if they were there it would make your project more impactful or increase its chances of success? Kyle has good connections through iris-hep with Hugging-Face and ModelShare This could connect well with Reana