SlideShare ist ein Scribd-Unternehmen logo
1 von 34
Downloaden Sie, um offline zu lesen
HIGH-PERFORMANCE
INPUT PIPELINES FOR
SCALABLE DEEP LEARNING
Joshua Robinson
Pure Storage
© 2019 PURE STORAGE INC.2
QUESTIONONEVERYONE’SMIND:
WHYISASTORAGECOMPANYHERE?
© 2019 PURE STORAGE INC.3
“We don’t have better algorithms,
we just have more data”
PETER NORVIG
​Engineering Director, Google
© 2019 PURE STORAGE INC.4
The AI “Hierarchy of Needs”
credit: Monica Rogati
ML algorithms: linear & logistic
regression, k-means clustering,
decision trees, etc.
Validation: A/B testing, detecting
model drift over time✓
Data preparation: cleaning, feature
identification, exploration, etc.
Data acquisition: ingest,
transformation, and representation of
data for analysis
© 2019 PURE STORAGE INC.5
THIS IS NOT THE FIRST AI HYPE WAVE
1950 1960 1970 1980 1990 2000 2010 2020
Birth of AI Re-birth I Re-birth II
AI winter I AI winter II
Common themes: compute and data couldn’t
match needs of problems being hyped
Common themes: focus on specific problems
where available compute & data are sufficient
6 © 2019 PURE STORAGE INC.
DEEP LEARNING = MASSIVE DATA & COMPUTE
Deep Learning
Accuracy
Data & Compute
Previous methods
STATE-OF-THE-ART RESULTS ACROSS VISION, SPEECH, LANGUAGE, AND MORE
Sources: https://arxiv.org/abs/1506.01497; https://arxiv.org/abs/1703.06870; https://shubhangdesai.github.io/blog/Neural-Style.html; https://cs.stanford.edu/people/karpathy/cnnembed/
© 2019 PURE STORAGE INC.7
THE INTUITION BEHIND DEEP LEARNING
deep
neural net
Pr{dog}= 0.903
Pr{cat} = 0.072
…
“dog”
Primitives Rough shapes Macro features
© 2019 PURE STORAGE INC.8
TRAINING A DEEP NEURAL NETWORK
evaluate
compute
gradients
apply
gradients
Pr{dog}= 0.903
Pr{cat} = 0.072
…
Primitives Rough shapes Macro features
© 2019 PURE STORAGE INC.9
DISTRIBUTED TRAINING
evaluate
compute
gradients
merge
gradients
apply
gradients
evaluate
compute
gradients
apply
gradients
evaluate
compute
gradients
apply
gradients
# GPUs
10 © 2019 PURE STORAGE INC.
MORE, FASTER GPUs + MORE DATA
11 © 2019 PURE STORAGE INC.
CAN WE KEEP GPUs FED WITH DATA?
INPUT PIPELINE = POTENTIAL BOTTLENECK
12 © 2019 PURE STORAGE INC.
INPUT PIPELINES
CAN IT BE THAT SIMPLE?
Source: K. He, X. Zhang, S. Ren, J. Sun, Deep Residual Learning for Image Recognition, CVPR 2015
13 © 2019 PURE STORAGE INC.
REAL INPUT PIPELINES
CAN YOU SPOT THE BOTTLENECK?
Source: K. He, X. Zhang, S. Ren, J. Sun, Deep Residual Learning for Image Recognition, CVPR 2015
14 © 2019 PURE STORAGE INC.
FROM IMAGES TO TENSORS
PLANE
DOG
BOAT
CAT
1. Enumerate
15 © 2019 PURE STORAGE INC.
FROM IMAGES TO TENSORS
PLANE PLANE PLANE PLANE
DOG DOG DOG DOG
BOAT BOAT BOAT BOAT
CAT CAT CAT CAT
1. Enumerate
2. Associate labels
16 © 2019 PURE STORAGE INC.
FROM IMAGES TO TENSORS
BOATCAT
PLANE
PLANE
PLANE
PLANE
DOG
DOG
DOGDOG
BOATBOAT
BOAT
CAT CAT
CAT
1. Enumerate
2. Associate labels
3. Shuffle
17 © 2019 PURE STORAGE INC.
FROM IMAGES TO TENSORS
PLANE
PLANE
PLANE
DOG
DOG
DOGDOG
BOAT BOATBOAT
BOAT
CAT CAT
CAT
CATPLANE
1. Enumerate
2. Associate labels
3. Shuffle
4. Read, crop, distort
18 © 2019 PURE STORAGE INC.
FROM IMAGES TO TENSORS
1. Enumerate
2. Associate labels
3. Shuffle
4. Read, crop, distort
5. Copy to GPU
PLANE
PLANE
PLANE
DOG
DOG
DOGDOG
BOAT BOATBOAT
BOAT
CAT CAT
CAT
CATPLANE
19 © 2019 PURE STORAGE INC.
FROM IMAGES TO TENSORS
1. Enumerate
2. Associate labels
3. Shuffle
4. Read, crop, distort
5. Copy to GPU
ANY OF THESE STEPS CAN BE
A POTENTIAL BOTTLENECK
Other domains (NLP, speech, etc.)
will follow a similar(ish) flow
20 © 2019 PURE STORAGE INC.
EVALUATION METHODOLOGY
1.3M images, 1000 categories
21 © 2019 PURE STORAGE INC.
40Gb Ethernet
4x NVIDIA DGX-1, each with
8x Tesla V100 GPUs (SXM2)
2x Intel E5-2698 v4 @ 2.20GHz
4x Mellanox MT27700 100Gb/s VPI adapters
512GB DDR4-2400
Pure Storage FlashBlade: 15x17TB
179T usable before data reduction
Arista DCS-7060CX2-32S
32x 100Gb/s QSFP100 ports
AIRI
100Gb Ethernet
w/ RDMA (RoCE)
HARDWARE STACK
22 © 2019 PURE STORAGE INC.
SOFTWARE STACK
nvcr.io/nvidia/tensorflow:17.12
Using
TensorFlow
“Datasets” API
for input
pipelines
DGX-OS (Ubuntu 16.04)
CUDA 9.0 NCCL 2.1.2
CUDNN v7
OpenMPI 3.0
TensorFlow 1.4.0+
Horovod
alsrgv/tf_cnn_benchmarks
23 © 2019 PURE STORAGE INC.
TRAINING WITH 1 GPU
216 i/s
Defaults
Images per second when training Inception3 (batch size = 64)
forward
input
pipeline
backward
“Default” training pipeline
forward backward
Replace the input pipeline with synthetic data
How do we know what good looks like?
Synthetic
228 i/s
24 © 2019 PURE STORAGE INC.
TRAINING WITH 1 GPU
225 i/s
Defaults + Prefetch
forward
input
pipeline
backward
Images per second when training Inception3 (batch size = 64)
Adding a prefetch queue improves scheduler behavior
216 i/s
Synthetic
228 images/s
forward
input
pipeline
backward
“Default” training pipeline
SHOULD WE CARE ABOUT 5%?
25 © 2019 PURE STORAGE INC.
SCALING TO 32 GPUs (4x DGX-1s)
Defaults
4143 i/s
Linear
Synthetic
6580 images/s
7200 images/s
+ Prefetch
5335 i/s
- Distortions
6440 i/s
Images per second when training Inception3 (batch size = 64/GPU)
+ Thread
Pool Limit
5527 i/s
Thread pool limits: Avoid
over-subscribing CPU with
too many threads.
(inter_op_parallelism_threads)
No Distortions: Skip
preprocessing step from
input pipeline. This is an
unrealistic configuration,
but it shows the bottleneck.
EXCELLENT SCALABILITY, BUT
STILL MORE WORK TO BE DONE
42% gap!
26 © 2019 PURE STORAGE INC.
2.5X Performance Improvement
28 © 2019 PURE STORAGE INC.
SCALE OF REAL-WORLD DATA
143 GB 20 PB
ImageNet Zenuity
29 © 2019 PURE STORAGE INC.
SINGLE-GPU TRAINING
evaluate
compute
gradients
apply
gradients
Pr{dog}= 0.903
Pr{cat} = 0.072
…
30 © 2019 PURE STORAGE INC.
DISTRIBUTED TRAINING
evaluate
compute
gradients
merge
gradients
apply
gradients
evaluate
compute
gradients
apply
gradients
evaluate
compute
gradients
apply
gradients
# GPUs
31 © 2019 PURE STORAGE INC.
LINEAR SCALING FOR CONVNETS
RESNET-50
2540 i/s
4870 i/s
10244 i/s
1 DGX-1 2 DGX-1 4 DGX-1
INCEPTION3
1600 i/s
3160 i/s
6440 i/s
1 DGX-1 2 DGX-1 4 DGX-1
VGG16
1640 i/s
3110 i/s
6300 i/s
1 DGX-1 2 DGX-1 4 DGX-1
32 © 2019 PURE STORAGE INC.
RDMA OVER ETHERNET
RDMA is essential for peak performance
33 © 2019 PURE STORAGE INC.
Input queue is full - need
more/faster GPUs?
KEEPING GPUs FED WITH DATA
34 © 2019 PURE STORAGE INC.
FROM IMAGES TO TENSORS
PLANE PLANE PLANE
DOG DOG DOG DOG
BOAT BOAT BOAT BOAT
CAT CAT CAT CAT
PLANE 1. Enumerate
2. Associate labels
3. Crop and distort

Weitere ähnliche Inhalte

Was ist angesagt?

ExxonMobil’s journey to unleash time-series data with open source technology
ExxonMobil’s journey to unleash time-series data with open source technologyExxonMobil’s journey to unleash time-series data with open source technology
ExxonMobil’s journey to unleash time-series data with open source technologyDataWorks Summit
 
Sharing metadata across the data lake and streams
Sharing metadata across the data lake and streamsSharing metadata across the data lake and streams
Sharing metadata across the data lake and streamsDataWorks Summit
 
Secured (Kerberos-based) Spark Notebook for Data Science: Spark Summit East t...
Secured (Kerberos-based) Spark Notebook for Data Science: Spark Summit East t...Secured (Kerberos-based) Spark Notebook for Data Science: Spark Summit East t...
Secured (Kerberos-based) Spark Notebook for Data Science: Spark Summit East t...Spark Summit
 
Saving the elephant—now, not later
Saving the elephant—now, not laterSaving the elephant—now, not later
Saving the elephant—now, not laterDataWorks Summit
 
Lessons learned running a container cloud on YARN
Lessons learned running a container cloud on YARNLessons learned running a container cloud on YARN
Lessons learned running a container cloud on YARNDataWorks Summit
 
Introduction to Apache NiFi 1.10
Introduction to Apache NiFi 1.10Introduction to Apache NiFi 1.10
Introduction to Apache NiFi 1.10Timothy Spann
 
Manage democratization of the data - Data Replication in Hadoop
Manage democratization of the data - Data Replication in HadoopManage democratization of the data - Data Replication in Hadoop
Manage democratization of the data - Data Replication in HadoopDataWorks Summit
 
HAWQ Meets Hive - Querying Unmanaged Data
HAWQ Meets Hive - Querying Unmanaged DataHAWQ Meets Hive - Querying Unmanaged Data
HAWQ Meets Hive - Querying Unmanaged DataDataWorks Summit
 
Real time-data-pipeline
Real time-data-pipelineReal time-data-pipeline
Real time-data-pipelinedinuswt22
 
Spark + Flashblade: Spark Summit East talk by Brian Gold
Spark + Flashblade: Spark Summit East talk by Brian GoldSpark + Flashblade: Spark Summit East talk by Brian Gold
Spark + Flashblade: Spark Summit East talk by Brian GoldSpark Summit
 
Emerging trends in data analytics
Emerging trends in data analyticsEmerging trends in data analytics
Emerging trends in data analyticsWei-Chiu Chuang
 
HTAP By Accident: Getting More From PostgreSQL Using Hardware Acceleration
HTAP By Accident: Getting More From PostgreSQL Using Hardware AccelerationHTAP By Accident: Getting More From PostgreSQL Using Hardware Acceleration
HTAP By Accident: Getting More From PostgreSQL Using Hardware AccelerationEDB
 
Next gen tooling for building streaming analytics apps: code-less development...
Next gen tooling for building streaming analytics apps: code-less development...Next gen tooling for building streaming analytics apps: code-less development...
Next gen tooling for building streaming analytics apps: code-less development...DataWorks Summit
 
Uncovering an Apache Spark 2 Benchmark - Configuration, Tuning and Test Results
Uncovering an Apache Spark 2 Benchmark - Configuration, Tuning and Test ResultsUncovering an Apache Spark 2 Benchmark - Configuration, Tuning and Test Results
Uncovering an Apache Spark 2 Benchmark - Configuration, Tuning and Test ResultsDataWorks Summit
 
Designing data pipelines for analytics and machine learning in industrial set...
Designing data pipelines for analytics and machine learning in industrial set...Designing data pipelines for analytics and machine learning in industrial set...
Designing data pipelines for analytics and machine learning in industrial set...DataWorks Summit
 
Fast SQL on Hadoop, Really?
Fast SQL on Hadoop, Really?Fast SQL on Hadoop, Really?
Fast SQL on Hadoop, Really?DataWorks Summit
 
End-to-End Security and Auditing in a Big Data as a Service Deployment
End-to-End Security and Auditing in a Big Data as a Service DeploymentEnd-to-End Security and Auditing in a Big Data as a Service Deployment
End-to-End Security and Auditing in a Big Data as a Service DeploymentDataWorks Summit/Hadoop Summit
 

Was ist angesagt? (20)

ExxonMobil’s journey to unleash time-series data with open source technology
ExxonMobil’s journey to unleash time-series data with open source technologyExxonMobil’s journey to unleash time-series data with open source technology
ExxonMobil’s journey to unleash time-series data with open source technology
 
Sharing metadata across the data lake and streams
Sharing metadata across the data lake and streamsSharing metadata across the data lake and streams
Sharing metadata across the data lake and streams
 
Secured (Kerberos-based) Spark Notebook for Data Science: Spark Summit East t...
Secured (Kerberos-based) Spark Notebook for Data Science: Spark Summit East t...Secured (Kerberos-based) Spark Notebook for Data Science: Spark Summit East t...
Secured (Kerberos-based) Spark Notebook for Data Science: Spark Summit East t...
 
Protecting Enterprise Data in Apache Hadoop
Protecting Enterprise Data in Apache HadoopProtecting Enterprise Data in Apache Hadoop
Protecting Enterprise Data in Apache Hadoop
 
Saving the elephant—now, not later
Saving the elephant—now, not laterSaving the elephant—now, not later
Saving the elephant—now, not later
 
Lessons learned running a container cloud on YARN
Lessons learned running a container cloud on YARNLessons learned running a container cloud on YARN
Lessons learned running a container cloud on YARN
 
Introduction to Apache NiFi 1.10
Introduction to Apache NiFi 1.10Introduction to Apache NiFi 1.10
Introduction to Apache NiFi 1.10
 
Manage democratization of the data - Data Replication in Hadoop
Manage democratization of the data - Data Replication in HadoopManage democratization of the data - Data Replication in Hadoop
Manage democratization of the data - Data Replication in Hadoop
 
HAWQ Meets Hive - Querying Unmanaged Data
HAWQ Meets Hive - Querying Unmanaged DataHAWQ Meets Hive - Querying Unmanaged Data
HAWQ Meets Hive - Querying Unmanaged Data
 
Real time-data-pipeline
Real time-data-pipelineReal time-data-pipeline
Real time-data-pipeline
 
How T-Mobile Tamed Metron
How T-Mobile Tamed MetronHow T-Mobile Tamed Metron
How T-Mobile Tamed Metron
 
Spark + Flashblade: Spark Summit East talk by Brian Gold
Spark + Flashblade: Spark Summit East talk by Brian GoldSpark + Flashblade: Spark Summit East talk by Brian Gold
Spark + Flashblade: Spark Summit East talk by Brian Gold
 
Emerging trends in data analytics
Emerging trends in data analyticsEmerging trends in data analytics
Emerging trends in data analytics
 
HTAP By Accident: Getting More From PostgreSQL Using Hardware Acceleration
HTAP By Accident: Getting More From PostgreSQL Using Hardware AccelerationHTAP By Accident: Getting More From PostgreSQL Using Hardware Acceleration
HTAP By Accident: Getting More From PostgreSQL Using Hardware Acceleration
 
Next gen tooling for building streaming analytics apps: code-less development...
Next gen tooling for building streaming analytics apps: code-less development...Next gen tooling for building streaming analytics apps: code-less development...
Next gen tooling for building streaming analytics apps: code-less development...
 
Uncovering an Apache Spark 2 Benchmark - Configuration, Tuning and Test Results
Uncovering an Apache Spark 2 Benchmark - Configuration, Tuning and Test ResultsUncovering an Apache Spark 2 Benchmark - Configuration, Tuning and Test Results
Uncovering an Apache Spark 2 Benchmark - Configuration, Tuning and Test Results
 
Designing data pipelines for analytics and machine learning in industrial set...
Designing data pipelines for analytics and machine learning in industrial set...Designing data pipelines for analytics and machine learning in industrial set...
Designing data pipelines for analytics and machine learning in industrial set...
 
Securing Hadoop in an Enterprise Context
Securing Hadoop in an Enterprise ContextSecuring Hadoop in an Enterprise Context
Securing Hadoop in an Enterprise Context
 
Fast SQL on Hadoop, Really?
Fast SQL on Hadoop, Really?Fast SQL on Hadoop, Really?
Fast SQL on Hadoop, Really?
 
End-to-End Security and Auditing in a Big Data as a Service Deployment
End-to-End Security and Auditing in a Big Data as a Service DeploymentEnd-to-End Security and Auditing in a Big Data as a Service Deployment
End-to-End Security and Auditing in a Big Data as a Service Deployment
 

Ähnlich wie High-Performance Input Pipelines for Scalable Deep Learning

How To Deliver Step Changes in Manufacturing Operations with Predictive Insig...
How To Deliver Step Changes in Manufacturing Operations with Predictive Insig...How To Deliver Step Changes in Manufacturing Operations with Predictive Insig...
How To Deliver Step Changes in Manufacturing Operations with Predictive Insig...Bigfinite
 
Machine Learning in Action
Machine Learning in ActionMachine Learning in Action
Machine Learning in ActionSplunk
 
Solving the Data Challenge in Compliance
Solving the Data Challenge in ComplianceSolving the Data Challenge in Compliance
Solving the Data Challenge in Complianceaccenture
 
A Practical Guide to the Art of Data Storytelling
A Practical Guide to the Art of Data StorytellingA Practical Guide to the Art of Data Storytelling
A Practical Guide to the Art of Data StorytellingZach Gemignani
 
Big Data LDN 2018: LESSONS LEARNED FROM DEPLOYING REAL-WORLD AI SYSTEMS
Big Data LDN 2018: LESSONS LEARNED FROM DEPLOYING REAL-WORLD AI SYSTEMSBig Data LDN 2018: LESSONS LEARNED FROM DEPLOYING REAL-WORLD AI SYSTEMS
Big Data LDN 2018: LESSONS LEARNED FROM DEPLOYING REAL-WORLD AI SYSTEMSMatt Stubbs
 
Increasing the Use and Value of Earth Science Data and Information
Increasing the Use and Value of Earth Science Data and InformationIncreasing the Use and Value of Earth Science Data and Information
Increasing the Use and Value of Earth Science Data and InformationAmazon Web Services
 
Licensed to Analyze? Strata Data NY 2019 IADSS Session - Usama Fayyad, Hamit ...
Licensed to Analyze? Strata Data NY 2019 IADSS Session - Usama Fayyad, Hamit ...Licensed to Analyze? Strata Data NY 2019 IADSS Session - Usama Fayyad, Hamit ...
Licensed to Analyze? Strata Data NY 2019 IADSS Session - Usama Fayyad, Hamit ...IADSS
 
Future of ai 20190612 v1
Future of ai 20190612 v1Future of ai 20190612 v1
Future of ai 20190612 v1ISSIP
 
Getting Microservices and Legacy to Play Nicely Together with Event-Driven Ar...
Getting Microservices and Legacy to Play Nicely Together with Event-Driven Ar...Getting Microservices and Legacy to Play Nicely Together with Event-Driven Ar...
Getting Microservices and Legacy to Play Nicely Together with Event-Driven Ar...VMware Tanzu
 
How to Guarantee Exact Count Distinct Queries with Sub-Second Latency on Mass...
How to Guarantee Exact Count Distinct Queries with Sub-Second Latency on Mass...How to Guarantee Exact Count Distinct Queries with Sub-Second Latency on Mass...
How to Guarantee Exact Count Distinct Queries with Sub-Second Latency on Mass...SamanthaBerlant
 
How to Guarantee Exact COUNT DISTINCT Queries with Sub-Second Latency on Mass...
How to Guarantee Exact COUNT DISTINCT Queries with Sub-Second Latency on Mass...How to Guarantee Exact COUNT DISTINCT Queries with Sub-Second Latency on Mass...
How to Guarantee Exact COUNT DISTINCT Queries with Sub-Second Latency on Mass...Tyler Wishnoff
 
Machine Learning in Action
Machine Learning in ActionMachine Learning in Action
Machine Learning in ActionSplunk
 
SF Big Analytics Meetup - Exact Count Distinct with Apache Kylin
SF Big Analytics Meetup - Exact Count Distinct with Apache KylinSF Big Analytics Meetup - Exact Count Distinct with Apache Kylin
SF Big Analytics Meetup - Exact Count Distinct with Apache KylinSamanthaBerlant
 
SFScon 2020 - Tobie Langel - Making the Business Case for Contributing to Ope...
SFScon 2020 - Tobie Langel - Making the Business Case for Contributing to Ope...SFScon 2020 - Tobie Langel - Making the Business Case for Contributing to Ope...
SFScon 2020 - Tobie Langel - Making the Business Case for Contributing to Ope...South Tyrol Free Software Conference
 
Machine Learning in easy pieces
Machine Learning in easy piecesMachine Learning in easy pieces
Machine Learning in easy piecesSakshi Ganeriwal
 
Predictive vs Prescriptive Analytics
Predictive vs Prescriptive AnalyticsPredictive vs Prescriptive Analytics
Predictive vs Prescriptive AnalyticsDATAVERSITY
 
MongoDB .local Houston 2019: Halliburton Integrated Well Construction – Edge ...
MongoDB .local Houston 2019: Halliburton Integrated Well Construction – Edge ...MongoDB .local Houston 2019: Halliburton Integrated Well Construction – Edge ...
MongoDB .local Houston 2019: Halliburton Integrated Well Construction – Edge ...MongoDB
 
Deep Learning with Databricks
Deep Learning with Databricks  Deep Learning with Databricks
Deep Learning with Databricks Henning Kropp
 

Ähnlich wie High-Performance Input Pipelines for Scalable Deep Learning (20)

How To Deliver Step Changes in Manufacturing Operations with Predictive Insig...
How To Deliver Step Changes in Manufacturing Operations with Predictive Insig...How To Deliver Step Changes in Manufacturing Operations with Predictive Insig...
How To Deliver Step Changes in Manufacturing Operations with Predictive Insig...
 
Machine Learning in Action
Machine Learning in ActionMachine Learning in Action
Machine Learning in Action
 
Solving the Data Challenge in Compliance
Solving the Data Challenge in ComplianceSolving the Data Challenge in Compliance
Solving the Data Challenge in Compliance
 
Top 5 Lessons Learned in Deploying AI in the Real World
Top 5 Lessons Learned in Deploying AI in the Real WorldTop 5 Lessons Learned in Deploying AI in the Real World
Top 5 Lessons Learned in Deploying AI in the Real World
 
A Practical Guide to the Art of Data Storytelling
A Practical Guide to the Art of Data StorytellingA Practical Guide to the Art of Data Storytelling
A Practical Guide to the Art of Data Storytelling
 
Big Data LDN 2018: LESSONS LEARNED FROM DEPLOYING REAL-WORLD AI SYSTEMS
Big Data LDN 2018: LESSONS LEARNED FROM DEPLOYING REAL-WORLD AI SYSTEMSBig Data LDN 2018: LESSONS LEARNED FROM DEPLOYING REAL-WORLD AI SYSTEMS
Big Data LDN 2018: LESSONS LEARNED FROM DEPLOYING REAL-WORLD AI SYSTEMS
 
Increasing the Use and Value of Earth Science Data and Information
Increasing the Use and Value of Earth Science Data and InformationIncreasing the Use and Value of Earth Science Data and Information
Increasing the Use and Value of Earth Science Data and Information
 
Licensed to Analyze? Strata Data NY 2019 IADSS Session - Usama Fayyad, Hamit ...
Licensed to Analyze? Strata Data NY 2019 IADSS Session - Usama Fayyad, Hamit ...Licensed to Analyze? Strata Data NY 2019 IADSS Session - Usama Fayyad, Hamit ...
Licensed to Analyze? Strata Data NY 2019 IADSS Session - Usama Fayyad, Hamit ...
 
Future of ai 20190612 v1
Future of ai 20190612 v1Future of ai 20190612 v1
Future of ai 20190612 v1
 
Getting Microservices and Legacy to Play Nicely Together with Event-Driven Ar...
Getting Microservices and Legacy to Play Nicely Together with Event-Driven Ar...Getting Microservices and Legacy to Play Nicely Together with Event-Driven Ar...
Getting Microservices and Legacy to Play Nicely Together with Event-Driven Ar...
 
How to Guarantee Exact Count Distinct Queries with Sub-Second Latency on Mass...
How to Guarantee Exact Count Distinct Queries with Sub-Second Latency on Mass...How to Guarantee Exact Count Distinct Queries with Sub-Second Latency on Mass...
How to Guarantee Exact Count Distinct Queries with Sub-Second Latency on Mass...
 
How to Guarantee Exact COUNT DISTINCT Queries with Sub-Second Latency on Mass...
How to Guarantee Exact COUNT DISTINCT Queries with Sub-Second Latency on Mass...How to Guarantee Exact COUNT DISTINCT Queries with Sub-Second Latency on Mass...
How to Guarantee Exact COUNT DISTINCT Queries with Sub-Second Latency on Mass...
 
Machine Learning in Action
Machine Learning in ActionMachine Learning in Action
Machine Learning in Action
 
SF Big Analytics Meetup - Exact Count Distinct with Apache Kylin
SF Big Analytics Meetup - Exact Count Distinct with Apache KylinSF Big Analytics Meetup - Exact Count Distinct with Apache Kylin
SF Big Analytics Meetup - Exact Count Distinct with Apache Kylin
 
Why i love Apache Spark?
Why i love Apache Spark?Why i love Apache Spark?
Why i love Apache Spark?
 
SFScon 2020 - Tobie Langel - Making the Business Case for Contributing to Ope...
SFScon 2020 - Tobie Langel - Making the Business Case for Contributing to Ope...SFScon 2020 - Tobie Langel - Making the Business Case for Contributing to Ope...
SFScon 2020 - Tobie Langel - Making the Business Case for Contributing to Ope...
 
Machine Learning in easy pieces
Machine Learning in easy piecesMachine Learning in easy pieces
Machine Learning in easy pieces
 
Predictive vs Prescriptive Analytics
Predictive vs Prescriptive AnalyticsPredictive vs Prescriptive Analytics
Predictive vs Prescriptive Analytics
 
MongoDB .local Houston 2019: Halliburton Integrated Well Construction – Edge ...
MongoDB .local Houston 2019: Halliburton Integrated Well Construction – Edge ...MongoDB .local Houston 2019: Halliburton Integrated Well Construction – Edge ...
MongoDB .local Houston 2019: Halliburton Integrated Well Construction – Edge ...
 
Deep Learning with Databricks
Deep Learning with Databricks  Deep Learning with Databricks
Deep Learning with Databricks
 

Mehr von DataWorks Summit

Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisDataWorks Summit
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...DataWorks Summit
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...DataWorks Summit
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal SystemDataWorks Summit
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExampleDataWorks Summit
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberDataWorks Summit
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixDataWorks Summit
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiDataWorks Summit
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsDataWorks Summit
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureDataWorks Summit
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EngineDataWorks Summit
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...DataWorks Summit
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudDataWorks Summit
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerDataWorks Summit
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...DataWorks Summit
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouDataWorks Summit
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkDataWorks Summit
 
Transforming and Scaling Large Scale Data Analytics: Moving to a Cloud-based ...
Transforming and Scaling Large Scale Data Analytics: Moving to a Cloud-based ...Transforming and Scaling Large Scale Data Analytics: Moving to a Cloud-based ...
Transforming and Scaling Large Scale Data Analytics: Moving to a Cloud-based ...DataWorks Summit
 
Applying Noisy Knowledge Graphs to Real Problems
Applying Noisy Knowledge Graphs to Real ProblemsApplying Noisy Knowledge Graphs to Real Problems
Applying Noisy Knowledge Graphs to Real ProblemsDataWorks Summit
 

Mehr von DataWorks Summit (20)

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
 
Transforming and Scaling Large Scale Data Analytics: Moving to a Cloud-based ...
Transforming and Scaling Large Scale Data Analytics: Moving to a Cloud-based ...Transforming and Scaling Large Scale Data Analytics: Moving to a Cloud-based ...
Transforming and Scaling Large Scale Data Analytics: Moving to a Cloud-based ...
 
Applying Noisy Knowledge Graphs to Real Problems
Applying Noisy Knowledge Graphs to Real ProblemsApplying Noisy Knowledge Graphs to Real Problems
Applying Noisy Knowledge Graphs to Real Problems
 

Kürzlich hochgeladen

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 

Kürzlich hochgeladen (20)

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 

High-Performance Input Pipelines for Scalable Deep Learning

  • 1. HIGH-PERFORMANCE INPUT PIPELINES FOR SCALABLE DEEP LEARNING Joshua Robinson Pure Storage
  • 2. © 2019 PURE STORAGE INC.2 QUESTIONONEVERYONE’SMIND: WHYISASTORAGECOMPANYHERE?
  • 3. © 2019 PURE STORAGE INC.3 “We don’t have better algorithms, we just have more data” PETER NORVIG ​Engineering Director, Google
  • 4. © 2019 PURE STORAGE INC.4 The AI “Hierarchy of Needs” credit: Monica Rogati ML algorithms: linear & logistic regression, k-means clustering, decision trees, etc. Validation: A/B testing, detecting model drift over time✓ Data preparation: cleaning, feature identification, exploration, etc. Data acquisition: ingest, transformation, and representation of data for analysis
  • 5. © 2019 PURE STORAGE INC.5 THIS IS NOT THE FIRST AI HYPE WAVE 1950 1960 1970 1980 1990 2000 2010 2020 Birth of AI Re-birth I Re-birth II AI winter I AI winter II Common themes: compute and data couldn’t match needs of problems being hyped Common themes: focus on specific problems where available compute & data are sufficient
  • 6. 6 © 2019 PURE STORAGE INC. DEEP LEARNING = MASSIVE DATA & COMPUTE Deep Learning Accuracy Data & Compute Previous methods STATE-OF-THE-ART RESULTS ACROSS VISION, SPEECH, LANGUAGE, AND MORE Sources: https://arxiv.org/abs/1506.01497; https://arxiv.org/abs/1703.06870; https://shubhangdesai.github.io/blog/Neural-Style.html; https://cs.stanford.edu/people/karpathy/cnnembed/
  • 7. © 2019 PURE STORAGE INC.7 THE INTUITION BEHIND DEEP LEARNING deep neural net Pr{dog}= 0.903 Pr{cat} = 0.072 … “dog” Primitives Rough shapes Macro features
  • 8. © 2019 PURE STORAGE INC.8 TRAINING A DEEP NEURAL NETWORK evaluate compute gradients apply gradients Pr{dog}= 0.903 Pr{cat} = 0.072 … Primitives Rough shapes Macro features
  • 9. © 2019 PURE STORAGE INC.9 DISTRIBUTED TRAINING evaluate compute gradients merge gradients apply gradients evaluate compute gradients apply gradients evaluate compute gradients apply gradients # GPUs
  • 10. 10 © 2019 PURE STORAGE INC. MORE, FASTER GPUs + MORE DATA
  • 11. 11 © 2019 PURE STORAGE INC. CAN WE KEEP GPUs FED WITH DATA? INPUT PIPELINE = POTENTIAL BOTTLENECK
  • 12. 12 © 2019 PURE STORAGE INC. INPUT PIPELINES CAN IT BE THAT SIMPLE? Source: K. He, X. Zhang, S. Ren, J. Sun, Deep Residual Learning for Image Recognition, CVPR 2015
  • 13. 13 © 2019 PURE STORAGE INC. REAL INPUT PIPELINES CAN YOU SPOT THE BOTTLENECK? Source: K. He, X. Zhang, S. Ren, J. Sun, Deep Residual Learning for Image Recognition, CVPR 2015
  • 14. 14 © 2019 PURE STORAGE INC. FROM IMAGES TO TENSORS PLANE DOG BOAT CAT 1. Enumerate
  • 15. 15 © 2019 PURE STORAGE INC. FROM IMAGES TO TENSORS PLANE PLANE PLANE PLANE DOG DOG DOG DOG BOAT BOAT BOAT BOAT CAT CAT CAT CAT 1. Enumerate 2. Associate labels
  • 16. 16 © 2019 PURE STORAGE INC. FROM IMAGES TO TENSORS BOATCAT PLANE PLANE PLANE PLANE DOG DOG DOGDOG BOATBOAT BOAT CAT CAT CAT 1. Enumerate 2. Associate labels 3. Shuffle
  • 17. 17 © 2019 PURE STORAGE INC. FROM IMAGES TO TENSORS PLANE PLANE PLANE DOG DOG DOGDOG BOAT BOATBOAT BOAT CAT CAT CAT CATPLANE 1. Enumerate 2. Associate labels 3. Shuffle 4. Read, crop, distort
  • 18. 18 © 2019 PURE STORAGE INC. FROM IMAGES TO TENSORS 1. Enumerate 2. Associate labels 3. Shuffle 4. Read, crop, distort 5. Copy to GPU PLANE PLANE PLANE DOG DOG DOGDOG BOAT BOATBOAT BOAT CAT CAT CAT CATPLANE
  • 19. 19 © 2019 PURE STORAGE INC. FROM IMAGES TO TENSORS 1. Enumerate 2. Associate labels 3. Shuffle 4. Read, crop, distort 5. Copy to GPU ANY OF THESE STEPS CAN BE A POTENTIAL BOTTLENECK Other domains (NLP, speech, etc.) will follow a similar(ish) flow
  • 20. 20 © 2019 PURE STORAGE INC. EVALUATION METHODOLOGY 1.3M images, 1000 categories
  • 21. 21 © 2019 PURE STORAGE INC. 40Gb Ethernet 4x NVIDIA DGX-1, each with 8x Tesla V100 GPUs (SXM2) 2x Intel E5-2698 v4 @ 2.20GHz 4x Mellanox MT27700 100Gb/s VPI adapters 512GB DDR4-2400 Pure Storage FlashBlade: 15x17TB 179T usable before data reduction Arista DCS-7060CX2-32S 32x 100Gb/s QSFP100 ports AIRI 100Gb Ethernet w/ RDMA (RoCE) HARDWARE STACK
  • 22. 22 © 2019 PURE STORAGE INC. SOFTWARE STACK nvcr.io/nvidia/tensorflow:17.12 Using TensorFlow “Datasets” API for input pipelines DGX-OS (Ubuntu 16.04) CUDA 9.0 NCCL 2.1.2 CUDNN v7 OpenMPI 3.0 TensorFlow 1.4.0+ Horovod alsrgv/tf_cnn_benchmarks
  • 23. 23 © 2019 PURE STORAGE INC. TRAINING WITH 1 GPU 216 i/s Defaults Images per second when training Inception3 (batch size = 64) forward input pipeline backward “Default” training pipeline forward backward Replace the input pipeline with synthetic data How do we know what good looks like? Synthetic 228 i/s
  • 24. 24 © 2019 PURE STORAGE INC. TRAINING WITH 1 GPU 225 i/s Defaults + Prefetch forward input pipeline backward Images per second when training Inception3 (batch size = 64) Adding a prefetch queue improves scheduler behavior 216 i/s Synthetic 228 images/s forward input pipeline backward “Default” training pipeline SHOULD WE CARE ABOUT 5%?
  • 25. 25 © 2019 PURE STORAGE INC. SCALING TO 32 GPUs (4x DGX-1s) Defaults 4143 i/s Linear Synthetic 6580 images/s 7200 images/s + Prefetch 5335 i/s - Distortions 6440 i/s Images per second when training Inception3 (batch size = 64/GPU) + Thread Pool Limit 5527 i/s Thread pool limits: Avoid over-subscribing CPU with too many threads. (inter_op_parallelism_threads) No Distortions: Skip preprocessing step from input pipeline. This is an unrealistic configuration, but it shows the bottleneck. EXCELLENT SCALABILITY, BUT STILL MORE WORK TO BE DONE 42% gap!
  • 26. 26 © 2019 PURE STORAGE INC. 2.5X Performance Improvement
  • 27.
  • 28. 28 © 2019 PURE STORAGE INC. SCALE OF REAL-WORLD DATA 143 GB 20 PB ImageNet Zenuity
  • 29. 29 © 2019 PURE STORAGE INC. SINGLE-GPU TRAINING evaluate compute gradients apply gradients Pr{dog}= 0.903 Pr{cat} = 0.072 …
  • 30. 30 © 2019 PURE STORAGE INC. DISTRIBUTED TRAINING evaluate compute gradients merge gradients apply gradients evaluate compute gradients apply gradients evaluate compute gradients apply gradients # GPUs
  • 31. 31 © 2019 PURE STORAGE INC. LINEAR SCALING FOR CONVNETS RESNET-50 2540 i/s 4870 i/s 10244 i/s 1 DGX-1 2 DGX-1 4 DGX-1 INCEPTION3 1600 i/s 3160 i/s 6440 i/s 1 DGX-1 2 DGX-1 4 DGX-1 VGG16 1640 i/s 3110 i/s 6300 i/s 1 DGX-1 2 DGX-1 4 DGX-1
  • 32. 32 © 2019 PURE STORAGE INC. RDMA OVER ETHERNET RDMA is essential for peak performance
  • 33. 33 © 2019 PURE STORAGE INC. Input queue is full - need more/faster GPUs? KEEPING GPUs FED WITH DATA
  • 34. 34 © 2019 PURE STORAGE INC. FROM IMAGES TO TENSORS PLANE PLANE PLANE DOG DOG DOG DOG BOAT BOAT BOAT BOAT CAT CAT CAT CAT PLANE 1. Enumerate 2. Associate labels 3. Crop and distort