SlideShare ist ein Scribd-Unternehmen logo
1 von 12
AI Pipeline Optimization
… using Kubeflow
© 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —
Muneer Ahmad (muneer.ahmad@netapp.com) Steve Guhr (steve.guhr@netapp.com)
AI Solutions Architect Solutions Engineer
Agenda
1) AI Pipeline Optimization & Architecture
2) Demo
3) Q&A
What’s it all about?
2 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —
What it is and why did we do that?
AI Pipeline Optimization
3 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —
AI Pipeline in general
… workflow across different sites
4 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —
Data
Ingestion
Data
Analysis
Data
Transformation
Data
Validation
Training
Data
Splitting
Monitoring
Training
Model
Validation
Training At
Scale
ServingRoll-Out Monitoring Logging
Site 1
Site 2Site 3Site 4
Site 5 Site 6
Model
AI Pipeline Portability
... working with multiple premises
5 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —
UX
Tooling
Framework
Storage
Runtime
Drivers
OS
Accelerator
HW
Model
UX
Tooling
Framework
Storage
Runtime
Drivers
OS
Accelerator
HW
Model
UX
Tooling
Framework
Storage
Runtime
Drivers
OS
Accelerator
HW
Laptop Training Rig Cloud
Model
AI Pipeline Optimization
… using Kubernetes & Kubeflow
6 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —
UX
Tooling
Framework
Storage
Runtime
Drivers
OS
Accelerator
HW
Model
UX
Tooling
Framework
Storage
Runtime
Drivers
OS
Accelerator
HW
Model
UX
Tooling
Framework
Storage
Runtime
Drivers
OS
Accelerator
HW
Laptop Training Rig Cloud
Architectural Overview
What did we do (in a nutshell)?
7 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —
Kubeflow
Kubernetes
JupyterHub
…
Trident
Pipeline Katib
How did we do it?
Everyone loves demos, right?!
8 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —
Explaining the pipeline
 Pre-processing
 Training (classification, CIFAR10 dataset)
 Deploying and serving trained models
 TensorRT Inference engine
 Web-application
… using actual data and training
9 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —
 What about „Machine Learning Version Control“?
 How do you „Lift and Shift“ the whole AI application stack across hybrid clouds?
 How to manage (c)old trained models and data?
 […]
10 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —
Closing Thoughts – What‘s next?
11 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —
Resources
 “Trident” for persistent volumes inside of containers:
 https://github.com/NetApp/trident
 https://netapp-trident.readthedocs.io/en/latest/
 Kubernetes for container orchestration:
 https://kubernetes.io/de/
 Kubeflow as a „Data Science Toolchest“
 https://www.kubeflow.org/
 Articles about „DataScience as a Service“ and „Machine Learning Version Control“:
 https://www.linkedin.com/pulse/simplify-machine-learning-version-control-muneer-ahmad-dedmari/
 https://www.linkedin.com/pulse/part-2-simplifying-dataops-datascience-service-jupyter-steve-guhr/
12
May the Data be with you!
© 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Scaling TensorFlow Models for Training using multi-GPUs & Google Cloud ML
Scaling TensorFlow Models for Training using multi-GPUs & Google Cloud MLScaling TensorFlow Models for Training using multi-GPUs & Google Cloud ML
Scaling TensorFlow Models for Training using multi-GPUs & Google Cloud ML
 
Hadoop + GPU
Hadoop + GPUHadoop + GPU
Hadoop + GPU
 
Metaflow: The ML Infrastructure at Netflix
Metaflow: The ML Infrastructure at NetflixMetaflow: The ML Infrastructure at Netflix
Metaflow: The ML Infrastructure at Netflix
 
IBM BOA for POWER
IBM BOA for POWER IBM BOA for POWER
IBM BOA for POWER
 
JMI Techtalk: 한재근 - How to use GPU for developing AI
JMI Techtalk: 한재근 - How to use GPU for developing AIJMI Techtalk: 한재근 - How to use GPU for developing AI
JMI Techtalk: 한재근 - How to use GPU for developing AI
 
Kubeflow at Spotify (For the Kubeflow Summit)
Kubeflow at Spotify (For the Kubeflow Summit)Kubeflow at Spotify (For the Kubeflow Summit)
Kubeflow at Spotify (For the Kubeflow Summit)
 
SigOpt at GTC - Tuning the Untunable
SigOpt at GTC - Tuning the UntunableSigOpt at GTC - Tuning the Untunable
SigOpt at GTC - Tuning the Untunable
 
Large-Scale Training with GPUs at Facebook
Large-Scale Training with GPUs at FacebookLarge-Scale Training with GPUs at Facebook
Large-Scale Training with GPUs at Facebook
 
Exploiting GPUs in Spark
Exploiting GPUs in SparkExploiting GPUs in Spark
Exploiting GPUs in Spark
 
TinyML as-a-Service
TinyML as-a-ServiceTinyML as-a-Service
TinyML as-a-Service
 
Distributed Deep Learning on Spark
Distributed Deep Learning on SparkDistributed Deep Learning on Spark
Distributed Deep Learning on Spark
 
TFX: A tensor flow-based production-scale machine learning platform
TFX: A tensor flow-based production-scale machine learning platformTFX: A tensor flow-based production-scale machine learning platform
TFX: A tensor flow-based production-scale machine learning platform
 
Deep Dive on Amazon EC2 Accelerated Computing
Deep Dive on Amazon EC2 Accelerated ComputingDeep Dive on Amazon EC2 Accelerated Computing
Deep Dive on Amazon EC2 Accelerated Computing
 
AI Bridging Cloud Infrastructure (ABCI) and its communication performance
AI Bridging Cloud Infrastructure (ABCI) and its communication performanceAI Bridging Cloud Infrastructure (ABCI) and its communication performance
AI Bridging Cloud Infrastructure (ABCI) and its communication performance
 
Deploying deep learning models with Docker and Kubernetes
Deploying deep learning models with Docker and KubernetesDeploying deep learning models with Docker and Kubernetes
Deploying deep learning models with Docker and Kubernetes
 
GTC Taiwan 2017 企業端深度學習與人工智慧應用
GTC Taiwan 2017 企業端深度學習與人工智慧應用GTC Taiwan 2017 企業端深度學習與人工智慧應用
GTC Taiwan 2017 企業端深度學習與人工智慧應用
 
AWS Compute Evolved Week: Deep Dive on Amazon EC2 Accelerated Computing
AWS Compute Evolved Week: Deep Dive on Amazon EC2 Accelerated ComputingAWS Compute Evolved Week: Deep Dive on Amazon EC2 Accelerated Computing
AWS Compute Evolved Week: Deep Dive on Amazon EC2 Accelerated Computing
 
BigDL: Bringing Ease of Use of Deep Learning for Apache Spark with Jason Dai ...
BigDL: Bringing Ease of Use of Deep Learning for Apache Spark with Jason Dai ...BigDL: Bringing Ease of Use of Deep Learning for Apache Spark with Jason Dai ...
BigDL: Bringing Ease of Use of Deep Learning for Apache Spark with Jason Dai ...
 
Very large scale distributed deep learning on BigDL
Very large scale distributed deep learning on BigDLVery large scale distributed deep learning on BigDL
Very large scale distributed deep learning on BigDL
 
Kaz Sato, Evangelist, Google at MLconf ATL 2016
Kaz Sato, Evangelist, Google at MLconf ATL 2016Kaz Sato, Evangelist, Google at MLconf ATL 2016
Kaz Sato, Evangelist, Google at MLconf ATL 2016
 

Ähnlich wie AI Pipeline Optimization using Kubeflow

Ähnlich wie AI Pipeline Optimization using Kubeflow (20)

DevOps@Scale- IBM Cloud and NetAp-Insight-Berlin
DevOps@Scale- IBM Cloud and NetAp-Insight-BerlinDevOps@Scale- IBM Cloud and NetAp-Insight-Berlin
DevOps@Scale- IBM Cloud and NetAp-Insight-Berlin
 
NetApp Hybrid Cloud with OpenNebula
NetApp Hybrid Cloud with OpenNebulaNetApp Hybrid Cloud with OpenNebula
NetApp Hybrid Cloud with OpenNebula
 
Leveraging HybridMultiCloud for Devops and Automation Platform
Leveraging HybridMultiCloud for Devops and Automation PlatformLeveraging HybridMultiCloud for Devops and Automation Platform
Leveraging HybridMultiCloud for Devops and Automation Platform
 
How to build containerized architectures for deep learning - Data Festival 20...
How to build containerized architectures for deep learning - Data Festival 20...How to build containerized architectures for deep learning - Data Festival 20...
How to build containerized architectures for deep learning - Data Festival 20...
 
NetApp IT Data Center Strategies to Enable Digital Transformation
NetApp IT Data Center Strategies to Enable Digital TransformationNetApp IT Data Center Strategies to Enable Digital Transformation
NetApp IT Data Center Strategies to Enable Digital Transformation
 
Recipe for Success: The Right Ingredients for Enterprise-Class Cloud Data Man...
Recipe for Success: The Right Ingredients for Enterprise-Class Cloud Data Man...Recipe for Success: The Right Ingredients for Enterprise-Class Cloud Data Man...
Recipe for Success: The Right Ingredients for Enterprise-Class Cloud Data Man...
 
Containers and Kubernetes
Containers and KubernetesContainers and Kubernetes
Containers and Kubernetes
 
IoT Edge Data Processing with NVidia Jetson Nano oct 3 2019
IoT  Edge Data Processing with NVidia Jetson Nano oct 3 2019IoT  Edge Data Processing with NVidia Jetson Nano oct 3 2019
IoT Edge Data Processing with NVidia Jetson Nano oct 3 2019
 
The rise of microservices
The rise of microservicesThe rise of microservices
The rise of microservices
 
End User Computing with NetApp
End User Computing with NetAppEnd User Computing with NetApp
End User Computing with NetApp
 
TensorFlow 16: Building a Data Science Platform
TensorFlow 16: Building a Data Science Platform TensorFlow 16: Building a Data Science Platform
TensorFlow 16: Building a Data Science Platform
 
Facilitez votre transition DevOps grâce à l'automatisation de votre infras...
 Facilitez votre transition DevOps grâce à l'automatisation de votre infras... Facilitez votre transition DevOps grâce à l'automatisation de votre infras...
Facilitez votre transition DevOps grâce à l'automatisation de votre infras...
 
CWIN16 UK Event - The Future of Infrastructure
CWIN16 UK Event - The Future of Infrastructure CWIN16 UK Event - The Future of Infrastructure
CWIN16 UK Event - The Future of Infrastructure
 
Instantaneous Replication of Build Artifacts with NetApp
Instantaneous Replication of Build Artifacts with NetAppInstantaneous Replication of Build Artifacts with NetApp
Instantaneous Replication of Build Artifacts with NetApp
 
Transform Your Enterprise Faster with Seamless Hybrid Cloud from Netapp
Transform Your Enterprise Faster with Seamless Hybrid Cloud from NetappTransform Your Enterprise Faster with Seamless Hybrid Cloud from Netapp
Transform Your Enterprise Faster with Seamless Hybrid Cloud from Netapp
 
Sydney mule soft meetup #8 1 August 2019 - all slides
Sydney mule soft meetup #8   1 August 2019 - all slidesSydney mule soft meetup #8   1 August 2019 - all slides
Sydney mule soft meetup #8 1 August 2019 - all slides
 
apidays LIVE Australia 2020 - Data with a Mission by Matt McLarty
apidays LIVE Australia 2020 -  Data with a Mission by Matt McLarty apidays LIVE Australia 2020 -  Data with a Mission by Matt McLarty
apidays LIVE Australia 2020 - Data with a Mission by Matt McLarty
 
apidays LIVE Paris - Data with a mission: a COVID-19 API case study by Matt M...
apidays LIVE Paris - Data with a mission: a COVID-19 API case study by Matt M...apidays LIVE Paris - Data with a mission: a COVID-19 API case study by Matt M...
apidays LIVE Paris - Data with a mission: a COVID-19 API case study by Matt M...
 
How to deploy machine learning models into production
How to deploy machine learning models into productionHow to deploy machine learning models into production
How to deploy machine learning models into production
 
End To End Machine Learning With Google Cloud
End To End Machine Learning With Google Cloud End To End Machine Learning With Google Cloud
End To End Machine Learning With Google Cloud
 

Kürzlich hochgeladen

%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
masabamasaba
 
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
VictoriaMetrics
 

Kürzlich hochgeladen (20)

%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
 
WSO2CON 2024 Slides - Open Source to SaaS
WSO2CON 2024 Slides - Open Source to SaaSWSO2CON 2024 Slides - Open Source to SaaS
WSO2CON 2024 Slides - Open Source to SaaS
 
WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With SimplicityWSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
 
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
 
WSO2CON 2024 - Navigating API Complexity: REST, GraphQL, gRPC, Websocket, Web...
WSO2CON 2024 - Navigating API Complexity: REST, GraphQL, gRPC, Websocket, Web...WSO2CON 2024 - Navigating API Complexity: REST, GraphQL, gRPC, Websocket, Web...
WSO2CON 2024 - Navigating API Complexity: REST, GraphQL, gRPC, Websocket, Web...
 
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
 
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
 
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
 
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
 
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
 
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...
 
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
 
tonesoftg
tonesoftgtonesoftg
tonesoftg
 
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
 
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
 

AI Pipeline Optimization using Kubeflow

  • 1. AI Pipeline Optimization … using Kubeflow © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL — Muneer Ahmad (muneer.ahmad@netapp.com) Steve Guhr (steve.guhr@netapp.com) AI Solutions Architect Solutions Engineer
  • 2. Agenda 1) AI Pipeline Optimization & Architecture 2) Demo 3) Q&A What’s it all about? 2 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —
  • 3. What it is and why did we do that? AI Pipeline Optimization 3 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —
  • 4. AI Pipeline in general … workflow across different sites 4 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL — Data Ingestion Data Analysis Data Transformation Data Validation Training Data Splitting Monitoring Training Model Validation Training At Scale ServingRoll-Out Monitoring Logging Site 1 Site 2Site 3Site 4 Site 5 Site 6
  • 5. Model AI Pipeline Portability ... working with multiple premises 5 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL — UX Tooling Framework Storage Runtime Drivers OS Accelerator HW Model UX Tooling Framework Storage Runtime Drivers OS Accelerator HW Model UX Tooling Framework Storage Runtime Drivers OS Accelerator HW Laptop Training Rig Cloud
  • 6. Model AI Pipeline Optimization … using Kubernetes & Kubeflow 6 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL — UX Tooling Framework Storage Runtime Drivers OS Accelerator HW Model UX Tooling Framework Storage Runtime Drivers OS Accelerator HW Model UX Tooling Framework Storage Runtime Drivers OS Accelerator HW Laptop Training Rig Cloud
  • 7. Architectural Overview What did we do (in a nutshell)? 7 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL — Kubeflow Kubernetes JupyterHub … Trident Pipeline Katib
  • 8. How did we do it? Everyone loves demos, right?! 8 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —
  • 9. Explaining the pipeline  Pre-processing  Training (classification, CIFAR10 dataset)  Deploying and serving trained models  TensorRT Inference engine  Web-application … using actual data and training 9 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —
  • 10.  What about „Machine Learning Version Control“?  How do you „Lift and Shift“ the whole AI application stack across hybrid clouds?  How to manage (c)old trained models and data?  […] 10 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL — Closing Thoughts – What‘s next?
  • 11. 11 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL — Resources  “Trident” for persistent volumes inside of containers:  https://github.com/NetApp/trident  https://netapp-trident.readthedocs.io/en/latest/  Kubernetes for container orchestration:  https://kubernetes.io/de/  Kubeflow as a „Data Science Toolchest“  https://www.kubeflow.org/  Articles about „DataScience as a Service“ and „Machine Learning Version Control“:  https://www.linkedin.com/pulse/simplify-machine-learning-version-control-muneer-ahmad-dedmari/  https://www.linkedin.com/pulse/part-2-simplifying-dataops-datascience-service-jupyter-steve-guhr/
  • 12. 12 May the Data be with you! © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —