SlideShare a Scribd company logo
1 of 23
Notebook-based AI Pipelines
with Elyra and Kubeflow
Nick Pentreath
Principal Engineer, IBM
@MLnick
About
DEG / Nov 18, 2020 / © 2020 IBM Corporation
–@MLnick on Twitter, Github, LinkedIn
–Principal Engineer, IBM CODAIT (Center
for Open-Source Data & AI Technologies)
–Machine Learning & AI
–Apache Spark committer & PMC
–Author of Machine Learning with Spark
–Various conferences & meetups
2
Improving the Enterprise AI Lifecycle in Open Source
DEG / Nov 18, 2020 / © 2020 IBM Corporation 3
–CODAIT aims to make AI solutions
dramatically easier to create,
deploy, and manage in the
enterprise.
–We contribute to and advocate for
the open-source technologies that
are foundational to IBM’s AI
offerings.
–30+ open-source developers!
Center for Open Source Data & AI Technologies
codait.org
CODAIT
Open Source @ IBM
Agenda
4
–Machine learning workflow
–JupyerLab & Elyra
–Demo
–Conclusion
DEG / Nov 18, 2020 / © 2020 IBM Corporation
Machine Learning
Workflow
5
Data Analyze Process Train Deploy
Predict
&
Maintain
DEG / Nov 18, 2020 / © 2020 IBM Corporation
Workflow spans teams …
6
Data Analyze Process Train Deploy
Predict
&
Maintain
DEG / Nov 18, 2020 / © 2020 IBM Corporation
Data Engineers Data Scientists & Researchers
Machine Learning &
Production Engineers
… and tools
7
Data Analyze Process Train Deploy
DEG / Nov 18, 2020 / © 2020 IBM Corporation
Data formats
• CSV, SQL
• JSON,
Parquet,
AVRO
• Binary
(image,
audio)
• …
Data Engineers Data Scientists & Researchers
Machine Learning &
Production Engineers
Analysis & data
viz
• ggplot
• dplyr
• matplotlib
• Pandas
• SparkSQL
• …
Pre-processing
& pipelines
• dplyr
• pandas
• scikit-learn
• SparkSQL /
SparkML
• …
Frameworks
• R, scikit-
learn
• SparkML
• TensorFlow
• PyTorch
• LightGBM,
XGBoost
• …
Formats &
mechanisms
• Variety of
formats
• Containers
• …
Iteration &
Experimentation
8
Data Analyze Process Train Deploy
DEG / Nov 18, 2020 / © 2020 IBM Corporation
Data Scientists & Researchers
Load Clean Explore Interpret
Refine
Iteration &
Experimentation
9
Data Process Train Deploy
DEG / Nov 18, 2020 / © 2020 IBM Corporation
Data Scientists & Researchers
Extract
features
Pre-
process
Train Evaluate
Refine
Analyze
Interactive Notebooks
DEG / Nov 18, 2020 / © 2020 IBM Corporation 10
Notebooks have become
the de-facto standard for
content-rich, interactive
& iterative work
* Logos trademarks of their respective projects
Elyra Overview
DEG / Nov 18, 2020 / © 2020 IBM Corporation 11
Elyra is a set of AI-
centric extensions to
JupyterLab Notebooks
* Logos trademarks of their respective projects
Elyra Key Features
DEG / Nov 18, 2020 / © 2020 IBM Corporation 12
–Visual Pipeline Editor
Visual editor for building AI pipelines,
enabling the conversion of multiple
notebooks into batch jobs or workflows.
–Notebooks as batch jobs
–Python script execution
–Automated Table of Contents
–Code Snippets
–Git integration
Elyra Key Features
DEG / Nov 18, 2020 / © 2020 IBM Corporation 13
–Visual Pipeline Editor
–Notebooks as batch jobs
Extends the notebook UI to simplify the
submission of notebooks as a batch job
for model training
–Python script execution
–Automated Table of Contents
–Code Snippets
–Git integration
Elyra Key Features
DEG / Nov 18, 2020 / © 2020 IBM Corporation 14
–Visual Pipeline Editor
–Notebooks as batch jobs
–Python script execution
Edit and execute python scripts against
local or cloud-based resources
–Automated Table of Contents
–Code Snippets
–Git integration
Elyra Key Features
DEG / Nov 18, 2020 / © 2020 IBM Corporation 15
–Visual Pipeline Editor
–Notebooks as batch jobs
–Python script execution
–Automated Table of Contents
Generate & navigate table of contents
from notebooks & python scripts
–Code Snippets
–Git integration
Elyra Key Features
DEG / Nov 18, 2020 / © 2020 IBM Corporation 16
–Visual Pipeline Editor
–Notebooks as batch jobs
–Python script execution
–Automated Table of Contents
–Code Snippets
Easy creation and insertion of reusable
code snippets for various languages
–Git integration
Elyra Key Features
DEG / Nov 18, 2020 / © 2020 IBM Corporation 17
–Visual Pipeline Editor
–Notebooks as batch jobs
–Python script execution
–Automated Table of Contents
–Code Snippets
–Git integration
Track project changes and share among
teammates
DEG / Nov 18, 2020 / © 2020 IBM Corporation
Getting started with Elyra
1. Try Elyra from Binder
ibm.biz/elyra-demo
2. Run Elyra from Docker
ibm.biz/elyra-docker-installation
3. Install Elyra on your local machine
ibm.biz/elyra-installation
18
DEG / Nov 18, 2020 / © 2020 IBM Corporation 19
Start using Elyra today!
Getting started with Elyra
ibm.biz/elyra-installation
Elyra on Github
github.com/elyra-ai/elyra
Elyra Notebook projects on Github
github.com/CODAIT/flight-delay-notebooks
github.com/CODAIT/covid-notebooks
Contributing to the projects
• Star and fork, submit bug reports, suggest improvements,
help with code reviews, join our community meetings
ibm.biz/elyra-demo
gitter.im/elyra-ai/community
DEG / Nov 18, 2020 / © 2020 IBM Corporation 20
Thank you
codait.org
twitter.com/codait_org
github.com/CODAIT
developer.ibm.com
21DEG / Nov 18, 2020 / © 2020 IBM Corporation
Check out the Data Asset Exchange
https://ibm.biz/data-exchange
Sign up for IBM Cloud
https://ibm.biz/Bdqkfg
DEG / Nov 18, 2020 / © 2020 IBM Corporation 22
Feedback
Your feedback is important to us.
Don’t forget to rate
and review the sessions.

More Related Content

What's hot

Keynote: A Roadmap for Domain-Specific Low-Code Platforms
Keynote: A Roadmap for Domain-Specific Low-Code PlatformsKeynote: A Roadmap for Domain-Specific Low-Code Platforms
Keynote: A Roadmap for Domain-Specific Low-Code PlatformsObeo
 
A Smart Development Environment for Infrastructure as Code
A Smart Development Environment for Infrastructure as CodeA Smart Development Environment for Infrastructure as Code
A Smart Development Environment for Infrastructure as CodeObeo
 
GDG Cloud meetup november 2019 - kubeflow pipelines
GDG Cloud meetup november 2019 -  kubeflow pipelinesGDG Cloud meetup november 2019 -  kubeflow pipelines
GDG Cloud meetup november 2019 - kubeflow pipelinesSven Degroote
 
Model Parallelism in Spark ML Cross-Validation with Nick Pentreath and Bryan ...
Model Parallelism in Spark ML Cross-Validation with Nick Pentreath and Bryan ...Model Parallelism in Spark ML Cross-Validation with Nick Pentreath and Bryan ...
Model Parallelism in Spark ML Cross-Validation with Nick Pentreath and Bryan ...Databricks
 
Building Agile and Resilient Schema Transformations using Apache Kafka and ESB's
Building Agile and Resilient Schema Transformations using Apache Kafka and ESB'sBuilding Agile and Resilient Schema Transformations using Apache Kafka and ESB's
Building Agile and Resilient Schema Transformations using Apache Kafka and ESB'sRicardo Ferreira
 
How to build high frequency trading with our matlab secrets with c++ and mysql
How to build high frequency trading with our matlab secrets with c++ and mysqlHow to build high frequency trading with our matlab secrets with c++ and mysql
How to build high frequency trading with our matlab secrets with c++ and mysqlBryan Downing
 
Using the Java Client Library by Noah Crowley, DevRel | InfluxData
Using the Java Client Library by Noah Crowley, DevRel | InfluxDataUsing the Java Client Library by Noah Crowley, DevRel | InfluxData
Using the Java Client Library by Noah Crowley, DevRel | InfluxDataInfluxData
 
Next18 Extended Targu Mures - Bringing the Cloud to you
Next18 Extended Targu Mures - Bringing the Cloud to youNext18 Extended Targu Mures - Bringing the Cloud to you
Next18 Extended Targu Mures - Bringing the Cloud to youMárton Kodok
 
EGL Conference 2011 - EGL Open
EGL Conference 2011 - EGL OpenEGL Conference 2011 - EGL Open
EGL Conference 2011 - EGL OpenWill Smythe
 
Resume - ERF - 2015-12-15
Resume - ERF - 2015-12-15Resume - ERF - 2015-12-15
Resume - ERF - 2015-12-15Eric Foertsch
 
EGL Conference 2011 - Application Migration
EGL Conference 2011 - Application MigrationEGL Conference 2011 - Application Migration
EGL Conference 2011 - Application MigrationWill Smythe
 
Smallworld 5 Migration Experiences
Smallworld 5 Migration ExperiencesSmallworld 5 Migration Experiences
Smallworld 5 Migration ExperiencesGlobema
 
Active reports Training Session
Active reports Training SessionActive reports Training Session
Active reports Training SessionForziatech
 
Clean coding in plsql and sql, v2
Clean coding in plsql and sql, v2Clean coding in plsql and sql, v2
Clean coding in plsql and sql, v2Brendan Furey
 
raman_resume
raman_resumeraman_resume
raman_resumeraman1029
 
Cloud-Native .Net des applications containerisées .Net sur Linux, Windows e...
 Cloud-Native .Net des applications containerisées .Net sur Linux, Windows e... Cloud-Native .Net des applications containerisées .Net sur Linux, Windows e...
Cloud-Native .Net des applications containerisées .Net sur Linux, Windows e...VMware Tanzu
 
Rhapsody reverseengineering
Rhapsody reverseengineeringRhapsody reverseengineering
Rhapsody reverseengineeringScott Althouse
 

What's hot (20)

Keynote: A Roadmap for Domain-Specific Low-Code Platforms
Keynote: A Roadmap for Domain-Specific Low-Code PlatformsKeynote: A Roadmap for Domain-Specific Low-Code Platforms
Keynote: A Roadmap for Domain-Specific Low-Code Platforms
 
A Smart Development Environment for Infrastructure as Code
A Smart Development Environment for Infrastructure as CodeA Smart Development Environment for Infrastructure as Code
A Smart Development Environment for Infrastructure as Code
 
GDG Cloud meetup november 2019 - kubeflow pipelines
GDG Cloud meetup november 2019 -  kubeflow pipelinesGDG Cloud meetup november 2019 -  kubeflow pipelines
GDG Cloud meetup november 2019 - kubeflow pipelines
 
Model Parallelism in Spark ML Cross-Validation with Nick Pentreath and Bryan ...
Model Parallelism in Spark ML Cross-Validation with Nick Pentreath and Bryan ...Model Parallelism in Spark ML Cross-Validation with Nick Pentreath and Bryan ...
Model Parallelism in Spark ML Cross-Validation with Nick Pentreath and Bryan ...
 
Building Agile and Resilient Schema Transformations using Apache Kafka and ESB's
Building Agile and Resilient Schema Transformations using Apache Kafka and ESB'sBuilding Agile and Resilient Schema Transformations using Apache Kafka and ESB's
Building Agile and Resilient Schema Transformations using Apache Kafka and ESB's
 
How to build high frequency trading with our matlab secrets with c++ and mysql
How to build high frequency trading with our matlab secrets with c++ and mysqlHow to build high frequency trading with our matlab secrets with c++ and mysql
How to build high frequency trading with our matlab secrets with c++ and mysql
 
Using the Java Client Library by Noah Crowley, DevRel | InfluxData
Using the Java Client Library by Noah Crowley, DevRel | InfluxDataUsing the Java Client Library by Noah Crowley, DevRel | InfluxData
Using the Java Client Library by Noah Crowley, DevRel | InfluxData
 
Next18 Extended Targu Mures - Bringing the Cloud to you
Next18 Extended Targu Mures - Bringing the Cloud to youNext18 Extended Targu Mures - Bringing the Cloud to you
Next18 Extended Targu Mures - Bringing the Cloud to you
 
EGL Conference 2011 - EGL Open
EGL Conference 2011 - EGL OpenEGL Conference 2011 - EGL Open
EGL Conference 2011 - EGL Open
 
TD/OMS and RPG Toolbox
TD/OMS and RPG ToolboxTD/OMS and RPG Toolbox
TD/OMS and RPG Toolbox
 
Resume - ERF - 2015-12-15
Resume - ERF - 2015-12-15Resume - ERF - 2015-12-15
Resume - ERF - 2015-12-15
 
EGL Conference 2011 - Application Migration
EGL Conference 2011 - Application MigrationEGL Conference 2011 - Application Migration
EGL Conference 2011 - Application Migration
 
Smallworld 5 Migration Experiences
Smallworld 5 Migration ExperiencesSmallworld 5 Migration Experiences
Smallworld 5 Migration Experiences
 
TheodoreWDennis-Resume
TheodoreWDennis-ResumeTheodoreWDennis-Resume
TheodoreWDennis-Resume
 
Active reports Training Session
Active reports Training SessionActive reports Training Session
Active reports Training Session
 
Sapphire
SapphireSapphire
Sapphire
 
Clean coding in plsql and sql, v2
Clean coding in plsql and sql, v2Clean coding in plsql and sql, v2
Clean coding in plsql and sql, v2
 
raman_resume
raman_resumeraman_resume
raman_resume
 
Cloud-Native .Net des applications containerisées .Net sur Linux, Windows e...
 Cloud-Native .Net des applications containerisées .Net sur Linux, Windows e... Cloud-Native .Net des applications containerisées .Net sur Linux, Windows e...
Cloud-Native .Net des applications containerisées .Net sur Linux, Windows e...
 
Rhapsody reverseengineering
Rhapsody reverseengineeringRhapsody reverseengineering
Rhapsody reverseengineering
 

Similar to Notebook-based AI Pipelines with Elyra and Kubeflow

Building Notebook-based AI Pipelines with Elyra and Kubeflow
Building Notebook-based AI Pipelines with Elyra and KubeflowBuilding Notebook-based AI Pipelines with Elyra and Kubeflow
Building Notebook-based AI Pipelines with Elyra and KubeflowDatabricks
 
Ai pipelines powered by jupyter notebooks
Ai pipelines powered by jupyter notebooksAi pipelines powered by jupyter notebooks
Ai pipelines powered by jupyter notebooksLuciano Resende
 
Inteligencia artificial, open source e IBM Call for Code
Inteligencia artificial, open source e IBM Call for CodeInteligencia artificial, open source e IBM Call for Code
Inteligencia artificial, open source e IBM Call for CodeLuciano Resende
 
Open Source AI - News and examples
Open Source AI - News and examplesOpen Source AI - News and examples
Open Source AI - News and examplesLuciano Resende
 
Optimizing your SparkML pipelines using the latest features in Spark 2.3
Optimizing your SparkML pipelines using the latest features in Spark 2.3Optimizing your SparkML pipelines using the latest features in Spark 2.3
Optimizing your SparkML pipelines using the latest features in Spark 2.3DataWorks Summit
 
Strata - Scaling Jupyter with Jupyter Enterprise Gateway
Strata - Scaling Jupyter with Jupyter Enterprise GatewayStrata - Scaling Jupyter with Jupyter Enterprise Gateway
Strata - Scaling Jupyter with Jupyter Enterprise GatewayLuciano Resende
 
From Data to AI - Silicon Valley Open Source projects come to you - Madrid me...
From Data to AI - Silicon Valley Open Source projects come to you - Madrid me...From Data to AI - Silicon Valley Open Source projects come to you - Madrid me...
From Data to AI - Silicon Valley Open Source projects come to you - Madrid me...Luciano Resende
 
Scaling notebooks for Deep Learning workloads
Scaling notebooks for Deep Learning workloadsScaling notebooks for Deep Learning workloads
Scaling notebooks for Deep Learning workloadsLuciano Resende
 
End-to-End Deep Learning Deployment with ONNX
End-to-End Deep Learning Deployment with ONNXEnd-to-End Deep Learning Deployment with ONNX
End-to-End Deep Learning Deployment with ONNXNick Pentreath
 
SAPTechED 2015 UX114 -Building custom SAP Fiori Apps Using SAP Web IDE
SAPTechED 2015 UX114 -Building custom SAP Fiori Apps Using SAP Web IDESAPTechED 2015 UX114 -Building custom SAP Fiori Apps Using SAP Web IDE
SAPTechED 2015 UX114 -Building custom SAP Fiori Apps Using SAP Web IDEMarkus Van Kempen
 
AD308: XPages in a Social World
AD308: XPages in a Social WorldAD308: XPages in a Social World
AD308: XPages in a Social Worldpaidi_ed
 
How to deploy machine learning models into production
How to deploy machine learning models into productionHow to deploy machine learning models into production
How to deploy machine learning models into productionDataWorks Summit
 
Deploying End-to-End Deep Learning Pipelines with ONNX
Deploying End-to-End Deep Learning Pipelines with ONNXDeploying End-to-End Deep Learning Pipelines with ONNX
Deploying End-to-End Deep Learning Pipelines with ONNXDatabricks
 
Integrating Service Mesh with Kubernetes-based connected vehicle platform
Integrating Service Mesh with Kubernetes-based connected vehicle platformIntegrating Service Mesh with Kubernetes-based connected vehicle platform
Integrating Service Mesh with Kubernetes-based connected vehicle platformJun Kai Yong
 
MLOps implemented - how we combine the cloud & open-source to boost data scie...
MLOps implemented - how we combine the cloud & open-source to boost data scie...MLOps implemented - how we combine the cloud & open-source to boost data scie...
MLOps implemented - how we combine the cloud & open-source to boost data scie...GetInData
 
Scaling up Deep Learning by Scaling Down
Scaling up Deep Learning by Scaling DownScaling up Deep Learning by Scaling Down
Scaling up Deep Learning by Scaling DownDatabricks
 
A short introduction to Spark and its benefits
A short introduction to Spark and its benefitsA short introduction to Spark and its benefits
A short introduction to Spark and its benefitsJohan Picard
 
Introduction to Node-RED
Introduction to Node-REDIntroduction to Node-RED
Introduction to Node-REDnodered_ug_jp
 
IoT Development from Software Developer Perspective
IoT Development from Software Developer PerspectiveIoT Development from Software Developer Perspective
IoT Development from Software Developer PerspectiveAndri Yadi
 

Similar to Notebook-based AI Pipelines with Elyra and Kubeflow (20)

Building Notebook-based AI Pipelines with Elyra and Kubeflow
Building Notebook-based AI Pipelines with Elyra and KubeflowBuilding Notebook-based AI Pipelines with Elyra and Kubeflow
Building Notebook-based AI Pipelines with Elyra and Kubeflow
 
Ai pipelines powered by jupyter notebooks
Ai pipelines powered by jupyter notebooksAi pipelines powered by jupyter notebooks
Ai pipelines powered by jupyter notebooks
 
Inteligencia artificial, open source e IBM Call for Code
Inteligencia artificial, open source e IBM Call for CodeInteligencia artificial, open source e IBM Call for Code
Inteligencia artificial, open source e IBM Call for Code
 
Open Source AI - News and examples
Open Source AI - News and examplesOpen Source AI - News and examples
Open Source AI - News and examples
 
Optimizing your SparkML pipelines using the latest features in Spark 2.3
Optimizing your SparkML pipelines using the latest features in Spark 2.3Optimizing your SparkML pipelines using the latest features in Spark 2.3
Optimizing your SparkML pipelines using the latest features in Spark 2.3
 
Strata - Scaling Jupyter with Jupyter Enterprise Gateway
Strata - Scaling Jupyter with Jupyter Enterprise GatewayStrata - Scaling Jupyter with Jupyter Enterprise Gateway
Strata - Scaling Jupyter with Jupyter Enterprise Gateway
 
From Data to AI - Silicon Valley Open Source projects come to you - Madrid me...
From Data to AI - Silicon Valley Open Source projects come to you - Madrid me...From Data to AI - Silicon Valley Open Source projects come to you - Madrid me...
From Data to AI - Silicon Valley Open Source projects come to you - Madrid me...
 
Scaling notebooks for Deep Learning workloads
Scaling notebooks for Deep Learning workloadsScaling notebooks for Deep Learning workloads
Scaling notebooks for Deep Learning workloads
 
Evolve18 | Carmen Sutter & Sarah Xu | Accelerate your Digital Experience with...
Evolve18 | Carmen Sutter & Sarah Xu | Accelerate your Digital Experience with...Evolve18 | Carmen Sutter & Sarah Xu | Accelerate your Digital Experience with...
Evolve18 | Carmen Sutter & Sarah Xu | Accelerate your Digital Experience with...
 
End-to-End Deep Learning Deployment with ONNX
End-to-End Deep Learning Deployment with ONNXEnd-to-End Deep Learning Deployment with ONNX
End-to-End Deep Learning Deployment with ONNX
 
SAPTechED 2015 UX114 -Building custom SAP Fiori Apps Using SAP Web IDE
SAPTechED 2015 UX114 -Building custom SAP Fiori Apps Using SAP Web IDESAPTechED 2015 UX114 -Building custom SAP Fiori Apps Using SAP Web IDE
SAPTechED 2015 UX114 -Building custom SAP Fiori Apps Using SAP Web IDE
 
AD308: XPages in a Social World
AD308: XPages in a Social WorldAD308: XPages in a Social World
AD308: XPages in a Social World
 
How to deploy machine learning models into production
How to deploy machine learning models into productionHow to deploy machine learning models into production
How to deploy machine learning models into production
 
Deploying End-to-End Deep Learning Pipelines with ONNX
Deploying End-to-End Deep Learning Pipelines with ONNXDeploying End-to-End Deep Learning Pipelines with ONNX
Deploying End-to-End Deep Learning Pipelines with ONNX
 
Integrating Service Mesh with Kubernetes-based connected vehicle platform
Integrating Service Mesh with Kubernetes-based connected vehicle platformIntegrating Service Mesh with Kubernetes-based connected vehicle platform
Integrating Service Mesh with Kubernetes-based connected vehicle platform
 
MLOps implemented - how we combine the cloud & open-source to boost data scie...
MLOps implemented - how we combine the cloud & open-source to boost data scie...MLOps implemented - how we combine the cloud & open-source to boost data scie...
MLOps implemented - how we combine the cloud & open-source to boost data scie...
 
Scaling up Deep Learning by Scaling Down
Scaling up Deep Learning by Scaling DownScaling up Deep Learning by Scaling Down
Scaling up Deep Learning by Scaling Down
 
A short introduction to Spark and its benefits
A short introduction to Spark and its benefitsA short introduction to Spark and its benefits
A short introduction to Spark and its benefits
 
Introduction to Node-RED
Introduction to Node-REDIntroduction to Node-RED
Introduction to Node-RED
 
IoT Development from Software Developer Perspective
IoT Development from Software Developer PerspectiveIoT Development from Software Developer Perspective
IoT Development from Software Developer Perspective
 

More from Nick Pentreath

Scaling up deep learning by scaling down
Scaling up deep learning by scaling downScaling up deep learning by scaling down
Scaling up deep learning by scaling downNick Pentreath
 
IBM Developer Model Asset eXchange
IBM Developer Model Asset eXchangeIBM Developer Model Asset eXchange
IBM Developer Model Asset eXchangeNick Pentreath
 
IBM Developer Model Asset eXchange - Deep Learning for Everyone
IBM Developer Model Asset eXchange - Deep Learning for EveryoneIBM Developer Model Asset eXchange - Deep Learning for Everyone
IBM Developer Model Asset eXchange - Deep Learning for EveryoneNick Pentreath
 
Search and Recommendations: 3 Sides of the Same Coin
Search and Recommendations: 3 Sides of the Same CoinSearch and Recommendations: 3 Sides of the Same Coin
Search and Recommendations: 3 Sides of the Same CoinNick Pentreath
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender SystemsNick Pentreath
 
RNNs for Recommendations and Personalization
RNNs for Recommendations and PersonalizationRNNs for Recommendations and Personalization
RNNs for Recommendations and PersonalizationNick Pentreath
 

More from Nick Pentreath (6)

Scaling up deep learning by scaling down
Scaling up deep learning by scaling downScaling up deep learning by scaling down
Scaling up deep learning by scaling down
 
IBM Developer Model Asset eXchange
IBM Developer Model Asset eXchangeIBM Developer Model Asset eXchange
IBM Developer Model Asset eXchange
 
IBM Developer Model Asset eXchange - Deep Learning for Everyone
IBM Developer Model Asset eXchange - Deep Learning for EveryoneIBM Developer Model Asset eXchange - Deep Learning for Everyone
IBM Developer Model Asset eXchange - Deep Learning for Everyone
 
Search and Recommendations: 3 Sides of the Same Coin
Search and Recommendations: 3 Sides of the Same CoinSearch and Recommendations: 3 Sides of the Same Coin
Search and Recommendations: 3 Sides of the Same Coin
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender Systems
 
RNNs for Recommendations and Personalization
RNNs for Recommendations and PersonalizationRNNs for Recommendations and Personalization
RNNs for Recommendations and Personalization
 

Recently uploaded

B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一ffjhghh
 

Recently uploaded (20)

B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 

Notebook-based AI Pipelines with Elyra and Kubeflow

  • 1. Notebook-based AI Pipelines with Elyra and Kubeflow Nick Pentreath Principal Engineer, IBM @MLnick
  • 2. About DEG / Nov 18, 2020 / © 2020 IBM Corporation –@MLnick on Twitter, Github, LinkedIn –Principal Engineer, IBM CODAIT (Center for Open-Source Data & AI Technologies) –Machine Learning & AI –Apache Spark committer & PMC –Author of Machine Learning with Spark –Various conferences & meetups 2
  • 3. Improving the Enterprise AI Lifecycle in Open Source DEG / Nov 18, 2020 / © 2020 IBM Corporation 3 –CODAIT aims to make AI solutions dramatically easier to create, deploy, and manage in the enterprise. –We contribute to and advocate for the open-source technologies that are foundational to IBM’s AI offerings. –30+ open-source developers! Center for Open Source Data & AI Technologies codait.org CODAIT Open Source @ IBM
  • 4. Agenda 4 –Machine learning workflow –JupyerLab & Elyra –Demo –Conclusion DEG / Nov 18, 2020 / © 2020 IBM Corporation
  • 5. Machine Learning Workflow 5 Data Analyze Process Train Deploy Predict & Maintain DEG / Nov 18, 2020 / © 2020 IBM Corporation
  • 6. Workflow spans teams … 6 Data Analyze Process Train Deploy Predict & Maintain DEG / Nov 18, 2020 / © 2020 IBM Corporation Data Engineers Data Scientists & Researchers Machine Learning & Production Engineers
  • 7. … and tools 7 Data Analyze Process Train Deploy DEG / Nov 18, 2020 / © 2020 IBM Corporation Data formats • CSV, SQL • JSON, Parquet, AVRO • Binary (image, audio) • … Data Engineers Data Scientists & Researchers Machine Learning & Production Engineers Analysis & data viz • ggplot • dplyr • matplotlib • Pandas • SparkSQL • … Pre-processing & pipelines • dplyr • pandas • scikit-learn • SparkSQL / SparkML • … Frameworks • R, scikit- learn • SparkML • TensorFlow • PyTorch • LightGBM, XGBoost • … Formats & mechanisms • Variety of formats • Containers • …
  • 8. Iteration & Experimentation 8 Data Analyze Process Train Deploy DEG / Nov 18, 2020 / © 2020 IBM Corporation Data Scientists & Researchers Load Clean Explore Interpret Refine
  • 9. Iteration & Experimentation 9 Data Process Train Deploy DEG / Nov 18, 2020 / © 2020 IBM Corporation Data Scientists & Researchers Extract features Pre- process Train Evaluate Refine Analyze
  • 10. Interactive Notebooks DEG / Nov 18, 2020 / © 2020 IBM Corporation 10 Notebooks have become the de-facto standard for content-rich, interactive & iterative work * Logos trademarks of their respective projects
  • 11. Elyra Overview DEG / Nov 18, 2020 / © 2020 IBM Corporation 11 Elyra is a set of AI- centric extensions to JupyterLab Notebooks * Logos trademarks of their respective projects
  • 12. Elyra Key Features DEG / Nov 18, 2020 / © 2020 IBM Corporation 12 –Visual Pipeline Editor Visual editor for building AI pipelines, enabling the conversion of multiple notebooks into batch jobs or workflows. –Notebooks as batch jobs –Python script execution –Automated Table of Contents –Code Snippets –Git integration
  • 13. Elyra Key Features DEG / Nov 18, 2020 / © 2020 IBM Corporation 13 –Visual Pipeline Editor –Notebooks as batch jobs Extends the notebook UI to simplify the submission of notebooks as a batch job for model training –Python script execution –Automated Table of Contents –Code Snippets –Git integration
  • 14. Elyra Key Features DEG / Nov 18, 2020 / © 2020 IBM Corporation 14 –Visual Pipeline Editor –Notebooks as batch jobs –Python script execution Edit and execute python scripts against local or cloud-based resources –Automated Table of Contents –Code Snippets –Git integration
  • 15. Elyra Key Features DEG / Nov 18, 2020 / © 2020 IBM Corporation 15 –Visual Pipeline Editor –Notebooks as batch jobs –Python script execution –Automated Table of Contents Generate & navigate table of contents from notebooks & python scripts –Code Snippets –Git integration
  • 16. Elyra Key Features DEG / Nov 18, 2020 / © 2020 IBM Corporation 16 –Visual Pipeline Editor –Notebooks as batch jobs –Python script execution –Automated Table of Contents –Code Snippets Easy creation and insertion of reusable code snippets for various languages –Git integration
  • 17. Elyra Key Features DEG / Nov 18, 2020 / © 2020 IBM Corporation 17 –Visual Pipeline Editor –Notebooks as batch jobs –Python script execution –Automated Table of Contents –Code Snippets –Git integration Track project changes and share among teammates
  • 18. DEG / Nov 18, 2020 / © 2020 IBM Corporation Getting started with Elyra 1. Try Elyra from Binder ibm.biz/elyra-demo 2. Run Elyra from Docker ibm.biz/elyra-docker-installation 3. Install Elyra on your local machine ibm.biz/elyra-installation 18
  • 19. DEG / Nov 18, 2020 / © 2020 IBM Corporation 19
  • 20. Start using Elyra today! Getting started with Elyra ibm.biz/elyra-installation Elyra on Github github.com/elyra-ai/elyra Elyra Notebook projects on Github github.com/CODAIT/flight-delay-notebooks github.com/CODAIT/covid-notebooks Contributing to the projects • Star and fork, submit bug reports, suggest improvements, help with code reviews, join our community meetings ibm.biz/elyra-demo gitter.im/elyra-ai/community DEG / Nov 18, 2020 / © 2020 IBM Corporation 20
  • 21. Thank you codait.org twitter.com/codait_org github.com/CODAIT developer.ibm.com 21DEG / Nov 18, 2020 / © 2020 IBM Corporation Check out the Data Asset Exchange https://ibm.biz/data-exchange Sign up for IBM Cloud https://ibm.biz/Bdqkfg
  • 22. DEG / Nov 18, 2020 / © 2020 IBM Corporation 22
  • 23. Feedback Your feedback is important to us. Don’t forget to rate and review the sessions.