Cloud Native Night July 2019, Munich: Talk by Emil A. Siemes (@mesosphere, Principal Solution Engineer at Mesosphere)
=== Please download slides if blurred! ===
Abstract: Tired of managing infrastructure instead of creating exiting ml models? Learn what DC/OS can do for the data scientist.
Join us next time: https://www.meetup.com/Cloud-Native-muc/events
3. Transport
Message Queue
Store
Distributed Database
Analyze
AI / ML + Real-time
Serve
Container
Orchestration
Mesosphere Accelerates Innovation for New Digital Services
On Hybrid Cloud Infrastructures
Infrastructure
Datacenter &
Edge Clouds
AWS EC2 Microsoft Azure Google Cloud
• “As-a-service” Automation
IT adopts any platform technology quickly,
securely & efficiently
• Unified Multi-Cloud Operations
Consistent & resilient services across
cloud, datacenter & edge
3
4. PHYSICAL INFRASTRUCTURE
MICROSERVICES, CONTAINERS, &DEV TOOLS
VIRTUAL MACHINES PUBLIC CLOUDS
DATA SERVICES, MACHINE LEARNING, &AI
Security &
Compliance
Application-Aware
Automation
Multitenancy
Hybrid Cloud
Management
100+
MORE
DatacenterEdge
Datacenter and Cloud as a Single Computing Resource
Powered by Apache Mesos
20+
MORE
Unified multi-cloud operations
Securely manage cloud, datacenter, and edge
infrastructures from a single control plane
4
Mesosphere DC/OS Delivers “As-a-Service” Automation
to Any Application Technology on Any Infrastructure
Intelligent resource pooling
Optimize workload density for highest utilization with
resource guarantees
3
Broad workload coverage
Run today & tomorrow’s applications including traditional
J2EE, containers, analytics & ML
1
Application-aware automation
Automate workload-specific operating procedures to “as-a-
Service” anything from Kubernetes to data services
2
5. What you want to be doing
9
Data
(clean)
Write awesome ML code
Train
(once)
Deploy
(not me)
6. 10
Sculley, D., Holt, G., Golovin, D. et al. Hidden Technical Debt in Machine Learning Systems
What you’re actually doing
7. Open Source Pipeline
Operationalizing a Machine
Learning model can be super hard.
It is a stage where most enterprise
Machine Learning projects fail. I
cannot tell you how many
companies I've talked to, who have
said their innovation teams had
devised these cool ML projects, but
they were struggling getting the ML
models into production. In this set
of courses, we will talk about how
to train, deploy, and predict with ML
models in a way that their
production ready. And finally, we
delve back into Machine Learning
theory.
Valliappa Lakshmanan.
Tech Lead for Big Data and Machine
Learning Professional Services on Google
Cloud Platform.
8. 1. Model Engineering 2. Model Training 3. Monitoring 4. Debugging 5. Model Serving
9. 1. Data Preparation using
Spark
7. Streaming of requests
...
Public Cloud Pipeline
Model Engineering 2. Model Training 3. Monitoring 4. Debugging 5. Model Serving
10. 1. Data Preparation using
Spark
7. Kafka stream of
requests
DIY Open Source Pipeline
1. Model
Engineering
2. Model Training 3. Monitoring 4. Debugging 5. Model Serving
11. Data Science Pipeline on DC/OS
Continuous Integration
Monitoring & Operations
Distributed Data
Storage and
Streaming
Data Preparation
and Analysis
Storage of trained
Models and
Metadata
Use trained Model
for Inference
Distributed
Training using
Machine Learning
Frameworks
Data & Streaming
Model
Engineering
Model
Management
Model Serving
Model
Training
Management
Tensorboard
12. Data Science Pipeline on DC/OS
Continuous Integration
Monitoring & Operations
Distributed Data
Storage and
Streaming
Data Preparation
and Analysis
Storage of trained
Models and
Metadata
Use trained Model
for Inference
Distributed
Training using
Machine Learning
Frameworks
Data & Streaming
Model
Engineering
Model
Management
Model Serving
Model
Training
Management
Tensorboard
15. Data Science Pipeline on DC/OS
Continuous Integration
Monitoring & Operations
Distributed Data
Storage and
Streaming
Data Preparation
and Analysis
Storage of trained
Models and
Metadata
Use trained Model
for Inference
Distributed
Training using
Machine Learning
Frameworks
Data & Streaming
Model
Engineering
Model
Management
Model Serving
Model
Training
Management
Tensorboard
18. Data Science Pipeline on DC/OS
Continuous Integration
Monitoring & Operations
Distributed Data
Storage and
Streaming
Data Preparation
and Analysis
Storage of trained
Models and
Metadata
Use trained Model
for Inference
Distributed
Training using
Machine Learning
Frameworks
Data & Streaming
Model
Engineering
Model
Management
Model Serving
Model
Training
Management
Tensorboard
23. Data Science Pipeline on DC/OS
Continuous Integration
Monitoring & Operations
Distributed Data
Storage and
Streaming
Data Preparation
and Analysis
Storage of trained
Models and
Metadata
Use trained Model
for Inference
Distributed
Training using
Machine Learning
Frameworks
Data & Streaming
Model
Engineering
Model
Management
Model Serving
Model
Training
Management
Tensorboard
25. Data Science Pipeline on DC/OS
Continuous Integration
Monitoring & Operations
Distributed Data
Storage and
Streaming
Data Preparation
and Analysis
Storage of trained
Models and
Metadata
Use trained Model
for Inference
Distributed
Training using
Machine Learning
Frameworks
Data & Streaming
Model
Engineering
Model
Management
Model Serving
Model
Training
Management
Tensorboard