SlideShare ist ein Scribd-Unternehmen logo
1 von 38
Downloaden Sie, um offline zu lesen
Developing and deploying AI solutions on
the cloud using Team Data Science Process
(TDSP) and Azure Machine Learning (AML)
Global Artificial
Intelligence Conference
January 18, 2018, Santa Clara, CA
Deck available @: https://aka.ms/tdsp-presentations
Agenda
Process
Tools (algos,
IDEs,
workbench)
Platform
(compute,
storage, ...)
Process, tools and
platforms for AI solutions
Agile and iterative process to develop,
deploy and manage AI applications in
the cloud
https://aka.ms/tdsp
tdsp-feedback@microsoft.com
Business
Understanding
Data
acquisition &
understanding
Modeling
(ML/AI)
Deployment
The opportunity and challenge of data science in
enterprises
Opportunity: 17% had a well-developed Predictive/Prescriptive Analytics
program in place, while 80% planned on implementing such a program within
five years – Dataversity 2015 Survey
Challenge: Only 27% of the big data projects are regarded as successful –
CapGenimi 2014
Tools & data platforms have matured -
Still a major gap in executing on the potential
One reason: Process challenge in Data Science
Organization
Collaboration
Quality
Knowledge Accumulation
Agility
Global Teams
• Geographic Locations
Team Growth
• Onboard New
Members Rapidly
Varied Use Cases
• Industries and Use
Cases
Diverse DS
Backgrounds
• DS have diverse
backgrounds,
experiences with
tools, languages
Why is a process useful for Data Science?
A process is a detailed sequence of activities necessary to
perform specific business tasks
It is used to standardize procedures and establish best practices
Technology and tools are changing rapidly. A standardized
process can provide continuity and stability of work-flow.
- Based on discussions with Luis Morinigo, Dir. IoT, NewSignature
Data Science can borrow processes from DevOps
TDSP objective
Integrate DevOps with data science workflows to improve
collaboration, quality, robustness and efficiency in data science projects
o Infrastructure as Code (IaC)
o Build
o Test
o CI / CD
o Release management
o Performance monitoring
TDSP features for data science teams
Standardized Data Science Lifecycle
Project Structure, Templates & Roles
Infrastructure & Toolkits
Re-usable Data Science Utilities
Project Execution (incl. DevOps components)
Data Science lifecycle
 Primary stages:
Lifecycle
TDSP lifecycle stages can be integrated with specific
deliverables & checkpoints
Business
Understanding
• Project Objective
• Data, Target &
Feature Definition
• Data Dictionary
Data acquisition and
understanding
Modeling Deployment
Lifecycle
Project roles & tasks
Personas: account managers, PMs, DSs, SWEs, architects
 Governance and Project
Management
 AI Developers
Structure,templates,roles
NOTE: Specific roles depend on organizations
Shared and distributed infrastructure & toolkits
Infrastructure&toolkits
Agile work planning and execution template
o Use Agile work planning & execution
template (data science specific)
https://commons.wikimedia.org
Scrum
framework
Projectexecutionguidelines
Collaborative development guidelines
Remote
Local LocalLocal Local
TDSP git Template
Integrated Agile planning &
code development
Projectexecutionguidelines
Integrating DevOps in projects
Projectexecutionguidelines
Re-usable data science utilities: Analytics
Interactive data exploration and reporting – IDEAR (Python, R, MRS)
o Data quality assessment
o Getting business insights from the data
o Association between variables
o Generating standardized data quality reports automatically
Clustering Distribution assessment
https://github.com/Azure/Azure-TDSP-Utilities
DSutilities
Re-usable data science utilities: Analytics - modeling
Automated modeling and reporting AMAR (R)
Predicted vs. Actual (multiple algorithms) Feature Importance (multiple algorithms)
https://github.com/Azure/Azure-TDSP-Utilities
DSutilities
TDSP documentation: https://aka.ms/tdsp
TDSP trainings
20
TDSP in action: E2E worked-out samples
 Azure Machine Learning
Adoption: How to stage (if needed)
Level1
- One git repository
per project
- Standard directory
structure
- Standardized
templates like
charter, exit reports
- Planning and
tracking of work
items
Level2 - Customize
templates to fit
team needs
- Create shared
team utility repo
(like IDEAR, AMAR)
Leve3
- Develop process
to graduate code
from projects to the
shared team utility
repo
- Develop E2E
worked-out
templates
- Use mature work
planning and
tracking system
(e.g. Agile)
Level4
- Link git branch
with work items
- Code review
- Manage and
version model and
data assets
- Develop
automated testing
framework
- Develop
automated CI/CD
Adoption: Customers
 Microsoft internal
 Microsoft consulting services (MCS)
 AI & R Cloud Platform: Algorithm and
data sciences team
 Windows Devices DS team
 …
 …
• External partners
• New Signature
• BlueGranite
https://docs.microsoft.com/azure/machine-learning/preview
Apps + insightsSocial
LOB
Graph
IoT
Image
CRM INGEST STORE PREP &
TRAIN
MODEL &
SERVE
Data orchestration
and monitoring
Data lake
and storage
Hadoop/Spark/SQL
and ML
Azure Machine Learning
IoT
Azure Machine Learning
25
Bring AI everywhere
Build with the tools and
platforms you know
Build, deploy, and
manage at scale
Boost productivity with
agile development
Benefit from the fastest AI developer cloud
26
Build, deploy,
and manage at scale
 Build and deploy everywhere – cloud,
on-premises, edge, and in-data
 Deploy in minutes with data driven
management and retraining of all your
models
 Prototype locally then scale up and out
with VMs, Spark clusters, and GPUs
 Real time, high through-put insights
everywhere, including Excel integration
27
Deployment and management of models as HTTP
services
Container-based hosting of real time and batch
processing
Management and monitoring through Azure
(e.g., AppInsights)
First class support for SparkML, Python, CNTK, TF, TLC, R,
extensible to support others (Caffe, MXnet)
Service authoring in Python and .NET Core
Manage models
28
DOCKER
Single node deployment
(cloud/on-prem)
Azure Container
Service
Azure IoT
Edge
Spark
clusters
Deploy everywhere
29
Boost productivity
with agile development
collaboration and sharing with
notebooks and Git
version control and
reproducibility
metrics, lineage, run history, asset
management
30
Build with tools and
platforms you know
 Choose between visual drag-and-drop or
code-first authoring
your favorite IDEs
any framework or library with the
most popular languages
 Train quicker and easier with industry-leading
Spark and GPUs
31
Use your favorite IDE
Leverage all types of platforms and
tools/libraries
Use what you want
U S E T H E M O S T P O P U L A R I N N O VAT I O N S
U S E A N Y TO O L
U S E A N Y F R A M E W O R K O R L I B R A RY
32
Using TDSP with Azure Machine Learning
TDSP – AML worked-out samples in AI
35
https://docs.microsoft.com/en-us/azure/machine-learning/team-data-science-process/walkthroughs
Sample: Sentiment-specific word embeddings (SSWE)
improves classification of sentiments
https://docs.microsoft.com/en-us/azure/machine-learning/team-data-science-process/predict-twitter-sentiment
http://www.aclweb.org/anthology/P14-1146
Summary
 A well-defined data science process is critical to bridge the gap between
opportunities and challenges, and put AI models in production to impact businesses
 A combination of appropriate process (e.g. TDSP), tools (e.g. AML) and data-
platforms (e.g. DSVM) is important for efficient development and deployment of AI
solutions
 TDSP – process
 AML – tool
 A process such as TDSP can be used with a variety of tools and data platforms
 Resources are available to use TDSP and AML together, along with other data
platforms on Azure to efficiently build and deploy AI solutions on the cloud
https://aka.ms/tdsp
tdsp-feedback@microsoft.com
Deck available @: https://aka.ms/tdsp-presentations

Weitere ähnliche Inhalte

Was ist angesagt?

TensorFlow London 18: Dr Alastair Moore, Towards the use of Graphical Models ...
TensorFlow London 18: Dr Alastair Moore, Towards the use of Graphical Models ...TensorFlow London 18: Dr Alastair Moore, Towards the use of Graphical Models ...
TensorFlow London 18: Dr Alastair Moore, Towards the use of Graphical Models ...Seldon
 
ICLR 2020 Recap
ICLR 2020 RecapICLR 2020 Recap
ICLR 2020 RecapSri Ambati
 
Machine Learning Project Lifecycle
Machine Learning Project LifecycleMachine Learning Project Lifecycle
Machine Learning Project LifecycleAbdelhak MAHMOUDI
 
Agile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessAgile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessInside Analysis
 
Revolutionizing Enterprise Software Development through Continuous Delivery &...
Revolutionizing Enterprise Software Development through Continuous Delivery &...Revolutionizing Enterprise Software Development through Continuous Delivery &...
Revolutionizing Enterprise Software Development through Continuous Delivery &...People10 Technosoft Private Limited
 
CD4ML and the challenges of testing and quality in ML systems
CD4ML and the challenges of testing and quality in ML systemsCD4ML and the challenges of testing and quality in ML systems
CD4ML and the challenges of testing and quality in ML systemsSeldon
 
Data Management - Full Stack Deep Learning
Data Management - Full Stack Deep LearningData Management - Full Stack Deep Learning
Data Management - Full Stack Deep LearningSergey Karayev
 
From Data Science to MLOps
From Data Science to MLOpsFrom Data Science to MLOps
From Data Science to MLOpsCarl W. Handlin
 
GraphTour 2020 - BT: Use of Graph Database in P2P / P2MP Connectivity for Vid...
GraphTour 2020 - BT: Use of Graph Database in P2P / P2MP Connectivity for Vid...GraphTour 2020 - BT: Use of Graph Database in P2P / P2MP Connectivity for Vid...
GraphTour 2020 - BT: Use of Graph Database in P2P / P2MP Connectivity for Vid...Neo4j
 
Introduction to Machine Learning - WeCloudData
Introduction to Machine Learning - WeCloudDataIntroduction to Machine Learning - WeCloudData
Introduction to Machine Learning - WeCloudDataWeCloudData
 
Why do the majority of Data Science projects never make it to production?
Why do the majority of Data Science projects never make it to production?Why do the majority of Data Science projects never make it to production?
Why do the majority of Data Science projects never make it to production?Itai Yaffe
 
Scaling & Managing Production Deployments with H2O ModelOps
Scaling & Managing Production Deployments with H2O ModelOpsScaling & Managing Production Deployments with H2O ModelOps
Scaling & Managing Production Deployments with H2O ModelOpsSri Ambati
 
GraphTour 2020 - Neo4j: What's New?
GraphTour 2020 - Neo4j: What's New?GraphTour 2020 - Neo4j: What's New?
GraphTour 2020 - Neo4j: What's New?Neo4j
 
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...Sri Ambati
 
Machine Learning with GraphLab Create
Machine Learning with GraphLab CreateMachine Learning with GraphLab Create
Machine Learning with GraphLab CreateTuri, Inc.
 
Mohamed Sabri: Operationalize machine learning with Kubeflow
Mohamed Sabri: Operationalize machine learning with KubeflowMohamed Sabri: Operationalize machine learning with Kubeflow
Mohamed Sabri: Operationalize machine learning with KubeflowLviv Startup Club
 
Intuit - Machine learning platform lifecycle management 2018
Intuit - Machine learning platform lifecycle management  2018Intuit - Machine learning platform lifecycle management  2018
Intuit - Machine learning platform lifecycle management 2018Karthik Murugesan
 

Was ist angesagt? (20)

Monitoring AI with AI
Monitoring AI with AIMonitoring AI with AI
Monitoring AI with AI
 
TensorFlow London 18: Dr Alastair Moore, Towards the use of Graphical Models ...
TensorFlow London 18: Dr Alastair Moore, Towards the use of Graphical Models ...TensorFlow London 18: Dr Alastair Moore, Towards the use of Graphical Models ...
TensorFlow London 18: Dr Alastair Moore, Towards the use of Graphical Models ...
 
ICLR 2020 Recap
ICLR 2020 RecapICLR 2020 Recap
ICLR 2020 Recap
 
Machine Learning Project Lifecycle
Machine Learning Project LifecycleMachine Learning Project Lifecycle
Machine Learning Project Lifecycle
 
Agile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessAgile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for Success
 
Revolutionizing Enterprise Software Development through Continuous Delivery &...
Revolutionizing Enterprise Software Development through Continuous Delivery &...Revolutionizing Enterprise Software Development through Continuous Delivery &...
Revolutionizing Enterprise Software Development through Continuous Delivery &...
 
CD4ML and the challenges of testing and quality in ML systems
CD4ML and the challenges of testing and quality in ML systemsCD4ML and the challenges of testing and quality in ML systems
CD4ML and the challenges of testing and quality in ML systems
 
Data Management - Full Stack Deep Learning
Data Management - Full Stack Deep LearningData Management - Full Stack Deep Learning
Data Management - Full Stack Deep Learning
 
From Data Science to MLOps
From Data Science to MLOpsFrom Data Science to MLOps
From Data Science to MLOps
 
GraphTour 2020 - BT: Use of Graph Database in P2P / P2MP Connectivity for Vid...
GraphTour 2020 - BT: Use of Graph Database in P2P / P2MP Connectivity for Vid...GraphTour 2020 - BT: Use of Graph Database in P2P / P2MP Connectivity for Vid...
GraphTour 2020 - BT: Use of Graph Database in P2P / P2MP Connectivity for Vid...
 
Resume (1)
Resume (1)Resume (1)
Resume (1)
 
Introduction to Machine Learning - WeCloudData
Introduction to Machine Learning - WeCloudDataIntroduction to Machine Learning - WeCloudData
Introduction to Machine Learning - WeCloudData
 
Why do the majority of Data Science projects never make it to production?
Why do the majority of Data Science projects never make it to production?Why do the majority of Data Science projects never make it to production?
Why do the majority of Data Science projects never make it to production?
 
Scaling & Managing Production Deployments with H2O ModelOps
Scaling & Managing Production Deployments with H2O ModelOpsScaling & Managing Production Deployments with H2O ModelOps
Scaling & Managing Production Deployments with H2O ModelOps
 
Architecting for Data Science
Architecting for Data ScienceArchitecting for Data Science
Architecting for Data Science
 
GraphTour 2020 - Neo4j: What's New?
GraphTour 2020 - Neo4j: What's New?GraphTour 2020 - Neo4j: What's New?
GraphTour 2020 - Neo4j: What's New?
 
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...
 
Machine Learning with GraphLab Create
Machine Learning with GraphLab CreateMachine Learning with GraphLab Create
Machine Learning with GraphLab Create
 
Mohamed Sabri: Operationalize machine learning with Kubeflow
Mohamed Sabri: Operationalize machine learning with KubeflowMohamed Sabri: Operationalize machine learning with Kubeflow
Mohamed Sabri: Operationalize machine learning with Kubeflow
 
Intuit - Machine learning platform lifecycle management 2018
Intuit - Machine learning platform lifecycle management  2018Intuit - Machine learning platform lifecycle management  2018
Intuit - Machine learning platform lifecycle management 2018
 

Ähnlich wie Developing and deploying AI solutions on the cloud using Team Data Science Process (TDSP) and Azure Machine Learning (AML), Presented @ GAIC Jan 18 2018, Santa Clara

Developing and deploying NLP services on the cloud using Azure ML and the Tea...
Developing and deploying NLP services on the cloud using Azure ML and the Tea...Developing and deploying NLP services on the cloud using Azure ML and the Tea...
Developing and deploying NLP services on the cloud using Azure ML and the Tea...Debraj GuhaThakurta
 
Lviv Data Science Club (Sergiy Lunyakin)
Lviv Data Science Club (Sergiy Lunyakin)Lviv Data Science Club (Sergiy Lunyakin)
Lviv Data Science Club (Sergiy Lunyakin)Lviv Startup Club
 
How Cloud is Affecting Data Scientists
How Cloud is Affecting Data Scientists How Cloud is Affecting Data Scientists
How Cloud is Affecting Data Scientists CCG
 
Big Data Adavnced Analytics on Microsoft Azure
Big Data Adavnced Analytics on Microsoft AzureBig Data Adavnced Analytics on Microsoft Azure
Big Data Adavnced Analytics on Microsoft AzureMark Tabladillo
 
Managing Enterprise Data Science 201904
Managing Enterprise Data Science 201904Managing Enterprise Data Science 201904
Managing Enterprise Data Science 201904Mark Tabladillo
 
Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02
Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02
Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02BIWUG
 
How to build your own Delve: combining machine learning, big data and SharePoint
How to build your own Delve: combining machine learning, big data and SharePointHow to build your own Delve: combining machine learning, big data and SharePoint
How to build your own Delve: combining machine learning, big data and SharePointJoris Poelmans
 
Architecting an Open Source AI Platform 2018 edition
Architecting an Open Source AI Platform   2018 editionArchitecting an Open Source AI Platform   2018 edition
Architecting an Open Source AI Platform 2018 editionDavid Talby
 
DevOps Spain 2019. Olivier Perard-Oracle
DevOps Spain 2019. Olivier Perard-OracleDevOps Spain 2019. Olivier Perard-Oracle
DevOps Spain 2019. Olivier Perard-OracleatSistemas
 
The Eco-System of AI and How to Use It
The Eco-System of AI and How to Use ItThe Eco-System of AI and How to Use It
The Eco-System of AI and How to Use Itinside-BigData.com
 
Machine Learning Models in Production
Machine Learning Models in ProductionMachine Learning Models in Production
Machine Learning Models in ProductionDataWorks Summit
 
Building Data Science into Organizations: Field Experience
Building Data Science into Organizations: Field ExperienceBuilding Data Science into Organizations: Field Experience
Building Data Science into Organizations: Field ExperienceDatabricks
 
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...Ali Alkan
 
zData BI & Advanced Analytics Platform + 8 Week Pilot Programs
zData BI & Advanced Analytics Platform + 8 Week Pilot ProgramszData BI & Advanced Analytics Platform + 8 Week Pilot Programs
zData BI & Advanced Analytics Platform + 8 Week Pilot ProgramszData Inc.
 
Scaling Data Science on Big Data
Scaling Data Science on Big DataScaling Data Science on Big Data
Scaling Data Science on Big DataDataWorks Summit
 
Bluegranite AA Webinar FINAL 28JUN16
Bluegranite AA Webinar FINAL 28JUN16Bluegranite AA Webinar FINAL 28JUN16
Bluegranite AA Webinar FINAL 28JUN16Andy Lathrop
 
MLOps - The Assembly Line of ML
MLOps - The Assembly Line of MLMLOps - The Assembly Line of ML
MLOps - The Assembly Line of MLJordan Birdsell
 
Driving Enterprise Adoption: Tragedies, Triumphs and Our NEXT
Driving Enterprise Adoption: Tragedies, Triumphs and Our NEXTDriving Enterprise Adoption: Tragedies, Triumphs and Our NEXT
Driving Enterprise Adoption: Tragedies, Triumphs and Our NEXTDataWorks Summit
 
Microsoft cloud big data strategy
Microsoft cloud big data strategyMicrosoft cloud big data strategy
Microsoft cloud big data strategyJames Serra
 
Scaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data ScienceScaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data ScienceeRic Choo
 

Ähnlich wie Developing and deploying AI solutions on the cloud using Team Data Science Process (TDSP) and Azure Machine Learning (AML), Presented @ GAIC Jan 18 2018, Santa Clara (20)

Developing and deploying NLP services on the cloud using Azure ML and the Tea...
Developing and deploying NLP services on the cloud using Azure ML and the Tea...Developing and deploying NLP services on the cloud using Azure ML and the Tea...
Developing and deploying NLP services on the cloud using Azure ML and the Tea...
 
Lviv Data Science Club (Sergiy Lunyakin)
Lviv Data Science Club (Sergiy Lunyakin)Lviv Data Science Club (Sergiy Lunyakin)
Lviv Data Science Club (Sergiy Lunyakin)
 
How Cloud is Affecting Data Scientists
How Cloud is Affecting Data Scientists How Cloud is Affecting Data Scientists
How Cloud is Affecting Data Scientists
 
Big Data Adavnced Analytics on Microsoft Azure
Big Data Adavnced Analytics on Microsoft AzureBig Data Adavnced Analytics on Microsoft Azure
Big Data Adavnced Analytics on Microsoft Azure
 
Managing Enterprise Data Science 201904
Managing Enterprise Data Science 201904Managing Enterprise Data Science 201904
Managing Enterprise Data Science 201904
 
Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02
Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02
Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02
 
How to build your own Delve: combining machine learning, big data and SharePoint
How to build your own Delve: combining machine learning, big data and SharePointHow to build your own Delve: combining machine learning, big data and SharePoint
How to build your own Delve: combining machine learning, big data and SharePoint
 
Architecting an Open Source AI Platform 2018 edition
Architecting an Open Source AI Platform   2018 editionArchitecting an Open Source AI Platform   2018 edition
Architecting an Open Source AI Platform 2018 edition
 
DevOps Spain 2019. Olivier Perard-Oracle
DevOps Spain 2019. Olivier Perard-OracleDevOps Spain 2019. Olivier Perard-Oracle
DevOps Spain 2019. Olivier Perard-Oracle
 
The Eco-System of AI and How to Use It
The Eco-System of AI and How to Use ItThe Eco-System of AI and How to Use It
The Eco-System of AI and How to Use It
 
Machine Learning Models in Production
Machine Learning Models in ProductionMachine Learning Models in Production
Machine Learning Models in Production
 
Building Data Science into Organizations: Field Experience
Building Data Science into Organizations: Field ExperienceBuilding Data Science into Organizations: Field Experience
Building Data Science into Organizations: Field Experience
 
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
 
zData BI & Advanced Analytics Platform + 8 Week Pilot Programs
zData BI & Advanced Analytics Platform + 8 Week Pilot ProgramszData BI & Advanced Analytics Platform + 8 Week Pilot Programs
zData BI & Advanced Analytics Platform + 8 Week Pilot Programs
 
Scaling Data Science on Big Data
Scaling Data Science on Big DataScaling Data Science on Big Data
Scaling Data Science on Big Data
 
Bluegranite AA Webinar FINAL 28JUN16
Bluegranite AA Webinar FINAL 28JUN16Bluegranite AA Webinar FINAL 28JUN16
Bluegranite AA Webinar FINAL 28JUN16
 
MLOps - The Assembly Line of ML
MLOps - The Assembly Line of MLMLOps - The Assembly Line of ML
MLOps - The Assembly Line of ML
 
Driving Enterprise Adoption: Tragedies, Triumphs and Our NEXT
Driving Enterprise Adoption: Tragedies, Triumphs and Our NEXTDriving Enterprise Adoption: Tragedies, Triumphs and Our NEXT
Driving Enterprise Adoption: Tragedies, Triumphs and Our NEXT
 
Microsoft cloud big data strategy
Microsoft cloud big data strategyMicrosoft cloud big data strategy
Microsoft cloud big data strategy
 
Scaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data ScienceScaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data Science
 

Kürzlich hochgeladen

5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteedamy56318795
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...karishmasinghjnh
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Pooja Nehwal
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNKTimothy Spann
 
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...amitlee9823
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...amitlee9823
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...amitlee9823
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 

Kürzlich hochgeladen (20)

Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 

Developing and deploying AI solutions on the cloud using Team Data Science Process (TDSP) and Azure Machine Learning (AML), Presented @ GAIC Jan 18 2018, Santa Clara

  • 1. Developing and deploying AI solutions on the cloud using Team Data Science Process (TDSP) and Azure Machine Learning (AML) Global Artificial Intelligence Conference January 18, 2018, Santa Clara, CA Deck available @: https://aka.ms/tdsp-presentations
  • 3. Agile and iterative process to develop, deploy and manage AI applications in the cloud https://aka.ms/tdsp tdsp-feedback@microsoft.com Business Understanding Data acquisition & understanding Modeling (ML/AI) Deployment
  • 4. The opportunity and challenge of data science in enterprises Opportunity: 17% had a well-developed Predictive/Prescriptive Analytics program in place, while 80% planned on implementing such a program within five years – Dataversity 2015 Survey Challenge: Only 27% of the big data projects are regarded as successful – CapGenimi 2014 Tools & data platforms have matured - Still a major gap in executing on the potential
  • 5. One reason: Process challenge in Data Science Organization Collaboration Quality Knowledge Accumulation Agility Global Teams • Geographic Locations Team Growth • Onboard New Members Rapidly Varied Use Cases • Industries and Use Cases Diverse DS Backgrounds • DS have diverse backgrounds, experiences with tools, languages
  • 6. Why is a process useful for Data Science? A process is a detailed sequence of activities necessary to perform specific business tasks It is used to standardize procedures and establish best practices Technology and tools are changing rapidly. A standardized process can provide continuity and stability of work-flow. - Based on discussions with Luis Morinigo, Dir. IoT, NewSignature
  • 7. Data Science can borrow processes from DevOps
  • 8. TDSP objective Integrate DevOps with data science workflows to improve collaboration, quality, robustness and efficiency in data science projects o Infrastructure as Code (IaC) o Build o Test o CI / CD o Release management o Performance monitoring
  • 9. TDSP features for data science teams Standardized Data Science Lifecycle Project Structure, Templates & Roles Infrastructure & Toolkits Re-usable Data Science Utilities Project Execution (incl. DevOps components)
  • 10. Data Science lifecycle  Primary stages: Lifecycle
  • 11. TDSP lifecycle stages can be integrated with specific deliverables & checkpoints Business Understanding • Project Objective • Data, Target & Feature Definition • Data Dictionary Data acquisition and understanding Modeling Deployment Lifecycle
  • 12. Project roles & tasks Personas: account managers, PMs, DSs, SWEs, architects  Governance and Project Management  AI Developers Structure,templates,roles NOTE: Specific roles depend on organizations
  • 13. Shared and distributed infrastructure & toolkits Infrastructure&toolkits
  • 14. Agile work planning and execution template o Use Agile work planning & execution template (data science specific) https://commons.wikimedia.org Scrum framework Projectexecutionguidelines
  • 15. Collaborative development guidelines Remote Local LocalLocal Local TDSP git Template Integrated Agile planning & code development Projectexecutionguidelines
  • 16. Integrating DevOps in projects Projectexecutionguidelines
  • 17. Re-usable data science utilities: Analytics Interactive data exploration and reporting – IDEAR (Python, R, MRS) o Data quality assessment o Getting business insights from the data o Association between variables o Generating standardized data quality reports automatically Clustering Distribution assessment https://github.com/Azure/Azure-TDSP-Utilities DSutilities
  • 18. Re-usable data science utilities: Analytics - modeling Automated modeling and reporting AMAR (R) Predicted vs. Actual (multiple algorithms) Feature Importance (multiple algorithms) https://github.com/Azure/Azure-TDSP-Utilities DSutilities
  • 21. TDSP in action: E2E worked-out samples  Azure Machine Learning
  • 22. Adoption: How to stage (if needed) Level1 - One git repository per project - Standard directory structure - Standardized templates like charter, exit reports - Planning and tracking of work items Level2 - Customize templates to fit team needs - Create shared team utility repo (like IDEAR, AMAR) Leve3 - Develop process to graduate code from projects to the shared team utility repo - Develop E2E worked-out templates - Use mature work planning and tracking system (e.g. Agile) Level4 - Link git branch with work items - Code review - Manage and version model and data assets - Develop automated testing framework - Develop automated CI/CD
  • 23. Adoption: Customers  Microsoft internal  Microsoft consulting services (MCS)  AI & R Cloud Platform: Algorithm and data sciences team  Windows Devices DS team  …  … • External partners • New Signature • BlueGranite
  • 25. Apps + insightsSocial LOB Graph IoT Image CRM INGEST STORE PREP & TRAIN MODEL & SERVE Data orchestration and monitoring Data lake and storage Hadoop/Spark/SQL and ML Azure Machine Learning IoT Azure Machine Learning 25
  • 26. Bring AI everywhere Build with the tools and platforms you know Build, deploy, and manage at scale Boost productivity with agile development Benefit from the fastest AI developer cloud 26
  • 27. Build, deploy, and manage at scale  Build and deploy everywhere – cloud, on-premises, edge, and in-data  Deploy in minutes with data driven management and retraining of all your models  Prototype locally then scale up and out with VMs, Spark clusters, and GPUs  Real time, high through-put insights everywhere, including Excel integration 27
  • 28. Deployment and management of models as HTTP services Container-based hosting of real time and batch processing Management and monitoring through Azure (e.g., AppInsights) First class support for SparkML, Python, CNTK, TF, TLC, R, extensible to support others (Caffe, MXnet) Service authoring in Python and .NET Core Manage models 28
  • 29. DOCKER Single node deployment (cloud/on-prem) Azure Container Service Azure IoT Edge Spark clusters Deploy everywhere 29
  • 30. Boost productivity with agile development collaboration and sharing with notebooks and Git version control and reproducibility metrics, lineage, run history, asset management 30
  • 31. Build with tools and platforms you know  Choose between visual drag-and-drop or code-first authoring your favorite IDEs any framework or library with the most popular languages  Train quicker and easier with industry-leading Spark and GPUs 31
  • 32. Use your favorite IDE Leverage all types of platforms and tools/libraries Use what you want U S E T H E M O S T P O P U L A R I N N O VAT I O N S U S E A N Y TO O L U S E A N Y F R A M E W O R K O R L I B R A RY 32
  • 33.
  • 34. Using TDSP with Azure Machine Learning
  • 35. TDSP – AML worked-out samples in AI 35 https://docs.microsoft.com/en-us/azure/machine-learning/team-data-science-process/walkthroughs
  • 36. Sample: Sentiment-specific word embeddings (SSWE) improves classification of sentiments https://docs.microsoft.com/en-us/azure/machine-learning/team-data-science-process/predict-twitter-sentiment http://www.aclweb.org/anthology/P14-1146
  • 37. Summary  A well-defined data science process is critical to bridge the gap between opportunities and challenges, and put AI models in production to impact businesses  A combination of appropriate process (e.g. TDSP), tools (e.g. AML) and data- platforms (e.g. DSVM) is important for efficient development and deployment of AI solutions  TDSP – process  AML – tool  A process such as TDSP can be used with a variety of tools and data platforms  Resources are available to use TDSP and AML together, along with other data platforms on Azure to efficiently build and deploy AI solutions on the cloud