SlideShare ist ein Scribd-Unternehmen logo
© 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com
DATA SCIENCE IN AZURE
End-to-End Data Analytics using Azure Databricks
Chandler Stevens
VP - Microsoft BI & Analytics www.visualbi.com
Jiwon Jeon
Data Scientist
© 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com 2
This presentation outlines our general product direction and should not be relied on in making a purchase decision. This
presentation is not subject to your license agreement or any other agreement with Visual BI Solutions. Visual BI Solutions has
no obligation to pursue any course of business outlined in this presentation or to develop or release any functionality
mentioned in this presentation.
This presentation and Visual BI Solution’s strategy and possible future developments are subject to change and may be
changed by Visual BI Solutions at any time for any reason without notice. This presentation is provided without a warranty of
any kind, either express or implied, including but not limited to, the implied warranties of merchantability, fitness for a
particular purpose, or non-infringement. Visual BI Solutions assumes no responsibility for errors or omissions in this
presentation, except if such damages were caused by Visual BI Solutions intentionally or grossly negligent.
VISUAL BI SOLUTIONS
Legal Disclaimer
© 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com
Lead Data ScientistVP, Microsoft BI and Analytics
Jiwon Jeon
jiwonj@visualbi.com
3
Introducing
Today’s Presenters
Chandler Stevens
chandlers@visualbi.com
© 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com 4
Today’s Agenda
DATA SCIENCE INTRODUCTION1
DATA SCIENCE IN AZURE2
AZURE DATABRICKS & DEMO3
SUMMARY4
Q&A (via IM Chat)5
© 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com 5
ABOUT VISUAL BI
All-in-One Partner for End-to-End BI & Analytics Needs
Visual BI Solutions is a leading All-in-One Business Intelligence (BI) enablement firm specializing in BI &
Analytics services, solutions, trainings and products. We have proven expertise in enabling BI & Analytics
for 100+ world’s leading brands. We can help you achieve competitive advantage by effectively managing
the Plan - Build - Run spectrum for BI.
Trusted by the largest companies world-wide
Trusted by
the industry
Integration and Partnership with
SAP and Microsoft is our forte
CONSULTING SERVICES
• Strategy
• Architecture Implementation
• Training
• Managed Services
• Visualization
• Cloud Migrations
ANALYTICS SOLUTIONS
• Dashboards by LOB
• Advanced Analytics
• Big Data Solutions
TRAINING
• SAP Business Objects Training
§ SAP Lumira Discovery (2 Days)
§ SAP Lumira Designer (3 Days )
§ SAP Web Intelligence (2 Days)
§ SAP Analysis for Office (2 Days)
• SAP Analytics Cloud Training
• Microsoft Power BI Training
SOFTWARE PRODUCTS
• VBI View – One Portal for All BI Content
• Product Extensions for SAP Lumira / SAP Design Studio
§ Visual BI Extensions (VBX Suite)
§ Document Management and Change Control
• Value Driver Tree (VDT) for Planning and Simulations
© 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com 7
Data Science in Azure
© 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com 8
DATA SCIENCE IN AZURE
Need of Data Science
Data becomes BIG, COMPLEX, and stays EVERYWHERE à “DATA SCIENCE” is a need and no longer a want
§ Technology is driving data creation
§ Size of data is a factor
§ Maximizing information is critical
§ Emergence of unstructured data
© 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com
DATA SCIENCE IN AZURE
What is Data Science?
9
Study of exploring all available forms of data and employing scientific methods to extract knowledge
and derive insights for actionable decision-making
• A series of actions/studies from acquiring data, processing data, modeling and deploying for
integration
• Exploring structured and unstructured data
• Using scientific methods, algorithms and systems and assisted by data visualization, machine
learning, and big data platforms to obtain knowledge and insights to make actionable decision
To present the
right offer to the
right user based
on the right-time
decision
Business encounters faster
changes
Data-driven decisions need to be
made accurately
Users/Customers require
prompt responses
© 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com
Stochastic
optimization
How can we achieve the best outcome including
variability?
Optimization How can we achieve the best outcome?
10
CompetitiveAdvantage
Degree of complexity
PRESCRIPTIVE
DESCRIPTIVE
PREDICTIVE
Query/Drill down What exactly is the problem?
Ad hoc reporting How many, how often, where?
Standard reporting What happened?
Predictive modeling What will happen next if…?
Forecasting What if these trends continue?
Simulation What could happen?
Alerts What actions are needed?
Credited to: Competing on Analytics, Davenport and Harris, 2007
DATA SCIENCE IN AZURE
Journey to Data Monetization
© 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com
DATA SCIENCE IN AZURE
Data Science Process Lifecycle – CRISP-DM
11
Cross Industry Standard Process for Data
Mining
§ An open standard process model for common
approaches used by data mining experts.
CRISP-DM consists of the followings:
§ Project/Business understanding: understand
stakeholder motivations
§ Data understanding: finding data sources and
acquiring data
§ Data preparation: cleaning and transforming data
§ Modeling: understanding relationships in data
§ Evaluation: optimizing the model to achieve the
project goal
§ Deployment: delivering value from data
© 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com
DATA SCIENCE IN AZURE
Data Science Process Lifecycle – MS TDSP
12
Team Data Science Process Lifecycle
§ Originally designed for data science projects
as part of intelligent applications.
§ Employed machine learning or artificial
intelligence models for predictive analytics.
§ To avoid misunderstandings between teams
and customers by using a well-defined set of
artifacts
TDSP consists of the followings:
§ Business understanding
§ Data acquisition and understanding
§ Modeling
§ Deployment
§ Customer acceptance
Credited to: Microsoft Azure Documentation
© 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com
DATA SCIENCE IN AZURE
Data Science Process Lifecycle – MS TDSP
13
TDSP roles and tasks
Business
Understanding
Data Acquisition &
Understanding
Modeling Deployment Customer Acceptance
Project
Lead
DataScientist
Project
Manager
Solution
Architect
Create Template
Repository
Create
Project
Charter
Project
Charter
Provision Data
Infrastructure
Provision
Compute Assets
Data Ingest
& Explore
Data Summary
Report
Design Solution
Architecture
Solution Architecture
Diagram
Feature
Engineering
Model
Development
Model
Report
Develop Data
Pipeline
Deploy
Scoring
Process
Monitor
health &
metrics
Deploy
Pipeline
Check in
final
Artifacts
Finalize
Documentation
Project Final
Report
Decommission
Compute
Assets
Checkpoint
Project
Transition to
Production Support
Dashboard
Credited to: Microsoft Azure Documentation
© 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com
DATA SCIENCE IN AZURE
Azure Data Stack
14
INGEST STORE MODEL & SERVEPREP & TRAIN
PolyBase
Azure SQL
Data
Warehouse Azure
Analysis
Services
Azure
Databricks
Azure
Cosmos
DB
Azure
Data
Factory
Azure
Data Lake
Storage
Power
BI
Predictive
Applications
Logs, files
and media
(unstructured
data)
Business/custom
apps
(structured data)
§ Azure Data Factory
§ Azure Import Export
Service
§ Azure Data
Migration Service
§ Azure Event Hub
§ Azure IoT Hub
§ Azure CLI
§ Azure SDK
§ Azure Blob Storage
§ Azure Data Lake
§ Azure SQL DB
§ Azure Data
Warehouse
§ Azure Cosmos DB
§ Azure Databricks
§ Azure HDInsight
§ Azure ML Service
§ Azure ML Studio
§ Azure Data Science VM
§ Azure Cognitive
Services
§ Azure Data Lake
Analytics
§ Azure Bot Services
§ Azure Stream Analytics
§ Power BI
§ IoT Apps
§ Azure Analysis Service
Azure
Machine
Learning
Streaming
data
(unstructured
data)
Azure
Event
Hub
© 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com
DATA SCIENCE IN AZURE
Data Science Resources in Azure
15
Azure ML Studio Azure ML Service Azure HD Insight Azure Databricks
Whatitis
Drag–and–drop visual workspace
for ML
Managed cloud service for a
variety of open source big data
analytics workloads
Azure implementation of
Hadoop as a managed
service supporting a variety
of open-source analytics
engines
Spark-based analytics platform
Works
Build, test, and deploy models
using pre-built ML algorithms
Train, deploy, and manage ML
models at scale using Python
and CLI
Build, test and deploy ML
models with massive data
Build and deploy models and
data workflows
Features
• Publishes models as web
services for further use
• No programing is required
• Rich tools and packages
• Use external compute
engines including SQL Server
and Spark
• Auto-parameter tuning
• Developer tooling and
monitoring capabilities
• Orchestration via Azure
Data Factory
• Native Integration with Azure
for Security via AAD
• Single engine for Batch,
Streaming, ML and Graph
• Notebook-based
collaborative environment
• Autoscaling
Scenarios
• For quick exploration of data or
ML algorithms
• For testing the
operationalization of model with
least error
For integrated use of difference
resources for ML modeling and
deployment at scale
• When Hadoop
technologies are required
than Spark
• To stay in codebase
environment and/or ‘Lift
and Shift’ from on-prem
deployments
• When Spark and notebook
options are required
• When Auto-scaling is
required
• To build integrated and
performant data pipelines
For Advanced Analytics and Machine Learning solutions using Big Data in the Cloud
Azure Databricks
Spark-based analytics platform
Build and deploy models and
data workflows
• Native Integration with Azure
for Security via AAD
• Single engine for Batch,
Streaming, ML and Graph
• Notebook-based
collaborative environment
• Autoscaling
• When Spark and notebook
options are required
• When Auto-scaling is
required
• To build integrated and
performant data pipelines
© 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com 16
Azure Databricks
© 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com
Deep Learning
AI models using GPU-enabled
clusters with deep learning
frameworks
DATA SCIENCE IN AZURE
What is Azure Databricks?
17
Apache Spark-based unified analytics platform offering the best of Spark with collaborative notebooks and enterprise
features optimized for Azure.
Apache Spark environment
Databricks Runtime and
serverless compute model
Collaborative workspace
Shared notebook for data engineers,
data scientists and business users
One-click setup
Streamlined workflows
Autoscale & Autoterminate
Autoscaling up and down of
clusters & Autoterminating
inactive clusters
Integration w/ Azure services
Integration with Azure data services &
stores by SSO with Azure AD
© 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com
DATA SCIENCE IN AZURE
Workflow in Azure Databricks
19
LAUNCH
WORKSPACE
Log in to Azure Databricks in the
Azure portal using single sign-on
with Azure AD.
OPEN
CLUSTERS
Create a new cluster, configure
and start it with one click. The
autoscaling feature makes scaling
clusters fast and easy. The
autoterminating feature shuts
down inactive clusters as desired.
Both features help reduce
resources and costs associated
with manual operations.
COLLABORATE ON
NOTEBOOKS
Create custom access settings for data
engineers, data scientists, and
business users for shared projects to
cooperate on the notebooks based on
individual access level.
SCHEDULE
JOBS
Run notebooks as jobs by choosing
from existing streaming or machine
learning libraries. Schedule jobs in
advance to run automatically, and
monitor their performance.
BUILD DATA SCIENCE
MODELS
Build, train, and deploy AI models
at scale using any data languages
among SQL, Python, Scala, and R.
EXPLORE
DATA
Using SQL, Python, Scala, and R in
notebooks to easily mount storage and
collect observations to build machine
learning models. Business users can
see data in easy-to-read live data
displays.
© 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com 20
Demo
© 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com
DATA SCIENCE IN AZURE
Workflow in Azure Databricks
21
LAUNCH
WORKSPACE
Log in to Azure Databricks in the
Azure portal using single sign-on
with Azure AD.
OPEN
CLUSTERS
Create a new cluster, configure
and start it with one click. The
autoscaling feature makes scaling
clusters fast and easy. The
autoterminating feature shuts
down inactive clusters as desired.
Both features help reduce
resources and costs associated
with manual operations.
COLLABORATE ON
NOTEBOOKS
Create custom access settings for data
engineers, data scientists, and
business users for shared projects to
cooperate on the notebooks based on
individual access level.
SCHEDULE
JOBS
Run notebooks as jobs by choosing
from existing streaming or machine
learning libraries. Schedule jobs in
advance to run automatically, and
monitor their performance.
BUILD DATA SCIENCE
MODELS
Build, train, and deploy AI models
at scale using any data languages
among SQL, Python, Scala, and R.
EXPLORE
DATA
Using SQL, Python, Scala, and R in
notebooks to easily mount storage and
collect observations to build machine
learning models. Business users can
see data in easy-to-read live data
displays.
© 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com 22
2
INTRODUCING VISUAL BI MODERN ANALYTICS
§ Harness data sourced from online networks,
web pages, audio and video devices, social
media, logs and many other sources to uncover
insights and patterns
§ Refine and tune machine learning models to
boost prediction accuracy
§ Deliver big data solutions that can encompass
“lift and shift” and “cloud-native”
implementation models
§ Enable operational agility with enhanced
telemetry management
§ Move beyond understanding “what happened”
to “how can we achieve the best possible
outcome”
Modern
Analytics
Descriptive to
Predictive
Operations
with agility
Multi-stack
Implementatio
n
Boost ML
Accuracy
Harness
Diverse Data
Enable Insight Velocity with Big Data
© 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com 23
§ Modern Data Science project requires handling big data to find
the hidden insights
§ Azure Databricks provides cloud-scale analytics platform with
fast and secured performance
§ Visual BI Solutions can provide end-to-end services for design,
implementation and end user enablement of a data science
solution for you
§ For more information reach us at solutions@visualbi.com
DATA SCIENCE IN AZURE
Summary
© 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com 24
UPCOMING WEBINAR
CLICK HERE TO REGISTER NOW
© 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com 25
www.visualbi.com
THANK YOU! gopal
@visua
lbi.com
www.visualbi.com
Chandler Stevens
chandlers@visualbi.com
VP – Microsoft BI & Analytics
Jiwon Jeon
jiwonj@visualbi.com
Data Scientist

Weitere ähnliche Inhalte

Was ist angesagt?

VBI View Your one stop solution to manage multiple BI Platforms
VBI View Your one stop solution to manage multiple BI PlatformsVBI View Your one stop solution to manage multiple BI Platforms
VBI View Your one stop solution to manage multiple BI Platforms
Visual_BI
 
ValQ Data Acquisition Transformation Techniques
ValQ Data Acquisition Transformation TechniquesValQ Data Acquisition Transformation Techniques
ValQ Data Acquisition Transformation Techniques
Visual_BI
 
xViz Advanced Custom Visuals for Microsoft Power BI - What's New?
xViz Advanced Custom Visuals for Microsoft Power BI - What's New?xViz Advanced Custom Visuals for Microsoft Power BI - What's New?
xViz Advanced Custom Visuals for Microsoft Power BI - What's New?
Visual_BI
 
Value driver planning for mining using microsoft power bi webinar
Value driver planning for mining using microsoft power bi   webinarValue driver planning for mining using microsoft power bi   webinar
Value driver planning for mining using microsoft power bi webinar
Visual_BI
 
Why Customers need to upgrade to SAP Lumira 2.2?
Why Customers need to upgrade to SAP Lumira 2.2?Why Customers need to upgrade to SAP Lumira 2.2?
Why Customers need to upgrade to SAP Lumira 2.2?
Visual_BI
 
ValQ- A modern digital planning solution
ValQ- A modern digital planning solutionValQ- A modern digital planning solution
ValQ- A modern digital planning solution
Visual_BI
 
On-the-fly Material Requirement Planning using Microsoft Power BI
On-the-fly Material Requirement Planning using Microsoft Power BIOn-the-fly Material Requirement Planning using Microsoft Power BI
On-the-fly Material Requirement Planning using Microsoft Power BI
Visual_BI
 
Webinar - ValQ for Production Planning and Control
Webinar - ValQ for Production Planning and ControlWebinar - ValQ for Production Planning and Control
Webinar - ValQ for Production Planning and Control
Visual_BI
 
Data governance in a Cloud BI world
Data governance in a Cloud BI worldData governance in a Cloud BI world
Data governance in a Cloud BI world
Visual_BI
 
The Power of Collective Insight with SAP BI
The Power of Collective Insight with SAP BIThe Power of Collective Insight with SAP BI
The Power of Collective Insight with SAP BI
Waldemar Adams
 
What's New with SAP BusinessObjects Business Intelligence 4.1?
What's New with SAP BusinessObjects Business Intelligence 4.1?What's New with SAP BusinessObjects Business Intelligence 4.1?
What's New with SAP BusinessObjects Business Intelligence 4.1?
SAP Analytics
 
QlikView Tutorial For Beginners | What Is QlikView | Qlikview Tutorial | Qlik...
QlikView Tutorial For Beginners | What Is QlikView | Qlikview Tutorial | Qlik...QlikView Tutorial For Beginners | What Is QlikView | Qlikview Tutorial | Qlik...
QlikView Tutorial For Beginners | What Is QlikView | Qlikview Tutorial | Qlik...
Edureka!
 
Qlikview for Beginners
Qlikview for BeginnersQlikview for Beginners
Qlikview for Beginners
Edureka!
 
Cognos demo.
Cognos demo.Cognos demo.
Cognos demo.
Vivek Raja
 
Getting Started with Qlikview
Getting Started with QlikviewGetting Started with Qlikview
Getting Started with Qlikview
Edureka!
 
Oh! Session on Introduction to Qlikview
Oh! Session on Introduction to QlikviewOh! Session on Introduction to Qlikview
Oh! Session on Introduction to Qlikview
Prakalp Agarwal
 
Perspective on SAP Acquisition Of Business Objects on MAIA Business Intellige...
Perspective on SAP Acquisition Of Business Objects on MAIA Business Intellige...Perspective on SAP Acquisition Of Business Objects on MAIA Business Intellige...
Perspective on SAP Acquisition Of Business Objects on MAIA Business Intellige...
Dhiren Gala
 
What’s new in SAP BusinessObject BI 4.1? (part1)
What’s new in SAP BusinessObject BI 4.1? (part1)What’s new in SAP BusinessObject BI 4.1? (part1)
What’s new in SAP BusinessObject BI 4.1? (part1)
tasmc
 
IBM Cognos Analytics - Cognos Business Intelligence version 11
IBM Cognos Analytics - Cognos Business Intelligence version 11IBM Cognos Analytics - Cognos Business Intelligence version 11
IBM Cognos Analytics - Cognos Business Intelligence version 11
Cresco International
 
What makes QlikView unique
What makes QlikView unique  What makes QlikView unique
What makes QlikView unique
QlikView-India
 

Was ist angesagt? (20)

VBI View Your one stop solution to manage multiple BI Platforms
VBI View Your one stop solution to manage multiple BI PlatformsVBI View Your one stop solution to manage multiple BI Platforms
VBI View Your one stop solution to manage multiple BI Platforms
 
ValQ Data Acquisition Transformation Techniques
ValQ Data Acquisition Transformation TechniquesValQ Data Acquisition Transformation Techniques
ValQ Data Acquisition Transformation Techniques
 
xViz Advanced Custom Visuals for Microsoft Power BI - What's New?
xViz Advanced Custom Visuals for Microsoft Power BI - What's New?xViz Advanced Custom Visuals for Microsoft Power BI - What's New?
xViz Advanced Custom Visuals for Microsoft Power BI - What's New?
 
Value driver planning for mining using microsoft power bi webinar
Value driver planning for mining using microsoft power bi   webinarValue driver planning for mining using microsoft power bi   webinar
Value driver planning for mining using microsoft power bi webinar
 
Why Customers need to upgrade to SAP Lumira 2.2?
Why Customers need to upgrade to SAP Lumira 2.2?Why Customers need to upgrade to SAP Lumira 2.2?
Why Customers need to upgrade to SAP Lumira 2.2?
 
ValQ- A modern digital planning solution
ValQ- A modern digital planning solutionValQ- A modern digital planning solution
ValQ- A modern digital planning solution
 
On-the-fly Material Requirement Planning using Microsoft Power BI
On-the-fly Material Requirement Planning using Microsoft Power BIOn-the-fly Material Requirement Planning using Microsoft Power BI
On-the-fly Material Requirement Planning using Microsoft Power BI
 
Webinar - ValQ for Production Planning and Control
Webinar - ValQ for Production Planning and ControlWebinar - ValQ for Production Planning and Control
Webinar - ValQ for Production Planning and Control
 
Data governance in a Cloud BI world
Data governance in a Cloud BI worldData governance in a Cloud BI world
Data governance in a Cloud BI world
 
The Power of Collective Insight with SAP BI
The Power of Collective Insight with SAP BIThe Power of Collective Insight with SAP BI
The Power of Collective Insight with SAP BI
 
What's New with SAP BusinessObjects Business Intelligence 4.1?
What's New with SAP BusinessObjects Business Intelligence 4.1?What's New with SAP BusinessObjects Business Intelligence 4.1?
What's New with SAP BusinessObjects Business Intelligence 4.1?
 
QlikView Tutorial For Beginners | What Is QlikView | Qlikview Tutorial | Qlik...
QlikView Tutorial For Beginners | What Is QlikView | Qlikview Tutorial | Qlik...QlikView Tutorial For Beginners | What Is QlikView | Qlikview Tutorial | Qlik...
QlikView Tutorial For Beginners | What Is QlikView | Qlikview Tutorial | Qlik...
 
Qlikview for Beginners
Qlikview for BeginnersQlikview for Beginners
Qlikview for Beginners
 
Cognos demo.
Cognos demo.Cognos demo.
Cognos demo.
 
Getting Started with Qlikview
Getting Started with QlikviewGetting Started with Qlikview
Getting Started with Qlikview
 
Oh! Session on Introduction to Qlikview
Oh! Session on Introduction to QlikviewOh! Session on Introduction to Qlikview
Oh! Session on Introduction to Qlikview
 
Perspective on SAP Acquisition Of Business Objects on MAIA Business Intellige...
Perspective on SAP Acquisition Of Business Objects on MAIA Business Intellige...Perspective on SAP Acquisition Of Business Objects on MAIA Business Intellige...
Perspective on SAP Acquisition Of Business Objects on MAIA Business Intellige...
 
What’s new in SAP BusinessObject BI 4.1? (part1)
What’s new in SAP BusinessObject BI 4.1? (part1)What’s new in SAP BusinessObject BI 4.1? (part1)
What’s new in SAP BusinessObject BI 4.1? (part1)
 
IBM Cognos Analytics - Cognos Business Intelligence version 11
IBM Cognos Analytics - Cognos Business Intelligence version 11IBM Cognos Analytics - Cognos Business Intelligence version 11
IBM Cognos Analytics - Cognos Business Intelligence version 11
 
What makes QlikView unique
What makes QlikView unique  What makes QlikView unique
What makes QlikView unique
 

Ähnlich wie Data science in Azure

Azure advanced analytics for SAP customers
Azure advanced analytics for SAP customersAzure advanced analytics for SAP customers
Azure advanced analytics for SAP customers
Visual_BI
 
Learn why Microsoft Power BI is an Undisputed Market Leader?
Learn why Microsoft Power BI is an Undisputed Market Leader?Learn why Microsoft Power BI is an Undisputed Market Leader?
Learn why Microsoft Power BI is an Undisputed Market Leader?
Visual_BI
 
Snowflake: The most cost-effective agile and scalable data warehouse ever!
Snowflake: The most cost-effective agile and scalable data warehouse ever!Snowflake: The most cost-effective agile and scalable data warehouse ever!
Snowflake: The most cost-effective agile and scalable data warehouse ever!
Visual_BI
 
A deep dive session on Tableau
A deep dive session on TableauA deep dive session on Tableau
A deep dive session on Tableau
Visual_BI
 
Modern Business Intelligence - Design and Implementations
Modern Business Intelligence - Design and ImplementationsModern Business Intelligence - Design and Implementations
Modern Business Intelligence - Design and Implementations
David J Rosenthal
 
Power BI overview.pptx
Power BI overview.pptxPower BI overview.pptx
Power BI overview.pptx
HungPham381
 
Decoding SAP's BI Analytics SAP Statement of Direction
Decoding SAP's BI Analytics SAP Statement of Direction Decoding SAP's BI Analytics SAP Statement of Direction
Decoding SAP's BI Analytics SAP Statement of Direction
Visual_BI
 
Analytics in a Day Ft. Synapse Virtual Workshop
Analytics in a Day Ft. Synapse Virtual WorkshopAnalytics in a Day Ft. Synapse Virtual Workshop
Analytics in a Day Ft. Synapse Virtual Workshop
CCG
 
The Agile Analyst: Solving the Data Problem with Virtualization
The Agile Analyst: Solving the Data Problem with VirtualizationThe Agile Analyst: Solving the Data Problem with Virtualization
The Agile Analyst: Solving the Data Problem with Virtualization
Inside Analysis
 
Cloud Reporting With Oracle BICS – Project Costing And Fixed Assets
Cloud Reporting With Oracle BICS – Project Costing And Fixed AssetsCloud Reporting With Oracle BICS – Project Costing And Fixed Assets
Cloud Reporting With Oracle BICS – Project Costing And Fixed Assets
Jade Global
 
SPS Vancouver 2018 - What is CDM and CDS
SPS Vancouver 2018 - What is CDM and CDSSPS Vancouver 2018 - What is CDM and CDS
SPS Vancouver 2018 - What is CDM and CDS
Nicolas Georgeault
 
Introduction To SQL Server 2014
Introduction To SQL Server 2014Introduction To SQL Server 2014
Introduction To SQL Server 2014
Vishal Pawar
 
powerBI_theguy.ppt
powerBI_theguy.pptpowerBI_theguy.ppt
powerBI_theguy.ppt
ssuser65fa31
 
Microsoft cloud big data strategy
Microsoft cloud big data strategyMicrosoft cloud big data strategy
Microsoft cloud big data strategy
James Serra
 
LEN - BIBO Overview v1 .pptx
LEN - BIBO Overview v1 .pptxLEN - BIBO Overview v1 .pptx
LEN - BIBO Overview v1 .pptx
ArsyanSyahir2
 
Modern Analytics with Microsoft PowerBI
Modern Analytics with Microsoft PowerBIModern Analytics with Microsoft PowerBI
Modern Analytics with Microsoft PowerBI
David J Rosenthal
 
Where the Warehouse Ends: A New Age of Information Access
Where the Warehouse Ends: A New Age of Information AccessWhere the Warehouse Ends: A New Age of Information Access
Where the Warehouse Ends: A New Age of Information Access
Inside Analysis
 
SAP BI Roadmap
SAP BI RoadmapSAP BI Roadmap
SAP BI Roadmap
JC Raveneau
 
How To Convert Your SAP BusinessObjects Unused Licenses To SAP Analytics Cloud
How To Convert Your SAP BusinessObjects Unused Licenses To SAP Analytics CloudHow To Convert Your SAP BusinessObjects Unused Licenses To SAP Analytics Cloud
How To Convert Your SAP BusinessObjects Unused Licenses To SAP Analytics Cloud
Wiiisdom
 
Webinar: SAP BW Dinosaur to Agile Analytics Powerhouse
Webinar: SAP BW Dinosaur to Agile Analytics PowerhouseWebinar: SAP BW Dinosaur to Agile Analytics Powerhouse
Webinar: SAP BW Dinosaur to Agile Analytics Powerhouse
Agilexi
 

Ähnlich wie Data science in Azure (20)

Azure advanced analytics for SAP customers
Azure advanced analytics for SAP customersAzure advanced analytics for SAP customers
Azure advanced analytics for SAP customers
 
Learn why Microsoft Power BI is an Undisputed Market Leader?
Learn why Microsoft Power BI is an Undisputed Market Leader?Learn why Microsoft Power BI is an Undisputed Market Leader?
Learn why Microsoft Power BI is an Undisputed Market Leader?
 
Snowflake: The most cost-effective agile and scalable data warehouse ever!
Snowflake: The most cost-effective agile and scalable data warehouse ever!Snowflake: The most cost-effective agile and scalable data warehouse ever!
Snowflake: The most cost-effective agile and scalable data warehouse ever!
 
A deep dive session on Tableau
A deep dive session on TableauA deep dive session on Tableau
A deep dive session on Tableau
 
Modern Business Intelligence - Design and Implementations
Modern Business Intelligence - Design and ImplementationsModern Business Intelligence - Design and Implementations
Modern Business Intelligence - Design and Implementations
 
Power BI overview.pptx
Power BI overview.pptxPower BI overview.pptx
Power BI overview.pptx
 
Decoding SAP's BI Analytics SAP Statement of Direction
Decoding SAP's BI Analytics SAP Statement of Direction Decoding SAP's BI Analytics SAP Statement of Direction
Decoding SAP's BI Analytics SAP Statement of Direction
 
Analytics in a Day Ft. Synapse Virtual Workshop
Analytics in a Day Ft. Synapse Virtual WorkshopAnalytics in a Day Ft. Synapse Virtual Workshop
Analytics in a Day Ft. Synapse Virtual Workshop
 
The Agile Analyst: Solving the Data Problem with Virtualization
The Agile Analyst: Solving the Data Problem with VirtualizationThe Agile Analyst: Solving the Data Problem with Virtualization
The Agile Analyst: Solving the Data Problem with Virtualization
 
Cloud Reporting With Oracle BICS – Project Costing And Fixed Assets
Cloud Reporting With Oracle BICS – Project Costing And Fixed AssetsCloud Reporting With Oracle BICS – Project Costing And Fixed Assets
Cloud Reporting With Oracle BICS – Project Costing And Fixed Assets
 
SPS Vancouver 2018 - What is CDM and CDS
SPS Vancouver 2018 - What is CDM and CDSSPS Vancouver 2018 - What is CDM and CDS
SPS Vancouver 2018 - What is CDM and CDS
 
Introduction To SQL Server 2014
Introduction To SQL Server 2014Introduction To SQL Server 2014
Introduction To SQL Server 2014
 
powerBI_theguy.ppt
powerBI_theguy.pptpowerBI_theguy.ppt
powerBI_theguy.ppt
 
Microsoft cloud big data strategy
Microsoft cloud big data strategyMicrosoft cloud big data strategy
Microsoft cloud big data strategy
 
LEN - BIBO Overview v1 .pptx
LEN - BIBO Overview v1 .pptxLEN - BIBO Overview v1 .pptx
LEN - BIBO Overview v1 .pptx
 
Modern Analytics with Microsoft PowerBI
Modern Analytics with Microsoft PowerBIModern Analytics with Microsoft PowerBI
Modern Analytics with Microsoft PowerBI
 
Where the Warehouse Ends: A New Age of Information Access
Where the Warehouse Ends: A New Age of Information AccessWhere the Warehouse Ends: A New Age of Information Access
Where the Warehouse Ends: A New Age of Information Access
 
SAP BI Roadmap
SAP BI RoadmapSAP BI Roadmap
SAP BI Roadmap
 
How To Convert Your SAP BusinessObjects Unused Licenses To SAP Analytics Cloud
How To Convert Your SAP BusinessObjects Unused Licenses To SAP Analytics CloudHow To Convert Your SAP BusinessObjects Unused Licenses To SAP Analytics Cloud
How To Convert Your SAP BusinessObjects Unused Licenses To SAP Analytics Cloud
 
Webinar: SAP BW Dinosaur to Agile Analytics Powerhouse
Webinar: SAP BW Dinosaur to Agile Analytics PowerhouseWebinar: SAP BW Dinosaur to Agile Analytics Powerhouse
Webinar: SAP BW Dinosaur to Agile Analytics Powerhouse
 

Kürzlich hochgeladen

原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
ihavuls
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
roli9797
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
vikram sood
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
Bill641377
 
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docxDATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
SaffaIbrahim1
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
bopyb
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
nyfuhyz
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
Timothy Spann
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
manishkhaire30
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
nuttdpt
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
nuttdpt
 
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
hyfjgavov
 
writing report business partner b1+ .pdf
writing report business partner b1+ .pdfwriting report business partner b1+ .pdf
writing report business partner b1+ .pdf
VyNguyen709676
 
Build applications with generative AI on Google Cloud
Build applications with generative AI on Google CloudBuild applications with generative AI on Google Cloud
Build applications with generative AI on Google Cloud
Márton Kodok
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
Sachin Paul
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
Timothy Spann
 
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
a9qfiubqu
 
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
xclpvhuk
 
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Kaxil Naik
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
aqzctr7x
 

Kürzlich hochgeladen (20)

原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
 
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docxDATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
 
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
 
writing report business partner b1+ .pdf
writing report business partner b1+ .pdfwriting report business partner b1+ .pdf
writing report business partner b1+ .pdf
 
Build applications with generative AI on Google Cloud
Build applications with generative AI on Google CloudBuild applications with generative AI on Google Cloud
Build applications with generative AI on Google Cloud
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
 
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
 
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
 
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
 

Data science in Azure

  • 1. © 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com DATA SCIENCE IN AZURE End-to-End Data Analytics using Azure Databricks Chandler Stevens VP - Microsoft BI & Analytics www.visualbi.com Jiwon Jeon Data Scientist
  • 2. © 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com 2 This presentation outlines our general product direction and should not be relied on in making a purchase decision. This presentation is not subject to your license agreement or any other agreement with Visual BI Solutions. Visual BI Solutions has no obligation to pursue any course of business outlined in this presentation or to develop or release any functionality mentioned in this presentation. This presentation and Visual BI Solution’s strategy and possible future developments are subject to change and may be changed by Visual BI Solutions at any time for any reason without notice. This presentation is provided without a warranty of any kind, either express or implied, including but not limited to, the implied warranties of merchantability, fitness for a particular purpose, or non-infringement. Visual BI Solutions assumes no responsibility for errors or omissions in this presentation, except if such damages were caused by Visual BI Solutions intentionally or grossly negligent. VISUAL BI SOLUTIONS Legal Disclaimer
  • 3. © 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com Lead Data ScientistVP, Microsoft BI and Analytics Jiwon Jeon jiwonj@visualbi.com 3 Introducing Today’s Presenters Chandler Stevens chandlers@visualbi.com
  • 4. © 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com 4 Today’s Agenda DATA SCIENCE INTRODUCTION1 DATA SCIENCE IN AZURE2 AZURE DATABRICKS & DEMO3 SUMMARY4 Q&A (via IM Chat)5
  • 5. © 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com 5 ABOUT VISUAL BI All-in-One Partner for End-to-End BI & Analytics Needs Visual BI Solutions is a leading All-in-One Business Intelligence (BI) enablement firm specializing in BI & Analytics services, solutions, trainings and products. We have proven expertise in enabling BI & Analytics for 100+ world’s leading brands. We can help you achieve competitive advantage by effectively managing the Plan - Build - Run spectrum for BI. Trusted by the largest companies world-wide Trusted by the industry Integration and Partnership with SAP and Microsoft is our forte CONSULTING SERVICES • Strategy • Architecture Implementation • Training • Managed Services • Visualization • Cloud Migrations ANALYTICS SOLUTIONS • Dashboards by LOB • Advanced Analytics • Big Data Solutions TRAINING • SAP Business Objects Training § SAP Lumira Discovery (2 Days) § SAP Lumira Designer (3 Days ) § SAP Web Intelligence (2 Days) § SAP Analysis for Office (2 Days) • SAP Analytics Cloud Training • Microsoft Power BI Training SOFTWARE PRODUCTS • VBI View – One Portal for All BI Content • Product Extensions for SAP Lumira / SAP Design Studio § Visual BI Extensions (VBX Suite) § Document Management and Change Control • Value Driver Tree (VDT) for Planning and Simulations
  • 6. © 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com 7 Data Science in Azure
  • 7. © 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com 8 DATA SCIENCE IN AZURE Need of Data Science Data becomes BIG, COMPLEX, and stays EVERYWHERE à “DATA SCIENCE” is a need and no longer a want § Technology is driving data creation § Size of data is a factor § Maximizing information is critical § Emergence of unstructured data
  • 8. © 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com DATA SCIENCE IN AZURE What is Data Science? 9 Study of exploring all available forms of data and employing scientific methods to extract knowledge and derive insights for actionable decision-making • A series of actions/studies from acquiring data, processing data, modeling and deploying for integration • Exploring structured and unstructured data • Using scientific methods, algorithms and systems and assisted by data visualization, machine learning, and big data platforms to obtain knowledge and insights to make actionable decision To present the right offer to the right user based on the right-time decision Business encounters faster changes Data-driven decisions need to be made accurately Users/Customers require prompt responses
  • 9. © 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com Stochastic optimization How can we achieve the best outcome including variability? Optimization How can we achieve the best outcome? 10 CompetitiveAdvantage Degree of complexity PRESCRIPTIVE DESCRIPTIVE PREDICTIVE Query/Drill down What exactly is the problem? Ad hoc reporting How many, how often, where? Standard reporting What happened? Predictive modeling What will happen next if…? Forecasting What if these trends continue? Simulation What could happen? Alerts What actions are needed? Credited to: Competing on Analytics, Davenport and Harris, 2007 DATA SCIENCE IN AZURE Journey to Data Monetization
  • 10. © 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com DATA SCIENCE IN AZURE Data Science Process Lifecycle – CRISP-DM 11 Cross Industry Standard Process for Data Mining § An open standard process model for common approaches used by data mining experts. CRISP-DM consists of the followings: § Project/Business understanding: understand stakeholder motivations § Data understanding: finding data sources and acquiring data § Data preparation: cleaning and transforming data § Modeling: understanding relationships in data § Evaluation: optimizing the model to achieve the project goal § Deployment: delivering value from data
  • 11. © 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com DATA SCIENCE IN AZURE Data Science Process Lifecycle – MS TDSP 12 Team Data Science Process Lifecycle § Originally designed for data science projects as part of intelligent applications. § Employed machine learning or artificial intelligence models for predictive analytics. § To avoid misunderstandings between teams and customers by using a well-defined set of artifacts TDSP consists of the followings: § Business understanding § Data acquisition and understanding § Modeling § Deployment § Customer acceptance Credited to: Microsoft Azure Documentation
  • 12. © 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com DATA SCIENCE IN AZURE Data Science Process Lifecycle – MS TDSP 13 TDSP roles and tasks Business Understanding Data Acquisition & Understanding Modeling Deployment Customer Acceptance Project Lead DataScientist Project Manager Solution Architect Create Template Repository Create Project Charter Project Charter Provision Data Infrastructure Provision Compute Assets Data Ingest & Explore Data Summary Report Design Solution Architecture Solution Architecture Diagram Feature Engineering Model Development Model Report Develop Data Pipeline Deploy Scoring Process Monitor health & metrics Deploy Pipeline Check in final Artifacts Finalize Documentation Project Final Report Decommission Compute Assets Checkpoint Project Transition to Production Support Dashboard Credited to: Microsoft Azure Documentation
  • 13. © 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com DATA SCIENCE IN AZURE Azure Data Stack 14 INGEST STORE MODEL & SERVEPREP & TRAIN PolyBase Azure SQL Data Warehouse Azure Analysis Services Azure Databricks Azure Cosmos DB Azure Data Factory Azure Data Lake Storage Power BI Predictive Applications Logs, files and media (unstructured data) Business/custom apps (structured data) § Azure Data Factory § Azure Import Export Service § Azure Data Migration Service § Azure Event Hub § Azure IoT Hub § Azure CLI § Azure SDK § Azure Blob Storage § Azure Data Lake § Azure SQL DB § Azure Data Warehouse § Azure Cosmos DB § Azure Databricks § Azure HDInsight § Azure ML Service § Azure ML Studio § Azure Data Science VM § Azure Cognitive Services § Azure Data Lake Analytics § Azure Bot Services § Azure Stream Analytics § Power BI § IoT Apps § Azure Analysis Service Azure Machine Learning Streaming data (unstructured data) Azure Event Hub
  • 14. © 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com DATA SCIENCE IN AZURE Data Science Resources in Azure 15 Azure ML Studio Azure ML Service Azure HD Insight Azure Databricks Whatitis Drag–and–drop visual workspace for ML Managed cloud service for a variety of open source big data analytics workloads Azure implementation of Hadoop as a managed service supporting a variety of open-source analytics engines Spark-based analytics platform Works Build, test, and deploy models using pre-built ML algorithms Train, deploy, and manage ML models at scale using Python and CLI Build, test and deploy ML models with massive data Build and deploy models and data workflows Features • Publishes models as web services for further use • No programing is required • Rich tools and packages • Use external compute engines including SQL Server and Spark • Auto-parameter tuning • Developer tooling and monitoring capabilities • Orchestration via Azure Data Factory • Native Integration with Azure for Security via AAD • Single engine for Batch, Streaming, ML and Graph • Notebook-based collaborative environment • Autoscaling Scenarios • For quick exploration of data or ML algorithms • For testing the operationalization of model with least error For integrated use of difference resources for ML modeling and deployment at scale • When Hadoop technologies are required than Spark • To stay in codebase environment and/or ‘Lift and Shift’ from on-prem deployments • When Spark and notebook options are required • When Auto-scaling is required • To build integrated and performant data pipelines For Advanced Analytics and Machine Learning solutions using Big Data in the Cloud Azure Databricks Spark-based analytics platform Build and deploy models and data workflows • Native Integration with Azure for Security via AAD • Single engine for Batch, Streaming, ML and Graph • Notebook-based collaborative environment • Autoscaling • When Spark and notebook options are required • When Auto-scaling is required • To build integrated and performant data pipelines
  • 15. © 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com 16 Azure Databricks
  • 16. © 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com Deep Learning AI models using GPU-enabled clusters with deep learning frameworks DATA SCIENCE IN AZURE What is Azure Databricks? 17 Apache Spark-based unified analytics platform offering the best of Spark with collaborative notebooks and enterprise features optimized for Azure. Apache Spark environment Databricks Runtime and serverless compute model Collaborative workspace Shared notebook for data engineers, data scientists and business users One-click setup Streamlined workflows Autoscale & Autoterminate Autoscaling up and down of clusters & Autoterminating inactive clusters Integration w/ Azure services Integration with Azure data services & stores by SSO with Azure AD
  • 17. © 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com DATA SCIENCE IN AZURE Workflow in Azure Databricks 19 LAUNCH WORKSPACE Log in to Azure Databricks in the Azure portal using single sign-on with Azure AD. OPEN CLUSTERS Create a new cluster, configure and start it with one click. The autoscaling feature makes scaling clusters fast and easy. The autoterminating feature shuts down inactive clusters as desired. Both features help reduce resources and costs associated with manual operations. COLLABORATE ON NOTEBOOKS Create custom access settings for data engineers, data scientists, and business users for shared projects to cooperate on the notebooks based on individual access level. SCHEDULE JOBS Run notebooks as jobs by choosing from existing streaming or machine learning libraries. Schedule jobs in advance to run automatically, and monitor their performance. BUILD DATA SCIENCE MODELS Build, train, and deploy AI models at scale using any data languages among SQL, Python, Scala, and R. EXPLORE DATA Using SQL, Python, Scala, and R in notebooks to easily mount storage and collect observations to build machine learning models. Business users can see data in easy-to-read live data displays.
  • 18. © 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com 20 Demo
  • 19. © 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com DATA SCIENCE IN AZURE Workflow in Azure Databricks 21 LAUNCH WORKSPACE Log in to Azure Databricks in the Azure portal using single sign-on with Azure AD. OPEN CLUSTERS Create a new cluster, configure and start it with one click. The autoscaling feature makes scaling clusters fast and easy. The autoterminating feature shuts down inactive clusters as desired. Both features help reduce resources and costs associated with manual operations. COLLABORATE ON NOTEBOOKS Create custom access settings for data engineers, data scientists, and business users for shared projects to cooperate on the notebooks based on individual access level. SCHEDULE JOBS Run notebooks as jobs by choosing from existing streaming or machine learning libraries. Schedule jobs in advance to run automatically, and monitor their performance. BUILD DATA SCIENCE MODELS Build, train, and deploy AI models at scale using any data languages among SQL, Python, Scala, and R. EXPLORE DATA Using SQL, Python, Scala, and R in notebooks to easily mount storage and collect observations to build machine learning models. Business users can see data in easy-to-read live data displays.
  • 20. © 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com 22 2 INTRODUCING VISUAL BI MODERN ANALYTICS § Harness data sourced from online networks, web pages, audio and video devices, social media, logs and many other sources to uncover insights and patterns § Refine and tune machine learning models to boost prediction accuracy § Deliver big data solutions that can encompass “lift and shift” and “cloud-native” implementation models § Enable operational agility with enhanced telemetry management § Move beyond understanding “what happened” to “how can we achieve the best possible outcome” Modern Analytics Descriptive to Predictive Operations with agility Multi-stack Implementatio n Boost ML Accuracy Harness Diverse Data Enable Insight Velocity with Big Data
  • 21. © 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com 23 § Modern Data Science project requires handling big data to find the hidden insights § Azure Databricks provides cloud-scale analytics platform with fast and secured performance § Visual BI Solutions can provide end-to-end services for design, implementation and end user enablement of a data science solution for you § For more information reach us at solutions@visualbi.com DATA SCIENCE IN AZURE Summary
  • 22. © 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com 24 UPCOMING WEBINAR CLICK HERE TO REGISTER NOW
  • 23. © 2019 Visual BI Solutions, Inc. All rights reserved. www.visualbi.com 25 www.visualbi.com THANK YOU! gopal @visua lbi.com www.visualbi.com Chandler Stevens chandlers@visualbi.com VP – Microsoft BI & Analytics Jiwon Jeon jiwonj@visualbi.com Data Scientist