This document provides an agenda and summary for a Data Analytics Meetup (DAM) on March 27, 2018. The agenda covers topics such as disruption opportunities in a changing data landscape, transitioning from traditional to modern BI architectures using Azure, Azure SQL Database vs Data Warehouse, data integration with Azure Data Factory and SSIS, Analysis Services, Power BI reporting, and a wrap-up. The document discusses challenges around data growth, digital transformation, and the shrinking time for companies to adapt to disruption. It provides overviews and comparisons of Azure SQL Database, Data Warehouse, and related Azure services to help modernize analytics architectures.
4. Widely available
unstructured data
Deliver powerful insights
over big data
Massive
data growth
Build data
into all your apps
50ZB
1010
0101
0010
{ }
Analytics driving
digital transformation
Modernize your apps
with built-in analytics
New Opportunities
in a Changing Data Landscape
5. SINCE 2000, 52% OF THE FORTUNE 500 COMPANIES HAVE
DISAPPEARED
0
20
40
60
80
1955 2015
Avg. Life
Expectancy
for a
Company
(years)
Source: Digital Transformation by Mark Baker
6. 67
Years
25
Years
15
Years
1920 1930 1940 1950 1960 1970 1980 1990 2000 2010's 2020's
The Time to Adapt to Disruption is Shrinking
Source: BBC
A hundred years ago, the average lifespan of a company
listed on the S&P 500 index was 67 years
In the 2020s, 75% of the
S&P 500 will be new companies
7.
8. Digital Transformation is a Team Effort
CIO
Business
Program and Portfolio
Management
Sourcing and
Vendor Management
Applications
Enterprise
Architecture and
Technology Innovation
Data and Analytics
Security
and Risk
Infrastructure and
Operations
9. Invest in developing PEOPLE
to support analytic adoption &
create a data-driven culture
Embrace agile
PROCESS that supports
analytics at speed
Connect and collect
DATA up, across
and out
Integrated and Inclusive Strategy
Unlock VALUE
& Insights
Take advantage of the modern
TECHNOLOGY to drive self-
service, embedded and viral
analytics
12. Azure PaaS
Key Benefits
Less Administration - free up
Administrative resources
Scalability - When using SQL
Database, you pay-as-you-go
with options to scale up or
out for greater power with no
interruption
Faster Deployments
No Operating System
Cost/Updates Overhead
Backups are done by default
Development continues
without any major changes.
16. Azure SQL Database DTU â Database Transaction Unit
Number of Dedicated SQL database
resources (CPU, Memory, Read and
writes) available for SQL Database
Doubling the DTUs by increasing the
performance level of a database
equates to doubling the set of
resource available to that database.
For example, a Premium P11
database with 1750 DTUs provides
350x more DTU compute power than
a Basic database with 5 DTUs
17. Azure SQL Database eDTU â Elastic Database Transaction Unit
Rather than providing dedicated set of DTUs to a SQL database you can add
a eDTU pool shared by multiple databases to accommodate unpredictable
periods of usage by individual databases.
SQL Database elastic pools are a simple, cost-effective solution for
managing and scaling multiple databases that have varying and
unpredictable usage demands
While the eDTU unit price for a pool is 1.5x greater than the DTU unit price
for a single database, pool eDTUs can be shared by many databases and
fewer total eDTUs are needed.
Pool eDTU can be used by applications Databases during daily Operations
activities and Processes like ETL/Data load Databases can utilize the same
eDTU for Nightly Processing.
18. Application benefits and BI
SQL Database in Azure
Limitations â 4TB max is a sizing limitation in Single SQL database but can
overcome by Scaling out called Sharding.
Sharding â Scaling out - way to overcome single database size restriction by
distributing data across multiple databases
Azure SQL Database Managed Instance â New offering currently in preview
â Managed instance allows existing SQL Server customers to lift and shift
their on-premises applications to the cloud with minimal application and
database changes. At the same time, Managed Instance preserves all
PaaS capabilities (automatic patching and version updates, backup, high-
availability), that drastically reduces management overhead and total cost
of ownership
19. Azure SQL Data Warehouse
SQL Data Warehouse is a cloud-based Enterprise
Data Warehouse (EDW) that leverages Massively
Parallel Processing (MPP) to quickly run complex
queries across petabytes of data
Data is ingested into big data stores from a variety
of sources.
Once in a big data store, Hadoop, Spark, and
machine learning algorithms prepare and train the
data
When the data is ready for complex analysis, SQL
Data Warehouse uses PolyBase to query the big
data stores. PolyBase uses standard T-SQL queries
to bring the data into SQL Data Warehouse.
20. Azure SQL Data Warehouse (DWU)
Azure DW leverages Scale Out architecture to distribute computational
processing of data across multiple nodes. The unit of scale is an
abstraction of compute power that is known as data warehouse unit
(DWU)
Azure DW uses a Node based architecture. Applications connect and
issue Tsql commands to a control node. The control node runs the MPP
engine which optimizes queries for parallel processing.
Data Movement Service (DMS) Coordinates the data movements
between compute nodes.
Distribution is the basic unit of storage and processing for parallel
queries. Each compute node manages one or more of 60 distributions.
21. Azure DW Distributions
Types of Distributions â
Hash â Provides the highest query performance for joins and aggregation on Large tables
â Fact â use Hash distribution with clustered column store index. Performance improves when two has tables are
joined on the same distribution column. Large Dimensions or any large tables to store on each compute node
use hash distributed.
Round Robin â Simplest table to create and delivers fast Performance when used as staging table for loads
â Staging tables- Round robin is ideal for staging tables in Azure DW
Replicated â Provides fastest query performance for small tables
â Small Dimensions â Smaller tables can utilize replicated Distribution.
23. Traditional on premises installations of SQL server the OLTP and OLAP the systems sat on different server but
the software was the same. OLTP instance has more 3rd Normal form structures for
Operational/Transactional processing vs the OLAP install favored Dimensional Modelling for reporting
purposes.
When connecting to Azure SQL Database and Azure SQL data warehouse with SSMS might have the same
look and feel on surface. They both are PaaS offerings but under the covers they are very different
Azure SQL Database is a single database and the concept of SQL server in Azure is more of a container for
logically grouping the Azure SQL Databases
Azure SQL Data Warehouse was built for OLAP systems. Massively Parallel processing system or MPP are
made up of multiple nodes each with their own resources and they work together to provide increased
performance. Azure SQL Databases is a Paas offering of an OLTP database.
Azure SQL DB vs Azure SQL Data Warehouse
26. Development Tools Azure Data Factory V2
Azure Data Factory is a hybrid data integration service that allows you to create, schedule and orchestrate your ETL/ELT workflows at
scale wherever your data lives, in cloud or self-hosted network.
Schedule and manage your data transformation and analysis process, add a Hadoop processing step for big or semi-structured data, a
stored procedure invocation step for structured data, a machine-learning step for analytics, or insert your own custom code as a
processing step in any pipeline.
With Data Factory Version 2 you can lift and deploy your on perm SSIS/SSDT package to SQL Data Factory.
27. SSIS/SSDT Connection Managers for Azure
SSIS/SSDT and Azure
SSIS/SSDT Control Flow components for Azure
SSIS/SSDT Data Flow components for Azure
30. Azure Analysis Services â PaaS Offering
Azure Analysis Services supports tabular models at the 1200 and 1400 compatibility levels. Partitions, row-level security, bi-directional
relationships, and translations are all supported. In-memory and DirectQuery modes mean lightning fast queries over massive and
complex datasets.
For Scaling and Cost management you can change plans up or down within the same tier, or upgrade to a higher tier, but you cannot
downgrade from a higher tier to a lower tier. You pay for only what you use so you can pause the instance during non business hours.
Its easy to migrate on premises Tabular cubes using SSDT or using SSMS. If your cube uses On Premises data sources you need to
install and configure on premises data gateway.
32. Power BI Reporting
A Power BI report is a multi-perspective
view into a dataset, with visualizations
that represent different findings and
insights from that dataset. A report can
have a single visualization or pages full of
visualizations
Data sources pointed to Local Instance
and Azure Analysis services instance.
Simple change
34. On Premises Gateway for On Premises Data Sources
The on-premises data gateway acts as a bridge,
providing secure data transfer between on-premises
data sources and your Azure Analysis Services servers
in the cloud.
The gateway also works with Azure Logic Apps, Power
BI, Power Apps, and Microsoft Flow. You can associate
multiple services in the same region with a single
gateway.
35. On Premises Gateway for On Premises Data sources (Cont.)
Few steps in getting this setup:
â Download and run setup - Installs a gateway service on a computer in your organization using an account in your tenant's
Azure AD. Azure B2B (guest) accounts are not supported.
â Register your gateway - Specify a name and recovery key for your gateway and select a region, registering your gateway
with the Gateway Cloud Service.
â Create a gateway resource in Azure - create a gateway resource in your Azure subscription.
â Connect your servers to your gateway resource - Once you have a gateway resource in your subscription, you can begin
connecting your servers to it. You can connect multiple servers and other resources to it.
36. 1 2 3
BI Supporting Tools for development
Azure Storage Explorer
Easily manage the
contents of your storage
account with Azure
Storage Explorer. Upload,
download, and manage
blobs, files, queues,
tables, and Cosmos DB
entities. Gain easy access
to manage your virtual
machine disks.
Link to Azure Storage
Explorer
SSIS Azure Feature Pack
SQL Server Integration
Services (SSIS) Feature
Pack for Azure is an
extension that provides
the components listed on
this page for SSIS to
connect to Azure
services, transfer data
between Azure and on-
premises data sources,
and process data stored
in Azure
Link to Azure Feature
Pack for SSIS
On Premises Gateway
The on-premises data
gateway acts as a bridge,
providing quick and secure
data transfer between on-
premises data (data that is
not in the cloud) and the
Power BI, Microsoft Flow,
Logic Apps, and PowerApps
services.
Link to On Premises
Gateway
38. Compliance - HIPPA/HITECH, HITRUST and more.
â Link to compliance offerings
Privacy â Microsoft is governed by strict standards regarding the privacy and protection of customer data. Its now General Data
Protection Regulation (GDPR) compliant. More info Click link below
â Link to privacy information
Transparency â Where is my data? How is it Secured? Who can Access it?
â Link to transparency information
Azure Government - environment includes a unique cloud instance, exclusively for government customers and their solution
providers, and hardened US data centers that are operated by extensively screened personnel.
â Link to Azure government information
Azure Industries - Azure solutions are used by multiple industries, including banking, healthcare, manufacturing, and
telecommunications.
â Link to Azure industries information
Microsoft Azure
39. Additional Resources
Azure 101 -
â Interactive website to learn all Azure offerings
â Link to Azure 101
Azure SLA (Service Level Agreements)
â Link to Azure SLA
Azure Updates â Releases
â Link to Azure Updates
Azure Pricing Calculator
â Link to Pricing Calculator