More Related Content Similar to Zero to Snowflake Presentation (20) Zero to Snowflake Presentation 1. © 2018 Snowflake Computing Inc. All Rights
Reserved.
August 24, 2018© Snowflake Computing Inc. All Rights Reserved
ZERO TO
SNOWFLAKE IN
90 MINUTES
in partnership with
2. © 2018 Snowflake Computing Inc. All Rights
Reserved.
• Download Materials @ https://tinyurl.com/yy9frfuw (Hotmail / outlook / o365)
• Download Worksheets and unzip
• Partner Introduction
• Snowflake Introduction
• Data Warehousing Today
• What Users do with Snowflake
• Hands on Snowflake
• Data loading- sample data set is shared
• Integrations
• Multi-clustering
• And more!
• Scaling/Workload Isolation
• Snowflake in the Real World- Data Sharing and Customer Example
• Conclusion
AGENDA
3. © 2018DMICONFIDENTIAL&PROPRIETARY
3
DATA & ANALYTICS
Snowflake
Solution Integration
Partner
Creating an intelligent existence by linking physical
& digital worlds to unleash the power of connectivity
〉 DATA PLATFORM SOLUTIONS
〉 MODERN DATA SOLUTIONS
〉 VISUAL SOLUTIONS
〉 ADVANCED ANALYTICS
〉 ENTERPRISE DATA STRATEGY
〉 EXECUTIVE ADVISORY
〉 ANALYTICS AS A SERVICE
〉 AGILE ANALYTICS
5. 1,500+ CUSTOMERS
Building new analytic applications
Delivered new analytic application to
pharmacies using Snowflake
Moving to the cloud
Using Snowflake to move data analytics
to the cloud
Modernizing data platforms
Replaced data warehouse appliance +
Hadoop with Snowflake
Accelerating enterprise BI and analytics
Moved from legacy data warehouse
systems (appliance & cloud) to
Snowflake
© 2018 Snowflake Computing Inc. All Rights Reserved. 5
6. © 2018 Snowflake Computing Inc. All Rights
Reserved.
What Are Users Doing Today?
8. © 2018 Snowflake Computing Inc. All Rights
Reserved.
August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved
What would you like in your Data Warehouse?
8
Complete
SQL Database
Zero
Management
All of
Your
Data
All of
Your
Users
Pay Only for
What You
Use
Live Data
Sharing
9. © 2018 Snowflake Computing Inc. All Rights
Reserved.
What Are Users Doing with Snowflake?
10. © 2018 Snowflake Computing Inc. All Rights
Reserved.
What Are User Doing with Snowflake?
11. © 2018 Snowflake Computing Inc. All Rights
Reserved.
What Are User Doing with Snowflake?
13. © 2018 Snowflake Computing Inc. All Rights
Reserved.
Snowflake’s Differentiating Architecture
14. © 2018 Snowflake Computing Inc. All Rights
Reserved.
SNOWFLAKE’S MULTI-CLUSTER, SHARED DATA ARCHITECTURE
Centralized storage
Instant, automatic scalability & elasticity
Service
Compute
Storage
15. © 2018 Snowflake Computing Inc. All Rights
Reserved.
Why Scaling Compute Saves $$$$$$
16. © 2018 Snowflake Computing Inc. All Rights
Reserved.
How Does Snowflake Fit?
ADVANCED
ANALYTICS
INTEGRATION
TOOLS
ELT
Stream
17. © 2018 Snowflake Computing Inc. All Rights
Reserved.
Scale compute and concurrency
ADF
18. © 2018 Snowflake Computing Inc. All Rights
Reserved.
Scale Up (Response Time)
Queries that are:
--Large, Complex, many Calculations
Scale Out
(Throuhput)
Many users
or processes
concurrently
querying
S M L XL 2XXS 3X 4X
XS
XS
XS
XS
XS
XS
XS
XS
XS
L L
2X
2X2X2X
4X
4X
4X
19. © 2018 Snowflake Computing Inc. All Rights
Reserved.
Differences from other EDW vendors
20. © 2018 Snowflake Computing Inc. All Rights
Reserved.
Business Implications for Organizations
• Native ANSI-SQL database for leveraging existing skills
• Reducing expensive retraining
• Interoperability with existing tools (Power BI, Tableau, and others have SF
connectors)
• Simplified Migration
• Support/Documentation on Snowflake – Take a look at
23. © 2018 Snowflake Computing Inc. All Rights
Reserved.
Data Protection w/o Restore?
24. © 2018 Snowflake Computing Inc. All Rights
Reserved.
Data Sharing (Monetization)
Disaster Recovery
***Platform (AZURE AWS) in 2019
25. © 2018 Snowflake Computing Inc. All Rights
Reserved.
Native Support for
structured and semi-
structured data
26. © 2018 Snowflake Computing Inc. All Rights
Reserved.
Cross Region Data Replication
27. © 2018 Snowflake Computing Inc. All Rights
Reserved.
Comprehensive data protection
28. August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved
BIO-BREAK
THEN
LET’S DIVE INTO
SNOWFLAKE!
https://tinyurl.com/yy9frfuw
29. August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved
Module 1
Snowflake DB/WH/Object
Configuration
30. August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved
Module 2
Queries and Performance
32. © 2018 Snowflake Computing Inc. All Rights
Reserved.
Performance through Caching
Azure Storage
33. © 2018 Snowflake Computing Inc. All Rights
Reserved.
Query
Profiler•Processing — time spent on data processing by the CPU.
•Local Disk IO — time when the processing was blocked by local disk access.
•Remote Disk IO — time when the processing was blocked by remote disk access.
•Network Communication — time when the processing was waiting for the network data
transfer.
•Synchronization — various synchronization activities between participating processes.
•Initialization — time spent setting up the query processing.
•IO — information about the input-output operations performed during the query:
• Scan progress — the percentage of data scanned for a given table so far.
• Bytes scanned — the number of bytes scanned so far.
• Percentage scanned from cache — the percentage of data scanned from the local disk
cache.
• Bytes written — bytes written (e.g. when loading into a table).
• Bytes written to result — bytes written to a result object.
• Bytes read from result — bytes read from a result object.
• External bytes scanned — bytes read from an external object, e.g. a stage.
•DML — statistics for Data Manipulation Language (DML) queries:
• Number of rows inserted — number of rows inserted into a table (or tables).
• Number of rows updated — number of rows updated in a table.
• Number of rows deleted — number of rows deleted from a table.
• Number of rows unloaded — number of rows unloaded during data export.
• Number of bytes deleted — number of bytes deleted from a table.
•Pruning — information on the effects of table pruning:
• Partitions scanned — number of partitions scanned so far.
• Partitions total — total number of partitions in a given table.
•Spilling — information about disk usage for operations where intermediate results do not fit in memory:
• Bytes spilled to local storage — volume of data spilled to local disk.
• Bytes spilled to remote storage — volume of data spilled to remote disk.
•Network — network communication:
• Bytes sent over the network — amount of data sent over the network.
•EXECUTION TIME
•STATISTICS
34. © 2018 Snowflake Computing Inc. All Rights
Reserved.
Multi-cluster Warehouse
36. © 2018 Snowflake Computing Inc. All Rights
Reserved.
Micropartion
s
&
Pruning
37. © 2018 Snowflake Computing Inc. All Rights
Reserved.
Enterprise Grade Security
38. August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved
Module 3
Unstructured Data
40. August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved
Module 4
Loading Data
41. © 2018 Snowflake Computing Inc. All Rights
Reserved.
• 4 objects for data loading
1) Source
2) Warehouse
3) Database
4) File Format (default CSV)
• 100 mb file is optimum size
42. August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved
Module 5
Cloning / Time Travel
43. © 2018 Snowflake Computing Inc. All Rights
Reserved.
Multi-cluster Warehouse
44. August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved
THANK YOU
46. © 2018 Snowflake Computing Inc. All Rights
Reserved.
USE CASES FOR SNOWFLAKE DATA SHARING
• Nielsen is a global information, data, and
measurement company
• Nielsen knows “What People Watch,
Listen To, and Buy”
• Nielsen Marketing Cloud includes eXelate
DMP which provides unified consumer
profiles
• Nielsen sells selective slices of their DMP
data available to advertisers for particular
marketing campaigns
• Nielsen plans to use data sharing for
making detailed datasets available to
subscribers
• Lower friction, lower cost solution
• Scalable operations
47. 61© 2016 Snowflake Computing Inc. All Rights
Reserved.
CUSTOMER EXAMPLE: BLACKBOARD
Jay White
Director, Software Engineering
Scenario
Provide and perfect over 14 different data
products that help universities facilitate
learning online
Pain Points
Disparate data
Challenges integrating data
Semi-structured data
Solution
Replace existing Hadoop and RDBMS
system with Snowflake
Everything that we did left our jaw on the
table. ‘Wait – we’ve never done anything like
that.’ Or, ‘How did that just run so fast.’ We
are getting 16x performance from
Snowflake.
48. 62© 2016 Snowflake Computing Inc. All Rights
Reserved.
A NEW DATA PIPELINE FOR BLACKBOARD
Snowflake
S
3
Student data
Mobile data
Collaborative data
Intelsuite data
Kafka
Airflow for data orchestration Looker for internal dashboards
Learning Management System
Blackboard Predict
R Prediction Engine
Unified data
Simplified data transformation
Existing tools integrate seamlessly
• 16x performance improvement over SQL
• 1 PB by the end of 2017
49. © 2018 Snowflake Computing Inc. All Rights
Reserved.
Built-in disaster recover and high availabity