SlideShare a Scribd company logo
1 of 49
© 2018 Snowflake Computing Inc. All Rights
Reserved.
August 24, 2018© Snowflake Computing Inc. All Rights Reserved
ZERO TO
SNOWFLAKE IN
90 MINUTES
in partnership with
© 2018 Snowflake Computing Inc. All Rights
Reserved.
• Download Materials @ https://tinyurl.com/yy9frfuw (Hotmail / outlook / o365)
• Download Worksheets and unzip
• Partner Introduction
• Snowflake Introduction
• Data Warehousing Today
• What Users do with Snowflake
• Hands on Snowflake
• Data loading- sample data set is shared
• Integrations
• Multi-clustering
• And more!
• Scaling/Workload Isolation
• Snowflake in the Real World- Data Sharing and Customer Example
• Conclusion
AGENDA
© 2018DMICONFIDENTIAL&PROPRIETARY
3
DATA & ANALYTICS
Snowflake
Solution Integration
Partner
Creating an intelligent existence by linking physical
& digital worlds to unleash the power of connectivity
〉 DATA PLATFORM SOLUTIONS
〉 MODERN DATA SOLUTIONS
〉 VISUAL SOLUTIONS
〉 ADVANCED ANALYTICS
〉 ENTERPRISE DATA STRATEGY
〉 EXECUTIVE ADVISORY
〉 ANALYTICS AS A SERVICE
〉 AGILE ANALYTICS
© 2018DMICONFIDENTIAL&PROPRIETARY
4
DATA & ANALYTICS
PARTNERSHIPS
***8 MOST IMPORTANT AI AND ANALYTIC TRENDS OF 2019 EBOOK AND MORE INFO ABOUT DMI @ WWW.DMINC.COM/EBOOK M
Today’s Moderators
〉 Brett VanderPlaats
Sr. Data Architect, Data Platform & Analytics
〉 David Mellinger
Practice Lead, Data Platform & Analytics
1,500+ CUSTOMERS
Building new analytic applications
Delivered new analytic application to
pharmacies using Snowflake
Moving to the cloud
Using Snowflake to move data analytics
to the cloud
Modernizing data platforms
Replaced data warehouse appliance +
Hadoop with Snowflake
Accelerating enterprise BI and analytics
Moved from legacy data warehouse
systems (appliance & cloud) to
Snowflake
© 2018 Snowflake Computing Inc. All Rights Reserved. 5
© 2018 Snowflake Computing Inc. All Rights
Reserved.
What Are Users Doing Today?
© 2018 Snowflake Computing Inc. All Rights
Reserved.
© 2018 Snowflake Computing Inc. All Rights
Reserved.
August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved
What would you like in your Data Warehouse?
8
Complete
SQL Database
Zero
Management
All of
Your
Data
All of
Your
Users
Pay Only for
What You
Use
Live Data
Sharing
© 2018 Snowflake Computing Inc. All Rights
Reserved.
What Are Users Doing with Snowflake?
© 2018 Snowflake Computing Inc. All Rights
Reserved.
What Are User Doing with Snowflake?
© 2018 Snowflake Computing Inc. All Rights
Reserved.
What Are User Doing with Snowflake?
© 2018 Snowflake Computing Inc. All Rights
Reserved.
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Snowflake’s Differentiating Architecture
© 2018 Snowflake Computing Inc. All Rights
Reserved.
SNOWFLAKE’S MULTI-CLUSTER, SHARED DATA ARCHITECTURE
Centralized storage
Instant, automatic scalability & elasticity
Service
Compute
Storage
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Why Scaling Compute Saves $$$$$$
© 2018 Snowflake Computing Inc. All Rights
Reserved.
How Does Snowflake Fit?
ADVANCED
ANALYTICS
INTEGRATION
TOOLS
ELT
Stream
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Scale compute and concurrency
ADF
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Scale Up (Response Time)
Queries that are:
--Large, Complex, many Calculations
Scale Out
(Throuhput)
Many users
or processes
concurrently
querying
S M L XL 2XXS 3X 4X
XS
XS
XS
XS
XS
XS
XS
XS
XS
L L
2X
2X2X2X
4X
4X
4X
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Differences from other EDW vendors
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Business Implications for Organizations
• Native ANSI-SQL database for leveraging existing skills
• Reducing expensive retraining
• Interoperability with existing tools (Power BI, Tableau, and others have SF
connectors)
• Simplified Migration
• Support/Documentation on Snowflake – Take a look at
© 2018 Snowflake Computing Inc. All Rights
Reserved.
But Wait….
…There’s
© 2018 Snowflake Computing Inc. All Rights
Reserved.
DEV OPS?
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Data Protection w/o Restore?
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Data Sharing (Monetization)
Disaster Recovery
***Platform (AZURE AWS) in 2019
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Native Support for
structured and semi-
structured data
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Cross Region Data Replication
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Comprehensive data protection
August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved
BIO-BREAK
THEN
LET’S DIVE INTO
SNOWFLAKE!
https://tinyurl.com/yy9frfuw
August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved
Module 1
Snowflake DB/WH/Object
Configuration
August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved
Module 2
Queries and Performance
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Azure Storage
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Performance through Caching
Azure Storage
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Query
Profiler•Processing — time spent on data processing by the CPU.
•Local Disk IO — time when the processing was blocked by local disk access.
•Remote Disk IO — time when the processing was blocked by remote disk access.
•Network Communication — time when the processing was waiting for the network data
transfer.
•Synchronization — various synchronization activities between participating processes.
•Initialization — time spent setting up the query processing.
•IO — information about the input-output operations performed during the query:
• Scan progress — the percentage of data scanned for a given table so far.
• Bytes scanned — the number of bytes scanned so far.
• Percentage scanned from cache — the percentage of data scanned from the local disk
cache.
• Bytes written — bytes written (e.g. when loading into a table).
• Bytes written to result — bytes written to a result object.
• Bytes read from result — bytes read from a result object.
• External bytes scanned — bytes read from an external object, e.g. a stage.
•DML — statistics for Data Manipulation Language (DML) queries:
• Number of rows inserted — number of rows inserted into a table (or tables).
• Number of rows updated — number of rows updated in a table.
• Number of rows deleted — number of rows deleted from a table.
• Number of rows unloaded — number of rows unloaded during data export.
• Number of bytes deleted — number of bytes deleted from a table.
•Pruning — information on the effects of table pruning:
• Partitions scanned — number of partitions scanned so far.
• Partitions total — total number of partitions in a given table.
•Spilling — information about disk usage for operations where intermediate results do not fit in memory:
• Bytes spilled to local storage — volume of data spilled to local disk.
• Bytes spilled to remote storage — volume of data spilled to remote disk.
•Network — network communication:
• Bytes sent over the network — amount of data sent over the network.
•EXECUTION TIME
•STATISTICS
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Multi-cluster Warehouse
© 2018 Snowflake Computing Inc. All Rights
Reserved.
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Micropartion
s
&
Pruning
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Enterprise Grade Security
August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved
Module 3
Unstructured Data
© 2018 Snowflake Computing Inc. All Rights
Reserved.
August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved
Module 4
Loading Data
© 2018 Snowflake Computing Inc. All Rights
Reserved.
• 4 objects for data loading
1) Source
2) Warehouse
3) Database
4) File Format (default CSV)
• 100 mb file is optimum size
August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved
Module 5
Cloning / Time Travel
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Multi-cluster Warehouse
August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved
THANK YOU
© 2018 Snowflake Computing Inc. All Rights
Reserved.
© 2018 Snowflake Computing Inc. All Rights
Reserved.
USE CASES FOR SNOWFLAKE DATA SHARING
• Nielsen is a global information, data, and
measurement company
• Nielsen knows “What People Watch,
Listen To, and Buy”
• Nielsen Marketing Cloud includes eXelate
DMP which provides unified consumer
profiles
• Nielsen sells selective slices of their DMP
data available to advertisers for particular
marketing campaigns
• Nielsen plans to use data sharing for
making detailed datasets available to
subscribers
• Lower friction, lower cost solution
• Scalable operations
61© 2016 Snowflake Computing Inc. All Rights
Reserved.
CUSTOMER EXAMPLE: BLACKBOARD
Jay White
Director, Software Engineering
Scenario
Provide and perfect over 14 different data
products that help universities facilitate
learning online
Pain Points
Disparate data
Challenges integrating data
Semi-structured data
Solution
Replace existing Hadoop and RDBMS
system with Snowflake
Everything that we did left our jaw on the
table. ‘Wait – we’ve never done anything like
that.’ Or, ‘How did that just run so fast.’ We
are getting 16x performance from
Snowflake.
62© 2016 Snowflake Computing Inc. All Rights
Reserved.
A NEW DATA PIPELINE FOR BLACKBOARD
Snowflake
S
3
Student data
Mobile data
Collaborative data
Intelsuite data
Kafka
Airflow for data orchestration Looker for internal dashboards
Learning Management System
Blackboard Predict
R Prediction Engine
Unified data
Simplified data transformation
Existing tools integrate seamlessly
• 16x performance improvement over SQL
• 1 PB by the end of 2017
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Built-in disaster recover and high availabity

More Related Content

What's hot

Data Sharing with Snowflake
Data Sharing with SnowflakeData Sharing with Snowflake
Data Sharing with Snowflake
Snowflake Computing
 

What's hot (20)

Snowflake Datawarehouse Architecturing
Snowflake Datawarehouse ArchitecturingSnowflake Datawarehouse Architecturing
Snowflake Datawarehouse Architecturing
 
Let’s get to know Snowflake
Let’s get to know SnowflakeLet’s get to know Snowflake
Let’s get to know Snowflake
 
Snowflake Company Presentation
Snowflake Company PresentationSnowflake Company Presentation
Snowflake Company Presentation
 
Snowflake Data Science and AI/ML at Scale
Snowflake Data Science and AI/ML at ScaleSnowflake Data Science and AI/ML at Scale
Snowflake Data Science and AI/ML at Scale
 
Changing the game with cloud dw
Changing the game with cloud dwChanging the game with cloud dw
Changing the game with cloud dw
 
Data Sharing with Snowflake
Data Sharing with SnowflakeData Sharing with Snowflake
Data Sharing with Snowflake
 
Introduction to snowflake
Introduction to snowflakeIntroduction to snowflake
Introduction to snowflake
 
Demystifying Data Warehouse as a Service
Demystifying Data Warehouse as a ServiceDemystifying Data Warehouse as a Service
Demystifying Data Warehouse as a Service
 
Snowflake Best Practices for Elastic Data Warehousing
Snowflake Best Practices for Elastic Data WarehousingSnowflake Best Practices for Elastic Data Warehousing
Snowflake Best Practices for Elastic Data Warehousing
 
Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)
 
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
 
From Data Warehouse to Lakehouse
From Data Warehouse to LakehouseFrom Data Warehouse to Lakehouse
From Data Warehouse to Lakehouse
 
Snowflake Architecture
Snowflake ArchitectureSnowflake Architecture
Snowflake Architecture
 
Demystifying Data Warehousing as a Service - DFW
Demystifying Data Warehousing as a Service - DFWDemystifying Data Warehousing as a Service - DFW
Demystifying Data Warehousing as a Service - DFW
 
An overview of snowflake
An overview of snowflakeAn overview of snowflake
An overview of snowflake
 
Snowflake: The Good, the Bad, and the Ugly
Snowflake: The Good, the Bad, and the UglySnowflake: The Good, the Bad, and the Ugly
Snowflake: The Good, the Bad, and the Ugly
 
A 30 day plan to start ending your data struggle with Snowflake
A 30 day plan to start ending your data struggle with SnowflakeA 30 day plan to start ending your data struggle with Snowflake
A 30 day plan to start ending your data struggle with Snowflake
 
How to Take Advantage of an Enterprise Data Warehouse in the Cloud
How to Take Advantage of an Enterprise Data Warehouse in the CloudHow to Take Advantage of an Enterprise Data Warehouse in the Cloud
How to Take Advantage of an Enterprise Data Warehouse in the Cloud
 
How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...
How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...
How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...
 
Elastic Data Warehousing
Elastic Data WarehousingElastic Data Warehousing
Elastic Data Warehousing
 

Similar to Zero to Snowflake Presentation

Introducing Technologies for Handling Big Data by Jaseela
Introducing Technologies for Handling Big Data by JaseelaIntroducing Technologies for Handling Big Data by Jaseela
Introducing Technologies for Handling Big Data by Jaseela
Student
 

Similar to Zero to Snowflake Presentation (20)

Sydney: Certus Data 2.0 Vault Meetup with Snowflake - Data Vault In The Cloud
Sydney: Certus Data 2.0 Vault Meetup with Snowflake - Data Vault In The Cloud Sydney: Certus Data 2.0 Vault Meetup with Snowflake - Data Vault In The Cloud
Sydney: Certus Data 2.0 Vault Meetup with Snowflake - Data Vault In The Cloud
 
Laboratorio práctico: Data warehouse en la nube
Laboratorio práctico: Data warehouse en la nubeLaboratorio práctico: Data warehouse en la nube
Laboratorio práctico: Data warehouse en la nube
 
In-Memory Stream Processing with Hazelcast Jet @JEEConf
In-Memory Stream Processing with Hazelcast Jet @JEEConfIn-Memory Stream Processing with Hazelcast Jet @JEEConf
In-Memory Stream Processing with Hazelcast Jet @JEEConf
 
Hadoop Application Architectures tutorial at Big DataService 2015
Hadoop Application Architectures tutorial at Big DataService 2015Hadoop Application Architectures tutorial at Big DataService 2015
Hadoop Application Architectures tutorial at Big DataService 2015
 
AWS Earth and Space 2018 - Element 84 Processing and Streaming GOES-16 Data...
AWS Earth and Space 2018 -   Element 84 Processing and Streaming GOES-16 Data...AWS Earth and Space 2018 -   Element 84 Processing and Streaming GOES-16 Data...
AWS Earth and Space 2018 - Element 84 Processing and Streaming GOES-16 Data...
 
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
 
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
 
Make your data fly - Building data platform in AWS
Make your data fly - Building data platform in AWSMake your data fly - Building data platform in AWS
Make your data fly - Building data platform in AWS
 
Stream based Data Integration
Stream based Data IntegrationStream based Data Integration
Stream based Data Integration
 
Snowflake’s Cloud Data Platform and Modern Analytics
Snowflake’s Cloud Data Platform and Modern AnalyticsSnowflake’s Cloud Data Platform and Modern Analytics
Snowflake’s Cloud Data Platform and Modern Analytics
 
Worldpay - Delivering Multi-Tenancy Applications in A Secure Operational Plat...
Worldpay - Delivering Multi-Tenancy Applications in A Secure Operational Plat...Worldpay - Delivering Multi-Tenancy Applications in A Secure Operational Plat...
Worldpay - Delivering Multi-Tenancy Applications in A Secure Operational Plat...
 
Data Works Summit Munich 2017 - Worldpay - Multi Tenancy Clusters
Data Works Summit Munich 2017 - Worldpay - Multi Tenancy ClustersData Works Summit Munich 2017 - Worldpay - Multi Tenancy Clusters
Data Works Summit Munich 2017 - Worldpay - Multi Tenancy Clusters
 
Introducing Technologies for Handling Big Data by Jaseela
Introducing Technologies for Handling Big Data by JaseelaIntroducing Technologies for Handling Big Data by Jaseela
Introducing Technologies for Handling Big Data by Jaseela
 
Big data processing with PubSub, Dataflow, and BigQuery
Big data processing with PubSub, Dataflow, and BigQueryBig data processing with PubSub, Dataflow, and BigQuery
Big data processing with PubSub, Dataflow, and BigQuery
 
KNIME Software Overview
KNIME Software OverviewKNIME Software Overview
KNIME Software Overview
 
Snowflake + Syncsort: Get Value from Your Mainframe Data
Snowflake + Syncsort: Get Value from Your Mainframe DataSnowflake + Syncsort: Get Value from Your Mainframe Data
Snowflake + Syncsort: Get Value from Your Mainframe Data
 
From Disaster to Recovery: Preparing Your IT for the Unexpected
From Disaster to Recovery: Preparing Your IT for the UnexpectedFrom Disaster to Recovery: Preparing Your IT for the Unexpected
From Disaster to Recovery: Preparing Your IT for the Unexpected
 
Event Sponsor NetApp - CSO- Jon Kissane
Event Sponsor NetApp - CSO- Jon Kissane  Event Sponsor NetApp - CSO- Jon Kissane
Event Sponsor NetApp - CSO- Jon Kissane
 
In-Memory Stream Processing with Hazelcast Jet @MorningAtLohika
In-Memory Stream Processing with Hazelcast Jet @MorningAtLohikaIn-Memory Stream Processing with Hazelcast Jet @MorningAtLohika
In-Memory Stream Processing with Hazelcast Jet @MorningAtLohika
 
Data & Analytics - Session 1 - Big Data Analytics
Data & Analytics - Session 1 -  Big Data AnalyticsData & Analytics - Session 1 -  Big Data Analytics
Data & Analytics - Session 1 - Big Data Analytics
 

Recently uploaded

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Recently uploaded (20)

AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 

Zero to Snowflake Presentation

  • 1. © 2018 Snowflake Computing Inc. All Rights Reserved. August 24, 2018© Snowflake Computing Inc. All Rights Reserved ZERO TO SNOWFLAKE IN 90 MINUTES in partnership with
  • 2. © 2018 Snowflake Computing Inc. All Rights Reserved. • Download Materials @ https://tinyurl.com/yy9frfuw (Hotmail / outlook / o365) • Download Worksheets and unzip • Partner Introduction • Snowflake Introduction • Data Warehousing Today • What Users do with Snowflake • Hands on Snowflake • Data loading- sample data set is shared • Integrations • Multi-clustering • And more! • Scaling/Workload Isolation • Snowflake in the Real World- Data Sharing and Customer Example • Conclusion AGENDA
  • 3. © 2018DMICONFIDENTIAL&PROPRIETARY 3 DATA & ANALYTICS Snowflake Solution Integration Partner Creating an intelligent existence by linking physical & digital worlds to unleash the power of connectivity 〉 DATA PLATFORM SOLUTIONS 〉 MODERN DATA SOLUTIONS 〉 VISUAL SOLUTIONS 〉 ADVANCED ANALYTICS 〉 ENTERPRISE DATA STRATEGY 〉 EXECUTIVE ADVISORY 〉 ANALYTICS AS A SERVICE 〉 AGILE ANALYTICS
  • 4. © 2018DMICONFIDENTIAL&PROPRIETARY 4 DATA & ANALYTICS PARTNERSHIPS ***8 MOST IMPORTANT AI AND ANALYTIC TRENDS OF 2019 EBOOK AND MORE INFO ABOUT DMI @ WWW.DMINC.COM/EBOOK M Today’s Moderators 〉 Brett VanderPlaats Sr. Data Architect, Data Platform & Analytics 〉 David Mellinger Practice Lead, Data Platform & Analytics
  • 5. 1,500+ CUSTOMERS Building new analytic applications Delivered new analytic application to pharmacies using Snowflake Moving to the cloud Using Snowflake to move data analytics to the cloud Modernizing data platforms Replaced data warehouse appliance + Hadoop with Snowflake Accelerating enterprise BI and analytics Moved from legacy data warehouse systems (appliance & cloud) to Snowflake © 2018 Snowflake Computing Inc. All Rights Reserved. 5
  • 6. © 2018 Snowflake Computing Inc. All Rights Reserved. What Are Users Doing Today?
  • 7. © 2018 Snowflake Computing Inc. All Rights Reserved.
  • 8. © 2018 Snowflake Computing Inc. All Rights Reserved. August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved What would you like in your Data Warehouse? 8 Complete SQL Database Zero Management All of Your Data All of Your Users Pay Only for What You Use Live Data Sharing
  • 9. © 2018 Snowflake Computing Inc. All Rights Reserved. What Are Users Doing with Snowflake?
  • 10. © 2018 Snowflake Computing Inc. All Rights Reserved. What Are User Doing with Snowflake?
  • 11. © 2018 Snowflake Computing Inc. All Rights Reserved. What Are User Doing with Snowflake?
  • 12. © 2018 Snowflake Computing Inc. All Rights Reserved.
  • 13. © 2018 Snowflake Computing Inc. All Rights Reserved. Snowflake’s Differentiating Architecture
  • 14. © 2018 Snowflake Computing Inc. All Rights Reserved. SNOWFLAKE’S MULTI-CLUSTER, SHARED DATA ARCHITECTURE Centralized storage Instant, automatic scalability & elasticity Service Compute Storage
  • 15. © 2018 Snowflake Computing Inc. All Rights Reserved. Why Scaling Compute Saves $$$$$$
  • 16. © 2018 Snowflake Computing Inc. All Rights Reserved. How Does Snowflake Fit? ADVANCED ANALYTICS INTEGRATION TOOLS ELT Stream
  • 17. © 2018 Snowflake Computing Inc. All Rights Reserved. Scale compute and concurrency ADF
  • 18. © 2018 Snowflake Computing Inc. All Rights Reserved. Scale Up (Response Time) Queries that are: --Large, Complex, many Calculations Scale Out (Throuhput) Many users or processes concurrently querying S M L XL 2XXS 3X 4X XS XS XS XS XS XS XS XS XS L L 2X 2X2X2X 4X 4X 4X
  • 19. © 2018 Snowflake Computing Inc. All Rights Reserved. Differences from other EDW vendors
  • 20. © 2018 Snowflake Computing Inc. All Rights Reserved. Business Implications for Organizations • Native ANSI-SQL database for leveraging existing skills • Reducing expensive retraining • Interoperability with existing tools (Power BI, Tableau, and others have SF connectors) • Simplified Migration • Support/Documentation on Snowflake – Take a look at
  • 21. © 2018 Snowflake Computing Inc. All Rights Reserved. But Wait…. …There’s
  • 22. © 2018 Snowflake Computing Inc. All Rights Reserved. DEV OPS?
  • 23. © 2018 Snowflake Computing Inc. All Rights Reserved. Data Protection w/o Restore?
  • 24. © 2018 Snowflake Computing Inc. All Rights Reserved. Data Sharing (Monetization) Disaster Recovery ***Platform (AZURE AWS) in 2019
  • 25. © 2018 Snowflake Computing Inc. All Rights Reserved. Native Support for structured and semi- structured data
  • 26. © 2018 Snowflake Computing Inc. All Rights Reserved. Cross Region Data Replication
  • 27. © 2018 Snowflake Computing Inc. All Rights Reserved. Comprehensive data protection
  • 28. August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved BIO-BREAK THEN LET’S DIVE INTO SNOWFLAKE! https://tinyurl.com/yy9frfuw
  • 29. August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved Module 1 Snowflake DB/WH/Object Configuration
  • 30. August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved Module 2 Queries and Performance
  • 31. © 2018 Snowflake Computing Inc. All Rights Reserved. Azure Storage
  • 32. © 2018 Snowflake Computing Inc. All Rights Reserved. Performance through Caching Azure Storage
  • 33. © 2018 Snowflake Computing Inc. All Rights Reserved. Query Profiler•Processing — time spent on data processing by the CPU. •Local Disk IO — time when the processing was blocked by local disk access. •Remote Disk IO — time when the processing was blocked by remote disk access. •Network Communication — time when the processing was waiting for the network data transfer. •Synchronization — various synchronization activities between participating processes. •Initialization — time spent setting up the query processing. •IO — information about the input-output operations performed during the query: • Scan progress — the percentage of data scanned for a given table so far. • Bytes scanned — the number of bytes scanned so far. • Percentage scanned from cache — the percentage of data scanned from the local disk cache. • Bytes written — bytes written (e.g. when loading into a table). • Bytes written to result — bytes written to a result object. • Bytes read from result — bytes read from a result object. • External bytes scanned — bytes read from an external object, e.g. a stage. •DML — statistics for Data Manipulation Language (DML) queries: • Number of rows inserted — number of rows inserted into a table (or tables). • Number of rows updated — number of rows updated in a table. • Number of rows deleted — number of rows deleted from a table. • Number of rows unloaded — number of rows unloaded during data export. • Number of bytes deleted — number of bytes deleted from a table. •Pruning — information on the effects of table pruning: • Partitions scanned — number of partitions scanned so far. • Partitions total — total number of partitions in a given table. •Spilling — information about disk usage for operations where intermediate results do not fit in memory: • Bytes spilled to local storage — volume of data spilled to local disk. • Bytes spilled to remote storage — volume of data spilled to remote disk. •Network — network communication: • Bytes sent over the network — amount of data sent over the network. •EXECUTION TIME •STATISTICS
  • 34. © 2018 Snowflake Computing Inc. All Rights Reserved. Multi-cluster Warehouse
  • 35. © 2018 Snowflake Computing Inc. All Rights Reserved.
  • 36. © 2018 Snowflake Computing Inc. All Rights Reserved. Micropartion s & Pruning
  • 37. © 2018 Snowflake Computing Inc. All Rights Reserved. Enterprise Grade Security
  • 38. August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved Module 3 Unstructured Data
  • 39. © 2018 Snowflake Computing Inc. All Rights Reserved.
  • 40. August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved Module 4 Loading Data
  • 41. © 2018 Snowflake Computing Inc. All Rights Reserved. • 4 objects for data loading 1) Source 2) Warehouse 3) Database 4) File Format (default CSV) • 100 mb file is optimum size
  • 42. August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved Module 5 Cloning / Time Travel
  • 43. © 2018 Snowflake Computing Inc. All Rights Reserved. Multi-cluster Warehouse
  • 44. August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved THANK YOU
  • 45. © 2018 Snowflake Computing Inc. All Rights Reserved.
  • 46. © 2018 Snowflake Computing Inc. All Rights Reserved. USE CASES FOR SNOWFLAKE DATA SHARING • Nielsen is a global information, data, and measurement company • Nielsen knows “What People Watch, Listen To, and Buy” • Nielsen Marketing Cloud includes eXelate DMP which provides unified consumer profiles • Nielsen sells selective slices of their DMP data available to advertisers for particular marketing campaigns • Nielsen plans to use data sharing for making detailed datasets available to subscribers • Lower friction, lower cost solution • Scalable operations
  • 47. 61© 2016 Snowflake Computing Inc. All Rights Reserved. CUSTOMER EXAMPLE: BLACKBOARD Jay White Director, Software Engineering Scenario Provide and perfect over 14 different data products that help universities facilitate learning online Pain Points Disparate data Challenges integrating data Semi-structured data Solution Replace existing Hadoop and RDBMS system with Snowflake Everything that we did left our jaw on the table. ‘Wait – we’ve never done anything like that.’ Or, ‘How did that just run so fast.’ We are getting 16x performance from Snowflake.
  • 48. 62© 2016 Snowflake Computing Inc. All Rights Reserved. A NEW DATA PIPELINE FOR BLACKBOARD Snowflake S 3 Student data Mobile data Collaborative data Intelsuite data Kafka Airflow for data orchestration Looker for internal dashboards Learning Management System Blackboard Predict R Prediction Engine Unified data Simplified data transformation Existing tools integrate seamlessly • 16x performance improvement over SQL • 1 PB by the end of 2017
  • 49. © 2018 Snowflake Computing Inc. All Rights Reserved. Built-in disaster recover and high availabity