SlideShare ist ein Scribd-Unternehmen logo
1 von 36
Downloaden Sie, um offline zu lesen
Amazon Redshift Overview & What’s Next
Rahul Pathak, Redshift PM (rapathak@amazon.com)
Anurag Gupta, Redshift GM (awgupta@amazon.com)
November 13, 2013

© 2013 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified, or distributed in whole or in part without the express consent of Amazon.com, Inc.
Amazon Redshift

Fast, simple, petabyte-scale data warehousing for less than $1,000/TB/Year
Amazon Redshift dramatically reduces I/O
ID

Amount

20

CA

500

345

25

WA

250

678

Data compression

State

123

•

Age

40

FL

125

37

WA

375

•

Zone maps

957

•

Direct-attached storage

•

With row storage you do
unnecessary I/O

•

To get total amount, you have
to read everything
Amazon Redshift dramatically reduces I/O
ID

Amount

20

CA

500

345

25

WA

250

678

Data compression

State

123

•

Age

40

FL

125

37

WA

375

•

Zone maps

957

•

Direct-attached storage

•

With column storage, you
only read the data you need
Amazon Redshift dramatically reduces I/O
analyze compression listing;

•

Data compression

•

Zone maps

•

Direct-attached storage

Table |
Column
| Encoding
---------+----------------+---------listing | listid
| delta
listing | sellerid
| delta32k
listing | eventid
| delta32k
listing | dateid
| bytedict
listing | numtickets
| bytedict
listing | priceperticket | delta32k
listing | totalprice
| mostly32
listing | listtime
| raw

•

COPY compresses
automatically on load

•

You can analyze and override

•

More performance, less cost

Slides not intended for redistribution.
Amazon Redshift dramatically reduces I/O
10
324

•

Data compression

375
623

10 | 13 | 14 | 26 |…
… | 100 | 245 | 324

375 | 393 | 417…
… 512 | 549 | 623

637

•

Zone maps

•

Direct-attached storage

637 | 712 | 809 …

959

… | 834 | 921 | 959

•

Track the minimum and
maximum value for each block

•

Skip over blocks that don’t
contain relevant data
Amazon Redshift dramatically reduces I/O
DW.HS1.XL:

•

Data compression

•

Zone maps

•

Direct-attached storage

DW.HS1.8XL:

•

> 2 GB/s scan rate

•

Optimized for data processing

•

High disk density
Amazon Redshift architecture
• Leader Node
–
–
–

JDBC/ODBC

SQL endpoint
Stores metadata
Coordinates query execution

• Compute Nodes
–
–
–
–

10 GigE
(HPC)

Local, columnar storage
Execute queries in parallel
Load, backup, restore via Amazon S3
Parallel load from Amazon DynamoDB

• Single node version available

Ingestion
Backup
Restore
Amazon Redshift parallelizes and distributes everything

•

Load

•

Backup/Restore

•

Resize
Amazon Redshift parallelizes and distributes everything

•

Load

•

Backup/Restore
•

•

Load in parallel from Amazon S3
or Amazon DynamoDB

•

Data automatically distributed and
sorted according to DDL

•

Scales linearly with number of
nodes

Resize
Amazon Redshift parallelizes and distributes everything

•

Load

•

Backup/Restore

•

Backups to Amazon S3 are automatic,
continuous and incremental

•

Resize

•

Configurable system snapshot
retention period

•

Take user snapshots on-demand

•

Streaming restores enable you to
resume querying faster
Amazon Redshift parallelizes and distributes everything

•

Load

•

Backup/Restore

•

Resize

•

Resize while remaining online

•

Provision a new cluster in the
background

•

Copy data in parallel from node to
node

•

Only charged for source cluster
Amazon Redshift parallelizes and distributes everything

•

Load

•

Backup/Restore
•

•

Automatic SQL endpoint switchover
via DNS

•

Decommission the source cluster

•

Simple operation via Console or API

Resize
Amazon Redshift lets you start small and grow big
Extra Large Node (DW.HS1.XL)
3 spindles, 2 TB, 16 GB RAM, 2 cores

Eight Extra Large Node (DW.HS1.8XL)
24 spindles, 16 TB, 128 GB RAM, 16 cores, 10 GigE

Single Node (2 TB)

Cluster 2-100 Nodes (32 TB – 1.6 PB)

Cluster 2-32 Nodes (4 TB – 64 TB)

Note: Nodes not to scale
Amazon Redshift is priced to let you analyze all your data
Price Per Hour for
HS1.XL Single Node

Effective Hourly
Price per TB

Effective Annual
Price per TB

On-Demand

$ 0.850

$ 0.425

$ 3,723

1 Year Reservation

$ 0.500

$ 0.250

$ 2,190

3 Year Reservation

$ 0.228

$ 0.114

$

999

Simple Pricing
Number of Nodes x Cost per Hour
No charge for Leader Node
No upfront costs
Pay as you go
Amazon Redshift has security built in
Customer VPC

• SSL to secure data in transit
JDBC/ODBC

• Encryption to secure data at rest
–
–

Internal
VPC

AES-256; hardware accelerated
All blocks on disk and in Amazon S3
encrypted

10 GigE
(HPC)

• No direct access to compute
nodes
• Amazon VPC support

Ingestion
Backup
Restore
Amazon Redshift automatically manages data
replication and hardware failures
•

Replication within the cluster and backup to Amazon S3 to maintain multiple
copies of data at all times

•

Backups to Amazon S3 are continuous, automatic, and incremental
–

Designed for eleven nines of durability

•

Continuous monitoring and automated recovery from failures of drives and
nodes

•

Able to restore snapshots to any Availability Zone within a region
Growing ecosystem
AWS Marketplace
• Find software to use with
Amazon Redshift
• One-click deployments
• Flexible pricing options
http://aws.amazon.com/marketplace
Over 40 new features since launch on Feb 14
•

Regions
–

•

Certifications
–

•

Snapshot sharing, backup/restore progress indicators

Query
–

•

Load/unload encrypted files, Resource-level IAM, Temporary credentials

Manageability
–

•

PCI, SOC 1/2/3

Security
–

•

N. Virginia, Oregon, Dublin, Tokyo, Singapore, Sydney

Regex, Cursors, MD5, SHA1, Time zone, workload queue timeout

Ingestion
–

S3 Manifest, LZOP/LZO, JSON built-ins, UTF-8 4byte, invalid character substitution, CSV, auto datetime format
detection, epoch
Amazon Redshift – What’s Next
Security, visibility and control
•

Audit logging
Redshift

•

SNS Alerts
Visibility and control

AWS
CloudTrail
System Activity
Creates, Changes,
Deletes, Resizes

•

Audit logging

•

SNS Alerts

Amazon Redshift

Database Activity
Logins, Login failures,
Queries, Loads
Amazon S3
Visibility and control
•
•

Audit logging
Monitoring
Security
Maintenance
Errors

SNS Alerts
Amazon
Redshift

SNS
Topic
Batch operations
•

Cluster Creation

•

Faster Resize

Amazon Corporate Amazon
EC2 Data Center EMR

Amazon
Redshift

Amazon S3
Batch operations
•

Cluster Creation

•

Faster Resize

Amazon Corporate Amazon
EC2 Data Center EMR

Amazon
Redshift

Amazon S3
Batch operations
•

Cluster Creation

•

Faster Resize

15-20 min

3 min
Batch operations
•

Cluster Creation

•

Faster Resize

29 hours

7 hours
Performance & Concurrency
Performance & Concurrency

692.8s
34.9s
< 2%
Performance & Concurrency

5,951.7s
2,151.9s
Performance & Concurrency

15
50
Feature Delivery
Unload logs (7/5)
Temp Credentials (4/11)

Sharing snapshots (7/18)

DUB (4/25)

Resource Level IAM (8/9)
SHA1 Builtin (7/15)

SOC1/2/3 (5/8)
Statement Timeout (7/22)
WLM Timeout/Wildcards (8/1)
UTF-8 Substitution (8/29)

JDBC Fetch Size (6/27)

EMR/HDFS/SSH copy, Distributed
Tables, Audit Logging/CloudTrail,
Concurrency, Resize Perf.,
Approximate Count Distinct, SNS
Alerts, WLM Memory Management
(11/13)

Service Launch (2/14)
Split_part, Audit tables (10/3)

6 weeks left
PCI (8/22)
SIN/SYD (10/8)
PDX (4/2)

JSON, Regex, Cursors (9/10)
NRT (6/5)

CRC32 Builtin, CSV, Restore
Progress (8/9)
Timezone, Epoch, Autoformat (7/25)
4 byte UTF-8 (7/18)

Unload Encrypted Files

HSM Support (11/11)
Redshift Customers at re:Invent
BDT 101: Big Data ‘State of the Union’
Earlier today

DAT 305: Getting Maximum Performance from Amazon Redshift
Wednesday 11/13: 3pm in Murano 3303
Redshift Customers at re:Invent
DAT 306: How Amazon.com is Leveraging Amazon Redshift
Thursday 11/14: 3pm in Murano 3303

DAT 205: Amazon Redshift in Action: Enterprise, Big Data, SaaS
Friday 11/15: 9am in Lido 3006
Please give us your feedback on this
presentation

DAT 103
As a thank you, we will select prize
winners daily for completed surveys!

Weitere ähnliche Inhalte

Was ist angesagt?

Migration to Redshift from SQL Server
Migration to Redshift from SQL ServerMigration to Redshift from SQL Server
Migration to Redshift from SQL Server
joeharris76
 

Was ist angesagt? (20)

Uses and Best Practices for Amazon Redshift
Uses and Best Practices for Amazon RedshiftUses and Best Practices for Amazon Redshift
Uses and Best Practices for Amazon Redshift
 
Deep Dive on Amazon Redshift
Deep Dive on Amazon RedshiftDeep Dive on Amazon Redshift
Deep Dive on Amazon Redshift
 
Deep Dive Amazon Redshift for Big Data Analytics - September Webinar Series
Deep Dive Amazon Redshift for Big Data Analytics - September Webinar SeriesDeep Dive Amazon Redshift for Big Data Analytics - September Webinar Series
Deep Dive Amazon Redshift for Big Data Analytics - September Webinar Series
 
Redshift overview
Redshift overviewRedshift overview
Redshift overview
 
Deep Dive on Amazon Redshift
Deep Dive on Amazon RedshiftDeep Dive on Amazon Redshift
Deep Dive on Amazon Redshift
 
(DAT308) Yahoo! Analyzes Billions of Events a Day on Amazon Redshift
(DAT308) Yahoo! Analyzes Billions of Events a Day on Amazon Redshift(DAT308) Yahoo! Analyzes Billions of Events a Day on Amazon Redshift
(DAT308) Yahoo! Analyzes Billions of Events a Day on Amazon Redshift
 
Near Real-Time Data Analysis With FlyData
Near Real-Time Data Analysis With FlyData Near Real-Time Data Analysis With FlyData
Near Real-Time Data Analysis With FlyData
 
Uses and Best Practices for Amazon Redshift
Uses and Best Practices for Amazon Redshift Uses and Best Practices for Amazon Redshift
Uses and Best Practices for Amazon Redshift
 
Best Practices for Data Warehousing with Amazon Redshift | AWS Public Sector ...
Best Practices for Data Warehousing with Amazon Redshift | AWS Public Sector ...Best Practices for Data Warehousing with Amazon Redshift | AWS Public Sector ...
Best Practices for Data Warehousing with Amazon Redshift | AWS Public Sector ...
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon Redshift
 
Powering Interactive Data Analysis at Pinterest by Amazon Redshift
Powering Interactive Data Analysis at Pinterest by Amazon RedshiftPowering Interactive Data Analysis at Pinterest by Amazon Redshift
Powering Interactive Data Analysis at Pinterest by Amazon Redshift
 
Data Warehousing with Amazon Redshift
Data Warehousing with Amazon RedshiftData Warehousing with Amazon Redshift
Data Warehousing with Amazon Redshift
 
Migration to Redshift from SQL Server
Migration to Redshift from SQL ServerMigration to Redshift from SQL Server
Migration to Redshift from SQL Server
 
Best Practices for Migrating your Data Warehouse to Amazon Redshift
Best Practices for Migrating your Data Warehouse to Amazon RedshiftBest Practices for Migrating your Data Warehouse to Amazon Redshift
Best Practices for Migrating your Data Warehouse to Amazon Redshift
 
AWS June Webinar Series - Getting Started: Amazon Redshift
AWS June Webinar Series - Getting Started: Amazon RedshiftAWS June Webinar Series - Getting Started: Amazon Redshift
AWS June Webinar Series - Getting Started: Amazon Redshift
 
Building Your Data Warehouse with Amazon Redshift
Building Your Data Warehouse with Amazon RedshiftBuilding Your Data Warehouse with Amazon Redshift
Building Your Data Warehouse with Amazon Redshift
 
Best Practices for Migrating Your Data Warehouse to Amazon Redshift
Best Practices for Migrating Your Data Warehouse to Amazon RedshiftBest Practices for Migrating Your Data Warehouse to Amazon Redshift
Best Practices for Migrating Your Data Warehouse to Amazon Redshift
 
AWS July Webinar Series: Amazon Redshift Optimizing Performance
AWS July Webinar Series: Amazon Redshift Optimizing PerformanceAWS July Webinar Series: Amazon Redshift Optimizing Performance
AWS July Webinar Series: Amazon Redshift Optimizing Performance
 
Scalability of Amazon Redshift Data Loading and Query Speed
Scalability of Amazon Redshift Data Loading and Query SpeedScalability of Amazon Redshift Data Loading and Query Speed
Scalability of Amazon Redshift Data Loading and Query Speed
 
(BDT401) Amazon Redshift Deep Dive: Tuning and Best Practices
(BDT401) Amazon Redshift Deep Dive: Tuning and Best Practices(BDT401) Amazon Redshift Deep Dive: Tuning and Best Practices
(BDT401) Amazon Redshift Deep Dive: Tuning and Best Practices
 

Andere mochten auch

Netflix - Freedom & Responsibility
Netflix - Freedom & ResponsibilityNetflix - Freedom & Responsibility
Netflix - Freedom & Responsibility
Chris Ellis
 
twitch tv presentation
twitch tv presentationtwitch tv presentation
twitch tv presentation
Galen Gong
 

Andere mochten auch (18)

The EU Lightbulb Ban
The EU Lightbulb BanThe EU Lightbulb Ban
The EU Lightbulb Ban
 
State of the Video Game Industry
State of the Video Game IndustryState of the Video Game Industry
State of the Video Game Industry
 
Introduction to Amazon Redshift
Introduction to Amazon RedshiftIntroduction to Amazon Redshift
Introduction to Amazon Redshift
 
Netflix - Freedom & Responsibility
Netflix - Freedom & ResponsibilityNetflix - Freedom & Responsibility
Netflix - Freedom & Responsibility
 
DAT101 Understanding AWS Database Options - AWS re: Invent 2012
DAT101 Understanding AWS Database Options - AWS re: Invent 2012DAT101 Understanding AWS Database Options - AWS re: Invent 2012
DAT101 Understanding AWS Database Options - AWS re: Invent 2012
 
Building Event-Driven Services with Apache Kafka
Building Event-Driven Services with Apache KafkaBuilding Event-Driven Services with Apache Kafka
Building Event-Driven Services with Apache Kafka
 
Facebook chat architecture
Facebook chat architectureFacebook chat architecture
Facebook chat architecture
 
Twitch Presentation
Twitch PresentationTwitch Presentation
Twitch Presentation
 
Twitch Plays Pokémon: Twitch's Chat Architecture
Twitch Plays Pokémon: Twitch's Chat ArchitectureTwitch Plays Pokémon: Twitch's Chat Architecture
Twitch Plays Pokémon: Twitch's Chat Architecture
 
twitch tv presentation
twitch tv presentationtwitch tv presentation
twitch tv presentation
 
Understanding AWS Database Options (DAT201) | AWS re:Invent 2013
Understanding AWS Database Options (DAT201) | AWS re:Invent 2013Understanding AWS Database Options (DAT201) | AWS re:Invent 2013
Understanding AWS Database Options (DAT201) | AWS re:Invent 2013
 
Facebook architecture presentation: scalability challenge
Facebook architecture presentation: scalability challengeFacebook architecture presentation: scalability challenge
Facebook architecture presentation: scalability challenge
 
AWS re:Invent 2016: What’s New with Amazon Redshift (BDA304)
AWS re:Invent 2016: What’s New with Amazon Redshift (BDA304)AWS re:Invent 2016: What’s New with Amazon Redshift (BDA304)
AWS re:Invent 2016: What’s New with Amazon Redshift (BDA304)
 
facebook architecture for 600M users
facebook architecture for 600M usersfacebook architecture for 600M users
facebook architecture for 600M users
 
How Game Developers Reach New Customers with Twitch
How Game Developers Reach New Customers with Twitch How Game Developers Reach New Customers with Twitch
How Game Developers Reach New Customers with Twitch
 
AWS re:Invent 2016: Best Practices for Data Warehousing with Amazon Redshift ...
AWS re:Invent 2016: Best Practices for Data Warehousing with Amazon Redshift ...AWS re:Invent 2016: Best Practices for Data Warehousing with Amazon Redshift ...
AWS re:Invent 2016: Best Practices for Data Warehousing with Amazon Redshift ...
 
AWS re:Invent 2016: Big Data Architectural Patterns and Best Practices on AWS...
AWS re:Invent 2016: Big Data Architectural Patterns and Best Practices on AWS...AWS re:Invent 2016: Big Data Architectural Patterns and Best Practices on AWS...
AWS re:Invent 2016: Big Data Architectural Patterns and Best Practices on AWS...
 
(BDT310) Big Data Architectural Patterns and Best Practices on AWS
(BDT310) Big Data Architectural Patterns and Best Practices on AWS(BDT310) Big Data Architectural Patterns and Best Practices on AWS
(BDT310) Big Data Architectural Patterns and Best Practices on AWS
 

Ähnlich wie Introduction to Amazon Redshift and What's Next (DAT103) | AWS re:Invent 2013

Ähnlich wie Introduction to Amazon Redshift and What's Next (DAT103) | AWS re:Invent 2013 (20)

Getting Started with Managed Database Services on AWS - September 2016 Webina...
Getting Started with Managed Database Services on AWS - September 2016 Webina...Getting Started with Managed Database Services on AWS - September 2016 Webina...
Getting Started with Managed Database Services on AWS - September 2016 Webina...
 
Selecting the Right AWS Database Solution - AWS 2017 Online Tech Talks
Selecting the Right AWS Database Solution - AWS 2017 Online Tech TalksSelecting the Right AWS Database Solution - AWS 2017 Online Tech Talks
Selecting the Right AWS Database Solution - AWS 2017 Online Tech Talks
 
AWS Summit London 2014 | Uses and Best Practices for Amazon Redshift (200)
AWS Summit London 2014 | Uses and Best Practices for Amazon Redshift (200)AWS Summit London 2014 | Uses and Best Practices for Amazon Redshift (200)
AWS Summit London 2014 | Uses and Best Practices for Amazon Redshift (200)
 
AWS Analytics
AWS AnalyticsAWS Analytics
AWS Analytics
 
(DAT202) Managed Database Options on AWS
(DAT202) Managed Database Options on AWS(DAT202) Managed Database Options on AWS
(DAT202) Managed Database Options on AWS
 
[よくわかるAmazon Redshift]Amazon Redshift最新情報と導入事例のご紹介
[よくわかるAmazon Redshift]Amazon Redshift最新情報と導入事例のご紹介[よくわかるAmazon Redshift]Amazon Redshift最新情報と導入事例のご紹介
[よくわかるAmazon Redshift]Amazon Redshift最新情報と導入事例のご紹介
 
Aws summit 2014 redshift
Aws summit 2014 redshiftAws summit 2014 redshift
Aws summit 2014 redshift
 
AWS March 2016 Webinar Series - Managed Database Services on Amazon Web Services
AWS March 2016 Webinar Series - Managed Database Services on Amazon Web ServicesAWS March 2016 Webinar Series - Managed Database Services on Amazon Web Services
AWS March 2016 Webinar Series - Managed Database Services on Amazon Web Services
 
[よくわかるAmazon Redshift in 大阪]Amazon Redshift最新情報と導入事例のご紹介
[よくわかるAmazon Redshift in 大阪]Amazon Redshift最新情報と導入事例のご紹介[よくわかるAmazon Redshift in 大阪]Amazon Redshift最新情報と導入事例のご紹介
[よくわかるAmazon Redshift in 大阪]Amazon Redshift最新情報と導入事例のご紹介
 
Introduction to Database Services
Introduction to Database ServicesIntroduction to Database Services
Introduction to Database Services
 
Amazon Redshift, Customer Acquisition Cost & Advertising ROI presented with A...
Amazon Redshift, Customer Acquisition Cost & Advertising ROI presented with A...Amazon Redshift, Customer Acquisition Cost & Advertising ROI presented with A...
Amazon Redshift, Customer Acquisition Cost & Advertising ROI presented with A...
 
Data & Analytics - Session 2 - Introducing Amazon Redshift
Data & Analytics - Session 2 - Introducing Amazon RedshiftData & Analytics - Session 2 - Introducing Amazon Redshift
Data & Analytics - Session 2 - Introducing Amazon Redshift
 
Building Your Data Warehouse with Amazon Redshift
Building Your Data Warehouse with Amazon RedshiftBuilding Your Data Warehouse with Amazon Redshift
Building Your Data Warehouse with Amazon Redshift
 
Leveraging Amazon Redshift for Your Data Warehouse
Leveraging Amazon Redshift for Your Data WarehouseLeveraging Amazon Redshift for Your Data Warehouse
Leveraging Amazon Redshift for Your Data Warehouse
 
Getting Started with Managed Database Services on AWS
Getting Started with Managed Database Services on AWSGetting Started with Managed Database Services on AWS
Getting Started with Managed Database Services on AWS
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon Redshift
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon Redshift
 
Getting Started with Managed Database Services on AWS
Getting Started with Managed Database Services on AWSGetting Started with Managed Database Services on AWS
Getting Started with Managed Database Services on AWS
 
Intro to AWS: Database Services
Intro to AWS: Database ServicesIntro to AWS: Database Services
Intro to AWS: Database Services
 
Amazon RDS with Amazon Aurora | AWS Public Sector Summit 2016
Amazon RDS with Amazon Aurora | AWS Public Sector Summit 2016Amazon RDS with Amazon Aurora | AWS Public Sector Summit 2016
Amazon RDS with Amazon Aurora | AWS Public Sector Summit 2016
 

Mehr von Amazon Web Services

Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
Amazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
Amazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
Amazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
Amazon Web Services
 

Mehr von Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Kürzlich hochgeladen

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Kürzlich hochgeladen (20)

Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 

Introduction to Amazon Redshift and What's Next (DAT103) | AWS re:Invent 2013

  • 1. Amazon Redshift Overview & What’s Next Rahul Pathak, Redshift PM (rapathak@amazon.com) Anurag Gupta, Redshift GM (awgupta@amazon.com) November 13, 2013 © 2013 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified, or distributed in whole or in part without the express consent of Amazon.com, Inc.
  • 2. Amazon Redshift Fast, simple, petabyte-scale data warehousing for less than $1,000/TB/Year
  • 3. Amazon Redshift dramatically reduces I/O ID Amount 20 CA 500 345 25 WA 250 678 Data compression State 123 • Age 40 FL 125 37 WA 375 • Zone maps 957 • Direct-attached storage • With row storage you do unnecessary I/O • To get total amount, you have to read everything
  • 4. Amazon Redshift dramatically reduces I/O ID Amount 20 CA 500 345 25 WA 250 678 Data compression State 123 • Age 40 FL 125 37 WA 375 • Zone maps 957 • Direct-attached storage • With column storage, you only read the data you need
  • 5. Amazon Redshift dramatically reduces I/O analyze compression listing; • Data compression • Zone maps • Direct-attached storage Table | Column | Encoding ---------+----------------+---------listing | listid | delta listing | sellerid | delta32k listing | eventid | delta32k listing | dateid | bytedict listing | numtickets | bytedict listing | priceperticket | delta32k listing | totalprice | mostly32 listing | listtime | raw • COPY compresses automatically on load • You can analyze and override • More performance, less cost Slides not intended for redistribution.
  • 6. Amazon Redshift dramatically reduces I/O 10 324 • Data compression 375 623 10 | 13 | 14 | 26 |… … | 100 | 245 | 324 375 | 393 | 417… … 512 | 549 | 623 637 • Zone maps • Direct-attached storage 637 | 712 | 809 … 959 … | 834 | 921 | 959 • Track the minimum and maximum value for each block • Skip over blocks that don’t contain relevant data
  • 7. Amazon Redshift dramatically reduces I/O DW.HS1.XL: • Data compression • Zone maps • Direct-attached storage DW.HS1.8XL: • > 2 GB/s scan rate • Optimized for data processing • High disk density
  • 8. Amazon Redshift architecture • Leader Node – – – JDBC/ODBC SQL endpoint Stores metadata Coordinates query execution • Compute Nodes – – – – 10 GigE (HPC) Local, columnar storage Execute queries in parallel Load, backup, restore via Amazon S3 Parallel load from Amazon DynamoDB • Single node version available Ingestion Backup Restore
  • 9. Amazon Redshift parallelizes and distributes everything • Load • Backup/Restore • Resize
  • 10. Amazon Redshift parallelizes and distributes everything • Load • Backup/Restore • • Load in parallel from Amazon S3 or Amazon DynamoDB • Data automatically distributed and sorted according to DDL • Scales linearly with number of nodes Resize
  • 11. Amazon Redshift parallelizes and distributes everything • Load • Backup/Restore • Backups to Amazon S3 are automatic, continuous and incremental • Resize • Configurable system snapshot retention period • Take user snapshots on-demand • Streaming restores enable you to resume querying faster
  • 12. Amazon Redshift parallelizes and distributes everything • Load • Backup/Restore • Resize • Resize while remaining online • Provision a new cluster in the background • Copy data in parallel from node to node • Only charged for source cluster
  • 13. Amazon Redshift parallelizes and distributes everything • Load • Backup/Restore • • Automatic SQL endpoint switchover via DNS • Decommission the source cluster • Simple operation via Console or API Resize
  • 14. Amazon Redshift lets you start small and grow big Extra Large Node (DW.HS1.XL) 3 spindles, 2 TB, 16 GB RAM, 2 cores Eight Extra Large Node (DW.HS1.8XL) 24 spindles, 16 TB, 128 GB RAM, 16 cores, 10 GigE Single Node (2 TB) Cluster 2-100 Nodes (32 TB – 1.6 PB) Cluster 2-32 Nodes (4 TB – 64 TB) Note: Nodes not to scale
  • 15. Amazon Redshift is priced to let you analyze all your data Price Per Hour for HS1.XL Single Node Effective Hourly Price per TB Effective Annual Price per TB On-Demand $ 0.850 $ 0.425 $ 3,723 1 Year Reservation $ 0.500 $ 0.250 $ 2,190 3 Year Reservation $ 0.228 $ 0.114 $ 999 Simple Pricing Number of Nodes x Cost per Hour No charge for Leader Node No upfront costs Pay as you go
  • 16. Amazon Redshift has security built in Customer VPC • SSL to secure data in transit JDBC/ODBC • Encryption to secure data at rest – – Internal VPC AES-256; hardware accelerated All blocks on disk and in Amazon S3 encrypted 10 GigE (HPC) • No direct access to compute nodes • Amazon VPC support Ingestion Backup Restore
  • 17. Amazon Redshift automatically manages data replication and hardware failures • Replication within the cluster and backup to Amazon S3 to maintain multiple copies of data at all times • Backups to Amazon S3 are continuous, automatic, and incremental – Designed for eleven nines of durability • Continuous monitoring and automated recovery from failures of drives and nodes • Able to restore snapshots to any Availability Zone within a region
  • 19. AWS Marketplace • Find software to use with Amazon Redshift • One-click deployments • Flexible pricing options http://aws.amazon.com/marketplace
  • 20. Over 40 new features since launch on Feb 14 • Regions – • Certifications – • Snapshot sharing, backup/restore progress indicators Query – • Load/unload encrypted files, Resource-level IAM, Temporary credentials Manageability – • PCI, SOC 1/2/3 Security – • N. Virginia, Oregon, Dublin, Tokyo, Singapore, Sydney Regex, Cursors, MD5, SHA1, Time zone, workload queue timeout Ingestion – S3 Manifest, LZOP/LZO, JSON built-ins, UTF-8 4byte, invalid character substitution, CSV, auto datetime format detection, epoch
  • 21. Amazon Redshift – What’s Next
  • 22. Security, visibility and control • Audit logging Redshift • SNS Alerts
  • 23. Visibility and control AWS CloudTrail System Activity Creates, Changes, Deletes, Resizes • Audit logging • SNS Alerts Amazon Redshift Database Activity Logins, Login failures, Queries, Loads Amazon S3
  • 24. Visibility and control • • Audit logging Monitoring Security Maintenance Errors SNS Alerts Amazon Redshift SNS Topic
  • 25. Batch operations • Cluster Creation • Faster Resize Amazon Corporate Amazon EC2 Data Center EMR Amazon Redshift Amazon S3
  • 26. Batch operations • Cluster Creation • Faster Resize Amazon Corporate Amazon EC2 Data Center EMR Amazon Redshift Amazon S3
  • 33. Feature Delivery Unload logs (7/5) Temp Credentials (4/11) Sharing snapshots (7/18) DUB (4/25) Resource Level IAM (8/9) SHA1 Builtin (7/15) SOC1/2/3 (5/8) Statement Timeout (7/22) WLM Timeout/Wildcards (8/1) UTF-8 Substitution (8/29) JDBC Fetch Size (6/27) EMR/HDFS/SSH copy, Distributed Tables, Audit Logging/CloudTrail, Concurrency, Resize Perf., Approximate Count Distinct, SNS Alerts, WLM Memory Management (11/13) Service Launch (2/14) Split_part, Audit tables (10/3) 6 weeks left PCI (8/22) SIN/SYD (10/8) PDX (4/2) JSON, Regex, Cursors (9/10) NRT (6/5) CRC32 Builtin, CSV, Restore Progress (8/9) Timezone, Epoch, Autoformat (7/25) 4 byte UTF-8 (7/18) Unload Encrypted Files HSM Support (11/11)
  • 34. Redshift Customers at re:Invent BDT 101: Big Data ‘State of the Union’ Earlier today DAT 305: Getting Maximum Performance from Amazon Redshift Wednesday 11/13: 3pm in Murano 3303
  • 35. Redshift Customers at re:Invent DAT 306: How Amazon.com is Leveraging Amazon Redshift Thursday 11/14: 3pm in Murano 3303 DAT 205: Amazon Redshift in Action: Enterprise, Big Data, SaaS Friday 11/15: 9am in Lido 3006
  • 36. Please give us your feedback on this presentation DAT 103 As a thank you, we will select prize winners daily for completed surveys!