SlideShare ist ein Scribd-Unternehmen logo
1 von 62
Downloaden Sie, um offline zu lesen
AWSプロダクトシリーズ

よくわかるAmazon Redshift in 大阪

2014/02/21
アマゾン データ サービス ジャパン株式会社
Big Data Platform on AWS
Collect

Store

Analyze

Direct Connect

S3

EMR

Import Export

DynamoDB

Redshift

Kinesis

Glacier

EC2
Lower costs

Ease of use
only pay for what you use
no capital investment

Lower costs
pay as you go
no subscriptions

Ease of use
integrate with
existing tools

programmable

Lower costs

Ease of use
zero admin

easy to
configure
Store anything
Object storage
Amazon S3

Scalable
99.999999999% durability
Real-time processing
High throughput; elastic
Amazon Kinesis

Easy to use
S3, Redshift, DynamoDB Integrations
Hadoop/HDFS clusters
Hive, Impala, MapReduce
Amazon EMR

Easy to use; fully managed
On-demand and spot pricing
NoSQL Database
Seamless scalability
Amazon
DynamoDB

Zero admin
Single digit millisecond latency
Relational data warehouse
Massively parallel
Amazon
Redshift

Petabyte scale
Fully managed

$1,000/TB/Year
Amazon Redshift
Fast, simple, petabyte-scale data warehousing for less than $1,000/TB/Year
Rahul Pathak |Senior Product Manager
a lot faster
a lot cheaper
a whole lot simpler

Petabyte scale

Massively parallel
Amazon
Redshift

Relational data warehouse
Fully managed; zero admin
Amazon Redshift Quick Overview
Amazon Redshift 概要のおさらい
Amazon Redshift architecture
•

Leader Node
–
–
–

•

SQL endpoint
Stores metadata
Coordinates query execution

JDBC/ODBC

Compute Nodes
–
–
–
–

Local, columnar storage
Execute queries in parallel
Load, backup, restore via
Amazon S3
Parallel load from Amazon DynamoDB

•

Hardware optimized for data processing

•

Two hardware platforms

10 GigE
(HPC)

–
–

DW1: HDD; scale from 2TB to 1.6PB
DW2: SSD; scale from 160GB to 256TB

Ingestion
Backup
Restore
Amazon Redshift has security built-in
•
•

Customer VPC

SSL to secure data in transit
Encryption to secure data at rest
–
–
–

JDBC/ODBC

AES-256; hardware accelerated
All blocks on disks and in Amazon S3
encrypted
HSM Support

•

10 GigE
(HPC)

No direct access to compute nodes

•

Internal
VPC

Audit logging & AWS CloudTrail
integration

•

Amazon VPC support

Ingestion
Backup
Restore
Amazon Redshift is easy to use
•

Provision in minutes

•

Monitor query performance

•

Point and click resize

•

Built in security

•

Automatic backups
Provision a data warehouse in minutes
Monitor query performance
Point and click resize

•

Resize while remaining online via AWS
Console or API

•

Provision a new cluster in the background
and copy data in parallel from node to
node

•

Only charged for source cluster until SQL
endpoint has automatically been switched
over via DNS
Amazon Redshift continuously backs up your data and
recovers from failures
•

Replication within the cluster and backup to Amazon S3 to maintain multiple
copies of data at all times

•

Backups to Amazon S3 are continuous, automatic, and incremental
–

Designed for eleven nines of durability

•

Continuous monitoring and automated recovery from failures of drives and nodes

•

Able to restore snapshots to any Availability Zone within a region

•

Easily enable backups to a second region for disaster recovery
Amazon Redshift integrates with multiple data sources

Corporate Datacenter

DynamoDB

Amazon Redshift

Amazon S3

Amazon RDS

Amazon EMR
New Features That Introduced After re:Invent 2013
re:Invent 2013以降の主なアップデート
Feature Delivery in 2013
Unload logs (7/5)
Temp Credentials (4/11)

Sharing snapshots (7/18)

DUB (4/25)

Resource Level IAM (8/9)
SHA1 Builtin (7/15)

SOC1/2/3 (5/8)
Statement Timeout (7/22)
WLM Timeout/Wildcards (8/1)
UTF-8 Substitution (8/29)

JDBC Fetch Size (6/27)

Kinesis EMR/HDFS/SSH copy,
Distributed Tables, Audit
Logging/CloudTrail, Concurrency,
Resize Perf., Approximate Count
Distinct, SNS Alerts (11/13)

Service Launch (2/14)
Split_part, Audit tables (10/3)
EIP Support for VPC Clusters (12/28)

PCI (8/22)
SIN/SYD (10/8)
PDX (4/2)

Distributed Tables, Single Node Cursor
Support, Maximum Connections to 500
(12/13)

JSON, Regex, Cursors (9/10)

NRT (6/5)

CRC32 Builtin, CSV, Restore Progress
(8/9)
Timezone, Epoch, Autoformat (7/25)
4 byte UTF-8 (7/18)

Unload Encrypted Files

HSM Support (11/11)
Summary of Updates after re:Invent
•

Amazon Redshift - New Features Galore (2013/11/11)
–
–
–
–

–
–
–
–

•
•
•

Distributed Tables - You now have more control over the distribution of a table's rows across compute
nodes.
Remote Loading - You can now load data into Redshift from remote hosts across an SSH connection.
Approximate Count Distinct - You can now use a variant of the COUNT function to approximate the
number of matching rows.
Workload Queue Memory Management - You can now apportion available memory across work
queues.
Key Rotation - You can now direct Redshift to rotate keys for an encrypted cluster.
HSM Support - You can now direct Redshift to use an on-premises Hardware Security Module (HSM) or
AWS CloudHSM to manage the encryption master and cluster encryption keys.
Database Auditing and Logging - You can log connections and user activity to Amazon S3.
SNS Notification - Redshift can now issue notifications to an Amazon SNS topic when certain events
occur.

Automated Cross-Region Snapshot Copy for Amazon Redshift (2013/11/14)
Faster & More Cost-Effective SSD-Based Nodes for Amazon Redshift(2014/01/24)
AWS CloudFormation Adds Support for Redshift and More (2014/02/10)
Amazon Redshift Node Types
DW1.XL: 16 GB RAM, 2 Cores
3 Spindles, 2 TB compressed storage

•

Optimized for I/O intensive workloads

•

High disk density

DW1.8XL: 128 GB RAM, 16 Cores, 24
Spindles 16 TB compressed, 2 GB/sec scan
rate

•

On demand at $0.85/hour

•

As low as $1,000/TB/Year

•

Scale from 2TB to 1.6PB

DW2.L *New*: 16 GB RAM, 2 Cores,
160 GB compressed SSD storage

•

High performance at smaller storage size

•

High compute and memory density

•

On demand at $0.25/hour

•

As low as $5,500/TB/Year

•

Scale from 160GB to 256TB

DW2.8XL *New*: 256 GB RAM, 32 Cores,
2.56 TB of compressed SSD storage
Amazon Redshift is priced to let you analyze all your data
Price Per Hour for
DW1.XL Single Node

Effective Annual
Price per TB

On-Demand

$ 1.250

$ 5,475

1 Year Reservation

$ 0.750

$ 3,283

3 Year Reservation

$ 0.452

$ 1,981

DW1 (HDD)

Effective Annual
Price per TB

On-Demand

$ 0.330

$ 18,068

1 Year Reservation

$ 0.211

$ 11,570

3 Year Reservation

$ 0.130

$ 7,127

No charge for leader node

•
Price Per Hour for
DW2.L Single Node

Number of nodes x cost per
hour

•

DW2 (SSD)

•

No upfront costs

•

Pay as you go
Security, visibility and control
•

Audit logging
Redshift

•

SNS Alerts
Visibility and control

AWS
CloudTrail
System Activity
Creates, Changes,
Deletes, Resizes

•

Audit logging

•

SNS Alerts

Amazon Redshift

Database Activity
Logins, Login failures,
Queries, Loads

Amazon S3
Visibility and control
•
•

Audit logging
Monitoring
Security
Maintenance
Errors

SNS Alerts
Amazon
Redshift

SNS
Topic
Batch operations
•

Cluster Creation

•

Faster Resize

Amazon Corporate Amazon
EC2 Data Center EMR

Amazon
Redshift

Amazon S3
Batch operations
•

Cluster Creation

•

Faster Resize

Amazon Corporate Amazon
EC2 Data Center EMR

Amazon
Redshift

Amazon S3
Batch operations
•

Cluster Creation

•

Faster Resize

15-20 min

3 min
Batch operations
•

Cluster Creation

•

Faster Resize

29 hours

7 hours
Performance & Concurrency
Performance & Concurrency

692.8s

34.9s
< 2%
Performance & Concurrency

5,951.7s
2,151.9s
Performance & Concurrency

15

50
How Customers Leverage Amazon Redshift
Amazon Redshift 活用事例
Common Customer Use Cases

Traditional Enterprise DW

SaaS Companies

•

Improve performance by
an order of magnitude

•

Add analytic functionality
to applications

Make more data
available for analysis

•

Scale DW capacity as
demand grows

•

•

•

•

•

Reduce costs by
extending DW rather than
adding HW

Companies with Big Data

Access business data via
standard reporting tools

•

Reduce HW & SW costs
by an order of magnitude

Migrate completely from
existing DW systems
Respond faster to
business; provision in
minutes
Amazon Redshift Customers
Japanese Redshift Customer – ALBERT
•

Business Challenge
–

•

Why AWS?
–

–

•

Given their data volumes, RDBMS tuning and archiving was causing them a lot of
operational pain and costing them money

Amazon Redshift’s performance and ability to handle large data sets allowed them to
make it the core engine of their analytics, enabling them to provide a private DMP (Data
Management Platform) for their customers on the Cloud
PostgreSQL is their primary RDBMS, and connectivity by PostgreSQL drivers is big technical
advantage to choose Redshift.

Benefits for their business
–
–

Ability to start small and scale as needed
Scalability and flexibility dramatically lowered the cost of ownership
Japanese Redshift Customer – Sansan
•

Business Challenge
– Since “Eight” is business card management solution for consumers, they
needed infrastructure that could start small and scale as needed

•

Why AWS?
– When they tried out AWS first, they were surprised with the ease of use. AWS
functionality and elasticity were critical factors

•

Benefits for their business
– Lower costs substantially using reserved instances
– Automation is a key to reduce operational and administration costs. They
utilize services such as Amazon SES and Amazon SWF.
– They use Redshift for KPI analytics of their services.
Growing ecosystem
Multiple Data Loading Options
Data Integration

•

Parallel upload to Amazon S3

•

AWS Direct Connect

•

AWS Import/Export

•

ETL Software

•

Systems integrators

Systems Integrators
Customers on Performance
“Redshift is twenty times faster than Hive” (5x – 20x reduction in query times) link

…[Redshift] performance has blown away everyone here (we generally see 50-100x speedup
over Hive). link
“We saw…2x improvement in query times and a 50% reduction in costs”
We regularly process multibillion row datasets and we do that in a matter of hours. link
“Queries that used to take hours came back in seconds. Our analysts are orders of magnitude
more productive.” (20x – 40x reduction in query times) link
“Did I mention it's ridiculously fast? We'll be using it immediately to provide our analysts
an alternative to Hadoop.”
Customers on Cost
“We found that Amazon Redshift offers the performance we needed while freeing us from
the licensing costs of our previous solution” link
“[Redshift] cost saving is even more impressive…Our analysts like [Redshift] so much they
don’t want to go back.” (4x reduction in cost over HIVE) link
“We saw 50% reduction in costs”
“Not only did we avoid 3 months of development work [we] saved approximately $80,000 in
labor…Competitive Advantage realized with just a few clicks.”
“[Amazon Redshift] took an industry famous for its opaque pricing, high TCO and unreliable
results and completely turned it on its head.” link
“[Redshift] has reduced our storage and processing costs significantly, helping us to realize
another 60-70 percent savings.” link
Customer on Ease of Use
“With Amazon Redshift and Tableau, anyone in the company can set up any queries they
like…It’s very flexible.” link
“Compared to Hadoop [Redshift] is much easier for analysts to use. What may have been a
Hadoop project can become just a query in Redshift.” link

“We can spin up an Amazon Redshift cluster, take a snapshot, and scale servers in minutes
instead of days.” link
“…our team was able to provision Redshift in a matter hours vs. weeks with on-premises
servers.”
“Amazon Redshift is simple to use and reliable. With one click, we can rapidly scale up or down
in real time in alignment with business requirements.” link
“Customers can get consistent, accurate, and useful data fast - in weeks not months or years.”
link
AWS Marketplace
•

Find software to use with Amazon
Redshift

•

One-click deployments

•

Flexible pricing options

http://aws.amazon.com/marketplace
Questions?
APPENDIX
Resources
•

Detail Pages
–
–

•

New Features
–
–

•

http://docs.aws.amazon.com/redshift/latest/dg/doc-history.html
http://docs.aws.amazon.com/redshift/latest/mgmt/document-history.html

Best Practices
–
–
–

•

http://aws.amazon.com/redshift
https://aws.amazon.com/marketplace/redshift/

http://docs.aws.amazon.com/redshift/latest/dg/c_loading-data-best-practices.html
http://docs.aws.amazon.com/redshift/latest/dg/c_designing-tables-best-practices.html
http://docs.aws.amazon.com/redshift/latest/dg/c-optimizing-query-performance.html

Presentations & Webinars:
–
–
–

http://www.youtube.com/watch?v=JxLpj_TnisM (2013 SF Summit Presentation)
http://www.youtube.com/watch?v=R1m-fwzXMow (Best Practices 1 of 2)
http://www.youtube.com/watch?v=7ySzRTOyK6o (Best Practices 2 of 2)
Amazon Redshift dramatically reduces I/O
Column storage
Data compression

Age

State

Amount

20

CA

500

345

25

WA

250

678

•

ID
123

•

40

FL

125

37

WA

375

•

Zone maps

957

•

Direct-attached storage

•

With row storage you do
unnecessary I/O

•

To get total amount, you have to
read everything
Amazon Redshift dramatically reduces I/O
Column storage
Data compression

Age

State

Amount

20

CA

500

345

25

WA

250

678

•

ID
123

•

40

FL

125

37

WA

375

•

Zone maps

957

•

Direct-attached storage

•

With column storage, you only
read the data you need
Amazon Redshift dramatically reduces I/O
•

Column storage

analyze compression listing;
Table |
Column
| Encoding
---------+----------------+---------listing | listid
| delta
listing | sellerid
| delta32k
listing | eventid
| delta32k
listing | dateid
| bytedict
listing | numtickets
| bytedict
listing | priceperticket | delta32k
listing | totalprice
| mostly32
listing | listtime
| raw

•

Data compression

•

Zone maps

•

Direct-attached storage

•

COPY compresses automatically

•

You can analyze and override

•

More performance, less cost

Slides not intended for redistribution.
Amazon Redshift dramatically reduces I/O
•

Column storage

10
324

•

Data compression

375

623

•

Zone maps

•

Direct-attached storage

637
959

10 | 13 | 14 | 26 |…
… | 100 | 245 | 324

375 | 393 | 417…
… 512 | 549 | 623
637 | 712 | 809 …
… | 834 | 921 | 959

•

Track the minimum and maximum
value for each block

•

Skip over blocks that don’t contain
relevant data
Amazon Redshift dramatically reduces I/O
•

Column storage
•

Use local storage for performance

•

Maximize scan rates

•

Data compression

•

Zone maps

•

Automatic replication and
continuous backup

•

Direct-attached storage

•

HDD & SSD platforms
Amazon Redshift parallelizes and distributes everything
•

Query

•

Load

•

Backup/Restore

•

Resize
Amazon Redshift parallelizes and distributes everything
•

Query

•

Load

•

Backup/Restore

•

Resize

•

Load in parallel from Amazon S3 or
Amazon DynamoDB or any SSH
connection

•

Data automatically distributed and
sorted according to DDL

•

Scales linearly with number of nodes
Amazon Redshift parallelizes and distributes everything
•

Query

•

Load

•

Backup/Restore

•

Backups to Amazon S3 are automatic, continuous
and incremental

•

Resize

•

Configurable system snapshot retention period. Take
user snapshots on-demand

•

Cross region backups for disaster recovery

•

Streaming restores enable you to resume querying
faster
Amazon Redshift parallelizes and distributes everything
•

Query

•

Load

•

Backup/Restore

•

Resize

•

Resize while remaining online

•

Provision a new cluster in the background

•

Copy data in parallel from node to node

•

Only charged for source cluster
Amazon Redshift parallelizes and distributes everything
•

Query

•

Load

•

Backup/Restore
•

•

Automatic SQL endpoint switchover
via DNS

•

Decommission the source cluster

•

Simple operation via Console or API

Resize

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

2017 AWS DB Day | AWS 데이터베이스 개요 - 나의 업무에 적합한 데이터베이스는?
2017 AWS DB Day |  AWS 데이터베이스 개요 - 나의 업무에 적합한 데이터베이스는?2017 AWS DB Day |  AWS 데이터베이스 개요 - 나의 업무에 적합한 데이터베이스는?
2017 AWS DB Day | AWS 데이터베이스 개요 - 나의 업무에 적합한 데이터베이스는?
 
2017 AWS DB Day | Amazon Redshift 자세히 살펴보기
2017 AWS DB Day | Amazon Redshift 자세히 살펴보기2017 AWS DB Day | Amazon Redshift 자세히 살펴보기
2017 AWS DB Day | Amazon Redshift 자세히 살펴보기
 
BigData: AWS RedShift with S3, EC2
BigData: AWS RedShift with S3, EC2BigData: AWS RedShift with S3, EC2
BigData: AWS RedShift with S3, EC2
 
(DAT308) Yahoo! Analyzes Billions of Events a Day on Amazon Redshift
(DAT308) Yahoo! Analyzes Billions of Events a Day on Amazon Redshift(DAT308) Yahoo! Analyzes Billions of Events a Day on Amazon Redshift
(DAT308) Yahoo! Analyzes Billions of Events a Day on Amazon Redshift
 
(ISM303) Migrating Your Enterprise Data Warehouse To Amazon Redshift
(ISM303) Migrating Your Enterprise Data Warehouse To Amazon Redshift(ISM303) Migrating Your Enterprise Data Warehouse To Amazon Redshift
(ISM303) Migrating Your Enterprise Data Warehouse To Amazon Redshift
 
Migrating to Amazon RDS with Database Migration Service
Migrating to Amazon RDS with Database Migration ServiceMigrating to Amazon RDS with Database Migration Service
Migrating to Amazon RDS with Database Migration Service
 
Uses and Best Practices for Amazon Redshift
Uses and Best Practices for Amazon RedshiftUses and Best Practices for Amazon Redshift
Uses and Best Practices for Amazon Redshift
 
What's New in Amazon Aurora
What's New in Amazon AuroraWhat's New in Amazon Aurora
What's New in Amazon Aurora
 
Redshift
RedshiftRedshift
Redshift
 
Intro to AWS: Database Services
Intro to AWS: Database ServicesIntro to AWS: Database Services
Intro to AWS: Database Services
 
(SOV202) Choosing Among AWS Managed Database Services | AWS re:Invent 2014
(SOV202) Choosing Among AWS Managed Database Services | AWS re:Invent 2014(SOV202) Choosing Among AWS Managed Database Services | AWS re:Invent 2014
(SOV202) Choosing Among AWS Managed Database Services | AWS re:Invent 2014
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon Redshift
 
AWS re:Invent 2016: JustGiving: Serverless Data Pipelines, Event-Driven ETL, ...
AWS re:Invent 2016: JustGiving: Serverless Data Pipelines, Event-Driven ETL, ...AWS re:Invent 2016: JustGiving: Serverless Data Pipelines, Event-Driven ETL, ...
AWS re:Invent 2016: JustGiving: Serverless Data Pipelines, Event-Driven ETL, ...
 
Getting Started with Big Data and HPC in the Cloud - August 2015
Getting Started with Big Data and HPC in the Cloud - August 2015Getting Started with Big Data and HPC in the Cloud - August 2015
Getting Started with Big Data and HPC in the Cloud - August 2015
 
Building your data warehouse with Redshift
Building your data warehouse with RedshiftBuilding your data warehouse with Redshift
Building your data warehouse with Redshift
 
SRV404 Deep Dive on Amazon DynamoDB
SRV404 Deep Dive on Amazon DynamoDBSRV404 Deep Dive on Amazon DynamoDB
SRV404 Deep Dive on Amazon DynamoDB
 
Getting Started with Amazon EC2 and Compute Services
Getting Started with Amazon EC2 and Compute ServicesGetting Started with Amazon EC2 and Compute Services
Getting Started with Amazon EC2 and Compute Services
 
Deep Dive on Amazon DynamoDB
Deep Dive on Amazon DynamoDBDeep Dive on Amazon DynamoDB
Deep Dive on Amazon DynamoDB
 
AWS RDS Presentation - DOAG Conference
AWS RDS Presentation - DOAG Conference AWS RDS Presentation - DOAG Conference
AWS RDS Presentation - DOAG Conference
 
ENT306 Migrating large Scale Data Sets to the Cloud
ENT306 Migrating large Scale Data Sets to the CloudENT306 Migrating large Scale Data Sets to the Cloud
ENT306 Migrating large Scale Data Sets to the Cloud
 

Andere mochten auch

[AWS re:invent 2013 Report] AWS New EC2 Instance Types
[AWS re:invent 2013 Report] AWS New EC2 Instance Types[AWS re:invent 2013 Report] AWS New EC2 Instance Types
[AWS re:invent 2013 Report] AWS New EC2 Instance Types
Amazon Web Services Japan
 
[よくわかるクラウドデータベース] Amazon RDS for SQL Server導入事例
[よくわかるクラウドデータベース] Amazon RDS for SQL Server導入事例[よくわかるクラウドデータベース] Amazon RDS for SQL Server導入事例
[よくわかるクラウドデータベース] Amazon RDS for SQL Server導入事例
Amazon Web Services Japan
 
[AWSマイスターシリーズ] AWS Client Side SDK -Android,iOS & JavaScript-
[AWSマイスターシリーズ] AWS Client Side SDK -Android,iOS & JavaScript-[AWSマイスターシリーズ] AWS Client Side SDK -Android,iOS & JavaScript-
[AWSマイスターシリーズ] AWS Client Side SDK -Android,iOS & JavaScript-
Amazon Web Services Japan
 
[AWS re:invent 2013 Report] Amazon RDS PostgreSQL
[AWS re:invent 2013 Report] Amazon RDS PostgreSQL[AWS re:invent 2013 Report] Amazon RDS PostgreSQL
[AWS re:invent 2013 Report] Amazon RDS PostgreSQL
Amazon Web Services Japan
 
AWS Simple Monthly Calculator 操作説明書
AWS Simple Monthly Calculator 操作説明書AWS Simple Monthly Calculator 操作説明書
AWS Simple Monthly Calculator 操作説明書
Amazon Web Services Japan
 
[AWSマイスターシリーズ] リザーブドインスタンス&スポットインスタンス
[AWSマイスターシリーズ] リザーブドインスタンス&スポットインスタンス[AWSマイスターシリーズ] リザーブドインスタンス&スポットインスタンス
[AWSマイスターシリーズ] リザーブドインスタンス&スポットインスタンス
Amazon Web Services Japan
 
[AWSマイスターシリーズ] Amazon ElastiCache
[AWSマイスターシリーズ] Amazon ElastiCache[AWSマイスターシリーズ] Amazon ElastiCache
[AWSマイスターシリーズ] Amazon ElastiCache
Amazon Web Services Japan
 

Andere mochten auch (19)

[AWS re:invent 2013 Report] AWS New EC2 Instance Types
[AWS re:invent 2013 Report] AWS New EC2 Instance Types[AWS re:invent 2013 Report] AWS New EC2 Instance Types
[AWS re:invent 2013 Report] AWS New EC2 Instance Types
 
20120319 aws meister-reloaded-s3
20120319 aws meister-reloaded-s320120319 aws meister-reloaded-s3
20120319 aws meister-reloaded-s3
 
Application Development and Data Modeling on Amazon DynamoDB
Application Development and Data Modeling on Amazon DynamoDBApplication Development and Data Modeling on Amazon DynamoDB
Application Development and Data Modeling on Amazon DynamoDB
 
20120423 aws meister-reloaded-fm-son_aws
20120423 aws meister-reloaded-fm-son_aws20120423 aws meister-reloaded-fm-son_aws
20120423 aws meister-reloaded-fm-son_aws
 
[よくわかるクラウドデータベース] Amazon RDS for SQL Server導入事例
[よくわかるクラウドデータベース] Amazon RDS for SQL Server導入事例[よくわかるクラウドデータベース] Amazon RDS for SQL Server導入事例
[よくわかるクラウドデータベース] Amazon RDS for SQL Server導入事例
 
20120303 jaws summit-meister-03_s3
20120303 jaws summit-meister-03_s320120303 jaws summit-meister-03_s3
20120303 jaws summit-meister-03_s3
 
[AWSマイスターシリーズ] AWS Client Side SDK -Android,iOS & JavaScript-
[AWSマイスターシリーズ] AWS Client Side SDK -Android,iOS & JavaScript-[AWSマイスターシリーズ] AWS Client Side SDK -Android,iOS & JavaScript-
[AWSマイスターシリーズ] AWS Client Side SDK -Android,iOS & JavaScript-
 
20120409 aws meister-reloaded-dynamo-db
20120409 aws meister-reloaded-dynamo-db20120409 aws meister-reloaded-dynamo-db
20120409 aws meister-reloaded-dynamo-db
 
[AWS re:invent 2013 Report] Amazon RDS PostgreSQL
[AWS re:invent 2013 Report] Amazon RDS PostgreSQL[AWS re:invent 2013 Report] Amazon RDS PostgreSQL
[AWS re:invent 2013 Report] Amazon RDS PostgreSQL
 
AWS Simple Monthly Calculator 操作説明書
AWS Simple Monthly Calculator 操作説明書AWS Simple Monthly Calculator 操作説明書
AWS Simple Monthly Calculator 操作説明書
 
[AWSマイスターシリーズ] リザーブドインスタンス&スポットインスタンス
[AWSマイスターシリーズ] リザーブドインスタンス&スポットインスタンス[AWSマイスターシリーズ] リザーブドインスタンス&スポットインスタンス
[AWSマイスターシリーズ] リザーブドインスタンス&スポットインスタンス
 
[AWSマイスターシリーズ] Amazon ElastiCache
[AWSマイスターシリーズ] Amazon ElastiCache[AWSマイスターシリーズ] Amazon ElastiCache
[AWSマイスターシリーズ] Amazon ElastiCache
 
AWS Black Belt Online Seminar lumberyard
AWS Black Belt Online Seminar lumberyardAWS Black Belt Online Seminar lumberyard
AWS Black Belt Online Seminar lumberyard
 
[AWSマイスターシリーズ] Amazon Simple Email Service
[AWSマイスターシリーズ] Amazon Simple Email Service[AWSマイスターシリーズ] Amazon Simple Email Service
[AWSマイスターシリーズ] Amazon Simple Email Service
 
Big DataとContainerとStream - AWSでのクラスタ構成とストリーム処理 -
Big DataとContainerとStream - AWSでのクラスタ構成とストリーム処理 -Big DataとContainerとStream - AWSでのクラスタ構成とストリーム処理 -
Big DataとContainerとStream - AWSでのクラスタ構成とストリーム処理 -
 
Black Belt Online Seminar Amazon Cognito
Black Belt Online Seminar Amazon CognitoBlack Belt Online Seminar Amazon Cognito
Black Belt Online Seminar Amazon Cognito
 
AWS Black Belt Online Seminar 10 Years of AWS
AWS Black Belt Online Seminar 10 Years of AWSAWS Black Belt Online Seminar 10 Years of AWS
AWS Black Belt Online Seminar 10 Years of AWS
 
AWS初心者向けWebinar AWSクラウドにおけるVDIソリューション
AWS初心者向けWebinar AWSクラウドにおけるVDIソリューションAWS初心者向けWebinar AWSクラウドにおけるVDIソリューション
AWS初心者向けWebinar AWSクラウドにおけるVDIソリューション
 
Conformisme 2.0
Conformisme 2.0Conformisme 2.0
Conformisme 2.0
 

Ähnlich wie [よくわかるAmazon Redshift in 大阪]Amazon Redshift最新情報と導入事例のご紹介

Ähnlich wie [よくわかるAmazon Redshift in 大阪]Amazon Redshift最新情報と導入事例のご紹介 (20)

Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon Redshift
 
RDS Postgres and Aurora Postgres | AWS Public Sector Summit 2017
RDS Postgres and Aurora Postgres | AWS Public Sector Summit 2017RDS Postgres and Aurora Postgres | AWS Public Sector Summit 2017
RDS Postgres and Aurora Postgres | AWS Public Sector Summit 2017
 
(DAT202) Managed Database Options on AWS
(DAT202) Managed Database Options on AWS(DAT202) Managed Database Options on AWS
(DAT202) Managed Database Options on AWS
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon Redshift
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon Redshift
 
AWS March 2016 Webinar Series - Building Big Data Solutions with Amazon EMR a...
AWS March 2016 Webinar Series - Building Big Data Solutions with Amazon EMR a...AWS March 2016 Webinar Series - Building Big Data Solutions with Amazon EMR a...
AWS March 2016 Webinar Series - Building Big Data Solutions with Amazon EMR a...
 
Bases de datos en la nube con AWS
Bases de datos en la nube con AWSBases de datos en la nube con AWS
Bases de datos en la nube con AWS
 
Migrating Your Oracle Database to PostgreSQL - AWS Online Tech Talks
Migrating Your Oracle Database to PostgreSQL - AWS Online Tech TalksMigrating Your Oracle Database to PostgreSQL - AWS Online Tech Talks
Migrating Your Oracle Database to PostgreSQL - AWS Online Tech Talks
 
Migrating Your Oracle Database to PostgreSQL - AWS Online Tech Talks
Migrating Your Oracle Database to PostgreSQL - AWS Online Tech TalksMigrating Your Oracle Database to PostgreSQL - AWS Online Tech Talks
Migrating Your Oracle Database to PostgreSQL - AWS Online Tech Talks
 
ENT305 Migrating Your Databases to AWS: Deep Dive on Amazon Relational Databa...
ENT305 Migrating Your Databases to AWS: Deep Dive on Amazon Relational Databa...ENT305 Migrating Your Databases to AWS: Deep Dive on Amazon Relational Databa...
ENT305 Migrating Your Databases to AWS: Deep Dive on Amazon Relational Databa...
 
Building Analytic Apps for SaaS: “Analytics as a Service”
Building Analytic Apps for SaaS: “Analytics as a Service”Building Analytic Apps for SaaS: “Analytics as a Service”
Building Analytic Apps for SaaS: “Analytics as a Service”
 
Module 2 - Datalake
Module 2 - DatalakeModule 2 - Datalake
Module 2 - Datalake
 
Getting Started with Managed Database Services on AWS
Getting Started with Managed Database Services on AWSGetting Started with Managed Database Services on AWS
Getting Started with Managed Database Services on AWS
 
Data warehousing in the era of Big Data: Deep Dive into Amazon Redshift
Data warehousing in the era of Big Data: Deep Dive into Amazon RedshiftData warehousing in the era of Big Data: Deep Dive into Amazon Redshift
Data warehousing in the era of Big Data: Deep Dive into Amazon Redshift
 
Getting Started with Managed Database Services on AWS
Getting Started with Managed Database Services on AWSGetting Started with Managed Database Services on AWS
Getting Started with Managed Database Services on AWS
 
Amazon Elastic Map Reduce - Ian Meyers
Amazon Elastic Map Reduce - Ian MeyersAmazon Elastic Map Reduce - Ian Meyers
Amazon Elastic Map Reduce - Ian Meyers
 
Getting Started with Managed Database Services on AWS - AWS Summit Tel Aviv 2017
Getting Started with Managed Database Services on AWS - AWS Summit Tel Aviv 2017Getting Started with Managed Database Services on AWS - AWS Summit Tel Aviv 2017
Getting Started with Managed Database Services on AWS - AWS Summit Tel Aviv 2017
 
Databases - State of the Union
Databases - State of the UnionDatabases - State of the Union
Databases - State of the Union
 
AWS Analytics
AWS AnalyticsAWS Analytics
AWS Analytics
 
Migrating Your Databases to AWS Deep Dive on Amazon RDS and AWS
Migrating Your Databases to AWS Deep Dive on Amazon RDS and AWSMigrating Your Databases to AWS Deep Dive on Amazon RDS and AWS
Migrating Your Databases to AWS Deep Dive on Amazon RDS and AWS
 

Mehr von Amazon Web Services Japan

Mehr von Amazon Web Services Japan (20)

202205 AWS Black Belt Online Seminar Amazon VPC IP Address Manager (IPAM)
202205 AWS Black Belt Online Seminar Amazon VPC IP Address Manager (IPAM)202205 AWS Black Belt Online Seminar Amazon VPC IP Address Manager (IPAM)
202205 AWS Black Belt Online Seminar Amazon VPC IP Address Manager (IPAM)
 
202205 AWS Black Belt Online Seminar Amazon FSx for OpenZFS
202205 AWS Black Belt Online Seminar Amazon FSx for OpenZFS202205 AWS Black Belt Online Seminar Amazon FSx for OpenZFS
202205 AWS Black Belt Online Seminar Amazon FSx for OpenZFS
 
202204 AWS Black Belt Online Seminar AWS IoT Device Defender
202204 AWS Black Belt Online Seminar AWS IoT Device Defender202204 AWS Black Belt Online Seminar AWS IoT Device Defender
202204 AWS Black Belt Online Seminar AWS IoT Device Defender
 
Infrastructure as Code (IaC) 談義 2022
Infrastructure as Code (IaC) 談義 2022Infrastructure as Code (IaC) 談義 2022
Infrastructure as Code (IaC) 談義 2022
 
202204 AWS Black Belt Online Seminar Amazon Connect を活用したオンコール対応の実現
202204 AWS Black Belt Online Seminar Amazon Connect を活用したオンコール対応の実現202204 AWS Black Belt Online Seminar Amazon Connect を活用したオンコール対応の実現
202204 AWS Black Belt Online Seminar Amazon Connect を活用したオンコール対応の実現
 
202204 AWS Black Belt Online Seminar Amazon Connect Salesforce連携(第1回 CTI Adap...
202204 AWS Black Belt Online Seminar Amazon Connect Salesforce連携(第1回 CTI Adap...202204 AWS Black Belt Online Seminar Amazon Connect Salesforce連携(第1回 CTI Adap...
202204 AWS Black Belt Online Seminar Amazon Connect Salesforce連携(第1回 CTI Adap...
 
Amazon Game Tech Night #25 ゲーム業界向け機械学習最新状況アップデート
Amazon Game Tech Night #25 ゲーム業界向け機械学習最新状況アップデートAmazon Game Tech Night #25 ゲーム業界向け機械学習最新状況アップデート
Amazon Game Tech Night #25 ゲーム業界向け機械学習最新状況アップデート
 
20220409 AWS BLEA 開発にあたって検討したこと
20220409 AWS BLEA 開発にあたって検討したこと20220409 AWS BLEA 開発にあたって検討したこと
20220409 AWS BLEA 開発にあたって検討したこと
 
202202 AWS Black Belt Online Seminar AWS Managed Rules for AWS WAF の活用
202202 AWS Black Belt Online Seminar AWS Managed Rules for AWS WAF の活用202202 AWS Black Belt Online Seminar AWS Managed Rules for AWS WAF の活用
202202 AWS Black Belt Online Seminar AWS Managed Rules for AWS WAF の活用
 
202203 AWS Black Belt Online Seminar Amazon Connect Tasks.pdf
202203 AWS Black Belt Online Seminar Amazon Connect Tasks.pdf202203 AWS Black Belt Online Seminar Amazon Connect Tasks.pdf
202203 AWS Black Belt Online Seminar Amazon Connect Tasks.pdf
 
SaaS テナント毎のコストを把握するための「AWS Application Cost Profiler」のご紹介
SaaS テナント毎のコストを把握するための「AWS Application Cost Profiler」のご紹介SaaS テナント毎のコストを把握するための「AWS Application Cost Profiler」のご紹介
SaaS テナント毎のコストを把握するための「AWS Application Cost Profiler」のご紹介
 
Amazon QuickSight の組み込み方法をちょっぴりDD
Amazon QuickSight の組み込み方法をちょっぴりDDAmazon QuickSight の組み込み方法をちょっぴりDD
Amazon QuickSight の組み込み方法をちょっぴりDD
 
マルチテナント化で知っておきたいデータベースのこと
マルチテナント化で知っておきたいデータベースのことマルチテナント化で知っておきたいデータベースのこと
マルチテナント化で知っておきたいデータベースのこと
 
機密データとSaaSは共存しうるのか!?セキュリティー重視のユーザー層を取り込む為のネットワーク通信のアプローチ
機密データとSaaSは共存しうるのか!?セキュリティー重視のユーザー層を取り込む為のネットワーク通信のアプローチ機密データとSaaSは共存しうるのか!?セキュリティー重視のユーザー層を取り込む為のネットワーク通信のアプローチ
機密データとSaaSは共存しうるのか!?セキュリティー重視のユーザー層を取り込む為のネットワーク通信のアプローチ
 
パッケージソフトウェアを簡単にSaaS化!?既存の資産を使ったSaaS化手法のご紹介
パッケージソフトウェアを簡単にSaaS化!?既存の資産を使ったSaaS化手法のご紹介パッケージソフトウェアを簡単にSaaS化!?既存の資産を使ったSaaS化手法のご紹介
パッケージソフトウェアを簡単にSaaS化!?既存の資産を使ったSaaS化手法のご紹介
 
202202 AWS Black Belt Online Seminar Amazon Connect Customer Profiles
202202 AWS Black Belt Online Seminar Amazon Connect Customer Profiles202202 AWS Black Belt Online Seminar Amazon Connect Customer Profiles
202202 AWS Black Belt Online Seminar Amazon Connect Customer Profiles
 
Amazon Game Tech Night #24 KPIダッシュボードを最速で用意するために
Amazon Game Tech Night #24 KPIダッシュボードを最速で用意するためにAmazon Game Tech Night #24 KPIダッシュボードを最速で用意するために
Amazon Game Tech Night #24 KPIダッシュボードを最速で用意するために
 
202202 AWS Black Belt Online Seminar AWS SaaS Boost で始めるSaaS開発⼊⾨
202202 AWS Black Belt Online Seminar AWS SaaS Boost で始めるSaaS開発⼊⾨202202 AWS Black Belt Online Seminar AWS SaaS Boost で始めるSaaS開発⼊⾨
202202 AWS Black Belt Online Seminar AWS SaaS Boost で始めるSaaS開発⼊⾨
 
[20220126] JAWS-UG 2022初頭までに葬ったAWSアンチパターン大紹介
[20220126] JAWS-UG 2022初頭までに葬ったAWSアンチパターン大紹介[20220126] JAWS-UG 2022初頭までに葬ったAWSアンチパターン大紹介
[20220126] JAWS-UG 2022初頭までに葬ったAWSアンチパターン大紹介
 
202111 AWS Black Belt Online Seminar AWSで構築するSmart Mirrorのご紹介
202111 AWS Black Belt Online Seminar AWSで構築するSmart Mirrorのご紹介202111 AWS Black Belt Online Seminar AWSで構築するSmart Mirrorのご紹介
202111 AWS Black Belt Online Seminar AWSで構築するSmart Mirrorのご紹介
 

Kürzlich hochgeladen

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Kürzlich hochgeladen (20)

Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 

[よくわかるAmazon Redshift in 大阪]Amazon Redshift最新情報と導入事例のご紹介

  • 1. AWSプロダクトシリーズ よくわかるAmazon Redshift in 大阪 2014/02/21 アマゾン データ サービス ジャパン株式会社
  • 5. only pay for what you use no capital investment Lower costs pay as you go no subscriptions Ease of use
  • 6. integrate with existing tools programmable Lower costs Ease of use zero admin easy to configure
  • 7. Store anything Object storage Amazon S3 Scalable 99.999999999% durability
  • 8. Real-time processing High throughput; elastic Amazon Kinesis Easy to use S3, Redshift, DynamoDB Integrations
  • 9. Hadoop/HDFS clusters Hive, Impala, MapReduce Amazon EMR Easy to use; fully managed On-demand and spot pricing
  • 10. NoSQL Database Seamless scalability Amazon DynamoDB Zero admin Single digit millisecond latency
  • 11. Relational data warehouse Massively parallel Amazon Redshift Petabyte scale Fully managed $1,000/TB/Year
  • 12. Amazon Redshift Fast, simple, petabyte-scale data warehousing for less than $1,000/TB/Year Rahul Pathak |Senior Product Manager
  • 13. a lot faster a lot cheaper a whole lot simpler Petabyte scale Massively parallel Amazon Redshift Relational data warehouse Fully managed; zero admin
  • 14. Amazon Redshift Quick Overview Amazon Redshift 概要のおさらい
  • 15. Amazon Redshift architecture • Leader Node – – – • SQL endpoint Stores metadata Coordinates query execution JDBC/ODBC Compute Nodes – – – – Local, columnar storage Execute queries in parallel Load, backup, restore via Amazon S3 Parallel load from Amazon DynamoDB • Hardware optimized for data processing • Two hardware platforms 10 GigE (HPC) – – DW1: HDD; scale from 2TB to 1.6PB DW2: SSD; scale from 160GB to 256TB Ingestion Backup Restore
  • 16. Amazon Redshift has security built-in • • Customer VPC SSL to secure data in transit Encryption to secure data at rest – – – JDBC/ODBC AES-256; hardware accelerated All blocks on disks and in Amazon S3 encrypted HSM Support • 10 GigE (HPC) No direct access to compute nodes • Internal VPC Audit logging & AWS CloudTrail integration • Amazon VPC support Ingestion Backup Restore
  • 17. Amazon Redshift is easy to use • Provision in minutes • Monitor query performance • Point and click resize • Built in security • Automatic backups
  • 18. Provision a data warehouse in minutes
  • 20. Point and click resize • Resize while remaining online via AWS Console or API • Provision a new cluster in the background and copy data in parallel from node to node • Only charged for source cluster until SQL endpoint has automatically been switched over via DNS
  • 21. Amazon Redshift continuously backs up your data and recovers from failures • Replication within the cluster and backup to Amazon S3 to maintain multiple copies of data at all times • Backups to Amazon S3 are continuous, automatic, and incremental – Designed for eleven nines of durability • Continuous monitoring and automated recovery from failures of drives and nodes • Able to restore snapshots to any Availability Zone within a region • Easily enable backups to a second region for disaster recovery
  • 22. Amazon Redshift integrates with multiple data sources Corporate Datacenter DynamoDB Amazon Redshift Amazon S3 Amazon RDS Amazon EMR
  • 23. New Features That Introduced After re:Invent 2013 re:Invent 2013以降の主なアップデート
  • 24. Feature Delivery in 2013 Unload logs (7/5) Temp Credentials (4/11) Sharing snapshots (7/18) DUB (4/25) Resource Level IAM (8/9) SHA1 Builtin (7/15) SOC1/2/3 (5/8) Statement Timeout (7/22) WLM Timeout/Wildcards (8/1) UTF-8 Substitution (8/29) JDBC Fetch Size (6/27) Kinesis EMR/HDFS/SSH copy, Distributed Tables, Audit Logging/CloudTrail, Concurrency, Resize Perf., Approximate Count Distinct, SNS Alerts (11/13) Service Launch (2/14) Split_part, Audit tables (10/3) EIP Support for VPC Clusters (12/28) PCI (8/22) SIN/SYD (10/8) PDX (4/2) Distributed Tables, Single Node Cursor Support, Maximum Connections to 500 (12/13) JSON, Regex, Cursors (9/10) NRT (6/5) CRC32 Builtin, CSV, Restore Progress (8/9) Timezone, Epoch, Autoformat (7/25) 4 byte UTF-8 (7/18) Unload Encrypted Files HSM Support (11/11)
  • 25. Summary of Updates after re:Invent • Amazon Redshift - New Features Galore (2013/11/11) – – – – – – – – • • • Distributed Tables - You now have more control over the distribution of a table's rows across compute nodes. Remote Loading - You can now load data into Redshift from remote hosts across an SSH connection. Approximate Count Distinct - You can now use a variant of the COUNT function to approximate the number of matching rows. Workload Queue Memory Management - You can now apportion available memory across work queues. Key Rotation - You can now direct Redshift to rotate keys for an encrypted cluster. HSM Support - You can now direct Redshift to use an on-premises Hardware Security Module (HSM) or AWS CloudHSM to manage the encryption master and cluster encryption keys. Database Auditing and Logging - You can log connections and user activity to Amazon S3. SNS Notification - Redshift can now issue notifications to an Amazon SNS topic when certain events occur. Automated Cross-Region Snapshot Copy for Amazon Redshift (2013/11/14) Faster & More Cost-Effective SSD-Based Nodes for Amazon Redshift(2014/01/24) AWS CloudFormation Adds Support for Redshift and More (2014/02/10)
  • 26. Amazon Redshift Node Types DW1.XL: 16 GB RAM, 2 Cores 3 Spindles, 2 TB compressed storage • Optimized for I/O intensive workloads • High disk density DW1.8XL: 128 GB RAM, 16 Cores, 24 Spindles 16 TB compressed, 2 GB/sec scan rate • On demand at $0.85/hour • As low as $1,000/TB/Year • Scale from 2TB to 1.6PB DW2.L *New*: 16 GB RAM, 2 Cores, 160 GB compressed SSD storage • High performance at smaller storage size • High compute and memory density • On demand at $0.25/hour • As low as $5,500/TB/Year • Scale from 160GB to 256TB DW2.8XL *New*: 256 GB RAM, 32 Cores, 2.56 TB of compressed SSD storage
  • 27. Amazon Redshift is priced to let you analyze all your data Price Per Hour for DW1.XL Single Node Effective Annual Price per TB On-Demand $ 1.250 $ 5,475 1 Year Reservation $ 0.750 $ 3,283 3 Year Reservation $ 0.452 $ 1,981 DW1 (HDD) Effective Annual Price per TB On-Demand $ 0.330 $ 18,068 1 Year Reservation $ 0.211 $ 11,570 3 Year Reservation $ 0.130 $ 7,127 No charge for leader node • Price Per Hour for DW2.L Single Node Number of nodes x cost per hour • DW2 (SSD) • No upfront costs • Pay as you go
  • 28. Security, visibility and control • Audit logging Redshift • SNS Alerts
  • 29. Visibility and control AWS CloudTrail System Activity Creates, Changes, Deletes, Resizes • Audit logging • SNS Alerts Amazon Redshift Database Activity Logins, Login failures, Queries, Loads Amazon S3
  • 30. Visibility and control • • Audit logging Monitoring Security Maintenance Errors SNS Alerts Amazon Redshift SNS Topic
  • 31. Batch operations • Cluster Creation • Faster Resize Amazon Corporate Amazon EC2 Data Center EMR Amazon Redshift Amazon S3
  • 32. Batch operations • Cluster Creation • Faster Resize Amazon Corporate Amazon EC2 Data Center EMR Amazon Redshift Amazon S3
  • 39. How Customers Leverage Amazon Redshift Amazon Redshift 活用事例
  • 40. Common Customer Use Cases Traditional Enterprise DW SaaS Companies • Improve performance by an order of magnitude • Add analytic functionality to applications Make more data available for analysis • Scale DW capacity as demand grows • • • • • Reduce costs by extending DW rather than adding HW Companies with Big Data Access business data via standard reporting tools • Reduce HW & SW costs by an order of magnitude Migrate completely from existing DW systems Respond faster to business; provision in minutes
  • 42. Japanese Redshift Customer – ALBERT • Business Challenge – • Why AWS? – – • Given their data volumes, RDBMS tuning and archiving was causing them a lot of operational pain and costing them money Amazon Redshift’s performance and ability to handle large data sets allowed them to make it the core engine of their analytics, enabling them to provide a private DMP (Data Management Platform) for their customers on the Cloud PostgreSQL is their primary RDBMS, and connectivity by PostgreSQL drivers is big technical advantage to choose Redshift. Benefits for their business – – Ability to start small and scale as needed Scalability and flexibility dramatically lowered the cost of ownership
  • 43. Japanese Redshift Customer – Sansan • Business Challenge – Since “Eight” is business card management solution for consumers, they needed infrastructure that could start small and scale as needed • Why AWS? – When they tried out AWS first, they were surprised with the ease of use. AWS functionality and elasticity were critical factors • Benefits for their business – Lower costs substantially using reserved instances – Automation is a key to reduce operational and administration costs. They utilize services such as Amazon SES and Amazon SWF. – They use Redshift for KPI analytics of their services.
  • 45. Multiple Data Loading Options Data Integration • Parallel upload to Amazon S3 • AWS Direct Connect • AWS Import/Export • ETL Software • Systems integrators Systems Integrators
  • 46. Customers on Performance “Redshift is twenty times faster than Hive” (5x – 20x reduction in query times) link …[Redshift] performance has blown away everyone here (we generally see 50-100x speedup over Hive). link “We saw…2x improvement in query times and a 50% reduction in costs” We regularly process multibillion row datasets and we do that in a matter of hours. link “Queries that used to take hours came back in seconds. Our analysts are orders of magnitude more productive.” (20x – 40x reduction in query times) link “Did I mention it's ridiculously fast? We'll be using it immediately to provide our analysts an alternative to Hadoop.”
  • 47. Customers on Cost “We found that Amazon Redshift offers the performance we needed while freeing us from the licensing costs of our previous solution” link “[Redshift] cost saving is even more impressive…Our analysts like [Redshift] so much they don’t want to go back.” (4x reduction in cost over HIVE) link “We saw 50% reduction in costs” “Not only did we avoid 3 months of development work [we] saved approximately $80,000 in labor…Competitive Advantage realized with just a few clicks.” “[Amazon Redshift] took an industry famous for its opaque pricing, high TCO and unreliable results and completely turned it on its head.” link “[Redshift] has reduced our storage and processing costs significantly, helping us to realize another 60-70 percent savings.” link
  • 48. Customer on Ease of Use “With Amazon Redshift and Tableau, anyone in the company can set up any queries they like…It’s very flexible.” link “Compared to Hadoop [Redshift] is much easier for analysts to use. What may have been a Hadoop project can become just a query in Redshift.” link “We can spin up an Amazon Redshift cluster, take a snapshot, and scale servers in minutes instead of days.” link “…our team was able to provision Redshift in a matter hours vs. weeks with on-premises servers.” “Amazon Redshift is simple to use and reliable. With one click, we can rapidly scale up or down in real time in alignment with business requirements.” link “Customers can get consistent, accurate, and useful data fast - in weeks not months or years.” link
  • 49. AWS Marketplace • Find software to use with Amazon Redshift • One-click deployments • Flexible pricing options http://aws.amazon.com/marketplace
  • 52. Resources • Detail Pages – – • New Features – – • http://docs.aws.amazon.com/redshift/latest/dg/doc-history.html http://docs.aws.amazon.com/redshift/latest/mgmt/document-history.html Best Practices – – – • http://aws.amazon.com/redshift https://aws.amazon.com/marketplace/redshift/ http://docs.aws.amazon.com/redshift/latest/dg/c_loading-data-best-practices.html http://docs.aws.amazon.com/redshift/latest/dg/c_designing-tables-best-practices.html http://docs.aws.amazon.com/redshift/latest/dg/c-optimizing-query-performance.html Presentations & Webinars: – – – http://www.youtube.com/watch?v=JxLpj_TnisM (2013 SF Summit Presentation) http://www.youtube.com/watch?v=R1m-fwzXMow (Best Practices 1 of 2) http://www.youtube.com/watch?v=7ySzRTOyK6o (Best Practices 2 of 2)
  • 53. Amazon Redshift dramatically reduces I/O Column storage Data compression Age State Amount 20 CA 500 345 25 WA 250 678 • ID 123 • 40 FL 125 37 WA 375 • Zone maps 957 • Direct-attached storage • With row storage you do unnecessary I/O • To get total amount, you have to read everything
  • 54. Amazon Redshift dramatically reduces I/O Column storage Data compression Age State Amount 20 CA 500 345 25 WA 250 678 • ID 123 • 40 FL 125 37 WA 375 • Zone maps 957 • Direct-attached storage • With column storage, you only read the data you need
  • 55. Amazon Redshift dramatically reduces I/O • Column storage analyze compression listing; Table | Column | Encoding ---------+----------------+---------listing | listid | delta listing | sellerid | delta32k listing | eventid | delta32k listing | dateid | bytedict listing | numtickets | bytedict listing | priceperticket | delta32k listing | totalprice | mostly32 listing | listtime | raw • Data compression • Zone maps • Direct-attached storage • COPY compresses automatically • You can analyze and override • More performance, less cost Slides not intended for redistribution.
  • 56. Amazon Redshift dramatically reduces I/O • Column storage 10 324 • Data compression 375 623 • Zone maps • Direct-attached storage 637 959 10 | 13 | 14 | 26 |… … | 100 | 245 | 324 375 | 393 | 417… … 512 | 549 | 623 637 | 712 | 809 … … | 834 | 921 | 959 • Track the minimum and maximum value for each block • Skip over blocks that don’t contain relevant data
  • 57. Amazon Redshift dramatically reduces I/O • Column storage • Use local storage for performance • Maximize scan rates • Data compression • Zone maps • Automatic replication and continuous backup • Direct-attached storage • HDD & SSD platforms
  • 58. Amazon Redshift parallelizes and distributes everything • Query • Load • Backup/Restore • Resize
  • 59. Amazon Redshift parallelizes and distributes everything • Query • Load • Backup/Restore • Resize • Load in parallel from Amazon S3 or Amazon DynamoDB or any SSH connection • Data automatically distributed and sorted according to DDL • Scales linearly with number of nodes
  • 60. Amazon Redshift parallelizes and distributes everything • Query • Load • Backup/Restore • Backups to Amazon S3 are automatic, continuous and incremental • Resize • Configurable system snapshot retention period. Take user snapshots on-demand • Cross region backups for disaster recovery • Streaming restores enable you to resume querying faster
  • 61. Amazon Redshift parallelizes and distributes everything • Query • Load • Backup/Restore • Resize • Resize while remaining online • Provision a new cluster in the background • Copy data in parallel from node to node • Only charged for source cluster
  • 62. Amazon Redshift parallelizes and distributes everything • Query • Load • Backup/Restore • • Automatic SQL endpoint switchover via DNS • Decommission the source cluster • Simple operation via Console or API Resize