SlideShare ist ein Scribd-Unternehmen logo
1 von 53
Downloaden Sie, um offline zu lesen
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Omair Gillani, S3 Product Manager, AWS
Lee Kear, Storage Specialist Solutions Architect, AWS
Jason Gluckman, Lead Software Engineer, Ring
Peter Levett, Storage Specialist Solutions Architect, AWS
Angela Wang, Solutions Architect, AWS
November 30, 2016
STG313
Amazon S3 Deep-Dive Hands-On Workshop:
Deploying and Managing a Global, Petabyte
Scale Storage Infrastructure
What to Expect from the SessionWhat to Expect from the Session
• How does a workshop differ from other sessions?
• S3 new features
• How we think about storage management for S3
• Storage Management Portfolio for S3
• Understand your data
• Discover your data
• Manage your data
• Pulling it all together
• Key naming schemes
• Group activity
How does a workshop differ from other sessions
Learn from AWS
45 minutes of lecture
Learn from each other
Group learning activity
New S3 Features
2012 2013 2014
Amazon storage usage
Trillions of objects
Millions of requests per second
Choice of storage classes on S3
Standard
Active data Archive dataInfrequently accessed data
Standard - Infrequent Access Amazon Glacier
File sync and share
+
consumer file
storage
Backup and archive +
disaster recovery
Long-retained
data
Use cases for Standard-Infrequent Access
Designed for 11 9s of
durability
Standard - Infrequent Access storage
Designed for
99.9% availability
Durable Available
Same as Standard storage
High performance
• Bucket policies
• AWS Identity and Access
Management (IAM) policies
• Many encryption options
Secure
• Lifecycle management
• Versioning
• Event notifications
• Metrics
Integrated
• No impact on user
experience
• Simple REST API
Easy to use
- Directly PUT to Standard - IA
- Transition Standard to Standard - IA
- Transition Standard - IA to Amazon Glacier
storage
- Expiration lifecycle policy
- Versioning support
Standard - Infrequent Access storage
Integrated: Lifecycle management
Standard - Infrequent Access
A comprehensive storage management
portfolio for S3
Storage Management for S3
Cross-Region
Replication Lifecycle
Policy
Data
Classification
& Management
Event
Notifications
S3 CloudWatch Metrics S3 Inventory Audit with object level
AWS CloudTrail Data Events
S3 Analytics
Standard Standard - Infrequent Access Amazon Glacier
Understand your storage usage
S3 InventoryAnalyze Logs with
Amazon EMR
S3 Analytics
S3 Inventory
Use case: trigger business workflows and applications such as secondary index garbage
collection, data auditing, and offline analytics
• More information about your objects than provided by LIST API such as replication
status, multipart upload flag, and delete marker
Save time Daily or Weekly delivery Delivery to S3 bucketCSV File Output
S3 Inventory
Eventually consistent rolling snapshot
• New objects may not be listed
• Removed objects may still be included
Name Value Type Description
Bucket String Bucket name. UTF-8 encoded.
Key String Object key name. UTF-8 encoded.
Version Id String Version Id of the object
Is Latest boolean true if object is the latest version (current version) of a versioned object, otherwise false
Delete Marker boolean true if object is a delete marker of a versioned object, otherwise false
Size long Object size in bytes
Last Modified String Last modified timestamp. Format in ISO: YYYY-MM-DDTHH:mm:ss.SSSZ
ETag String eTag in HEX encoded format
StorageClass String Valid values: STANDARD, REDUCED_REDUNDANCY, GLACIER, STANDARD_IA. UTF-8 encoded.
Multipart Uploaded boolean true if object is uploaded by using multipart, otherwise false
Replication Status String Valid values: REPLICA, COMPLETED, PENDING, FAILED. UTF-8 encoded.
Validate before you act!
• Use HEAD OBJECT
S3 Analytics – Storage Class Analysis
Analyze buckets,
prefixes, or tags
$0.10 per million
objects analyzed
Storage Class
Analysis
&
lifecycle
recommendation
Data-driven storage management for S3
Export Analysis to
your S3 bucket
S3 Analytics – Storage Class Analysis
Demo
S3 Analytics – Storage Class Analysis
S3 Analytics – Storage Class Analysis
Monitor your storage
Monitor and Alert with
Amazon CloudWatch
Audit your storage with
AWS CloudTrail
Server Access Logs
CloudWatch metrics for S3
Operational & Performance monitoring
• Generate metrics for data of your choice
• Entire bucket, prefixes, and tags
• Up to 1,000 object groups
• 1-minute CloudWatch metrics
• Alert and alarm on metrics
• Pay for what you use
CloudWatch metrics for S3
Price per metric
• $0.30 per metric per month
Metric Name Metric value
AllRequests Count
PutRequests Count
GetRequests Count
ListRequests Count
DeleteRequests Count
HeadRequests Count
PostRequests Count
BytesDownloaded MB
BytesUploaded MB
4xxErrors Count
5xxErrors Count
FirstByteLatency ms
TotalRequestLatency ms
CloudTrail data events for S3
Use case: Perform security analysis, meet your IT auditing and
compliance needs
API logs for bucket and object-level requests
• Creation/deletion of buckets
• Changes to bucket configuration (bucket policy, lifecycle policies,
replication policies, etc.)
• SNS notification for log file delivery (optional)
Manage your data
Cross-Region
Replication
Lifecycle Policies Event
Notifications
S3 Object Tags
Manage your data
S3 Object Tags
Manage storage based on object tags
• Classify your data
• Tag your objects with key-value pairs
• Write policies once based on the type of data
AnalyzeLifecycle PolicyAccess Control
Deep dive on tags
• Tags are key value pairs
• Maximum 10 tags per object
• Maximum key length—127 Unicode characters
• Maximum value length—255 Unicode characters
• Tag keys and values are case sensitive
2 ways to put tags via API
• Put objects with tag parameter, or
• Add tag API after object is created
What can I do with tags?
{
"Version": "2012-10-17",
"Statement": [
{
"Effect": "Allow",
"Action": [
"s3:GetObject"
],
"Resource": "arn:aws:s3:::EXAMPLE-BUCKET-NAME/*"
"Condition": {"StringEquals": {"S3:ResourceTag/HIPAA":"True"}}
}
]
}
Manage permissions with tags
Lifecycle policies based on tags
<LifecycleConfiguration>
<Rule>
<ID>sample-rule</ID>
<Filter>
<And>
<Prefix>documents/</Prefix>
<Tag>
<Key>Project</Key>
<Value>Delta</Value>
</Tag>
<Tag>
<Key>Data type</Key>
<Value>HPI</Value>
</Tag>
</And>
</Filter>
<Status>Enabled</Status>
<Transition>
<Days>365</Days>
<StorageClass>GLACIER</StorageClass>
</Transition>
<Expiration>
<Days>3650</Days>
</Expiration>
</Rule>
</LifecycleConfiguration>
Putting it all together
Storage Management for S3
Cross-Region
Replication
Lifecycle Policy S3 Object TagsEvent
Notifications
Monitor and Alert with
CloudWatch
S3 Inventory Audit with
CloudTrail Data
Events
S3 Analytics
S3 Performance at Scale
Getting high throughput performance with S3
• S3 can scale to many thousands of requests per second
• Need a good key naming scheme
• Only at scale do you need to consider your key naming
scheme
• What are Partitions?
• Why?
• Spread keys lexigraphically
• Goal of partitioning is too spread the heat
• Prevent HotSpots
my-bucket/2013_11_13-164533125.jpg
my-bucket/2013_11_13-164533126.jpg
my-bucket/2013_11_13-164533127.jpg
my-bucket/2013_11_13-164533128.jpg
my-bucket/2013_11_12-164533129.jpg
my-bucket/2013_11_12-164533130.jpg
my-bucket/2013_11_12-164533131.jpg
my-bucket/2013_11_12-164533132.jpg
my-bucket/2013_11_11-164533133.jpg
my-bucket/2013_11_11-164533134.jpg
my-bucket/2013_11_11-164533135.jpg
my-bucket/2013_11_11-164533136.jpg
Use a key-naming scheme with randomness at the beginning for high
TPS
• Most important if you will regularly exceed 100 TPS on a bucket
• Avoid starting with a date or monotonically increasing numbers
• Consider adding a hash or reversed timestamp (ssmmhhddmmyy)
Don’t do this…
How to design for high Request Rates to S3
Partition:
my-bucket/2013_11_1
Distributing key names
Add randomness to the beginning of the key name…
my-bucket/6213-2013_11_13.jpg
my-bucket/4653-2013_11_13.jpg
my-bucket/9873-2013_11_13.jpg
my-bucket/4657-2013_11_13.jpg
my-bucket/1256-2013_11_13.jpg
my-bucket/8345-2013_11_13.jpg
my-bucket/0321-2013_11_13.jpg
my-bucket/5654-2013_11_13.jpg
my-bucket/2345-2013_11_13.jpg
my-bucket/7567-2013_11_13.jpg
my-bucket/3455-2013_11_13.jpg
my-bucket/4313-2013_11_13.jpg
Partitions:
my-bucket/0
my-bucket/1
my-bucket/2
my-bucket/3
my-bucket/4
my-bucket/5
my-bucket/6
my-bucket/7
my-bucket/8
my-bucket/9
Monotonically Increasing Customer ID
mycustdata/2134857/app_data_1/2016-11-30-02:01:01:24/log.txt
mycustdata/2134857/app_data_1/2016-11-30-02:01:01:32/wrk_user
mycustdata/2134858/app_data_1/2016-11-30-02:01:01:29/product_usage.csv
mycustdata/2134858/app_data_1/2016-11-30-02:01:01:24/log.txt
mycustdata/2134858/app_data_1/2016-11-30-02:01:01:14/wrk_user
mycustdata/2134859/app_data_1/2016-11-30-02:01:01:28/product_usage.csv
mycustdata/2134859/app_data_1/2016-11-30-02:01:01:45/log.txt
mycustdata/2134859/app_data_1/2016-11-30-02:01:01:34/wrk_user
mycustdata/7584312/app_data_1/2016-11-30-02:01:01:23/product_usage.csv
mycustdata/7584312/app_data_1/2016-11-30-02:01:01:24/log.txt
mycustdata/7584312/app_data_1/2016-11-30-02:01:01:32/wrk_user
mycustdata/8584312/app_data_1/2016-11-30-02:01:01:29/product_usage.csv
mycustdata/8584312/app_data_1/2016-11-30-02:01:01:24/log.txt
mycustdata/8584312/app_data_1/2016-11-30-02:01:01:14/wrk_user
mycustdata/9584312/app_data_1/2016-11-30-02:01:01:28/product_usage.csv
mycustdata/9584312/app_data_1/2016-11-30-02:01:01:45/log.txt
mycustdata/9584312/app_data_1/2016-11-30-02:01:01:34/wrk_user
Partition:
mycustdata/213485
Partitions:
mycustdata/7
mycustdata/8
mycustdata/9
Reverse Monotonically Increase prefix
If a single customer can push a higher
workload, they can cause a Hotspot.
Add A Hash to Beginning of Key – Best
mycustdata/2134857/app_data_1/2016-11-30-02:01:01:24/log.txt
mycustdata/2134857/app_data_1/2016-11-30-02:01:01:32/wrk_user
mycustdata/2134858/app_data_1/2016-11-30-02:01:01:29/product_usage.csv
mycustdata/2134858/app_data_1/2016-11-30-02:01:01:24/log.txt
mycustdata/2134858/app_data_1/2016-11-30-02:01:01:14/wrk_user
mycustdata/2134859/app_data_1/2016-11-30-02:01:01:28/product_usage.csv
mycustdata/2134859/app_data_1/2016-11-30-02:01:01:45/log.txt
mycustdata/2134859/app_data_1/2016-11-30-02:01:01:34/wrk_user
mycustdata/1a/2134857/app_data_1/2016-11-30-02:01:01:24/log.txt
mycustdata/34/2134857/app_data_1/2016-11-30-02:01:01:32/wrk_user
mycustdata/a7/2134858/app_data_1/2016-11-30-02:01:01:29/product_usage.csv
mycustdata/58/2134858/app_data_1/2016-11-30-02:01:01:24/log.txt
mycustdata/70/2134858/app_data_1/2016-11-30-02:01:01:14/wrk_user
mycustdata/02/2134859/app_data_1/2016-11-30-02:01:01:28/product_usage.csv
mycustdata/2b/2134859/app_data_1/2016-11-30-02:01:01:45/log.txt
mycustdata/63/2134859/app_data_1/2016-11-30-02:01:01:34/wrk_user
Partition:
mycustdata/213485
Partitions:
mycustdata/0
mycustdata/1
mycustdata/2
mycustdata/3
mycustdata/4
mycustdata/5
mycustdata/6
mycustdata/7
Add a hash to evenly distribute the keys for all requests
mycustdata/8
mycustdata/9
mycustdata/a
mycustdata/b
mycustdata/c
mycustdata/d
mycustdata/e
mycustdata/f
Challenges of using a hash to create entropy
• Listing challenges/opportunities:
• A Secondary Index can be used to avoid listing
• Can be accomplished with Event Notification to AWS Lambda and
Amazon DynamoDB
• Blog Post - Building and Maintaining an Amazon S3 Metadata Index
without Servers
• Hash can be used to split work of LISTing objects
• Lifecycle constraints
• Max number of lifecycle rules – 1000
• Tagging can make this easier
Faster upload of large objects
Parallelize PUTs with Multipart Uploads
• Increase aggregate throughput by
parallelizing PUTs on high-bandwidth
networks
• Move the bottleneck to the network,
where it belongs
• Increase resiliency to network errors;
fewer large restarts on error-prone
networks
Best Practice
Faster download
You can parallelize GETs too
For large objects, use range-based GETs
For content distribution, enable Amazon CloudFront
• Caches objects at the edge
• 59 global edge locations
GET /example-object HTTP/1.1
Host: example-bucket.s3.amazonaws.com
x-amz-date: Fri, 28 Jan 2011 21:32:02 GMT
Range: bytes=0-9
Authorization: AWS AKIAIOSFODNN7EXAMPLE:Yxg83MZaEgh3OZ3l0rLo5RTX11o=
Q & A
Case Study
Ring Products: Practical Uses for the IoT
Ring Neighborhoods: Network Effects in Practice
Wilshire Park study with LAPD:
Ring installed on 10% of homes
Burglaries down 55% for the
entire community in 6 months
Burglars want an easy hit, and go
elsewhere if you’re home
Alarms are reactive, not proactive
Traditional systems don’t link up,
so protection ends at your door
Devices installed in nearly every country on Earth
Millions of connected apps and devices
Over 1 billion videos and rapidly increasing
High growth brings challenges, even month to month
Ring Urban Activity Index
2016-10-20, USA-only,
low-cut rural areas
Global Concerns
Intelligently Determining Class
Ring Requirements
• Live video is ingested from devices and apps via our application servers
• Videos are uploaded to our S3 buckets
• The videos are transcoded and make them available for customers to
stream
• Customers need low latency in delivering video streams around the world
• Customers get a 30-day free trial of video backups.
• If they decide to continue to store videos, they can store videos for up to 6
months after the activity.
• When users share videos, we expect them to be watched a lot, and
sometimes they go viral
Present Your Design
• How did you address the use case?
• What was your key naming scheme?
• How did you address scale?
• How did you manage object metadata?
• Did you minimize cost?
• How do you monitor your requests?
• How did you address security considerations?
Ring Video Pipeline
Raw
Buckets
Final
(Standard)
S3 Logs
Amazon
CloudFront
Ring App(s)
AWS
Lambda
Viewers
Amazon
SQS
Owner(s)
Visitor
Application
Servers
Ring Device
GPU
Farm
Final
(IA)
Lifecycle
Transitions
Event
Triggers
Live Video
Extreme Performance is Easy
S3 will automatically partition if you use good keys – or just add more buckets
CloudFront as a CDN for GET heavy loads and faster downloads
Faster uploads with Transfer Acceleration
TCP Window Scaling - without it, 64kB window kneecaps long fat networks
TCP SACK is good for fast but lossy connections like mobile connections
examplebucket/2134857/data/start.png
examplebucket/2134857/data/resource.rsrc
examplebucket/2134857/data/results.txt
examplebucket/2134858/data/start.png
examplebucket/2134858/data/resource.rsrc
examplebucket/2134858/data/results.txt
examplebucket/2134859/data/start.png
examplebucket/2134859/data/resource.rsrc
examplebucket/2134859/data/results.txt
examplebucket/7584312/data/start.png
examplebucket/7584312/data/resource.rsrc
examplebucket/7584312/data/results.txt
examplebucket/8584312/data/start.png
examplebucket/8584312/data/resource.rsrc
examplebucket/8584312/data/results.txt
examplebucket/9584312/data/start.png
examplebucket/9584312/data/resource.rsrc
examplebucket/9584312/data/results.txt
S3 Scaling on H-Day
Thank you!Thank you!
Remember to complete
your evaluations!
Remember to complete
your evaluations!

Weitere ähnliche Inhalte

Was ist angesagt?

10월 웨비나 - AWS에서 Active Directory 구축 및 연동 옵션 살펴보기 (김용우 솔루션즈 아키텍트)
10월 웨비나 - AWS에서 Active Directory 구축 및 연동 옵션 살펴보기 (김용우 솔루션즈 아키텍트)10월 웨비나 - AWS에서 Active Directory 구축 및 연동 옵션 살펴보기 (김용우 솔루션즈 아키텍트)
10월 웨비나 - AWS에서 Active Directory 구축 및 연동 옵션 살펴보기 (김용우 솔루션즈 아키텍트)
Amazon Web Services Korea
 

Was ist angesagt? (20)

AWS KMS 에서 제공하는 봉투암호화 방식의 암호화 및 사이닝 기능에 대한 소개와 실습 - 신은수, AWS 솔루션즈 아키텍트 :: AWS...
AWS KMS 에서 제공하는 봉투암호화 방식의 암호화 및 사이닝 기능에 대한 소개와 실습 - 신은수, AWS 솔루션즈 아키텍트 :: AWS...AWS KMS 에서 제공하는 봉투암호화 방식의 암호화 및 사이닝 기능에 대한 소개와 실습 - 신은수, AWS 솔루션즈 아키텍트 :: AWS...
AWS KMS 에서 제공하는 봉투암호화 방식의 암호화 및 사이닝 기능에 대한 소개와 실습 - 신은수, AWS 솔루션즈 아키텍트 :: AWS...
 
Introduction of AWS KMS
Introduction of AWS KMSIntroduction of AWS KMS
Introduction of AWS KMS
 
An introduction to AWS CloudFormation - Pop-up Loft Tel Aviv
An introduction to AWS CloudFormation - Pop-up Loft Tel AvivAn introduction to AWS CloudFormation - Pop-up Loft Tel Aviv
An introduction to AWS CloudFormation - Pop-up Loft Tel Aviv
 
AWS S3 and GLACIER
AWS S3 and GLACIERAWS S3 and GLACIER
AWS S3 and GLACIER
 
Deep Dive on Amazon S3 - AWS Online Tech Talks
Deep Dive on Amazon S3 - AWS Online Tech TalksDeep Dive on Amazon S3 - AWS Online Tech Talks
Deep Dive on Amazon S3 - AWS Online Tech Talks
 
Introduction to Amazon EC2
Introduction to Amazon EC2Introduction to Amazon EC2
Introduction to Amazon EC2
 
Access Control for the Cloud: AWS Identity and Access Management (IAM) (SEC20...
Access Control for the Cloud: AWS Identity and Access Management (IAM) (SEC20...Access Control for the Cloud: AWS Identity and Access Management (IAM) (SEC20...
Access Control for the Cloud: AWS Identity and Access Management (IAM) (SEC20...
 
Amazon CloudWatch Logs and AWS Lambda: A Match Made in Heaven
Amazon CloudWatch Logs and AWS Lambda: A Match Made in HeavenAmazon CloudWatch Logs and AWS Lambda: A Match Made in Heaven
Amazon CloudWatch Logs and AWS Lambda: A Match Made in Heaven
 
Cloud Security (AWS)
Cloud Security (AWS)Cloud Security (AWS)
Cloud Security (AWS)
 
Amazon SageMaker 모델 배포 방법 소개::김대근, AI/ML 스페셜리스트 솔루션즈 아키텍트, AWS::AWS AIML 스페셜 웨비나
Amazon SageMaker 모델 배포 방법 소개::김대근, AI/ML 스페셜리스트 솔루션즈 아키텍트, AWS::AWS AIML 스페셜 웨비나Amazon SageMaker 모델 배포 방법 소개::김대근, AI/ML 스페셜리스트 솔루션즈 아키텍트, AWS::AWS AIML 스페셜 웨비나
Amazon SageMaker 모델 배포 방법 소개::김대근, AI/ML 스페셜리스트 솔루션즈 아키텍트, AWS::AWS AIML 스페셜 웨비나
 
(STG401) Amazon S3 Deep Dive & Best Practices
(STG401) Amazon S3 Deep Dive & Best Practices(STG401) Amazon S3 Deep Dive & Best Practices
(STG401) Amazon S3 Deep Dive & Best Practices
 
글로벌 기업들의 효과적인 데이터 분석을 위한 Data Lake 구축 및 분석 사례 - 김준형 (AWS 솔루션즈 아키텍트)
글로벌 기업들의 효과적인 데이터 분석을 위한 Data Lake 구축 및 분석 사례 - 김준형 (AWS 솔루션즈 아키텍트)글로벌 기업들의 효과적인 데이터 분석을 위한 Data Lake 구축 및 분석 사례 - 김준형 (AWS 솔루션즈 아키텍트)
글로벌 기업들의 효과적인 데이터 분석을 위한 Data Lake 구축 및 분석 사례 - 김준형 (AWS 솔루션즈 아키텍트)
 
(SEC318) AWS CloudTrail Deep Dive
(SEC318) AWS CloudTrail Deep Dive(SEC318) AWS CloudTrail Deep Dive
(SEC318) AWS CloudTrail Deep Dive
 
Encryption and Key Management in AWS
Encryption and Key Management in AWSEncryption and Key Management in AWS
Encryption and Key Management in AWS
 
Getting Started with AWS Lambda and Serverless
Getting Started with AWS Lambda and ServerlessGetting Started with AWS Lambda and Serverless
Getting Started with AWS Lambda and Serverless
 
AWS Security Best Practices
AWS Security Best PracticesAWS Security Best Practices
AWS Security Best Practices
 
ABCs of AWS: S3
ABCs of AWS: S3ABCs of AWS: S3
ABCs of AWS: S3
 
10월 웨비나 - AWS에서 Active Directory 구축 및 연동 옵션 살펴보기 (김용우 솔루션즈 아키텍트)
10월 웨비나 - AWS에서 Active Directory 구축 및 연동 옵션 살펴보기 (김용우 솔루션즈 아키텍트)10월 웨비나 - AWS에서 Active Directory 구축 및 연동 옵션 살펴보기 (김용우 솔루션즈 아키텍트)
10월 웨비나 - AWS에서 Active Directory 구축 및 연동 옵션 살펴보기 (김용우 솔루션즈 아키텍트)
 
Amazon CloudWatch Logs and AWS Lambda: A Match Made in Heaven
Amazon CloudWatch Logs and AWS Lambda: A Match Made in HeavenAmazon CloudWatch Logs and AWS Lambda: A Match Made in Heaven
Amazon CloudWatch Logs and AWS Lambda: A Match Made in Heaven
 
AWS Builders - Industry Edition: DevSecOps on AWS - 시작은 IAM 부터
AWS Builders - Industry Edition: DevSecOps on AWS - 시작은 IAM 부터AWS Builders - Industry Edition: DevSecOps on AWS - 시작은 IAM 부터
AWS Builders - Industry Edition: DevSecOps on AWS - 시작은 IAM 부터
 

Andere mochten auch

Andere mochten auch (20)

AWS re:Invent 2016: Deep Dive on Amazon Elastic File System (STG202)
AWS re:Invent 2016: Deep Dive on Amazon Elastic File System (STG202)AWS re:Invent 2016: Deep Dive on Amazon Elastic File System (STG202)
AWS re:Invent 2016: Deep Dive on Amazon Elastic File System (STG202)
 
AWS re:Invent 2016: How Amazon S3 Storage Management Helps Optimize Storage a...
AWS re:Invent 2016: How Amazon S3 Storage Management Helps Optimize Storage a...AWS re:Invent 2016: How Amazon S3 Storage Management Helps Optimize Storage a...
AWS re:Invent 2016: How Amazon S3 Storage Management Helps Optimize Storage a...
 
Deep Dive on Amazon S3
Deep Dive on Amazon S3Deep Dive on Amazon S3
Deep Dive on Amazon S3
 
AWS re:Invent 2016: Deep Dive on Amazon Glacier (STG302)
AWS re:Invent 2016: Deep Dive on Amazon Glacier (STG302)AWS re:Invent 2016: Deep Dive on Amazon Glacier (STG302)
AWS re:Invent 2016: Deep Dive on Amazon Glacier (STG302)
 
AWS re:Invent 2016: Tableau Rules of Engagement in the Cloud (STG306)
AWS re:Invent 2016: Tableau Rules of Engagement in the Cloud (STG306)AWS re:Invent 2016: Tableau Rules of Engagement in the Cloud (STG306)
AWS re:Invent 2016: Tableau Rules of Engagement in the Cloud (STG306)
 
AWS re:Invent 2016: Migrating Your Data Warehouse to Amazon Redshift (DAT202)
AWS re:Invent 2016: Migrating Your Data Warehouse to Amazon Redshift (DAT202)AWS re:Invent 2016: Migrating Your Data Warehouse to Amazon Redshift (DAT202)
AWS re:Invent 2016: Migrating Your Data Warehouse to Amazon Redshift (DAT202)
 
Intro to Amazon S3
Intro to Amazon S3Intro to Amazon S3
Intro to Amazon S3
 
AWS re:Invent 2016: [JK REPEAT] Deep Dive on Amazon EC2 Instances, Featuring ...
AWS re:Invent 2016: [JK REPEAT] Deep Dive on Amazon EC2 Instances, Featuring ...AWS re:Invent 2016: [JK REPEAT] Deep Dive on Amazon EC2 Instances, Featuring ...
AWS re:Invent 2016: [JK REPEAT] Deep Dive on Amazon EC2 Instances, Featuring ...
 
AWS re:Invent 2016: Deep Dive on Amazon Relational Database Service (DAT305)
AWS re:Invent 2016: Deep Dive on Amazon Relational Database Service (DAT305)AWS re:Invent 2016: Deep Dive on Amazon Relational Database Service (DAT305)
AWS re:Invent 2016: Deep Dive on Amazon Relational Database Service (DAT305)
 
Amazon EC2 Masterclass
Amazon EC2 MasterclassAmazon EC2 Masterclass
Amazon EC2 Masterclass
 
AWS re:Invent 2016: Deep Dive on Amazon Elastic Block Store (STG301)
AWS re:Invent 2016: Deep Dive on Amazon Elastic Block Store (STG301)AWS re:Invent 2016: Deep Dive on Amazon Elastic Block Store (STG301)
AWS re:Invent 2016: Deep Dive on Amazon Elastic Block Store (STG301)
 
AWS re:Invent 2016: Storage State of the Union (STG201)
AWS re:Invent 2016: Storage State of the Union (STG201)AWS re:Invent 2016: Storage State of the Union (STG201)
AWS re:Invent 2016: Storage State of the Union (STG201)
 
AWS re:Invent 2016: Deep Dive on Amazon DynamoDB (DAT304)
AWS re:Invent 2016: Deep Dive on Amazon DynamoDB (DAT304)AWS re:Invent 2016: Deep Dive on Amazon DynamoDB (DAT304)
AWS re:Invent 2016: Deep Dive on Amazon DynamoDB (DAT304)
 
AWS re:Invent 2016: AWS Database State of the Union (DAT320)
AWS re:Invent 2016: AWS Database State of the Union (DAT320)AWS re:Invent 2016: AWS Database State of the Union (DAT320)
AWS re:Invent 2016: AWS Database State of the Union (DAT320)
 
AWS re:Invent 2016: Elastic Load Balancing Deep Dive and Best Practices (NET403)
AWS re:Invent 2016: Elastic Load Balancing Deep Dive and Best Practices (NET403)AWS re:Invent 2016: Elastic Load Balancing Deep Dive and Best Practices (NET403)
AWS re:Invent 2016: Elastic Load Balancing Deep Dive and Best Practices (NET403)
 
AWS re:Invent 2016: Deep Dive: AWS Direct Connect and VPNs (NET402)
AWS re:Invent 2016: Deep Dive: AWS Direct Connect and VPNs (NET402)AWS re:Invent 2016: Deep Dive: AWS Direct Connect and VPNs (NET402)
AWS re:Invent 2016: Deep Dive: AWS Direct Connect and VPNs (NET402)
 
AWS re:Invent 2016: Introduction to Managed Database Services on AWS (DAT307)
AWS re:Invent 2016: Introduction to Managed Database Services on AWS (DAT307)AWS re:Invent 2016: Introduction to Managed Database Services on AWS (DAT307)
AWS re:Invent 2016: Introduction to Managed Database Services on AWS (DAT307)
 
Analyzing data, Optimizing Service Experience for Retailers
Analyzing data, Optimizing Service Experience for Retailers Analyzing data, Optimizing Service Experience for Retailers
Analyzing data, Optimizing Service Experience for Retailers
 
AWS re:Invent 2016: How to move 1,000 VMs and Biz Critical Apps to AWS in 6 m...
AWS re:Invent 2016: How to move 1,000 VMs and Biz Critical Apps to AWS in 6 m...AWS re:Invent 2016: How to move 1,000 VMs and Biz Critical Apps to AWS in 6 m...
AWS re:Invent 2016: How to move 1,000 VMs and Biz Critical Apps to AWS in 6 m...
 
Building AWS Redshift Data Warehouse with Matillion and Tableau
Building AWS Redshift Data Warehouse with Matillion and TableauBuilding AWS Redshift Data Warehouse with Matillion and Tableau
Building AWS Redshift Data Warehouse with Matillion and Tableau
 

Ähnlich wie AWS re:Invent 2016: Workshop: AWS S3 Deep-Dive Hands-On Workshop: Deploying and Managing a Global, Petabyte Scale Storage Infrastructure (STG313)

Deep Dive on S3 Storage Management Covering New Feature Announcements - Decem...
Deep Dive on S3 Storage Management Covering New Feature Announcements - Decem...Deep Dive on S3 Storage Management Covering New Feature Announcements - Decem...
Deep Dive on S3 Storage Management Covering New Feature Announcements - Decem...
Amazon Web Services
 

Ähnlich wie AWS re:Invent 2016: Workshop: AWS S3 Deep-Dive Hands-On Workshop: Deploying and Managing a Global, Petabyte Scale Storage Infrastructure (STG313) (20)

Supercharging the Value of Your Data with Amazon S3
Supercharging the Value of Your Data with Amazon S3Supercharging the Value of Your Data with Amazon S3
Supercharging the Value of Your Data with Amazon S3
 
Deep Dive On Object Storage: Amazon S3 and Amazon Glacier - AWS PS Summit Can...
Deep Dive On Object Storage: Amazon S3 and Amazon Glacier - AWS PS Summit Can...Deep Dive On Object Storage: Amazon S3 and Amazon Glacier - AWS PS Summit Can...
Deep Dive On Object Storage: Amazon S3 and Amazon Glacier - AWS PS Summit Can...
 
Deep Dive on Amazon S3 - March 2017 AWS Online Tech Talks
Deep Dive on Amazon S3 - March 2017 AWS Online Tech TalksDeep Dive on Amazon S3 - March 2017 AWS Online Tech Talks
Deep Dive on Amazon S3 - March 2017 AWS Online Tech Talks
 
SRV403 Deep Dive on Object Storage: Amazon S3 and Amazon Glacier
SRV403 Deep Dive on Object Storage: Amazon S3 and Amazon GlacierSRV403 Deep Dive on Object Storage: Amazon S3 and Amazon Glacier
SRV403 Deep Dive on Object Storage: Amazon S3 and Amazon Glacier
 
Deep Dive on S3 Storage Management Covering New Feature Announcements - Decem...
Deep Dive on S3 Storage Management Covering New Feature Announcements - Decem...Deep Dive on S3 Storage Management Covering New Feature Announcements - Decem...
Deep Dive on S3 Storage Management Covering New Feature Announcements - Decem...
 
Builders' Day - Best Practises for S3 - BL
Builders' Day - Best Practises for S3 - BLBuilders' Day - Best Practises for S3 - BL
Builders' Day - Best Practises for S3 - BL
 
Best Practices for Building a Data Lake with Amazon S3 - August 2016 Monthly ...
Best Practices for Building a Data Lake with Amazon S3 - August 2016 Monthly ...Best Practices for Building a Data Lake with Amazon S3 - August 2016 Monthly ...
Best Practices for Building a Data Lake with Amazon S3 - August 2016 Monthly ...
 
Building a Data Lake on AWS
Building a Data Lake on AWSBuilding a Data Lake on AWS
Building a Data Lake on AWS
 
Deep Dive on Amazon S3
Deep Dive on Amazon S3Deep Dive on Amazon S3
Deep Dive on Amazon S3
 
Deep Dive on Amazon S3
Deep Dive on Amazon S3Deep Dive on Amazon S3
Deep Dive on Amazon S3
 
AWS Summit Auckland - Building a Server-less Data Lake on AWS
AWS Summit Auckland - Building a Server-less Data Lake on AWSAWS Summit Auckland - Building a Server-less Data Lake on AWS
AWS Summit Auckland - Building a Server-less Data Lake on AWS
 
2017 AWS DB Day | Amazon Athena 서비스 최신 기능 소개
2017 AWS DB Day | Amazon Athena 서비스 최신 기능 소개 2017 AWS DB Day | Amazon Athena 서비스 최신 기능 소개
2017 AWS DB Day | Amazon Athena 서비스 최신 기능 소개
 
Protect & Manage Amazon S3 & Amazon Glacier Objects at Scale (STG316-R1) - AW...
Protect & Manage Amazon S3 & Amazon Glacier Objects at Scale (STG316-R1) - AW...Protect & Manage Amazon S3 & Amazon Glacier Objects at Scale (STG316-R1) - AW...
Protect & Manage Amazon S3 & Amazon Glacier Objects at Scale (STG316-R1) - AW...
 
Using AWS CloudTrail to Enhance Governance and Compliance of Amazon S3 - DEV3...
Using AWS CloudTrail to Enhance Governance and Compliance of Amazon S3 - DEV3...Using AWS CloudTrail to Enhance Governance and Compliance of Amazon S3 - DEV3...
Using AWS CloudTrail to Enhance Governance and Compliance of Amazon S3 - DEV3...
 
Deep Dive on Object Storage: Amazon S3 and Amazon Glacier
Deep Dive on Object Storage: Amazon S3 and Amazon GlacierDeep Dive on Object Storage: Amazon S3 and Amazon Glacier
Deep Dive on Object Storage: Amazon S3 and Amazon Glacier
 
Deep Dive on Object Storage: Amazon S3 and Amazon Glacier
Deep Dive on Object Storage: Amazon S3 and Amazon GlacierDeep Dive on Object Storage: Amazon S3 and Amazon Glacier
Deep Dive on Object Storage: Amazon S3 and Amazon Glacier
 
(SEC402) Intrusion Detection in the Cloud | AWS re:Invent 2014
(SEC402) Intrusion Detection in the Cloud | AWS re:Invent 2014(SEC402) Intrusion Detection in the Cloud | AWS re:Invent 2014
(SEC402) Intrusion Detection in the Cloud | AWS re:Invent 2014
 
Deep Dive on Object Storage: Amazon S3 and Amazon Glacier | AWS Public Sector...
Deep Dive on Object Storage: Amazon S3 and Amazon Glacier | AWS Public Sector...Deep Dive on Object Storage: Amazon S3 and Amazon Glacier | AWS Public Sector...
Deep Dive on Object Storage: Amazon S3 and Amazon Glacier | AWS Public Sector...
 
AWS July Webinar Series - Troubleshooting Operational and Security Issues in ...
AWS July Webinar Series - Troubleshooting Operational and Security Issues in ...AWS July Webinar Series - Troubleshooting Operational and Security Issues in ...
AWS July Webinar Series - Troubleshooting Operational and Security Issues in ...
 
Policy Ninja
Policy NinjaPolicy Ninja
Policy Ninja
 

Mehr von Amazon Web Services

Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
Amazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
Amazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
Amazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
Amazon Web Services
 

Mehr von Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 

AWS re:Invent 2016: Workshop: AWS S3 Deep-Dive Hands-On Workshop: Deploying and Managing a Global, Petabyte Scale Storage Infrastructure (STG313)

  • 1. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Omair Gillani, S3 Product Manager, AWS Lee Kear, Storage Specialist Solutions Architect, AWS Jason Gluckman, Lead Software Engineer, Ring Peter Levett, Storage Specialist Solutions Architect, AWS Angela Wang, Solutions Architect, AWS November 30, 2016 STG313 Amazon S3 Deep-Dive Hands-On Workshop: Deploying and Managing a Global, Petabyte Scale Storage Infrastructure
  • 2. What to Expect from the SessionWhat to Expect from the Session • How does a workshop differ from other sessions? • S3 new features • How we think about storage management for S3 • Storage Management Portfolio for S3 • Understand your data • Discover your data • Manage your data • Pulling it all together • Key naming schemes • Group activity
  • 3. How does a workshop differ from other sessions Learn from AWS 45 minutes of lecture Learn from each other Group learning activity
  • 5. 2012 2013 2014 Amazon storage usage Trillions of objects Millions of requests per second
  • 6. Choice of storage classes on S3 Standard Active data Archive dataInfrequently accessed data Standard - Infrequent Access Amazon Glacier
  • 7. File sync and share + consumer file storage Backup and archive + disaster recovery Long-retained data Use cases for Standard-Infrequent Access
  • 8. Designed for 11 9s of durability Standard - Infrequent Access storage Designed for 99.9% availability Durable Available Same as Standard storage High performance • Bucket policies • AWS Identity and Access Management (IAM) policies • Many encryption options Secure • Lifecycle management • Versioning • Event notifications • Metrics Integrated • No impact on user experience • Simple REST API Easy to use
  • 9. - Directly PUT to Standard - IA - Transition Standard to Standard - IA - Transition Standard - IA to Amazon Glacier storage - Expiration lifecycle policy - Versioning support Standard - Infrequent Access storage Integrated: Lifecycle management Standard - Infrequent Access
  • 10. A comprehensive storage management portfolio for S3
  • 11. Storage Management for S3 Cross-Region Replication Lifecycle Policy Data Classification & Management Event Notifications S3 CloudWatch Metrics S3 Inventory Audit with object level AWS CloudTrail Data Events S3 Analytics Standard Standard - Infrequent Access Amazon Glacier
  • 12. Understand your storage usage S3 InventoryAnalyze Logs with Amazon EMR S3 Analytics
  • 13. S3 Inventory Use case: trigger business workflows and applications such as secondary index garbage collection, data auditing, and offline analytics • More information about your objects than provided by LIST API such as replication status, multipart upload flag, and delete marker Save time Daily or Weekly delivery Delivery to S3 bucketCSV File Output
  • 14. S3 Inventory Eventually consistent rolling snapshot • New objects may not be listed • Removed objects may still be included Name Value Type Description Bucket String Bucket name. UTF-8 encoded. Key String Object key name. UTF-8 encoded. Version Id String Version Id of the object Is Latest boolean true if object is the latest version (current version) of a versioned object, otherwise false Delete Marker boolean true if object is a delete marker of a versioned object, otherwise false Size long Object size in bytes Last Modified String Last modified timestamp. Format in ISO: YYYY-MM-DDTHH:mm:ss.SSSZ ETag String eTag in HEX encoded format StorageClass String Valid values: STANDARD, REDUCED_REDUNDANCY, GLACIER, STANDARD_IA. UTF-8 encoded. Multipart Uploaded boolean true if object is uploaded by using multipart, otherwise false Replication Status String Valid values: REPLICA, COMPLETED, PENDING, FAILED. UTF-8 encoded. Validate before you act! • Use HEAD OBJECT
  • 15. S3 Analytics – Storage Class Analysis Analyze buckets, prefixes, or tags $0.10 per million objects analyzed Storage Class Analysis & lifecycle recommendation Data-driven storage management for S3 Export Analysis to your S3 bucket
  • 16. S3 Analytics – Storage Class Analysis Demo
  • 17. S3 Analytics – Storage Class Analysis
  • 18. S3 Analytics – Storage Class Analysis
  • 19. Monitor your storage Monitor and Alert with Amazon CloudWatch Audit your storage with AWS CloudTrail Server Access Logs
  • 20. CloudWatch metrics for S3 Operational & Performance monitoring • Generate metrics for data of your choice • Entire bucket, prefixes, and tags • Up to 1,000 object groups • 1-minute CloudWatch metrics • Alert and alarm on metrics • Pay for what you use
  • 21. CloudWatch metrics for S3 Price per metric • $0.30 per metric per month Metric Name Metric value AllRequests Count PutRequests Count GetRequests Count ListRequests Count DeleteRequests Count HeadRequests Count PostRequests Count BytesDownloaded MB BytesUploaded MB 4xxErrors Count 5xxErrors Count FirstByteLatency ms TotalRequestLatency ms
  • 22. CloudTrail data events for S3 Use case: Perform security analysis, meet your IT auditing and compliance needs API logs for bucket and object-level requests • Creation/deletion of buckets • Changes to bucket configuration (bucket policy, lifecycle policies, replication policies, etc.) • SNS notification for log file delivery (optional)
  • 23. Manage your data Cross-Region Replication Lifecycle Policies Event Notifications S3 Object Tags
  • 24. Manage your data S3 Object Tags Manage storage based on object tags • Classify your data • Tag your objects with key-value pairs • Write policies once based on the type of data AnalyzeLifecycle PolicyAccess Control
  • 25. Deep dive on tags • Tags are key value pairs • Maximum 10 tags per object • Maximum key length—127 Unicode characters • Maximum value length—255 Unicode characters • Tag keys and values are case sensitive 2 ways to put tags via API • Put objects with tag parameter, or • Add tag API after object is created
  • 26. What can I do with tags? { "Version": "2012-10-17", "Statement": [ { "Effect": "Allow", "Action": [ "s3:GetObject" ], "Resource": "arn:aws:s3:::EXAMPLE-BUCKET-NAME/*" "Condition": {"StringEquals": {"S3:ResourceTag/HIPAA":"True"}} } ] } Manage permissions with tags
  • 27. Lifecycle policies based on tags <LifecycleConfiguration> <Rule> <ID>sample-rule</ID> <Filter> <And> <Prefix>documents/</Prefix> <Tag> <Key>Project</Key> <Value>Delta</Value> </Tag> <Tag> <Key>Data type</Key> <Value>HPI</Value> </Tag> </And> </Filter> <Status>Enabled</Status> <Transition> <Days>365</Days> <StorageClass>GLACIER</StorageClass> </Transition> <Expiration> <Days>3650</Days> </Expiration> </Rule> </LifecycleConfiguration>
  • 28. Putting it all together
  • 29. Storage Management for S3 Cross-Region Replication Lifecycle Policy S3 Object TagsEvent Notifications Monitor and Alert with CloudWatch S3 Inventory Audit with CloudTrail Data Events S3 Analytics
  • 31. Getting high throughput performance with S3 • S3 can scale to many thousands of requests per second • Need a good key naming scheme • Only at scale do you need to consider your key naming scheme • What are Partitions? • Why? • Spread keys lexigraphically • Goal of partitioning is too spread the heat • Prevent HotSpots
  • 32. my-bucket/2013_11_13-164533125.jpg my-bucket/2013_11_13-164533126.jpg my-bucket/2013_11_13-164533127.jpg my-bucket/2013_11_13-164533128.jpg my-bucket/2013_11_12-164533129.jpg my-bucket/2013_11_12-164533130.jpg my-bucket/2013_11_12-164533131.jpg my-bucket/2013_11_12-164533132.jpg my-bucket/2013_11_11-164533133.jpg my-bucket/2013_11_11-164533134.jpg my-bucket/2013_11_11-164533135.jpg my-bucket/2013_11_11-164533136.jpg Use a key-naming scheme with randomness at the beginning for high TPS • Most important if you will regularly exceed 100 TPS on a bucket • Avoid starting with a date or monotonically increasing numbers • Consider adding a hash or reversed timestamp (ssmmhhddmmyy) Don’t do this… How to design for high Request Rates to S3 Partition: my-bucket/2013_11_1
  • 33. Distributing key names Add randomness to the beginning of the key name… my-bucket/6213-2013_11_13.jpg my-bucket/4653-2013_11_13.jpg my-bucket/9873-2013_11_13.jpg my-bucket/4657-2013_11_13.jpg my-bucket/1256-2013_11_13.jpg my-bucket/8345-2013_11_13.jpg my-bucket/0321-2013_11_13.jpg my-bucket/5654-2013_11_13.jpg my-bucket/2345-2013_11_13.jpg my-bucket/7567-2013_11_13.jpg my-bucket/3455-2013_11_13.jpg my-bucket/4313-2013_11_13.jpg Partitions: my-bucket/0 my-bucket/1 my-bucket/2 my-bucket/3 my-bucket/4 my-bucket/5 my-bucket/6 my-bucket/7 my-bucket/8 my-bucket/9
  • 34. Monotonically Increasing Customer ID mycustdata/2134857/app_data_1/2016-11-30-02:01:01:24/log.txt mycustdata/2134857/app_data_1/2016-11-30-02:01:01:32/wrk_user mycustdata/2134858/app_data_1/2016-11-30-02:01:01:29/product_usage.csv mycustdata/2134858/app_data_1/2016-11-30-02:01:01:24/log.txt mycustdata/2134858/app_data_1/2016-11-30-02:01:01:14/wrk_user mycustdata/2134859/app_data_1/2016-11-30-02:01:01:28/product_usage.csv mycustdata/2134859/app_data_1/2016-11-30-02:01:01:45/log.txt mycustdata/2134859/app_data_1/2016-11-30-02:01:01:34/wrk_user mycustdata/7584312/app_data_1/2016-11-30-02:01:01:23/product_usage.csv mycustdata/7584312/app_data_1/2016-11-30-02:01:01:24/log.txt mycustdata/7584312/app_data_1/2016-11-30-02:01:01:32/wrk_user mycustdata/8584312/app_data_1/2016-11-30-02:01:01:29/product_usage.csv mycustdata/8584312/app_data_1/2016-11-30-02:01:01:24/log.txt mycustdata/8584312/app_data_1/2016-11-30-02:01:01:14/wrk_user mycustdata/9584312/app_data_1/2016-11-30-02:01:01:28/product_usage.csv mycustdata/9584312/app_data_1/2016-11-30-02:01:01:45/log.txt mycustdata/9584312/app_data_1/2016-11-30-02:01:01:34/wrk_user Partition: mycustdata/213485 Partitions: mycustdata/7 mycustdata/8 mycustdata/9 Reverse Monotonically Increase prefix If a single customer can push a higher workload, they can cause a Hotspot.
  • 35. Add A Hash to Beginning of Key – Best mycustdata/2134857/app_data_1/2016-11-30-02:01:01:24/log.txt mycustdata/2134857/app_data_1/2016-11-30-02:01:01:32/wrk_user mycustdata/2134858/app_data_1/2016-11-30-02:01:01:29/product_usage.csv mycustdata/2134858/app_data_1/2016-11-30-02:01:01:24/log.txt mycustdata/2134858/app_data_1/2016-11-30-02:01:01:14/wrk_user mycustdata/2134859/app_data_1/2016-11-30-02:01:01:28/product_usage.csv mycustdata/2134859/app_data_1/2016-11-30-02:01:01:45/log.txt mycustdata/2134859/app_data_1/2016-11-30-02:01:01:34/wrk_user mycustdata/1a/2134857/app_data_1/2016-11-30-02:01:01:24/log.txt mycustdata/34/2134857/app_data_1/2016-11-30-02:01:01:32/wrk_user mycustdata/a7/2134858/app_data_1/2016-11-30-02:01:01:29/product_usage.csv mycustdata/58/2134858/app_data_1/2016-11-30-02:01:01:24/log.txt mycustdata/70/2134858/app_data_1/2016-11-30-02:01:01:14/wrk_user mycustdata/02/2134859/app_data_1/2016-11-30-02:01:01:28/product_usage.csv mycustdata/2b/2134859/app_data_1/2016-11-30-02:01:01:45/log.txt mycustdata/63/2134859/app_data_1/2016-11-30-02:01:01:34/wrk_user Partition: mycustdata/213485 Partitions: mycustdata/0 mycustdata/1 mycustdata/2 mycustdata/3 mycustdata/4 mycustdata/5 mycustdata/6 mycustdata/7 Add a hash to evenly distribute the keys for all requests mycustdata/8 mycustdata/9 mycustdata/a mycustdata/b mycustdata/c mycustdata/d mycustdata/e mycustdata/f
  • 36. Challenges of using a hash to create entropy • Listing challenges/opportunities: • A Secondary Index can be used to avoid listing • Can be accomplished with Event Notification to AWS Lambda and Amazon DynamoDB • Blog Post - Building and Maintaining an Amazon S3 Metadata Index without Servers • Hash can be used to split work of LISTing objects • Lifecycle constraints • Max number of lifecycle rules – 1000 • Tagging can make this easier
  • 37. Faster upload of large objects Parallelize PUTs with Multipart Uploads • Increase aggregate throughput by parallelizing PUTs on high-bandwidth networks • Move the bottleneck to the network, where it belongs • Increase resiliency to network errors; fewer large restarts on error-prone networks Best Practice
  • 38. Faster download You can parallelize GETs too For large objects, use range-based GETs For content distribution, enable Amazon CloudFront • Caches objects at the edge • 59 global edge locations GET /example-object HTTP/1.1 Host: example-bucket.s3.amazonaws.com x-amz-date: Fri, 28 Jan 2011 21:32:02 GMT Range: bytes=0-9 Authorization: AWS AKIAIOSFODNN7EXAMPLE:Yxg83MZaEgh3OZ3l0rLo5RTX11o=
  • 39. Q & A
  • 41.
  • 42. Ring Products: Practical Uses for the IoT
  • 43. Ring Neighborhoods: Network Effects in Practice Wilshire Park study with LAPD: Ring installed on 10% of homes Burglaries down 55% for the entire community in 6 months Burglars want an easy hit, and go elsewhere if you’re home Alarms are reactive, not proactive Traditional systems don’t link up, so protection ends at your door
  • 44. Devices installed in nearly every country on Earth Millions of connected apps and devices Over 1 billion videos and rapidly increasing High growth brings challenges, even month to month Ring Urban Activity Index 2016-10-20, USA-only, low-cut rural areas
  • 47. Ring Requirements • Live video is ingested from devices and apps via our application servers • Videos are uploaded to our S3 buckets • The videos are transcoded and make them available for customers to stream • Customers need low latency in delivering video streams around the world • Customers get a 30-day free trial of video backups. • If they decide to continue to store videos, they can store videos for up to 6 months after the activity. • When users share videos, we expect them to be watched a lot, and sometimes they go viral
  • 48. Present Your Design • How did you address the use case? • What was your key naming scheme? • How did you address scale? • How did you manage object metadata? • Did you minimize cost? • How do you monitor your requests? • How did you address security considerations?
  • 49. Ring Video Pipeline Raw Buckets Final (Standard) S3 Logs Amazon CloudFront Ring App(s) AWS Lambda Viewers Amazon SQS Owner(s) Visitor Application Servers Ring Device GPU Farm Final (IA) Lifecycle Transitions Event Triggers Live Video
  • 50. Extreme Performance is Easy S3 will automatically partition if you use good keys – or just add more buckets CloudFront as a CDN for GET heavy loads and faster downloads Faster uploads with Transfer Acceleration TCP Window Scaling - without it, 64kB window kneecaps long fat networks TCP SACK is good for fast but lossy connections like mobile connections examplebucket/2134857/data/start.png examplebucket/2134857/data/resource.rsrc examplebucket/2134857/data/results.txt examplebucket/2134858/data/start.png examplebucket/2134858/data/resource.rsrc examplebucket/2134858/data/results.txt examplebucket/2134859/data/start.png examplebucket/2134859/data/resource.rsrc examplebucket/2134859/data/results.txt examplebucket/7584312/data/start.png examplebucket/7584312/data/resource.rsrc examplebucket/7584312/data/results.txt examplebucket/8584312/data/start.png examplebucket/8584312/data/resource.rsrc examplebucket/8584312/data/results.txt examplebucket/9584312/data/start.png examplebucket/9584312/data/resource.rsrc examplebucket/9584312/data/results.txt
  • 51. S3 Scaling on H-Day
  • 53. Remember to complete your evaluations! Remember to complete your evaluations!