SlideShare ist ein Scribd-Unternehmen logo
1 von 78
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Mv – Marcus Vinicius Ferreira / Bruno Silveira
Public Sector Team
Jun/2019
Hands-on: Storage in the
CloudEBS, EFS, S3
Mv – Marcus Vinicius Ferreira
mvferr@amazon.com
SolutionsArchitect
BR, Public Sector, Education
Mv
Bruno Silveira
brunorms@amazon.com
SolutionsArchitect
BR, Public Sector, Gov
Bruno
AWS Topics
• Why: Storage Motivation
- Overview
• What: Storage Services
- The Block Storage "Family": EBS, Snapshots
- The Object Storage "Family": S3, S3-IA, Glacier
- The Transfer Storage "Family": Storage Gateway, Snowball, Direct Connect, DMS
• How: Scenarios and Architectures
- Databases
- Web Applications
- Analytics, Big Data
- Backup and Recovery
- Legacy Systems
Why: Storage Motivation
GB
TB
PB
ZB
EB
Big Data: Unconstrained Growth
• Unstructured data growth
is explosive
• 95% of the 1.2 zettabytes
of data in the digital
universe is unstructured
• Logs, Machine data and
IoT will only steepen the
curve
• 70% of this data is user-
generated content
• Videos resolution is always
increasing: 1080p, 4K, 8K
Source: IDC, The Internet of Things: Getting Ready to Embrace Its Impact on the Digital Economy, March 2016.
Gartner: User Survey Analysis: Key Trends Shaping the Future of Data Center Infrastructure Through 2011
IDC: Worldwide Business Analytics Software 2012–2016 Forecast and 2011 Vendor Shares
Available for analysis
Generated data
1990 2000 2010 2020
Key Insight: Most Data Falls on the Floor
90% of the data in a company
is never analyzed
High costs and complexity of
traditional DW systems make it
hard to justify the capital
expense
Data is a strategic asset
for every organization
The world’s most
valuable resource is
no longer oil, but data.*
*Copyright:The Economist, 2017, David Parkins
“
”
Two Facts of Life
Two Facts of Life
Amazon EFS
File
Amazon EBS
Amazon EC2
Instance Store
Block
Amazon S3 Amazon Glacier
Object
Data Transfer
AWS Direct
Connect
ISV
Connectors
Amazon
Kinesis
Firehose
Storage
Gateway
S3 Transfer
Acceleration
AWS Storage is a Platform
AWS
Snowball
Amazon
CloudFront
Internet/VPN
What: Storage Services
Example Data Center: Where Do We Put All of This on AWS?
DB
(Master)
DB
(Slave)
Back-ups on
tapes
Web
server
Web
server
App serverApp server App server
SAN
NAS file
server
File system
disks
LDAP server
Example Data Center: Where Do We Put All of This on AWS?
Web
server
Web
server
App serverApp server App server
Amazon Elastic
File System
Elastic Load
Balancing
Elastic Load
Balancing
Amazon
Elastic
Block Store
Amazon RDS
(Master)
Amazon RDS
(Standby) Backups to
Amazon S3
or Glacier
AWS Directory
Service
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Block vs File vs Object
Block Storage
Raw Storage
Data organized as an array of unrelated blocks
Host File System places data on disk
e.g.: Microsoft NTFS, Unix ZFS
File Storage
Unrelated data blocks managed by a file (serving) system
Native file system places data on disk
Object Storage
Stores Virtual containers that encapsulate the data, data attributes, metadata and Object IDs
API Access to data
Metadata Driven, Policy-based, etc
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Three types of storage
File ObjectBlock
Amazon EFS
Amazon FSx
Amazon EBS Amazon S3,
Amazon Glacier
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Performance comparison of storage types
File
Object
Block
Latency
Throughput
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Object storage
S3 Standard
S3 Glacier Deep Archive
S3 Glacier
S3 Intelligent-Tiering
S3 One Zone-IA
S3 Standard-IA
Block storage
Provisioned IOPS SSD
Cold HDD
Throughput-Optimized HDD
NEW!
COMING
SOON!
File storage
EFS Standard
EFS Infrequent
Access
COMING
SOON!
Elastic
Amazon
EFS
AWS Storage
Gateway
Family
Amazon
S3
NEW!Amazon
FSx for
Lustre
Amazon FSx
for Windows
File Server
NEW!
Amazon
EBS
Amazon
EC2
EBS, Snapshots, EFS, Ephemeral
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
AWSblockstorageofferings
EC2
instance
store
sc1st1
io1gp2
EBS
SSD-backed
volumes
EBS
HDD-backed
volumes
HDDSSD
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
AWSblockstorageofferings
EC2
instance
store
sc1st1
io1gp2
EBS
SSD-backed
volumes
EBS
HDD-backed
volumes
HDDSSD
Block Storage: EBS Volume Types
General Purpose SSD
(Generic)
gp2
Provisioned IOPS SSD
(Fast, Random I/O)
io1
Throughput Optimized HDD
(Fast, Sequential I/O)
st1
Cold HDD
(Slow, Large, Cheap)
sc1
SSD HDD
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
What is EBS?
EBS
volume
EC2
instance
• Block storage as a service
• Create, attach volumes through an API
• Service accessed over the network
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
What is EBS?
EBS
volume
Availability Zone
AWS Region
EC2
instance
• Volume and instance must be
in the same AZ
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
What is EBS?
EBS
volume
EC2
instance
• Volumes persist independent
of EC2
• Detach and attach between
instances
• Volume and instance must be
in the same AZ
Availability Zone
AWS Region
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
What is EBS?
EBS
volume
EC2
instance
• Volumes persist independent
of EC2
• Detach and attach between
instances
• Volume and instance must be
in the same AZ
Availability Zone
AWS Region
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
What is EBS?
EBS
volume
• Volumes persist independent
of EC2
• Detach and attach between
instances
• Volume and instance must be
in the same AZ
Availability Zone
AWS Region
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
What is EBS?
EBS
volume
EC2
instance
• Volumes persist independent
of EC2
• Detach and attach between
instances
• Volume and instance must be
in the same AZ
Availability Zone
AWS Region
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
What is EBS?
Availability Zone
AWS Region
EC2
instance
EBS
data
volume
EBS
data
volume
• Volumes attach to one instance
at a time
• Many volumes can attach to an
instance
• Maximum Volume Size is 16TB
EBS
data
volume
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
What is EBS?
Availability Zone
AWS Region
EC2
instance
EBS
data
volume
EBS
data
volume
• Volumes attach to one instance
at a time
• Many volumes can attach to an
instance
• Maximum Volume Size is 16TB16TB 16TB 8TB
EBS
data
volume
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
What is EBS?
Availability Zone
AWS Region
EC2
instance
EBS
data
volume
• Volumes attach to one instance
at a time
16TB
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
EBS is designed for:
What is EBS?
99.999% service availability
0.1% to 0.2% annual failure rate (AFR)
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
What is an EBS snapshot?
EBS
volume
Availability Zone
AWS Region
Amazon
S3
EBS snapshot
Availability Zone
Replica
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
What can you do with a snapshot?
EBS
volume
Availability Zone
AWS Region
Amazon
S3
EBS snapshot
Availability Zone
EBS
volume
Replica Replica
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
What can you do with a snapshot?
EBS
volume
Availability Zone
AWS Region
Amazon
S3
EBS snapshot
EBS
volume
Availability Zone
AWS Region
EBS snapshot
Replica Replica
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
EBS restrictions are...
• Single-AZ
• Attach to just one EC2 at a time
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Do it yourself – NFS architecture
NFS
Server
Volume Volume
NFS
Server
Volume Volume
NFS
Server
Volume Volume
NFS
Clients
NFS
Clients
NFS
Clients
http://bit.ly/amazonefstutorial
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Do it yourself
http://bit.ly/amazonefstutorial
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Amazon EFS architecture
NFS
Clients
NFS
Clients
NFS
Clients
Mount
Target
Single Namespace
Mount
Target
Mount
Target
http://bit.ly/amazonefstutorial
S3, S3-IA, Glacier
1 PB raw storage
800 TB usable storage
600 TB allocated storage
400 TB application data
Traditional Storage System
1 PB raw storage
800 TB usable storage
600 TB allocated storage
400 TB application data
S3 unlimited capacity -- pay only for what you use!
Amazon S3
~ $0.021 / GB
Amazon S3: HTTP access
1. HTTP/HTTPS access
2. Unlimited amount of files
3. Unlimited growth...
4. Any type of data: backups, photos, videos, documents, logs
5. Cheap, unlimited storage
[bucket name]
Preview2.mp4
Tokyo Region
(ap-northeast-1)
Bucket
Object
https://s3-ap-northeast-1.amazonaws.com/[bucket name]/Preview2.mp4
https://s3-ap-northeast-1.amazonaws.com/[bucket name]/
Region code Bucket name
Key
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
ObjectKeys
An object key is the unique identifier for an object in a bucket.
http://doc.s3.amazonaws.com/2006-03-01/AmazonS3.html
Bucket Object/Key
A Closer Look: S3 Durability
4 9s durability
5 9s durability
S3, S3-IA Glacier
11 9s durability
99.999999999 %
VS.
VS.
Understanding Durability
designed for
99.99%
durability
Two copies on one site
designed for
99.999%
durability
Copies on two sites
designed for
99.999999999%
durability
GlacierStandard IA
AWS Region
Understanding Durability
Availability Zone
Availability Zone
Availability Zone
S3 Standard
S3 Standard-IA
Glacier
Availability Zone
S3 One Zone-IA
AWS Region AWS Region
S3 Standard S3 Standard –
Infrequent Access
Amazon Glacier
Active data Archive dataInfrequently accessed data
Milliseconds Minutes to HoursMilliseconds
$0.021/GB/mo $0.004/GB/mo$0.0125/GB/mo
Choice of storage classes on Amazon S3
Storage Gateway, Snowball,
Direct Connect
AWS offers the most ways to move data to the cloud
AWS
Direct
Connect
A private
connection
between your data
center, office, or
colocation
environment and
AWS
AWS Snow
family
(Snowball, Snowball
Edge, Snowmobile)
Secure, physical
transport
appliances that
move up to
Exabytes of data
into and out of
AWS
AWS
Storage
Gateways
Hybrid storage that
seamlessly
connects on-
premises
applications to AWS
storage. Ideal for
backup, DR,
bursting, tiering or
migration
Amazon
Kinesis
Firehose
Capture, trans-
form, & load
streaming data
into S3 for use
with Amazon
business
intelligence and
analytics tools
Amazon EFS
File
Sync
Up to 5x faster file
transfers than
open source tools.
Ideal for migrating
data into EFS or
moving between
cloud file systems
Amazon S3
Transfer
Acceleration
Up to 300%
faster transfers
into and out of
S3. Ideal when
working with
long geographic
distances
APN
competency
partners
Integrations
between 3rd party
vendors and AWS
services. Ideal for
leveraging
existing software
licenses and skills
Networks Shipping Hybrid
Storage Gateway: Enterprise Backup
Amazon S3
Amazon
Glacier
Internet
Amazon S3-IA
Application
servers
Storage Gateway
Local disk
Media
server
Gateway
Application
servers
Cloud Connector/Native Integration
Local disk
Media server
with cloud
connector
VPNVPN
Which On-Premise Backup Software? All of them!
AWS Storage Gateway VTL Native S3 Integration
Enterprise Backup: Direct Connect
Amazon S3
Amazon
Glacier
AWS
Direct
Connect
Amazon S3-IA
Application
servers
Storage Gateway
Local disk
Media
server
Gateway
Application
servers
Cloud Connector/Native Integration
Local disk
Media server
with cloud
connector
VPN
1 GB or 10 GB dedicated link
Amazon S3Transfer Acceleration
Rio De
Janeiro
Warsaw New York Atlanta Madrid Virginia Melbourne Paris Los
Angeles
Seattle Tokyo Singapore
Time[hrs]
500 GB upload from clients in these locations to a bucket in Singapore
Public InternetAccelerated Transfer
Up to 300% faster
171% on average
© 2019, Amazon Web Services, Inc. or its affiliates.All rights reserved.
What is Snowball? Petabyte scale data transport
E-ink shipping
label
Ruggedized
case
“8.5G Impact”
All data encrypted
end-to-end
50TB or 80TB
10G network
Rain & dust
resistant
Tamper-resistant
case & electronics
How fast is Snowball?
• Less than 1 day to transfer 50TB via a 10G connection with Snowball, less
than 1 week including shipping
• Number of days to transfer 50TB via the internet at typical utilizations
Internet Connection Speed
Utilization 1Gbps 500Mbps 300Mbps 150Mbps
25% 19 38 63 126
50% 9 19 32 63
75% 6 13 21 42
How fast is Snowball?
• Less than 1 day to transfer 250TB via 5x10G connections with 5 Snowballs,
less than 1 week including shipping
• Number of days to transfer 250TB via the Internet at typical utilizations
Internet Connection Speed
Utilization 1Gbps 500Mbps 300Mbps 150Mbps
25% 95 190 316 632
50% 47 95 158 316
75% 32 63 105 211
AWS Snow* Family
Snowball Snowball Edge Snowmobile
Petabyte-scale data
migration
Showball with Lambda
inside
Exabyte-scale data
migration
How: Scenarios and Architectures
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
EBS for Databases: SQL, NoSQL, BigData
EC2 Server
Volume Volume Volume Volume Volume Volume
S3: Sharing web files
172.31.0.0/16
sa-east-1a sa-east-1b sa-east-1c
S3: Sharing web files: because of AutoScaling
172.31.0.0/16
sa-east-1a sa-east-1b sa-east-1c
EFS: Legacy Systems
172.31.0.0/16
How: Big Data and Analytics
S3 for Big Data
• Scalability & Elasticity
• Resize a running cluster based on how
much work is needed to be done.
• Durability and Availability
• Fault tolerant for slave node (HDFS)
• Backup to S3 for resilience against master
node failures
• Standard Interfaces
• Hive, Pig, Spark, Hbase, Impala, Hunk,
Presto, other popular tools
Amazon EMR Cluster
Amazon EMR Cluster
Amazon EMR Cluster
Big Data is about large amount of files
Stored logs structure
(in Amazon S3)
Raw log data
(sample)
Order_ID Customer_ID Order_date Total
AWS EMR Environment: Hadoop, Spark, et al.
Master instance group
Task instance groupCore instance group
Amazon S3
Core instances:
 Manage data and
tasks
 Can be added and
removed
Task instances
(optional) are added or
subtracted in response
to work
Amazon S3 as primary storage
HDFS HDFS
Terabytes of files
Netflix Uses S3 to Back its Various Clusters
S3
Fraud Detection
FINRA uses Amazon EMR and Amazon S3 to process up to 75 billion
trading events per day and securely store over 5 petabytes of data,
attaining savings of $10-20mm per year.
NASDAQ LISTS3 , 6 0 0 G L O B A L C O M P A N I E S
IN MARKET CAP REPRESENTING
WORTH $9.6TRILLION
DIVERSE INDUSTRIES AND
MANY OF THE WORLD’S
MOST WELL-KNOWN AND
INNOVATIVE BRANDSMORE THAN U.S.
1 TRILLIONNATIONAL VALUE IS TIED
TO OUR LIBRARY OF MORE THAN
41,000 GLOBAL INDEXES
N A S D A Q T E C H N O L O G Y
IS USED TO POWER MORE THAN
IN 50 COUNTRIES
100 MARKETPLACES
OUR GLOBAL PLATFORM
CAN HANDLE MORE THAN
1 MILLION
MESSAGES/SECOND
AT SUB-40 MICROSECONDS
AV E R A G E S P E E D S
1 C L E A R I N G H O U S E
WE OWN AND OPERATE
26 MARKETS
5 CENTRAL SECURITIES
DEPOSITORIES
INCLUDING
A C R O S S A S S E T CL A S SE S
& GEOGRAPHIES
High Level Architecture Overview
Labs
https://www.qwiklabs.com/
https://bit.ly/ps-hands-on-efs
https://bit.ly/ps-hands-on-ebs
https://bit.ly/ps-hands-on-s3
Questions?
Summary
• What: Storage Services
- The Block Storage "Family": EBS, Snapshots
- The Object Storage "Family": S3, S3-IA, Glacier
- The Transfer Storage "Family": Storage Gateway, Snowball, Direct Connect
• How: Scenarios and Architectures
- Databases: EBS
- Web Applications: S3
- Analytics, Big Data: S3
- Backup and Recovery: S3, Storage Gateway, Direct Connect, Snowball
- Legacy Systems: EBS, EFS, Storage Gateway
ThankYou!
https://aws.amazon.com/ebs/
https://aws.amazon.com/efs/
https://aws.amazon.com/s3/
Feedback
http://bit.ly/PesquisaBSB6
S U M M I T
São Paulo
https://www.cvent.com/events/aws-summit-sao-paulo/registration-89802b17e4ab403db6baeed7ba5917cc.aspx?lang=pt-
BR&fqp=true&refid=sp_summit_2019

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Optimising Cost and Efficiency on AWS
Optimising Cost and Efficiency on AWSOptimising Cost and Efficiency on AWS
Optimising Cost and Efficiency on AWS
 
DAT339_Replicate, Analyze, and Visualize Datasets Using AWS Database Migratio...
DAT339_Replicate, Analyze, and Visualize Datasets Using AWS Database Migratio...DAT339_Replicate, Analyze, and Visualize Datasets Using AWS Database Migratio...
DAT339_Replicate, Analyze, and Visualize Datasets Using AWS Database Migratio...
 
STG309_Deep Dive Using Hybrid Storage with AWS Storage Gateway to Solve On-Pr...
STG309_Deep Dive Using Hybrid Storage with AWS Storage Gateway to Solve On-Pr...STG309_Deep Dive Using Hybrid Storage with AWS Storage Gateway to Solve On-Pr...
STG309_Deep Dive Using Hybrid Storage with AWS Storage Gateway to Solve On-Pr...
 
Hybrid Cloud Storage for Recovery & Migration with AWS Storage Gateway (STG30...
Hybrid Cloud Storage for Recovery & Migration with AWS Storage Gateway (STG30...Hybrid Cloud Storage for Recovery & Migration with AWS Storage Gateway (STG30...
Hybrid Cloud Storage for Recovery & Migration with AWS Storage Gateway (STG30...
 
Improving Backup & DR – AWS Storage Gateway - AWS Online Tech Talks
Improving Backup & DR – AWS Storage Gateway - AWS Online Tech TalksImproving Backup & DR – AWS Storage Gateway - AWS Online Tech Talks
Improving Backup & DR – AWS Storage Gateway - AWS Online Tech Talks
 
STG206_Big Data Data Lakes and Data Oceans
STG206_Big Data Data Lakes and Data OceansSTG206_Big Data Data Lakes and Data Oceans
STG206_Big Data Data Lakes and Data Oceans
 
Automate the Provisioning of Secure Developer Environments on AWS PPT
 Automate the Provisioning of Secure Developer Environments on AWS PPT Automate the Provisioning of Secure Developer Environments on AWS PPT
Automate the Provisioning of Secure Developer Environments on AWS PPT
 
Building a Strong Foundation with AWS Storage Services
Building a Strong Foundation with AWS Storage ServicesBuilding a Strong Foundation with AWS Storage Services
Building a Strong Foundation with AWS Storage Services
 
STG203_Get Rid of Tape and Modernize Backup with AWS
STG203_Get Rid of Tape and Modernize Backup with AWSSTG203_Get Rid of Tape and Modernize Backup with AWS
STG203_Get Rid of Tape and Modernize Backup with AWS
 
Real world High Performance & High Throughput Computing on AWS
Real world High Performance & High Throughput Computing on AWSReal world High Performance & High Throughput Computing on AWS
Real world High Performance & High Throughput Computing on AWS
 
ARC205_Born in the Cloud
ARC205_Born in the CloudARC205_Born in the Cloud
ARC205_Born in the Cloud
 
Design patterns and best practices for data analytics with amazon emr (ABD305)
Design patterns and best practices for data analytics with amazon emr (ABD305)Design patterns and best practices for data analytics with amazon emr (ABD305)
Design patterns and best practices for data analytics with amazon emr (ABD305)
 
Centralized IAM Governance using CloudFormation StackSets and AWS Organizatio...
Centralized IAM Governance using CloudFormation StackSets and AWS Organizatio...Centralized IAM Governance using CloudFormation StackSets and AWS Organizatio...
Centralized IAM Governance using CloudFormation StackSets and AWS Organizatio...
 
Hybrid Data Storage Made Easier with AWS Storage Gateway
Hybrid Data Storage Made Easier with AWS Storage GatewayHybrid Data Storage Made Easier with AWS Storage Gateway
Hybrid Data Storage Made Easier with AWS Storage Gateway
 
Scaling Up to Your First 10 Million Users (ARC205-R1) - AWS re:Invent 2018
Scaling Up to Your First 10 Million Users (ARC205-R1) - AWS re:Invent 2018Scaling Up to Your First 10 Million Users (ARC205-R1) - AWS re:Invent 2018
Scaling Up to Your First 10 Million Users (ARC205-R1) - AWS re:Invent 2018
 
Scaling up to Your First 10 Million Users
Scaling up to Your First 10 Million UsersScaling up to Your First 10 Million Users
Scaling up to Your First 10 Million Users
 
ABD207 building a banking utility leveraging aws to fight financial crime and...
ABD207 building a banking utility leveraging aws to fight financial crime and...ABD207 building a banking utility leveraging aws to fight financial crime and...
ABD207 building a banking utility leveraging aws to fight financial crime and...
 
Deep Dive: Building Hybrid Cloud Storage Architectures with AWS Storage Gatew...
Deep Dive: Building Hybrid Cloud Storage Architectures with AWS Storage Gatew...Deep Dive: Building Hybrid Cloud Storage Architectures with AWS Storage Gatew...
Deep Dive: Building Hybrid Cloud Storage Architectures with AWS Storage Gatew...
 
Artificial Intelligence on the AWS Cloud - AWS Innovate Ottawa
Artificial Intelligence on the AWS Cloud - AWS Innovate OttawaArtificial Intelligence on the AWS Cloud - AWS Innovate Ottawa
Artificial Intelligence on the AWS Cloud - AWS Innovate Ottawa
 
GPSTEC313_GPS Real-Time Data Processing with AWS Lambda Quickly, at Scale, an...
GPSTEC313_GPS Real-Time Data Processing with AWS Lambda Quickly, at Scale, an...GPSTEC313_GPS Real-Time Data Processing with AWS Lambda Quickly, at Scale, an...
GPSTEC313_GPS Real-Time Data Processing with AWS Lambda Quickly, at Scale, an...
 

Ähnlich wie Armazenamento em Amazon Web Service para Setor Publico

Ähnlich wie Armazenamento em Amazon Web Service para Setor Publico (20)

Webinar AWS: Desmistificando a Nuvem
Webinar AWS: Desmistificando a NuvemWebinar AWS: Desmistificando a Nuvem
Webinar AWS: Desmistificando a Nuvem
 
Tech Talks On Site- Edição de Agosto- Armazenamento em AWS
Tech Talks On Site- Edição de Agosto- Armazenamento em AWSTech Talks On Site- Edição de Agosto- Armazenamento em AWS
Tech Talks On Site- Edição de Agosto- Armazenamento em AWS
 
STG307_Deep Dive on Amazon Elastic File System (Amazon EFS)
STG307_Deep Dive on Amazon Elastic File System (Amazon EFS)STG307_Deep Dive on Amazon Elastic File System (Amazon EFS)
STG307_Deep Dive on Amazon Elastic File System (Amazon EFS)
 
Introduction to Block and File storage on AWS
Introduction to Block and File storage on AWSIntroduction to Block and File storage on AWS
Introduction to Block and File storage on AWS
 
Case Study: Learn how to Choose and Optimize Storage for Media and Entertainm...
Case Study: Learn how to Choose and Optimize Storage for Media and Entertainm...Case Study: Learn how to Choose and Optimize Storage for Media and Entertainm...
Case Study: Learn how to Choose and Optimize Storage for Media and Entertainm...
 
STG316_Optimizing Storage for Big Data Workloads
STG316_Optimizing Storage for Big Data WorkloadsSTG316_Optimizing Storage for Big Data Workloads
STG316_Optimizing Storage for Big Data Workloads
 
Securing your block storage on AWS - GRC207 - AWS re:Inforce 2019
Securing your block storage on AWS - GRC207 - AWS re:Inforce 2019 Securing your block storage on AWS - GRC207 - AWS re:Inforce 2019
Securing your block storage on AWS - GRC207 - AWS re:Inforce 2019
 
Data Migration Best Practices
Data Migration Best PracticesData Migration Best Practices
Data Migration Best Practices
 
Data Migration Best Practice
Data Migration Best PracticeData Migration Best Practice
Data Migration Best Practice
 
STG320_Storage
STG320_StorageSTG320_Storage
STG320_Storage
 
AWS storage solutions for business-critical applications - STG301 - Chicago A...
AWS storage solutions for business-critical applications - STG301 - Chicago A...AWS storage solutions for business-critical applications - STG301 - Chicago A...
AWS storage solutions for business-critical applications - STG301 - Chicago A...
 
Files in AWS: Overcoming Storage Challenges for Common File Use Cases, with S...
Files in AWS: Overcoming Storage Challenges for Common File Use Cases, with S...Files in AWS: Overcoming Storage Challenges for Common File Use Cases, with S...
Files in AWS: Overcoming Storage Challenges for Common File Use Cases, with S...
 
Soluzioni per la migrazione e gestione dei dati in Amazon Web Services
Soluzioni per la migrazione e gestione dei dati in Amazon Web ServicesSoluzioni per la migrazione e gestione dei dati in Amazon Web Services
Soluzioni per la migrazione e gestione dei dati in Amazon Web Services
 
Pitt Immersion Day Module 4 - storage in AWS
Pitt Immersion Day Module 4 - storage in AWSPitt Immersion Day Module 4 - storage in AWS
Pitt Immersion Day Module 4 - storage in AWS
 
Construindo Arquiteturas Híbridas de Armazenamento em Nuvem
Construindo Arquiteturas Híbridas de Armazenamento em NuvemConstruindo Arquiteturas Híbridas de Armazenamento em Nuvem
Construindo Arquiteturas Híbridas de Armazenamento em Nuvem
 
Migrating Large Scale Data Sets to the Cloud - STG204 - re:Invent 2017
Migrating Large Scale Data Sets to the Cloud - STG204 - re:Invent 2017Migrating Large Scale Data Sets to the Cloud - STG204 - re:Invent 2017
Migrating Large Scale Data Sets to the Cloud - STG204 - re:Invent 2017
 
Amazon Elastic Block Store for Application Storage
Amazon Elastic Block Store for Application StorageAmazon Elastic Block Store for Application Storage
Amazon Elastic Block Store for Application Storage
 
Amazon EFS 深入採討
Amazon EFS 深入採討Amazon EFS 深入採討
Amazon EFS 深入採討
 
Amazon EFS: Leverage the Power of a Distributed Shared File System in the Clo...
Amazon EFS: Leverage the Power of a Distributed Shared File System in the Clo...Amazon EFS: Leverage the Power of a Distributed Shared File System in the Clo...
Amazon EFS: Leverage the Power of a Distributed Shared File System in the Clo...
 
“Lift and shift” storage for business-critical applications - STG203 - New Yo...
“Lift and shift” storage for business-critical applications - STG203 - New Yo...“Lift and shift” storage for business-critical applications - STG203 - New Yo...
“Lift and shift” storage for business-critical applications - STG203 - New Yo...
 

Mehr von Amazon Web Services LATAM

Mehr von Amazon Web Services LATAM (20)

AWS para terceiro setor - Sessão 1 - Introdução à nuvem
AWS para terceiro setor - Sessão 1 - Introdução à nuvemAWS para terceiro setor - Sessão 1 - Introdução à nuvem
AWS para terceiro setor - Sessão 1 - Introdução à nuvem
 
AWS para terceiro setor - Sessão 2 - Armazenamento e Backup
AWS para terceiro setor - Sessão 2 - Armazenamento e BackupAWS para terceiro setor - Sessão 2 - Armazenamento e Backup
AWS para terceiro setor - Sessão 2 - Armazenamento e Backup
 
AWS para terceiro setor - Sessão 3 - Protegendo seus dados.
AWS para terceiro setor - Sessão 3 - Protegendo seus dados.AWS para terceiro setor - Sessão 3 - Protegendo seus dados.
AWS para terceiro setor - Sessão 3 - Protegendo seus dados.
 
AWS para terceiro setor - Sessão 1 - Introdução à nuvem
AWS para terceiro setor - Sessão 1 - Introdução à nuvemAWS para terceiro setor - Sessão 1 - Introdução à nuvem
AWS para terceiro setor - Sessão 1 - Introdução à nuvem
 
AWS para terceiro setor - Sessão 2 - Armazenamento e Backup
AWS para terceiro setor - Sessão 2 - Armazenamento e BackupAWS para terceiro setor - Sessão 2 - Armazenamento e Backup
AWS para terceiro setor - Sessão 2 - Armazenamento e Backup
 
AWS para terceiro setor - Sessão 3 - Protegendo seus dados.
AWS para terceiro setor - Sessão 3 - Protegendo seus dados.AWS para terceiro setor - Sessão 3 - Protegendo seus dados.
AWS para terceiro setor - Sessão 3 - Protegendo seus dados.
 
Automatice el proceso de entrega con CI/CD en AWS
Automatice el proceso de entrega con CI/CD en AWSAutomatice el proceso de entrega con CI/CD en AWS
Automatice el proceso de entrega con CI/CD en AWS
 
Automatize seu processo de entrega de software com CI/CD na AWS
Automatize seu processo de entrega de software com CI/CD na AWSAutomatize seu processo de entrega de software com CI/CD na AWS
Automatize seu processo de entrega de software com CI/CD na AWS
 
Cómo empezar con Amazon EKS
Cómo empezar con Amazon EKSCómo empezar con Amazon EKS
Cómo empezar con Amazon EKS
 
Como começar com Amazon EKS
Como começar com Amazon EKSComo começar com Amazon EKS
Como começar com Amazon EKS
 
Ransomware: como recuperar os seus dados na nuvem AWS
Ransomware: como recuperar os seus dados na nuvem AWSRansomware: como recuperar os seus dados na nuvem AWS
Ransomware: como recuperar os seus dados na nuvem AWS
 
Ransomware: cómo recuperar sus datos en la nube de AWS
Ransomware: cómo recuperar sus datos en la nube de AWSRansomware: cómo recuperar sus datos en la nube de AWS
Ransomware: cómo recuperar sus datos en la nube de AWS
 
Ransomware: Estratégias de Mitigação
Ransomware: Estratégias de MitigaçãoRansomware: Estratégias de Mitigação
Ransomware: Estratégias de Mitigação
 
Ransomware: Estratégias de Mitigación
Ransomware: Estratégias de MitigaciónRansomware: Estratégias de Mitigación
Ransomware: Estratégias de Mitigación
 
Aprenda a migrar y transferir datos al usar la nube de AWS
Aprenda a migrar y transferir datos al usar la nube de AWSAprenda a migrar y transferir datos al usar la nube de AWS
Aprenda a migrar y transferir datos al usar la nube de AWS
 
Aprenda como migrar e transferir dados ao utilizar a nuvem da AWS
Aprenda como migrar e transferir dados ao utilizar a nuvem da AWSAprenda como migrar e transferir dados ao utilizar a nuvem da AWS
Aprenda como migrar e transferir dados ao utilizar a nuvem da AWS
 
Cómo mover a un almacenamiento de archivos administrados
Cómo mover a un almacenamiento de archivos administradosCómo mover a un almacenamiento de archivos administrados
Cómo mover a un almacenamiento de archivos administrados
 
Simplifique su BI con AWS
Simplifique su BI con AWSSimplifique su BI con AWS
Simplifique su BI con AWS
 
Simplifique o seu BI com a AWS
Simplifique o seu BI com a AWSSimplifique o seu BI com a AWS
Simplifique o seu BI com a AWS
 
Os benefícios de migrar seus workloads de Big Data para a AWS
Os benefícios de migrar seus workloads de Big Data para a AWSOs benefícios de migrar seus workloads de Big Data para a AWS
Os benefícios de migrar seus workloads de Big Data para a AWS
 

Kürzlich hochgeladen

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Kürzlich hochgeladen (20)

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 

Armazenamento em Amazon Web Service para Setor Publico

  • 1. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Mv – Marcus Vinicius Ferreira / Bruno Silveira Public Sector Team Jun/2019 Hands-on: Storage in the CloudEBS, EFS, S3
  • 2. Mv – Marcus Vinicius Ferreira mvferr@amazon.com SolutionsArchitect BR, Public Sector, Education Mv
  • 4. AWS Topics • Why: Storage Motivation - Overview • What: Storage Services - The Block Storage "Family": EBS, Snapshots - The Object Storage "Family": S3, S3-IA, Glacier - The Transfer Storage "Family": Storage Gateway, Snowball, Direct Connect, DMS • How: Scenarios and Architectures - Databases - Web Applications - Analytics, Big Data - Backup and Recovery - Legacy Systems
  • 6. GB TB PB ZB EB Big Data: Unconstrained Growth • Unstructured data growth is explosive • 95% of the 1.2 zettabytes of data in the digital universe is unstructured • Logs, Machine data and IoT will only steepen the curve • 70% of this data is user- generated content • Videos resolution is always increasing: 1080p, 4K, 8K Source: IDC, The Internet of Things: Getting Ready to Embrace Its Impact on the Digital Economy, March 2016.
  • 7. Gartner: User Survey Analysis: Key Trends Shaping the Future of Data Center Infrastructure Through 2011 IDC: Worldwide Business Analytics Software 2012–2016 Forecast and 2011 Vendor Shares Available for analysis Generated data 1990 2000 2010 2020 Key Insight: Most Data Falls on the Floor 90% of the data in a company is never analyzed High costs and complexity of traditional DW systems make it hard to justify the capital expense
  • 8. Data is a strategic asset for every organization The world’s most valuable resource is no longer oil, but data.* *Copyright:The Economist, 2017, David Parkins “ ”
  • 10. Two Facts of Life
  • 11. Amazon EFS File Amazon EBS Amazon EC2 Instance Store Block Amazon S3 Amazon Glacier Object Data Transfer AWS Direct Connect ISV Connectors Amazon Kinesis Firehose Storage Gateway S3 Transfer Acceleration AWS Storage is a Platform AWS Snowball Amazon CloudFront Internet/VPN
  • 13. Example Data Center: Where Do We Put All of This on AWS? DB (Master) DB (Slave) Back-ups on tapes Web server Web server App serverApp server App server SAN NAS file server File system disks LDAP server
  • 14. Example Data Center: Where Do We Put All of This on AWS? Web server Web server App serverApp server App server Amazon Elastic File System Elastic Load Balancing Elastic Load Balancing Amazon Elastic Block Store Amazon RDS (Master) Amazon RDS (Standby) Backups to Amazon S3 or Glacier AWS Directory Service
  • 15. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Block vs File vs Object Block Storage Raw Storage Data organized as an array of unrelated blocks Host File System places data on disk e.g.: Microsoft NTFS, Unix ZFS File Storage Unrelated data blocks managed by a file (serving) system Native file system places data on disk Object Storage Stores Virtual containers that encapsulate the data, data attributes, metadata and Object IDs API Access to data Metadata Driven, Policy-based, etc
  • 16. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Three types of storage File ObjectBlock Amazon EFS Amazon FSx Amazon EBS Amazon S3, Amazon Glacier
  • 17. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Performance comparison of storage types File Object Block Latency Throughput
  • 18. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Object storage S3 Standard S3 Glacier Deep Archive S3 Glacier S3 Intelligent-Tiering S3 One Zone-IA S3 Standard-IA Block storage Provisioned IOPS SSD Cold HDD Throughput-Optimized HDD NEW! COMING SOON! File storage EFS Standard EFS Infrequent Access COMING SOON! Elastic Amazon EFS AWS Storage Gateway Family Amazon S3 NEW!Amazon FSx for Lustre Amazon FSx for Windows File Server NEW! Amazon EBS Amazon EC2
  • 19. EBS, Snapshots, EFS, Ephemeral
  • 20. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. AWSblockstorageofferings EC2 instance store sc1st1 io1gp2 EBS SSD-backed volumes EBS HDD-backed volumes HDDSSD
  • 21. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. AWSblockstorageofferings EC2 instance store sc1st1 io1gp2 EBS SSD-backed volumes EBS HDD-backed volumes HDDSSD
  • 22. Block Storage: EBS Volume Types General Purpose SSD (Generic) gp2 Provisioned IOPS SSD (Fast, Random I/O) io1 Throughput Optimized HDD (Fast, Sequential I/O) st1 Cold HDD (Slow, Large, Cheap) sc1 SSD HDD
  • 23. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved What is EBS? EBS volume EC2 instance • Block storage as a service • Create, attach volumes through an API • Service accessed over the network
  • 24. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved What is EBS? EBS volume Availability Zone AWS Region EC2 instance • Volume and instance must be in the same AZ
  • 25. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved What is EBS? EBS volume EC2 instance • Volumes persist independent of EC2 • Detach and attach between instances • Volume and instance must be in the same AZ Availability Zone AWS Region
  • 26. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved What is EBS? EBS volume EC2 instance • Volumes persist independent of EC2 • Detach and attach between instances • Volume and instance must be in the same AZ Availability Zone AWS Region
  • 27. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved What is EBS? EBS volume • Volumes persist independent of EC2 • Detach and attach between instances • Volume and instance must be in the same AZ Availability Zone AWS Region
  • 28. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved What is EBS? EBS volume EC2 instance • Volumes persist independent of EC2 • Detach and attach between instances • Volume and instance must be in the same AZ Availability Zone AWS Region
  • 29. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved What is EBS? Availability Zone AWS Region EC2 instance EBS data volume EBS data volume • Volumes attach to one instance at a time • Many volumes can attach to an instance • Maximum Volume Size is 16TB EBS data volume
  • 30. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved What is EBS? Availability Zone AWS Region EC2 instance EBS data volume EBS data volume • Volumes attach to one instance at a time • Many volumes can attach to an instance • Maximum Volume Size is 16TB16TB 16TB 8TB EBS data volume
  • 31. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved What is EBS? Availability Zone AWS Region EC2 instance EBS data volume • Volumes attach to one instance at a time 16TB
  • 32. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved EBS is designed for: What is EBS? 99.999% service availability 0.1% to 0.2% annual failure rate (AFR)
  • 33. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved What is an EBS snapshot? EBS volume Availability Zone AWS Region Amazon S3 EBS snapshot Availability Zone Replica
  • 34. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved What can you do with a snapshot? EBS volume Availability Zone AWS Region Amazon S3 EBS snapshot Availability Zone EBS volume Replica Replica
  • 35. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved What can you do with a snapshot? EBS volume Availability Zone AWS Region Amazon S3 EBS snapshot EBS volume Availability Zone AWS Region EBS snapshot Replica Replica
  • 36. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved EBS restrictions are... • Single-AZ • Attach to just one EC2 at a time
  • 37. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Do it yourself – NFS architecture NFS Server Volume Volume NFS Server Volume Volume NFS Server Volume Volume NFS Clients NFS Clients NFS Clients http://bit.ly/amazonefstutorial
  • 38. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Do it yourself http://bit.ly/amazonefstutorial
  • 39. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Amazon EFS architecture NFS Clients NFS Clients NFS Clients Mount Target Single Namespace Mount Target Mount Target http://bit.ly/amazonefstutorial
  • 41. 1 PB raw storage 800 TB usable storage 600 TB allocated storage 400 TB application data Traditional Storage System
  • 42. 1 PB raw storage 800 TB usable storage 600 TB allocated storage 400 TB application data S3 unlimited capacity -- pay only for what you use! Amazon S3 ~ $0.021 / GB
  • 43. Amazon S3: HTTP access 1. HTTP/HTTPS access 2. Unlimited amount of files 3. Unlimited growth... 4. Any type of data: backups, photos, videos, documents, logs 5. Cheap, unlimited storage [bucket name] Preview2.mp4 Tokyo Region (ap-northeast-1) Bucket Object https://s3-ap-northeast-1.amazonaws.com/[bucket name]/Preview2.mp4 https://s3-ap-northeast-1.amazonaws.com/[bucket name]/ Region code Bucket name Key
  • 44. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. ObjectKeys An object key is the unique identifier for an object in a bucket. http://doc.s3.amazonaws.com/2006-03-01/AmazonS3.html Bucket Object/Key
  • 45. A Closer Look: S3 Durability 4 9s durability 5 9s durability S3, S3-IA Glacier 11 9s durability 99.999999999 % VS.
  • 46. VS. Understanding Durability designed for 99.99% durability Two copies on one site designed for 99.999% durability Copies on two sites designed for 99.999999999% durability GlacierStandard IA AWS Region
  • 47. Understanding Durability Availability Zone Availability Zone Availability Zone S3 Standard S3 Standard-IA Glacier Availability Zone S3 One Zone-IA AWS Region AWS Region
  • 48. S3 Standard S3 Standard – Infrequent Access Amazon Glacier Active data Archive dataInfrequently accessed data Milliseconds Minutes to HoursMilliseconds $0.021/GB/mo $0.004/GB/mo$0.0125/GB/mo Choice of storage classes on Amazon S3
  • 50. AWS offers the most ways to move data to the cloud AWS Direct Connect A private connection between your data center, office, or colocation environment and AWS AWS Snow family (Snowball, Snowball Edge, Snowmobile) Secure, physical transport appliances that move up to Exabytes of data into and out of AWS AWS Storage Gateways Hybrid storage that seamlessly connects on- premises applications to AWS storage. Ideal for backup, DR, bursting, tiering or migration Amazon Kinesis Firehose Capture, trans- form, & load streaming data into S3 for use with Amazon business intelligence and analytics tools Amazon EFS File Sync Up to 5x faster file transfers than open source tools. Ideal for migrating data into EFS or moving between cloud file systems Amazon S3 Transfer Acceleration Up to 300% faster transfers into and out of S3. Ideal when working with long geographic distances APN competency partners Integrations between 3rd party vendors and AWS services. Ideal for leveraging existing software licenses and skills Networks Shipping Hybrid
  • 51. Storage Gateway: Enterprise Backup Amazon S3 Amazon Glacier Internet Amazon S3-IA Application servers Storage Gateway Local disk Media server Gateway Application servers Cloud Connector/Native Integration Local disk Media server with cloud connector VPNVPN
  • 52. Which On-Premise Backup Software? All of them! AWS Storage Gateway VTL Native S3 Integration
  • 53. Enterprise Backup: Direct Connect Amazon S3 Amazon Glacier AWS Direct Connect Amazon S3-IA Application servers Storage Gateway Local disk Media server Gateway Application servers Cloud Connector/Native Integration Local disk Media server with cloud connector VPN 1 GB or 10 GB dedicated link
  • 54. Amazon S3Transfer Acceleration Rio De Janeiro Warsaw New York Atlanta Madrid Virginia Melbourne Paris Los Angeles Seattle Tokyo Singapore Time[hrs] 500 GB upload from clients in these locations to a bucket in Singapore Public InternetAccelerated Transfer Up to 300% faster 171% on average
  • 55. © 2019, Amazon Web Services, Inc. or its affiliates.All rights reserved.
  • 56. What is Snowball? Petabyte scale data transport E-ink shipping label Ruggedized case “8.5G Impact” All data encrypted end-to-end 50TB or 80TB 10G network Rain & dust resistant Tamper-resistant case & electronics
  • 57. How fast is Snowball? • Less than 1 day to transfer 50TB via a 10G connection with Snowball, less than 1 week including shipping • Number of days to transfer 50TB via the internet at typical utilizations Internet Connection Speed Utilization 1Gbps 500Mbps 300Mbps 150Mbps 25% 19 38 63 126 50% 9 19 32 63 75% 6 13 21 42
  • 58. How fast is Snowball? • Less than 1 day to transfer 250TB via 5x10G connections with 5 Snowballs, less than 1 week including shipping • Number of days to transfer 250TB via the Internet at typical utilizations Internet Connection Speed Utilization 1Gbps 500Mbps 300Mbps 150Mbps 25% 95 190 316 632 50% 47 95 158 316 75% 32 63 105 211
  • 59. AWS Snow* Family Snowball Snowball Edge Snowmobile Petabyte-scale data migration Showball with Lambda inside Exabyte-scale data migration
  • 60. How: Scenarios and Architectures
  • 61. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved EBS for Databases: SQL, NoSQL, BigData EC2 Server Volume Volume Volume Volume Volume Volume
  • 62. S3: Sharing web files 172.31.0.0/16 sa-east-1a sa-east-1b sa-east-1c
  • 63. S3: Sharing web files: because of AutoScaling 172.31.0.0/16 sa-east-1a sa-east-1b sa-east-1c
  • 65. How: Big Data and Analytics
  • 66. S3 for Big Data • Scalability & Elasticity • Resize a running cluster based on how much work is needed to be done. • Durability and Availability • Fault tolerant for slave node (HDFS) • Backup to S3 for resilience against master node failures • Standard Interfaces • Hive, Pig, Spark, Hbase, Impala, Hunk, Presto, other popular tools Amazon EMR Cluster Amazon EMR Cluster Amazon EMR Cluster
  • 67. Big Data is about large amount of files Stored logs structure (in Amazon S3) Raw log data (sample) Order_ID Customer_ID Order_date Total
  • 68. AWS EMR Environment: Hadoop, Spark, et al. Master instance group Task instance groupCore instance group Amazon S3 Core instances:  Manage data and tasks  Can be added and removed Task instances (optional) are added or subtracted in response to work Amazon S3 as primary storage HDFS HDFS Terabytes of files
  • 69. Netflix Uses S3 to Back its Various Clusters S3
  • 70. Fraud Detection FINRA uses Amazon EMR and Amazon S3 to process up to 75 billion trading events per day and securely store over 5 petabytes of data, attaining savings of $10-20mm per year.
  • 71. NASDAQ LISTS3 , 6 0 0 G L O B A L C O M P A N I E S IN MARKET CAP REPRESENTING WORTH $9.6TRILLION DIVERSE INDUSTRIES AND MANY OF THE WORLD’S MOST WELL-KNOWN AND INNOVATIVE BRANDSMORE THAN U.S. 1 TRILLIONNATIONAL VALUE IS TIED TO OUR LIBRARY OF MORE THAN 41,000 GLOBAL INDEXES N A S D A Q T E C H N O L O G Y IS USED TO POWER MORE THAN IN 50 COUNTRIES 100 MARKETPLACES OUR GLOBAL PLATFORM CAN HANDLE MORE THAN 1 MILLION MESSAGES/SECOND AT SUB-40 MICROSECONDS AV E R A G E S P E E D S 1 C L E A R I N G H O U S E WE OWN AND OPERATE 26 MARKETS 5 CENTRAL SECURITIES DEPOSITORIES INCLUDING A C R O S S A S S E T CL A S SE S & GEOGRAPHIES
  • 75. Summary • What: Storage Services - The Block Storage "Family": EBS, Snapshots - The Object Storage "Family": S3, S3-IA, Glacier - The Transfer Storage "Family": Storage Gateway, Snowball, Direct Connect • How: Scenarios and Architectures - Databases: EBS - Web Applications: S3 - Analytics, Big Data: S3 - Backup and Recovery: S3, Storage Gateway, Direct Connect, Snowball - Legacy Systems: EBS, EFS, Storage Gateway
  • 78. S U M M I T São Paulo https://www.cvent.com/events/aws-summit-sao-paulo/registration-89802b17e4ab403db6baeed7ba5917cc.aspx?lang=pt- BR&fqp=true&refid=sp_summit_2019

Hinweis der Redaktion

  1. Here’s what we do know about all Big Data. Due to the convergence of many technologies of cloud, mobile, social, and advancements in many field such as genomics, life sciences, space, the size of the digital universe is growing at an ever increasing rate. Customers have also found tremendous value in being able to mine this data to make better medicine, tailored purchasing recommendations, detect fraudulent financial transactions in real time, provide on-demand digital content such as movies and songs, predict weather forecasts, the list goes on and on.
  2. For on-promises storage solutions, a lot of information is on tape and not easily available at scale.
  3. Timing: 10 seconds So 7 years later, world agree that data matters. In fact it’s the most important asset for a company. This thought has gone mainstream with The Economist saying it too.
  4. Most organizations tell us they’ve concluded that it costs more to delete things than to store them. - (We are always accumulating things) Too hard to separate the good from the bad Might lose something important No tools to do this easily (but there are lots and lots of tools) They also tell us that cloud storage is intriguing because it offers ways to make their stored data more useful. Easy to scale, usually simpler than building it on your own Easy to apply as the foundation of new development Sometimes tricky to apply
  5. Cloud storage is a solution: unlimited storage in a very cost effective way.
  6. In order to meet the requirements of the wide variety of these use cases and other, AWS offers a storage platform with different types of storage suited for different needs, these include… However, whether you are a building a new application in the cloud, or moving an existing workload, but how do you get that data into AWS? Today we will discussing eight options for data migration to the cloud, ranging from network based services like internet/vpn, S3 transfer acceleration, Amazon CloudFront and AWS DirectConnect. We will then be reviewing additional data migration options including Amazon Kinesis Firehose, Storage Gateway, AWS Snowball, and solutions provides by AWS technology partners.
  7. A traditional on-premises or data center–based infrastructure might include a setup like this. Here we'll walk you through just one example of how an arrangement like this could be set up and run on AWS instead.
  8. What happens when you turn this data center infrastructure into an AWS infrastructure? Servers, such as these web servers and app servers, are replaced with Amazon EC2 instances that run all of the same software. Because Amazon EC2 instances can run a variety of Windows Server, Red Hat, SUSE, Ubuntu, or our own Amazon Linux operating systems, virtually all server applications can be run on Amazon EC2 instances. The LDAP server is replaced with AWS Directory Service, which supports LDAP authentication and allows you to easily set up and run Microsoft Active Directory in the cloud or connect your AWS resources with existing on-premises Microsoft Active Directory. Software-based load balancers are replaced with Elastic Load Balancing load balancers. Elastic Load Balancing is a fully managed load balancing solution that scales automatically as needed and can perform health checks on attached resources, thus redistributing load away from unhealthy resources as necessary. SAN solutions can be replaced with Amazon Elastic Block Store (EBS) volumes. These volumes can be attached to the application servers to store data long-term and share the data between instances. Amazon Elastic File System (EFS), currently available via preview, could be used to replace your NAS file server. Amazon EFS is a file storage service for Amazon EC2 instances with a simple interface that allows you to create and configure file systems. It also grows and shrinks your storage automatically as you add and remove files, so you are always using exactly the amount of storage you need. Another solution could be to run an NAS solution on an Amazon EC2 instance. Many NAS solutions are available via the AWS Marketplace at https://aws.amazon.com/marketplace/. Databases can be replaced with Amazon Relational Database Service (RDS), which lets you run Amazon Aurora, PostgreSQL, MySQL, MariaDB, Oracle, and Microsoft SQL Server on a managed AWS-based platform. Amazon RDS offers master, read replica, and standby instances. Finally, Amazon RDS instances can be automatically backed up to Amazon S3, thus replacing the need for on-premises database backup hardware.
  9. Each storage option has a unique combination of performance, durability, cost, and interface
  10. Block Storage are the volumes you can attach to an instance and place a filesystem on it. Think about disks running EXT3, EXT4 on Linux or NTFS on Windows. SSD backed volumes for transactional workloads Performance is measured in Input / Output Operations per Second or IOPS HDD backed volumes for throughput workloads Performance is measured in MEGABYTES per SECOND Excels at what HDDs do best: high throughput sequential workloads
  11. What is EBS? Create a volume, attach to an EC server. That’s it.
  12. EBS volumes are bound to an AZ. Restriction number 1.
  13. If I loose a server, the EBS volume can remain. There is a property of each volume when you decide if it must remain or must be deleted when the EC2 instance goes away.
  14. If I loose a server, the EBS volume can remain. There is a property of each volume when you decide if it must remain or must be deleted when the EC2 instance goes away.
  15. To access that EBS data again, you can create another EC2 instance and attach that EBS volume.
  16. One EC2 can have many EBS volumes. Max size of a EBS volume? 16TB.
  17. If you need a 40 TB volume? Just combine as many EBS volumes as needed.
  18. EBS volumes are not shareable between different EC2 instances.
  19. Reliable!
  20. EBS snapshots are copies of the blocks of the EBS volume stored in S3. Snapshots are smart to copy only the different blocks that were modified since the last snapshot. Successive snapshots are faster, dependeding on the amount of modifications. Being stored in S3, the cost of the snapshots are the S3 cost of that stored information.
  21. Being stored in S3, snapshots can be restored in a different AZ.....
  22. Being stored in S3, snapshots can be restored in a different AZ..... Or even a different region.
  23. EBS limitations...... :(
  24. When thinking about Storage in on-premises environments it is granted that I can use NFS to share volumes between different servers. A do-it-yourself NFS architecture in the cloud can be elaborate and with a very heavy daily admnistration.
  25. So..... A do-it-by-yourself NFS server solution in the cloud is hard....... What if there was something better????
  26. EFS....! EFS is a shared volume service, based on the NFS protocol.
  27. Traditional storage on-premises solutions are expensive, and there is the administration overhead of keeping space available. Storage solutions will get full from time to time.
  28. S3 is great, but how much does it cost. Let’s first discuss what we’re going to charge you for… with traditional storage you pay for raw capacity but after accounting for protection schemes, such as RAID, file system overhead and the need to keep a free storage reserve, you’re left with much less of actual capacity used by data. With S3 you only pay for used capacity, when you use it. So in this example for 400 TBs, you’re really paying for 400 TBs and this is not accounting for DR copies. This drastic difference affects both CAPEX and OPEX costs.
  29. S3 is great. Unlimited storage, with a very low cost. But it is accessible via HTTP/HTTPS mainly.
  30. Because the combination of a bucket, key, and version ID uniquely identifies each object, Amazon S3 can be thought of as a basic data map between "bucket + key + version" and the object itself. Every object in Amazon S3 can be uniquely addressed through the combination of the web service endpoint, bucket name, key, and optionally, a version. For example, in the URL http://doc.s3.amazonaws.com/2006-03-01/AmazonS3.html, "doc" is the name of the bucket and "2006-03-01/AmazonS3.html" is the key.
  31. Across the board, S3, SIA and Glacier all offer the same 11 9’s durability, where AWS stores data redundantly across multiple facilities and storage devices, and the services automatically perform data integrity check in the background to guard against potential data corruption. I work with many customers who archive data by storing two copies of tape either in the same building or one copy on-site and one remote.  When we discuss durability, which is a big deal for many archive customers, many are accustomed to thinking in number of “copies” and found the 11 9’s a bit non-intuitive.  To bridge that, we did a thought experiment with a large studio where, at a high level, we walked them through how we derived the 11 9’s using a Markov chain model where we modeled failures from storage device, server, network, availability zone, etc.  We asked them to estimate their two-copy tape durability using a similar concept and they estimated ~4 9’s for two copies in a single building or ~5 9’s for two copies in separate locations.  This helped them realize that Glacier’s 11 9’s durability can be thought of as 6 to 7 orders of magnitude more durable than two copies of tape and helped us bridge the conversation.
  32. Across the board, S3, SIA and Glacier all offer the same 11 9’s durability, where AWS stores data redundantly across multiple facilities and storage devices, and the services automatically perform data integrity check in the background to guard against potential data corruption. I work with many customers who archive data by storing two copies of tape either in the same building or one copy on-site and one remote.  When we discuss durability, which is a big deal for many archive customers, many are accustomed to thinking in number of “copies” and found the 11 9’s a bit non-intuitive.  To bridge that, we did a thought experiment with a large studio where, at a high level, we walked them through how we derived the 11 9’s using a Markov chain model where we modeled failures from storage device, server, network, availability zone, etc.  We asked them to estimate their two-copy tape durability using a similar concept and they estimated ~4 9’s for two copies in a single building or ~5 9’s for two copies in separate locations.  This helped them realize that Glacier’s 11 9’s durability can be thought of as 6 to 7 orders of magnitude more durable than two copies of tape and helped us bridge the conversation.
  33. Across the board, S3, SIA and Glacier all offer the same 11 9’s durability, where AWS stores data redundantly across multiple facilities and storage devices, and the services automatically perform data integrity check in the background to guard against potential data corruption. I work with many customers who archive data by storing two copies of tape either in the same building or one copy on-site and one remote.  When we discuss durability, which is a big deal for many archive customers, many are accustomed to thinking in number of “copies” and found the 11 9’s a bit non-intuitive.  To bridge that, we did a thought experiment with a large studio where, at a high level, we walked them through how we derived the 11 9’s using a Markov chain model where we modeled failures from storage device, server, network, availability zone, etc.  We asked them to estimate their two-copy tape durability using a similar concept and they estimated ~4 9’s for two copies in a single building or ~5 9’s for two copies in separate locations.  This helped them realize that Glacier’s 11 9’s durability can be thought of as 6 to 7 orders of magnitude more durable than two copies of tape and helped us bridge the conversation.
  34. When you view our object storage as a portfolio of storage classes, we provide 3 storage options with different performance characteristics and price points. S3 Standard which is our high performance object storage - very active, hot workloads. available in milliseconds starts at 2.1 cents/GB/month depending on the region S3 Standard - Infrequent Access shares the same millisecond access times as S3 Standard, but designed for data you plan to access maybe a few times a year or what we think of as “active archive”. S3-IA costs $0.0125/GB/mo, and then you pay a nominal fee for requests. Glacier is that cold archival tier access latency from minutes to hours, depending on the retrieval option you choose, storage costs $0.004/GB/month.
  35. The first challenge for man organizations is the physics of moving data. Customers have asked us for help moving their data, for things like: datacenter shutdowns remote sites migrating existing Enterprise applications building hybrid workflows that can still accommodate on-premises data
  36. S3 is a industry standard for backup solutions. But even legacy backup systems can use S3 via a deployed AWS Storage Gateway.
  37. S3 is a industry standard for backup solutions. Yes, it is.
  38. For more bandwidth and throughput, Direct Connect is the way to go.
  39. What is AWS Import/Export Snowball? Snowball is a new AWS Import/Export offering that provides a petabyte-scale data transfer service that uses Amazon-provided storage devices for transport. Previously customers purchased their own portable storage devices and used these devices to ship their data. With the launch of Snowball customers are now able to use highly secure, rugged Amazon-owned Network Attached Storage (NAS) devices, called Snowballs, to ship their data. Once received and set up, customers are able to copy up to 50TB data from their on prem file system to the Snowball via the Snowball client software via a 10Gbps network interface . Prior to transfer to the Snowball all data is encrypted by 256-bit GSM encryption by the client. When customers finish transferring data to the device they simply ship it back to an AWS facility where the data is ingested at high speed into Amazon S3.
  40. Compare and contrast Internet vs 1x Snowball.
  41. Compare and contrast Internet vs 5x Snowball.
  42. In the fullness of time we see hybrid cloud storage addressing needs at the edge of your networks. Customers asked for a way to incorporate simple detached cloud storage platform with some computing capability at the edge of their networks, for applications like wind farms, medical devices, shipboard scientific computing and manufacturing shop floors. AWS Snowball Edge is a petabyte-scale data transfer solution with temporary on-premises storage and compute capabilities. It transports up to 100TB of data with the same embedded cryptography and security as the original Snowball, and may also integrate smoothly with existing workflows, scale local capacity, and process stored data. Snowball Edge hosts a file server and an S3-compatible endpoint that allow you to use the NFS protocol, S3 SDK or S3 CLI to transfer data directly to the device without specialized client software. Multiple units may be clustered together, forming a temporary data collection storage tier in your datacenter so you can work as data is generated without managing copies. As storage needs scale up and down, devices can be easily added or removed from the local cluster and returned to AWS.   Snowball Edge also comes with embedded computing power (equivalent to an EC2 m4.4 xlarge instance) that hosts a platform for general compute tasks. AWS Lambda functions can run on the device to do things like examine a data stream collected from an IoT sensor, search for anomalies, create aggregated metrics or send alarms or control signals. Environments with unstable connectivity but high operational demands can run data processes redundantly on Snowball Edge devices, protecting against connectivity issues and eventually returning the captured and processed results to AWS. Snowball Edge is designed to keep data and applications secure while on site or in transit to AWS, making it appropriate for even the most sensitive customer data. The hardware and software is cryptographically signed and all data stored is automatically encrypted using 256 bit encryption keys, owned by the customer and managed by AWS Key Management Service (KMS). Customer data stays encrypted in the appliance and is decrypted only at the time when it is copied from the appliance to AWS. Encryption is now performed on the device, instead of on the client, producing higher data throughput rates and reducing overall processing time.   Snowball Edge devices are Amazon-owned and eliminate the need for customers to invest in new hardware. Customers pay $300 plus shipping per device and a $30 per day usage fee, applied after the initial 10 days on site. If more capacity is needed at the edge, multiple devices can be requested and used together in a cluster. Amazon monitors the health and utilization of Snowballs and provides replacement devices when needed. Current Snowball data transport appliances in 50TB and 80TB volumes will continue to be available in addition to the new Snowball Edge. Availability in regions will vary, please check the Snowball product page for additional information. Philips Healthcare develops technology solutions for consumers, patients, providers and caregivers across the health continuum, from supporting healthy living and prevention to diagnosis, treatment and home care. They embedded Snowball v2 devices in their hospital networks to collect data and initiate real-time analytics. Now the hospital staff no longer waits for answers and they have a local dataset to run on in case of any connectivity issues.
  43. Databases – any type of them – want big and fast disks. EBS is the case. Observation: EFS is not a database friendly solution.
  44. EBS disks can be attached to RDS DB servers or EC2 servers. Keep in mind that EC2 auto-scaling machines will die and their disks will be lost. Any information that must be saved is better placed in a database table or S3 bucket. EBS disks attached to a database server are permanent. But even so, a database backup is always needed for production systems.
  45. EBS disks can be attached to RDS DB servers or EC2 servers. Keep in mind that EC2 auto-scaling machines will die and their disks will be lost. Any information that must be saved is better placed in a database table or S3 bucket. EBS disks attached to a database server are permanent. But even so, a database backup is always needed for production systems.
  46. For those legacy systems that are not auto-scaling friendly, and must write information to local disks without loosing it, EFS is the solution. Any generated information can be saved to a database table, a S3 bucket, or a EFS share.
  47. Because of the unlimited storage space available on S3, S3 is a natural component of Big Data solutions in AWS. Stored information in S3 can be accessed by any Big Data solution simultaneously.
  48. .... Big data is about a variety of information, stored in a huge amount of files that need to be processed.....
  49. ... And EMR clusters, running Hadoop, or Spark, or others, can process all that information stored in a S3 based Data lake.
  50. Netflix runs large EMR clusters for
  51. Here is a high level overview of the solution we came up with. These diagrams can all generally be read left-to-right, top-to-bottom. Anything contained within the dashed blue lines are systems running in Nasdaq datacenters, everything else is assumed to be in AWS. So, we have systems inside Nasdaq which write data into a temp bucket in S3, which is then loaded into Redshift using COPY SQL commands. For some data, we perform transformations and aggregations inside Redshift, then unload those results back to the temporary S3 bucket. In all cases, original data or aggregates, we process the CSV data in the temporary bucket to produce Parquet file, which are stored in a separate S3 bucket for long-term storage. Presto, running in an EMR cluster, is then used to query data stored in those files in S3 This transformation into Parquet is currently performed in Nasdaq datacenters, however this is a stop-gap measure until we move our data ingest system into EC2, which we are planning to complete in early 2016. In both cases, SQL clients access the databases directly through JDBC.
  52. Summary. Of course, for more specific details let’s talk to the SA about each use case.