SlideShare ist ein Scribd-Unternehmen logo
1 von 33
Downloaden Sie, um offline zu lesen
Seagate Kinetic Open Storage Platform
James Hughes
…and many others
Storage is a Price Elastic Market
http://en.wikipedia.org/wiki/Alfred_Marshall

Price elasticity of demand
• Alfred Marshall (1890)

As the price of Storage approaches $0
• Demands for storage will approach infinity

If the price of a Cisco router
approaches $0
• Demands for routers will not approach
infinity - Storage is different

"3
Areal Density Growth

• 

100000

Perpendicular expected to extend
to 0.5-1 Tb/in2

10000

Additional innovations required
at that point

•  heat-assisted
recording
•  bit patterned
media recording

Single particle
superparamagnetic limit
(estimated)

40%

HAMR
+BPM

1000

HAMR
Perpendicular Writing & GMR

100

Seagate Confidential: Subject to NDA
100%
10 Charap’s limiteffective Jan. 18, 2009,
No. 77103,
(broken)
and all applicable supplements GMR reading
•  Inductive Writing/
1
0.1

29%

•  Inductive Writing/ MR reading
•  Inductive Writing & Reading

19
89
19
91
19
93
19
95
19
97
19
99
20
01
20
03
20
05
20
07
20
09
20
11
20
13
20
15
20
17
20
19

• 

Late 1990s – super paramagnetic
limit demonstrated through
modeling

gigabit / in2

• 

•  Areal Density CAGR 40%
•  Transfer Rate CAGR 20%

year

"4
Cloud Computing will increase this trend
•Jevons Paradox
• Cloud Computing increases the efficiency of computing....

Seagate Confidential: Subject to NDA
No. 77103, effective Jan. 18, 2009,
and all applicable supplements

http://en.wikipedia.org/wiki/Jevons_paradox

"5
Cloud Computing will increase this trend
•Jevons Paradox
• Cloud Computing increases the efficiency of computing....

Improved technology doubles
Seagate Confidential: Subject to NDA
the amount of Information
No. 77103, effective Jan. produced
18,
with a given amount of Storage 2009,
!
and all applicable supplements
Demand for Storage rises

http://en.wikipedia.org/wiki/Jevons_paradox

"5
Technology TrendsNDA
Seagate Confidential: Subject to
No. 77103, effective Jan. 18, 2009,
and all applicable supplements

"6
Shingled Disks
•Write head larger than read
head
• Turns Disk into a sequentially
written media
Seagate Confidential: Subject to NDA
No. 77103, effective Jan. 18, 2009,
and all applicable supplements

•All updates to data and
metadata are written
sequentially to a continuous
stream, called a log
•Disk API of sectors is no
longer “natural”

http://www.ssrc.ucsc.edu/Papers/amer-ieeetm11.pdf

"7
Log Structured Storage
How much is erased on a reposition?
• Tape - the remainder of the tape
• Shingled disk - the remainder of the track group
• Flash - the entire page

All persistent Storage systems do/will implement log structure
• e.g. “NoSQL Database of sectors”

Does it make sense to layer a database on top of a
database?
• Could we use the log structure of the media to provide a more
natural storage systems, not mimicking an antique paradigm?

"8
Single System Performance Trend
Leading to disaggregation of servers

http://web.eecs.umich.edu/~twenisch/papers/isca09-disaggregate.pdf

"9
Scaling Storage
Distributed Hash Table
• Key/Value Store

RAM
Flash
Disk

Memcached
FAWN
Riak

http://en.wikipedia.org/wiki/Distributed_hash_table

"10
Metadata and Metadata Servers are Evil
Required by traditional file systems (POSIX) to translate
names to sectors
• Hard to scale, heavy HA requirements, expensive

Can we use a name as a key?
• Place the data into a scaled key value store?
• Eliminate costly metadata servers?

"11
Cumulative operations ordered by length
100%

operations
92% of the operations

80%

60%

40%

32KB

Cumulative percentage

data

20%

0.5% of the data
0%
1.00

10.00

100.00

1000.00

10000.00

100000.00

Length (KB)
"12
512KB
Length

Map of Operations

0

0
1

3

Tim
e

(m

2
inu
tes

)

3
4

0

1
oc
L

2
B)
(T
on
ati

"13
Seagate Kinetic
Seagate Confidential: Subject to NDA
Open Storage Platform
No. 77103, effective Jan. 18, 2009,
and all applicable supplements

"14
Seagate Kinetic Open Storage Platform
Dis-intermediates applications to drive
–Goes around file systems, volume managers, drivers

Enable ecosystem of value added software
–Partners (like Basho) can create their own system value

Lower TCO
–Eliminates complexity

"15
"16
"16
D

A
S

"17
App

Proprietary
to System Vendor

• Application
• Clustering
• Management

App
D

LibKinetic

C++, Java, Python, Erlang, DIY
GPL
Standard

ProtoBuf
TCP/IP/GbE

• Interconnect

Proprietary
to Seagate

A

• Storage

S

"17
App

App

App

App

App

Proprietary
to System Vendor

• Application
• Clustering
• Management

App

D

LibKinetic

LibKinetic

LibKinetic
GPL
Standard

ProtoBuf
TCP/IP/GbE

• Interconnect

Proprietary
to Seagate

A

• Storage

S

"17
App

App

App

App

App

D

LibKinetic

LibKinetic

LibKinetic
GPL
Standard

ProtoBuf
TCP/IP/GbE

• Interconnect

D

A

D

A
S

D

A
S

D

A

A
S

S

D

A

S

D

A
S

D

A
S

A
S

S

"17

Proprietary
to Seagate

D

• Storage

App

Proprietary
to System Vendor

• Application
• Clustering
• Management
System Hardware
Typical JBOD architecture
• Does not require a server, just JBODs to the ToR Switch
• 10 JBODS × 60 drives × 4TB = 2.4PB/Rack

"18
Features
Provides RPC to Key/Value database
• Data is pre-indexed
• Compression and other value is easy and transparent

P2P (Drive to Drive) copy of key ranges
Communicate using existing Data Center Plumbing (TCP/IP)
Multiple masters - Data sharing between machines
Configurable caching per command
• Async, Sync, Flush

Local space management

"19
Kinetic Systems
Clustering (performance, reliability, management)
Compatibility with large scale applications (S3, etc.)
Centralized Management
• Reliability, availability, durability

"20
Lower TCO
Elimination of server layers
Less Human requirements
Reduced mistakes
Disaggregate storage from
servers
Power management

"21
Lower TCO
Elimination of server layers
Less Human requirements
Reduced mistakes
Disaggregate storage from
servers
Power management

"21
Goals of API
Data movement
• Get/put/delete/getnext/getprevious
• Versioned (== for success), options

Range operations
Multiple masters
• Authentication/Integrity/Authorization

Cluster-able
• Simple cluster configuration version enforcement

3rd party copy
Management
"22
Management (System Vendor)
Configures the drive
• Network
• authorized clients

Monitors
• Health
• Statistics
• Logs

Initiates recovery
• Change cluster version
• 3rdPartyCopy
"23
Data formats
Key Structure
• Variable number of octets (0-4KB)

Data Structure (Serialized to a byte stream)
• KeyOf
• Version
• E2E Data Integrity
–Algorithm name

• Data Variable length (0-xMB)

"24
Performance Metrics
Same normal performance expectations
•
•
•
•

Sequential Write
Random Write
Sequential Read
Sequential Write

Iometer for key/value

"25
Demo Time!

Seagate Confidential: Subject to NDA
No. 77103, effective Jan. 18, 2009,
and all applicable supplements

"26
Performance Results

MB/s

Puts/s

120

1000

90

750

60

500

30

250

0

0
0

2

4

6

8

1MB values put rate (MB/s)

0

2

4

6

1KB values put rate

"27

8
Conclusion
Deliver more value to Seagate, Partners and Customers
• Dis-intermediates cloud applications to drive
• Enable innovation in hardware and software ecosystem
• Lower TCO

OpenSource Software
–Basho Riak, Swift, HDFS

More information
• http://seagate.com/www/kinetic
• https://developers.seagate.com
• http://guthub.com/Seagate
"28

Weitere ähnliche Inhalte

Was ist angesagt?

Disk health prediction for Ceph
Disk health prediction for CephDisk health prediction for Ceph
Disk health prediction for CephCeph Community
 
CEPH DAY BERLIN - 5 REASONS TO USE ARM-BASED MICRO-SERVER ARCHITECTURE FOR CE...
CEPH DAY BERLIN - 5 REASONS TO USE ARM-BASED MICRO-SERVER ARCHITECTURE FOR CE...CEPH DAY BERLIN - 5 REASONS TO USE ARM-BASED MICRO-SERVER ARCHITECTURE FOR CE...
CEPH DAY BERLIN - 5 REASONS TO USE ARM-BASED MICRO-SERVER ARCHITECTURE FOR CE...Ceph Community
 
Red Hat Ceph Storage Acceleration Utilizing Flash Technology
Red Hat Ceph Storage Acceleration Utilizing Flash Technology Red Hat Ceph Storage Acceleration Utilizing Flash Technology
Red Hat Ceph Storage Acceleration Utilizing Flash Technology Red_Hat_Storage
 
August 2013 HUG: Removing the NameNode's memory limitation
August 2013 HUG: Removing the NameNode's memory limitation August 2013 HUG: Removing the NameNode's memory limitation
August 2013 HUG: Removing the NameNode's memory limitation Yahoo Developer Network
 
GlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 MeetupGlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 MeetupGlusterFS
 
Performance tuning in BlueStore & RocksDB - Li Xiaoyan
Performance tuning in BlueStore & RocksDB - Li XiaoyanPerformance tuning in BlueStore & RocksDB - Li Xiaoyan
Performance tuning in BlueStore & RocksDB - Li XiaoyanCeph Community
 
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA ArchitectureCeph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA ArchitectureDanielle Womboldt
 
Ndb cluster 80_ycsb_disk
Ndb cluster 80_ycsb_diskNdb cluster 80_ycsb_disk
Ndb cluster 80_ycsb_diskmikaelronstrom
 
Red Hat Gluster Storage Performance
Red Hat Gluster Storage PerformanceRed Hat Gluster Storage Performance
Red Hat Gluster Storage PerformanceRed_Hat_Storage
 
BlueStore: a new, faster storage backend for Ceph
BlueStore: a new, faster storage backend for CephBlueStore: a new, faster storage backend for Ceph
BlueStore: a new, faster storage backend for CephSage Weil
 
Red Hat Storage Server Administration Deep Dive
Red Hat Storage Server Administration Deep DiveRed Hat Storage Server Administration Deep Dive
Red Hat Storage Server Administration Deep DiveRed_Hat_Storage
 
Gluster Webinar: Introduction to GlusterFS v3.3
Gluster Webinar: Introduction to GlusterFS v3.3Gluster Webinar: Introduction to GlusterFS v3.3
Gluster Webinar: Introduction to GlusterFS v3.3GlusterFS
 
Modern network servers
Modern network serversModern network servers
Modern network serversAPNIC
 
CEPH DAY BERLIN - MASTERING CEPH OPERATIONS: UPMAP AND THE MGR BALANCER
CEPH DAY BERLIN - MASTERING CEPH OPERATIONS: UPMAP AND THE MGR BALANCERCEPH DAY BERLIN - MASTERING CEPH OPERATIONS: UPMAP AND THE MGR BALANCER
CEPH DAY BERLIN - MASTERING CEPH OPERATIONS: UPMAP AND THE MGR BALANCERCeph Community
 
Ceph Deployment at Target: Customer Spotlight
Ceph Deployment at Target: Customer SpotlightCeph Deployment at Target: Customer Spotlight
Ceph Deployment at Target: Customer SpotlightColleen Corrice
 
Red Hat Storage Day New York - New Reference Architectures
Red Hat Storage Day New York - New Reference ArchitecturesRed Hat Storage Day New York - New Reference Architectures
Red Hat Storage Day New York - New Reference ArchitecturesRed_Hat_Storage
 
Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio
Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio
Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio Ceph Community
 
QCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitectureQCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitecturePatrick McGarry
 

Was ist angesagt? (20)

Disk health prediction for Ceph
Disk health prediction for CephDisk health prediction for Ceph
Disk health prediction for Ceph
 
CEPH DAY BERLIN - 5 REASONS TO USE ARM-BASED MICRO-SERVER ARCHITECTURE FOR CE...
CEPH DAY BERLIN - 5 REASONS TO USE ARM-BASED MICRO-SERVER ARCHITECTURE FOR CE...CEPH DAY BERLIN - 5 REASONS TO USE ARM-BASED MICRO-SERVER ARCHITECTURE FOR CE...
CEPH DAY BERLIN - 5 REASONS TO USE ARM-BASED MICRO-SERVER ARCHITECTURE FOR CE...
 
Red Hat Ceph Storage Acceleration Utilizing Flash Technology
Red Hat Ceph Storage Acceleration Utilizing Flash Technology Red Hat Ceph Storage Acceleration Utilizing Flash Technology
Red Hat Ceph Storage Acceleration Utilizing Flash Technology
 
August 2013 HUG: Removing the NameNode's memory limitation
August 2013 HUG: Removing the NameNode's memory limitation August 2013 HUG: Removing the NameNode's memory limitation
August 2013 HUG: Removing the NameNode's memory limitation
 
GlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 MeetupGlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 Meetup
 
Performance tuning in BlueStore & RocksDB - Li Xiaoyan
Performance tuning in BlueStore & RocksDB - Li XiaoyanPerformance tuning in BlueStore & RocksDB - Li Xiaoyan
Performance tuning in BlueStore & RocksDB - Li Xiaoyan
 
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA ArchitectureCeph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
 
Ndb cluster 80_ycsb_disk
Ndb cluster 80_ycsb_diskNdb cluster 80_ycsb_disk
Ndb cluster 80_ycsb_disk
 
Red Hat Gluster Storage Performance
Red Hat Gluster Storage PerformanceRed Hat Gluster Storage Performance
Red Hat Gluster Storage Performance
 
BlueStore: a new, faster storage backend for Ceph
BlueStore: a new, faster storage backend for CephBlueStore: a new, faster storage backend for Ceph
BlueStore: a new, faster storage backend for Ceph
 
Red Hat Storage Server Administration Deep Dive
Red Hat Storage Server Administration Deep DiveRed Hat Storage Server Administration Deep Dive
Red Hat Storage Server Administration Deep Dive
 
Red Hat Storage Roadmap
Red Hat Storage RoadmapRed Hat Storage Roadmap
Red Hat Storage Roadmap
 
Gluster Webinar: Introduction to GlusterFS v3.3
Gluster Webinar: Introduction to GlusterFS v3.3Gluster Webinar: Introduction to GlusterFS v3.3
Gluster Webinar: Introduction to GlusterFS v3.3
 
Modern network servers
Modern network serversModern network servers
Modern network servers
 
CEPH DAY BERLIN - MASTERING CEPH OPERATIONS: UPMAP AND THE MGR BALANCER
CEPH DAY BERLIN - MASTERING CEPH OPERATIONS: UPMAP AND THE MGR BALANCERCEPH DAY BERLIN - MASTERING CEPH OPERATIONS: UPMAP AND THE MGR BALANCER
CEPH DAY BERLIN - MASTERING CEPH OPERATIONS: UPMAP AND THE MGR BALANCER
 
MySQL Head-to-Head
MySQL Head-to-HeadMySQL Head-to-Head
MySQL Head-to-Head
 
Ceph Deployment at Target: Customer Spotlight
Ceph Deployment at Target: Customer SpotlightCeph Deployment at Target: Customer Spotlight
Ceph Deployment at Target: Customer Spotlight
 
Red Hat Storage Day New York - New Reference Architectures
Red Hat Storage Day New York - New Reference ArchitecturesRed Hat Storage Day New York - New Reference Architectures
Red Hat Storage Day New York - New Reference Architectures
 
Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio
Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio
Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio
 
QCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitectureQCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference Architecture
 

Ähnlich wie Kinetic basho public

Evoluzione dello storage
Evoluzione dello storageEvoluzione dello storage
Evoluzione dello storageAndrea Mauro
 
In-memory Data Management Trends & Techniques
In-memory Data Management Trends & TechniquesIn-memory Data Management Trends & Techniques
In-memory Data Management Trends & TechniquesHazelcast
 
Best practices for Data warehousing with Amazon Redshift - AWS PS Summit Canb...
Best practices for Data warehousing with Amazon Redshift - AWS PS Summit Canb...Best practices for Data warehousing with Amazon Redshift - AWS PS Summit Canb...
Best practices for Data warehousing with Amazon Redshift - AWS PS Summit Canb...Amazon Web Services
 
Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters
Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters
Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters Ceph Community
 
Deploying ssd in the data center 2014
Deploying ssd in the data center 2014Deploying ssd in the data center 2014
Deploying ssd in the data center 2014Howard Marks
 
High Performance Hardware for Data Analysis
High Performance Hardware for Data AnalysisHigh Performance Hardware for Data Analysis
High Performance Hardware for Data AnalysisMike Pittaro
 
Mike Pittaro - High Performance Hardware for Data Analysis
Mike Pittaro - High Performance Hardware for Data Analysis Mike Pittaro - High Performance Hardware for Data Analysis
Mike Pittaro - High Performance Hardware for Data Analysis PyData
 
Dissecting Scalable Database Architectures
Dissecting Scalable Database ArchitecturesDissecting Scalable Database Architectures
Dissecting Scalable Database Architectureshypertable
 
Tuning Linux for your database FLOSSUK 2016
Tuning Linux for your database FLOSSUK 2016Tuning Linux for your database FLOSSUK 2016
Tuning Linux for your database FLOSSUK 2016Colin Charles
 
Memory, Big Data, NoSQL and Virtualization
Memory, Big Data, NoSQL and VirtualizationMemory, Big Data, NoSQL and Virtualization
Memory, Big Data, NoSQL and VirtualizationBigstep
 
Solid State Drive Technology - MIT Lincoln Labs
Solid State Drive Technology - MIT Lincoln LabsSolid State Drive Technology - MIT Lincoln Labs
Solid State Drive Technology - MIT Lincoln LabsMatt Simmons
 
JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]
JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]
JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]Speedment, Inc.
 
JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]
JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]
JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]Malin Weiss
 
Collier exadata technical overview presentation 4 14-10
Collier exadata technical overview presentation 4 14-10Collier exadata technical overview presentation 4 14-10
Collier exadata technical overview presentation 4 14-10xKinAnx
 
Crossbar ARM TechCon 2016 presentation
Crossbar ARM TechCon 2016 presentation        Crossbar ARM TechCon 2016 presentation
Crossbar ARM TechCon 2016 presentation Crossbarinc
 
S016827 pendulum-swings-nola-v1710d
S016827 pendulum-swings-nola-v1710dS016827 pendulum-swings-nola-v1710d
S016827 pendulum-swings-nola-v1710dTony Pearson
 
LCA13: Jason Taylor Keynote - ARM & Disaggregated Rack - LCA13-Hong - 6 March...
LCA13: Jason Taylor Keynote - ARM & Disaggregated Rack - LCA13-Hong - 6 March...LCA13: Jason Taylor Keynote - ARM & Disaggregated Rack - LCA13-Hong - 6 March...
LCA13: Jason Taylor Keynote - ARM & Disaggregated Rack - LCA13-Hong - 6 March...Linaro
 
TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)
TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)
TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)Alex Rasmussen
 

Ähnlich wie Kinetic basho public (20)

Evoluzione dello storage
Evoluzione dello storageEvoluzione dello storage
Evoluzione dello storage
 
In-memory Data Management Trends & Techniques
In-memory Data Management Trends & TechniquesIn-memory Data Management Trends & Techniques
In-memory Data Management Trends & Techniques
 
Best practices for Data warehousing with Amazon Redshift - AWS PS Summit Canb...
Best practices for Data warehousing with Amazon Redshift - AWS PS Summit Canb...Best practices for Data warehousing with Amazon Redshift - AWS PS Summit Canb...
Best practices for Data warehousing with Amazon Redshift - AWS PS Summit Canb...
 
Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters
Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters
Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters
 
Deploying ssd in the data center 2014
Deploying ssd in the data center 2014Deploying ssd in the data center 2014
Deploying ssd in the data center 2014
 
High Performance Hardware for Data Analysis
High Performance Hardware for Data AnalysisHigh Performance Hardware for Data Analysis
High Performance Hardware for Data Analysis
 
Mike Pittaro - High Performance Hardware for Data Analysis
Mike Pittaro - High Performance Hardware for Data Analysis Mike Pittaro - High Performance Hardware for Data Analysis
Mike Pittaro - High Performance Hardware for Data Analysis
 
Dissecting Scalable Database Architectures
Dissecting Scalable Database ArchitecturesDissecting Scalable Database Architectures
Dissecting Scalable Database Architectures
 
CPU Caches
CPU CachesCPU Caches
CPU Caches
 
Tuning Linux for your database FLOSSUK 2016
Tuning Linux for your database FLOSSUK 2016Tuning Linux for your database FLOSSUK 2016
Tuning Linux for your database FLOSSUK 2016
 
Memory, Big Data, NoSQL and Virtualization
Memory, Big Data, NoSQL and VirtualizationMemory, Big Data, NoSQL and Virtualization
Memory, Big Data, NoSQL and Virtualization
 
Solid State Drive Technology - MIT Lincoln Labs
Solid State Drive Technology - MIT Lincoln LabsSolid State Drive Technology - MIT Lincoln Labs
Solid State Drive Technology - MIT Lincoln Labs
 
JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]
JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]
JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]
 
JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]
JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]
JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]
 
Collier exadata technical overview presentation 4 14-10
Collier exadata technical overview presentation 4 14-10Collier exadata technical overview presentation 4 14-10
Collier exadata technical overview presentation 4 14-10
 
Wolfgang Lehner Technische Universitat Dresden
Wolfgang Lehner Technische Universitat DresdenWolfgang Lehner Technische Universitat Dresden
Wolfgang Lehner Technische Universitat Dresden
 
Crossbar ARM TechCon 2016 presentation
Crossbar ARM TechCon 2016 presentation        Crossbar ARM TechCon 2016 presentation
Crossbar ARM TechCon 2016 presentation
 
S016827 pendulum-swings-nola-v1710d
S016827 pendulum-swings-nola-v1710dS016827 pendulum-swings-nola-v1710d
S016827 pendulum-swings-nola-v1710d
 
LCA13: Jason Taylor Keynote - ARM & Disaggregated Rack - LCA13-Hong - 6 March...
LCA13: Jason Taylor Keynote - ARM & Disaggregated Rack - LCA13-Hong - 6 March...LCA13: Jason Taylor Keynote - ARM & Disaggregated Rack - LCA13-Hong - 6 March...
LCA13: Jason Taylor Keynote - ARM & Disaggregated Rack - LCA13-Hong - 6 March...
 
TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)
TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)
TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)
 

Kürzlich hochgeladen

Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 

Kürzlich hochgeladen (20)

Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 

Kinetic basho public

  • 1. Seagate Kinetic Open Storage Platform James Hughes …and many others
  • 2. Storage is a Price Elastic Market http://en.wikipedia.org/wiki/Alfred_Marshall Price elasticity of demand • Alfred Marshall (1890) As the price of Storage approaches $0 • Demands for storage will approach infinity If the price of a Cisco router approaches $0 • Demands for routers will not approach infinity - Storage is different "3
  • 3. Areal Density Growth •  100000 Perpendicular expected to extend to 0.5-1 Tb/in2 10000 Additional innovations required at that point •  heat-assisted recording •  bit patterned media recording Single particle superparamagnetic limit (estimated) 40% HAMR +BPM 1000 HAMR Perpendicular Writing & GMR 100 Seagate Confidential: Subject to NDA 100% 10 Charap’s limiteffective Jan. 18, 2009, No. 77103, (broken) and all applicable supplements GMR reading •  Inductive Writing/ 1 0.1 29% •  Inductive Writing/ MR reading •  Inductive Writing & Reading 19 89 19 91 19 93 19 95 19 97 19 99 20 01 20 03 20 05 20 07 20 09 20 11 20 13 20 15 20 17 20 19 •  Late 1990s – super paramagnetic limit demonstrated through modeling gigabit / in2 •  •  Areal Density CAGR 40% •  Transfer Rate CAGR 20% year "4
  • 4. Cloud Computing will increase this trend •Jevons Paradox • Cloud Computing increases the efficiency of computing.... Seagate Confidential: Subject to NDA No. 77103, effective Jan. 18, 2009, and all applicable supplements http://en.wikipedia.org/wiki/Jevons_paradox "5
  • 5. Cloud Computing will increase this trend •Jevons Paradox • Cloud Computing increases the efficiency of computing.... Improved technology doubles Seagate Confidential: Subject to NDA the amount of Information No. 77103, effective Jan. produced 18, with a given amount of Storage 2009, ! and all applicable supplements Demand for Storage rises http://en.wikipedia.org/wiki/Jevons_paradox "5
  • 6. Technology TrendsNDA Seagate Confidential: Subject to No. 77103, effective Jan. 18, 2009, and all applicable supplements "6
  • 7. Shingled Disks •Write head larger than read head • Turns Disk into a sequentially written media Seagate Confidential: Subject to NDA No. 77103, effective Jan. 18, 2009, and all applicable supplements •All updates to data and metadata are written sequentially to a continuous stream, called a log •Disk API of sectors is no longer “natural” http://www.ssrc.ucsc.edu/Papers/amer-ieeetm11.pdf "7
  • 8. Log Structured Storage How much is erased on a reposition? • Tape - the remainder of the tape • Shingled disk - the remainder of the track group • Flash - the entire page All persistent Storage systems do/will implement log structure • e.g. “NoSQL Database of sectors” Does it make sense to layer a database on top of a database? • Could we use the log structure of the media to provide a more natural storage systems, not mimicking an antique paradigm? "8
  • 9. Single System Performance Trend Leading to disaggregation of servers http://web.eecs.umich.edu/~twenisch/papers/isca09-disaggregate.pdf "9
  • 10. Scaling Storage Distributed Hash Table • Key/Value Store RAM Flash Disk Memcached FAWN Riak http://en.wikipedia.org/wiki/Distributed_hash_table "10
  • 11. Metadata and Metadata Servers are Evil Required by traditional file systems (POSIX) to translate names to sectors • Hard to scale, heavy HA requirements, expensive Can we use a name as a key? • Place the data into a scaled key value store? • Eliminate costly metadata servers? "11
  • 12. Cumulative operations ordered by length 100% operations 92% of the operations 80% 60% 40% 32KB Cumulative percentage data 20% 0.5% of the data 0% 1.00 10.00 100.00 1000.00 10000.00 100000.00 Length (KB) "12
  • 14. Seagate Kinetic Seagate Confidential: Subject to NDA Open Storage Platform No. 77103, effective Jan. 18, 2009, and all applicable supplements "14
  • 15. Seagate Kinetic Open Storage Platform Dis-intermediates applications to drive –Goes around file systems, volume managers, drivers Enable ecosystem of value added software –Partners (like Basho) can create their own system value Lower TCO –Eliminates complexity "15
  • 16. "16
  • 17. "16
  • 19. App Proprietary to System Vendor • Application • Clustering • Management App D LibKinetic C++, Java, Python, Erlang, DIY GPL Standard ProtoBuf TCP/IP/GbE • Interconnect Proprietary to Seagate A • Storage S "17
  • 20. App App App App App Proprietary to System Vendor • Application • Clustering • Management App D LibKinetic LibKinetic LibKinetic GPL Standard ProtoBuf TCP/IP/GbE • Interconnect Proprietary to Seagate A • Storage S "17
  • 22. System Hardware Typical JBOD architecture • Does not require a server, just JBODs to the ToR Switch • 10 JBODS × 60 drives × 4TB = 2.4PB/Rack "18
  • 23. Features Provides RPC to Key/Value database • Data is pre-indexed • Compression and other value is easy and transparent P2P (Drive to Drive) copy of key ranges Communicate using existing Data Center Plumbing (TCP/IP) Multiple masters - Data sharing between machines Configurable caching per command • Async, Sync, Flush Local space management "19
  • 24. Kinetic Systems Clustering (performance, reliability, management) Compatibility with large scale applications (S3, etc.) Centralized Management • Reliability, availability, durability "20
  • 25. Lower TCO Elimination of server layers Less Human requirements Reduced mistakes Disaggregate storage from servers Power management "21
  • 26. Lower TCO Elimination of server layers Less Human requirements Reduced mistakes Disaggregate storage from servers Power management "21
  • 27. Goals of API Data movement • Get/put/delete/getnext/getprevious • Versioned (== for success), options Range operations Multiple masters • Authentication/Integrity/Authorization Cluster-able • Simple cluster configuration version enforcement 3rd party copy Management "22
  • 28. Management (System Vendor) Configures the drive • Network • authorized clients Monitors • Health • Statistics • Logs Initiates recovery • Change cluster version • 3rdPartyCopy "23
  • 29. Data formats Key Structure • Variable number of octets (0-4KB) Data Structure (Serialized to a byte stream) • KeyOf • Version • E2E Data Integrity –Algorithm name • Data Variable length (0-xMB) "24
  • 30. Performance Metrics Same normal performance expectations • • • • Sequential Write Random Write Sequential Read Sequential Write Iometer for key/value "25
  • 31. Demo Time! Seagate Confidential: Subject to NDA No. 77103, effective Jan. 18, 2009, and all applicable supplements "26
  • 33. Conclusion Deliver more value to Seagate, Partners and Customers • Dis-intermediates cloud applications to drive • Enable innovation in hardware and software ecosystem • Lower TCO OpenSource Software –Basho Riak, Swift, HDFS More information • http://seagate.com/www/kinetic • https://developers.seagate.com • http://guthub.com/Seagate "28