SlideShare ist ein Scribd-Unternehmen logo
1 von 26
Downloaden Sie, um offline zu lesen
Store your trillions of bytes using
commodity hardware and open source
(GlusterFS)
Theophanis K. Kontogiannis
RHC{SA,E,EV,ESM,I,X}
tkonto@gmail.com
@tkonto
The problem
● Data growth beyond manageable sizes
● Data growth beyond cost effective sizes
How much would it cost to store 100PB of
non structured data in a storage???
The idea
Create a scalable data storing infrastructure
uniformly presented to clients using:
● Commodity (even off the Self) Hardware
● Open standards
The concept
The vision
GlusterFS:
Open – Unified - Extensible
Scalable – Manageable - Reliable
Scale-out Network Attached Storage (NAS) Software Solution
for
On Premise - Virtualized - Cloud Environments
The implementation
●
Open source, distributed file system capable
of scaling to thousands petabytes (actually,
72 brontobytes!)
and handling thousands of clients.
Processing:
1024 Terabytes = 1 Petabyte
1024 Petabytes = 1 Exabyte
1024 Exabytes = 1 Zettabyte
1024 Zettabytes = 1 Yottabyte
1024 Yottabytes = 1 Brontobyte
● Clusters together storage building blocks over Infiniband
RDMA or TCP/IP interconnect, aggregating disk and memory
resources and managing data in a single global namespace.
● Based on a stackable user space design and can deliver
exceptional performance for diverse workloads.
● Self Healing
● Not tied to I/O profiles or hardware or OS
-The question is how much is a BrontoByte?
-The question is WHO CARES?
Really it can support that much?
Yes it can!
2^32 (max subvolumes of distribute translator)
X
18 exabytes (max xfs volume size)
=
72 brontobytes
(or 89,131,682,828,547,379,792,736,944,128bytes)
GlusterFS is supporting 2^128 (uuid) inodes
And this is how it goes
A bit of (business as usual) history
● Gluster Inc. was founded in 2005
● Focused in Public & Private Cloud Storage
● Main product GlusterFS was written by
Anand Babu Periasamy, Gluster’s founder
and CTO
● Received $8.5M in 2010 via VC funding
● Acquired for $136M by Red Hat in 2011
GlusterFS <--> Red Hat Storage
● Gluster.com redirects to RHS pages
● Gluster.org actively supported by RedHat
What is important is the integration of
technologies in ways that demonstrably
benefit the customers
Components
●
brick
The brick is the storage filesystem that has been assigned to a volume.
●
client
The machine which mounts the volume (this may also be a server).
●
server
The machine (virtual or bare metal) which hosts the actual filesystem in which
data will be stored.
●
subvolume
A brick after being processed by at least one translator.
●
volume
The final share after it passes through all the translators
●
Translator
Code that interprets the actual files geometry/location/distribution on disks
comprising a volume and is responsible for the perceived performance
The Outer Atmosphere View
The 100.000ft view
Storage Node
The 50.000ft View
The 10.000ft View
The ground level view
...and the programmers view
if (!(xl->fops = dlsym (handle, "fops"))) {
gf_log ("xlator", GF_LOG_WARNING, "dlsym(fops) on %s",
dlerror ());
goto out;
}
if (!(xl->cbks = dlsym (handle, "cbks"))) {
gf_log ("xlator", GF_LOG_WARNING, "dlsym(cbks) on %s",
dlerror ());
goto out;
}
if (!(xl->init = dlsym (handle, "init"))) {
gf_log ("xlator", GF_LOG_WARNING, "dlsym(init) on %s",
dlerror ());
goto out;
}
if (!(xl->fini = dlsym (handle, "fini"))) {
gf_log ("xlator", GF_LOG_WARNING, "dlsym(fini) on %s",
dlerror ());
goto out;
}
Course of action
● Partition, Format and mount the bricks
● Format the partition
● Mount the partition as a Gluster "brick"
● Add an entry to /etc/fstab
● Install Gluster packages on nodes
● Run the gluster peer probe command
● Configure your Gluster volume (and the translators)
● Test using the volume
Translators?
Translator Type Functional Purpose
Storage Lowest level translator, stores and accesses data from local file system.
Debug Provide interface and statistics for errors and debugging.
Cluster Handle distribution and replication of data as it relates to writing to and
reading from bricks & nodes.
Encryption Extension translators for on-the-fly encryption/decryption of stored data.
Protocol Interface translators for client / server authentication and communications.
Performance Tuning translators to adjust for workload and I/O profiles.
Bindings Add extensibility, e.g. The Python interface written by Jeff Darcy to extend API
interaction with GlusterFS.
System System access translators, e.g. Interfacing with file system access control.
Scheduler I/O schedulers that determine how to distribute new write operations across
clustered systems.
Features Add additional features such as Quotas, Filters, Locks, etc.
Not flexible with command line?
Benchmarks?
Method and platforms pretty much standard:
● Multiple 'dd' of varying blocks are read and written from
multiple clients simultaneously.
●
GlusterFS Brick Configuration (16 bricks)
Processor - Dual Intel(R) Xeon(R) CPU 5160 @ 3.00GHz
RAM - 8GB FB-DIMM
Disk - SATA-II 500GB
HCA - Mellanox MHGS18-XT/S InfiniBand HCA
● Client Configuration (64 clients)
RAM - 4GB DDR2 (533 Mhz)
Processor - Single Intel(R) Pentium(R) D CPU 3.40GHz
Disk - SATA-II 500GB
HCA - Mellanox MHGS18-XT/S InfiniBand HCA
●
Interconnect Switch: Voltaire port InfiniBand Switch (14U)
Size does not matter....
...number of participants does
Suck the throughput. You can!
And you can GeoDistribute it :)
Multi-site cascading
Enough with food for thoughts...
● www.redhat.com/products/storage-server/
● www.gluster.org
Now back to your consoles!!!!
Thank you...

Weitere ähnliche Inhalte

Was ist angesagt?

Improving Presto performance with Alluxio at TikTok
Improving Presto performance with Alluxio at TikTokImproving Presto performance with Alluxio at TikTok
Improving Presto performance with Alluxio at TikTokAlluxio, Inc.
 
Achieving Separation of Compute and Storage in a Cloud World
Achieving Separation of Compute and Storage in a Cloud WorldAchieving Separation of Compute and Storage in a Cloud World
Achieving Separation of Compute and Storage in a Cloud WorldAlluxio, Inc.
 
Update on Crimson - the Seastarized Ceph - Seastar Summit
Update on Crimson  - the Seastarized Ceph - Seastar SummitUpdate on Crimson  - the Seastarized Ceph - Seastar Summit
Update on Crimson - the Seastarized Ceph - Seastar SummitScyllaDB
 
Improve Presto Architectural Decisions with Shadow Cache
 Improve Presto Architectural Decisions with Shadow Cache Improve Presto Architectural Decisions with Shadow Cache
Improve Presto Architectural Decisions with Shadow CacheAlluxio, Inc.
 
An intro to Ceph and big data - CERN Big Data Workshop
An intro to Ceph and big data - CERN Big Data WorkshopAn intro to Ceph and big data - CERN Big Data Workshop
An intro to Ceph and big data - CERN Big Data WorkshopPatrick McGarry
 
Ceph at salesforce ceph day external presentation
Ceph at salesforce   ceph day external presentationCeph at salesforce   ceph day external presentation
Ceph at salesforce ceph day external presentationSameer Tiwari
 
CEPH DAY BERLIN - WHAT'S NEW IN CEPH
CEPH DAY BERLIN - WHAT'S NEW IN CEPH CEPH DAY BERLIN - WHAT'S NEW IN CEPH
CEPH DAY BERLIN - WHAT'S NEW IN CEPH Ceph Community
 
New Ceph capabilities and Reference Architectures
New Ceph capabilities and Reference ArchitecturesNew Ceph capabilities and Reference Architectures
New Ceph capabilities and Reference ArchitecturesKamesh Pemmaraju
 
Scalable and High available Distributed File System Metadata Service Using gR...
Scalable and High available Distributed File System Metadata Service Using gR...Scalable and High available Distributed File System Metadata Service Using gR...
Scalable and High available Distributed File System Metadata Service Using gR...Alluxio, Inc.
 
Optimizing Latency-Sensitive Queries for Presto at Facebook: A Collaboration ...
Optimizing Latency-Sensitive Queries for Presto at Facebook: A Collaboration ...Optimizing Latency-Sensitive Queries for Presto at Facebook: A Collaboration ...
Optimizing Latency-Sensitive Queries for Presto at Facebook: A Collaboration ...Alluxio, Inc.
 
State of the_gluster_-_lceu
State of the_gluster_-_lceuState of the_gluster_-_lceu
State of the_gluster_-_lceuGluster.org
 
Using Ceph for Large Hadron Collider Data
Using Ceph for Large Hadron Collider DataUsing Ceph for Large Hadron Collider Data
Using Ceph for Large Hadron Collider DataRob Gardner
 
Red Hat Storage Day Dallas - Defiance of the Appliance
Red Hat Storage Day Dallas - Defiance of the Appliance Red Hat Storage Day Dallas - Defiance of the Appliance
Red Hat Storage Day Dallas - Defiance of the Appliance Red_Hat_Storage
 
Alluxio Data Orchestration Platform for the Cloud
Alluxio Data Orchestration Platform for the CloudAlluxio Data Orchestration Platform for the Cloud
Alluxio Data Orchestration Platform for the CloudShubham Tagra
 
Redis e Memcached - Daniel Naves - Omnilogic
Redis e Memcached - Daniel Naves - OmnilogicRedis e Memcached - Daniel Naves - Omnilogic
Redis e Memcached - Daniel Naves - OmnilogicFelipe Guimarães
 
OSDC 2015: John Spray | The Ceph Storage System
OSDC 2015: John Spray | The Ceph Storage SystemOSDC 2015: John Spray | The Ceph Storage System
OSDC 2015: John Spray | The Ceph Storage SystemNETWAYS
 

Was ist angesagt? (20)

YDAL Barcelona
YDAL BarcelonaYDAL Barcelona
YDAL Barcelona
 
Improving Presto performance with Alluxio at TikTok
Improving Presto performance with Alluxio at TikTokImproving Presto performance with Alluxio at TikTok
Improving Presto performance with Alluxio at TikTok
 
Ceph Research at UCSC
Ceph Research at UCSCCeph Research at UCSC
Ceph Research at UCSC
 
Achieving Separation of Compute and Storage in a Cloud World
Achieving Separation of Compute and Storage in a Cloud WorldAchieving Separation of Compute and Storage in a Cloud World
Achieving Separation of Compute and Storage in a Cloud World
 
Update on Crimson - the Seastarized Ceph - Seastar Summit
Update on Crimson  - the Seastarized Ceph - Seastar SummitUpdate on Crimson  - the Seastarized Ceph - Seastar Summit
Update on Crimson - the Seastarized Ceph - Seastar Summit
 
Improve Presto Architectural Decisions with Shadow Cache
 Improve Presto Architectural Decisions with Shadow Cache Improve Presto Architectural Decisions with Shadow Cache
Improve Presto Architectural Decisions with Shadow Cache
 
An intro to Ceph and big data - CERN Big Data Workshop
An intro to Ceph and big data - CERN Big Data WorkshopAn intro to Ceph and big data - CERN Big Data Workshop
An intro to Ceph and big data - CERN Big Data Workshop
 
Ceph at salesforce ceph day external presentation
Ceph at salesforce   ceph day external presentationCeph at salesforce   ceph day external presentation
Ceph at salesforce ceph day external presentation
 
CEPH DAY BERLIN - WHAT'S NEW IN CEPH
CEPH DAY BERLIN - WHAT'S NEW IN CEPH CEPH DAY BERLIN - WHAT'S NEW IN CEPH
CEPH DAY BERLIN - WHAT'S NEW IN CEPH
 
New Ceph capabilities and Reference Architectures
New Ceph capabilities and Reference ArchitecturesNew Ceph capabilities and Reference Architectures
New Ceph capabilities and Reference Architectures
 
Scalable and High available Distributed File System Metadata Service Using gR...
Scalable and High available Distributed File System Metadata Service Using gR...Scalable and High available Distributed File System Metadata Service Using gR...
Scalable and High available Distributed File System Metadata Service Using gR...
 
Optimizing Latency-Sensitive Queries for Presto at Facebook: A Collaboration ...
Optimizing Latency-Sensitive Queries for Presto at Facebook: A Collaboration ...Optimizing Latency-Sensitive Queries for Presto at Facebook: A Collaboration ...
Optimizing Latency-Sensitive Queries for Presto at Facebook: A Collaboration ...
 
State of the_gluster_-_lceu
State of the_gluster_-_lceuState of the_gluster_-_lceu
State of the_gluster_-_lceu
 
Using Ceph for Large Hadron Collider Data
Using Ceph for Large Hadron Collider DataUsing Ceph for Large Hadron Collider Data
Using Ceph for Large Hadron Collider Data
 
RubiX
RubiXRubiX
RubiX
 
Red Hat Storage Day Dallas - Defiance of the Appliance
Red Hat Storage Day Dallas - Defiance of the Appliance Red Hat Storage Day Dallas - Defiance of the Appliance
Red Hat Storage Day Dallas - Defiance of the Appliance
 
Alluxio Data Orchestration Platform for the Cloud
Alluxio Data Orchestration Platform for the CloudAlluxio Data Orchestration Platform for the Cloud
Alluxio Data Orchestration Platform for the Cloud
 
Introduction to Redis
Introduction to RedisIntroduction to Redis
Introduction to Redis
 
Redis e Memcached - Daniel Naves - Omnilogic
Redis e Memcached - Daniel Naves - OmnilogicRedis e Memcached - Daniel Naves - Omnilogic
Redis e Memcached - Daniel Naves - Omnilogic
 
OSDC 2015: John Spray | The Ceph Storage System
OSDC 2015: John Spray | The Ceph Storage SystemOSDC 2015: John Spray | The Ceph Storage System
OSDC 2015: John Spray | The Ceph Storage System
 

Andere mochten auch

Innovaphone produktkatalog 2015_2016_en
Innovaphone produktkatalog 2015_2016_enInnovaphone produktkatalog 2015_2016_en
Innovaphone produktkatalog 2015_2016_enSander Van Gysegem
 
iTQi 2013 - Simón Martin , Guijuelo (Salamanca)
iTQi 2013 - Simón Martin , Guijuelo (Salamanca)iTQi 2013 - Simón Martin , Guijuelo (Salamanca)
iTQi 2013 - Simón Martin , Guijuelo (Salamanca)SIMÓN MARTÍN GUIJUELO
 
Teaching strategies
Teaching strategiesTeaching strategies
Teaching strategiesPrincess Rao
 

Andere mochten auch (6)

Assessment
AssessmentAssessment
Assessment
 
Innovaphone produktkatalog 2015_2016_en
Innovaphone produktkatalog 2015_2016_enInnovaphone produktkatalog 2015_2016_en
Innovaphone produktkatalog 2015_2016_en
 
Health hazards
Health hazardsHealth hazards
Health hazards
 
iTQi 2013 - Simón Martin , Guijuelo (Salamanca)
iTQi 2013 - Simón Martin , Guijuelo (Salamanca)iTQi 2013 - Simón Martin , Guijuelo (Salamanca)
iTQi 2013 - Simón Martin , Guijuelo (Salamanca)
 
Human rights.
Human rights.Human rights.
Human rights.
 
Teaching strategies
Teaching strategiesTeaching strategies
Teaching strategies
 

Ähnlich wie GlusterFS Presentation FOSSCOMM2013 HUA, Athens, GR

GlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 MeetupGlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 MeetupGlusterFS
 
Glusterfs and openstack
Glusterfs  and openstackGlusterfs  and openstack
Glusterfs and openstackopenstackindia
 
VMworld 2015: The Future of Software- Defined Storage- What Does it Look Like...
VMworld 2015: The Future of Software- Defined Storage- What Does it Look Like...VMworld 2015: The Future of Software- Defined Storage- What Does it Look Like...
VMworld 2015: The Future of Software- Defined Storage- What Does it Look Like...VMworld
 
GlusterFs: a scalable file system for today's and tomorrow's big data
GlusterFs: a scalable file system for today's and tomorrow's big dataGlusterFs: a scalable file system for today's and tomorrow's big data
GlusterFs: a scalable file system for today's and tomorrow's big dataRoberto Franchini
 
GlusterFS : un file system open source per i big data di oggi e domani - Robe...
GlusterFS : un file system open source per i big data di oggi e domani - Robe...GlusterFS : un file system open source per i big data di oggi e domani - Robe...
GlusterFS : un file system open source per i big data di oggi e domani - Robe...Codemotion
 
Distributed Filesystems Review
Distributed Filesystems ReviewDistributed Filesystems Review
Distributed Filesystems ReviewSchubert Zhang
 
Red Hat Storage 2014 - Product(s) Overview
Red Hat Storage 2014 - Product(s) OverviewRed Hat Storage 2014 - Product(s) Overview
Red Hat Storage 2014 - Product(s) OverviewMarcel Hergaarden
 
Scalable POSIX File Systems in the Cloud
Scalable POSIX File Systems in the CloudScalable POSIX File Systems in the Cloud
Scalable POSIX File Systems in the CloudRed_Hat_Storage
 
TechDay - Toronto 2016 - Hyperconvergence and OpenNebula
TechDay - Toronto 2016 - Hyperconvergence and OpenNebulaTechDay - Toronto 2016 - Hyperconvergence and OpenNebula
TechDay - Toronto 2016 - Hyperconvergence and OpenNebulaOpenNebula Project
 
Gestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMF
Gestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMFGestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMF
Gestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMFSUSE Italy
 
Webinar: OpenEBS - Still Free and now FASTEST Kubernetes storage
Webinar: OpenEBS - Still Free and now FASTEST Kubernetes storageWebinar: OpenEBS - Still Free and now FASTEST Kubernetes storage
Webinar: OpenEBS - Still Free and now FASTEST Kubernetes storageMayaData Inc
 
Big Data Glossary of terms
Big Data Glossary of termsBig Data Glossary of terms
Big Data Glossary of termsKognitio
 
Red Hat Enterprise Linux: Open, hyperconverged infrastructure
Red Hat Enterprise Linux: Open, hyperconverged infrastructureRed Hat Enterprise Linux: Open, hyperconverged infrastructure
Red Hat Enterprise Linux: Open, hyperconverged infrastructureRed_Hat_Storage
 
Dell Lustre Storage Architecture Presentation - MBUG 2016
Dell Lustre Storage Architecture Presentation - MBUG 2016Dell Lustre Storage Architecture Presentation - MBUG 2016
Dell Lustre Storage Architecture Presentation - MBUG 2016Andrew Underwood
 
Scaling Security on 100s of Millions of Mobile Devices Using Apache Kafka® an...
Scaling Security on 100s of Millions of Mobile Devices Using Apache Kafka® an...Scaling Security on 100s of Millions of Mobile Devices Using Apache Kafka® an...
Scaling Security on 100s of Millions of Mobile Devices Using Apache Kafka® an...confluent
 
Monitoring&Logging - Stanislav Kolenkin
Monitoring&Logging - Stanislav Kolenkin  Monitoring&Logging - Stanislav Kolenkin
Monitoring&Logging - Stanislav Kolenkin Kuberton
 
Microsoft azure platforms
Microsoft azure platformsMicrosoft azure platforms
Microsoft azure platformsMotty Ben Atia
 
HPC DAY 2017 | HPE Storage and Data Management for Big Data
HPC DAY 2017 | HPE Storage and Data Management for Big DataHPC DAY 2017 | HPE Storage and Data Management for Big Data
HPC DAY 2017 | HPE Storage and Data Management for Big DataHPC DAY
 

Ähnlich wie GlusterFS Presentation FOSSCOMM2013 HUA, Athens, GR (20)

GlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 MeetupGlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 Meetup
 
Glusterfs and openstack
Glusterfs  and openstackGlusterfs  and openstack
Glusterfs and openstack
 
VMworld 2015: The Future of Software- Defined Storage- What Does it Look Like...
VMworld 2015: The Future of Software- Defined Storage- What Does it Look Like...VMworld 2015: The Future of Software- Defined Storage- What Does it Look Like...
VMworld 2015: The Future of Software- Defined Storage- What Does it Look Like...
 
GlusterFs: a scalable file system for today's and tomorrow's big data
GlusterFs: a scalable file system for today's and tomorrow's big dataGlusterFs: a scalable file system for today's and tomorrow's big data
GlusterFs: a scalable file system for today's and tomorrow's big data
 
GlusterFS : un file system open source per i big data di oggi e domani - Robe...
GlusterFS : un file system open source per i big data di oggi e domani - Robe...GlusterFS : un file system open source per i big data di oggi e domani - Robe...
GlusterFS : un file system open source per i big data di oggi e domani - Robe...
 
Distributed Filesystems Review
Distributed Filesystems ReviewDistributed Filesystems Review
Distributed Filesystems Review
 
Red Hat Storage 2014 - Product(s) Overview
Red Hat Storage 2014 - Product(s) OverviewRed Hat Storage 2014 - Product(s) Overview
Red Hat Storage 2014 - Product(s) Overview
 
Scalable POSIX File Systems in the Cloud
Scalable POSIX File Systems in the CloudScalable POSIX File Systems in the Cloud
Scalable POSIX File Systems in the Cloud
 
Open ebs 101
Open ebs 101Open ebs 101
Open ebs 101
 
TechDay - Toronto 2016 - Hyperconvergence and OpenNebula
TechDay - Toronto 2016 - Hyperconvergence and OpenNebulaTechDay - Toronto 2016 - Hyperconvergence and OpenNebula
TechDay - Toronto 2016 - Hyperconvergence and OpenNebula
 
Gestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMF
Gestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMFGestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMF
Gestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMF
 
Webinar: OpenEBS - Still Free and now FASTEST Kubernetes storage
Webinar: OpenEBS - Still Free and now FASTEST Kubernetes storageWebinar: OpenEBS - Still Free and now FASTEST Kubernetes storage
Webinar: OpenEBS - Still Free and now FASTEST Kubernetes storage
 
Big Data Glossary of terms
Big Data Glossary of termsBig Data Glossary of terms
Big Data Glossary of terms
 
Red Hat Enterprise Linux: Open, hyperconverged infrastructure
Red Hat Enterprise Linux: Open, hyperconverged infrastructureRed Hat Enterprise Linux: Open, hyperconverged infrastructure
Red Hat Enterprise Linux: Open, hyperconverged infrastructure
 
Dell Lustre Storage Architecture Presentation - MBUG 2016
Dell Lustre Storage Architecture Presentation - MBUG 2016Dell Lustre Storage Architecture Presentation - MBUG 2016
Dell Lustre Storage Architecture Presentation - MBUG 2016
 
Ceph as software define storage
Ceph as software define storageCeph as software define storage
Ceph as software define storage
 
Scaling Security on 100s of Millions of Mobile Devices Using Apache Kafka® an...
Scaling Security on 100s of Millions of Mobile Devices Using Apache Kafka® an...Scaling Security on 100s of Millions of Mobile Devices Using Apache Kafka® an...
Scaling Security on 100s of Millions of Mobile Devices Using Apache Kafka® an...
 
Monitoring&Logging - Stanislav Kolenkin
Monitoring&Logging - Stanislav Kolenkin  Monitoring&Logging - Stanislav Kolenkin
Monitoring&Logging - Stanislav Kolenkin
 
Microsoft azure platforms
Microsoft azure platformsMicrosoft azure platforms
Microsoft azure platforms
 
HPC DAY 2017 | HPE Storage and Data Management for Big Data
HPC DAY 2017 | HPE Storage and Data Management for Big DataHPC DAY 2017 | HPE Storage and Data Management for Big Data
HPC DAY 2017 | HPE Storage and Data Management for Big Data
 

GlusterFS Presentation FOSSCOMM2013 HUA, Athens, GR

  • 1. Store your trillions of bytes using commodity hardware and open source (GlusterFS) Theophanis K. Kontogiannis RHC{SA,E,EV,ESM,I,X} tkonto@gmail.com @tkonto
  • 2. The problem ● Data growth beyond manageable sizes ● Data growth beyond cost effective sizes How much would it cost to store 100PB of non structured data in a storage???
  • 3. The idea Create a scalable data storing infrastructure uniformly presented to clients using: ● Commodity (even off the Self) Hardware ● Open standards
  • 5. The vision GlusterFS: Open – Unified - Extensible Scalable – Manageable - Reliable Scale-out Network Attached Storage (NAS) Software Solution for On Premise - Virtualized - Cloud Environments
  • 6. The implementation ● Open source, distributed file system capable of scaling to thousands petabytes (actually, 72 brontobytes!) and handling thousands of clients. Processing: 1024 Terabytes = 1 Petabyte 1024 Petabytes = 1 Exabyte 1024 Exabytes = 1 Zettabyte 1024 Zettabytes = 1 Yottabyte 1024 Yottabytes = 1 Brontobyte ● Clusters together storage building blocks over Infiniband RDMA or TCP/IP interconnect, aggregating disk and memory resources and managing data in a single global namespace. ● Based on a stackable user space design and can deliver exceptional performance for diverse workloads. ● Self Healing ● Not tied to I/O profiles or hardware or OS -The question is how much is a BrontoByte? -The question is WHO CARES?
  • 7. Really it can support that much? Yes it can! 2^32 (max subvolumes of distribute translator) X 18 exabytes (max xfs volume size) = 72 brontobytes (or 89,131,682,828,547,379,792,736,944,128bytes) GlusterFS is supporting 2^128 (uuid) inodes
  • 8. And this is how it goes
  • 9. A bit of (business as usual) history ● Gluster Inc. was founded in 2005 ● Focused in Public & Private Cloud Storage ● Main product GlusterFS was written by Anand Babu Periasamy, Gluster’s founder and CTO ● Received $8.5M in 2010 via VC funding ● Acquired for $136M by Red Hat in 2011
  • 10. GlusterFS <--> Red Hat Storage ● Gluster.com redirects to RHS pages ● Gluster.org actively supported by RedHat What is important is the integration of technologies in ways that demonstrably benefit the customers
  • 11. Components ● brick The brick is the storage filesystem that has been assigned to a volume. ● client The machine which mounts the volume (this may also be a server). ● server The machine (virtual or bare metal) which hosts the actual filesystem in which data will be stored. ● subvolume A brick after being processed by at least one translator. ● volume The final share after it passes through all the translators ● Translator Code that interprets the actual files geometry/location/distribution on disks comprising a volume and is responsible for the perceived performance
  • 17. ...and the programmers view if (!(xl->fops = dlsym (handle, "fops"))) { gf_log ("xlator", GF_LOG_WARNING, "dlsym(fops) on %s", dlerror ()); goto out; } if (!(xl->cbks = dlsym (handle, "cbks"))) { gf_log ("xlator", GF_LOG_WARNING, "dlsym(cbks) on %s", dlerror ()); goto out; } if (!(xl->init = dlsym (handle, "init"))) { gf_log ("xlator", GF_LOG_WARNING, "dlsym(init) on %s", dlerror ()); goto out; } if (!(xl->fini = dlsym (handle, "fini"))) { gf_log ("xlator", GF_LOG_WARNING, "dlsym(fini) on %s", dlerror ()); goto out; }
  • 18. Course of action ● Partition, Format and mount the bricks ● Format the partition ● Mount the partition as a Gluster "brick" ● Add an entry to /etc/fstab ● Install Gluster packages on nodes ● Run the gluster peer probe command ● Configure your Gluster volume (and the translators) ● Test using the volume
  • 19. Translators? Translator Type Functional Purpose Storage Lowest level translator, stores and accesses data from local file system. Debug Provide interface and statistics for errors and debugging. Cluster Handle distribution and replication of data as it relates to writing to and reading from bricks & nodes. Encryption Extension translators for on-the-fly encryption/decryption of stored data. Protocol Interface translators for client / server authentication and communications. Performance Tuning translators to adjust for workload and I/O profiles. Bindings Add extensibility, e.g. The Python interface written by Jeff Darcy to extend API interaction with GlusterFS. System System access translators, e.g. Interfacing with file system access control. Scheduler I/O schedulers that determine how to distribute new write operations across clustered systems. Features Add additional features such as Quotas, Filters, Locks, etc.
  • 20. Not flexible with command line?
  • 21. Benchmarks? Method and platforms pretty much standard: ● Multiple 'dd' of varying blocks are read and written from multiple clients simultaneously. ● GlusterFS Brick Configuration (16 bricks) Processor - Dual Intel(R) Xeon(R) CPU 5160 @ 3.00GHz RAM - 8GB FB-DIMM Disk - SATA-II 500GB HCA - Mellanox MHGS18-XT/S InfiniBand HCA ● Client Configuration (64 clients) RAM - 4GB DDR2 (533 Mhz) Processor - Single Intel(R) Pentium(R) D CPU 3.40GHz Disk - SATA-II 500GB HCA - Mellanox MHGS18-XT/S InfiniBand HCA ● Interconnect Switch: Voltaire port InfiniBand Switch (14U)
  • 22. Size does not matter....
  • 25. And you can GeoDistribute it :) Multi-site cascading
  • 26. Enough with food for thoughts... ● www.redhat.com/products/storage-server/ ● www.gluster.org Now back to your consoles!!!! Thank you...