GlusterFS is an open source, scale-out network filesystem. It runs on commodity hardware and allows indefinite growth in capacity and performance by simply adding server nodes. Key benefits include flexibility to deploy on any hardware, linearly scalable performance, and superior storage economics compared to traditional storage solutions. GlusterFS uses a distributed hashing technique instead of a metadata server to provide high availability and reliability.
1. Introduction To GlusterFS
Audio is available via VoIP or Landline.
For VoIP
You will be connected to audio using your
computer‟s speakers or headset.
For Landline:
Please select Use Telephone after joining the
Webinar.
US/Canada attendees dial (805) 309-0021
Access Code 232-976-193
2. Today’s Speakers
John Kreisa Tom Trainer Jacob Shucart
Vice President Director Senior Systems Engineer
Marketing Product Marketing Gluster, Inc.
Gluster, Inc. Gluster, Inc.
A Better Way To Do Storage 2
3. Poll Question
Are you using GlusterFS today?
– Yes, in a test environment
– Yes, it‟s deployed in a production environment
– No, however we are considering it
– Just researching
A Better Way To Do Storage 3
4. History of Gluster
How it all started
– Backgrounds in high performance, clustered computing
– Working at Lawrence Livermore National Labs
• AB Periasamy & Hitesh Chellani design “Thunder”
• One of the worlds fastest super computers
• On Intel commodity hardware
• Solved filesystem scalability and performance limitations
– Large customer in oil & gas persuaded them to focus on storage
– Gluster founded by Hitesh & AB to bring technology to market
Thunder
Result: award winning technology
A Better Way To Do Storage 4
5. What is the Gluster File System?
A scale-out file system for Network Attached Storage (NAS)
– Highly available
– Linearly scalable performance
GlusterFS provides
– Flexibility to deploy in ANY
environment
– Scalability to Petabytes & beyond
– Superior storage economics
A Better Way To Do Storage 5
6. GlusterFS Architecture Design Goals
Elasticity
– Flexibility adapt to growth/reduction
– Add, delete volumes & users
– Without disruption
Scale linearly
– Multiple dimensions
• Performance
• Capacity
– Aggregated resources
Eliminate metadata
Performance
– Improve file access speed
Simplicity
Capacity – Ease of management
– No complex Kernel patches
– Run in user space
A Better Way To Do Storage 6
7. Key Differentiators
Filesystem runs in user space
Software only
Open source
Modular, stackable storage OS architecture
Data stored in native formats
No metadata – Elastic hashing
A Better Way To Do Storage 7
8. Filesystem Runs in User Space
User Space Not tied to kernel
GlusterFS
No reassemblies
Server
(CPU/Mem) Independence
Kernel
1 TB 1 TB
1 TB 1 TB
1 TB 1 TB
1 TB 1 TB
1 TB 1 TB
1 TB 1 TB
1 TB 1 TB
A Better Way To Do Storage 8
9. Software Only
Hardware agnostic
Superior storage economics & flexibility
– Data center / private cloud use commodity hardware
– Public cloud – i.e. AWS, RackSpace, GoGrid – pay for only what you need
No lock-in
– Hardware vendors-at purchase time or in the future
– Public or private cloud
– Performance, capacity, or availability levels
– GlusterFS – not proprietary, files are stored in native formats (i.e. EXT4)
A Better Way To Do Storage 9
10. Open Source
200,000+ downloads
Global Adoption – ~12,000 /month
500+ registered deployments
– 45 countries
2,500+ registered users
– Mailing lists, Forums, etc.
Active community
– Diverse testing environments
– Bugs identification and fixes
– Code contributions
Member of broader ecosystem
– OpenStack, Linux Foundation, Open
Virtualization Alliance
A Better Way To Do Storage 10
11. Modular, Stackable Storage OS Architecture
Services to the public network
NAS protocols
Replication
Global namespace NFS, CIFS, GlusterFS, HTTP(s), WebDAV, (S)FTP
Interface – Protocols - Replication
Clustered & replicated
Built-in failover Gluster Server
Self healing Public Network | Storage Server Network
1Gb, 10Gb, Infiniband
POSIX compliant
64-bit Linux distributions
No metadata server RHEL, Fedora, Debian, CentOS, Ubuntu, SUSE
Elastic hash Unlimited number of files
Elastic hash function
Native format ext3 or ext4 recommended / POSIX compliant
Thin provisioning
Disk storage
Local to host(Gluster Server), SAS, SATA, SCSI attached JBODs, Fibre
Channel supported
Flexible storage configurations ISCSI supported – for low I/O Apps
A Better Way To Do Storage 11
12. Elastic Hashing
No metadata server
An algorithmic approach
– Unique hash tag for each file stored
– Tags stored within the file system
– Rapid file read – low latency
Figure 1
Figure 2
Figure 3
A Better Way To Do Storage *Figures 1,2,3 are examples only 12
13. A Standard Gluster Deployment
Clients/Apps Clients/Apps Clients/Apps Standard clients
running standard apps
IP Network Over any standard IP
network
Access application
data, as files & folders,
Gluster Global Namespace (NFS, CIFS, Gluster Native)
in a global namespace,
Application data VMs VMDK VMDK using a variety of
standard protocols
Stored in a
commoditized,
virtual storage pool virtualized, scale-out,
centrally managed pool
DAS, SAN, NAS
A Better Way To Do Storage 13
14. Unifying Public and Private Cloud Storage
Client/Apps Client/Apps Client/Apps
Client/Apps
Client/Apps Client/Apps
Client/Apps Client/Apps
Client/Apps
IP Network
Gluster Global Namespace
Private Cloud Public Cloud
Replication
A Better Way To Do Storage 14
15. Deployment Scenarios
Common Solutions Built on GlusterFS
Media serving (CDN)
Large scale file storage
Tier 2 & 3 archive
File sharing
Multi-tenant file systems
Storage for High Performance
Computing (HPC)
IaaS storage layer
Disaster recovery
Backup & restore
Private cloud
A Better Way To Do Storage 15
16. Pandora Internet Radio
Problem
• Explosive user & title growth
• As many as 12 file formats for each song
• „Hot‟ content and long tail
Solution
• Three data centers, each with a six-node
GlusterFS cluster
• 1.2 PB of audio served • Replication for high availability
per week • 250+ TB total capacity
• 13 million files Benefits
• Over 50 GB/sec peak • Easily scale capacity
traffic • Centralized management; one administrator
to manage day-to-day operations
• No changes to application
• Higher reliability
A Better Way To Do Storage 16
17. Brightcove
Problem
• Cloud-based online video platform
• Explosive customer & title growth
• Massive video in multiple locations
• Costs rising, esp. with HD formats
Solution
• Complete scale-out based on commodity
DAS/JBOD
• Replication for high availability
• Over 1 PB currently in
• 1PB total capacity
Gluster
• Separate 4 PB project Benefits
in the works • Easily scale capacity
• Centralized management; one administrator
to manage day-to-day operations
• Higher reliability
• Path to multi-site
A Better Way To Do Storage 17
18. Cincinnati Bell Technology Solutions
Problem
• Host a dedicated enterprise cloud solution
• Large scale VMware environment
• Need high availability
Solution
• Large scale VM
• Gluster for VM storage, NFS to clients
storage
• SAS drives on back-end
• Low cost service • Replication for high availability
delivery for enterprise
customer Benefits
• Drastic reduction in • Storage provisioning from 6 wks to 15 min.
provisioning time • Vendor agnostic storage
• Low cost of service delivery
• Elastic growth
A Better Way To Do Storage 18
19. Partners Healthcare
Private Cloud: Centralized Storage as a Service
Problem
• Capacity growth from 144TB to 1+PB
• Multiple distributed users/departments
• Multi OS access - Windows, Linux and Unix
Solution
• Over 500 TB • GlusterFS Cluster
• 9 Sun “Thumper” • Solaris/ZFS/x4500 w/ InfiniBand
systems in cluster • Native CIFS/ NFS access
Benefits
• Capacity on demand / pay as you grow
• Centralized management
• Higher reliability
• OPEX decreased by 10X
A Better Way To Do Storage 19
20. 4 Supported Ways to Consume GlusterFS
Virtual Machines
– GlusterFS deployable on the leading virtual machines
Amazon Web Services (AWS)
– GlusterFS deployed within Amazon Machine Image (AMI)
RightScale Cloud Management
– GlusterFS is available within a RightScale ServerTemplate
– Deployable via the RightScale Cloud Management Dashboard
Storage software appliance
– Deployable on bare metal and supports any hardware on the Red Hat Hardware
Compatibility List (HCL) of certified servers and storage
A Better Way To Do Storage 20
22. Summary
GlusterFS – A scale-out NAS file system
Flexibility, scalability, superior economics
Innovative architecture provides a better way to do
storage
A Better Way To Do Storage 22
23. Questions and Answers
Your turn - ask our experts
Try Gluster for free here: http://www.gluster.com/trybuy/
Additional resources here: http://www.gluster.com/products/resources/
Join the community: http://www.gluster.org/
Follow on twitter: @gluster.
Read our blog: http://blog.gluster.com/
Contact us at: info@gluster.com or 1-800-805-5215
A Better Way To Do Storage 23