SlideShare ist ein Scribd-Unternehmen logo
1 von 46
Downloaden Sie, um offline zu lesen
The State of the Gluster Community
John Mark Walker
Gluster Community Guy
November 8, 2012
11/08/12
Topics
● What is GlusterFS
● GlusterFS 3.3
● Ways to Access GlusterFS Volumes
● GlusterFS client (FUSE)
● Libgfapi – QEMU block device driver
● Translators
● Swift API – Unified File and Object
● The Roadmap
11/08/12
Simple Economics
● Simplicity, scalability, less cost
Multi-TenantVirtualized Automated Commoditized
Scale on
Demand
In the Cloud Scale Out Open Source
Commercial Product
What is Open Source
Open Source
Code
“Open Core”
● All engineering controlled by
project/product sponsor
● No innovation outside of
core engineering team
● All open source features
also in commercial product
● Many features in
Commercial product not in
open source code
Open Source Code
What is Open Source
Commercial Products
“Real” Open Source
● Many points of collaboration
and innovation in open
source project
● Engineering team from
multiple sources
● Project and product do not
completely overlap
● Commercial products are
hardened, more secure and
thoroughly tested
GlusterFS
What is Open Source
Red Hat Storage
“Real” Open Source
● Enables more innovation on
the fringes
● Engineering team from
multiple sources
● Open source project is
“upstream” from commercial
product
● “Downstream” products are
hardened, more secure and
thoroughly tested
11/08/12
Simplicity Bias
● FC, FCoE, iSCSI → HTTP, Sockets
● Modified BSD OS → Linux / User Space /
C, Python & Java
● Appliance based → Application based
11/08/12
Scale-out Open Source
is the winner
11/08/12
Bengaluru Office
Conference Room US Head Office
Bengaluru Office
Community Deployments
11/08/12
Not a Storage Company
● At first a cluster-building company
● Engineering team excelled at
building open source HPC systems
11/08/12
Necessity:
The Mother of
Invention
11/08/12
The big idea:
Storage should be
simple
11/08/12
What is Simple
Storage?
● Low-risk, easy to deploy and administer, data
consistency, open source, software-only,
user space
11/08/12
What is GlusterFS,
Really?
Gluster is a unified, distributed
storage system
● User space, global namespace, stackable,
POSIX-y, scale-out NAS platform, inspired by
GNU Hurd
11/08/12
Some Features
● No single point of failure
● DHT
● Synchronous and asynchronous
replication
● Proactive self-healing
11/08/12
What Can You Do With
It?
● Media – Docs, Photos, Video
● Shared storage – multi-tenant
environments
● Big Data – Log Files, RFID Data
● Objects – Long Tail Data
11/08/12
Standard Deployment
● Distributed
over multiple
servers
● Replicate
volumes
● On top of disk
FS (XFS,
Ext4, ie.
Xattrs)
● Multi-protocol
access
Red Hat Proprietary
19
Storage for Any Environment
Scale-out NAS for On-premises and Public Clouds
●
Standardized NAS infrastructure
●
On-premise and public cloud
●
POSIX-ish
●
Apps move easily between environments
●
Replicate between both
Public CloudOn-premises
11/08/12
First Versions
● Toolkit for building storage systems
● Very hacker-friendly
● Community integral part of development
– Drove feature development
– Repeatable use cases
11/08/12
Mid-2011 Snapshot
● Scale-out NAS
● Distributed and replicated
● NFS, CIFS and native GlusterFS
● User-space, stackable architecture
● Lots of users, not many devs
→ A good platform to build on
11/08/12
GlusterFS 3.3: Building
on the Foundation
● Granular locking
● Proactive self-healing
● Improved rebalancing
● More access methods
11/08/12
Granular Locking
– Server fails, comes back
– Files evaluated
– Block-by-block until healed
Server 1 Server 2
GlusterFS GlusterFS
Virtual
Disk 1-1
Virtual
Disk 1-2
Virtual
Disk 2-1
Virtual
Disk 2-2
Blocks compared
11/08/12
Proactive Self-healing
– Performed server-to-server
– Recovered node queries peers
/ Symlink 1
Hidden | Symlink 2
 Symlink 3
File 1
File 2
File 3
Server 1 - good
Server 2 - recovered
Replicated
Server 3 - good
File 1
File 2
File 3
Server 4 - good
Distributed
Self-healing
11/08/12
Easier Rebalancing
– Now faster
● Previously, created entire new hash
set, moving data unnecessarily
● Now recreates hash map and
compares to old
– Easier to decommission server nodes
– Proof point for synchronous translator
API
11/08/12
Unified File and Object
(UFO)
– S3, Swift-style object storage
– Access via UFO or Gluster mount
Client
Proxy Account
Container
Object
HTTP Request
ID=/dir/sub/sub2/file
Directory
Volume
FileClient
NFS or
GlusterFS Mount
11/08/12
Unified File and Object
(UFO)
– Your gateway to the cloud
– Your data, accessed your way
11/08/12
HDFS Compatibility
– Run MapReduce jobs on GlusterFS
– Add unstructured data to Hadoop
Hadoop Server
GlusterFS
GlusterFS
GlusterFS
Local Disk
GlusterFS
HDFS
Connector
(Jar file)
11/08/12
4. Coming
Attractions
11/08/12
API Check
● Ways to interface with GlusterFS
– Translators
● Stackable, async and sync
– FUSE mount
● GlusterFS client
– Libgfapi
● FUSE bypass
11/08/12
API Check
● Ways to interface with GlusterFS
– Marker framework
● Geo-replication, quickly ID changes
– UFO RESTful API
– HDFS library
– Management API
● oVirt 3.1
11/08/12
Better VM Image
Handling
– Better responsiveness for random I/o
use cases
– Contribution: Block Device Translator
11/08/12
Enabling GlusterFS for
Virtualization use
● QEMU-GlusterFS integration
● Native integration, no FUSE mount
● Gluster as QEMU block back end
● QEMU talks to gluster and gluster hides different
image formats and storage types underneath
● Block device support in GlusterFS via Block Device
translator
● Logical volumes as VM images
11/08/12
GlusterFS & QEMU
11/08/12
Libglusterfs Client API
– Previously abandoned
– Brought back to life
● In part because of QEMU Fuse
bypass contributions
11/08/12
Split Brain
– Nodes cannot see each other, but can
all still write
– Often due to network outages
– Sometimes results in conflicts
– Up to 3.2, GlusterFS had no concept
of “quorum”
11/08/12
Quorum Enforcement
– Which node has valid data?
– If quorum, keep writing, else stop
● Configurable option
-No quorum
-Stops writing
-Quorum
-Keeps writing
Server 1 Server 2
Broken
Connection
-Quorum
-Keeps writing
Server 3
11/08/12
Quorum Enforcement
– After connection restored, self-heal kicks off
-No quorum
-Stops writing
-Quorum
-Keeps writing
Replica 1 Replica 2
-Quorum
-Keeps writing
Replica 3
-No quorum
-Stops writing
-Quorum
-Keeps writing
-Quorum
-Keeps writing
Self-heal
11/08/12
Enhanced Quorum
– Quorum tracking on the servers
– Need quorum for any management
changes
– 3rd party arbiters / observers so never
N=2
11/08/12
Management UI & REST
API
– Collaboration with oVirt project
– Management GUI for admins
– RESTful gateway for devs
– First community release... ?
11/08/12
11/08/12
Multi-tenancy &
Encryption
– HekaFS created this for cloud
deployments
– In-flight data encryption
11/08/12
Down the Road
– Multi-master Geo-rep
– Snapshots
– Versioning
– GeoRep Sparse Replicas
– File compression & de-dupe
11/08/12
Server-side Processing
– Implementing gfind, glocate
– Fast traversal of metadata in xattrs
● Find and locate responsive
– Inotify-esque behavior: triggers based
on i/o activity, ie. file close
● Why rely on Hadoop batch-
processing?
11/08/12
Goal: Intelligent
Storage
● Just storing and retrieving data is not
enough
● Should be able to store, analyze,
transform, mutilate, and retrieve
● Intelligent storage gives sysadmins and
developers the ultimate data swiss army
knife
Thank you!
John Mark Walker
Gluster Community Guy
johnmark@redhat.com

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Dedupe nmamit
Dedupe nmamitDedupe nmamit
Dedupe nmamit
 
Integrating gluster fs,_qemu_and_ovirt-vijay_bellur-linuxcon_eu_2013
Integrating gluster fs,_qemu_and_ovirt-vijay_bellur-linuxcon_eu_2013Integrating gluster fs,_qemu_and_ovirt-vijay_bellur-linuxcon_eu_2013
Integrating gluster fs,_qemu_and_ovirt-vijay_bellur-linuxcon_eu_2013
 
Gluster d thread_synchronization_using_urcu_lca2016
Gluster d thread_synchronization_using_urcu_lca2016Gluster d thread_synchronization_using_urcu_lca2016
Gluster d thread_synchronization_using_urcu_lca2016
 
Sdc challenges-2012
Sdc challenges-2012Sdc challenges-2012
Sdc challenges-2012
 
YDAL Barcelona
YDAL BarcelonaYDAL Barcelona
YDAL Barcelona
 
Sdc 2012-challenges
Sdc 2012-challengesSdc 2012-challenges
Sdc 2012-challenges
 
GlusterFS Containers
GlusterFS ContainersGlusterFS Containers
GlusterFS Containers
 
Gluster d2
Gluster d2Gluster d2
Gluster d2
 
Kkeithley ufonfs-gluster summit
Kkeithley ufonfs-gluster summitKkeithley ufonfs-gluster summit
Kkeithley ufonfs-gluster summit
 
Lisa 2015-gluster fs-introduction
Lisa 2015-gluster fs-introductionLisa 2015-gluster fs-introduction
Lisa 2015-gluster fs-introduction
 
Gluster for sysadmins
Gluster for sysadminsGluster for sysadmins
Gluster for sysadmins
 
20160401 Gluster-roadmap
20160401 Gluster-roadmap20160401 Gluster-roadmap
20160401 Gluster-roadmap
 
Lcna example-2012
Lcna example-2012Lcna example-2012
Lcna example-2012
 
Accessing gluster ufo_-_eco_willson
Accessing gluster ufo_-_eco_willsonAccessing gluster ufo_-_eco_willson
Accessing gluster ufo_-_eco_willson
 
Gdeploy 2.0
Gdeploy 2.0Gdeploy 2.0
Gdeploy 2.0
 
Gluster fs current_features_and_roadmap
Gluster fs current_features_and_roadmapGluster fs current_features_and_roadmap
Gluster fs current_features_and_roadmap
 
Introduction to highly_availablenfs_server_on_scale-out_storage_systems_based...
Introduction to highly_availablenfs_server_on_scale-out_storage_systems_based...Introduction to highly_availablenfs_server_on_scale-out_storage_systems_based...
Introduction to highly_availablenfs_server_on_scale-out_storage_systems_based...
 
GlusterFs Architecture & Roadmap - LinuxCon EU 2013
GlusterFs Architecture & Roadmap - LinuxCon EU 2013GlusterFs Architecture & Roadmap - LinuxCon EU 2013
GlusterFs Architecture & Roadmap - LinuxCon EU 2013
 
20160401 guster-roadmap
20160401 guster-roadmap20160401 guster-roadmap
20160401 guster-roadmap
 
Gluster fs hadoop_fifth-elephant
Gluster fs hadoop_fifth-elephantGluster fs hadoop_fifth-elephant
Gluster fs hadoop_fifth-elephant
 

Andere mochten auch

Andere mochten auch (15)

Gluster wireshark niels_de_vos
Gluster wireshark niels_de_vosGluster wireshark niels_de_vos
Gluster wireshark niels_de_vos
 
On demand file-caching_-_gustavo_brand
On demand file-caching_-_gustavo_brandOn demand file-caching_-_gustavo_brand
On demand file-caching_-_gustavo_brand
 
Gsummit apis-2013
Gsummit apis-2013Gsummit apis-2013
Gsummit apis-2013
 
Lcna tutorial-2012
Lcna tutorial-2012Lcna tutorial-2012
Lcna tutorial-2012
 
Qemu gluster fs
Qemu gluster fsQemu gluster fs
Qemu gluster fs
 
Disperse xlator ramon_datalab
Disperse xlator ramon_datalabDisperse xlator ramon_datalab
Disperse xlator ramon_datalab
 
Hands On Gluster with Jeff Darcy
Hands On Gluster with Jeff DarcyHands On Gluster with Jeff Darcy
Hands On Gluster with Jeff Darcy
 
Gsummit apis-2012
Gsummit apis-2012Gsummit apis-2012
Gsummit apis-2012
 
Lisa 2015-gluster fs-hands-on
Lisa 2015-gluster fs-hands-onLisa 2015-gluster fs-hands-on
Lisa 2015-gluster fs-hands-on
 
Gluster as Block Store in Containers
Gluster as Block Store in ContainersGluster as Block Store in Containers
Gluster as Block Store in Containers
 
Leases and-caching final
Leases and-caching finalLeases and-caching final
Leases and-caching final
 
Bug triage in_gluster
Bug triage in_glusterBug triage in_gluster
Bug triage in_gluster
 
Debugging with-wireshark-niels-de-vos
Debugging with-wireshark-niels-de-vosDebugging with-wireshark-niels-de-vos
Debugging with-wireshark-niels-de-vos
 
Integrating gluster fs,_qemu_and_ovirt-vijay_bellur-linuxcon_eu_2013
Integrating gluster fs,_qemu_and_ovirt-vijay_bellur-linuxcon_eu_2013Integrating gluster fs,_qemu_and_ovirt-vijay_bellur-linuxcon_eu_2013
Integrating gluster fs,_qemu_and_ovirt-vijay_bellur-linuxcon_eu_2013
 
Introduction to Open Source
Introduction to Open SourceIntroduction to Open Source
Introduction to Open Source
 

Ähnlich wie State of the_gluster_-_lceu

Wheeler w 0450_linux_file_systems1
Wheeler w 0450_linux_file_systems1Wheeler w 0450_linux_file_systems1
Wheeler w 0450_linux_file_systems1
sprdd
 
Wheeler w 0450_linux_file_systems1
Wheeler w 0450_linux_file_systems1Wheeler w 0450_linux_file_systems1
Wheeler w 0450_linux_file_systems1
sprdd
 
vBACD - Distributed Petabyte-Scale Cloud Storage with GlusterFS - 2/28
vBACD - Distributed Petabyte-Scale Cloud Storage with GlusterFS - 2/28vBACD - Distributed Petabyte-Scale Cloud Storage with GlusterFS - 2/28
vBACD - Distributed Petabyte-Scale Cloud Storage with GlusterFS - 2/28
CloudStack - Open Source Cloud Computing Project
 

Ähnlich wie State of the_gluster_-_lceu (20)

Gluster fs architecture_&_roadmap-vijay_bellur-linuxcon_eu_2013
Gluster fs architecture_&_roadmap-vijay_bellur-linuxcon_eu_2013Gluster fs architecture_&_roadmap-vijay_bellur-linuxcon_eu_2013
Gluster fs architecture_&_roadmap-vijay_bellur-linuxcon_eu_2013
 
GlusterFS Talk for CentOS Dojo Bangalore
GlusterFS Talk for CentOS Dojo BangaloreGlusterFS Talk for CentOS Dojo Bangalore
GlusterFS Talk for CentOS Dojo Bangalore
 
GlusterFS And Big Data
GlusterFS And Big DataGlusterFS And Big Data
GlusterFS And Big Data
 
GlusterFS and Openstack Storage
GlusterFS and Openstack StorageGlusterFS and Openstack Storage
GlusterFS and Openstack Storage
 
Cephfsglusterfs.talk
Cephfsglusterfs.talkCephfsglusterfs.talk
Cephfsglusterfs.talk
 
Wheeler w 0450_linux_file_systems1
Wheeler w 0450_linux_file_systems1Wheeler w 0450_linux_file_systems1
Wheeler w 0450_linux_file_systems1
 
Wheeler w 0450_linux_file_systems1
Wheeler w 0450_linux_file_systems1Wheeler w 0450_linux_file_systems1
Wheeler w 0450_linux_file_systems1
 
Lt2013 glusterfs.talk
Lt2013 glusterfs.talkLt2013 glusterfs.talk
Lt2013 glusterfs.talk
 
vBACD - Distributed Petabyte-Scale Cloud Storage with GlusterFS - 2/28
vBACD - Distributed Petabyte-Scale Cloud Storage with GlusterFS - 2/28vBACD - Distributed Petabyte-Scale Cloud Storage with GlusterFS - 2/28
vBACD - Distributed Petabyte-Scale Cloud Storage with GlusterFS - 2/28
 
Spectrum Scale Unified File and Object with WAN Caching
Spectrum Scale Unified File and Object with WAN CachingSpectrum Scale Unified File and Object with WAN Caching
Spectrum Scale Unified File and Object with WAN Caching
 
Software Defined Analytics with File and Object Access Plus Geographically Di...
Software Defined Analytics with File and Object Access Plus Geographically Di...Software Defined Analytics with File and Object Access Plus Geographically Di...
Software Defined Analytics with File and Object Access Plus Geographically Di...
 
Linux High Availability Overview - openSUSE.Asia Summit 2015
Linux High Availability Overview - openSUSE.Asia Summit 2015 Linux High Availability Overview - openSUSE.Asia Summit 2015
Linux High Availability Overview - openSUSE.Asia Summit 2015
 
The Future of GlusterFS and Gluster.org
The Future of GlusterFS and Gluster.orgThe Future of GlusterFS and Gluster.org
The Future of GlusterFS and Gluster.org
 
OSDC 2013 | Distributed Storage with GlusterFS by Dr. Udo Seidel
OSDC 2013 | Distributed Storage with GlusterFS by Dr. Udo SeidelOSDC 2013 | Distributed Storage with GlusterFS by Dr. Udo Seidel
OSDC 2013 | Distributed Storage with GlusterFS by Dr. Udo Seidel
 
Red Hat Storage Server Roadmap & Integration With Open Stack
Red Hat Storage Server Roadmap & Integration With Open StackRed Hat Storage Server Roadmap & Integration With Open Stack
Red Hat Storage Server Roadmap & Integration With Open Stack
 
HDFS- What is New and Future
HDFS- What is New and FutureHDFS- What is New and Future
HDFS- What is New and Future
 
OSDC 2012 | Extremes Wolken Dateisystem!? by Dr. Udo Seidel
OSDC 2012 | Extremes Wolken Dateisystem!? by Dr. Udo SeidelOSDC 2012 | Extremes Wolken Dateisystem!? by Dr. Udo Seidel
OSDC 2012 | Extremes Wolken Dateisystem!? by Dr. Udo Seidel
 
HDFCloud Workshop: HDF5 in the Cloud
HDFCloud Workshop: HDF5 in the CloudHDFCloud Workshop: HDF5 in the Cloud
HDFCloud Workshop: HDF5 in the Cloud
 
Gluster fs architecture_&_roadmap_atin_punemeetup_2015
Gluster fs architecture_&_roadmap_atin_punemeetup_2015Gluster fs architecture_&_roadmap_atin_punemeetup_2015
Gluster fs architecture_&_roadmap_atin_punemeetup_2015
 
Red Hat Enterprise Linux: Open, hyperconverged infrastructure
Red Hat Enterprise Linux: Open, hyperconverged infrastructureRed Hat Enterprise Linux: Open, hyperconverged infrastructure
Red Hat Enterprise Linux: Open, hyperconverged infrastructure
 

Mehr von Gluster.org

nfusr: a new userspace NFS client based on libnfs - Shreyas Siravara
nfusr: a new userspace NFS client based on libnfs - Shreyas Siravaranfusr: a new userspace NFS client based on libnfs - Shreyas Siravara
nfusr: a new userspace NFS client based on libnfs - Shreyas Siravara
Gluster.org
 
Facebook’s upstream approach to GlusterFS - David Hasson
Facebook’s upstream approach to GlusterFS  - David HassonFacebook’s upstream approach to GlusterFS  - David Hasson
Facebook’s upstream approach to GlusterFS - David Hasson
Gluster.org
 

Mehr von Gluster.org (20)

Automating Gluster @ Facebook - Shreyas Siravara
Automating Gluster @ Facebook - Shreyas SiravaraAutomating Gluster @ Facebook - Shreyas Siravara
Automating Gluster @ Facebook - Shreyas Siravara
 
nfusr: a new userspace NFS client based on libnfs - Shreyas Siravara
nfusr: a new userspace NFS client based on libnfs - Shreyas Siravaranfusr: a new userspace NFS client based on libnfs - Shreyas Siravara
nfusr: a new userspace NFS client based on libnfs - Shreyas Siravara
 
Facebook’s upstream approach to GlusterFS - David Hasson
Facebook’s upstream approach to GlusterFS  - David HassonFacebook’s upstream approach to GlusterFS  - David Hasson
Facebook’s upstream approach to GlusterFS - David Hasson
 
Throttling Traffic at Facebook Scale
Throttling Traffic at Facebook ScaleThrottling Traffic at Facebook Scale
Throttling Traffic at Facebook Scale
 
GlusterFS w/ Tiered XFS
GlusterFS w/ Tiered XFS  GlusterFS w/ Tiered XFS
GlusterFS w/ Tiered XFS
 
Gluster Metrics: why they are crucial for running stable deployments of all s...
Gluster Metrics: why they are crucial for running stable deployments of all s...Gluster Metrics: why they are crucial for running stable deployments of all s...
Gluster Metrics: why they are crucial for running stable deployments of all s...
 
Up and Running with Glusto & Glusto-Tests in 5 Minutes (or less)
Up and Running with Glusto & Glusto-Tests in 5 Minutes (or less)Up and Running with Glusto & Glusto-Tests in 5 Minutes (or less)
Up and Running with Glusto & Glusto-Tests in 5 Minutes (or less)
 
Data Reduction for Gluster with VDO
Data Reduction for Gluster with VDOData Reduction for Gluster with VDO
Data Reduction for Gluster with VDO
 
Releases: What are contributors responsible for
Releases: What are contributors responsible forReleases: What are contributors responsible for
Releases: What are contributors responsible for
 
RIO Distribution: Reconstructing the onion - Shyamsundar Ranganathan
RIO Distribution: Reconstructing the onion - Shyamsundar RanganathanRIO Distribution: Reconstructing the onion - Shyamsundar Ranganathan
RIO Distribution: Reconstructing the onion - Shyamsundar Ranganathan
 
Gluster and Kubernetes
Gluster and KubernetesGluster and Kubernetes
Gluster and Kubernetes
 
Native Clients, more the merrier with GFProxy!
Native Clients, more the merrier with GFProxy!Native Clients, more the merrier with GFProxy!
Native Clients, more the merrier with GFProxy!
 
Gluster: a SWOT Analysis
Gluster: a SWOT Analysis Gluster: a SWOT Analysis
Gluster: a SWOT Analysis
 
GlusterD-2.0: What's Happening? - Kaushal Madappa
GlusterD-2.0: What's Happening? - Kaushal MadappaGlusterD-2.0: What's Happening? - Kaushal Madappa
GlusterD-2.0: What's Happening? - Kaushal Madappa
 
Scalability and Performance of CNS 3.6
Scalability and Performance of CNS 3.6Scalability and Performance of CNS 3.6
Scalability and Performance of CNS 3.6
 
What Makes Us Fail
What Makes Us FailWhat Makes Us Fail
What Makes Us Fail
 
Gluster as Native Storage for Containers - past, present and future
Gluster as Native Storage for Containers - past, present and futureGluster as Native Storage for Containers - past, present and future
Gluster as Native Storage for Containers - past, present and future
 
Heketi Functionality into Glusterd2
Heketi Functionality into Glusterd2Heketi Functionality into Glusterd2
Heketi Functionality into Glusterd2
 
Architecture of the High Availability Solution for Ganesha and Samba with Kal...
Architecture of the High Availability Solution for Ganesha and Samba with Kal...Architecture of the High Availability Solution for Ganesha and Samba with Kal...
Architecture of the High Availability Solution for Ganesha and Samba with Kal...
 
Challenges with Gluster and Persistent Memory with Dan Lambright
Challenges with Gluster and Persistent Memory with Dan LambrightChallenges with Gluster and Persistent Memory with Dan Lambright
Challenges with Gluster and Persistent Memory with Dan Lambright
 

Kürzlich hochgeladen

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Kürzlich hochgeladen (20)

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 

State of the_gluster_-_lceu

  • 1. The State of the Gluster Community John Mark Walker Gluster Community Guy November 8, 2012
  • 2. 11/08/12 Topics ● What is GlusterFS ● GlusterFS 3.3 ● Ways to Access GlusterFS Volumes ● GlusterFS client (FUSE) ● Libgfapi – QEMU block device driver ● Translators ● Swift API – Unified File and Object ● The Roadmap
  • 3. 11/08/12 Simple Economics ● Simplicity, scalability, less cost Multi-TenantVirtualized Automated Commoditized Scale on Demand In the Cloud Scale Out Open Source
  • 4. Commercial Product What is Open Source Open Source Code “Open Core” ● All engineering controlled by project/product sponsor ● No innovation outside of core engineering team ● All open source features also in commercial product ● Many features in Commercial product not in open source code
  • 5. Open Source Code What is Open Source Commercial Products “Real” Open Source ● Many points of collaboration and innovation in open source project ● Engineering team from multiple sources ● Project and product do not completely overlap ● Commercial products are hardened, more secure and thoroughly tested
  • 6. GlusterFS What is Open Source Red Hat Storage “Real” Open Source ● Enables more innovation on the fringes ● Engineering team from multiple sources ● Open source project is “upstream” from commercial product ● “Downstream” products are hardened, more secure and thoroughly tested
  • 7. 11/08/12 Simplicity Bias ● FC, FCoE, iSCSI → HTTP, Sockets ● Modified BSD OS → Linux / User Space / C, Python & Java ● Appliance based → Application based
  • 9. 11/08/12 Bengaluru Office Conference Room US Head Office Bengaluru Office
  • 11. 11/08/12 Not a Storage Company ● At first a cluster-building company ● Engineering team excelled at building open source HPC systems
  • 13. 11/08/12 The big idea: Storage should be simple
  • 14. 11/08/12 What is Simple Storage? ● Low-risk, easy to deploy and administer, data consistency, open source, software-only, user space
  • 15. 11/08/12 What is GlusterFS, Really? Gluster is a unified, distributed storage system ● User space, global namespace, stackable, POSIX-y, scale-out NAS platform, inspired by GNU Hurd
  • 16. 11/08/12 Some Features ● No single point of failure ● DHT ● Synchronous and asynchronous replication ● Proactive self-healing
  • 17. 11/08/12 What Can You Do With It? ● Media – Docs, Photos, Video ● Shared storage – multi-tenant environments ● Big Data – Log Files, RFID Data ● Objects – Long Tail Data
  • 18. 11/08/12 Standard Deployment ● Distributed over multiple servers ● Replicate volumes ● On top of disk FS (XFS, Ext4, ie. Xattrs) ● Multi-protocol access
  • 19. Red Hat Proprietary 19 Storage for Any Environment Scale-out NAS for On-premises and Public Clouds ● Standardized NAS infrastructure ● On-premise and public cloud ● POSIX-ish ● Apps move easily between environments ● Replicate between both Public CloudOn-premises
  • 20. 11/08/12 First Versions ● Toolkit for building storage systems ● Very hacker-friendly ● Community integral part of development – Drove feature development – Repeatable use cases
  • 21. 11/08/12 Mid-2011 Snapshot ● Scale-out NAS ● Distributed and replicated ● NFS, CIFS and native GlusterFS ● User-space, stackable architecture ● Lots of users, not many devs → A good platform to build on
  • 22. 11/08/12 GlusterFS 3.3: Building on the Foundation ● Granular locking ● Proactive self-healing ● Improved rebalancing ● More access methods
  • 23. 11/08/12 Granular Locking – Server fails, comes back – Files evaluated – Block-by-block until healed Server 1 Server 2 GlusterFS GlusterFS Virtual Disk 1-1 Virtual Disk 1-2 Virtual Disk 2-1 Virtual Disk 2-2 Blocks compared
  • 24. 11/08/12 Proactive Self-healing – Performed server-to-server – Recovered node queries peers / Symlink 1 Hidden | Symlink 2 Symlink 3 File 1 File 2 File 3 Server 1 - good Server 2 - recovered Replicated Server 3 - good File 1 File 2 File 3 Server 4 - good Distributed Self-healing
  • 25. 11/08/12 Easier Rebalancing – Now faster ● Previously, created entire new hash set, moving data unnecessarily ● Now recreates hash map and compares to old – Easier to decommission server nodes – Proof point for synchronous translator API
  • 26. 11/08/12 Unified File and Object (UFO) – S3, Swift-style object storage – Access via UFO or Gluster mount Client Proxy Account Container Object HTTP Request ID=/dir/sub/sub2/file Directory Volume FileClient NFS or GlusterFS Mount
  • 27. 11/08/12 Unified File and Object (UFO) – Your gateway to the cloud – Your data, accessed your way
  • 28. 11/08/12 HDFS Compatibility – Run MapReduce jobs on GlusterFS – Add unstructured data to Hadoop Hadoop Server GlusterFS GlusterFS GlusterFS Local Disk GlusterFS HDFS Connector (Jar file)
  • 30. 11/08/12 API Check ● Ways to interface with GlusterFS – Translators ● Stackable, async and sync – FUSE mount ● GlusterFS client – Libgfapi ● FUSE bypass
  • 31. 11/08/12 API Check ● Ways to interface with GlusterFS – Marker framework ● Geo-replication, quickly ID changes – UFO RESTful API – HDFS library – Management API ● oVirt 3.1
  • 32. 11/08/12 Better VM Image Handling – Better responsiveness for random I/o use cases – Contribution: Block Device Translator
  • 33. 11/08/12 Enabling GlusterFS for Virtualization use ● QEMU-GlusterFS integration ● Native integration, no FUSE mount ● Gluster as QEMU block back end ● QEMU talks to gluster and gluster hides different image formats and storage types underneath ● Block device support in GlusterFS via Block Device translator ● Logical volumes as VM images
  • 35. 11/08/12 Libglusterfs Client API – Previously abandoned – Brought back to life ● In part because of QEMU Fuse bypass contributions
  • 36. 11/08/12 Split Brain – Nodes cannot see each other, but can all still write – Often due to network outages – Sometimes results in conflicts – Up to 3.2, GlusterFS had no concept of “quorum”
  • 37. 11/08/12 Quorum Enforcement – Which node has valid data? – If quorum, keep writing, else stop ● Configurable option -No quorum -Stops writing -Quorum -Keeps writing Server 1 Server 2 Broken Connection -Quorum -Keeps writing Server 3
  • 38. 11/08/12 Quorum Enforcement – After connection restored, self-heal kicks off -No quorum -Stops writing -Quorum -Keeps writing Replica 1 Replica 2 -Quorum -Keeps writing Replica 3 -No quorum -Stops writing -Quorum -Keeps writing -Quorum -Keeps writing Self-heal
  • 39. 11/08/12 Enhanced Quorum – Quorum tracking on the servers – Need quorum for any management changes – 3rd party arbiters / observers so never N=2
  • 40. 11/08/12 Management UI & REST API – Collaboration with oVirt project – Management GUI for admins – RESTful gateway for devs – First community release... ?
  • 42. 11/08/12 Multi-tenancy & Encryption – HekaFS created this for cloud deployments – In-flight data encryption
  • 43. 11/08/12 Down the Road – Multi-master Geo-rep – Snapshots – Versioning – GeoRep Sparse Replicas – File compression & de-dupe
  • 44. 11/08/12 Server-side Processing – Implementing gfind, glocate – Fast traversal of metadata in xattrs ● Find and locate responsive – Inotify-esque behavior: triggers based on i/o activity, ie. file close ● Why rely on Hadoop batch- processing?
  • 45. 11/08/12 Goal: Intelligent Storage ● Just storing and retrieving data is not enough ● Should be able to store, analyze, transform, mutilate, and retrieve ● Intelligent storage gives sysadmins and developers the ultimate data swiss army knife
  • 46. Thank you! John Mark Walker Gluster Community Guy johnmark@redhat.com