SlideShare ist ein Scribd-Unternehmen logo
1 von 27
Downloaden Sie, um offline zu lesen
GlusterFS
                             Distributed Replicated Parallel File System



                                                Text
                                                 Text




                             Martin Alfke <martin.alfke@buero20.org>




Thursday, November 4, 2010
Agenda

                    • General Information on GlusterFS
                    • Architecture Overview
                    • GlusterFS Translators
                    • GlusterFS Configuration



Thursday, November 4, 2010
General Information




Thursday, November 4, 2010
General Information I

                                                  File System Solutions

                        Shared Disk File System
                           •San or Block Access
                           •Mostly used in HA setup (e.g. DRBD)

                        Distributed File System
                           •Network File System
                               •NFS
                               •SMB/CIFS
                               •9P

                        Distributed replicated File System
                           •Replication
                           •HA and offline operation
                               •Coda
                               •MS DFS
                               •MooseFS




                                                             4
Thursday, November 4, 2010
General Information II

                                                  File System Solutions

                         Distributed parallel File System
                            •Setup across multiple servers
                            •HPC

                         Distributed replicated parallel File System
                            •HPC and HA
                                •Cosmos
                                •MogileFS
                                •GPFS (IBM)
                                •GFS (Google)
                                •Hadoop
                                •GlusterFS




                                                              5
Thursday, November 4, 2010
Customer Platform

                                                 Shared Storage Issues

                         Bad performance of NFS kernel stack
                         Limitations of concurrent NFS accesses
                         Customer data already on NFS system




                                           Environment and Requirements

                         Debian GNU/Linux Version 4 (etch) – 3 years old
                         Most D f-t p FS need complex data migration
                         Solution has to be expandable




                                                            6
Thursday, November 4, 2010
Why GlusterFS ?
                                                     Decision basics

                        Possibility to run NFS and GlusterFS in parallel

                        No data migration necessary

                        Easy setup

                        Extendable (e.g. new storage nodes)

                        min. Kernel 2.6.3x --> optimization for FUSE context switches !




                                                              7
Thursday, November 4, 2010
Architecture Overview




Thursday, November 4, 2010
GlusterFS Basics

                                                       Hardware
                        Any x86 Hardware
                        Direct attached storage, RAID
                        FC, Inniband or iSCSI SAN
                        Gigabit or 10 Gigabit network or Inniband

                                                            OS
                        Linux, Solaris, OpenSolaris, OS X, FreeBSD
                        ext3 or ext4 Filesystem (tested)
                        Other POSIX compliant lesystems should also work

                                                      Architecture
                        No Meta-Data Server (fully distributed architecture - Elastic Hash)
                        Replication (RAID 1)
                        Distribution (RAID 0)
                        FUSE (Standard)
                        NFS (unfs3 - depreciated)
                        SMB/CIFS
                        DAV




                                                            9
Thursday, November 4, 2010
GlusterFS Architecture Overview I

                         Distribution                                         Replication




                                           Images Copyright by Gluster.Inc.


                                                         10
Thursday, November 4, 2010
GlusterFS Architecture Overview II




                                            Image Copyright by Gluster.Inc.


                                                          11
Thursday, November 4, 2010
GlusterFS Architecture Overview III

                        Recommended Server Setup

                        GlusterFS daemons

                        GlusterFS Server

                        Network

                        *nix distribution

                        FileSystem (ext3/ext4 or POSIX)

                        Volume Manager (LVM2 only)

                        HW RAID

                        Disks




                                                               Image Copyright by Gluster.Inc.



                                                          12
Thursday, November 4, 2010
GlusterFS Translators




Thursday, November 4, 2010
GlusterFS Translators


                                            GlusterFS modular extension

                        storage
                           •posix – for underlying filesystem
                           •bdb – database storage system
                        protocol
                           •server – required for server config (e.g. Inifiniband or TCP)
                           •client – required for client config (e.g. Infiniband or TCP)
                        cluster
                           •distribute – storage spread to multiple servers
                           •replicate – mirror content between servers
                           •stripe – striping between multiple servers
                           •unify – obsolete, use cluster/distribute
                           •nufa – HPC - higher preference for a local volume
                        encryption
                           •rot-13 – sample code for encryption




                                                            14
Thursday, November 4, 2010
GlusterFS Translators




                        performance

                             •readahead – for sequential read performance
                             •writebehind – aggregate written blocks
                             •io-threads – number of threads
                             •io-cache – read cache block size
                             •quickread – fetch small files in a single call
                             •stat-prefetch – prefetch stats in readdir
                             •symlink-cache – undocumented




                                                             15
Thursday, November 4, 2010
GlusterFS Translators



                        features
                           •locks – POSIX locking
                           •filter – filtering on user id or group id
                           •access-control – undocumented
                           •path-convertor – internal path converter
                           •quota – don’t grow beyond disk space
                           •read-only – included in feature/filter
                           •trash – recycle bin (use on server)
                        debug
                           •trace – trace glusterfs functions and system calls
                           •io-stats – collect performance data
                           •error-gen – undocumented




                                                            16
Thursday, November 4, 2010
GlusterFS
                             Conguration



Thursday, November 4, 2010
GlusterFS Conguration Location

                                          Conguration les (path and names)

                        Server
                           •/etc/glusterfs/glusterfsd.vol

                        Client
                           •/etc/glusterfs/glusterfs.vol




                                        Possible location of cong les for clients

                        Local on disk

                        Remote on glusterfs-Server




                                                            18
Thursday, November 4, 2010
GlusterFS Conguration Example
                                                2 Servers – n Clients – replication

              Server 1 + 2

                         volume posix
                                        type storage/posix
                                        option directory /data/export
                         end-volume

                         volume locks
                                        type features/locks
                                        subvolumes posix
                         end-volume
                                                                           volume server
                         volume brick                                                      type protocol/server
                                        type performance/io-threads                        option transport-type tcp
                                        option thread-count 8                              option auth.addr.brick.allow 192.168.0.102
                                        subvolumes locks                                   subvolumes brick
                         end-volume                                        end-volume




                                                                      19
Thursday, November 4, 2010
GlusterFS Conguration Example
                                             2 Servers – n Clients – replication

             Clients

                        volume remote1
                                    type protocol/client
                                    option transport-type tcp                volume replicate
                                    option remote-host server1.example.com                 type cluster/replicate
                                    option remote-subvolume brick                          subvolumes remote1 remote2
                        end-volume                                           end-volume

                        volume remote2                                       volume writebehind
                                    type protocol/client                                  type performance/write-behind
                                    option transport-type tcp                             option window-size 1MB
                                    option remote-host server2.example.com                subvolumes replicate
                                    option remote-subvolume brick            end-volume
                        end-volume
                                                                             volume cache
                                                                                            type performance/io-cache
                                                                                            option cache-size 512MB
                                                                                            subvolumes writebehind
                                                                             end-volume




                                                               20
Thursday, November 4, 2010
GlusterFS Conguration @Customer
                                                                Servers - both

                              volume posix
                                             type storage/posix
                                             option directory /data
                              End-volume

                              volume iostats
                                                type debug/io-stats
                                                subvolumes posix            volume iothreads
                              end-volume                                                  type performance/io-threads
                                                                                          option thread-count 16
                              volume locks                                                subvolumes locks
                                             type features/locks            end-volume
                                             subvolumes iostats
                              end-volume                                    volume writebehind
                                                                                         type performance/write-behind
                                                                                         option cache-size 64MB
                                                                                         option flush-behind off
                                                                                         subvolumes iothreads
                                                                            end-volume




                                                                      21
Thursday, November 4, 2010
GlusterFS Conguration @Customer

                                      Servers - both - continued

                                  volume brick
                                                 type performance/io-cache
                                                 option cache-size 2048MB
                                                 option cache-timeout 5
                                                 subvolumes writebehind
                                  End-volume

                                  volume server-tcp
                                                 type protocol/server
                                                 option transport-type tcp
                                                 option auth.addr.brick.allow *
                                                 option transport.socket.listen-port 6996
                                                 option transport.socket.nodelay on
                                                 subvolumes brick
                                  end-volume




                                                               22
Thursday, November 4, 2010
GlusterFS Conguration @Customer
                                                             Clients
                volume remote1
                            type protocol/client
                            option transport-type tcp
                            option remote-host <IP server1>
                            option transport.socket.nodelay on
                            option remote-port 6996
                            option remote-subvolume brick              volume iothreads
                end-volume                                                           type performance/io-threads
                                                                                     option thread-count 16
                volume remote2                                                       subvolumes distribute
                           type protocol/client                        end-volume
                           option transport-type tcp
                           option remote-host <IP server2>             volume writebehind
                           option remote-subvolume brick                            type performance/write-behind
                end-volume                                                          option cache-size 32MB
                                                                                    subvolumes iothreads
                volume distribute                                      end-volume
                             type cluster/distribute
                             subvolumes remote1 remote2                volume cache
                end-volume                                                            type performance/io-cache
                                                                                      option cache-size 256MB
                                                                                       option cache-timeout 10
                                                                                      subvolumes writebehind
                                                                       end-volume




                                                                 23
Thursday, November 4, 2010
GlusterFS Usage




Thursday, November 4, 2010
GlusterFS Usage

                                               Mounting GlusterFS on Clients


                        manual mount
                           server-side conguration:
                         mount -t glusterfs server_IP /mnt/glusterfs
                               local (client) conguration
                             mount -t glusterfs /etc/glusterfs/glusterfs.vol /mnt/glusterfs

                        automatic mount (fstab)
                            server-side conguration:
                         server_IP /mnt/glusterfs glusterfs defaults,_netdev 0 0

                                 local(client) conguration:
                             /etc/glusterfs/glusterfs.vol /mnt/glusterfs glusterfs defaults,_netdev 0 0




                                                                25
Thursday, November 4, 2010
GlusterFS understood


                        Links:

                             •http://www.gluster.org/
                             •http://www.howtoforge.com/trip_search
                             •http://www.gluster.org/docs/index.php/GlusterFS
                             •http://www.gluster.org/docs/index.php/GlusterFS_Volume_Specification
                             •http://www.gluster.org/docs/index.php/GlusterFS_Translators
                             •http://www.gluster.com/community/documentation/index.php/Translators_options
                             •http://www.gluster.com/community/documentation/index.php/GlusterFS_Technical_FAQ




                        Further information and examples:

                             •apt-get install glusterfs-examples




                                                                   26
Thursday, November 4, 2010
GlusterFS


                                                       Credits:

                             Gluster Inc.

                             Christian Gischewski <info@cgihome.de>

                             Tobias Geiger <tobias.geiger@1und1.de>




                                                      Questions ?




                                                          27
Thursday, November 4, 2010

Weitere ähnliche Inhalte

Was ist angesagt?

Btrfs by Chris Mason
Btrfs by Chris MasonBtrfs by Chris Mason
Btrfs by Chris Mason
Terry Wang
 
Zettabyte File Storage System
Zettabyte File Storage SystemZettabyte File Storage System
Zettabyte File Storage System
Amdocs
 
Wafl overview
Wafl overviewWafl overview
Wafl overview
taotao1240
 
Introduction to Btrfs - FLOSS UK Spring Conference York 2015
Introduction to Btrfs - FLOSS UK Spring Conference York 2015Introduction to Btrfs - FLOSS UK Spring Conference York 2015
Introduction to Btrfs - FLOSS UK Spring Conference York 2015
Richard Melville
 
B tree file system
B tree file systemB tree file system
B tree file system
Dinesh Gupta
 
Server quickstart47 linux
Server quickstart47 linuxServer quickstart47 linux
Server quickstart47 linux
kb_exchange_hk
 
OSS Presentation by Kevin Halgren
OSS Presentation by Kevin HalgrenOSS Presentation by Kevin Halgren
OSS Presentation by Kevin Halgren
OpenStorageSummit
 

Was ist angesagt? (19)

ZFS Tutorial USENIX June 2009
ZFS  Tutorial  USENIX June 2009ZFS  Tutorial  USENIX June 2009
ZFS Tutorial USENIX June 2009
 
Red Hat System Administration
Red Hat System AdministrationRed Hat System Administration
Red Hat System Administration
 
SmartOS ZFS Architecture
SmartOS ZFS ArchitectureSmartOS ZFS Architecture
SmartOS ZFS Architecture
 
Scale2014
Scale2014Scale2014
Scale2014
 
Btrfs: Design, Implementation and the Current Status
Btrfs: Design, Implementation and the Current StatusBtrfs: Design, Implementation and the Current Status
Btrfs: Design, Implementation and the Current Status
 
ZFS Workshop
ZFS WorkshopZFS Workshop
ZFS Workshop
 
Btrfs by Chris Mason
Btrfs by Chris MasonBtrfs by Chris Mason
Btrfs by Chris Mason
 
Zettabyte File Storage System
Zettabyte File Storage SystemZettabyte File Storage System
Zettabyte File Storage System
 
Wafl overview
Wafl overviewWafl overview
Wafl overview
 
IBM SONAS and the Cloud Storage Taxonomy
IBM SONAS and the Cloud Storage TaxonomyIBM SONAS and the Cloud Storage Taxonomy
IBM SONAS and the Cloud Storage Taxonomy
 
Introduction to Btrfs - FLOSS UK Spring Conference York 2015
Introduction to Btrfs - FLOSS UK Spring Conference York 2015Introduction to Btrfs - FLOSS UK Spring Conference York 2015
Introduction to Btrfs - FLOSS UK Spring Conference York 2015
 
B tree file system
B tree file systemB tree file system
B tree file system
 
Btrfs current status and_future_prospects
Btrfs current status and_future_prospectsBtrfs current status and_future_prospects
Btrfs current status and_future_prospects
 
4. linux file systems
4. linux file systems4. linux file systems
4. linux file systems
 
Server quickstart47 linux
Server quickstart47 linuxServer quickstart47 linux
Server quickstart47 linux
 
OSS Presentation by Kevin Halgren
OSS Presentation by Kevin HalgrenOSS Presentation by Kevin Halgren
OSS Presentation by Kevin Halgren
 
Olf2013
Olf2013Olf2013
Olf2013
 
The Evolution of Storage on Linux - FrOSCon - 2015-08-22
The Evolution of Storage on Linux - FrOSCon - 2015-08-22The Evolution of Storage on Linux - FrOSCon - 2015-08-22
The Evolution of Storage on Linux - FrOSCon - 2015-08-22
 
ZFS Talk Part 1
ZFS Talk Part 1ZFS Talk Part 1
ZFS Talk Part 1
 

Ähnlich wie Gluster fs buero20_presentation

Distributed file systems
Distributed file systemsDistributed file systems
Distributed file systems
Sri Prasanna
 
Hadoop Distributed File System
Hadoop Distributed File SystemHadoop Distributed File System
Hadoop Distributed File System
Milad Sobhkhiz
 

Ähnlich wie Gluster fs buero20_presentation (20)

GlusterFS as a DFS
GlusterFS as a DFSGlusterFS as a DFS
GlusterFS as a DFS
 
Gluster Webinar: Introduction to GlusterFS
Gluster Webinar: Introduction to GlusterFSGluster Webinar: Introduction to GlusterFS
Gluster Webinar: Introduction to GlusterFS
 
Gpfs introandsetup
Gpfs introandsetupGpfs introandsetup
Gpfs introandsetup
 
Posscon2013
Posscon2013Posscon2013
Posscon2013
 
GlusterFs: a scalable file system for today's and tomorrow's big data
GlusterFs: a scalable file system for today's and tomorrow's big dataGlusterFs: a scalable file system for today's and tomorrow's big data
GlusterFs: a scalable file system for today's and tomorrow's big data
 
GlusterFS : un file system open source per i big data di oggi e domani - Robe...
GlusterFS : un file system open source per i big data di oggi e domani - Robe...GlusterFS : un file system open source per i big data di oggi e domani - Robe...
GlusterFS : un file system open source per i big data di oggi e domani - Robe...
 
Lisa 2015-gluster fs-introduction
Lisa 2015-gluster fs-introductionLisa 2015-gluster fs-introduction
Lisa 2015-gluster fs-introduction
 
Dustin Black - Red Hat Storage Server Administration Deep Dive
Dustin Black - Red Hat Storage Server Administration Deep DiveDustin Black - Red Hat Storage Server Administration Deep Dive
Dustin Black - Red Hat Storage Server Administration Deep Dive
 
Pnfs
PnfsPnfs
Pnfs
 
Architecture of the Upcoming OrangeFS v3 Distributed Parallel File System
Architecture of the Upcoming OrangeFS v3 Distributed Parallel File SystemArchitecture of the Upcoming OrangeFS v3 Distributed Parallel File System
Architecture of the Upcoming OrangeFS v3 Distributed Parallel File System
 
Spectrum Scale Unified File and Object with WAN Caching
Spectrum Scale Unified File and Object with WAN CachingSpectrum Scale Unified File and Object with WAN Caching
Spectrum Scale Unified File and Object with WAN Caching
 
Software Defined Analytics with File and Object Access Plus Geographically Di...
Software Defined Analytics with File and Object Access Plus Geographically Di...Software Defined Analytics with File and Object Access Plus Geographically Di...
Software Defined Analytics with File and Object Access Plus Geographically Di...
 
Xfs file system for linux
Xfs file system for linuxXfs file system for linux
Xfs file system for linux
 
Distributed file systems
Distributed file systemsDistributed file systems
Distributed file systems
 
Gluster intro-tdose
Gluster intro-tdoseGluster intro-tdose
Gluster intro-tdose
 
Gluster intro-tdose
Gluster intro-tdoseGluster intro-tdose
Gluster intro-tdose
 
Adding Extended Attribute Support to NFS
Adding Extended Attribute Support to NFSAdding Extended Attribute Support to NFS
Adding Extended Attribute Support to NFS
 
Hadoop Distributed File System
Hadoop Distributed File SystemHadoop Distributed File System
Hadoop Distributed File System
 
The evolution of linux file system
The evolution of linux file systemThe evolution of linux file system
The evolution of linux file system
 
White Paper: Using Perforce 'Attributes' for Managing Game Asset Metadata
White Paper: Using Perforce 'Attributes' for Managing Game Asset MetadataWhite Paper: Using Perforce 'Attributes' for Managing Game Asset Metadata
White Paper: Using Perforce 'Attributes' for Managing Game Asset Metadata
 

Mehr von Martin Alfke

Puppet buero20 presentation
Puppet buero20 presentationPuppet buero20 presentation
Puppet buero20 presentation
Martin Alfke
 

Mehr von Martin Alfke (18)

CfgMgmtCamp 2023 - Puppet is YAML.pdf
CfgMgmtCamp 2023 - Puppet is YAML.pdfCfgMgmtCamp 2023 - Puppet is YAML.pdf
CfgMgmtCamp 2023 - Puppet is YAML.pdf
 
HashiTalksDACH-Terraform-Managing training instances in the Cloud
HashiTalksDACH-Terraform-Managing training instances in the CloudHashiTalksDACH-Terraform-Managing training instances in the Cloud
HashiTalksDACH-Terraform-Managing training instances in the Cloud
 
PuppetCamp2021-Testing Modules and ControlRepo.pdf
PuppetCamp2021-Testing Modules and ControlRepo.pdfPuppetCamp2021-Testing Modules and ControlRepo.pdf
PuppetCamp2021-Testing Modules and ControlRepo.pdf
 
Puppet Camp Germany 2020 - Puppet Control Repo and GIT
Puppet Camp Germany 2020 - Puppet Control Repo and GITPuppet Camp Germany 2020 - Puppet Control Repo and GIT
Puppet Camp Germany 2020 - Puppet Control Repo and GIT
 
DevOps - How to get technical buy in
DevOps - How to get technical buy inDevOps - How to get technical buy in
DevOps - How to get technical buy in
 
ADDO 2019 DevOps in a containerized world
ADDO 2019 DevOps in a containerized worldADDO 2019 DevOps in a containerized world
ADDO 2019 DevOps in a containerized world
 
OpenRheinRuhr 2018 - Ops hates containers! Why?
OpenRheinRuhr 2018 - Ops hates containers! Why?OpenRheinRuhr 2018 - Ops hates containers! Why?
OpenRheinRuhr 2018 - Ops hates containers! Why?
 
PuppetConf 2016 Moving from Exec to Types and Provides
PuppetConf 2016 Moving from Exec to Types and ProvidesPuppetConf 2016 Moving from Exec to Types and Provides
PuppetConf 2016 Moving from Exec to Types and Provides
 
Puppet Camp Paris 2016 Data in Modules
Puppet Camp Paris 2016 Data in ModulesPuppet Camp Paris 2016 Data in Modules
Puppet Camp Paris 2016 Data in Modules
 
Power of Puppet 4
Power of Puppet 4Power of Puppet 4
Power of Puppet 4
 
Can you upgrade to Puppet 4.x?
Can you upgrade to Puppet 4.x?Can you upgrade to Puppet 4.x?
Can you upgrade to Puppet 4.x?
 
Upgrading Puppet CommitterConf Essen 2014
Upgrading Puppet CommitterConf Essen 2014Upgrading Puppet CommitterConf Essen 2014
Upgrading Puppet CommitterConf Essen 2014
 
GUUG Hamburg OpenNebula
GUUG Hamburg OpenNebulaGUUG Hamburg OpenNebula
GUUG Hamburg OpenNebula
 
Puppet camp london-modulerewritingsmartway
Puppet camp london-modulerewritingsmartwayPuppet camp london-modulerewritingsmartway
Puppet camp london-modulerewritingsmartway
 
One
OneOne
One
 
Puppet future parser
Puppet future parserPuppet future parser
Puppet future parser
 
developing sysadmin, sysadmining developersGuug devops puppet
developing sysadmin, sysadmining developersGuug devops puppetdeveloping sysadmin, sysadmining developersGuug devops puppet
developing sysadmin, sysadmining developersGuug devops puppet
 
Puppet buero20 presentation
Puppet buero20 presentationPuppet buero20 presentation
Puppet buero20 presentation
 

KĂźrzlich hochgeladen

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

KĂźrzlich hochgeladen (20)

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 

Gluster fs buero20_presentation

  • 1. GlusterFS Distributed Replicated Parallel File System Text Text Martin Alfke <martin.alfke@buero20.org> Thursday, November 4, 2010
  • 2. Agenda • General Information on GlusterFS • Architecture Overview • GlusterFS Translators • GlusterFS Conguration Thursday, November 4, 2010
  • 4. General Information I File System Solutions Shared Disk File System •San or Block Access •Mostly used in HA setup (e.g. DRBD) Distributed File System •Network File System •NFS •SMB/CIFS •9P Distributed replicated File System •Replication •HA and offline operation •Coda •MS DFS •MooseFS 4 Thursday, November 4, 2010
  • 5. General Information II File System Solutions Distributed parallel File System •Setup across multiple servers •HPC Distributed replicated parallel File System •HPC and HA •Cosmos •MogileFS •GPFS (IBM) •GFS (Google) •Hadoop •GlusterFS 5 Thursday, November 4, 2010
  • 6. Customer Platform Shared Storage Issues Bad performance of NFS kernel stack Limitations of concurrent NFS accesses Customer data already on NFS system Environment and Requirements Debian GNU/Linux Version 4 (etch) – 3 years old Most D f-t p FS need complex data migration Solution has to be expandable 6 Thursday, November 4, 2010
  • 7. Why GlusterFS ? Decision basics Possibility to run NFS and GlusterFS in parallel No data migration necessary Easy setup Extendable (e.g. new storage nodes) min. Kernel 2.6.3x --> optimization for FUSE context switches ! 7 Thursday, November 4, 2010
  • 9. GlusterFS Basics Hardware Any x86 Hardware Direct attached storage, RAID FC, Inniband or iSCSI SAN Gigabit or 10 Gigabit network or Inniband OS Linux, Solaris, OpenSolaris, OS X, FreeBSD ext3 or ext4 Filesystem (tested) Other POSIX compliant lesystems should also work Architecture No Meta-Data Server (fully distributed architecture - Elastic Hash) Replication (RAID 1) Distribution (RAID 0) FUSE (Standard) NFS (unfs3 - depreciated) SMB/CIFS DAV 9 Thursday, November 4, 2010
  • 10. GlusterFS Architecture Overview I Distribution Replication Images Copyright by Gluster.Inc. 10 Thursday, November 4, 2010
  • 11. GlusterFS Architecture Overview II Image Copyright by Gluster.Inc. 11 Thursday, November 4, 2010
  • 12. GlusterFS Architecture Overview III Recommended Server Setup GlusterFS daemons GlusterFS Server Network *nix distribution FileSystem (ext3/ext4 or POSIX) Volume Manager (LVM2 only) HW RAID Disks Image Copyright by Gluster.Inc. 12 Thursday, November 4, 2010
  • 14. GlusterFS Translators GlusterFS modular extension storage •posix – for underlying lesystem •bdb – database storage system protocol •server – required for server cong (e.g. Ininiband or TCP) •client – required for client cong (e.g. Inniband or TCP) cluster •distribute – storage spread to multiple servers •replicate – mirror content between servers •stripe – striping between multiple servers •unify – obsolete, use cluster/distribute •nufa – HPC - higher preference for a local volume encryption •rot-13 – sample code for encryption 14 Thursday, November 4, 2010
  • 15. GlusterFS Translators performance •readahead – for sequential read performance •writebehind – aggregate written blocks •io-threads – number of threads •io-cache – read cache block size •quickread – fetch small les in a single call •stat-prefetch – prefetch stats in readdir •symlink-cache – undocumented 15 Thursday, November 4, 2010
  • 16. GlusterFS Translators features •locks – POSIX locking •filter – ltering on user id or group id •access-control – undocumented •path-convertor – internal path converter •quota – don’t grow beyond disk space •read-only – included in feature/lter •trash – recycle bin (use on server) debug •trace – trace glusterfs functions and system calls •io-stats – collect performance data •error-gen – undocumented 16 Thursday, November 4, 2010
  • 17. GlusterFS Conguration Thursday, November 4, 2010
  • 18. GlusterFS Conguration Location Conguration les (path and names) Server •/etc/glusterfs/glusterfsd.vol Client •/etc/glusterfs/glusterfs.vol Possible location of cong les for clients Local on disk Remote on glusterfs-Server 18 Thursday, November 4, 2010
  • 19. GlusterFS Conguration Example 2 Servers – n Clients – replication Server 1 + 2 volume posix type storage/posix option directory /data/export end-volume volume locks type features/locks subvolumes posix end-volume volume server volume brick type protocol/server type performance/io-threads option transport-type tcp option thread-count 8 option auth.addr.brick.allow 192.168.0.102 subvolumes locks subvolumes brick end-volume end-volume 19 Thursday, November 4, 2010
  • 20. GlusterFS Conguration Example 2 Servers – n Clients – replication Clients volume remote1 type protocol/client option transport-type tcp volume replicate option remote-host server1.example.com type cluster/replicate option remote-subvolume brick subvolumes remote1 remote2 end-volume end-volume volume remote2 volume writebehind type protocol/client type performance/write-behind option transport-type tcp option window-size 1MB option remote-host server2.example.com subvolumes replicate option remote-subvolume brick end-volume end-volume volume cache type performance/io-cache option cache-size 512MB subvolumes writebehind end-volume 20 Thursday, November 4, 2010
  • 21. GlusterFS Conguration @Customer Servers - both volume posix type storage/posix option directory /data End-volume volume iostats type debug/io-stats subvolumes posix volume iothreads end-volume type performance/io-threads option thread-count 16 volume locks subvolumes locks type features/locks end-volume subvolumes iostats end-volume volume writebehind type performance/write-behind option cache-size 64MB option flush-behind off subvolumes iothreads end-volume 21 Thursday, November 4, 2010
  • 22. GlusterFS Conguration @Customer Servers - both - continued volume brick type performance/io-cache option cache-size 2048MB option cache-timeout 5 subvolumes writebehind End-volume volume server-tcp type protocol/server option transport-type tcp option auth.addr.brick.allow * option transport.socket.listen-port 6996 option transport.socket.nodelay on subvolumes brick end-volume 22 Thursday, November 4, 2010
  • 23. GlusterFS Conguration @Customer Clients volume remote1 type protocol/client option transport-type tcp option remote-host <IP server1> option transport.socket.nodelay on option remote-port 6996 option remote-subvolume brick volume iothreads end-volume type performance/io-threads option thread-count 16 volume remote2 subvolumes distribute type protocol/client end-volume option transport-type tcp option remote-host <IP server2> volume writebehind option remote-subvolume brick type performance/write-behind end-volume option cache-size 32MB subvolumes iothreads volume distribute end-volume type cluster/distribute subvolumes remote1 remote2 volume cache end-volume type performance/io-cache option cache-size 256MB option cache-timeout 10 subvolumes writebehind end-volume 23 Thursday, November 4, 2010
  • 25. GlusterFS Usage Mounting GlusterFS on Clients manual mount server-side conguration: mount -t glusterfs server_IP /mnt/glusterfs local (client) conguration mount -t glusterfs /etc/glusterfs/glusterfs.vol /mnt/glusterfs automatic mount (fstab) server-side conguration: server_IP /mnt/glusterfs glusterfs defaults,_netdev 0 0 local(client) conguration: /etc/glusterfs/glusterfs.vol /mnt/glusterfs glusterfs defaults,_netdev 0 0 25 Thursday, November 4, 2010
  • 26. GlusterFS understood Links: •http://www.gluster.org/ •http://www.howtoforge.com/trip_search •http://www.gluster.org/docs/index.php/GlusterFS •http://www.gluster.org/docs/index.php/GlusterFS_Volume_Specication •http://www.gluster.org/docs/index.php/GlusterFS_Translators •http://www.gluster.com/community/documentation/index.php/Translators_options •http://www.gluster.com/community/documentation/index.php/GlusterFS_Technical_FAQ Further information and examples: •apt-get install glusterfs-examples 26 Thursday, November 4, 2010
  • 27. GlusterFS Credits: Gluster Inc. Christian Gischewski <info@cgihome.de> Tobias Geiger <tobias.geiger@1und1.de> Questions ? 27 Thursday, November 4, 2010