SlideShare ist ein Scribd-Unternehmen logo
1 von 23
Gluster: Where We've Been
                               AB Periasamy
                 Office of the CTO, Red Hat

                         John Mark Walker
                   Gluster Community Guy
Topics
 The Big Idea
 Humble beginnings
      From Bangalore to Milpitas
 Scale-out + Open source == WINNING
      User-space, no metadata server, stackable
 Cloud and commoditization



06/13/12
A Data Explosion!
 74% == Unstructured data annual growth
 63,000 PB == Scale-out storage in 2015
 40% == storage-related expense for cloud
 44x == Unstructured data volume growth by
  2020

06/13/12
Conference Room    US Head Office




  06/13/12

Bengaluru Office   Bengaluru Office
Gluster Community Deployments
Gluster Production Deployments
What Can You Store?
      Media – Docs, Photos, Video
      VM Filesystem – VM Disk Images
      Big Data – Log Files, RFID Data
      Objects – Long Tail Data



06/13/12
The big idea:
   Storage should be
               simple
 Simple, scalable, low-cost
06/13/12
What is GlusterFS,
 Really?
           Gluster is a unified, distributed
            storage system
             DHT, stackable, POSIX, Swift, HDFS




06/13/12
Phase 1: Lego Kit for
  Storage
      “People who think that userspace filesystems
        are realistic for anything but toys are just
        misguided" – Linus Torvalds
      Goal: create a global namespace



06/13/12
volume testvol-posix
         type storage/posix
         option directory /media/datastore
         option volume-id 329e31c1-04cc-4386-8bb8-xxxx
     end-volume

     volume testvol-access-control
         type features/access-control
         subvolumes testvol-posix
     end-volume

     volume testvol-locks
         type features/locks
         subvolumes testvol-access-control
     end-volume

     volume testvol-io-threads
         type performance/io-threads
         subvolumes testvol-locks
     end-volume
06/13/12
Versions 1.x – 2.x
      Hand-crafted volume definition files
        See examples
      Simple configuration files
      Faster than tape? It's good!


06/13/12
Phase 2: Repeatability
  of Use Cases




06/13/12
Community-led
      Learned from community
         Desired features
         Usage profiles
      All about scalable storage of unstructured
       data

06/13/12
GlusterFS 3.0: Putting it
  all together
      Adding, removing features
      Templates – recipes for common use
       cases


06/13/12
GlusterFS 3.1 - 2010
      Elasticity: add and remove volumes w/
       glusterd
      Automation: CLI, scriptable




06/13/12
CLI Magic
 $ gluster peer probe HOSTNAME
 $ gluster volume info
 $ gluster volume create VOLNAME [stripe COUNT] 
           [replica COUNT] [transport tcp | rdma] BRICK
 $ gluster volume delete VOLNAME
 $ gluster volume add-brick VOLNAME NEW-BRICK ...
 $ gluster volume rebalance VOLNAME start

06/13/12
GlusterFS 3.2 - 2011
      Native NFS server
      Marker framework
      Geo-replication
        Asynchronous


06/13/12
And now for something
  completely different
           Commoditization and the changing
            economics of storage
           Why we're winning



06/13/12
Simple Economics
        Simplicity, scalability, less cost


  Virtualized     Multi-Tenant   Automated   Commoditized


Scale on Demand   In the Cloud   Scale Out   Open Source




  06/13/12
Simplicity Bias
      FC, FCoE, iSCSI → HTTP, Sockets
      Modified BSD OS → Linux / User Space /
                        C, Python & Java
      Appliance based → Application based




06/13/12
Scale-out Open Source
  is the winner




06/13/12
Thank you!
                     AB Periasamy
       Office of the CTO, Red Hat
                   ab@redhat.com

               John Mark Walker
         Gluster Community Guy
          johnmark@redhat.com

Weitere ähnliche Inhalte

Ähnlich wie Gluster: where weve been - a history

LINBIT_HA_Business_Apr2016
LINBIT_HA_Business_Apr2016LINBIT_HA_Business_Apr2016
LINBIT_HA_Business_Apr2016
Alexandre Huynh
 
Unified Data API for Distributed Cloud Analytics and AI
Unified Data API for Distributed Cloud Analytics and AIUnified Data API for Distributed Cloud Analytics and AI
Unified Data API for Distributed Cloud Analytics and AI
Alluxio, Inc.
 
GlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 MeetupGlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 Meetup
GlusterFS
 
Proud to be polyglot!
Proud to be polyglot!Proud to be polyglot!
Proud to be polyglot!
NLJUG
 

Ähnlich wie Gluster: where weve been - a history (20)

Solving Big Data Problems
Solving Big Data ProblemsSolving Big Data Problems
Solving Big Data Problems
 
Scalability 09262012
Scalability 09262012Scalability 09262012
Scalability 09262012
 
IBM Cloud Object Storage: How it works and typical use cases
IBM Cloud Object Storage: How it works and typical use casesIBM Cloud Object Storage: How it works and typical use cases
IBM Cloud Object Storage: How it works and typical use cases
 
Simplified Data Preparation for Machine Learning in Hybrid and Multi Clouds
Simplified Data Preparation for Machine Learning in Hybrid and Multi CloudsSimplified Data Preparation for Machine Learning in Hybrid and Multi Clouds
Simplified Data Preparation for Machine Learning in Hybrid and Multi Clouds
 
Hadoop Technical Presentation
Hadoop Technical PresentationHadoop Technical Presentation
Hadoop Technical Presentation
 
LINBIT_HA_Business_Apr2016
LINBIT_HA_Business_Apr2016LINBIT_HA_Business_Apr2016
LINBIT_HA_Business_Apr2016
 
Yahoo! Scalable Storage and Delivery Services
Yahoo! Scalable Storage and Delivery ServicesYahoo! Scalable Storage and Delivery Services
Yahoo! Scalable Storage and Delivery Services
 
Unified Data API for Distributed Cloud Analytics and AI
Unified Data API for Distributed Cloud Analytics and AIUnified Data API for Distributed Cloud Analytics and AI
Unified Data API for Distributed Cloud Analytics and AI
 
EMC config Hadoop
EMC config HadoopEMC config Hadoop
EMC config Hadoop
 
New Business Applications Powered by In-Memory Technology @MIT Forum for Supp...
New Business Applications Powered by In-Memory Technology @MIT Forum for Supp...New Business Applications Powered by In-Memory Technology @MIT Forum for Supp...
New Business Applications Powered by In-Memory Technology @MIT Forum for Supp...
 
GlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 MeetupGlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 Meetup
 
Strategies for Context Data Persistence
Strategies for Context Data PersistenceStrategies for Context Data Persistence
Strategies for Context Data Persistence
 
S100299 ibm-cos-orlando-v1804c
S100299 ibm-cos-orlando-v1804cS100299 ibm-cos-orlando-v1804c
S100299 ibm-cos-orlando-v1804c
 
Online storage for the masses and the case of pithos
Online storage for the masses and the case of pithosOnline storage for the masses and the case of pithos
Online storage for the masses and the case of pithos
 
S016825 ibm-cos-nola-v1710d
S016825 ibm-cos-nola-v1710dS016825 ibm-cos-nola-v1710d
S016825 ibm-cos-nola-v1710d
 
Red Hat Storage Day Boston - Red Hat Gluster Storage vs. Traditional Storage ...
Red Hat Storage Day Boston - Red Hat Gluster Storage vs. Traditional Storage ...Red Hat Storage Day Boston - Red Hat Gluster Storage vs. Traditional Storage ...
Red Hat Storage Day Boston - Red Hat Gluster Storage vs. Traditional Storage ...
 
Pivotal: Virtualize Big Data to Make the Elephant Dance
Pivotal: Virtualize Big Data to Make the Elephant DancePivotal: Virtualize Big Data to Make the Elephant Dance
Pivotal: Virtualize Big Data to Make the Elephant Dance
 
Hadoop Summit Brussels 2015: Architecting a Scalable Hadoop Platform - Top 10...
Hadoop Summit Brussels 2015: Architecting a Scalable Hadoop Platform - Top 10...Hadoop Summit Brussels 2015: Architecting a Scalable Hadoop Platform - Top 10...
Hadoop Summit Brussels 2015: Architecting a Scalable Hadoop Platform - Top 10...
 
Proud to be polyglot!
Proud to be polyglot!Proud to be polyglot!
Proud to be polyglot!
 
HPE Solutions for Challenges in AI and Big Data
HPE Solutions for Challenges in AI and Big DataHPE Solutions for Challenges in AI and Big Data
HPE Solutions for Challenges in AI and Big Data
 

Mehr von John Mark Walker

Intro to Open Cloud Initiative
Intro to Open Cloud InitiativeIntro to Open Cloud Initiative
Intro to Open Cloud Initiative
John Mark Walker
 

Mehr von John Mark Walker (13)

OSEN SF Meetup - Business of Open Source
OSEN SF Meetup - Business of Open SourceOSEN SF Meetup - Business of Open Source
OSEN SF Meetup - Business of Open Source
 
Product Development in the Age of Cloud Native
Product Development in the Age of Cloud NativeProduct Development in the Age of Cloud Native
Product Development in the Age of Cloud Native
 
From project to product
From project to productFrom project to product
From project to product
 
Hybrid Cloud Management with ManageIQ
Hybrid Cloud Management with ManageIQHybrid Cloud Management with ManageIQ
Hybrid Cloud Management with ManageIQ
 
The Secrets to Open Source Innovation
The Secrets to Open Source InnovationThe Secrets to Open Source Innovation
The Secrets to Open Source Innovation
 
Gluster 3.3 deep dive
Gluster 3.3 deep diveGluster 3.3 deep dive
Gluster 3.3 deep dive
 
Open Source and Cloud - The Two Great Tastes...
Open Source and Cloud - The Two Great Tastes...Open Source and Cloud - The Two Great Tastes...
Open Source and Cloud - The Two Great Tastes...
 
GlusterFS Community Preso
GlusterFS Community PresoGlusterFS Community Preso
GlusterFS Community Preso
 
Intro to Open Cloud Initiative
Intro to Open Cloud InitiativeIntro to Open Cloud Initiative
Intro to Open Cloud Initiative
 
The Future of GlusterFS and Gluster.org
The Future of GlusterFS and Gluster.orgThe Future of GlusterFS and Gluster.org
The Future of GlusterFS and Gluster.org
 
The Future of GlusterFS and Gluster.org
The Future of GlusterFS and Gluster.orgThe Future of GlusterFS and Gluster.org
The Future of GlusterFS and Gluster.org
 
FOSS vs. Web Services Lightning Talk: Is FOSS Necessary?
FOSS vs. Web Services Lightning Talk: Is FOSS Necessary?FOSS vs. Web Services Lightning Talk: Is FOSS Necessary?
FOSS vs. Web Services Lightning Talk: Is FOSS Necessary?
 
Building Vibrant Open Source Communities
Building Vibrant Open Source CommunitiesBuilding Vibrant Open Source Communities
Building Vibrant Open Source Communities
 

Kürzlich hochgeladen

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 

Kürzlich hochgeladen (20)

MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 

Gluster: where weve been - a history

  • 1. Gluster: Where We've Been AB Periasamy Office of the CTO, Red Hat John Mark Walker Gluster Community Guy
  • 2. Topics The Big Idea Humble beginnings From Bangalore to Milpitas Scale-out + Open source == WINNING User-space, no metadata server, stackable Cloud and commoditization 06/13/12
  • 3. A Data Explosion! 74% == Unstructured data annual growth 63,000 PB == Scale-out storage in 2015 40% == storage-related expense for cloud 44x == Unstructured data volume growth by 2020 06/13/12
  • 4. Conference Room US Head Office 06/13/12 Bengaluru Office Bengaluru Office
  • 7. What Can You Store? Media – Docs, Photos, Video VM Filesystem – VM Disk Images Big Data – Log Files, RFID Data Objects – Long Tail Data 06/13/12
  • 8. The big idea: Storage should be simple Simple, scalable, low-cost 06/13/12
  • 9. What is GlusterFS, Really? Gluster is a unified, distributed storage system DHT, stackable, POSIX, Swift, HDFS 06/13/12
  • 10. Phase 1: Lego Kit for Storage “People who think that userspace filesystems are realistic for anything but toys are just misguided" – Linus Torvalds Goal: create a global namespace 06/13/12
  • 11. volume testvol-posix type storage/posix option directory /media/datastore option volume-id 329e31c1-04cc-4386-8bb8-xxxx end-volume volume testvol-access-control type features/access-control subvolumes testvol-posix end-volume volume testvol-locks type features/locks subvolumes testvol-access-control end-volume volume testvol-io-threads type performance/io-threads subvolumes testvol-locks end-volume 06/13/12
  • 12. Versions 1.x – 2.x Hand-crafted volume definition files See examples Simple configuration files Faster than tape? It's good! 06/13/12
  • 13. Phase 2: Repeatability of Use Cases 06/13/12
  • 14. Community-led Learned from community Desired features Usage profiles All about scalable storage of unstructured data 06/13/12
  • 15. GlusterFS 3.0: Putting it all together Adding, removing features Templates – recipes for common use cases 06/13/12
  • 16. GlusterFS 3.1 - 2010 Elasticity: add and remove volumes w/ glusterd Automation: CLI, scriptable 06/13/12
  • 17. CLI Magic $ gluster peer probe HOSTNAME $ gluster volume info $ gluster volume create VOLNAME [stripe COUNT] [replica COUNT] [transport tcp | rdma] BRICK $ gluster volume delete VOLNAME $ gluster volume add-brick VOLNAME NEW-BRICK ... $ gluster volume rebalance VOLNAME start 06/13/12
  • 18. GlusterFS 3.2 - 2011 Native NFS server Marker framework Geo-replication Asynchronous 06/13/12
  • 19. And now for something completely different Commoditization and the changing economics of storage Why we're winning 06/13/12
  • 20. Simple Economics Simplicity, scalability, less cost Virtualized Multi-Tenant Automated Commoditized Scale on Demand In the Cloud Scale Out Open Source 06/13/12
  • 21. Simplicity Bias FC, FCoE, iSCSI → HTTP, Sockets Modified BSD OS → Linux / User Space / C, Python & Java Appliance based → Application based 06/13/12
  • 22. Scale-out Open Source is the winner 06/13/12
  • 23. Thank you! AB Periasamy Office of the CTO, Red Hat ab@redhat.com John Mark Walker Gluster Community Guy johnmark@redhat.com

Hinweis der Redaktion

  1. Add examples where complexity has been bad - EMC, Cisco, Brocade et al. certification made business out of complexity - if too complicated, doesn't scale
  2. Discuss approach – how GlusterFS is unique and different from other approaches - Lessons form GNU Hurd - user space distributed storage operating system - overcome some parts of the OS: implemented scheduler, POSIX locking, RDMA, MM, cf. JVM, python, etc. - no metadata separation
  3. If you have a bunch of files, should be as simple as an FTP server - in user space, required FUSE, POSIX translator, NAS protocol, cluster translator
  4. Learned about missing features Found the largest problem and wanted to solve it - patterns emerged - scalable unstructured data storage was the #1 problem people wanted to solve Had a clearer idea where we wanted to go – clear direction
  5. Standalone NFS replacement Active-active replicated storage Scalable, distributed storage .. And then scalable, replicated distributed storage + other combos
  6. Elastic features driven by cloud and virt usage - shared storage for virtual guests - flexible, self-service storage - elastic volume management became requirement - automated provisioning of storage w/ CLI (native NFS server? Or 3.2?)
  7. Marker famework: - story of why it's necessary - backup of data in other locales - don't need entire snapshot - users wanted to continuous, unlimited replication - don't want sysadmin intervention – on-demand - queries FS to find what files have changed - manages queue, telling rsync exactly which files to change Inotify – doesn't scale, if daemon crashes, stops tracking changes - would have to write journaling feature to maintain change queue Geo-replication – can work on high-latency, flaky networks