Introduction To GlusterFS
  Audio is available via VoIP or Landline.
                 For VoIP
 You will be connected to audio using your
     computer‟s speakers or headset.

               For Landline:
Please select Use Telephone after joining the
                 Webinar.
 US/Canada attendees dial (805) 309-0021
        Access Code 232-976-193
Today’s Speakers




     John Kreisa                     Tom Trainer           Jacob Shucart
     Vice President                      Director       Senior Systems Engineer
       Marketing                    Product Marketing         Gluster, Inc.
      Gluster, Inc.                    Gluster, Inc.




            A Better Way To Do Storage                                            2
Poll Question

Are you using GlusterFS today?
 –   Yes, in a test environment
 –   Yes, it‟s deployed in a production environment
 –   No, however we are considering it
 –   Just researching




              A Better Way To Do Storage              3
History of Gluster

How it all started
 – Backgrounds in high performance, clustered computing
 – Working at Lawrence Livermore National Labs
     • AB Periasamy & Hitesh Chellani design “Thunder”
     • One of the worlds fastest super computers
     • On Intel commodity hardware
     • Solved filesystem scalability and performance limitations
 – Large customer in oil & gas persuaded them to focus on storage
 – Gluster founded by Hitesh & AB to bring technology to market
                                                                    Thunder
Result: award winning technology




              A Better Way To Do Storage                                      4
What is the Gluster File System?

 A scale-out file system for Network Attached Storage (NAS)
  – Highly available
  – Linearly scalable performance



 GlusterFS provides

  – Flexibility to deploy in ANY
     environment
  – Scalability to Petabytes & beyond
  – Superior storage economics




          A Better Way To Do Storage                          5
GlusterFS Architecture Design Goals

                                             Elasticity
                                              – Flexibility adapt to growth/reduction
                                              – Add, delete volumes & users
                                              – Without disruption

                                             Scale linearly
                                              – Multiple dimensions
                                                 • Performance
                                                 • Capacity
                                              – Aggregated resources

                                             Eliminate metadata
  Performance




                                              – Improve file access speed

                                             Simplicity
                Capacity                      – Ease of management
                                              – No complex Kernel patches
                                              – Run in user space

                A Better Way To Do Storage                                              6
Key Differentiators

  Filesystem runs in user space
  Software only
  Open source
  Modular, stackable storage OS architecture
  Data stored in native formats
  No metadata – Elastic hashing




        A Better Way To Do Storage             7
Filesystem Runs in User Space



    User Space                            Not tied to kernel
       GlusterFS
                                          No reassemblies
        Server
      (CPU/Mem)                           Independence
     Kernel
       1 TB      1 TB
       1 TB      1 TB
      1 TB      1 TB
      1 TB      1 TB




       1 TB      1 TB
       1 TB      1 TB
       1 TB      1 TB




             A Better Way To Do Storage                        8
Software Only

 Hardware agnostic
 Superior storage economics & flexibility
  – Data center / private cloud use commodity hardware
  – Public cloud – i.e. AWS, RackSpace, GoGrid – pay for only what you need

 No lock-in
  –   Hardware vendors-at purchase time or in the future
  –   Public or private cloud
  –   Performance, capacity, or availability levels
  –   GlusterFS – not proprietary, files are stored in native formats (i.e. EXT4)




            A Better Way To Do Storage                                              9
Open Source

                                     200,000+ downloads
    Global Adoption                   – ~12,000 /month

                                     500+ registered deployments
                                      – 45 countries

                                     2,500+ registered users
                                      – Mailing lists, Forums, etc.

                                     Active community
                                      – Diverse testing environments
                                      – Bugs identification and fixes
                                      – Code contributions

                                     Member of broader ecosystem
                                      – OpenStack, Linux Foundation, Open
                                        Virtualization Alliance




        A Better Way To Do Storage                                          10
Modular, Stackable Storage OS Architecture
                                                      Services to the public network
NAS protocols
Replication
Global namespace                                      NFS, CIFS, GlusterFS, HTTP(s), WebDAV, (S)FTP
                                                         Interface – Protocols - Replication
Clustered & replicated
Built-in failover                                                Gluster Server

Self healing                                 Public Network             |    Storage Server Network
                                                                1Gb, 10Gb, Infiniband
POSIX compliant
                                                             64-bit Linux distributions
No metadata server                                   RHEL, Fedora, Debian, CentOS, Ubuntu, SUSE


Elastic hash                                                Unlimited number of files
                                                                  Elastic hash function
                                              Native format ext3 or ext4 recommended / POSIX compliant
Thin provisioning
                                                                     Disk storage
                                            Local to host(Gluster Server), SAS, SATA, SCSI attached JBODs, Fibre
                                                                      Channel supported
Flexible storage configurations                               ISCSI supported – for low I/O Apps


               A Better Way To Do Storage                                                                          11
Elastic Hashing
 No metadata server
 An algorithmic approach
  – Unique hash tag for each file stored
  – Tags stored within the file system
  – Rapid file read – low latency



                                                                   Figure 1




                                                                    Figure 2
                  Figure 3
          A Better Way To Do Storage       *Figures 1,2,3 are examples only    12
A Standard Gluster Deployment

Clients/Apps       Clients/Apps              Clients/Apps       Standard clients
                                                                running standard apps


                         IP Network                             Over any standard IP
                                                                network

                                                                Access application
                                                                data, as files & folders,
  Gluster Global Namespace (NFS, CIFS, Gluster Native)
                                                                in a global namespace,
Application data                            VMs   VMDK   VMDK   using a variety of
                                                                standard protocols



                                                                Stored in a
                                                                commoditized,
                   virtual storage pool                         virtualized, scale-out,
                                                                centrally managed pool
                                                                DAS, SAN, NAS



               A Better Way To Do Storage                                                   13
Unifying Public and Private Cloud Storage

                      Client/Apps     Client/Apps     Client/Apps
                     Client/Apps
                    Client/Apps      Client/Apps
                                    Client/Apps      Client/Apps
                                                    Client/Apps




                                    IP Network


                            Gluster Global Namespace

Private Cloud                                                       Public Cloud




                                    Replication



            A Better Way To Do Storage                                             14
Deployment Scenarios
Common Solutions Built on GlusterFS

                                     Media serving (CDN)
                                     Large scale file storage
                                     Tier 2 & 3 archive
                                     File sharing
                                     Multi-tenant file systems
                                     Storage for High Performance
                                    Computing (HPC)
                                     IaaS storage layer
                                     Disaster recovery
                                     Backup & restore
                                     Private cloud

       A Better Way To Do Storage                                   15
Pandora Internet Radio
                                   Problem
                                    • Explosive user & title growth
                                    • As many as 12 file formats for each song
                                    • „Hot‟ content and long tail
                                   Solution
                                    • Three data centers, each with a six-node
                                      GlusterFS cluster
• 1.2 PB of audio served            • Replication for high availability
  per week                          • 250+ TB total capacity
• 13 million files                 Benefits
• Over 50 GB/sec peak               • Easily scale capacity
  traffic                           • Centralized management; one administrator
                                      to manage day-to-day operations
                                    • No changes to application
                                    • Higher reliability

            A Better Way To Do Storage                                           16
Brightcove
                                   Problem
                                    •    Cloud-based online video platform
                                    •    Explosive customer & title growth
                                    •    Massive video in multiple locations
                                    •    Costs rising, esp. with HD formats
                                   Solution
                                    • Complete scale-out based on commodity
                                      DAS/JBOD
                                    • Replication for high availability
• Over 1 PB currently in
                                    • 1PB total capacity
  Gluster
• Separate 4 PB project            Benefits
  in the works                      • Easily scale capacity
                                    • Centralized management; one administrator
                                      to manage day-to-day operations
                                    • Higher reliability
                                    • Path to multi-site
            A Better Way To Do Storage                                         17
Cincinnati Bell Technology Solutions

                                   Problem
                                    • Host a dedicated enterprise cloud solution
                                    • Large scale VMware environment
                                    • Need high availability
                                   Solution
 • Large scale VM
                                    • Gluster for VM storage, NFS to clients
   storage
                                    • SAS drives on back-end
 • Low cost service                 • Replication for high availability
   delivery for enterprise
   customer                        Benefits
 • Drastic reduction in             •     Storage provisioning from 6 wks to 15 min.
   provisioning time                •     Vendor agnostic storage
                                    •     Low cost of service delivery
                                    •     Elastic growth


             A Better Way To Do Storage                                                18
Partners Healthcare
Private Cloud: Centralized Storage as a Service

                                  Problem
                                   • Capacity growth from 144TB to 1+PB
                                   • Multiple distributed users/departments
                                   • Multi OS access - Windows, Linux and Unix
                                  Solution
 • Over 500 TB                     • GlusterFS Cluster
 • 9 Sun “Thumper”                 • Solaris/ZFS/x4500 w/ InfiniBand
   systems in cluster              • Native CIFS/ NFS access
                                  Benefits
                                   •   Capacity on demand / pay as you grow
                                   •   Centralized management
                                   •   Higher reliability
                                   •   OPEX decreased by 10X
             A Better Way To Do Storage                                          19
4 Supported Ways to Consume GlusterFS

 Virtual Machines
  – GlusterFS deployable on the leading virtual machines



 Amazon Web Services (AWS)
  – GlusterFS deployed within Amazon Machine Image (AMI)



 RightScale Cloud Management
  – GlusterFS is available within a RightScale ServerTemplate
  – Deployable via the RightScale Cloud Management Dashboard



 Storage software appliance
  – Deployable on bare metal and supports any hardware on the Red Hat Hardware
    Compatibility List (HCL) of certified servers and storage


           A Better Way To Do Storage                                            20
Select Gluster Customers




       A Better Way To Do Storage   21
Summary

 GlusterFS – A scale-out NAS file system
 Flexibility, scalability, superior economics
 Innovative architecture provides a better way to do
  storage




       A Better Way To Do Storage                  22
Questions and Answers

                        Your turn - ask our experts



 Try Gluster for free here: http://www.gluster.com/trybuy/
 Additional resources here: http://www.gluster.com/products/resources/
 Join the community: http://www.gluster.org/
 Follow on twitter: @gluster.
 Read our blog: http://blog.gluster.com/


              Contact us at: info@gluster.com or 1-800-805-5215




            A Better Way To Do Storage                                   23

Gluster Webinar: Introduction to GlusterFS

  • 1.
    Introduction To GlusterFS Audio is available via VoIP or Landline. For VoIP You will be connected to audio using your computer‟s speakers or headset. For Landline: Please select Use Telephone after joining the Webinar. US/Canada attendees dial (805) 309-0021 Access Code 232-976-193
  • 2.
    Today’s Speakers John Kreisa Tom Trainer Jacob Shucart Vice President Director Senior Systems Engineer Marketing Product Marketing Gluster, Inc. Gluster, Inc. Gluster, Inc. A Better Way To Do Storage 2
  • 3.
    Poll Question Are youusing GlusterFS today? – Yes, in a test environment – Yes, it‟s deployed in a production environment – No, however we are considering it – Just researching A Better Way To Do Storage 3
  • 4.
    History of Gluster Howit all started – Backgrounds in high performance, clustered computing – Working at Lawrence Livermore National Labs • AB Periasamy & Hitesh Chellani design “Thunder” • One of the worlds fastest super computers • On Intel commodity hardware • Solved filesystem scalability and performance limitations – Large customer in oil & gas persuaded them to focus on storage – Gluster founded by Hitesh & AB to bring technology to market Thunder Result: award winning technology A Better Way To Do Storage 4
  • 5.
    What is theGluster File System? A scale-out file system for Network Attached Storage (NAS) – Highly available – Linearly scalable performance GlusterFS provides – Flexibility to deploy in ANY environment – Scalability to Petabytes & beyond – Superior storage economics A Better Way To Do Storage 5
  • 6.
    GlusterFS Architecture DesignGoals Elasticity – Flexibility adapt to growth/reduction – Add, delete volumes & users – Without disruption Scale linearly – Multiple dimensions • Performance • Capacity – Aggregated resources Eliminate metadata Performance – Improve file access speed Simplicity Capacity – Ease of management – No complex Kernel patches – Run in user space A Better Way To Do Storage 6
  • 7.
    Key Differentiators Filesystem runs in user space Software only Open source Modular, stackable storage OS architecture Data stored in native formats No metadata – Elastic hashing A Better Way To Do Storage 7
  • 8.
    Filesystem Runs inUser Space User Space Not tied to kernel GlusterFS No reassemblies Server (CPU/Mem) Independence Kernel 1 TB 1 TB 1 TB 1 TB 1 TB 1 TB 1 TB 1 TB 1 TB 1 TB 1 TB 1 TB 1 TB 1 TB A Better Way To Do Storage 8
  • 9.
    Software Only Hardwareagnostic Superior storage economics & flexibility – Data center / private cloud use commodity hardware – Public cloud – i.e. AWS, RackSpace, GoGrid – pay for only what you need No lock-in – Hardware vendors-at purchase time or in the future – Public or private cloud – Performance, capacity, or availability levels – GlusterFS – not proprietary, files are stored in native formats (i.e. EXT4) A Better Way To Do Storage 9
  • 10.
    Open Source 200,000+ downloads Global Adoption – ~12,000 /month 500+ registered deployments – 45 countries 2,500+ registered users – Mailing lists, Forums, etc. Active community – Diverse testing environments – Bugs identification and fixes – Code contributions Member of broader ecosystem – OpenStack, Linux Foundation, Open Virtualization Alliance A Better Way To Do Storage 10
  • 11.
    Modular, Stackable StorageOS Architecture Services to the public network NAS protocols Replication Global namespace NFS, CIFS, GlusterFS, HTTP(s), WebDAV, (S)FTP Interface – Protocols - Replication Clustered & replicated Built-in failover Gluster Server Self healing Public Network | Storage Server Network 1Gb, 10Gb, Infiniband POSIX compliant 64-bit Linux distributions No metadata server RHEL, Fedora, Debian, CentOS, Ubuntu, SUSE Elastic hash Unlimited number of files Elastic hash function Native format ext3 or ext4 recommended / POSIX compliant Thin provisioning Disk storage Local to host(Gluster Server), SAS, SATA, SCSI attached JBODs, Fibre Channel supported Flexible storage configurations ISCSI supported – for low I/O Apps A Better Way To Do Storage 11
  • 12.
    Elastic Hashing Nometadata server An algorithmic approach – Unique hash tag for each file stored – Tags stored within the file system – Rapid file read – low latency Figure 1 Figure 2 Figure 3 A Better Way To Do Storage *Figures 1,2,3 are examples only 12
  • 13.
    A Standard GlusterDeployment Clients/Apps Clients/Apps Clients/Apps Standard clients running standard apps IP Network Over any standard IP network Access application data, as files & folders, Gluster Global Namespace (NFS, CIFS, Gluster Native) in a global namespace, Application data VMs VMDK VMDK using a variety of standard protocols Stored in a commoditized, virtual storage pool virtualized, scale-out, centrally managed pool DAS, SAN, NAS A Better Way To Do Storage 13
  • 14.
    Unifying Public andPrivate Cloud Storage Client/Apps Client/Apps Client/Apps Client/Apps Client/Apps Client/Apps Client/Apps Client/Apps Client/Apps IP Network Gluster Global Namespace Private Cloud Public Cloud Replication A Better Way To Do Storage 14
  • 15.
    Deployment Scenarios Common SolutionsBuilt on GlusterFS Media serving (CDN) Large scale file storage Tier 2 & 3 archive File sharing Multi-tenant file systems Storage for High Performance Computing (HPC) IaaS storage layer Disaster recovery Backup & restore Private cloud A Better Way To Do Storage 15
  • 16.
    Pandora Internet Radio Problem • Explosive user & title growth • As many as 12 file formats for each song • „Hot‟ content and long tail Solution • Three data centers, each with a six-node GlusterFS cluster • 1.2 PB of audio served • Replication for high availability per week • 250+ TB total capacity • 13 million files Benefits • Over 50 GB/sec peak • Easily scale capacity traffic • Centralized management; one administrator to manage day-to-day operations • No changes to application • Higher reliability A Better Way To Do Storage 16
  • 17.
    Brightcove Problem • Cloud-based online video platform • Explosive customer & title growth • Massive video in multiple locations • Costs rising, esp. with HD formats Solution • Complete scale-out based on commodity DAS/JBOD • Replication for high availability • Over 1 PB currently in • 1PB total capacity Gluster • Separate 4 PB project Benefits in the works • Easily scale capacity • Centralized management; one administrator to manage day-to-day operations • Higher reliability • Path to multi-site A Better Way To Do Storage 17
  • 18.
    Cincinnati Bell TechnologySolutions Problem • Host a dedicated enterprise cloud solution • Large scale VMware environment • Need high availability Solution • Large scale VM • Gluster for VM storage, NFS to clients storage • SAS drives on back-end • Low cost service • Replication for high availability delivery for enterprise customer Benefits • Drastic reduction in • Storage provisioning from 6 wks to 15 min. provisioning time • Vendor agnostic storage • Low cost of service delivery • Elastic growth A Better Way To Do Storage 18
  • 19.
    Partners Healthcare Private Cloud:Centralized Storage as a Service Problem • Capacity growth from 144TB to 1+PB • Multiple distributed users/departments • Multi OS access - Windows, Linux and Unix Solution • Over 500 TB • GlusterFS Cluster • 9 Sun “Thumper” • Solaris/ZFS/x4500 w/ InfiniBand systems in cluster • Native CIFS/ NFS access Benefits • Capacity on demand / pay as you grow • Centralized management • Higher reliability • OPEX decreased by 10X A Better Way To Do Storage 19
  • 20.
    4 Supported Waysto Consume GlusterFS Virtual Machines – GlusterFS deployable on the leading virtual machines Amazon Web Services (AWS) – GlusterFS deployed within Amazon Machine Image (AMI) RightScale Cloud Management – GlusterFS is available within a RightScale ServerTemplate – Deployable via the RightScale Cloud Management Dashboard Storage software appliance – Deployable on bare metal and supports any hardware on the Red Hat Hardware Compatibility List (HCL) of certified servers and storage A Better Way To Do Storage 20
  • 21.
    Select Gluster Customers A Better Way To Do Storage 21
  • 22.
    Summary GlusterFS –A scale-out NAS file system Flexibility, scalability, superior economics Innovative architecture provides a better way to do storage A Better Way To Do Storage 22
  • 23.
    Questions and Answers Your turn - ask our experts Try Gluster for free here: http://www.gluster.com/trybuy/ Additional resources here: http://www.gluster.com/products/resources/ Join the community: http://www.gluster.org/ Follow on twitter: @gluster. Read our blog: http://blog.gluster.com/ Contact us at: info@gluster.com or 1-800-805-5215 A Better Way To Do Storage 23