This document provides an overview of tools for monitoring and managing Ceph clusters in production environments. It discusses several GUI tools for visualizing cluster status, including Calamari, InkScope, and Ceph Dash. It also covers using the Ceph command line for troubleshooting, and provides tips on monitoring placement groups, OSDs, and monitors to ensure cluster health. The key aspects of running Ceph in production are summarized as having happy placement groups, monitors, and OSDs.
Presentation on how to chat with PDF using ChatGPT code interpreter
Living with a Cephalopod: Daily Care & Feeding of Ceph Storage
1. SYSADMIN’S TOOLBOX
TOOLS FOR RUNNING CEPH IN
PRODUCTION
Paul Evans
principal architect
daystrom technology group
Paul at Daystrom dot com
san francisco
ceph day
March 12, 2015
2. WHAT’S IN THIS TALK
• Tools to help Understand Ceph Status
• Tips forTroubleshooting Faster
• Won’t Cover it All
• Maybe some Fun
4. Is there a Simple way to run Ceph that
isn’t Rocket Science?
5. WHAT COULD BE SIMPLER
THAN THE….CLI ?
Ceph’s CLI is Great, but…
REALITY: many Operations Teams juggle
too many technologies already…
Do they need to learn another CLI?
7. GUI TOOL OPTIONS
Calamari VSM InkScope ceph-dash
Backing From Red Hat Intel Orange Labs
Christian
Eichelmann
Lastest
Version 1.2.3 2014.12-0.9.1 1.1 1.0
Release
Date Sep 2014 Dec 2014 Jan 2015 Feb 2015
Capabilities
Monitor +
Light Config
Monitor +
Config
Monitor +
Light Config
Monitor Only
Compatability Wide Limited Wide Wide
8. MONITORING
Calamari VSM InkScope ceph-dash
Mon Status Y Y Y Y
OSD Status Y Y Y Y
OSD-Host Mapping Y Y Y Y
PG Status Y Y Y Y
PG-OSD Mapping N N Y N
MDS Status N Y Y N
Host Status Y Y Y Y
Capacity Utilization Y via Groups Y Y
Throughput (Cluster) N Y Y Y
IOPS (Cluster) Y Y Y Y
Errors/Warnings Y Y Y Y
View Logs Y N N N
Send Alerts (email) N N N via nagios plug-in
Charts/Graphs Y N N via nagios plug-in
9. MANAGEMENT
Calamari VSM InkScope ceph-dash
Deploy a Cluster N Y N N
Deploy Hosts (add/remove) N Y N N
Deploy Storage Groups
(create)
N Y N N
Cluster Services (daemons) OSD only Y N(?) N
Cluster Settings (ops flags) Y N Y N
Cluster Settings (parameters) Y N View N
Cluster Settings (CRUSH
map/rrules)
N Partial View N
Cluster Settings (EC Profiles) N Y Y N
OSD (start/stop/in/out) Partial Y Y N
Pools (Replicated) Y (limited) Y Y N
Pools (EC &Tiering) N Y Partial N
RBDs N Partial N N
S3/Swift Users/Buckets N N Y N
Link to OpenStack Nova N Y N N
22. When it’s time to go deep…
/var/log/ceph/ceph.log
/var/log/ceph/ceph-mon.[host].log
/var/log/ceph/ceph-osd.[xx].log
ceph tell osd.[xx] injectargs --debug-osd 0/5
23. REMINDER ABOUT CLUSTERS
Clusters rarely do things instantly.
Clusters can be like a Flock of Sheep - it
starts to move in the right directly
slowly and then picks up speed
(don’t run it off a cliff)