[ceph.git] / ceph / doc / rados / operations / monitoring.rst

======================
 Monitoring a Cluster
======================

Once you have a running cluster, you may use the ``ceph`` tool to monitor your
cluster. Monitoring a cluster typically involves checking OSD status, monitor 
status, placement group status and metadata server status.

Using the command line
======================

Interactive mode
----------------

To run the ``ceph`` tool in interactive mode, type ``ceph`` at the command line
with no arguments.  For example:: 

	ceph
	ceph> health
	ceph> status
	ceph> quorum_status
	ceph> mon stat

Non-default paths
-----------------

If you specified non-default locations for your configuration or keyring,
you may specify their locations::

   ceph -c /path/to/conf -k /path/to/keyring health

Checking a Cluster's Status
===========================

After you start your cluster, and before you start reading and/or
writing data, check your cluster's status first.

To check a cluster's status, execute the following:: 

	ceph status
	
Or:: 

	ceph -s

In interactive mode, type ``status`` and press **Enter**. ::

	ceph> status

Ceph will print the cluster status. For example, a tiny Ceph demonstration
cluster with one of each service may print the following:

::

  cluster:
    id:     477e46f1-ae41-4e43-9c8f-72c918ab0a20
    health: HEALTH_OK
   
  services:
    mon: 3 daemons, quorum a,b,c
    mgr: x(active)
    mds: cephfs_a-1/1/1 up  {0=a=up:active}, 2 up:standby
    osd: 3 osds: 3 up, 3 in
  
  data:
    pools:   2 pools, 16 pgs
    objects: 21 objects, 2.19K
    usage:   546 GB used, 384 GB / 931 GB avail
    pgs:     16 active+clean


.. topic:: How Ceph Calculates Data Usage

   The ``usage`` value reflects the *actual* amount of raw storage used. The 
   ``xxx GB / xxx GB`` value means the amount available (the lesser number)
   of the overall storage capacity of the cluster. The notional number reflects 
   the size of the stored data before it is replicated, cloned or snapshotted.
   Therefore, the amount of data actually stored typically exceeds the notional
   amount stored, because Ceph creates replicas of the data and may also use 
   storage capacity for cloning and snapshotting.


Watching a Cluster
==================

In addition to local logging by each daemon, Ceph clusters maintain
a *cluster log* that records high level events about the whole system.
This is logged to disk on monitor servers (as ``/var/log/ceph/ceph.log`` by
default), but can also be monitored via the command line.

To follow the cluster log, use the following command

:: 

	ceph -w

Ceph will print the status of the system, followed by each log message as it
is emitted.  For example:

:: 

  cluster:
    id:     477e46f1-ae41-4e43-9c8f-72c918ab0a20
    health: HEALTH_OK
  
  services:
    mon: 3 daemons, quorum a,b,c
    mgr: x(active)
    mds: cephfs_a-1/1/1 up  {0=a=up:active}, 2 up:standby
    osd: 3 osds: 3 up, 3 in
  
  data:
    pools:   2 pools, 16 pgs
    objects: 21 objects, 2.19K
    usage:   546 GB used, 384 GB / 931 GB avail
    pgs:     16 active+clean
  
  
  2017-07-24 08:15:11.329298 mon.a mon.0 172.21.9.34:6789/0 23 : cluster [INF] osd.0 172.21.9.34:6806/20527 boot
  2017-07-24 08:15:14.258143 mon.a mon.0 172.21.9.34:6789/0 39 : cluster [INF] Activating manager daemon x
  2017-07-24 08:15:15.446025 mon.a mon.0 172.21.9.34:6789/0 47 : cluster [INF] Manager daemon x is now available


In addition to using ``ceph -w`` to print log lines as they are emitted,
use ``ceph log last [n]`` to see the most recent ``n`` lines from the cluster
log.

Monitoring Health Checks
========================

Ceph continuously runs various *health checks* against its own status.  When
a health check fails, this is reflected in the output of ``ceph status`` (or
``ceph health``).  In addition, messages are sent to the cluster log to
indicate when a check fails, and when the cluster recovers.

For example, when an OSD goes down, the ``health`` section of the status
output may be updated as follows:

::

    health: HEALTH_WARN
            1 osds down
            Degraded data redundancy: 21/63 objects degraded (33.333%), 16 pgs unclean, 16 pgs degraded

At this time, cluster log messages are also emitted to record the failure of the 
health checks:

::

    2017-07-25 10:08:58.265945 mon.a mon.0 172.21.9.34:6789/0 91 : cluster [WRN] Health check failed: 1 osds down (OSD_DOWN)
    2017-07-25 10:09:01.302624 mon.a mon.0 172.21.9.34:6789/0 94 : cluster [WRN] Health check failed: Degraded data redundancy: 21/63 objects degraded (33.333%), 16 pgs unclean, 16 pgs degraded (PG_DEGRADED)

When the OSD comes back online, the cluster log records the cluster's return
to a health state:

::

    2017-07-25 10:11:11.526841 mon.a mon.0 172.21.9.34:6789/0 109 : cluster [WRN] Health check update: Degraded data redundancy: 2 pgs unclean, 2 pgs degraded, 2 pgs undersized (PG_DEGRADED)
    2017-07-25 10:11:13.535493 mon.a mon.0 172.21.9.34:6789/0 110 : cluster [INF] Health check cleared: PG_DEGRADED (was: Degraded data redundancy: 2 pgs unclean, 2 pgs degraded, 2 pgs undersized)
    2017-07-25 10:11:13.535577 mon.a mon.0 172.21.9.34:6789/0 111 : cluster [INF] Cluster is now healthy

Network Performance Checks
--------------------------

Ceph OSDs send heartbeat ping messages amongst themselves to monitor daemon availability.  We
also use the response times to monitor network performance.
While it is possible that a busy OSD could delay a ping response, we can assume
that if a network switch fails multiple delays will be detected between distinct pairs of OSDs.

By default we will warn about ping times which exceed 1 second (1000 milliseconds).

::

    HEALTH_WARN Slow OSD heartbeats on back (longest 1118.001ms)

The health detail will add the combination of OSDs are seeing the delays and by how much.  There is a limit of 10
detail line items.

::

    [WRN] OSD_SLOW_PING_TIME_BACK: Slow OSD heartbeats on back (longest 1118.001ms)
        Slow OSD heartbeats on back from osd.0 [dc1,rack1] to osd.1 [dc1,rack1] 1118.001 msec possibly improving
        Slow OSD heartbeats on back from osd.0 [dc1,rack1] to osd.2 [dc1,rack2] 1030.123 msec
        Slow OSD heartbeats on back from osd.2 [dc1,rack2] to osd.1 [dc1,rack1] 1015.321 msec
        Slow OSD heartbeats on back from osd.1 [dc1,rack1] to osd.0 [dc1,rack1] 1010.456 msec

To see even more detail and a complete dump of network performance information the ``dump_osd_network`` command can be used.  Typically, this would be
sent to a mgr, but it can be limited to a particular OSD's interactions by issuing it to any OSD.  The current threshold which defaults to 1 second
(1000 milliseconds) can be overridden as an argument in milliseconds.

The following command will show all gathered network performance data by specifying a threshold of 0 and sending to the mgr.

::

    $ ceph daemon /var/run/ceph/ceph-mgr.x.asok dump_osd_network 0
    {
        "threshold": 0,
        "entries": [
            {
                "last update": "Wed Sep  4 17:04:49 2019",
                "stale": false,
                "from osd": 2,
                "to osd": 0,
                "interface": "front",
                "average": {
                    "1min": 1.023,
                    "5min": 0.860,
                    "15min": 0.883
                },
                "min": {
                    "1min": 0.818,
                    "5min": 0.607,
                    "15min": 0.607
                },
                "max": {
                    "1min": 1.164,
                    "5min": 1.173,
                    "15min": 1.544
                },
                "last": 0.924
            },
            {
                "last update": "Wed Sep  4 17:04:49 2019",
                "stale": false,
                "from osd": 2,
                "to osd": 0,
                "interface": "back",
                "average": {
                    "1min": 0.968,
                    "5min": 0.897,
                    "15min": 0.830
                },
                "min": {
                    "1min": 0.860,
                    "5min": 0.563,
                    "15min": 0.502
                },
                "max": {
                    "1min": 1.171,
                    "5min": 1.216,
                    "15min": 1.456
                },
                "last": 0.845
            },
            {
                "last update": "Wed Sep  4 17:04:48 2019",
                "stale": false,
                "from osd": 0,
                "to osd": 1,
                "interface": "front",
                "average": {
                    "1min": 0.965,
                    "5min": 0.811,
                    "15min": 0.850
                },
                "min": {
                    "1min": 0.650,
                    "5min": 0.488,
                    "15min": 0.466
                },
                "max": {
                    "1min": 1.252,
                    "5min": 1.252,
                    "15min": 1.362
                },
            "last": 0.791
        },
        ...


Muting health checks
--------------------

Health checks can be muted so that they do not affect the overall
reported status of the cluster.  Alerts are specified using the health
check code (see :ref:`health-checks`)::

  ceph health mute <code>

For example, if there is a health warning, muting it will make the
cluster report an overall status of ``HEALTH_OK``.  For example, to
mute an ``OSD_DOWN`` alert,::

  ceph health mute OSD_DOWN

Mutes are reported as part of the short and long form of the ``ceph health`` command.
For example, in the above scenario, the cluster would report::

  $ ceph health
  HEALTH_OK (muted: OSD_DOWN)
  $ ceph health detail
  HEALTH_OK (muted: OSD_DOWN)
  (MUTED) OSD_DOWN 1 osds down
      osd.1 is down

A mute can be explicitly removed with::

  ceph health unmute <code>

For example,::

  ceph health unmute OSD_DOWN

A health check mute may optionally have a TTL (time to live)
associated with it, such that the mute will automatically expire
after the specified period of time has elapsed.  The TTL is specified as an optional
duration argument, e.g.::

  ceph health mute OSD_DOWN 4h    # mute for 4 hours
  ceph health mute MON_DOWN 15m   # mute for 15  minutes

Normally, if a muted health alert is resolved (e.g., in the example
above, the OSD comes back up), the mute goes away.  If the alert comes
back later, it will be reported in the usual way.

It is possible to make a mute "sticky" such that the mute will remain even if the
alert clears.  For example,::

  ceph health mute OSD_DOWN 1h --sticky   # ignore any/all down OSDs for next hour

Most health mutes also disappear if the extent of an alert gets worse.  For example,
if there is one OSD down, and the alert is muted, the mute will disappear if one
or more additional OSDs go down.  This is true for any health alert that involves
a count indicating how much or how many of something is triggering the warning or
error.


Detecting configuration issues
==============================

In addition to the health checks that Ceph continuously runs on its
own status, there are some configuration issues that may only be detected
by an external tool.

Use the `ceph-medic`_ tool to run these additional checks on your Ceph
cluster's configuration.

Checking a Cluster's Usage Stats
================================

To check a cluster's data usage and data distribution among pools, you can
use the ``df`` option. It is similar to Linux ``df``. Execute 
the following::

	ceph df

The **RAW STORAGE** section of the output provides an overview of the
amount of storage that is managed by your cluster.

- **CLASS:** The class of OSD device (or the total for the cluster)
- **SIZE:** The amount of storage capacity managed by the cluster.
- **AVAIL:** The amount of free space available in the cluster.
- **USED:** The amount of raw storage consumed by user data.
- **RAW USED:** The amount of raw storage consumed by user data, internal overhead, or reserved capacity.
- **%RAW USED:** The percentage of raw storage used. Use this number in
  conjunction with the ``full ratio`` and ``near full ratio`` to ensure that 
  you are not reaching your cluster's capacity. See `Storage Capacity`_ for 
  additional details.

The **POOLS** section of the output provides a list of pools and the notional 
usage of each pool. The output from this section **DOES NOT** reflect replicas,
clones or snapshots. For example, if you store an object with 1MB of data, the 
notional usage will be 1MB, but the actual usage may be 2MB or more depending 
on the number of replicas, clones and snapshots.

- **NAME:** The name of the pool.
- **ID:** The pool ID.
- **USED:** The notional amount of data stored in kilobytes, unless the number 
  appends **M** for megabytes or **G** for gigabytes.
- **%USED:** The notional percentage of storage used per pool.
- **MAX AVAIL:** An estimate of the notional amount of data that can be written
  to this pool.
- **OBJECTS:** The notional number of objects stored per pool.

.. note:: The numbers in the **POOLS** section are notional. They are not 
   inclusive of the number of replicas, snapshots or clones. As a result, 
   the sum of the **USED** and **%USED** amounts will not add up to the 
   **USED** and **%USED** amounts in the **RAW** section of the
   output.

.. note:: The **MAX AVAIL** value is a complicated function of the
   replication or erasure code used, the CRUSH rule that maps storage
   to devices, the utilization of those devices, and the configured
   mon_osd_full_ratio.


Checking OSD Status
===================

You can check OSDs to ensure they are ``up`` and ``in`` by executing:: 

	ceph osd stat
	
Or:: 

	ceph osd dump
	
You can also check view OSDs according to their position in the CRUSH map. :: 

	ceph osd tree

Ceph will print out a CRUSH tree with a host, its OSDs, whether they are up
and their weight. ::  

	#ID CLASS WEIGHT  TYPE NAME             STATUS REWEIGHT PRI-AFF
	 -1       3.00000 pool default
	 -3       3.00000 rack mainrack
	 -2       3.00000 host osd-host
	  0   ssd 1.00000         osd.0             up  1.00000 1.00000
	  1   ssd 1.00000         osd.1             up  1.00000 1.00000
	  2   ssd 1.00000         osd.2             up  1.00000 1.00000

For a detailed discussion, refer to `Monitoring OSDs and Placement Groups`_.

Checking Monitor Status
=======================

If your cluster has multiple monitors (likely), you should check the monitor
quorum status after you start the cluster and before reading and/or writing data. A
quorum must be present when multiple monitors are running. You should also check
monitor status periodically to ensure that they are running.

To see display the monitor map, execute the following::

	ceph mon stat
	
Or:: 

	ceph mon dump
	
To check the quorum status for the monitor cluster, execute the following:: 
	
	ceph quorum_status

Ceph will return the quorum status. For example, a Ceph  cluster consisting of
three monitors may return the following:

.. code-block:: javascript

	{ "election_epoch": 10,
	  "quorum": [
	        0,
	        1,
	        2],
	  "quorum_names": [
		"a",
		"b",
		"c"],
	  "quorum_leader_name": "a",
	  "monmap": { "epoch": 1,
	      "fsid": "444b489c-4f16-4b75-83f0-cb8097468898",
	      "modified": "2011-12-12 13:28:27.505520",
	      "created": "2011-12-12 13:28:27.505520",
	      "features": {"persistent": [
				"kraken",
				"luminous",
				"mimic"],
		"optional": []
	      },
	      "mons": [
	            { "rank": 0,
	              "name": "a",
	              "addr": "127.0.0.1:6789/0",
		      "public_addr": "127.0.0.1:6789/0"},
	            { "rank": 1,
	              "name": "b",
	              "addr": "127.0.0.1:6790/0",
		      "public_addr": "127.0.0.1:6790/0"},
	            { "rank": 2,
	              "name": "c",
	              "addr": "127.0.0.1:6791/0",
		      "public_addr": "127.0.0.1:6791/0"}
	           ]
	  }
	}

Checking MDS Status
===================

Metadata servers provide metadata services for  CephFS. Metadata servers have
two sets of states: ``up | down`` and ``active | inactive``. To ensure your
metadata servers are ``up`` and ``active``,  execute the following:: 

	ceph mds stat
	
To display details of the metadata cluster, execute the following:: 

	ceph fs dump


Checking Placement Group States
===============================

Placement groups map objects to OSDs. When you monitor your
placement groups,  you will want them to be ``active`` and ``clean``. 
For a detailed discussion, refer to `Monitoring OSDs and Placement Groups`_.

.. _Monitoring OSDs and Placement Groups: ../monitoring-osd-pg

.. _rados-monitoring-using-admin-socket:

Using the Admin Socket
======================

The Ceph admin socket allows you to query a daemon via a socket interface. 
By default, Ceph sockets reside under ``/var/run/ceph``. To access a daemon
via the admin socket, login to the host running the daemon and use the 
following command:: 

	ceph daemon {daemon-name}
	ceph daemon {path-to-socket-file}

For example, the following are equivalent::

    ceph daemon osd.0 foo
    ceph daemon /var/run/ceph/ceph-osd.0.asok foo

To view the available admin socket commands, execute the following command:: 

	ceph daemon {daemon-name} help

The admin socket command enables you to show and set your configuration at
runtime. See `Viewing a Configuration at Runtime`_ for details.

Additionally, you can set configuration values at runtime directly (i.e., the
admin socket bypasses the monitor, unlike ``ceph tell {daemon-type}.{id}
config set``, which relies on the monitor but doesn't require you to login
directly to the host in question ).

.. _Viewing a Configuration at Runtime: ../../configuration/ceph-conf#viewing-a-configuration-at-runtime
.. _Storage Capacity: ../../configuration/mon-config-ref#storage-capacity
.. _ceph-medic: http://docs.ceph.com/ceph-medic/master/
Commit	Line	Data
7c673cae FG	1	======================
	2	Monitoring a Cluster
	3	======================
	4
	5	Once you have a running cluster, you may use the ``ceph`` tool to monitor your
	6	cluster. Monitoring a cluster typically involves checking OSD status, monitor
	7	status, placement group status and metadata server status.
	8
c07f9fc5 FG	9	Using the command line
	10	======================
	11
	12	Interactive mode
	13	----------------
7c673cae FG	14
	15	To run the ``ceph`` tool in interactive mode, type ``ceph`` at the command line
	16	with no arguments. For example::
	17
	18	ceph
	19	ceph> health
	20	ceph> status
	21	ceph> quorum_status
9f95a23c	22	ceph> mon stat
7c673cae	23
c07f9fc5 FG	24	Non-default paths
c07f9fc5 FG	25	-----------------
7c673cae FG	26
	27	If you specified non-default locations for your configuration or keyring,
	28	you may specify their locations::
	29
	30	ceph -c /path/to/conf -k /path/to/keyring health
	31
c07f9fc5 FG	32	Checking a Cluster's Status
	33	===========================
	34
	35	After you start your cluster, and before you start reading and/or
	36	writing data, check your cluster's status first.
7c673cae	37
c07f9fc5	38	To check a cluster's status, execute the following::
7c673cae	39
c07f9fc5 FG	40	ceph status
	41
	42	Or::
7c673cae	43
c07f9fc5 FG	44	ceph -s
	45
	46	In interactive mode, type ``status`` and press Enter. ::
	47
	48	ceph> status
	49
	50	Ceph will print the cluster status. For example, a tiny Ceph demonstration
	51	cluster with one of each service may print the following:
	52
	53	::
	54
	55	cluster:
	56	id: 477e46f1-ae41-4e43-9c8f-72c918ab0a20
	57	health: HEALTH_OK
	58
	59	services:
11fdf7f2	60	mon: 3 daemons, quorum a,b,c
c07f9fc5	61	mgr: x(active)
11fdf7f2 TL	62	mds: cephfs_a-1/1/1 up {0=a=up:active}, 2 up:standby
11fdf7f2 TL	63	osd: 3 osds: 3 up, 3 in
c07f9fc5 FG	64
	65	data:
	66	pools: 2 pools, 16 pgs
11fdf7f2	67	objects: 21 objects, 2.19K
c07f9fc5 FG	68	usage: 546 GB used, 384 GB / 931 GB avail
c07f9fc5 FG	69	pgs: 16 active+clean
7c673cae	70
7c673cae FG	71
	72	.. topic:: How Ceph Calculates Data Usage
	73
c07f9fc5	74	The ``usage`` value reflects the actual amount of raw storage used. The
7c673cae FG	75	``xxx GB / xxx GB`` value means the amount available (the lesser number)
	76	of the overall storage capacity of the cluster. The notional number reflects
	77	the size of the stored data before it is replicated, cloned or snapshotted.
	78	Therefore, the amount of data actually stored typically exceeds the notional
	79	amount stored, because Ceph creates replicas of the data and may also use
	80	storage capacity for cloning and snapshotting.
	81
	82
c07f9fc5 FG	83	Watching a Cluster
	84	==================
	85
	86	In addition to local logging by each daemon, Ceph clusters maintain
	87	a cluster log that records high level events about the whole system.
	88	This is logged to disk on monitor servers (as ``/var/log/ceph/ceph.log`` by
	89	default), but can also be monitored via the command line.
	90
	91	To follow the cluster log, use the following command
	92
	93	::
	94
	95	ceph -w
	96
	97	Ceph will print the status of the system, followed by each log message as it
	98	is emitted. For example:
	99
	100	::
	101
	102	cluster:
	103	id: 477e46f1-ae41-4e43-9c8f-72c918ab0a20
	104	health: HEALTH_OK
	105
	106	services:
11fdf7f2	107	mon: 3 daemons, quorum a,b,c
c07f9fc5	108	mgr: x(active)
11fdf7f2 TL	109	mds: cephfs_a-1/1/1 up {0=a=up:active}, 2 up:standby
11fdf7f2 TL	110	osd: 3 osds: 3 up, 3 in
c07f9fc5 FG	111
	112	data:
	113	pools: 2 pools, 16 pgs
11fdf7f2	114	objects: 21 objects, 2.19K
c07f9fc5 FG	115	usage: 546 GB used, 384 GB / 931 GB avail
	116	pgs: 16 active+clean
	117
	118
	119	2017-07-24 08:15:11.329298 mon.a mon.0 172.21.9.34:6789/0 23 : cluster [INF] osd.0 172.21.9.34:6806/20527 boot
	120	2017-07-24 08:15:14.258143 mon.a mon.0 172.21.9.34:6789/0 39 : cluster [INF] Activating manager daemon x
	121	2017-07-24 08:15:15.446025 mon.a mon.0 172.21.9.34:6789/0 47 : cluster [INF] Manager daemon x is now available
	122
	123
	124	In addition to using ``ceph -w`` to print log lines as they are emitted,
	125	use ``ceph log last [n]`` to see the most recent ``n`` lines from the cluster
	126	log.
	127
	128	Monitoring Health Checks
	129	========================
	130
11fdf7f2	131	Ceph continuously runs various health checks against its own status. When
c07f9fc5 FG	132	a health check fails, this is reflected in the output of ``ceph status`` (or
	133	``ceph health``). In addition, messages are sent to the cluster log to
	134	indicate when a check fails, and when the cluster recovers.
	135
	136	For example, when an OSD goes down, the ``health`` section of the status
	137	output may be updated as follows:
	138
	139	::
	140
	141	health: HEALTH_WARN
	142	1 osds down
	143	Degraded data redundancy: 21/63 objects degraded (33.333%), 16 pgs unclean, 16 pgs degraded
	144
	145	At this time, cluster log messages are also emitted to record the failure of the
	146	health checks:
	147
	148	::
	149
	150	2017-07-25 10:08:58.265945 mon.a mon.0 172.21.9.34:6789/0 91 : cluster [WRN] Health check failed: 1 osds down (OSD_DOWN)
	151	2017-07-25 10:09:01.302624 mon.a mon.0 172.21.9.34:6789/0 94 : cluster [WRN] Health check failed: Degraded data redundancy: 21/63 objects degraded (33.333%), 16 pgs unclean, 16 pgs degraded (PG_DEGRADED)
	152
	153	When the OSD comes back online, the cluster log records the cluster's return
	154	to a health state:
	155
	156	::
	157
	158	2017-07-25 10:11:11.526841 mon.a mon.0 172.21.9.34:6789/0 109 : cluster [WRN] Health check update: Degraded data redundancy: 2 pgs unclean, 2 pgs degraded, 2 pgs undersized (PG_DEGRADED)
	159	2017-07-25 10:11:13.535493 mon.a mon.0 172.21.9.34:6789/0 110 : cluster [INF] Health check cleared: PG_DEGRADED (was: Degraded data redundancy: 2 pgs unclean, 2 pgs degraded, 2 pgs undersized)
	160	2017-07-25 10:11:13.535577 mon.a mon.0 172.21.9.34:6789/0 111 : cluster [INF] Cluster is now healthy
	161
eafe8130 TL	162	Network Performance Checks
	163	--------------------------
	164
	165	Ceph OSDs send heartbeat ping messages amongst themselves to monitor daemon availability. We
	166	also use the response times to monitor network performance.
	167	While it is possible that a busy OSD could delay a ping response, we can assume
9f95a23c	168	that if a network switch fails multiple delays will be detected between distinct pairs of OSDs.
eafe8130 TL	169
	170	By default we will warn about ping times which exceed 1 second (1000 milliseconds).
	171
	172	::
	173
9f95a23c	174	HEALTH_WARN Slow OSD heartbeats on back (longest 1118.001ms)
eafe8130 TL	175
	176	The health detail will add the combination of OSDs are seeing the delays and by how much. There is a limit of 10
	177	detail line items.
	178
	179	::
	180
9f95a23c TL	181	[WRN] OSD_SLOW_PING_TIME_BACK: Slow OSD heartbeats on back (longest 1118.001ms)
	182	Slow OSD heartbeats on back from osd.0 [dc1,rack1] to osd.1 [dc1,rack1] 1118.001 msec possibly improving
	183	Slow OSD heartbeats on back from osd.0 [dc1,rack1] to osd.2 [dc1,rack2] 1030.123 msec
	184	Slow OSD heartbeats on back from osd.2 [dc1,rack2] to osd.1 [dc1,rack1] 1015.321 msec
	185	Slow OSD heartbeats on back from osd.1 [dc1,rack1] to osd.0 [dc1,rack1] 1010.456 msec
eafe8130 TL	186
	187	To see even more detail and a complete dump of network performance information the ``dump_osd_network`` command can be used. Typically, this would be
	188	sent to a mgr, but it can be limited to a particular OSD's interactions by issuing it to any OSD. The current threshold which defaults to 1 second
	189	(1000 milliseconds) can be overridden as an argument in milliseconds.
	190
	191	The following command will show all gathered network performance data by specifying a threshold of 0 and sending to the mgr.
	192
	193	::
	194
	195	$ ceph daemon /var/run/ceph/ceph-mgr.x.asok dump_osd_network 0
	196	{
	197	"threshold": 0,
	198	"entries": [
	199	{
	200	"last update": "Wed Sep 4 17:04:49 2019",
	201	"stale": false,
	202	"from osd": 2,
	203	"to osd": 0,
	204	"interface": "front",
	205	"average": {
	206	"1min": 1.023,
	207	"5min": 0.860,
	208	"15min": 0.883
	209	},
	210	"min": {
	211	"1min": 0.818,
	212	"5min": 0.607,
	213	"15min": 0.607
	214	},
	215	"max": {
	216	"1min": 1.164,
	217	"5min": 1.173,
	218	"15min": 1.544
	219	},
	220	"last": 0.924
	221	},
	222	{
	223	"last update": "Wed Sep 4 17:04:49 2019",
	224	"stale": false,
	225	"from osd": 2,
	226	"to osd": 0,
	227	"interface": "back",
	228	"average": {
	229	"1min": 0.968,
	230	"5min": 0.897,
	231	"15min": 0.830
	232	},
	233	"min": {
	234	"1min": 0.860,
	235	"5min": 0.563,
	236	"15min": 0.502
	237	},
	238	"max": {
	239	"1min": 1.171,
	240	"5min": 1.216,
	241	"15min": 1.456
	242	},
	243	"last": 0.845
	244	},
	245	{
	246	"last update": "Wed Sep 4 17:04:48 2019",
	247	"stale": false,
	248	"from osd": 0,
	249	"to osd": 1,
250	"interface": "front",
251	"average": {
252	"1min": 0.965,
253	"5min": 0.811,
254	"15min": 0.850
255	},
256	"min": {
257	"1min": 0.650,
258	"5min": 0.488,
259	"15min": 0.466
260	},
261	"max": {
262	"1min": 1.252,
263	"5min": 1.252,
264	"15min": 1.362
265	},
266	"last": 0.791
267	},
268	...
269
c07f9fc5	270
9f95a23c TL	271
	272	Muting health checks
	273	--------------------
	274
	275	Health checks can be muted so that they do not affect the overall
	276	reported status of the cluster. Alerts are specified using the health
	277	check code (see :ref:`health-checks`)::
	278
	279	ceph health mute <code>
	280
	281	For example, if there is a health warning, muting it will make the
	282	cluster report an overall status of ``HEALTH_OK``. For example, to
	283	mute an ``OSD_DOWN`` alert,::
	284
	285	ceph health mute OSD_DOWN
	286
	287	Mutes are reported as part of the short and long form of the ``ceph health`` command.
	288	For example, in the above scenario, the cluster would report::
	289
	290	$ ceph health
	291	HEALTH_OK (muted: OSD_DOWN)
	292	$ ceph health detail
	293	HEALTH_OK (muted: OSD_DOWN)
	294	(MUTED) OSD_DOWN 1 osds down
	295	osd.1 is down
	296
	297	A mute can be explicitly removed with::
	298
	299	ceph health unmute <code>
	300
	301	For example,::
	302
	303	ceph health unmute OSD_DOWN
	304
	305	A health check mute may optionally have a TTL (time to live)
	306	associated with it, such that the mute will automatically expire
	307	after the specified period of time has elapsed. The TTL is specified as an optional
	308	duration argument, e.g.::
	309
	310	ceph health mute OSD_DOWN 4h # mute for 4 hours
	311	ceph health mute MON_DOWN 15m # mute for 15 minutes
	312
	313	Normally, if a muted health alert is resolved (e.g., in the example
	314	above, the OSD comes back up), the mute goes away. If the alert comes
	315	back later, it will be reported in the usual way.
	316
	317	It is possible to make a mute "sticky" such that the mute will remain even if the
	318	alert clears. For example,::
	319
	320	ceph health mute OSD_DOWN 1h --sticky # ignore any/all down OSDs for next hour
	321
	322	Most health mutes also disappear if the extent of an alert gets worse. For example,
	323	if there is one OSD down, and the alert is muted, the mute will disappear if one
	324	or more additional OSDs go down. This is true for any health alert that involves
	325	a count indicating how much or how many of something is triggering the warning or
	326	error.
	327
	328
c07f9fc5 FG	329	Detecting configuration issues
	330	==============================
	331
	332	In addition to the health checks that Ceph continuously runs on its
	333	own status, there are some configuration issues that may only be detected
	334	by an external tool.
	335
	336	Use the `ceph-medic`_ tool to run these additional checks on your Ceph
	337	cluster's configuration.
	338
7c673cae FG	339	Checking a Cluster's Usage Stats
	340	================================
	341
	342	To check a cluster's data usage and data distribution among pools, you can
	343	use the ``df`` option. It is similar to Linux ``df``. Execute
	344	the following::
	345
	346	ceph df
	347
11fdf7f2 TL	348	The RAW STORAGE section of the output provides an overview of the
11fdf7f2 TL	349	amount of storage that is managed by your cluster.
7c673cae	350
11fdf7f2 TL	351	- CLASS: The class of OSD device (or the total for the cluster)
11fdf7f2 TL	352	- SIZE: The amount of storage capacity managed by the cluster.
7c673cae	353	- AVAIL: The amount of free space available in the cluster.
11fdf7f2 TL	354	- USED: The amount of raw storage consumed by user data.
	355	- RAW USED: The amount of raw storage consumed by user data, internal overhead, or reserved capacity.
	356	- %RAW USED: The percentage of raw storage used. Use this number in
7c673cae FG	357	conjunction with the ``full ratio`` and ``near full ratio`` to ensure that
	358	you are not reaching your cluster's capacity. See `Storage Capacity`_ for
	359	additional details.
	360
	361	The POOLS section of the output provides a list of pools and the notional
	362	usage of each pool. The output from this section DOES NOT reflect replicas,
	363	clones or snapshots. For example, if you store an object with 1MB of data, the
	364	notional usage will be 1MB, but the actual usage may be 2MB or more depending
	365	on the number of replicas, clones and snapshots.
	366
	367	- NAME: The name of the pool.
	368	- ID: The pool ID.
	369	- USED: The notional amount of data stored in kilobytes, unless the number
	370	appends M for megabytes or G for gigabytes.
	371	- %USED: The notional percentage of storage used per pool.
	372	- MAX AVAIL: An estimate of the notional amount of data that can be written
	373	to this pool.
11fdf7f2	374	- OBJECTS: The notional number of objects stored per pool.
7c673cae FG	375
7c673cae FG	376	.. note:: The numbers in the POOLS section are notional. They are not
11fdf7f2	377	inclusive of the number of replicas, snapshots or clones. As a result,
7c673cae	378	the sum of the USED and %USED amounts will not add up to the
11fdf7f2	379	USED and %USED amounts in the RAW section of the
7c673cae FG	380	output.
	381
	382	.. note:: The MAX AVAIL value is a complicated function of the
	383	replication or erasure code used, the CRUSH rule that maps storage
	384	to devices, the utilization of those devices, and the configured
	385	mon_osd_full_ratio.
	386
	387
7c673cae FG	388
	389	Checking OSD Status
	390	===================
	391
	392	You can check OSDs to ensure they are ``up`` and ``in`` by executing::
	393
	394	ceph osd stat
	395
	396	Or::
	397
	398	ceph osd dump
	399
	400	You can also check view OSDs according to their position in the CRUSH map. ::
	401
	402	ceph osd tree
	403
	404	Ceph will print out a CRUSH tree with a host, its OSDs, whether they are up
	405	and their weight. ::
	406
11fdf7f2 TL	407	#ID CLASS WEIGHT TYPE NAME STATUS REWEIGHT PRI-AFF
	408	-1 3.00000 pool default
	409	-3 3.00000 rack mainrack
	410	-2 3.00000 host osd-host
	411	0 ssd 1.00000 osd.0 up 1.00000 1.00000
	412	1 ssd 1.00000 osd.1 up 1.00000 1.00000
	413	2 ssd 1.00000 osd.2 up 1.00000 1.00000
7c673cae FG	414
	415	For a detailed discussion, refer to `Monitoring OSDs and Placement Groups`_.
	416
	417	Checking Monitor Status
	418	=======================
	419
	420	If your cluster has multiple monitors (likely), you should check the monitor
11fdf7f2	421	quorum status after you start the cluster and before reading and/or writing data. A
7c673cae FG	422	quorum must be present when multiple monitors are running. You should also check
	423	monitor status periodically to ensure that they are running.
	424
	425	To see display the monitor map, execute the following::
	426
	427	ceph mon stat
	428
	429	Or::
	430
	431	ceph mon dump
	432
	433	To check the quorum status for the monitor cluster, execute the following::
	434
	435	ceph quorum_status
	436
	437	Ceph will return the quorum status. For example, a Ceph cluster consisting of
	438	three monitors may return the following:
	439
	440	.. code-block:: javascript
	441
	442	{ "election_epoch": 10,
	443	"quorum": [
	444	0,
	445	1,
	446	2],
11fdf7f2 TL	447	"quorum_names": [
	448	"a",
	449	"b",
	450	"c"],
	451	"quorum_leader_name": "a",
7c673cae FG	452	"monmap": { "epoch": 1,
	453	"fsid": "444b489c-4f16-4b75-83f0-cb8097468898",
	454	"modified": "2011-12-12 13:28:27.505520",
	455	"created": "2011-12-12 13:28:27.505520",
11fdf7f2 TL	456	"features": {"persistent": [
	457	"kraken",
	458	"luminous",
	459	"mimic"],
	460	"optional": []
	461	},
7c673cae FG	462	"mons": [
	463	{ "rank": 0,
	464	"name": "a",
11fdf7f2 TL	465	"addr": "127.0.0.1:6789/0",
11fdf7f2 TL	466	"public_addr": "127.0.0.1:6789/0"},
7c673cae FG	467	{ "rank": 1,
7c673cae FG	468	"name": "b",
11fdf7f2 TL	469	"addr": "127.0.0.1:6790/0",
11fdf7f2 TL	470	"public_addr": "127.0.0.1:6790/0"},
7c673cae FG	471	{ "rank": 2,
7c673cae FG	472	"name": "c",
11fdf7f2 TL	473	"addr": "127.0.0.1:6791/0",
11fdf7f2 TL	474	"public_addr": "127.0.0.1:6791/0"}
7c673cae	475	]
11fdf7f2	476	}
7c673cae FG	477	}
	478
	479	Checking MDS Status
	480	===================
	481
91327a77	482	Metadata servers provide metadata services for CephFS. Metadata servers have
7c673cae FG	483	two sets of states: ``up \| down`` and ``active \| inactive``. To ensure your
	484	metadata servers are ``up`` and ``active``, execute the following::
	485
	486	ceph mds stat
	487
	488	To display details of the metadata cluster, execute the following::
	489
	490	ceph fs dump
	491
	492
	493	Checking Placement Group States
	494	===============================
	495
	496	Placement groups map objects to OSDs. When you monitor your
	497	placement groups, you will want them to be ``active`` and ``clean``.
	498	For a detailed discussion, refer to `Monitoring OSDs and Placement Groups`_.
	499
	500	.. _Monitoring OSDs and Placement Groups: ../monitoring-osd-pg
	501
e306af50	502	.. _rados-monitoring-using-admin-socket:
7c673cae FG	503
	504	Using the Admin Socket
	505	======================
	506
	507	The Ceph admin socket allows you to query a daemon via a socket interface.
	508	By default, Ceph sockets reside under ``/var/run/ceph``. To access a daemon
	509	via the admin socket, login to the host running the daemon and use the
	510	following command::
	511
	512	ceph daemon {daemon-name}
	513	ceph daemon {path-to-socket-file}
	514
	515	For example, the following are equivalent::
	516
	517	ceph daemon osd.0 foo
	518	ceph daemon /var/run/ceph/ceph-osd.0.asok foo
	519
	520	To view the available admin socket commands, execute the following command::
	521
	522	ceph daemon {daemon-name} help
	523
	524	The admin socket command enables you to show and set your configuration at
	525	runtime. See `Viewing a Configuration at Runtime`_ for details.
	526
	527	Additionally, you can set configuration values at runtime directly (i.e., the
	528	admin socket bypasses the monitor, unlike ``ceph tell {daemon-type}.{id}
11fdf7f2	529	config set``, which relies on the monitor but doesn't require you to login
7c673cae FG	530	directly to the host in question ).
7c673cae FG	531
11fdf7f2	532	.. _Viewing a Configuration at Runtime: ../../configuration/ceph-conf#viewing-a-configuration-at-runtime
7c673cae	533	.. _Storage Capacity: ../../configuration/mon-config-ref#storage-capacity
c07f9fc5	534	.. _ceph-medic: http://docs.ceph.com/ceph-medic/master/