Import ceph 15.2.8

[ceph.git] / ceph / doc / architecture.rst
diff --git a/ceph/doc/architecture.rst b/ceph/doc/architecture.rst

index 6d24c5844c10de0780403e84b7e1b387f3058c14..c1f7f5ff9c4b7cd288c7076ea303e8cb878c7934 100644 (file)
--- a/ceph/doc/architecture.rst
+++ b/ceph/doc/architecture.rst
@@ -27,7 +27,9 @@ A Ceph Storage Cluster consists of two types of daemons:
  - :term:`Ceph Monitor`
  - :term:`Ceph OSD Daemon`
  
-.. ditaa::  +---------------+ +---------------+
+.. ditaa::
+
+            +---------------+ +---------------+
              |      OSDs     | |    Monitors   |
              +---------------+ +---------------+
  
@@ -51,12 +53,14 @@ Storing Data
  
  The Ceph Storage Cluster receives data from :term:`Ceph Clients`--whether it
  comes through a :term:`Ceph Block Device`, :term:`Ceph Object Storage`, the
-:term:`Ceph Filesystem` or a custom implementation you create using
+:term:`Ceph File System` or a custom implementation you create using
  ``librados``--and it stores the data as objects. Each object corresponds to a
  file in a filesystem, which is stored on an :term:`Object Storage Device`. Ceph
  OSD Daemons handle the read/write operations on the storage disks.
  
-.. ditaa:: /-----\       +-----+       +-----+
+.. ditaa::
+
+           /-----\       +-----+       +-----+
             | obj |------>| {d} |------>| {s} |
             \-----/       +-----+       +-----+
     
@@ -70,7 +74,9 @@ attributes such as the file owner, created date, last modified date, and so
  forth.
  
  
-.. ditaa:: /------+------------------------------+----------------\
+.. ditaa::
+
+           /------+------------------------------+----------------\
             | ID   | Binary Data                  | Metadata       |
             +------+------------------------------+----------------+
             | 1234 | 0101010101010100110101010010 | name1 = value1 | 
@@ -229,7 +235,9 @@ the client and the monitor share a secret key.
  .. note:: The ``client.admin`` user must provide the user ID and 
     secret key to the user in a secure manner. 
  
-.. ditaa:: +---------+     +---------+
+.. ditaa::
+
+           +---------+     +---------+
             | Client  |     | Monitor |
             +---------+     +---------+
                  |  request to   |
@@ -252,7 +260,9 @@ user's secret key and transmits it back to the client. The client decrypts the
  ticket and uses it to sign requests to OSDs and metadata servers throughout the
  cluster.
  
-.. ditaa:: +---------+     +---------+
+.. ditaa::
+
+           +---------+     +---------+
             | Client  |     | Monitor |
             +---------+     +---------+
                  |  authenticate |
@@ -283,7 +293,9 @@ machine and the Ceph servers. Each message sent between a client and server,
  subsequent to the initial authentication, is signed using a ticket that the
  monitors, OSDs and metadata servers can verify with their shared secret.
  
-.. ditaa:: +---------+     +---------+     +-------+     +-------+
+.. ditaa::
+
+           +---------+     +---------+     +-------+     +-------+
             |  Client |     | Monitor |     |  MDS  |     |  OSD  |
             +---------+     +---------+     +-------+     +-------+
                  |  request to   |              |             |
@@ -393,7 +405,8 @@ ability to leverage this computing power leads to several major benefits:
     and tertiary OSDs (as many OSDs as additional replicas), and responds to the
     client once it has confirmed the object was stored successfully.
  
-.. ditaa:: 
+.. ditaa::
+
               +----------+
               |  Client  |
               |          |
@@ -443,7 +456,8 @@ Ceph Clients retrieve a `Cluster Map`_ from a Ceph Monitor, and write objects to
  pools. The pool's ``size`` or number of replicas, the CRUSH rule and the
  number of placement groups determine how Ceph will place the data.
  
-.. ditaa:: 
+.. ditaa::
+
              +--------+  Retrieves  +---------------+
              | Client |------------>|  Cluster Map  |
              +--------+             +---------------+
@@ -488,7 +502,8 @@ rebalance dynamically when new Ceph OSD Daemons and the underlying OSD devices
  come online. The following diagram depicts how CRUSH maps objects to placement
  groups, and placement groups to OSDs.
  
-.. ditaa:: 
+.. ditaa::
+
             /-----\  /-----\  /-----\  /-----\  /-----\
             | obj |  | obj |  | obj |  | obj |  | obj |
             \-----/  \-----/  \-----/  \-----/  \-----/
@@ -614,7 +629,8 @@ and each OSD gets some added capacity, so there are no load spikes on the
  new OSD after rebalancing is complete.
  
  
-.. ditaa:: 
+.. ditaa::
+
             +--------+     +--------+
     Before  |  OSD 1 |     |  OSD 2 |
             +--------+     +--------+
@@ -685,6 +701,7 @@ name. Chunk 1 contains ``ABC`` and is stored on **OSD5** while chunk 4 contains
  
  
  .. ditaa::
+
                              +-------------------+
                         name |       NYAN        |
                              +-------------------+
@@ -739,6 +756,7 @@ three chunks are read: **OSD2** was the slowest and its chunk was not taken into
  account.
  
  .. ditaa::
+
                                  +-------------------+
                             name |       NYAN        |
                                  +-------------------+
@@ -794,7 +812,7 @@ sends them to the other OSDs. It is also responsible for maintaining an
  authoritative version of the placement group logs.
  
  In the following diagram, an erasure coded placement group has been created with
-``K = 2 + M = 1`` and is supported by three OSDs, two for ``K`` and one for
+``K = 2, M = 1`` and is supported by three OSDs, two for ``K`` and one for
  ``M``. The acting set of the placement group is made of **OSD 1**, **OSD 2** and
  **OSD 3**. An object has been encoded and stored in the OSDs : the chunk
  ``D1v1`` (i.e. Data chunk number 1, version 1) is on **OSD 1**, ``D2v1`` on
@@ -804,6 +822,7 @@ version 1).
  
  
  .. ditaa::
+
       Primary OSD
      
     +-------------+
@@ -934,6 +953,7 @@ object can be removed: ``D1v1`` on **OSD 1**, ``D2v1`` on **OSD 2** and ``C1v1``
  on **OSD 3**.
  
  .. ditaa::
+
       Primary OSD
      
     +-------------+
@@ -972,6 +992,7 @@ to be available on all OSDs in the previous acting set ) is ``1,1`` and that
  will be the head of the new authoritative log.
  
  .. ditaa::
+
     +-------------+
     |    OSD 1    |
     |   (down)    |
@@ -1017,6 +1038,7 @@ the erasure coding library during scrubbing and stored on the new primary
  
  
  .. ditaa::
+
       Primary OSD
      
     +-------------+
@@ -1068,7 +1090,8 @@ tier. So the cache tier and the backing storage tier are completely transparent
  to Ceph clients.
  
  
-.. ditaa:: 
+.. ditaa::
+
             +-------------+
             | Ceph Client |
             +------+------+
@@ -1150,7 +1173,8 @@ Cluster. Ceph packages this functionality into the ``librados`` library so that
  you can create your own custom Ceph Clients. The following diagram depicts the
  basic architecture.
  
-.. ditaa::  
+.. ditaa::
+
              +---------------------------------+
              |  Ceph Storage Cluster Protocol  |
              |           (librados)            |
@@ -1193,7 +1217,9 @@ notification. This enables a client to use any object as a
  synchronization/communication channel.
  
  
-.. ditaa:: +----------+     +----------+     +----------+     +---------------+
+.. ditaa::
+
+           +----------+     +----------+     +----------+     +---------------+
             | Client 1 |     | Client 2 |     | Client 3 |     | OSD:Object ID |
             +----------+     +----------+     +----------+     +---------------+
                   |                |                |                  |
@@ -1249,13 +1275,13 @@ The RAID type most similar to Ceph's striping is `RAID 0`_, or a 'striped
  volume'. Ceph's striping offers the throughput of RAID 0 striping, the
  reliability of n-way RAID mirroring and faster recovery.
  
-Ceph provides three types of clients: Ceph Block Device, Ceph Filesystem, and
+Ceph provides three types of clients: Ceph Block Device, Ceph File System, and
  Ceph Object Storage. A Ceph Client converts its data from the representation 
  format it provides to its users (a block device image, RESTful objects, CephFS
  filesystem directories) into objects for storage in the Ceph Storage Cluster. 
  
  .. tip:: The objects Ceph stores in the Ceph Storage Cluster are not striped. 
-   Ceph Object Storage, Ceph Block Device, and the Ceph Filesystem stripe their 
+   Ceph Object Storage, Ceph Block Device, and the Ceph File System stripe their 
     data over multiple Ceph Storage Cluster objects. Ceph Clients that write 
     directly to the Ceph Storage Cluster via ``librados`` must perform the
     striping (and parallel I/O) for themselves to obtain these benefits.
@@ -1269,7 +1295,8 @@ take maximum advantage of Ceph's ability to distribute data across placement
  groups, and consequently doesn't improve performance very much. The following
  diagram depicts the simplest form of striping:
  
-.. ditaa::              
+.. ditaa::
+
                          +---------------+
                          |  Client Data  |
                          |     Format    |
@@ -1327,7 +1354,8 @@ set (``object set 2`` in the following diagram), and begins writing to the first
  stripe (``stripe unit 16``) in the first object in the new object set (``object
  4`` in the diagram below).
  
-.. ditaa::                 
+.. ditaa::
+
                            +---------------+
                            |  Client Data  |
                            |     Format    |
@@ -1434,7 +1462,7 @@ Ceph Clients include a number of service interfaces. These include:
    provides RESTful APIs with interfaces that are compatible with Amazon S3
    and OpenStack Swift. 
    
-- **Filesystem**: The :term:`Ceph Filesystem` (CephFS) service provides 
+- **Filesystem**: The :term:`Ceph File System` (CephFS) service provides 
    a POSIX compliant filesystem usable with ``mount`` or as 
    a filesystem in user space (FUSE).
  
@@ -1443,6 +1471,7 @@ and high availability. The following diagram depicts the high-level
  architecture. 
  
  .. ditaa::
+
              +--------------+  +----------------+  +-------------+
              | Block Device |  | Object Storage |  |   CephFS    |
              +--------------+  +----------------+  +-------------+            
@@ -1513,20 +1542,21 @@ client. Other virtualization technologies such as Xen can access the Ceph Block
  Device kernel object(s). This is done with the  command-line tool ``rbd``.
  
  
-.. index:: CephFS; Ceph Filesystem; libcephfs; MDS; metadata server; ceph-mds
+.. index:: CephFS; Ceph File System; libcephfs; MDS; metadata server; ceph-mds
  
  .. _arch-cephfs:
  
-Ceph Filesystem
----------------
+Ceph File System
+----------------
  
-The Ceph Filesystem (CephFS) provides a POSIX-compliant filesystem as a
+The Ceph File System (CephFS) provides a POSIX-compliant filesystem as a
  service that is layered on top of the object-based Ceph Storage Cluster.
  CephFS files get mapped to objects that Ceph stores in the Ceph Storage
  Cluster. Ceph Clients mount a CephFS filesystem as a kernel object or as
  a Filesystem in User Space (FUSE).
  
  .. ditaa::
+
              +-----------------------+  +------------------------+
              | CephFS Kernel Object  |  |      CephFS FUSE       |
              +-----------------------+  +------------------------+            
@@ -1544,14 +1574,14 @@ a Filesystem in User Space (FUSE).
              +---------------+ +---------------+ +---------------+
  
  
-The Ceph Filesystem service includes the Ceph Metadata Server (MDS) deployed
+The Ceph File System service includes the Ceph Metadata Server (MDS) deployed
  with the Ceph Storage cluster. The purpose of the MDS is to store all the
  filesystem metadata (directories, file ownership, access modes, etc) in
  high-availability Ceph Metadata Servers where the metadata resides in memory.
  The reason for the MDS (a daemon called ``ceph-mds``) is that simple filesystem
  operations like listing a directory or changing a directory (``ls``, ``cd``)
  would tax the Ceph OSD Daemons unnecessarily. So separating the metadata from
-the data means that the Ceph Filesystem can provide high performance services
+the data means that the Ceph File System can provide high performance services
  without taxing the Ceph Storage Cluster.
  
  CephFS separates the metadata from the data, storing the metadata in the MDS,