bump version to 6.2-6

[pve-docs.git] / pvecm.adoc
diff --git a/pvecm.adoc b/pvecm.adoc

index 0c7e14c994287556e2383919a95b4bc8473d7159..3820c178862efacec241942b6fdbd66d102eddce 100644 (file)
--- a/pvecm.adoc
+++ b/pvecm.adoc
@@ -61,7 +61,7 @@ Requirements
  
  * Date and time have to be synchronized.
  
-* SSH tunnel on TCP port 22 between nodes is used. 
+* SSH tunnel on TCP port 22 between nodes is used.
  
  * If you are interested in High Availability, you need to have at
    least three nodes for reliable quorum. All nodes should have the
@@ -75,9 +75,9 @@ Requirements
  NOTE: It is not possible to mix {pve} 3.x and earlier with {pve} 4.X cluster
  nodes.
  
-NOTE: While it's possible for {pve} 4.4 and {pve} 5.0 this is not supported as
-production configuration and should only used temporarily during upgrading the
-whole cluster from one to another major version.
+NOTE: While it's possible to mix {pve} 4.4 and {pve} 5.0  nodes, doing so is
+not supported as production configuration and should only used temporarily
+during upgrading the whole cluster from one to another major version.
  
  NOTE: Running a cluster of {pve} 6.x with earlier versions is not possible. The
  cluster protocol (corosync) between {pve} 6.x and earlier versions changed
@@ -92,10 +92,6 @@ First, install {PVE} on all nodes. Make sure that each node is
  installed with the final hostname and IP configuration. Changing the
  hostname and IP is not possible after cluster creation.
  
-Currently the cluster creation can either be done on the console (login via
-`ssh`) or the API, which we have a GUI implementation for (__Datacenter ->
-Cluster__).
-
  While it's common to reference all nodenames and their IPs in `/etc/hosts` (or
  make their names resolvable through other means), this is not necessary for a
  cluster to work. It may be useful however, as you can then connect from one node
@@ -105,23 +101,47 @@ recommend to reference nodes by their IP addresses in the cluster configuration.
  
  
  [[pvecm_create_cluster]]
-Create the Cluster
-------------------
+Create a Cluster
+----------------
+
+You can either create a cluster on the console (login via `ssh`), or through
+the API using the {pve} Webinterface (__Datacenter -> Cluster__).
+
+NOTE: Use a unique name for your cluster. This name cannot be changed later.
+The cluster name follows the same rules as node names.
+
+[[pvecm_cluster_create_via_gui]]
+Create via Web GUI
+~~~~~~~~~~~~~~~~~~
+
+[thumbnail="screenshot/gui-cluster-create.png"]
+
+Under __Datacenter -> Cluster__, click on *Create Cluster*. Enter the cluster
+name and select a network connection from the dropdown to serve as the main
+cluster network (Link 0). It defaults to the IP resolved via the node's
+hostname.
  
-Login via `ssh` to the first {pve} node. Use a unique name for your cluster.
-This name cannot be changed later. The cluster name follows the same rules as
-node names.
+To add a second link as fallback, you can select the 'Advanced' checkbox and
+choose an additional network interface (Link 1, see also
+xref:pvecm_redundancy[Corosync Redundancy]).
+
+NOTE: Ensure the network selected for the cluster communication is not used for
+any high traffic loads like those of (network) storages or live-migration.
+While the cluster network itself produces small amounts of data, it is very
+sensitive to latency. Check out full
+xref:pvecm_cluster_network_requirements[cluster network requirements].
+
+[[pvecm_cluster_create_via_cli]]
+Create via Command Line
+~~~~~~~~~~~~~~~~~~~~~~~
+
+Login via `ssh` to the first {pve} node and run the following command:
  
  ----
   hp1# pvecm create CLUSTERNAME
  ----
  
-NOTE: It is possible to create multiple clusters in the same physical or logical
-network. Use unique cluster names if you do so. To avoid human confusion, it is
-also recommended to choose different names even if clusters do not share the
-cluster network.
-
-To check the state of your cluster use:
+To check the state of the new cluster use:
  
  ----
   hp1# pvecm status
@@ -131,9 +151,9 @@ Multiple Clusters In Same Network
  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  
  It is possible to create multiple clusters in the same physical or logical
-network. Each such cluster must have a unique name, this does not only helps
-admins to distinguish on which cluster they currently operate, it is also
-required to avoid possible clashes in the cluster communication stack.
+network. Each such cluster must have a unique name to avoid possible clashes in
+the cluster communication stack. This also helps avoid human confusion by making
+clusters clearly distinguishable.
  
  While the bandwidth requirement of a corosync cluster is relatively low, the
  latency of packages and the package per second (PPS) rate is the limiting
@@ -145,7 +165,47 @@ infrastructure for bigger clusters.
  Adding Nodes to the Cluster
  ---------------------------
  
-Login via `ssh` to the node you want to add.
+CAUTION: A node that is about to be added to the cluster cannot hold any guests.
+All existing configuration in `/etc/pve` is overwritten when joining a cluster,
+since guest IDs could be conflicting. As a workaround create a backup of the
+guest (`vzdump`) and restore it as a different ID after the node has been added
+to the cluster.
+
+Join Node to Cluster via GUI
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+
+[thumbnail="screenshot/gui-cluster-join-information.png"]
+
+Login to the web interface on an existing cluster node. Under __Datacenter ->
+Cluster__, click the button *Join Information* at the top. Then, click on the
+button *Copy Information*. Alternatively, copy the string from the 'Information'
+field manually.
+
+[thumbnail="screenshot/gui-cluster-join.png"]
+
+Next, login to the web interface on the node you want to add.
+Under __Datacenter -> Cluster__, click on *Join Cluster*. Fill in the
+'Information' field with the 'Join Information' text you copied earlier.
+Most settings required for joining the cluster will be filled out
+automatically. For security reasons, the cluster password has to be entered
+manually.
+
+NOTE: To enter all required data manually, you can disable the 'Assisted Join'
+checkbox.
+
+After clicking the *Join* button, the cluster join process will start
+immediately. After the node joined the cluster its current node certificate
+will be replaced by one signed from the cluster certificate authority (CA),
+that means the current session will stop to work after a few seconds. You might
+then need to force-reload the webinterface and re-login with the cluster
+credentials.
+
+Now your node should be visible under __Datacenter -> Cluster__.
+
+Join Node to Cluster via Command Line
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+
+Login via `ssh` to the node you want to join into an existing cluster.
  
  ----
   hp2# pvecm add IP-ADDRESS-CLUSTER
@@ -154,11 +214,6 @@ Login via `ssh` to the node you want to add.
  For `IP-ADDRESS-CLUSTER` use the IP or hostname of an existing cluster node.
  An IP address is recommended (see xref:pvecm_corosync_addresses[Link Address Types]).
  
-CAUTION: A new node cannot hold any VMs, because you would get
-conflicts about identical VM IDs. Also, all existing configuration in
-`/etc/pve` is overwritten when you join a new node to the cluster. To
-workaround, use `vzdump` to backup and restore to a different VMID after
-adding the node to the cluster.
  
  To check the state of the cluster use:
  
@@ -229,6 +284,8 @@ pvecm add IP-ADDRESS-CLUSTER -link0 LOCAL-IP-ADDRESS-LINK0
  If you want to use the built-in xref:pvecm_redundancy[redundancy] of the
  kronosnet transport layer, also use the 'link1' parameter.
  
+Using the GUI, you can select the correct interface from the corresponding 'Link 0'
+and 'Link 1' fields in the *Cluster Join* dialog.
  
  Remove a Cluster Node
  ---------------------
@@ -270,11 +327,11 @@ After powering off the node hp4, we can safely remove it from the cluster.
  
  ----
   hp1# pvecm delnode hp4
+ Killing node 4
  ----
  
-If the operation succeeds no output is returned, just check the node
-list again with `pvecm nodes` or `pvecm status`. You should see
-something like:
+Use `pvecm nodes` or `pvecm status` to check the node list again. It should
+look something like:
  
  ----
  hp1# pvecm status
@@ -340,7 +397,7 @@ node from the cluster.
  WARNING: Ensure all shared resources are cleanly separated! Otherwise you will
  run into conflicts and problems.
  
-First stop the corosync and the pve-cluster services on the node:
+First, stop the corosync and the pve-cluster services on the node:
  [source,bash]
  ----
  systemctl stop pve-cluster
@@ -357,7 +414,7 @@ Delete the corosync configuration files:
  [source,bash]
  ----
  rm /etc/pve/corosync.conf
-rm /etc/corosync/*
+rm -r /etc/corosync/*
  ----
  
  You can now start the filesystem again as normal service:
@@ -470,7 +527,7 @@ application.
  Setting Up A New Network
  ^^^^^^^^^^^^^^^^^^^^^^^^
  
-First you have to set up a new network interface. It should be on a physically
+First, you have to set up a new network interface. It should be on a physically
  separate network. Ensure that your network fulfills the
  xref:pvecm_cluster_network_requirements[cluster network requirements].
  
@@ -673,7 +730,7 @@ resolve to can be changed without touching corosync or the node it runs on -
  which may lead to a situation where an address is changed without thinking
  about implications for corosync.
  
-A seperate, static hostname specifically for corosync is recommended, if
+A separate, static hostname specifically for corosync is recommended, if
  hostnames are preferred. Also, make sure that every node in the cluster can
  resolve all hostnames correctly.
  
@@ -682,7 +739,7 @@ entry. Only the resolved IP is then saved to the configuration.
  
  Nodes that joined the cluster on earlier versions likely still use their
  unresolved hostname in `corosync.conf`. It might be a good idea to replace
-them with IPs or a seperate hostname, as mentioned above.
+them with IPs or a separate hostname, as mentioned above.
  
  
  [[pvecm_redundancy]]
@@ -692,22 +749,23 @@ Corosync Redundancy
  Corosync supports redundant networking via its integrated kronosnet layer by
  default (it is not supported on the legacy udp/udpu transports). It can be
  enabled by specifying more than one link address, either via the '--linkX'
-parameters of `pvecm` (while creating a cluster or adding a new node) or by
-specifying more than one 'ringX_addr' in `corosync.conf`.
+parameters of `pvecm`, in the GUI as **Link 1** (while creating a cluster or
+adding a new node) or by specifying more than one 'ringX_addr' in
+`corosync.conf`.
  
  NOTE: To provide useful failover, every link should be on its own
  physical network connection.
  
  Links are used according to a priority setting. You can configure this priority
  by setting 'knet_link_priority' in the corresponding interface section in
-`corosync.conf`, or, preferrably, using the 'priority' parameter when creating
+`corosync.conf`, or, preferably, using the 'priority' parameter when creating
  your cluster with `pvecm`:
  
  ----
- # pvecm create CLUSTERNAME --link0 10.10.10.1,priority=20 --link1 10.20.20.1,priority=15
+ # pvecm create CLUSTERNAME --link0 10.10.10.1,priority=15 --link1 10.20.20.1,priority=20
  ----
  
-This would cause 'link1' to be used first, since it has the lower priority.
+This would cause 'link1' to be used first, since it has the higher priority.
  
  If no priorities are configured manually (or two links have the same priority),
  links will be used in order of their number, with the lower number having higher
@@ -830,7 +888,7 @@ example 2+1 nodes).
  QDevice Technical Overview
  ~~~~~~~~~~~~~~~~~~~~~~~~~~
  
-The Corosync Quroum Device (QDevice) is a daemon which runs on each cluster
+The Corosync Quorum Device (QDevice) is a daemon which runs on each cluster
  node. It provides a configured number of votes to the clusters quorum
  subsystem based on an external running third-party arbitrator's decision.
  Its primary use is to allow a cluster to sustain more node failures than
@@ -894,13 +952,22 @@ QDevice-Net Setup
  ~~~~~~~~~~~~~~~~~
  
  We recommend to run any daemon which provides votes to corosync-qdevice as an
-unprivileged user. {pve} and Debian provides a package which is already
+unprivileged user. {pve} and Debian provide a package which is already
  configured to do so.
  The traffic between the daemon and the cluster must be encrypted to ensure a
  safe and secure QDevice integration in {pve}.
  
-First install the 'corosync-qnetd' package on your external server and
-the 'corosync-qdevice' package on all cluster nodes.
+First, install the 'corosync-qnetd' package on your external server
+
+----
+external# apt install corosync-qnetd
+----
+
+and the 'corosync-qdevice' package on all cluster nodes
+
+----
+pve# apt install corosync-qdevice
+----
  
  After that, ensure that all your nodes on the cluster are online.
  
@@ -911,8 +978,10 @@ of the {pve} nodes:
  pve# pvecm qdevice setup <QDEVICE-IP>
  ----
  
-The SSH key from the cluster will be automatically copied to the QDevice. You
-might need to enter an SSH password during this step.
+The SSH key from the cluster will be automatically copied to the QDevice.
+
+NOTE: Make sure that the SSH configuration on your external server allows root
+login via password, if you are asked for a password during this step.
  
  After you enter the password and all the steps are successfully completed, you
  will see "Done". You can check the status now:
@@ -1198,9 +1267,9 @@ iface vmbr0 inet static
      address 192.X.Y.57
      netmask 255.255.250.0
      gateway 192.X.Y.1
-    bridge_ports eno1
-    bridge_stp off
-    bridge_fd 0
+    bridge-ports eno1
+    bridge-stp off
+    bridge-fd 0
  
  # cluster network
  auto eno2