]> git.proxmox.com Git - pve-docs.git/blob - vzdump.adoc
vzdump: add backup job screenshots
[pve-docs.git] / vzdump.adoc
1 [[chapter_vzdump]]
2 ifdef::manvolnum[]
3 vzdump(1)
4 =========
5 :pve-toplevel:
6
7 NAME
8 ----
9
10 vzdump - Backup Utility for VMs and Containers
11
12
13 SYNOPSIS
14 --------
15
16 include::vzdump.1-synopsis.adoc[]
17
18
19 DESCRIPTION
20 -----------
21 endif::manvolnum[]
22 ifndef::manvolnum[]
23 Backup and Restore
24 ==================
25 :pve-toplevel:
26 endif::manvolnum[]
27
28 Backups are a requirement for any sensible IT deployment, and {pve}
29 provides a fully integrated solution, using the capabilities of each
30 storage and each guest system type. This allows the system
31 administrator to fine tune via the `mode` option between consistency
32 of the backups and downtime of the guest system.
33
34 {pve} backups are always full backups - containing the VM/CT
35 configuration and all data. Backups can be started via the GUI or via
36 the `vzdump` command-line tool.
37
38 .Backup Storage
39
40 Before a backup can run, a backup storage must be defined. Refer to the
41 xref:chapter_storage[storage documentation] on how to add a storage. It can
42 either be a Proxmox Backup Server storage, where backups are stored as
43 de-duplicated chunks and metadata, or a file-level storage, where backups are
44 stored as regular files. Using Proxmox Backup Server on a dedicated host is
45 recommended, because of its advanced features. Using an NFS server is a good
46 alternative. In both cases, you might want to save those backups later to a tape
47 drive, for off-site archiving.
48
49 .Scheduled Backup
50
51 Backup jobs can be scheduled so that they are executed automatically on specific
52 days and times, for selectable nodes and guest systems. See the
53 xref:vzdump_jobs[Backup Jobs] section for more.
54
55 Backup Modes
56 ------------
57
58 There are several ways to provide consistency (option `mode`),
59 depending on the guest type.
60
61 .Backup modes for VMs:
62
63 `stop` mode::
64
65 This mode provides the highest consistency of the backup, at the cost
66 of a short downtime in the VM operation. It works by executing an
67 orderly shutdown of the VM, and then runs a background QEMU process to
68 backup the VM data. After the backup is started, the VM goes to full
69 operation mode if it was previously running. Consistency is guaranteed
70 by using the live backup feature.
71
72 `suspend` mode::
73
74 This mode is provided for compatibility reason, and suspends the VM
75 before calling the `snapshot` mode. Since suspending the VM results in
76 a longer downtime and does not necessarily improve the data
77 consistency, the use of the `snapshot` mode is recommended instead.
78
79 `snapshot` mode::
80
81 This mode provides the lowest operation downtime, at the cost of a
82 small inconsistency risk. It works by performing a {pve} live
83 backup, in which data blocks are copied while the VM is running. If the
84 guest agent is enabled (`agent: 1`) and running, it calls
85 `guest-fsfreeze-freeze` and `guest-fsfreeze-thaw` to improve
86 consistency.
87
88 A technical overview of the {pve} live backup for QemuServer can
89 be found online
90 https://git.proxmox.com/?p=pve-qemu.git;a=blob_plain;f=backup.txt[here].
91
92 NOTE: {pve} live backup provides snapshot-like semantics on any
93 storage type. It does not require that the underlying storage supports
94 snapshots. Also please note that since the backups are done via
95 a background QEMU process, a stopped VM will appear as running for a
96 short amount of time while the VM disks are being read by QEMU.
97 However the VM itself is not booted, only its disk(s) are read.
98
99 .Backup modes for Containers:
100
101 `stop` mode::
102
103 Stop the container for the duration of the backup. This potentially
104 results in a very long downtime.
105
106 `suspend` mode::
107
108 This mode uses rsync to copy the container data to a temporary
109 location (see option `--tmpdir`). Then the container is suspended and
110 a second rsync copies changed files. After that, the container is
111 started (resumed) again. This results in minimal downtime, but needs
112 additional space to hold the container copy.
113 +
114 When the container is on a local file system and the target storage of
115 the backup is an NFS/CIFS server, you should set `--tmpdir` to reside on a
116 local file system too, as this will result in a many fold performance
117 improvement. Use of a local `tmpdir` is also required if you want to
118 backup a local container using ACLs in suspend mode if the backup
119 storage is an NFS server.
120
121 `snapshot` mode::
122
123 This mode uses the snapshotting facilities of the underlying
124 storage. First, the container will be suspended to ensure data consistency.
125 A temporary snapshot of the container's volumes will be made and the
126 snapshot content will be archived in a tar file. Finally, the temporary
127 snapshot is deleted again.
128
129 NOTE: `snapshot` mode requires that all backed up volumes are on a storage that
130 supports snapshots. Using the `backup=no` mount point option individual volumes
131 can be excluded from the backup (and thus this requirement).
132
133 // see PVE::VZDump::LXC::prepare()
134 NOTE: By default additional mount points besides the Root Disk mount point are
135 not included in backups. For volume mount points you can set the *Backup* option
136 to include the mount point in the backup. Device and bind mounts are never
137 backed up as their content is managed outside the {pve} storage library.
138
139 VM Backup Fleecing
140 ~~~~~~~~~~~~~~~~~~
141
142 WARNING: Backup fleecing is still being worked on (also in upstream QEMU) and is
143 currently only a technology preview.
144
145 When a backup for a VM is started, QEMU will install a "copy-before-write"
146 filter in its block layer. This filter ensures that upon new guest writes, old
147 data still needed for the backup is sent to the backup target first. The guest
148 write blocks until this operation is finished so guest IO to not-yet-backed-up
149 sectors will be limited by the speed of the backup target.
150
151 With backup fleecing, such old data is cached in a fleecing image rather than
152 sent directly to the backup target. This can help guest IO performance and even
153 prevent hangs in certain scenarios, at the cost of requiring more storage space.
154 Use e.g. `vzdump 123 --fleecing enabled=1,storage=local-lvm` to enable backup
155 fleecing, with fleecing images created on the storage `local-lvm`.
156
157 The fleecing storage should be a fast local storage, with thin provisioning and
158 discard support. Examples are LVM-thin, RBD, ZFS with `sparse 1` in the storage
159 configuration, many file-based storages. Ideally, the fleecing storage is a
160 dedicated storage, so it running full will not affect other guests and just fail
161 the backup. Parts of the fleecing image that have been backed up will be
162 discarded to try and keep the space usage low.
163
164 For file-based storages that do not support discard (e.g. NFS before version
165 4.2), you should set `preallocation off` in the storage configuration. In
166 combination with `qcow2` (used automatically as the format for the fleecing
167 image when the storage supports it), this has the advantage that already
168 allocated parts of the image can be re-used later, which can still help save
169 quite a bit of space.
170
171 WARNING: On a storage that's not thinly provisioned, e.g. LVM or ZFS without the
172 `sparse` option, the full size of the original disk needs to be reserved for the
173 fleecing image up-front. On a thinly provisioned storage, the fleecing image can
174 grow to the same size as the original image only if the guest re-writes a whole
175 disk while the backup is busy with another disk.
176
177 Backup File Names
178 -----------------
179
180 Newer versions of vzdump encode the guest type and the
181 backup time into the filename, for example
182
183 vzdump-lxc-105-2009_10_09-11_04_43.tar
184
185 That way it is possible to store several backup in the same directory. You can
186 limit the number of backups that are kept with various retention options, see
187 the xref:vzdump_retention[Backup Retention] section below.
188
189 Backup File Compression
190 -----------------------
191
192 The backup file can be compressed with one of the following algorithms: `lzo`
193 footnote:[Lempel–Ziv–Oberhumer a lossless data compression algorithm
194 https://en.wikipedia.org/wiki/Lempel-Ziv-Oberhumer], `gzip` footnote:[gzip -
195 based on the DEFLATE algorithm https://en.wikipedia.org/wiki/Gzip] or `zstd`
196 footnote:[Zstandard a lossless data compression algorithm
197 https://en.wikipedia.org/wiki/Zstandard].
198
199 Currently, Zstandard (zstd) is the fastest of these three algorithms.
200 Multi-threading is another advantage of zstd over lzo and gzip. Lzo and gzip
201 are more widely used and often installed by default.
202
203 You can install pigz footnote:[pigz - parallel implementation of gzip
204 https://zlib.net/pigz/] as a drop-in replacement for gzip to provide better
205 performance due to multi-threading. For pigz & zstd, the amount of
206 threads/cores can be adjusted. See the
207 xref:vzdump_configuration[configuration options] below.
208
209 The extension of the backup file name can usually be used to determine which
210 compression algorithm has been used to create the backup.
211
212 |===
213 |.zst | Zstandard (zstd) compression
214 |.gz or .tgz | gzip compression
215 |.lzo | lzo compression
216 |===
217
218 If the backup file name doesn't end with one of the above file extensions, then
219 it was not compressed by vzdump.
220
221 Backup Encryption
222 -----------------
223
224 For Proxmox Backup Server storages, you can optionally set up client-side
225 encryption of backups, see xref:storage_pbs_encryption[the corresponding section.]
226
227 [[vzdump_jobs]]
228 Backup Jobs
229 -----------
230
231 [thumbnail="screenshot/gui-cluster-backup-overview.png"]
232
233 Besides triggering a backup manually, you can also setup periodic jobs that
234 backup all, or a selection of virtual guest to a storage. You can manage the
235 jobs in the UI under 'Datacenter' -> 'Backup' or via the `/cluster/backup` API
236 endpoint. Both will generate job entries in `/etc/pve/jobs.cfg`, which are
237 parsed and executed by the `pvescheduler` daemon.
238
239 [thumbnail="screenshot/gui-cluster-backup-edit-01-general.png", float="left"]
240
241 A job is either configured for all cluster nodes or a specific node, and is
242 executed according to a given schedule. The format for the schedule is very
243 similar to `systemd` calendar events, see the
244 xref:chapter_calendar_events[calendar events] section for details. The
245 'Schedule' field in the UI can be freely edited, and it contains several
246 examples that can be used as a starting point in its drop-down list.
247
248 [thumbnail="screenshot/gui-cluster-backup-edit-02-retention.png"]
249
250 You can configure job-specific xref:vzdump_retention[retention options]
251 overriding those from the storage or node configuration, as well as a
252 xref:vzdump_notes[template for notes] for additional information to be saved
253 together with the backup.
254
255 [thumbnail="screenshot/gui-cluster-backup-edit-03-template.png", float="left"]
256 Since scheduled backups miss their execution when the host was offline or the
257 pvescheduler was disabled during the scheduled time, it is possible to configure
258 the behaviour for catching up. By enabling the `Repeat missed` option (in the
259 'Advanced' tab in the UI, `repeat-missed` in the config), you can tell the
260 scheduler that it should run missed jobs as soon as possible.
261
262 [thumbnail="screenshot/gui-cluster-backup-edit-04-advanced.png"]
263
264 There are a few settings for tuning backup performance (some of which are
265 exposed in the 'Advanced' tab in the UI). The most notable is `bwlimit` for
266 limiting IO bandwidth. The amount of threads used for the compressor can be
267 controlled with the `pigz` (replacing `gzip`), respectively, `zstd` setting.
268 Furthermore, there are `ionice` (when the BFQ scheduler is used) and, as part of
269 the `performance` setting, `max-workers` (affects VM backups only) and
270 `pbs-entries-max` (affects container backups only). See the
271 xref:vzdump_configuration[configuration options] for details.
272
273 [[vzdump_retention]]
274 Backup Retention
275 ----------------
276
277 With the `prune-backups` option you can specify which backups you want to keep
278 in a flexible manner. The following retention options are available:
279
280 `keep-all <boolean>` ::
281 Keep all backups. If this is `true`, no other options can be set.
282
283 `keep-last <N>` ::
284 Keep the last `<N>` backups.
285
286 `keep-hourly <N>` ::
287 Keep backups for the last `<N>` hours. If there is more than one
288 backup for a single hour, only the latest is kept.
289
290 `keep-daily <N>` ::
291 Keep backups for the last `<N>` days. If there is more than one
292 backup for a single day, only the latest is kept.
293
294 `keep-weekly <N>` ::
295 Keep backups for the last `<N>` weeks. If there is more than one
296 backup for a single week, only the latest is kept.
297
298 NOTE: Weeks start on Monday and end on Sunday. The software uses the
299 `ISO week date`-system and handles weeks at the end of the year correctly.
300
301 `keep-monthly <N>` ::
302 Keep backups for the last `<N>` months. If there is more than one
303 backup for a single month, only the latest is kept.
304
305 `keep-yearly <N>` ::
306 Keep backups for the last `<N>` years. If there is more than one
307 backup for a single year, only the latest is kept.
308
309 The retention options are processed in the order given above. Each option
310 only covers backups within its time period. The next option does not take care
311 of already covered backups. It will only consider older backups.
312
313 Specify the retention options you want to use as a
314 comma-separated list, for example:
315
316 # vzdump 777 --prune-backups keep-last=3,keep-daily=13,keep-yearly=9
317
318 While you can pass `prune-backups` directly to `vzdump`, it is often more
319 sensible to configure the setting on the storage level, which can be done via
320 the web interface.
321
322 NOTE: The old `maxfiles` option is deprecated and should be replaced either by
323 `keep-last` or, in case `maxfiles` was `0` for unlimited retention, by
324 `keep-all`.
325
326
327 Prune Simulator
328 ~~~~~~~~~~~~~~~
329
330 You can use the https://pbs.proxmox.com/docs/prune-simulator[prune simulator
331 of the Proxmox Backup Server documentation] to explore the effect of different
332 retention options with various backup schedules.
333
334 Retention Settings Example
335 ~~~~~~~~~~~~~~~~~~~~~~~~~~
336
337 The backup frequency and retention of old backups may depend on how often data
338 changes, and how important an older state may be, in a specific work load.
339 When backups act as a company's document archive, there may also be legal
340 requirements for how long backups must be kept.
341
342 For this example, we assume that you are doing daily backups, have a retention
343 period of 10 years, and the period between backups stored gradually grows.
344
345 `keep-last=3` - even if only daily backups are taken, an admin may want to
346 create an extra one just before or after a big upgrade. Setting keep-last
347 ensures this.
348
349 `keep-hourly` is not set - for daily backups this is not relevant. You cover
350 extra manual backups already, with keep-last.
351
352 `keep-daily=13` - together with keep-last, which covers at least one
353 day, this ensures that you have at least two weeks of backups.
354
355 `keep-weekly=8` - ensures that you have at least two full months of
356 weekly backups.
357
358 `keep-monthly=11` - together with the previous keep settings, this
359 ensures that you have at least a year of monthly backups.
360
361 `keep-yearly=9` - this is for the long term archive. As you covered the
362 current year with the previous options, you would set this to nine for the
363 remaining ones, giving you a total of at least 10 years of coverage.
364
365 We recommend that you use a higher retention period than is minimally required
366 by your environment; you can always reduce it if you find it is unnecessarily
367 high, but you cannot recreate backups once they have been removed.
368
369 [[vzdump_protection]]
370 Backup Protection
371 -----------------
372
373 You can mark a backup as `protected` to prevent its removal. Attempting to
374 remove a protected backup via {pve}'s UI, CLI or API will fail. However, this
375 is enforced by {pve} and not the file-system, that means that a manual removal
376 of a backup file itself is still possible for anyone with write access to the
377 underlying backup storage.
378
379 NOTE: Protected backups are ignored by pruning and do not count towards the
380 retention settings.
381
382 For filesystem-based storages, the protection is implemented via a sentinel file
383 `<backup-name>.protected`. For Proxmox Backup Server, it is handled on the
384 server side (available since Proxmox Backup Server version 2.1).
385
386 Use the storage option `max-protected-backups` to control how many protected
387 backups per guest are allowed on the storage. Use `-1` for unlimited. The
388 default is unlimited for users with `Datastore.Allocate` privilege and `5` for
389 other users.
390
391 [[vzdump_notes]]
392 Backup Notes
393 ------------
394
395 You can add notes to backups using the 'Edit Notes' button in the UI or via the
396 storage content API.
397
398 It is also possible to specify a template for generating notes dynamically for
399 a backup job and for manual backup. The template string can contain variables,
400 surrounded by two curly braces, which will be replaced by the corresponding
401 value when the backup is executed.
402
403 Currently supported are:
404
405 * `{{cluster}}` the cluster name, if any
406 * `{{guestname}}` the virtual guest's assigned name
407 * `{{node}}` the host name of the node the backup is being created
408 * `{{vmid}}` the numerical VMID of the guest
409
410 When specified via API or CLI, it needs to be a single line, where newline and
411 backslash need to be escaped as literal `\n` and `\\` respectively.
412
413 [[vzdump_restore]]
414 Restore
415 -------
416
417 A backup archive can be restored through the {pve} web GUI or through the
418 following CLI tools:
419
420
421 `pct restore`:: Container restore utility
422
423 `qmrestore`:: Virtual Machine restore utility
424
425 For details see the corresponding manual pages.
426
427 Bandwidth Limit
428 ~~~~~~~~~~~~~~~
429
430 Restoring one or more big backups may need a lot of resources, especially
431 storage bandwidth for both reading from the backup storage and writing to
432 the target storage. This can negatively affect other virtual guests as access
433 to storage can get congested.
434
435 To avoid this you can set bandwidth limits for a backup job. {pve}
436 implements two kinds of limits for restoring and archive:
437
438 * per-restore limit: denotes the maximal amount of bandwidth for
439 reading from a backup archive
440
441 * per-storage write limit: denotes the maximal amount of bandwidth used for
442 writing to a specific storage
443
444 The read limit indirectly affects the write limit, as we cannot write more
445 than we read. A smaller per-job limit will overwrite a bigger per-storage
446 limit. A bigger per-job limit will only overwrite the per-storage limit if
447 you have `Data.Allocate' permissions on the affected storage.
448
449 You can use the `--bwlimit <integer>` option from the restore CLI commands
450 to set up a restore job specific bandwidth limit. KiB/s is used as unit
451 for the limit, this means passing `10240' will limit the read speed of the
452 backup to 10 MiB/s, ensuring that the rest of the possible storage bandwidth
453 is available for the already running virtual guests, and thus the backup
454 does not impact their operations.
455
456 NOTE: You can use `0` for the `bwlimit` parameter to disable all limits for
457 a specific restore job. This can be helpful if you need to restore a very
458 important virtual guest as fast as possible. (Needs `Data.Allocate'
459 permissions on storage)
460
461 Most times your storage's generally available bandwidth stays the same over
462 time, thus we implemented the possibility to set a default bandwidth limit
463 per configured storage, this can be done with:
464
465 ----
466 # pvesm set STORAGEID --bwlimit restore=KIBs
467 ----
468
469 Live-Restore
470 ~~~~~~~~~~~~
471
472 Restoring a large backup can take a long time, in which a guest is still
473 unavailable. For VM backups stored on a Proxmox Backup Server, this wait
474 time can be mitigated using the live-restore option.
475
476 Enabling live-restore via either the checkbox in the GUI or the `--live-restore`
477 argument of `qmrestore` causes the VM to start as soon as the restore
478 begins. Data is copied in the background, prioritizing chunks that the VM is
479 actively accessing.
480
481 Note that this comes with two caveats:
482
483 * During live-restore, the VM will operate with limited disk read speeds, as
484 data has to be loaded from the backup server (once loaded, it is immediately
485 available on the destination storage however, so accessing data twice only
486 incurs the penalty the first time). Write speeds are largely unaffected.
487 * If the live-restore fails for any reason, the VM will be left in an
488 undefined state - that is, not all data might have been copied from the
489 backup, and it is _most likely_ not possible to keep any data that was written
490 during the failed restore operation.
491
492 This mode of operation is especially useful for large VMs, where only a small
493 amount of data is required for initial operation, e.g. web servers - once the OS
494 and necessary services have been started, the VM is operational, while the
495 background task continues copying seldom used data.
496
497 Single File Restore
498 ~~~~~~~~~~~~~~~~~~~
499
500 The 'File Restore' button in the 'Backups' tab of the storage GUI can be used to
501 open a file browser directly on the data contained in a backup. This feature
502 is only available for backups on a Proxmox Backup Server.
503
504 For containers, the first layer of the file tree shows all included 'pxar'
505 archives, which can be opened and browsed freely. For VMs, the first layer shows
506 contained drive images, which can be opened to reveal a list of supported
507 storage technologies found on the drive. In the most basic case, this will be an
508 entry called 'part', representing a partition table, which contains entries for
509 each partition found on the drive. Note that for VMs, not all data might be
510 accessible (unsupported guest file systems, storage technologies, etc...).
511
512 Files and directories can be downloaded using the 'Download' button, the latter
513 being compressed into a zip archive on the fly.
514
515 To enable secure access to VM images, which might contain untrusted data, a
516 temporary VM (not visible as a guest) is started. This does not mean that data
517 downloaded from such an archive is inherently safe, but it avoids exposing the
518 hypervisor system to danger. The VM will stop itself after a timeout. This
519 entire process happens transparently from a user's point of view.
520
521 NOTE: For troubleshooting purposes, each temporary VM instance generates a log
522 file in `/var/log/proxmox-backup/file-restore/`. The log file might contain
523 additional information in case an attempt to restore individual files or
524 accessing file systems contained in a backup archive fails.
525
526 [[vzdump_configuration]]
527 Configuration
528 -------------
529
530 Global configuration is stored in `/etc/vzdump.conf`. The file uses a
531 simple colon separated key/value format. Each line has the following
532 format:
533
534 OPTION: value
535
536 Blank lines in the file are ignored, and lines starting with a `#`
537 character are treated as comments and are also ignored. Values from
538 this file are used as default, and can be overwritten on the command
539 line.
540
541 We currently support the following options:
542
543 include::vzdump.conf.5-opts.adoc[]
544
545
546 .Example `vzdump.conf` Configuration
547 ----
548 tmpdir: /mnt/fast_local_disk
549 storage: my_backup_storage
550 mode: snapshot
551 bwlimit: 10000
552 ----
553
554 Hook Scripts
555 ------------
556
557 You can specify a hook script with option `--script`. This script is
558 called at various phases of the backup process, with parameters
559 accordingly set. You can find an example in the documentation
560 directory (`vzdump-hook-script.pl`).
561
562 File Exclusions
563 ---------------
564
565 NOTE: this option is only available for container backups.
566
567 `vzdump` skips the following files by default (disable with the option
568 `--stdexcludes 0`)
569
570 /tmp/?*
571 /var/tmp/?*
572 /var/run/?*pid
573
574 You can also manually specify (additional) exclude paths, for example:
575
576 # vzdump 777 --exclude-path /tmp/ --exclude-path '/var/foo*'
577
578 excludes the directory `/tmp/` and any file or directory named `/var/foo`,
579 `/var/foobar`, and so on.
580
581 Paths that do not start with a `/` are not anchored to the container's root,
582 but will match relative to any subdirectory. For example:
583
584 # vzdump 777 --exclude-path bar
585
586 excludes any file or directory named `/bar`, `/var/bar`, `/var/foo/bar`, and
587 so on, but not `/bar2`.
588
589 Configuration files are also stored inside the backup archive
590 (in `./etc/vzdump/`) and will be correctly restored.
591
592 Examples
593 --------
594
595 Simply dump guest 777 - no snapshot, just archive the guest private area and
596 configuration files to the default dump directory (usually
597 `/var/lib/vz/dump/`).
598
599 # vzdump 777
600
601 Use rsync and suspend/resume to create a snapshot (minimal downtime).
602
603 # vzdump 777 --mode suspend
604
605 Backup all guest systems and send notification mails to root and admin.
606 Due to `mailto` being set and `notification-mode` being set to `auto` by
607 default, the notification mails are sent via the system's `sendmail`
608 command instead of the notification system.
609
610 # vzdump --all --mode suspend --mailto root --mailto admin
611
612 Use snapshot mode (no downtime) and non-default dump directory.
613
614 # vzdump 777 --dumpdir /mnt/backup --mode snapshot
615
616 Backup more than one guest (selectively)
617
618 # vzdump 101 102 103 --mailto root
619
620 Backup all guests excluding 101 and 102
621
622 # vzdump --mode suspend --exclude 101,102
623
624 Restore a container to a new CT 600
625
626 # pct restore 600 /mnt/backup/vzdump-lxc-777.tar
627
628 Restore a QemuServer VM to VM 601
629
630 # qmrestore /mnt/backup/vzdump-qemu-888.vma 601
631
632 Clone an existing container 101 to a new container 300 with a 4GB root
633 file system, using pipes
634
635 # vzdump 101 --stdout | pct restore --rootfs 4 300 -
636
637
638 ifdef::manvolnum[]
639 include::pve-copyright.adoc[]
640 endif::manvolnum[]