]> git.proxmox.com Git - proxmox-backup.git/blame - docs/terminology.rst
cleanup schema function calls
[proxmox-backup.git] / docs / terminology.rst
CommitLineData
ec1ae7e6 1.. _terms:
1531185d 2
04e24b14
DW
3Terminology
4===========
5
6Backup Content
7--------------
8
9When doing deduplication, there are different strategies to get
10optimal results in terms of performance and/or deduplication rates.
11Depending on the type of data, it can be split into *fixed* or *variable*
12sized chunks.
13
14Fixed sized chunking requires minimal CPU power, and is used to
15backup virtual machine images.
16
17Variable sized chunking needs more CPU power, but is essential to get
18good deduplication rates for file archives.
19
20The Proxmox Backup Server supports both strategies.
21
22
23Image Archives: ``<name>.img``
24~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
25
26This is used for virtual machine images and other large binary
27data. Content is split into fixed-sized chunks.
28
29
30File Archives: ``<name>.pxar``
31~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
32
33.. see https://moinakg.wordpress.com/2013/06/22/high-performance-content-defined-chunking/
34
35A file archive stores a full directory tree. Content is stored using
36the :ref:`pxar-format`, split into variable-sized chunks. The format
37is optimized to achieve good deduplication rates.
38
39
40Binary Data (BLOBs)
41~~~~~~~~~~~~~~~~~~~
42
43This type is used to store smaller (< 16MB) binary data such as
717ce406 44configuration files. Larger files should be stored as image archives.
04e24b14
DW
45
46.. caution:: Please do not store all files as BLOBs. Instead, use the
717ce406 47 file archive to store entire directory trees.
04e24b14
DW
48
49
50Catalog File: ``catalog.pcat1``
51~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
52
53The catalog file is an index for file archives. It contains
717ce406 54the list of included files and is used to speed up search operations.
04e24b14
DW
55
56
57The Manifest: ``index.json``
58~~~~~~~~~~~~~~~~~~~~~~~~~~~~
59
717ce406 60The manifest contains a list of all backed up files, and their
04e24b14
DW
61sizes and checksums. It is used to verify the consistency of a
62backup.
63
64
65Backup Type
66-----------
67
68The backup server groups backups by *type*, where *type* is one of:
69
70``vm``
717ce406 71 This type is used for :term:`virtual machine`\ s. It typically
04e24b14
DW
72 consists of the virtual machine's configuration file and an image archive
73 for each disk.
74
75``ct``
717ce406
DW
76 This type is used for :term:`container`\ s. It consists of the container's
77 configuration and a single file archive for the filesystem's contents.
04e24b14
DW
78
79``host``
717ce406
DW
80 This type is used for file/directory backups created from within a machine.
81 Typically this would be a physical host, but could also be a virtual machine
82 or container. Such backups may contain file and image archives; there are no
83 restrictions in this regard.
04e24b14
DW
84
85
86Backup ID
87---------
88
89A unique ID. Usually the virtual machine or container ID. ``host``
90type backups normally use the hostname.
91
92
93Backup Time
94-----------
95
96The time when the backup was made.
97
98
99Backup Group
100------------
101
102The tuple ``<type>/<ID>`` is called a backup group. Such a group
103may contain one or more backup snapshots.
104
ec1ae7e6 105.. _term_backup_snapshot:
04e24b14
DW
106
107Backup Snapshot
108---------------
109
110The triplet ``<type>/<ID>/<time>`` is called a backup snapshot. It
111uniquely identifies a specific backup within a datastore.
112
113.. code-block:: console
114 :caption: Backup Snapshot Examples
115
116 vm/104/2019-10-09T08:01:06Z
117 host/elsa/2019-11-08T09:48:14Z
118
119As you can see, the time format is RFC3399_ with Coordinated
120Universal Time (UTC_, identified by the trailing *Z*).
121
122