]> git.proxmox.com Git - pve-docs.git/blame - pve-network.adoc
network: the netmask is depreacated
[pve-docs.git] / pve-network.adoc
CommitLineData
80c0adcb 1[[sysadmin_network_configuration]]
0bcd1f7f
DM
2Network Configuration
3---------------------
5f09af76
DM
4ifdef::wiki[]
5:pve-toplevel:
6endif::wiki[]
7
a22d7c24 8Network configuration can be done either via the GUI, or by manually
05213009
EK
9editing the file `/etc/network/interfaces`, which contains the
10whole network configuration. The `interfaces(5)` manual page contains the
11complete format description. All {pve} tools try hard to keep direct
52e23c35 12user modifications, but using the GUI is still preferable, because it
05213009 13protects you from errors.
0bcd1f7f 14
a22d7c24 15Once the network is configured, you can use the Debian traditional tools `ifup`
05213009 16and `ifdown` commands to bring interfaces up and down.
0bcd1f7f 17
52e23c35
TL
18Apply Network Changes
19~~~~~~~~~~~~~~~~~~~~~
20
21{pve} does not write changes directly to `/etc/network/interfaces`. Instead, we
22write into a temporary file called `/etc/network/interfaces.new`, this way you
23can do many related changes at once. This also allows to ensure your changes
24are correct before applying, as a wrong network configuration may render a node
25inaccessible.
26
27Reboot Node to apply
28^^^^^^^^^^^^^^^^^^^^
29
30With the default installed `ifupdown` network managing package you need to
31reboot to commit any pending network changes. Most of the time, the basic {pve}
32network setup is stable and does not change often, so rebooting should not be
33required often.
34
35Reload Network with ifupdown2
36^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
37
38With the optional `ifupdown2` network managing package you also can reload the
39network configuration live, without requiring a reboot.
40
52e23c35
TL
41Since {pve} 6.1 you can apply pending network changes over the web-interface,
42using the 'Apply Configuration' button in the 'Network' panel of a node.
43
44To install 'ifupdown2' ensure you have the latest {pve} updates installed, then
45
46WARNING: installing 'ifupdown2' will remove 'ifupdown', but as the removal
47scripts of 'ifupdown' before version '0.8.35+pve1' have a issue where network
48is fully stopped on removal footnote:[Introduced with Debian Buster:
49https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=945877] you *must* ensure
50that you have a up to date 'ifupdown' package version.
51
52For the installation itself you can then simply do:
53
54 apt install ifupdown2
55
56With that you're all set. You can also switch back to the 'ifupdown' variant at
57any time, if you run into issues.
0bcd1f7f 58
0bcd1f7f
DM
59Naming Conventions
60~~~~~~~~~~~~~~~~~~
61
62We currently use the following naming conventions for device names:
63
05213009
EK
64* Ethernet devices: en*, systemd network interface names. This naming scheme is
65 used for new {pve} installations since version 5.0.
7a0d4784 66
05213009
EK
67* Ethernet devices: eth[N], where 0 ≤ N (`eth0`, `eth1`, ...) This naming
68scheme is used for {pve} hosts which were installed before the 5.0
69release. When upgrading to 5.0, the names are kept as-is.
0bcd1f7f
DM
70
71* Bridge names: vmbr[N], where 0 ≤ N ≤ 4094 (`vmbr0` - `vmbr4094`)
72
73* Bonds: bond[N], where 0 ≤ N (`bond0`, `bond1`, ...)
74
75* VLANs: Simply add the VLAN number to the device name,
7a0d4784 76 separated by a period (`eno1.50`, `bond1.30`)
0bcd1f7f
DM
77
78This makes it easier to debug networks problems, because the device
05213009 79name implies the device type.
cc3cb912 80
7a0d4784
WL
81Systemd Network Interface Names
82^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
83
8116dea5
DM
84Systemd uses the two character prefix 'en' for Ethernet network
85devices. The next characters depends on the device driver and the fact
cc3cb912 86which schema matches first.
7a0d4784
WL
87
88* o<index>[n<phys_port_name>|d<dev_port>] — devices on board
89
90* s<slot>[f<function>][n<phys_port_name>|d<dev_port>] — device by hotplug id
91
92* [P<domain>]p<bus>s<slot>[f<function>][n<phys_port_name>|d<dev_port>] — devices by bus id
93
94* x<MAC> — device by MAC address
95
cc3cb912 96The most common patterns are:
7a0d4784
WL
97
98* eno1 — is the first on board NIC
99
100* enp3s0f1 — is the NIC on pcibus 3 slot 0 and use the NIC function 1.
101
cc3cb912
DM
102For more information see https://www.freedesktop.org/wiki/Software/systemd/PredictableNetworkInterfaceNames/[Predictable Network Interface Names].
103
05213009
EK
104Choosing a network configuration
105~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
106
a22d7c24 107Depending on your current network organization and your resources you can
05213009
EK
108choose either a bridged, routed, or masquerading networking setup.
109
110{pve} server in a private LAN, using an external gateway to reach the internet
111^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
112
a22d7c24 113The *Bridged* model makes the most sense in this case, and this is also
05213009 114the default mode on new {pve} installations.
a22d7c24
SR
115Each of your Guest system will have a virtual interface attached to the
116{pve} bridge. This is similar in effect to having the Guest network card
05213009
EK
117directly connected to a new switch on your LAN, the {pve} host playing the role
118of the switch.
119
120{pve} server at hosting provider, with public IP ranges for Guests
121^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
122
123For this setup, you can use either a *Bridged* or *Routed* model, depending on
124what your provider allows.
125
126{pve} server at hosting provider, with a single public IP address
127^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
128
129In that case the only way to get outgoing network accesses for your guest
a22d7c24 130systems is to use *Masquerading*. For incoming network access to your guests,
05213009
EK
131you will need to configure *Port Forwarding*.
132
133For further flexibility, you can configure
134VLANs (IEEE 802.1q) and network bonding, also known as "link
135aggregation". That way it is possible to build complex and flexible
136virtual networks.
7a0d4784 137
0bcd1f7f
DM
138Default Configuration using a Bridge
139~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
140
50f88938 141[thumbnail="default-network-setup-bridge.svg"]
05213009 142Bridges are like physical network switches implemented in software.
e2280bf7
TL
143All virtual guests can share a single bridge, or you can create multiple
144bridges to separate network domains. Each host can have up to 4094 bridges.
05213009 145
0bcd1f7f 146The installation program creates a single bridge named `vmbr0`, which
04e8476d
EK
147is connected to the first Ethernet card. The corresponding
148configuration in `/etc/network/interfaces` might look like this:
0bcd1f7f
DM
149
150----
151auto lo
152iface lo inet loopback
153
7a0d4784 154iface eno1 inet manual
0bcd1f7f
DM
155
156auto vmbr0
157iface vmbr0 inet static
33349b9f 158 address 192.168.10.2/24
0bcd1f7f 159 gateway 192.168.10.1
7a39aabd
AL
160 bridge-ports eno1
161 bridge-stp off
162 bridge-fd 0
0bcd1f7f
DM
163----
164
165Virtual machines behave as if they were directly connected to the
166physical network. The network, in turn, sees each virtual machine as
167having its own MAC, even though there is only one network cable
168connecting all of these VMs to the network.
169
0bcd1f7f
DM
170Routed Configuration
171~~~~~~~~~~~~~~~~~~~~
172
173Most hosting providers do not support the above setup. For security
174reasons, they disable networking as soon as they detect multiple MAC
175addresses on a single interface.
176
67c9747f
TL
177TIP: Some providers allow you to register additional MACs through their
178management interface. This avoids the problem, but can be clumsy to
0bcd1f7f
DM
179configure because you need to register a MAC for each of your VMs.
180
8c1189b6 181You can avoid the problem by ``routing'' all traffic via a single
0bcd1f7f
DM
182interface. This makes sure that all network packets use the same MAC
183address.
184
50f88938 185[thumbnail="default-network-setup-routed.svg"]
05213009 186A common scenario is that you have a public IP (assume `198.51.100.5`
0bcd1f7f 187for this example), and an additional IP block for your VMs
05213009 188(`203.0.113.16/29`). We recommend the following setup for such
0bcd1f7f
DM
189situations:
190
191----
192auto lo
193iface lo inet loopback
194
7a0d4784
WL
195auto eno1
196iface eno1 inet static
33349b9f 197 address 198.51.100.5/24
05213009 198 gateway 198.51.100.1
1ed90852 199 post-up echo 1 > /proc/sys/net/ipv4/ip_forward
7a0d4784 200 post-up echo 1 > /proc/sys/net/ipv4/conf/eno1/proxy_arp
0bcd1f7f
DM
201
202
203auto vmbr0
204iface vmbr0 inet static
33349b9f 205 address 203.0.113.17/29
7a39aabd
AL
206 bridge-ports none
207 bridge-stp off
208 bridge-fd 0
0bcd1f7f
DM
209----
210
211
8c1189b6
FG
212Masquerading (NAT) with `iptables`
213~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
0bcd1f7f 214
05213009
EK
215Masquerading allows guests having only a private IP address to access the
216network by using the host IP address for outgoing traffic. Each outgoing
217packet is rewritten by `iptables` to appear as originating from the host,
218and responses are rewritten accordingly to be routed to the original sender.
0bcd1f7f
DM
219
220----
221auto lo
222iface lo inet loopback
223
05213009 224auto eno1
470d4313 225#real IP address
7a0d4784 226iface eno1 inet static
33349b9f 227 address 198.51.100.5/24
05213009 228 gateway 198.51.100.1
0bcd1f7f
DM
229
230auto vmbr0
231#private sub network
232iface vmbr0 inet static
33349b9f 233 address 10.10.10.1/24
7a39aabd
AL
234 bridge-ports none
235 bridge-stp off
236 bridge-fd 0
0bcd1f7f 237
22d52440 238 post-up echo 1 > /proc/sys/net/ipv4/ip_forward
7a0d4784
WL
239 post-up iptables -t nat -A POSTROUTING -s '10.10.10.0/24' -o eno1 -j MASQUERADE
240 post-down iptables -t nat -D POSTROUTING -s '10.10.10.0/24' -o eno1 -j MASQUERADE
0bcd1f7f
DM
241----
242
22d52440
OB
243NOTE: In some masquerade setups with firewall enabled, conntrack zones might be
244needed for outgoing connections. Otherwise the firewall could block outgoing
245connections since they will prefer the `POSTROUTING` of the VM bridge (and not
246`MASQUERADE`).
247
248Adding these lines in the `/etc/network/interfaces` can fix this problem:
249
250----
251post-up iptables -t raw -I PREROUTING -i fwbr+ -j CT --zone 1
252post-down iptables -t raw -D PREROUTING -i fwbr+ -j CT --zone 1
253----
254
255For more information about this, refer to the following links:
217f7cd8 256
22d52440 257https://commons.wikimedia.org/wiki/File:Netfilter-packet-flow.svg[Netfilter Packet Flow]
217f7cd8 258
22d52440 259https://lwn.net/Articles/370152/[Patch on netdev-list introducing conntrack zones]
217f7cd8 260
22d52440
OB
261https://blog.lobraun.de/2019/05/19/prox/[Blog post with a good explanation by using TRACE in the raw table]
262
263
b4c06a93
WL
264
265Linux Bond
266~~~~~~~~~~
267
3eafe338
WL
268Bonding (also called NIC teaming or Link Aggregation) is a technique
269for binding multiple NIC's to a single network device. It is possible
270to achieve different goals, like make the network fault-tolerant,
271increase the performance or both together.
272
273High-speed hardware like Fibre Channel and the associated switching
274hardware can be quite expensive. By doing link aggregation, two NICs
275can appear as one logical interface, resulting in double speed. This
276is a native Linux kernel feature that is supported by most
277switches. If your nodes have multiple Ethernet ports, you can
278distribute your points of failure by running network cables to
279different switches and the bonded connection will failover to one
280cable or the other in case of network trouble.
281
282Aggregated links can improve live-migration delays and improve the
283speed of replication of data between Proxmox VE Cluster nodes.
b4c06a93
WL
284
285There are 7 modes for bonding:
286
287* *Round-robin (balance-rr):* Transmit network packets in sequential
288order from the first available network interface (NIC) slave through
289the last. This mode provides load balancing and fault tolerance.
290
291* *Active-backup (active-backup):* Only one NIC slave in the bond is
292active. A different slave becomes active if, and only if, the active
293slave fails. The single logical bonded interface's MAC address is
294externally visible on only one NIC (port) to avoid distortion in the
295network switch. This mode provides fault tolerance.
296
297* *XOR (balance-xor):* Transmit network packets based on [(source MAC
298address XOR'd with destination MAC address) modulo NIC slave
299count]. This selects the same NIC slave for each destination MAC
300address. This mode provides load balancing and fault tolerance.
301
302* *Broadcast (broadcast):* Transmit network packets on all slave
303network interfaces. This mode provides fault tolerance.
304
305* *IEEE 802.3ad Dynamic link aggregation (802.3ad)(LACP):* Creates
306aggregation groups that share the same speed and duplex
307settings. Utilizes all slave network interfaces in the active
308aggregator group according to the 802.3ad specification.
309
310* *Adaptive transmit load balancing (balance-tlb):* Linux bonding
311driver mode that does not require any special network-switch
312support. The outgoing network packet traffic is distributed according
313to the current load (computed relative to the speed) on each network
314interface slave. Incoming traffic is received by one currently
315designated slave network interface. If this receiving slave fails,
316another slave takes over the MAC address of the failed receiving
317slave.
318
e60ce90c 319* *Adaptive load balancing (balance-alb):* Includes balance-tlb plus receive
b4c06a93
WL
320load balancing (rlb) for IPV4 traffic, and does not require any
321special network switch support. The receive load balancing is achieved
322by ARP negotiation. The bonding driver intercepts the ARP Replies sent
323by the local system on their way out and overwrites the source
324hardware address with the unique hardware address of one of the NIC
325slaves in the single logical bonded interface such that different
326network-peers use different MAC addresses for their network packet
327traffic.
328
649098a6 329If your switch support the LACP (IEEE 802.3ad) protocol then we recommend using
a22d7c24 330the corresponding bonding mode (802.3ad). Otherwise you should generally use the
649098a6
EK
331active-backup mode. +
332// http://lists.linux-ha.org/pipermail/linux-ha/2013-January/046295.html
333If you intend to run your cluster network on the bonding interfaces, then you
334have to use active-passive mode on the bonding interfaces, other modes are
335unsupported.
b4c06a93 336
cd1de2c2
WL
337The following bond configuration can be used as distributed/shared
338storage network. The benefit would be that you get more speed and the
339network will be fault-tolerant.
340
b4c06a93
WL
341.Example: Use bond with fixed IP address
342----
343auto lo
344iface lo inet loopback
345
7a0d4784 346iface eno1 inet manual
b4c06a93 347
7a0d4784 348iface eno2 inet manual
b4c06a93 349
61b099f5
WL
350iface eno3 inet manual
351
b4c06a93
WL
352auto bond0
353iface bond0 inet static
5f05aad1 354 bond-slaves eno1 eno2
33349b9f 355 address 192.168.1.2/24
7a39aabd
AL
356 bond-miimon 100
357 bond-mode 802.3ad
358 bond-xmit-hash-policy layer2+3
b4c06a93
WL
359
360auto vmbr0
361iface vmbr0 inet static
33349b9f 362 address 10.10.10.2/24
7ea42266 363 gateway 10.10.10.1
61b099f5 364 bridge-ports eno3
7a39aabd
AL
365 bridge-stp off
366 bridge-fd 0
b4c06a93
WL
367
368----
369
cd1de2c2 370
50f88938 371[thumbnail="default-network-setup-bond.svg"]
cd1de2c2
WL
372Another possibility it to use the bond directly as bridge port.
373This can be used to make the guest network fault-tolerant.
374
375.Example: Use a bond as bridge port
b4c06a93
WL
376----
377auto lo
378iface lo inet loopback
379
7a0d4784 380iface eno1 inet manual
b4c06a93 381
7a0d4784 382iface eno2 inet manual
b4c06a93
WL
383
384auto bond0
470d4313 385iface bond0 inet manual
5f05aad1 386 bond-slaves eno1 eno2
7a39aabd
AL
387 bond-miimon 100
388 bond-mode 802.3ad
389 bond-xmit-hash-policy layer2+3
b4c06a93
WL
390
391auto vmbr0
392iface vmbr0 inet static
33349b9f 393 address 10.10.10.2/24
7ea42266 394 gateway 10.10.10.1
7a39aabd
AL
395 bridge-ports bond0
396 bridge-stp off
397 bridge-fd 0
b4c06a93
WL
398
399----
400
61105e42 401
94fd8ea5
WL
402VLAN 802.1Q
403~~~~~~~~~~~
404
4d8af129
DM
405A virtual LAN (VLAN) is a broadcast domain that is partitioned and
406isolated in the network at layer two. So it is possible to have
407multiple networks (4096) in a physical network, each independent of
408the other ones.
409
61105e42 410Each VLAN network is identified by a number often called 'tag'.
4d8af129
DM
411Network packages are then 'tagged' to identify which virtual network
412they belong to.
94fd8ea5 413
94fd8ea5 414
4d8af129
DM
415VLAN for Guest Networks
416^^^^^^^^^^^^^^^^^^^^^^^
94fd8ea5 417
4d8af129
DM
418{pve} supports this setup out of the box. You can specify the VLAN tag
419when you create a VM. The VLAN tag is part of the guest network
a22d7c24 420configuration. The networking layer supports different modes to
4d8af129 421implement VLANs, depending on the bridge configuration:
94fd8ea5 422
4d8af129 423* *VLAN awareness on the Linux bridge:*
94fd8ea5 424In this case, each guest's virtual network card is assigned to a VLAN tag,
4d8af129 425which is transparently supported by the Linux bridge.
a22d7c24 426Trunk mode is also possible, but that makes configuration
94fd8ea5
WL
427in the guest necessary.
428
429* *"traditional" VLAN on the Linux bridge:*
430In contrast to the VLAN awareness method, this method is not transparent
431and creates a VLAN device with associated bridge for each VLAN.
a22d7c24
SR
432That is, creating a guest on VLAN 5 for example, would create two
433interfaces eno1.5 and vmbr0v5, which would remain until a reboot occurs.
94fd8ea5 434
4d8af129
DM
435* *Open vSwitch VLAN:*
436This mode uses the OVS VLAN feature.
437
a22d7c24 438* *Guest configured VLAN:*
4d8af129
DM
439VLANs are assigned inside the guest. In this case, the setup is
440completely done inside the guest and can not be influenced from the
441outside. The benefit is that you can use more than one VLAN on a
442single virtual NIC.
443
444
445VLAN on the Host
446^^^^^^^^^^^^^^^^
94fd8ea5 447
4d8af129
DM
448To allow host communication with an isolated network. It is possible
449to apply VLAN tags to any network device (NIC, Bond, Bridge). In
450general, you should configure the VLAN on the interface with the least
94fd8ea5
WL
451abstraction layers between itself and the physical NIC.
452
453For example, in a default configuration where you want to place
454the host management address on a separate VLAN.
455
94fd8ea5 456
038dc7df 457.Example: Use VLAN 5 for the {pve} management IP with traditional Linux bridge
94fd8ea5
WL
458----
459auto lo
460iface lo inet loopback
461
462iface eno1 inet manual
463
464iface eno1.5 inet manual
465
466auto vmbr0v5
467iface vmbr0v5 inet static
33349b9f 468 address 10.10.10.2/24
94fd8ea5 469 gateway 10.10.10.1
7a39aabd
AL
470 bridge-ports eno1.5
471 bridge-stp off
472 bridge-fd 0
94fd8ea5
WL
473
474auto vmbr0
475iface vmbr0 inet manual
7a39aabd
AL
476 bridge-ports eno1
477 bridge-stp off
478 bridge-fd 0
94fd8ea5
WL
479
480----
481
038dc7df
AD
482.Example: Use VLAN 5 for the {pve} management IP with VLAN aware Linux bridge
483----
484auto lo
485iface lo inet loopback
486
487iface eno1 inet manual
488
489
490auto vmbr0.5
491iface vmbr0.5 inet static
33349b9f 492 address 10.10.10.2/24
038dc7df
AD
493 gateway 10.10.10.1
494
495auto vmbr0
496iface vmbr0 inet manual
7a39aabd
AL
497 bridge-ports eno1
498 bridge-stp off
499 bridge-fd 0
500 bridge-vlan-aware yes
038dc7df
AD
501----
502
94fd8ea5
WL
503The next example is the same setup but a bond is used to
504make this network fail-safe.
505
038dc7df 506.Example: Use VLAN 5 with bond0 for the {pve} management IP with traditional Linux bridge
94fd8ea5
WL
507----
508auto lo
509iface lo inet loopback
510
511iface eno1 inet manual
512
513iface eno2 inet manual
514
515auto bond0
516iface bond0 inet manual
5f05aad1 517 bond-slaves eno1 eno2
7a39aabd
AL
518 bond-miimon 100
519 bond-mode 802.3ad
520 bond-xmit-hash-policy layer2+3
94fd8ea5
WL
521
522iface bond0.5 inet manual
523
524auto vmbr0v5
525iface vmbr0v5 inet static
33349b9f 526 address 10.10.10.2/24
94fd8ea5 527 gateway 10.10.10.1
7a39aabd
AL
528 bridge-ports bond0.5
529 bridge-stp off
530 bridge-fd 0
94fd8ea5
WL
531
532auto vmbr0
533iface vmbr0 inet manual
7a39aabd
AL
534 bridge-ports bond0
535 bridge-stp off
536 bridge-fd 0
94fd8ea5
WL
537
538----
539
024d3706
SI
540Disabling IPv6 on the Node
541~~~~~~~~~~~~~~~~~~~~~~~~~~
542
543{pve} works correctly in all environments, irrespective of whether IPv6 is
544deployed or not. We recommend leaving all settings at the provided defaults.
545
546Should you still need to disable support for IPv6 on your node, do so by
547creating an appropriate `sysctl.conf (5)` snippet file and setting the proper
548https://www.kernel.org/doc/Documentation/networking/ip-sysctl.txt[sysctls],
549for example adding `/etc/sysctl.d/disable-ipv6.conf` with content:
550
551----
552net.ipv6.conf.all.disable_ipv6 = 1
553net.ipv6.conf.default.disable_ipv6 = 1
554----
555
556This method is preferred to disabling the loading of the IPv6 module on the
557https://www.kernel.org/doc/Documentation/networking/ipv6.rst[kernel commandline].
558
0bcd1f7f
DM
559////
560TODO: explain IPv6 support?
470d4313 561TODO: explain OVS
0bcd1f7f 562////