]> git.proxmox.com Git - pve-docs.git/blame_incremental - qm-pci-passthrough.adoc
qm: add documentation for vm network MTU setting
[pve-docs.git] / qm-pci-passthrough.adoc
... / ...
CommitLineData
1[[qm_pci_passthrough]]
2PCI(e) Passthrough
3------------------
4ifdef::wiki[]
5:pve-toplevel:
6endif::wiki[]
7
8PCI(e) passthrough is a mechanism to give a virtual machine control over
9a PCI device from the host. This can have some advantages over using
10virtualized hardware, for example lower latency, higher performance, or more
11features (e.g., offloading).
12
13But, if you pass through a device to a virtual machine, you cannot use that
14device anymore on the host or in any other VM.
15
16General Requirements
17~~~~~~~~~~~~~~~~~~~~
18
19Since passthrough is a feature which also needs hardware support, there are
20some requirements to check and preparations to be done to make it work.
21
22
23Hardware
24^^^^^^^^
25Your hardware needs to support `IOMMU` (*I*/*O* **M**emory **M**anagement
26**U**nit) interrupt remapping, this includes the CPU and the mainboard.
27
28Generally, Intel systems with VT-d, and AMD systems with AMD-Vi support this.
29But it is not guaranteed that everything will work out of the box, due
30to bad hardware implementation and missing or low quality drivers.
31
32Further, server grade hardware has often better support than consumer grade
33hardware, but even then, many modern system can support this.
34
35Please refer to your hardware vendor to check if they support this feature
36under Linux for your specific setup.
37
38
39Configuration
40^^^^^^^^^^^^^
41
42Once you ensured that your hardware supports passthrough, you will need to do
43some configuration to enable PCI(e) passthrough.
44
45.IOMMU
46
47First, you have to enable IOMMU support in your BIOS/UEFI. Usually the
48corresponding setting is called `IOMMU` or `VT-d`,but you should find the exact
49option name in the manual of your motherboard.
50
51For Intel CPUs, you may also need to enable the IOMMU on the
52xref:sysboot_edit_kernel_cmdline[kernel command line] for older (pre-5.15)
53kernels by adding:
54
55----
56 intel_iommu=on
57----
58
59For AMD CPUs it should be enabled automatically.
60
61.IOMMU Passthrough Mode
62
63If your hardware supports IOMMU passthrough mode, enabling this mode might
64increase performance.
65This is because VMs then bypass the (default) DMA translation normally
66performed by the hyper-visor and instead pass DMA requests directly to the
67hardware IOMMU. To enable these options, add:
68
69----
70 iommu=pt
71----
72
73to the xref:sysboot_edit_kernel_cmdline[kernel commandline].
74
75.Kernel Modules
76
77You have to make sure the following modules are loaded. This can be achieved by
78adding them to `'/etc/modules''
79
80----
81 vfio
82 vfio_iommu_type1
83 vfio_pci
84 vfio_virqfd
85----
86
87[[qm_pci_passthrough_update_initramfs]]
88After changing anything modules related, you need to refresh your
89`initramfs`. On {pve} this can be done by executing:
90
91----
92# update-initramfs -u -k all
93----
94
95.Finish Configuration
96
97Finally reboot to bring the changes into effect and check that it is indeed
98enabled.
99
100----
101# dmesg | grep -e DMAR -e IOMMU -e AMD-Vi
102----
103
104should display that `IOMMU`, `Directed I/O` or `Interrupt Remapping` is
105enabled, depending on hardware and kernel the exact message can vary.
106
107It is also important that the device(s) you want to pass through
108are in a *separate* `IOMMU` group. This can be checked with:
109
110----
111# find /sys/kernel/iommu_groups/ -type l
112----
113
114It is okay if the device is in an `IOMMU` group together with its functions
115(e.g. a GPU with the HDMI Audio device) or with its root port or PCI(e) bridge.
116
117.PCI(e) slots
118[NOTE]
119====
120Some platforms handle their physical PCI(e) slots differently. So, sometimes
121it can help to put the card in a another PCI(e) slot, if you do not get the
122desired `IOMMU` group separation.
123====
124
125.Unsafe interrupts
126[NOTE]
127====
128For some platforms, it may be necessary to allow unsafe interrupts.
129For this add the following line in a file ending with `.conf' file in
130*/etc/modprobe.d/*:
131
132----
133 options vfio_iommu_type1 allow_unsafe_interrupts=1
134----
135
136Please be aware that this option can make your system unstable.
137====
138
139GPU Passthrough Notes
140^^^^^^^^^^^^^^^^^^^^^
141
142It is not possible to display the frame buffer of the GPU via NoVNC or SPICE on
143the {pve} web interface.
144
145When passing through a whole GPU or a vGPU and graphic output is wanted, one
146has to either physically connect a monitor to the card, or configure a remote
147desktop software (for example, VNC or RDP) inside the guest.
148
149If you want to use the GPU as a hardware accelerator, for example, for
150programs using OpenCL or CUDA, this is not required.
151
152Host Device Passthrough
153~~~~~~~~~~~~~~~~~~~~~~~
154
155The most used variant of PCI(e) passthrough is to pass through a whole
156PCI(e) card, for example a GPU or a network card.
157
158
159Host Configuration
160^^^^^^^^^^^^^^^^^^
161
162In this case, the host must not use the card. There are two methods to achieve
163this:
164
165* pass the device IDs to the options of the 'vfio-pci' modules by adding
166+
167----
168 options vfio-pci ids=1234:5678,4321:8765
169----
170+
171to a .conf file in */etc/modprobe.d/* where `1234:5678` and `4321:8765` are
172the vendor and device IDs obtained by:
173+
174----
175# lspci -nn
176----
177
178* blacklist the driver completely on the host, ensuring that it is free to bind
179for passthrough, with
180+
181----
182 blacklist DRIVERNAME
183----
184+
185in a .conf file in */etc/modprobe.d/*.
186
187For both methods you need to
188xref:qm_pci_passthrough_update_initramfs[update the `initramfs`] again and
189reboot after that.
190
191.Verify Configuration
192
193To check if your changes were successful, you can use
194
195----
196# lspci -nnk
197----
198
199and check your device entry. If it says
200
201----
202Kernel driver in use: vfio-pci
203----
204
205or the 'in use' line is missing entirely, the device is ready to be used for
206passthrough.
207
208[[qm_pci_passthrough_vm_config]]
209VM Configuration
210^^^^^^^^^^^^^^^^
211To pass through the device you need to set the *hostpciX* option in the VM
212configuration, for example by executing:
213
214----
215# qm set VMID -hostpci0 00:02.0
216----
217
218If your device has multiple functions (e.g., ``00:02.0`' and ``00:02.1`' ),
219you can pass them through all together with the shortened syntax ``00:02`'.
220This is equivalent with checking the ``All Functions`' checkbox in the
221web-interface.
222
223There are some options to which may be necessary, depending on the device
224and guest OS:
225
226* *x-vga=on|off* marks the PCI(e) device as the primary GPU of the VM.
227With this enabled the *vga* configuration option will be ignored.
228
229* *pcie=on|off* tells {pve} to use a PCIe or PCI port. Some guests/device
230combination require PCIe rather than PCI. PCIe is only available for 'q35'
231machine types.
232
233* *rombar=on|off* makes the firmware ROM visible for the guest. Default is on.
234Some PCI(e) devices need this disabled.
235
236* *romfile=<path>*, is an optional path to a ROM file for the device to use.
237This is a relative path under */usr/share/kvm/*.
238
239.Example
240
241An example of PCIe passthrough with a GPU set to primary:
242
243----
244# qm set VMID -hostpci0 02:00,pcie=on,x-vga=on
245----
246
247.PCI ID overrides
248
249You can override the PCI vendor ID, device ID, and subsystem IDs that will be
250seen by the guest. This is useful if your device is a variant with an ID that
251your guest's drivers don't recognize, but you want to force those drivers to be
252loaded anyway (e.g. if you know your device shares the same chipset as a
253supported variant).
254
255The available options are `vendor-id`, `device-id`, `sub-vendor-id`, and
256`sub-device-id`. You can set any or all of these to override your device's
257default IDs.
258
259For example:
260
261----
262# qm set VMID -hostpci0 02:00,device-id=0x10f6,sub-vendor-id=0x0000
263----
264
265
266Other considerations
267^^^^^^^^^^^^^^^^^^^^
268
269When passing through a GPU, the best compatibility is reached when using
270'q35' as machine type, 'OVMF' ('EFI' for VMs) instead of SeaBIOS and PCIe
271instead of PCI. Note that if you want to use 'OVMF' for GPU passthrough, the
272GPU needs to have an EFI capable ROM, otherwise use SeaBIOS instead.
273
274SR-IOV
275~~~~~~
276
277Another variant for passing through PCI(e) devices, is to use the hardware
278virtualization features of your devices, if available.
279
280'SR-IOV' (**S**ingle-**R**oot **I**nput/**O**utput **V**irtualization) enables
281a single device to provide multiple 'VF' (**V**irtual **F**unctions) to the
282system. Each of those 'VF' can be used in a different VM, with full hardware
283features and also better performance and lower latency than software
284virtualized devices.
285
286Currently, the most common use case for this are NICs (**N**etwork
287**I**nterface **C**ard) with SR-IOV support, which can provide multiple VFs per
288physical port. This allows using features such as checksum offloading, etc. to
289be used inside a VM, reducing the (host) CPU overhead.
290
291
292Host Configuration
293^^^^^^^^^^^^^^^^^^
294
295Generally, there are two methods for enabling virtual functions on a device.
296
297* sometimes there is an option for the driver module e.g. for some
298Intel drivers
299+
300----
301 max_vfs=4
302----
303+
304which could be put file with '.conf' ending under */etc/modprobe.d/*.
305(Do not forget to update your initramfs after that)
306+
307Please refer to your driver module documentation for the exact
308parameters and options.
309
310* The second, more generic, approach is using the `sysfs`.
311If a device and driver supports this you can change the number of VFs on
312the fly. For example, to setup 4 VFs on device 0000:01:00.0 execute:
313+
314----
315# echo 4 > /sys/bus/pci/devices/0000:01:00.0/sriov_numvfs
316----
317+
318To make this change persistent you can use the `sysfsutils` Debian package.
319After installation configure it via */etc/sysfs.conf* or a `FILE.conf' in
320*/etc/sysfs.d/*.
321
322VM Configuration
323^^^^^^^^^^^^^^^^
324
325After creating VFs, you should see them as separate PCI(e) devices when
326outputting them with `lspci`. Get their ID and pass them through like a
327xref:qm_pci_passthrough_vm_config[normal PCI(e) device].
328
329Other considerations
330^^^^^^^^^^^^^^^^^^^^
331
332For this feature, platform support is especially important. It may be necessary
333to enable this feature in the BIOS/EFI first, or to use a specific PCI(e) port
334for it to work. In doubt, consult the manual of the platform or contact its
335vendor.
336
337Mediated Devices (vGPU, GVT-g)
338~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
339
340Mediated devices are another method to reuse features and performance from
341physical hardware for virtualized hardware. These are found most common in
342virtualized GPU setups such as Intel's GVT-g and NVIDIA's vGPUs used in their
343GRID technology.
344
345With this, a physical Card is able to create virtual cards, similar to SR-IOV.
346The difference is that mediated devices do not appear as PCI(e) devices in the
347host, and are such only suited for using in virtual machines.
348
349
350Host Configuration
351^^^^^^^^^^^^^^^^^^
352
353In general your card's driver must support that feature, otherwise it will
354not work. So please refer to your vendor for compatible drivers and how to
355configure them.
356
357Intel's drivers for GVT-g are integrated in the Kernel and should work
358with 5th, 6th and 7th generation Intel Core Processors, as well as E3 v4, E3
359v5 and E3 v6 Xeon Processors.
360
361To enable it for Intel Graphics, you have to make sure to load the module
362'kvmgt' (for example via `/etc/modules`) and to enable it on the
363xref:sysboot_edit_kernel_cmdline[Kernel commandline] and add the following parameter:
364
365----
366 i915.enable_gvt=1
367----
368
369After that remember to
370xref:qm_pci_passthrough_update_initramfs[update the `initramfs`],
371and reboot your host.
372
373VM Configuration
374^^^^^^^^^^^^^^^^
375
376To use a mediated device, simply specify the `mdev` property on a `hostpciX`
377VM configuration option.
378
379You can get the supported devices via the 'sysfs'. For example, to list the
380supported types for the device '0000:00:02.0' you would simply execute:
381
382----
383# ls /sys/bus/pci/devices/0000:00:02.0/mdev_supported_types
384----
385
386Each entry is a directory which contains the following important files:
387
388* 'available_instances' contains the amount of still available instances of
389this type, each 'mdev' use in a VM reduces this.
390* 'description' contains a short description about the capabilities of the type
391* 'create' is the endpoint to create such a device, {pve} does this
392automatically for you, if a 'hostpciX' option with `mdev` is configured.
393
394Example configuration with an `Intel GVT-g vGPU` (`Intel Skylake 6700k`):
395
396----
397# qm set VMID -hostpci0 00:02.0,mdev=i915-GVTg_V5_4
398----
399
400With this set, {pve} automatically creates such a device on VM start, and
401cleans it up again when the VM stops.
402
403ifdef::wiki[]
404
405See Also
406~~~~~~~~
407
408* link:/wiki/Pci_passthrough[PCI Passthrough Examples]
409
410endif::wiki[]