]> git.proxmox.com Git - pve-docs.git/blame - qm-pci-passthrough.adoc
Refer to the bootloader chapter in remaining docs
[pve-docs.git] / qm-pci-passthrough.adoc
CommitLineData
6e4c46c4
DC
1[[qm_pci_passthrough]]
2PCI(e) Passthrough
3------------------
e582833b
DC
4ifdef::wiki[]
5:pve-toplevel:
6endif::wiki[]
6e4c46c4
DC
7
8PCI(e) passthrough is a mechanism to give a virtual machine control over
49f20f1b
TL
9a PCI device from the host. This can have some advantages over using
10virtualized hardware, for example lower latency, higher performance, or more
11features (e.g., offloading).
6e4c46c4 12
49f20f1b 13But, if you pass through a device to a virtual machine, you cannot use that
6e4c46c4
DC
14device anymore on the host or in any other VM.
15
16General Requirements
17~~~~~~~~~~~~~~~~~~~~
18
19Since passthrough is a feature which also needs hardware support, there are
49f20f1b
TL
20some requirements to check and preparations to be done to make it work.
21
6e4c46c4
DC
22
23Hardware
24^^^^^^^^
49f20f1b
TL
25Your hardware needs to support `IOMMU` (*I*/*O* **M**emory **M**anagement
26**U**nit) interrupt remapping, this includes the CPU and the mainboard.
6e4c46c4 27
49f20f1b
TL
28Generally, Intel systems with VT-d, and AMD systems with AMD-Vi support this.
29But it is not guaranteed that everything will work out of the box, due
30to bad hardware implementation and missing or low quality drivers.
6e4c46c4 31
49f20f1b 32Further, server grade hardware has often better support than consumer grade
6e4c46c4
DC
33hardware, but even then, many modern system can support this.
34
49f20f1b 35Please refer to your hardware vendor to check if they support this feature
a22d7c24 36under Linux for your specific setup.
49f20f1b 37
6e4c46c4
DC
38
39Configuration
40^^^^^^^^^^^^^
41
49f20f1b
TL
42Once you ensured that your hardware supports passthrough, you will need to do
43some configuration to enable PCI(e) passthrough.
6e4c46c4 44
6e4c46c4 45
39d84f28 46.IOMMU
6e4c46c4 47
1748211a
SI
48The IOMMU has to be activated on the
49xref:edit_kernel_cmdline[kernel commandline].
50
51The command line parameters are:
6e4c46c4 52
49f20f1b
TL
53* for Intel CPUs:
54+
55----
56 intel_iommu=on
57----
58* for AMD CPUs:
59+
60----
6e4c46c4 61 amd_iommu=on
49f20f1b 62----
6e4c46c4 63
6e4c46c4 64
39d84f28 65.Kernel Modules
6e4c46c4 66
49f20f1b
TL
67You have to make sure the following modules are loaded. This can be achieved by
68adding them to `'/etc/modules''
6e4c46c4 69
49f20f1b 70----
6e4c46c4
DC
71 vfio
72 vfio_iommu_type1
73 vfio_pci
74 vfio_virqfd
49f20f1b 75----
6e4c46c4 76
49f20f1b 77[[qm_pci_passthrough_update_initramfs]]
6e4c46c4 78After changing anything modules related, you need to refresh your
49f20f1b 79`initramfs`. On {pve} this can be done by executing:
6e4c46c4
DC
80
81----
49f20f1b 82# update-initramfs -u -k all
6e4c46c4
DC
83----
84
1748211a
SI
85If you are using `systemd-boot` make sure to
86xref:systemd-boot-refresh[sync the new initramfs to the bootable partitions].
87
39d84f28 88.Finish Configuration
49f20f1b
TL
89
90Finally reboot to bring the changes into effect and check that it is indeed
91enabled.
6e4c46c4
DC
92
93----
5e235b99 94# dmesg | grep -e DMAR -e IOMMU -e AMD-Vi
6e4c46c4
DC
95----
96
49f20f1b
TL
97should display that `IOMMU`, `Directed I/O` or `Interrupt Remapping` is
98enabled, depending on hardware and kernel the exact message can vary.
6e4c46c4
DC
99
100It is also important that the device(s) you want to pass through
49f20f1b 101are in a *separate* `IOMMU` group. This can be checked with:
6e4c46c4
DC
102
103----
49f20f1b 104# find /sys/kernel/iommu_groups/ -type l
6e4c46c4
DC
105----
106
49f20f1b 107It is okay if the device is in an `IOMMU` group together with its functions
6e4c46c4
DC
108(e.g. a GPU with the HDMI Audio device) or with its root port or PCI(e) bridge.
109
110.PCI(e) slots
111[NOTE]
112====
49f20f1b
TL
113Some platforms handle their physical PCI(e) slots differently. So, sometimes
114it can help to put the card in a another PCI(e) slot, if you do not get the
115desired `IOMMU` group separation.
6e4c46c4
DC
116====
117
118.Unsafe interrupts
119[NOTE]
120====
121For some platforms, it may be necessary to allow unsafe interrupts.
49f20f1b
TL
122For this add the following line in a file ending with `.conf' file in
123*/etc/modprobe.d/*:
6e4c46c4 124
49f20f1b 125----
6e4c46c4 126 options vfio_iommu_type1 allow_unsafe_interrupts=1
49f20f1b 127----
6e4c46c4
DC
128
129Please be aware that this option can make your system unstable.
130====
131
082b32fb
TL
132GPU Passthrough Notes
133^^^^^^^^^^^^^^^^^^^^^
13cae0c1 134
082b32fb
TL
135It is not possible to display the frame buffer of the GPU via NoVNC or SPICE on
136the {pve} web interface.
13cae0c1 137
082b32fb
TL
138When passing through a whole GPU or a vGPU and graphic output is wanted, one
139has to either physically connect a monitor to the card, or configure a remote
140desktop software (for example, VNC or RDP) inside the guest.
13cae0c1 141
082b32fb
TL
142If you want to use the GPU as a hardware accelerator, for example, for
143programs using OpenCL or CUDA, this is not required.
13cae0c1 144
49f20f1b 145Host Device Passthrough
6e4c46c4
DC
146~~~~~~~~~~~~~~~~~~~~~~~
147
148The most used variant of PCI(e) passthrough is to pass through a whole
49f20f1b
TL
149PCI(e) card, for example a GPU or a network card.
150
6e4c46c4
DC
151
152Host Configuration
153^^^^^^^^^^^^^^^^^^
154
49f20f1b
TL
155In this case, the host cannot use the card. There are two methods to achieve
156this:
6e4c46c4 157
49f20f1b
TL
158* pass the device IDs to the options of the 'vfio-pci' modules by adding
159+
160----
6e4c46c4 161 options vfio-pci ids=1234:5678,4321:8765
6e4c46c4 162----
49f20f1b
TL
163+
164to a .conf file in */etc/modprobe.d/* where `1234:5678` and `4321:8765` are
165the vendor and device IDs obtained by:
166+
167----
168# lcpci -nn
6e4c46c4
DC
169----
170
49f20f1b
TL
171* blacklist the driver completely on the host, ensuring that it is free to bind
172for passthrough, with
173+
174----
6e4c46c4 175 blacklist DRIVERNAME
49f20f1b
TL
176----
177+
178in a .conf file in */etc/modprobe.d/*.
6e4c46c4 179
49f20f1b
TL
180For both methods you need to
181xref:qm_pci_passthrough_update_initramfs[update the `initramfs`] again and
182reboot after that.
6e4c46c4 183
49f20f1b 184[[qm_pci_passthrough_vm_config]]
6e4c46c4
DC
185VM Configuration
186^^^^^^^^^^^^^^^^
49f20f1b
TL
187To pass through the device you need to set the *hostpciX* option in the VM
188configuration, for example by executing:
6e4c46c4
DC
189
190----
49f20f1b 191# qm set VMID -hostpci0 00:02.0
6e4c46c4
DC
192----
193
5ee3d3cd 194If your device has multiple functions (e.g., ``00:02.0`' and ``00:02.1`' ),
1c1241f2 195you can pass them through all together with the shortened syntax ``00:02`'
6e4c46c4
DC
196
197There are some options to which may be necessary, depending on the device
49f20f1b
TL
198and guest OS:
199
200* *x-vga=on|off* marks the PCI(e) device as the primary GPU of the VM.
201With this enabled the *vga* configuration option will be ignored.
6e4c46c4 202
6e4c46c4 203* *pcie=on|off* tells {pve} to use a PCIe or PCI port. Some guests/device
49f20f1b
TL
204combination require PCIe rather than PCI. PCIe is only available for 'q35'
205machine types.
206
6e4c46c4
DC
207* *rombar=on|off* makes the firmware ROM visible for the guest. Default is on.
208Some PCI(e) devices need this disabled.
49f20f1b 209
6e4c46c4 210* *romfile=<path>*, is an optional path to a ROM file for the device to use.
49f20f1b
TL
211This is a relative path under */usr/share/kvm/*.
212
39d84f28 213.Example
6e4c46c4
DC
214
215An example of PCIe passthrough with a GPU set to primary:
216
217----
49f20f1b 218# qm set VMID -hostpci0 02:00,pcie=on,x-vga=on
6e4c46c4
DC
219----
220
49f20f1b 221
6e4c46c4
DC
222Other considerations
223^^^^^^^^^^^^^^^^^^^^
224
225When passing through a GPU, the best compatibility is reached when using
49f20f1b
TL
226'q35' as machine type, 'OVMF' ('EFI' for VMs) instead of SeaBIOS and PCIe
227instead of PCI. Note that if you want to use 'OVMF' for GPU passthrough, the
228GPU needs to have an EFI capable ROM, otherwise use SeaBIOS instead.
6e4c46c4
DC
229
230SR-IOV
231~~~~~~
232
49f20f1b
TL
233Another variant for passing through PCI(e) devices, is to use the hardware
234virtualization features of your devices, if available.
235
236'SR-IOV' (**S**ingle-**R**oot **I**nput/**O**utput **V**irtualization) enables
237a single device to provide multiple 'VF' (**V**irtual **F**unctions) to the
238system. Each of those 'VF' can be used in a different VM, with full hardware
239features and also better performance and lower latency than software
240virtualized devices.
6e4c46c4 241
49f20f1b
TL
242Currently, the most common use case for this are NICs (**N**etwork
243**I**nterface **C**ard) with SR-IOV support, which can provide multiple VFs per
244physical port. This allows using features such as checksum offloading, etc. to
245be used inside a VM, reducing the (host) CPU overhead.
6e4c46c4 246
6e4c46c4
DC
247
248Host Configuration
249^^^^^^^^^^^^^^^^^^
250
49f20f1b 251Generally, there are two methods for enabling virtual functions on a device.
6e4c46c4 252
49f20f1b 253* sometimes there is an option for the driver module e.g. for some
6e4c46c4 254Intel drivers
49f20f1b
TL
255+
256----
6e4c46c4 257 max_vfs=4
49f20f1b
TL
258----
259+
260which could be put file with '.conf' ending under */etc/modprobe.d/*.
6e4c46c4 261(Do not forget to update your initramfs after that)
49f20f1b 262+
6e4c46c4
DC
263Please refer to your driver module documentation for the exact
264parameters and options.
265
49f20f1b
TL
266* The second, more generic, approach is using the `sysfs`.
267If a device and driver supports this you can change the number of VFs on
268the fly. For example, to setup 4 VFs on device 0000:01:00.0 execute:
269+
6e4c46c4 270----
49f20f1b 271# echo 4 > /sys/bus/pci/devices/0000:01:00.0/sriov_numvfs
6e4c46c4 272----
49f20f1b
TL
273+
274To make this change persistent you can use the `sysfsutils` Debian package.
39d84f28 275After installation configure it via */etc/sysfs.conf* or a `FILE.conf' in
49f20f1b 276*/etc/sysfs.d/*.
6e4c46c4
DC
277
278VM Configuration
279^^^^^^^^^^^^^^^^
280
49f20f1b
TL
281After creating VFs, you should see them as separate PCI(e) devices when
282outputting them with `lspci`. Get their ID and pass them through like a
283xref:qm_pci_passthrough_vm_config[normal PCI(e) device].
6e4c46c4
DC
284
285Other considerations
286^^^^^^^^^^^^^^^^^^^^
287
288For this feature, platform support is especially important. It may be necessary
49f20f1b
TL
289to enable this feature in the BIOS/EFI first, or to use a specific PCI(e) port
290for it to work. In doubt, consult the manual of the platform or contact its
291vendor.
050192c5 292
d25f097c
TL
293Mediated Devices (vGPU, GVT-g)
294~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
050192c5 295
a22d7c24 296Mediated devices are another method to reuse features and performance from
d25f097c
TL
297physical hardware for virtualized hardware. These are found most common in
298virtualized GPU setups such as Intels GVT-g and Nvidias vGPUs used in their
299GRID technology.
300
301With this, a physical Card is able to create virtual cards, similar to SR-IOV.
302The difference is that mediated devices do not appear as PCI(e) devices in the
303host, and are such only suited for using in virtual machines.
050192c5 304
050192c5
DC
305
306Host Configuration
307^^^^^^^^^^^^^^^^^^
308
d25f097c 309In general your card's driver must support that feature, otherwise it will
a22d7c24 310not work. So please refer to your vendor for compatible drivers and how to
050192c5
DC
311configure them.
312
a22d7c24
SR
313Intels drivers for GVT-g are integrated in the Kernel and should work
314with 5th, 6th and 7th generation Intel Core Processors, as well as E3 v4, E3
315v5 and E3 v6 Xeon Processors.
050192c5 316
1748211a
SI
317To enable it for Intel Graphics, you have to make sure to load the module
318'kvmgt' (for example via `/etc/modules`) and to enable it on the
319xref:edit_kernel_cmdline[Kernel commandline] and add the following parameter:
050192c5
DC
320
321----
322 i915.enable_gvt=1
323----
324
325After that remember to
326xref:qm_pci_passthrough_update_initramfs[update the `initramfs`],
1748211a 327and reboot your host.
050192c5
DC
328
329VM Configuration
330^^^^^^^^^^^^^^^^
331
d25f097c
TL
332To use a mediated device, simply specify the `mdev` property on a `hostpciX`
333VM configuration option.
050192c5 334
d25f097c
TL
335You can get the supported devices via the 'sysfs'. For example, to list the
336supported types for the device '0000:00:02.0' you would simply execute:
050192c5
DC
337
338----
339# ls /sys/bus/pci/devices/0000:00:02.0/mdev_supported_types
340----
341
342Each entry is a directory which contains the following important files:
343
d25f097c
TL
344* 'available_instances' contains the amount of still available instances of
345this type, each 'mdev' use in a VM reduces this.
050192c5 346* 'description' contains a short description about the capabilities of the type
d25f097c
TL
347* 'create' is the endpoint to create such a device, {pve} does this
348automatically for you, if a 'hostpciX' option with `mdev` is configured.
050192c5 349
d25f097c 350Example configuration with an `Intel GVT-g vGPU` (`Intel Skylake 6700k`):
050192c5
DC
351
352----
353# qm set VMID -hostpci0 00:02.0,mdev=i915-GVTg_V5_4
354----
355
356With this set, {pve} automatically creates such a device on VM start, and
357cleans it up again when the VM stops.
e582833b
DC
358
359ifdef::wiki[]
360
361See Also
362~~~~~~~~
363
364* link:/wiki/Pci_passthrough[PCI Passthrough Examples]
365
366endif::wiki[]