]> git.proxmox.com Git - pve-docs.git/blame_incremental - qm-pci-passthrough.adoc
network: override device names: suggest running update-initramfs
[pve-docs.git] / qm-pci-passthrough.adoc
... / ...
CommitLineData
1[[qm_pci_passthrough]]
2PCI(e) Passthrough
3------------------
4ifdef::wiki[]
5:pve-toplevel:
6endif::wiki[]
7
8PCI(e) passthrough is a mechanism to give a virtual machine control over
9a PCI device from the host. This can have some advantages over using
10virtualized hardware, for example lower latency, higher performance, or more
11features (e.g., offloading).
12
13But, if you pass through a device to a virtual machine, you cannot use that
14device anymore on the host or in any other VM.
15
16General Requirements
17~~~~~~~~~~~~~~~~~~~~
18
19Since passthrough is a feature which also needs hardware support, there are
20some requirements to check and preparations to be done to make it work.
21
22
23Hardware
24^^^^^^^^
25Your hardware needs to support `IOMMU` (*I*/*O* **M**emory **M**anagement
26**U**nit) interrupt remapping, this includes the CPU and the mainboard.
27
28Generally, Intel systems with VT-d, and AMD systems with AMD-Vi support this.
29But it is not guaranteed that everything will work out of the box, due
30to bad hardware implementation and missing or low quality drivers.
31
32Further, server grade hardware has often better support than consumer grade
33hardware, but even then, many modern system can support this.
34
35Please refer to your hardware vendor to check if they support this feature
36under Linux for your specific setup.
37
38
39Configuration
40^^^^^^^^^^^^^
41
42Once you ensured that your hardware supports passthrough, you will need to do
43some configuration to enable PCI(e) passthrough.
44
45
46.IOMMU
47
48The IOMMU has to be activated on the
49xref:sysboot_edit_kernel_cmdline[kernel commandline].
50
51The command line parameters are:
52
53* for Intel CPUs:
54+
55----
56 intel_iommu=on
57----
58* for AMD CPUs:
59+
60----
61 amd_iommu=on
62----
63
64
65.Kernel Modules
66
67You have to make sure the following modules are loaded. This can be achieved by
68adding them to `'/etc/modules''
69
70----
71 vfio
72 vfio_iommu_type1
73 vfio_pci
74 vfio_virqfd
75----
76
77[[qm_pci_passthrough_update_initramfs]]
78After changing anything modules related, you need to refresh your
79`initramfs`. On {pve} this can be done by executing:
80
81----
82# update-initramfs -u -k all
83----
84
85.Finish Configuration
86
87Finally reboot to bring the changes into effect and check that it is indeed
88enabled.
89
90----
91# dmesg | grep -e DMAR -e IOMMU -e AMD-Vi
92----
93
94should display that `IOMMU`, `Directed I/O` or `Interrupt Remapping` is
95enabled, depending on hardware and kernel the exact message can vary.
96
97It is also important that the device(s) you want to pass through
98are in a *separate* `IOMMU` group. This can be checked with:
99
100----
101# find /sys/kernel/iommu_groups/ -type l
102----
103
104It is okay if the device is in an `IOMMU` group together with its functions
105(e.g. a GPU with the HDMI Audio device) or with its root port or PCI(e) bridge.
106
107.PCI(e) slots
108[NOTE]
109====
110Some platforms handle their physical PCI(e) slots differently. So, sometimes
111it can help to put the card in a another PCI(e) slot, if you do not get the
112desired `IOMMU` group separation.
113====
114
115.Unsafe interrupts
116[NOTE]
117====
118For some platforms, it may be necessary to allow unsafe interrupts.
119For this add the following line in a file ending with `.conf' file in
120*/etc/modprobe.d/*:
121
122----
123 options vfio_iommu_type1 allow_unsafe_interrupts=1
124----
125
126Please be aware that this option can make your system unstable.
127====
128
129GPU Passthrough Notes
130^^^^^^^^^^^^^^^^^^^^^
131
132It is not possible to display the frame buffer of the GPU via NoVNC or SPICE on
133the {pve} web interface.
134
135When passing through a whole GPU or a vGPU and graphic output is wanted, one
136has to either physically connect a monitor to the card, or configure a remote
137desktop software (for example, VNC or RDP) inside the guest.
138
139If you want to use the GPU as a hardware accelerator, for example, for
140programs using OpenCL or CUDA, this is not required.
141
142Host Device Passthrough
143~~~~~~~~~~~~~~~~~~~~~~~
144
145The most used variant of PCI(e) passthrough is to pass through a whole
146PCI(e) card, for example a GPU or a network card.
147
148
149Host Configuration
150^^^^^^^^^^^^^^^^^^
151
152In this case, the host must not use the card. There are two methods to achieve
153this:
154
155* pass the device IDs to the options of the 'vfio-pci' modules by adding
156+
157----
158 options vfio-pci ids=1234:5678,4321:8765
159----
160+
161to a .conf file in */etc/modprobe.d/* where `1234:5678` and `4321:8765` are
162the vendor and device IDs obtained by:
163+
164----
165# lspci -nn
166----
167
168* blacklist the driver completely on the host, ensuring that it is free to bind
169for passthrough, with
170+
171----
172 blacklist DRIVERNAME
173----
174+
175in a .conf file in */etc/modprobe.d/*.
176
177For both methods you need to
178xref:qm_pci_passthrough_update_initramfs[update the `initramfs`] again and
179reboot after that.
180
181.Verify Configuration
182
183To check if your changes were successful, you can use
184
185----
186# lspci -nnk
187----
188
189and check your device entry. If it says
190
191----
192Kernel driver in use: vfio-pci
193----
194
195or the 'in use' line is missing entirely, the device is ready to be used for
196passthrough.
197
198[[qm_pci_passthrough_vm_config]]
199VM Configuration
200^^^^^^^^^^^^^^^^
201To pass through the device you need to set the *hostpciX* option in the VM
202configuration, for example by executing:
203
204----
205# qm set VMID -hostpci0 00:02.0
206----
207
208If your device has multiple functions (e.g., ``00:02.0`' and ``00:02.1`' ),
209you can pass them through all together with the shortened syntax ``00:02`'.
210This is equivalent with checking the ``All Functions`' checkbox in the
211web-interface.
212
213There are some options to which may be necessary, depending on the device
214and guest OS:
215
216* *x-vga=on|off* marks the PCI(e) device as the primary GPU of the VM.
217With this enabled the *vga* configuration option will be ignored.
218
219* *pcie=on|off* tells {pve} to use a PCIe or PCI port. Some guests/device
220combination require PCIe rather than PCI. PCIe is only available for 'q35'
221machine types.
222
223* *rombar=on|off* makes the firmware ROM visible for the guest. Default is on.
224Some PCI(e) devices need this disabled.
225
226* *romfile=<path>*, is an optional path to a ROM file for the device to use.
227This is a relative path under */usr/share/kvm/*.
228
229.Example
230
231An example of PCIe passthrough with a GPU set to primary:
232
233----
234# qm set VMID -hostpci0 02:00,pcie=on,x-vga=on
235----
236
237
238Other considerations
239^^^^^^^^^^^^^^^^^^^^
240
241When passing through a GPU, the best compatibility is reached when using
242'q35' as machine type, 'OVMF' ('EFI' for VMs) instead of SeaBIOS and PCIe
243instead of PCI. Note that if you want to use 'OVMF' for GPU passthrough, the
244GPU needs to have an EFI capable ROM, otherwise use SeaBIOS instead.
245
246SR-IOV
247~~~~~~
248
249Another variant for passing through PCI(e) devices, is to use the hardware
250virtualization features of your devices, if available.
251
252'SR-IOV' (**S**ingle-**R**oot **I**nput/**O**utput **V**irtualization) enables
253a single device to provide multiple 'VF' (**V**irtual **F**unctions) to the
254system. Each of those 'VF' can be used in a different VM, with full hardware
255features and also better performance and lower latency than software
256virtualized devices.
257
258Currently, the most common use case for this are NICs (**N**etwork
259**I**nterface **C**ard) with SR-IOV support, which can provide multiple VFs per
260physical port. This allows using features such as checksum offloading, etc. to
261be used inside a VM, reducing the (host) CPU overhead.
262
263
264Host Configuration
265^^^^^^^^^^^^^^^^^^
266
267Generally, there are two methods for enabling virtual functions on a device.
268
269* sometimes there is an option for the driver module e.g. for some
270Intel drivers
271+
272----
273 max_vfs=4
274----
275+
276which could be put file with '.conf' ending under */etc/modprobe.d/*.
277(Do not forget to update your initramfs after that)
278+
279Please refer to your driver module documentation for the exact
280parameters and options.
281
282* The second, more generic, approach is using the `sysfs`.
283If a device and driver supports this you can change the number of VFs on
284the fly. For example, to setup 4 VFs on device 0000:01:00.0 execute:
285+
286----
287# echo 4 > /sys/bus/pci/devices/0000:01:00.0/sriov_numvfs
288----
289+
290To make this change persistent you can use the `sysfsutils` Debian package.
291After installation configure it via */etc/sysfs.conf* or a `FILE.conf' in
292*/etc/sysfs.d/*.
293
294VM Configuration
295^^^^^^^^^^^^^^^^
296
297After creating VFs, you should see them as separate PCI(e) devices when
298outputting them with `lspci`. Get their ID and pass them through like a
299xref:qm_pci_passthrough_vm_config[normal PCI(e) device].
300
301Other considerations
302^^^^^^^^^^^^^^^^^^^^
303
304For this feature, platform support is especially important. It may be necessary
305to enable this feature in the BIOS/EFI first, or to use a specific PCI(e) port
306for it to work. In doubt, consult the manual of the platform or contact its
307vendor.
308
309Mediated Devices (vGPU, GVT-g)
310~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
311
312Mediated devices are another method to reuse features and performance from
313physical hardware for virtualized hardware. These are found most common in
314virtualized GPU setups such as Intel's GVT-g and NVIDIA's vGPUs used in their
315GRID technology.
316
317With this, a physical Card is able to create virtual cards, similar to SR-IOV.
318The difference is that mediated devices do not appear as PCI(e) devices in the
319host, and are such only suited for using in virtual machines.
320
321
322Host Configuration
323^^^^^^^^^^^^^^^^^^
324
325In general your card's driver must support that feature, otherwise it will
326not work. So please refer to your vendor for compatible drivers and how to
327configure them.
328
329Intel's drivers for GVT-g are integrated in the Kernel and should work
330with 5th, 6th and 7th generation Intel Core Processors, as well as E3 v4, E3
331v5 and E3 v6 Xeon Processors.
332
333To enable it for Intel Graphics, you have to make sure to load the module
334'kvmgt' (for example via `/etc/modules`) and to enable it on the
335xref:sysboot_edit_kernel_cmdline[Kernel commandline] and add the following parameter:
336
337----
338 i915.enable_gvt=1
339----
340
341After that remember to
342xref:qm_pci_passthrough_update_initramfs[update the `initramfs`],
343and reboot your host.
344
345VM Configuration
346^^^^^^^^^^^^^^^^
347
348To use a mediated device, simply specify the `mdev` property on a `hostpciX`
349VM configuration option.
350
351You can get the supported devices via the 'sysfs'. For example, to list the
352supported types for the device '0000:00:02.0' you would simply execute:
353
354----
355# ls /sys/bus/pci/devices/0000:00:02.0/mdev_supported_types
356----
357
358Each entry is a directory which contains the following important files:
359
360* 'available_instances' contains the amount of still available instances of
361this type, each 'mdev' use in a VM reduces this.
362* 'description' contains a short description about the capabilities of the type
363* 'create' is the endpoint to create such a device, {pve} does this
364automatically for you, if a 'hostpciX' option with `mdev` is configured.
365
366Example configuration with an `Intel GVT-g vGPU` (`Intel Skylake 6700k`):
367
368----
369# qm set VMID -hostpci0 00:02.0,mdev=i915-GVTg_V5_4
370----
371
372With this set, {pve} automatically creates such a device on VM start, and
373cleans it up again when the VM stops.
374
375ifdef::wiki[]
376
377See Also
378~~~~~~~~
379
380* link:/wiki/Pci_passthrough[PCI Passthrough Examples]
381
382endif::wiki[]