]> git.proxmox.com Git - mirror_ovs.git/blame - FAQ.rst
doc: Further populate the 'howto' section
[mirror_ovs.git] / FAQ.rst
CommitLineData
edde4861
SF
1..
2 Licensed under the Apache License, Version 2.0 (the "License"); you may
3 not use this file except in compliance with the License. You may obtain
4 a copy of the License at
5
6 http://www.apache.org/licenses/LICENSE-2.0
7
8 Unless required by applicable law or agreed to in writing, software
9 distributed under the License is distributed on an "AS IS" BASIS, WITHOUT
10 WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the
11 License for the specific language governing permissions and limitations
12 under the License.
13
14 Convention for heading levels in Open vSwitch documentation:
15
16 ======= Heading 0 (reserved for the title in a document)
17 ------- Heading 1
18 ~~~~~~~ Heading 2
19 +++++++ Heading 3
20 ''''''' Heading 4
21
22 Avoid deeper levels because they do not render well.
23
24==========================
25Frequently Asked Questions
26==========================
27
28General
29-------
30
31Q: What is Open vSwitch?
32
33 A: Open vSwitch is a production quality open source software switch
34 designed to be used as a vswitch in virtualized server environments. A
35 vswitch forwards traffic between different VMs on the same physical host
36 and also forwards traffic between VMs and the physical network. Open
37 vSwitch supports standard management interfaces (e.g. sFlow, NetFlow,
38 IPFIX, RSPAN, CLI), and is open to programmatic extension and control using
39 OpenFlow and the OVSDB management protocol.
40
41 Open vSwitch as designed to be compatible with modern switching chipsets.
42 This means that it can be ported to existing high-fanout switches allowing
43 the same flexible control of the physical infrastructure as the virtual
44 infrastructure. It also means that Open vSwitch will be able to take
45 advantage of on-NIC switching chipsets as their functionality matures.
46
47Q: What virtualization platforms can use Open vSwitch?
48
49 A: Open vSwitch can currently run on any Linux-based virtualization
50 platform (kernel 3.10 and newer), including: KVM, VirtualBox, Xen, Xen
51 Cloud Platform, XenServer. As of Linux 3.3 it is part of the mainline
52 kernel. The bulk of the code is written in platform- independent C and is
53 easily ported to other environments. We welcome inquires about integrating
54 Open vSwitch with other virtualization platforms.
55
56Q: How can I try Open vSwitch?
57
58 A: The Open vSwitch source code can be built on a Linux system. You can
59 build and experiment with Open vSwitch on any Linux machine. Packages for
60 various Linux distributions are available on many platforms, including:
61 Debian, Ubuntu, Fedora.
62
63 You may also download and run a virtualization platform that already has
64 Open vSwitch integrated. For example, download a recent ISO for XenServer
65 or Xen Cloud Platform. Be aware that the version integrated with a
66 particular platform may not be the most recent Open vSwitch release.
67
68Q: Does Open vSwitch only work on Linux?
69
70 A: No, Open vSwitch has been ported to a number of different operating
71 systems and hardware platforms. Most of the development work occurs on
72 Linux, but the code should be portable to any POSIX system. We've seen
73 Open vSwitch ported to a number of different platforms, including FreeBSD,
74 Windows, and even non-POSIX embedded systems.
75
76 By definition, the Open vSwitch Linux kernel module only works on Linux and
77 will provide the highest performance. However, a userspace datapath is
78 available that should be very portable.
79
80Q: What's involved with porting Open vSwitch to a new platform or switching ASIC?
81
82 A: The `porting document <PORTING.rst>`__ describes how one would go about
83 porting Open vSwitch to a new operating system or hardware platform.
84
85Q: Why would I use Open vSwitch instead of the Linux bridge?
86
87 A: Open vSwitch is specially designed to make it easier to manage VM
88 network configuration and monitor state spread across many physical hosts
89 in dynamic virtualized environments. Refer to `WHY-OVS <WHY-OVS.rst>`__
90 for a more detailed description of how Open vSwitch relates to the Linux
91 Bridge.
92
93Q: How is Open vSwitch related to distributed virtual switches like the VMware
94vNetwork distributed switch or the Cisco Nexus 1000V?
95
96 A: Distributed vswitch applications (e.g., VMware vNetwork distributed
97 switch, Cisco Nexus 1000V) provide a centralized way to configure and
98 monitor the network state of VMs that are spread across many physical
99 hosts. Open vSwitch is not a distributed vswitch itself, rather it runs on
100 each physical host and supports remote management in a way that makes it
101 easier for developers of virtualization/cloud management platforms to offer
102 distributed vswitch capabilities.
103
104 To aid in distribution, Open vSwitch provides two open protocols that are
105 specially designed for remote management in virtualized network
106 environments: OpenFlow, which exposes flow-based forwarding state, and the
107 OVSDB management protocol, which exposes switch port state. In addition to
108 the switch implementation itself, Open vSwitch includes tools (ovs-ofctl,
109 ovs-vsctl) that developers can script and extend to provide distributed
110 vswitch capabilities that are closely integrated with their virtualization
111 management platform.
112
113Q: Why doesn't Open vSwitch support distribution?
114
115 A: Open vSwitch is intended to be a useful component for building flexible
116 network infrastructure. There are many different approaches to distribution
117 which balance trade-offs between simplicity, scalability, hardware
118 compatibility, convergence times, logical forwarding model, etc. The goal
119 of Open vSwitch is to be able to support all as a primitive building block
120 rather than choose a particular point in the distributed design space.
121
122Q: How can I contribute to the Open vSwitch Community?
123
124 A: You can start by joining the mailing lists and helping to answer
125 questions. You can also suggest improvements to documentation. If you
126 have a feature or bug you would like to work on, send a mail to `one of the
8bb0434a 127 mailing lists <http://openvswitch.org/mlists/>`__.
edde4861
SF
128
129Q: Why can I no longer connect to my OpenFlow controller or OVSDB manager?
130
131 A: Starting in OVS 2.4, we switched the default ports to the IANA-specified
132 port numbers for OpenFlow (6633->6653) and OVSDB (6632->6640). We
133 recommend using these port numbers, but if you cannot, all the programs
134 allow overriding the default port. See the appropriate man page.
135
136Releases
137--------
138
139Q: What does it mean for an Open vSwitch release to be LTS (long-term support)?
140
141 A: All official releases have been through a comprehensive testing process
142 and are suitable for production use. Planned releases occur twice a year.
143 If a significant bug is identified in an LTS release, we will provide an
144 updated release that includes the fix. Releases that are not LTS may not
145 be fixed and may just be supplanted by the next major release. The current
7169d4fd 146 LTS release is 2.5.x.
edde4861 147
8bb0434a 148 For more information on the Open vSwitch release process, refer to `release
f8e5a539 149 process <Documentation/release-process.rst>`__.
edde4861
SF
150
151Q: What Linux kernel versions does each Open vSwitch release work with?
152
153 A: The following table lists the Linux kernel versions against which the
154 given versions of the Open vSwitch kernel module will successfully build.
155 The Linux kernel versions are upstream kernel versions, so Linux kernels
156 modified from the upstream sources may not build in some cases even if they
157 are based on a supported version. This is most notably true of Red Hat
158 Enterprise Linux (RHEL) kernels, which are extensively modified from
159 upstream.
160
161 ============ ==============
162 Open vSwitch Linux kernel
163 ============ ==============
164 1.4.x 2.6.18 to 3.2
165 1.5.x 2.6.18 to 3.2
166 1.6.x 2.6.18 to 3.2
167 1.7.x 2.6.18 to 3.3
168 1.8.x 2.6.18 to 3.4
169 1.9.x 2.6.18 to 3.8
170 1.10.x 2.6.18 to 3.8
171 1.11.x 2.6.18 to 3.8
172 2.0.x 2.6.32 to 3.10
173 2.1.x 2.6.32 to 3.11
174 2.3.x 2.6.32 to 3.14
175 2.4.x 2.6.32 to 4.0
176 2.5.x 2.6.32 to 4.3
177 2.6.x 3.10 to 4.7
178 ============ ==============
179
180 Open vSwitch userspace should also work with the Linux kernel module built
181 into Linux 3.3 and later.
182
183 Open vSwitch userspace is not sensitive to the Linux kernel version. It
184 should build against almost any kernel, certainly against 2.6.32 and later.
185
186Q: Are all features available with all datapaths?
187
188 A: Open vSwitch supports different datapaths on different platforms. Each
189 datapath has a different feature set: the following tables try to summarize
190 the status.
191
192 Supported datapaths:
193
194 Linux upstream
195 The datapath implemented by the kernel module shipped with Linux
196 upstream. Since features have been gradually introduced into the kernel,
197 the table mentions the first Linux release whose OVS module supports the
198 feature.
199
200 Linux OVS tree
201 The datapath implemented by the Linux kernel module distributed with the
202 OVS source tree.
203
204 Userspace
205 Also known as DPDK, dpif-netdev or dummy datapath. It is the only
206 datapath that works on NetBSD, FreeBSD and Mac OSX.
207
208 Hyper-V
209 Also known as the Windows datapath.
210
211 The following table lists the datapath supported features from an Open
212 vSwitch user's perspective.
213
214 ===================== ============== ============== ========= =======
215 Feature Linux upstream Linux OVS tree Userspace Hyper-V
216 ===================== ============== ============== ========= =======
217 NAT 4.6 YES NO NO
218 Connection tracking 4.3 YES PARTIAL PARTIAL
219 Tunnel - LISP NO YES NO NO
220 Tunnel - STT NO YES NO YES
221 Tunnel - GRE 3.11 YES YES YES
222 Tunnel - VXLAN 3.12 YES YES YES
223 Tunnel - Geneve 3.18 YES YES YES
224 Tunnel - GRE-IPv6 NO NO YES NO
225 Tunnel - VXLAN-IPv6 4.3 YES YES NO
226 Tunnel - Geneve-IPv6 4.4 YES YES NO
227 QoS - Policing YES YES YES NO
228 QoS - Shaping YES YES NO NO
229 sFlow YES YES YES NO
230 IPFIX 3.10 YES YES NO
231 Set action YES YES YES PARTIAL
232 NIC Bonding YES YES YES NO
233 Multiple VTEPs YES YES YES NO
234 ===================== ============== ============== ========= =======
235
236 Do note, however:
237
238 * Only a limited set of flow fields is modifiable via the set action by the
239 Hyper-V datapath.
240 * The Hyper-V datapath only supports one physical NIC per datapath. This is
241 why bonding is not supported.
242 * The Hyper-V datapath can have at most one IP address configured as a
243 tunnel endpoint.
244
245 The following table lists features that do not *directly* impact an Open
246 vSwitch user, e.g. because their absence can be hidden by the ofproto layer
247 (usually this comes with a performance penalty).
248
249 ===================== ============== ============== ========= =======
250 Feature Linux upstream Linux OVS tree Userspace Hyper-V
251 ===================== ============== ============== ========= =======
252 SCTP flows 3.12 YES YES YES
253 MPLS 3.19 YES YES YES
254 UFID 4.0 YES YES NO
255 Megaflows 3.12 YES YES NO
256 Masked set action 4.0 YES YES NO
257 Recirculation 3.19 YES YES YES
258 TCP flags matching 3.13 YES YES NO
259 Validate flow actions YES YES N/A NO
260 Multiple datapaths YES YES YES NO
261 Tunnel TSO - STT N/A YES NO YES
262 ===================== ============== ============== ========= =======
263
264Q: What DPDK version does each Open vSwitch release work with?
265
266 A: The following table lists the DPDK version against which the given
267 versions of Open vSwitch will successfully build.
268
269 ============ =====
270 Open vSwitch DPDK
271 ============ =====
272 2.2.x 1.6
273 2.3.x 1.6
274 2.4.x 2.0
275 2.5.x 2.2
276 2.6.x 16.07
277 ============ =====
278
dc76953f 279Q: I get an error like this when I configure Open vSwitch::
edde4861
SF
280
281 configure: error: Linux kernel in <dir> is version <x>, but
282 version newer than <y> is not supported (please refer to the
283 FAQ for advice)
284
285What should I do?
286
287 A: You have the following options:
288
289 - Use the Linux kernel module supplied with the kernel that you are using.
290 (See also the following FAQ.)
291
292 - If there is a newer released version of Open vSwitch, consider building
293 that one, because it may support the kernel that you are building
294 against. (To find out, consult the table in the previous FAQ.)
295
296 - The Open vSwitch "master" branch may support the kernel that you are
297 using, so consider building the kernel module from "master".
298
299 All versions of Open vSwitch userspace are compatible with all versions of
300 the Open vSwitch kernel module, so you do not have to use the kernel module
301 from one source along with the userspace programs from the same source.
302
303Q: What features are not available in the Open vSwitch kernel datapath that
304ships as part of the upstream Linux kernel?
305
306 A: The kernel module in upstream Linux does not include support for LISP.
307 Work is in progress to add support for LISP to the upstream Linux version
308 of the Open vSwitch kernel module. For now, if you need this feature, use
309 the kernel module from the Open vSwitch distribution instead of the
310 upstream Linux kernel module.
311
312 Certain features require kernel support to function or to have reasonable
313 performance. If the ovs-vswitchd log file indicates that a feature is not
314 supported, consider upgrading to a newer upstream Linux release or using
315 the kernel module paired with the userspace distribution.
316
317Q: Why do tunnels not work when using a kernel module other than the one
318packaged with Open vSwitch?
319
320 A: Support for tunnels was added to the upstream Linux kernel module after
321 the rest of Open vSwitch. As a result, some kernels may contain support for
322 Open vSwitch but not tunnels. The minimum kernel version that supports each
323 tunnel protocol is:
324
325 ======== ============
326 Protocol Linux Kernel
327 ======== ============
328 GRE 3.11
329 VXLAN 3.12
330 Geneve 3.18
331 LISP not upstream
332 STT not upstream
333 ======== ============
334
335 If you are using a version of the kernel that is older than the one listed
336 above, it is still possible to use that tunnel protocol. However, you must
337 compile and install the kernel module included with the Open vSwitch
338 distribution rather than the one on your machine. If problems persist after
339 doing this, check to make sure that the module that is loaded is the one
340 you expect.
341
342Q: Why are UDP tunnel checksums not computed for VXLAN or Geneve?
343
344 A: Generating outer UDP checksums requires kernel support that was not part
345 of the initial implementation of these protocols. If using the upstream
346 Linux Open vSwitch module, you must use kernel 4.0 or newer. The
347 out-of-tree modules from Open vSwitch release 2.4 and later support UDP
348 checksums.
349
350Q: What features are not available when using the userspace datapath?
351
352 A: Tunnel virtual ports are not supported, as described in the previous
353 answer. It is also not possible to use queue-related actions. On Linux
354 kernels before 2.6.39, maximum-sized VLAN packets may not be transmitted.
355
356Q: Should userspace or kernel be upgraded first to minimize downtime?
357
358 A. In general, the Open vSwitch userspace should be used with the kernel
359 version included in the same release or with the version from upstream
360 Linux. However, when upgrading between two releases of Open vSwitch it is
361 best to migrate userspace first to reduce the possibility of
362 incompatibilities.
363
364Q: What happened to the bridge compatibility feature?
365
366 A: Bridge compatibility was a feature of Open vSwitch 1.9 and earlier.
367 When it was enabled, Open vSwitch imitated the interface of the Linux
368 kernel "bridge" module. This allowed users to drop Open vSwitch into
369 environments designed to use the Linux kernel bridge module without
370 adapting the environment to use Open vSwitch.
371
372 Open vSwitch 1.10 and later do not support bridge compatibility. The
373 feature was dropped because version 1.10 adopted a new internal
374 architecture that made bridge compatibility difficult to maintain. Now
375 that many environments use OVS directly, it would be rarely useful in any
376 case.
377
378 To use bridge compatibility, install OVS 1.9 or earlier, including the
379 accompanying kernel modules (both the main and bridge compatibility
380 modules), following the instructions that come with the release. Be sure
381 to start the ovs-brcompatd daemon.
382
383Terminology
384-----------
385
386Q: I thought Open vSwitch was a virtual Ethernet switch, but the documentation
387keeps talking about bridges. What's a bridge?
388
389 A: In networking, the terms "bridge" and "switch" are synonyms. Open
390 vSwitch implements an Ethernet switch, which means that it is also an
391 Ethernet bridge.
392
393Q: What's a VLAN?
394
395 A: See the "VLAN" section below.
396
397Basic Configuration
398-------------------
399
400Q: How do I configure a port as an access port?
401
402 A. Add ``tag=VLAN`` to your ``ovs-vsctl add-port`` command. For example,
403 the following commands configure br0 with eth0 as a trunk port (the
404 default) and tap0 as an access port for VLAN 9:
405
406 ::
407
408 $ ovs-vsctl add-br br0
409 $ ovs-vsctl add-port br0 eth0
410 $ ovs-vsctl add-port br0 tap0 tag=9
411
412 If you want to configure an already added port as an access port, use
413 ``ovs-vsctl set``, e.g.:
414
415 ::
416
417 $ ovs-vsctl set port tap0 tag=9
418
419Q: How do I configure a port as a SPAN port, that is, enable mirroring of all
420traffic to that port?
421
422 A. The following commands configure br0 with eth0 and tap0 as trunk ports.
423 All traffic coming in or going out on eth0 or tap0 is also mirrored to
424 tap1; any traffic arriving on tap1 is dropped:
425
426 ::
427
428 $ ovs-vsctl add-br br0
429 $ ovs-vsctl add-port br0 eth0
430 $ ovs-vsctl add-port br0 tap0
431 $ ovs-vsctl add-port br0 tap1 \
432 -- --id=@p get port tap1 \
433 -- --id=@m create mirror name=m0 select-all=true output-port=@p \
434 -- set bridge br0 mirrors=@m
435
436 To later disable mirroring, run:
437
438 ::
439
440 $ ovs-vsctl clear bridge br0 mirrors
441
442Q: Does Open vSwitch support configuring a port in promiscuous mode?
443
444 A: Yes. How you configure it depends on what you mean by "promiscuous
445 mode":
446
447 - Conventionally, "promiscuous mode" is a feature of a network interface
448 card. Ordinarily, a NIC passes to the CPU only the packets actually
449 destined to its host machine. It discards the rest to avoid wasting
450 memory and CPU cycles. When promiscuous mode is enabled, however, it
451 passes every packet to the CPU. On an old-style shared-media or
452 hub-based network, this allows the host to spy on all packets on the
453 network. But in the switched networks that are almost everywhere these
454 days, promiscuous mode doesn't have much effect, because few packets not
455 destined to a host are delivered to the host's NIC.
456
457 This form of promiscuous mode is configured in the guest OS of the VMs on
458 your bridge, e.g. with "ifconfig".
459
460 - The VMware vSwitch uses a different definition of "promiscuous mode".
461 When you configure promiscuous mode on a VMware vNIC, the vSwitch sends a
462 copy of every packet received by the vSwitch to that vNIC. That has a
463 much bigger effect than just enabling promiscuous mode in a guest OS.
464 Rather than getting a few stray packets for which the switch does not yet
465 know the correct destination, the vNIC gets every packet. The effect is
466 similar to replacing the vSwitch by a virtual hub.
467
468 This "promiscuous mode" is what switches normally call "port mirroring"
469 or "SPAN". For information on how to configure SPAN, see "How do I
470 configure a port as a SPAN port, that is, enable mirroring of all traffic
471 to that port?"
472
473Q: How do I configure a DPDK port as an access port?
474
475 A: Firstly, you must have a DPDK-enabled version of Open vSwitch.
476
477 If your version is DPDK-enabled it will support the other-config:dpdk-init
478 configuration in the database and will display lines with "EAL:..." during
479 startup when other_config:dpdk-init is set to 'true'.
480
481 Secondly, when adding a DPDK port, unlike a system port, the type for the
dc76953f 482 interface must be specified. For example::
edde4861
SF
483
484 $ ovs-vsctl add-br br0
485 $ ovs-vsctl add-port br0 dpdk0 -- set Interface dpdk0 type=dpdk
486
487 Finally, it is required that DPDK port names begin with ``dpdk``.
488
795752a3 489 Refer to the `DPDK installation guide`_ for more information on enabling
edde4861
SF
490 and using DPDK with Open vSwitch.
491
492Q: How do I configure a VLAN as an RSPAN VLAN, that is, enable mirroring of all
493traffic to that VLAN?
494
495 A: The following commands configure br0 with eth0 as a trunk port and tap0
496 as an access port for VLAN 10. All traffic coming in or going out on tap0,
497 as well as traffic coming in or going out on eth0 in VLAN 10, is also
498 mirrored to VLAN 15 on eth0. The original tag for VLAN 10, in cases where
499 one is present, is dropped as part of mirroring:
500
501 ::
502
503 $ ovs-vsctl add-br br0
504 $ ovs-vsctl add-port br0 eth0
505 $ ovs-vsctl add-port br0 tap0 tag=10
506 $ ovs-vsctl \
507 -- --id=@m create mirror name=m0 select-all=true select-vlan=10 \
508 output-vlan=15 \
509 -- set bridge br0 mirrors=@m
510
511 To later disable mirroring, run:
512
513 ::
514
515 $ ovs-vsctl clear bridge br0 mirrors
516
517 Mirroring to a VLAN can disrupt a network that contains unmanaged switches.
518 See ovs-vswitchd.conf.db(5) for details. Mirroring to a GRE tunnel has
519 fewer caveats than mirroring to a VLAN and should generally be preferred.
520
521Q: Can I mirror more than one input VLAN to an RSPAN VLAN?
522
523 A: Yes, but mirroring to a VLAN strips the original VLAN tag in favor of
524 the specified output-vlan. This loss of information may make the mirrored
525 traffic too hard to interpret.
526
527 To mirror multiple VLANs, use the commands above, but specify a
528 comma-separated list of VLANs as the value for select-vlan. To mirror
529 every VLAN, use the commands above, but omit select-vlan and its value
530 entirely.
531
532 When a packet arrives on a VLAN that is used as a mirror output VLAN, the
533 mirror is disregarded. Instead, in standalone mode, OVS floods the packet
534 across all the ports for which the mirror output VLAN is configured. (If
535 an OpenFlow controller is in use, then it can override this behavior
536 through the flow table.) If OVS is used as an intermediate switch, rather
537 than an edge switch, this ensures that the RSPAN traffic is distributed
538 through the network.
539
540 Mirroring to a VLAN can disrupt a network that contains unmanaged switches.
541 See ovs-vswitchd.conf.db(5) for details. Mirroring to a GRE tunnel has
542 fewer caveats than mirroring to a VLAN and should generally be preferred.
543
544Q: How do I configure mirroring of all traffic to a GRE tunnel?
545
546 A: The following commands configure br0 with eth0 and tap0 as trunk ports.
547 All traffic coming in or going out on eth0 or tap0 is also mirrored to
548 gre0, a GRE tunnel to the remote host 192.168.1.10; any traffic arriving on
dc76953f 549 gre0 is dropped::
edde4861
SF
550
551 $ ovs-vsctl add-br br0
552 $ ovs-vsctl add-port br0 eth0
553 $ ovs-vsctl add-port br0 tap0
554 $ ovs-vsctl add-port br0 gre0 \
555 -- set interface gre0 type=gre options:remote_ip=192.168.1.10 \
556 -- --id=@p get port gre0 \
557 -- --id=@m create mirror name=m0 select-all=true output-port=@p \
558 -- set bridge br0 mirrors=@m
559
dc76953f 560 To later disable mirroring and destroy the GRE tunnel::
edde4861
SF
561
562 $ ovs-vsctl clear bridge br0 mirrors
563 $ ovs-vsctl del-port br0 gre0
564
565Q: Does Open vSwitch support ERSPAN?
566
567 A: No. As an alternative, Open vSwitch supports mirroring to a GRE tunnel
568 (see above).
569
570Q: How do I connect two bridges?
571
572 A: First, why do you want to do this? Two connected bridges are not much
573 different from a single bridge, so you might as well just have a single
574 bridge with all your ports on it.
575
576 If you still want to connect two bridges, you can use a pair of patch
577 ports. The following example creates bridges br0 and br1, adds eth0 and
578 tap0 to br0, adds tap1 to br1, and then connects br0 and br1 with a pair of
579 patch ports.
580
581 ::
582
583 $ ovs-vsctl add-br br0
584 $ ovs-vsctl add-port br0 eth0
585 $ ovs-vsctl add-port br0 tap0
586 $ ovs-vsctl add-br br1
587 $ ovs-vsctl add-port br1 tap1
588 $ ovs-vsctl \
589 -- add-port br0 patch0 \
590 -- set interface patch0 type=patch options:peer=patch1 \
591 -- add-port br1 patch1 \
592 -- set interface patch1 type=patch options:peer=patch0
593
594 Bridges connected with patch ports are much like a single bridge. For
595 instance, if the example above also added eth1 to br1, and both eth0 and
596 eth1 happened to be connected to the same next-hop switch, then you could
597 loop your network just as you would if you added eth0 and eth1 to the same
598 bridge (see the "Configuration Problems" section below for more
599 information).
600
601 If you are using Open vSwitch 1.9 or an earlier version, then you need to
602 be using the kernel module bundled with Open vSwitch rather than the one
603 that is integrated into Linux 3.3 and later, because Open vSwitch 1.9 and
604 earlier versions need kernel support for patch ports. This also means that
605 in Open vSwitch 1.9 and earlier, patch ports will not work with the
606 userspace datapath, only with the kernel module.
607
608Q: How do I configure a bridge without an OpenFlow local port? (Local port in
609the sense of OFPP_LOCAL)
610
611 A: Open vSwitch does not support such a configuration. Bridges always have
612 their local ports.
613
614Implementation Details
615----------------------
616
617Q: I hear OVS has a couple of kinds of flows. Can you tell me about them?
618
619 A: Open vSwitch uses different kinds of flows for different purposes:
620
621 - OpenFlow flows are the most important kind of flow. OpenFlow controllers
622 use these flows to define a switch's policy. OpenFlow flows support
623 wildcards, priorities, and multiple tables.
624
625 When in-band control is in use, Open vSwitch sets up a few "hidden"
626 flows, with priority higher than a controller or the user can configure,
627 that are not visible via OpenFlow. (See the "Controller" section of the
628 FAQ for more information about hidden flows.)
629
630 - The Open vSwitch software switch implementation uses a second kind of
631 flow internally. These flows, called "datapath" or "kernel" flows, do
632 not support priorities and comprise only a single table, which makes them
633 suitable for caching. (Like OpenFlow flows, datapath flows do support
634 wildcarding, in Open vSwitch 1.11 and later.) OpenFlow flows and
635 datapath flows also support different actions and number ports
636 differently.
637
638 Datapath flows are an implementation detail that is subject to change in
639 future versions of Open vSwitch. Even with the current version of Open
640 vSwitch, hardware switch implementations do not necessarily use this
641 architecture.
642
643Users and controllers directly control only the OpenFlow flow table. Open
644vSwitch manages the datapath flow table itself, so users should not normally be
645concerned with it.
646
647Q: Why are there so many different ways to dump flows?
648
649 A: Open vSwitch has two kinds of flows (see the previous question), so it
650 has commands with different purposes for dumping each kind of flow:
651
652 - ``ovs-ofctl dump-flows <br>`` dumps OpenFlow flows, excluding hidden
653 flows. This is the most commonly useful form of flow dump. (Unlike the
654 other commands, this should work with any OpenFlow switch, not just Open
655 vSwitch.)
656
657 - ``ovs-appctl bridge/dump-flows <br>`` dumps OpenFlow flows, including
658 hidden flows. This is occasionally useful for troubleshooting suspected
659 issues with in-band control.
660
661 - ``ovs-dpctl dump-flows [dp]`` dumps the datapath flow table entries for a
662 Linux kernel-based datapath. In Open vSwitch 1.10 and later,
663 ovs-vswitchd merges multiple switches into a single datapath, so it will
664 show all the flows on all your kernel-based switches. This command can
665 occasionally be useful for debugging.
666
667 - ``ovs-appctl dpif/dump-flows <br>``, new in Open vSwitch 1.10, dumps
668 datapath flows for only the specified bridge, regardless of the type.
669
670Q: How does multicast snooping works with VLANs?
671
672 A: Open vSwitch maintains snooping tables for each VLAN.
673
674Q: Can OVS populate the kernel flow table in advance instead of in reaction to
675packets?
676
677 A: No. There are several reasons:
678
679 - Kernel flows are not as sophisticated as OpenFlow flows, which means that
680 some OpenFlow policies could require a large number of kernel flows. The
681 "conjunctive match" feature is an extreme example: the number of kernel
682 flows it requires is the product of the number of flows in each
683 dimension.
684
685 - With multiple OpenFlow flow tables and simple sets of actions, the number
686 of kernel flows required can be as large as the product of the number of
687 flows in each dimension. With more sophisticated actions, the number of
688 kernel flows could be even larger.
689
690 - Open vSwitch is designed so that any version of OVS userspace
691 interoperates with any version of the OVS kernel module. This forward
692 and backward compatibility requires that userspace observe how the kernel
693 module parses received packets. This is only possible in a
694 straightforward way when userspace adds kernel flows in reaction to
695 received packets.
696
697 For more relevant information on the architecture of Open vSwitch, please
698 read "The Design and Implementation of Open vSwitch", published in USENIX
699 NSDI 2015.
700
701Performance
702-----------
703
704Q: I just upgraded and I see a performance drop. Why?
705
706 A: The OVS kernel datapath may have been updated to a newer version than
707 the OVS userspace components. Sometimes new versions of OVS kernel module
708 add functionality that is backwards compatible with older userspace
709 components but may cause a drop in performance with them. Especially, if a
710 kernel module from OVS 2.1 or newer is paired with OVS userspace 1.10 or
711 older, there will be a performance drop for TCP traffic.
712
713 Updating the OVS userspace components to the latest released version should
714 fix the performance degradation.
715
716To get the best possible performance and functionality, it is recommended to
717pair the same versions of the kernel module and OVS userspace.
718
719Configuration Problems
720----------------------
721
722Q: I created a bridge and added my Ethernet port to it, using commands like
dc76953f 723these::
edde4861
SF
724
725 ovs-vsctl add-br br0
726 ovs-vsctl add-port br0 eth0
727
728and as soon as I ran the "add-port" command I lost all connectivity through
729eth0. Help!
730
731 A: A physical Ethernet device that is part of an Open vSwitch bridge should
732 not have an IP address. If one does, then that IP address will not be
733 fully functional.
734
735 You can restore functionality by moving the IP address to an Open vSwitch
736 "internal" device, such as the network device named after the bridge
737 itself. For example, assuming that eth0's IP address is 192.168.128.5, you
dc76953f 738 could run the commands below to fix up the situation::
edde4861
SF
739
740 $ ifconfig eth0 0.0.0.0
741 $ ifconfig br0 192.168.128.5
742
743 (If your only connection to the machine running OVS is through the IP
744 address in question, then you would want to run all of these commands on a
745 single command line, or put them into a script.) If there were any
746 additional routes assigned to eth0, then you would also want to use
747 commands to adjust these routes to go through br0.
748
749 If you use DHCP to obtain an IP address, then you should kill the DHCP
750 client that was listening on the physical Ethernet interface (e.g. eth0)
751 and start one listening on the internal interface (e.g. br0). You might
752 still need to manually clear the IP address from the physical interface
753 (e.g. with "ifconfig eth0 0.0.0.0").
754
755 There is no compelling reason why Open vSwitch must work this way.
756 However, this is the way that the Linux kernel bridge module has always
757 worked, so it's a model that those accustomed to Linux bridging are already
758 used to. Also, the model that most people expect is not implementable
759 without kernel changes on all the versions of Linux that Open vSwitch
760 supports.
761
762 By the way, this issue is not specific to physical Ethernet devices. It
763 applies to all network devices except Open vSwitch "internal" devices.
764
765Q: I created a bridge and added a couple of Ethernet ports to it, using
dc76953f 766commands like these::
edde4861
SF
767
768 $ ovs-vsctl add-br br0
769 $ ovs-vsctl add-port br0 eth0
770 $ ovs-vsctl add-port br0 eth1
771
772and now my network seems to have melted: connectivity is unreliable (even
773connectivity that doesn't go through Open vSwitch), all the LEDs on my physical
774switches are blinking, wireshark shows duplicated packets, and CPU usage is
775very high.
776
777 A: More than likely, you've looped your network. Probably, eth0 and eth1
778 are connected to the same physical Ethernet switch. This yields a scenario
779 where OVS receives a broadcast packet on eth0 and sends it out on eth1,
780 then the physical switch connected to eth1 sends the packet back on eth0,
781 and so on forever. More complicated scenarios, involving a loop through
782 multiple switches, are possible too.
783
784 The solution depends on what you are trying to do:
785
786 - If you added eth0 and eth1 to get higher bandwidth or higher reliability
787 between OVS and your physical Ethernet switch, use a bond. The following
dc76953f 788 commands create br0 and then add eth0 and eth1 as a bond::
edde4861
SF
789
790 $ ovs-vsctl add-br br0
791 $ ovs-vsctl add-bond br0 bond0 eth0 eth1
792
793 Bonds have tons of configuration options. Please read the documentation
794 on the Port table in ovs-vswitchd.conf.db(5) for all the details.
795
796 Configuration for DPDK-enabled interfaces is slightly less
795752a3
SF
797 straightforward. Refer to the `DPDK installation guide`_ for more
798 information.
edde4861
SF
799
800 - Perhaps you don't actually need eth0 and eth1 to be on the same bridge.
801 For example, if you simply want to be able to connect each of them to
802 virtual machines, then you can put each of them on a bridge of its own:
803
804 $ ovs-vsctl add-br br0
805 $ ovs-vsctl add-port br0 eth0
806
807 $ ovs-vsctl add-br br1
808 $ ovs-vsctl add-port br1 eth1
809
810 and then connect VMs to br0 and br1. (A potential disadvantage is that
811 traffic cannot directly pass between br0 and br1. Instead, it will go
812 out eth0 and come back in eth1, or vice versa.)
813
814 - If you have a redundant or complex network topology and you want to
815 prevent loops, turn on spanning tree protocol (STP). The following
816 commands create br0, enable STP, and add eth0 and eth1 to the bridge.
817 The order is important because you don't want have to have a loop in your
dc76953f 818 network even transiently::
edde4861
SF
819
820 $ ovs-vsctl add-br br0
821 $ ovs-vsctl set bridge br0 stp_enable=true
822 $ ovs-vsctl add-port br0 eth0
823 $ ovs-vsctl add-port br0 eth1
824
825 The Open vSwitch implementation of STP is not well tested. Report any
826 bugs you observe, but if you'd rather avoid acting as a beta tester then
827 another option might be your best shot.
828
829Q: I can't seem to use Open vSwitch in a wireless network.
830
831 A: Wireless base stations generally only allow packets with the source MAC
832 address of NIC that completed the initial handshake. Therefore, without
833 MAC rewriting, only a single device can communicate over a single wireless
834 link.
835
836 This isn't specific to Open vSwitch, it's enforced by the access point, so
837 the same problems will show up with the Linux bridge or any other way to do
838 bridging.
839
840Q: I can't seem to add my PPP interface to an Open vSwitch bridge.
841
842 A: PPP most commonly carries IP packets, but Open vSwitch works only with
843 Ethernet frames. The correct way to interface PPP to an Ethernet network
844 is usually to use routing instead of switching.
845
846Q: Is there any documentation on the database tables and fields?
847
848 A: Yes. ovs-vswitchd.conf.db(5) is a comprehensive reference.
849
850Q: When I run ovs-dpctl I no longer see the bridges I created. Instead, I only
851see a datapath called "ovs-system". How can I see datapath information about a
852particular bridge?
853
854 A: In version 1.9.0, OVS switched to using a single datapath that is shared
855 by all bridges of that type. The ``ovs-appctl dpif/*`` commands provide
856 similar functionality that is scoped by the bridge.
857
858Q: I created a GRE port using ovs-vsctl so why can't I send traffic or see the
859port in the datapath?
860
861 A: On Linux kernels before 3.11, the OVS GRE module and Linux GRE module
862 cannot be loaded at the same time. It is likely that on your system the
863 Linux GRE module is already loaded and blocking OVS (to confirm, check
864 dmesg for errors regarding GRE registration). To fix this, unload all GRE
865 modules that appear in lsmod as well as the OVS kernel module. You can then
795752a3
SF
866 reload the OVS module following the directions in the `general installation
867 guide`_, which will ensure that dependencies are satisfied.
edde4861
SF
868
869Q: Open vSwitch does not seem to obey my packet filter rules.
870
871 A: It depends on mechanisms and configurations you want to use.
872
873 You cannot usefully use typical packet filters, like iptables, on physical
874 Ethernet ports that you add to an Open vSwitch bridge. This is because
875 Open vSwitch captures packets from the interface at a layer lower below
876 where typical packet-filter implementations install their hooks. (This
877 actually applies to any interface of type "system" that you might add to an
878 Open vSwitch bridge.)
879
880 You can usefully use typical packet filters on Open vSwitch internal ports
881 as they are mostly ordinary interfaces from the point of view of packet
882 filters.
883
884 For example, suppose you create a bridge br0 and add Ethernet port eth0 to
885 it. Then you can usefully add iptables rules to affect the internal
886 interface br0, but not the physical interface eth0. (br0 is also where you
887 would add an IP address, as discussed elsewhere in the FAQ.)
888
889 For simple filtering rules, it might be possible to achieve similar results
0b1545be
HZ
890 by installing appropriate OpenFlow flows instead. The OVS conntrack
891 feature (see the "ct" action in ovs-ofctl(8)) can implement a stateful
892 firewall.
edde4861
SF
893
894 If the use of a particular packet filter setup is essential, Open vSwitch
895 might not be the best choice for you. On Linux, you might want to consider
896 using the Linux Bridge. (This is the only choice if you want to use
897 ebtables rules.) On NetBSD, you might want to consider using the bridge(4)
898 with BRIDGE_IPF option.
899
900Q: It seems that Open vSwitch does nothing when I removed a port and then
901immediately put it back. For example, consider that p1 is a port of
dc76953f 902``type=internal``::
edde4861
SF
903
904 $ ovs-vsctl del-port br0 p1 -- \
905 add-port br0 p1 -- \
906 set interface p1 type=internal
907
908 A: It's an expected behaviour.
909
910 If del-port and add-port happen in a single OVSDB transaction as your
911 example, Open vSwitch always "skips" the intermediate steps. Even if they
912 are done in multiple transactions, it's still allowed for Open vSwitch to
913 skip the intermediate steps and just implement the overall effect. In both
914 cases, your example would be turned into a no-op.
915
916 If you want to make Open vSwitch actually destroy and then re-create the
917 port for some side effects like resetting kernel setting for the
918 corresponding interface, you need to separate operations into multiple
919 OVSDB transactions and ensure that at least the first one does not have
920 ``--no-wait``. In the following example, the first ovs-vsctl will block
dc76953f 921 until Open vSwitch reloads the new configuration and removes the port::
edde4861
SF
922
923 $ ovs-vsctl del-port br0 p1
924 $ ovs-vsctl add-port br0 p1 -- \
925 set interface p1 type=internal
926
927Q: I want to add thousands of ports to an Open vSwitch bridge, but it takes too
928long (minutes or hours) to do it with ovs-vsctl. How can I do it faster?
929
930 A: If you add them one at a time with ovs-vsctl, it can take a long time to
931 add thousands of ports to an Open vSwitch bridge. This is because every
932 invocation of ovs-vsctl first reads the current configuration from OVSDB.
933 As the number of ports grows, this starts to take an appreciable amount of
934 time, and when it is repeated thousands of times the total time becomes
935 significant.
936
937 The solution is to add the ports in one invocation of ovs-vsctl (or a small
dc76953f 938 number of them). For example, using bash::
edde4861
SF
939
940 $ ovs-vsctl add-br br0
941 $ cmds=; for i in {1..5000}; do cmds+=" -- add-port br0 p$i"; done
942 $ ovs-vsctl $cmds
943
944 takes seconds, not minutes or hours, in the OVS sandbox environment.
945
946Q: I created a bridge named br0. My bridge shows up in "ovs-vsctl show", but
947"ovs-ofctl show br0" just prints "br0 is not a bridge or a socket".
948
949 A: Open vSwitch wasn't able to create the bridge. Check the ovs-vswitchd
950 log for details (Debian and Red Hat packaging for Open vSwitch put it in
951 /var/log/openvswitch/ovs-vswitchd.log).
952
953 In general, the Open vSwitch database reflects the desired configuration
954 state. ovs-vswitchd monitors the database and, when it changes,
955 reconfigures the system to reflect the new desired state. This normally
956 happens very quickly. Thus, a discrepancy between the database and the
957 actual state indicates that ovs-vswitchd could not implement the
958 configuration, and so one should check the log to find out why. (Another
959 possible cause is that ovs-vswitchd is not running. This will make
960 ovs-vsctl commands hang, if they change the configuration, unless one
961 specifies ``--no-wait``.)
962
963Q: I have a bridge br0. I added a new port vif1.0, and it shows up in
964"ovs-vsctl show", but "ovs-vsctl list port" says that it has OpenFlow port
965("ofport") -1, and "ovs-ofctl show br0" doesn't show vif1.0 at all.
966
967 A: Open vSwitch wasn't able to create the port. Check the ovs-vswitchd log
968 for details (Debian and Red Hat packaging for Open vSwitch put it in
969 /var/log/openvswitch/ovs-vswitchd.log). Please see the previous question
970 for more information.
971
972 You may want to upgrade to Open vSwitch 2.3 (or later), in which ovs-vsctl
973 will immediately report when there is an issue creating a port.
974
975Q: I created a tap device tap0, configured an IP address on it, and added it to
dc76953f 976a bridge, like this::
edde4861
SF
977
978 $ tunctl -t tap0
979 $ ifconfig tap0 192.168.0.123
980 $ ovs-vsctl add-br br0
981 $ ovs-vsctl add-port br0 tap0
982
983I expected that I could then use this IP address to contact other hosts on the
984network, but it doesn't work. Why not?
985
986 A: The short answer is that this is a misuse of a "tap" device. Use an
987 "internal" device implemented by Open vSwitch, which works differently and
988 is designed for this use. To solve this problem with an internal device,
dc76953f 989 instead run::
edde4861
SF
990
991 $ ovs-vsctl add-br br0
992 $ ovs-vsctl add-port br0 int0 -- set Interface int0 type=internal
993 $ ifconfig int0 192.168.0.123
994
995 Even more simply, you can take advantage of the internal port that every
dc76953f 996 bridge has under the name of the bridge::
edde4861
SF
997
998 $ ovs-vsctl add-br br0
999 $ ifconfig br0 192.168.0.123
1000
1001 In more detail, a "tap" device is an interface between the Linux (or BSD)
1002 network stack and a user program that opens it as a socket. When the "tap"
1003 device transmits a packet, it appears in the socket opened by the userspace
1004 program. Conversely, when the userspace program writes to the "tap"
1005 socket, the kernel TCP/IP stack processes the packet as if it had been
1006 received by the "tap" device.
1007
1008 Consider the configuration above. Given this configuration, if you "ping"
1009 an IP address in the 192.168.0.x subnet, the Linux kernel routing stack
1010 will transmit an ARP on the tap0 device. Open vSwitch userspace treats
1011 "tap" devices just like any other network device; that is, it doesn't open
1012 them as "tap" sockets. That means that the ARP packet will simply get
1013 dropped.
1014
1015 You might wonder why the Open vSwitch kernel module doesn't intercept the
1016 ARP packet and bridge it. After all, Open vSwitch intercepts packets on
1017 other devices. The answer is that Open vSwitch only intercepts *received*
1018 packets, but this is a packet being transmitted. The same thing happens
1019 for all other types of network devices, except for Open vSwitch "internal"
1020 ports. If you, for example, add a physical Ethernet port to an OVS bridge,
1021 configure an IP address on a physical Ethernet port, and then issue a
1022 "ping" to an address in that subnet, the same thing happens: an ARP gets
1023 transmitted on the physical Ethernet port and Open vSwitch never sees it.
1024 (You should not do that, as documented at the beginning of this section.)
1025
1026 It can make sense to add a "tap" device to an Open vSwitch bridge, if some
1027 userspace program (other than Open vSwitch) has opened the tap socket.
1028 This is the case, for example, if the "tap" device was created by KVM (or
1029 QEMU) to simulate a virtual NIC. In such a case, when OVS bridges a packet
1030 to the "tap" device, the kernel forwards that packet to KVM in userspace,
1031 which passes it along to the VM, and in the other direction, when the VM
1032 sends a packet, KVM writes it to the "tap" socket, which causes OVS to
1033 receive it and bridge it to the other OVS ports. Please note that in such
1034 a case no IP address is configured on the "tap" device (there is normally
1035 an IP address configured in the virtual NIC inside the VM, but this is not
1036 visible to the host Linux kernel or to Open vSwitch).
1037
1038 There is one special case in which Open vSwitch does directly read and
1039 write "tap" sockets. This is an implementation detail of the Open vSwitch
1040 userspace switch, which implements its "internal" ports as Linux (or BSD)
1041 "tap" sockets. In such a userspace switch, OVS receives packets sent on
1042 the "tap" device used to implement an "internal" port by reading the
1043 associated "tap" socket, and bridges them to the rest of the switch. In
1044 the other direction, OVS transmits packets bridged to the "internal" port
1045 by writing them to the "tap" socket, causing them to be processed by the
1046 kernel TCP/IP stack as if they had been received on the "tap" device.
1047 Users should not need to be concerned with this implementation detail.
1048
1049 Open vSwitch has a network device type called "tap". This is intended only
1050 for implementing "internal" ports in the OVS userspace switch and should
1051 not be used otherwise. In particular, users should not configure KVM "tap"
1052 devices as type "tap" (use type "system", the default, instead).
1053
1054Q: I observe packet loss at the beginning of RFC2544 tests on a server running
1055few hundred container apps bridged to OVS with traffic generated by HW traffic
1056generator. How can I fix this?
1057
1058 A: This is expected behavior on virtual switches. RFC2544 tests were
1059 designed for hardware switches, which don't have caches on the fastpath
1060 that need to be heated. Traffic generators in order to prime the switch
1061 use learning phase to heat the caches before sending the actual traffic in
1062 test phase. In case of OVS the cache is flushed quickly and to accommodate
1063 the traffic generator's delay between learning and test phase, the max-idle
1064 timeout settings should be changed to 50000 ms.::
1065
1066 $ ovs-vsctl --no-wait set Open_vSwitch . other_config:max-idle=50000
1067
1068Q: How can I configure the bridge internal interface MTU? Why does Open vSwitch
1069keep changing internal ports MTU?
1070
1071 A: By default Open vSwitch overrides the internal interfaces (e.g. br0)
1072 MTU. If you have just an internal interface (e.g. br0) and a physical
1073 interface (e.g. eth0), then every change in MTU to eth0 will be reflected
1074 to br0. Any manual MTU configuration using `ip` or `ifconfig` on internal
1075 interfaces is going to be overridden by Open vSwitch to match the current
1076 bridge minimum.
1077
1078 Sometimes this behavior is not desirable, for example with tunnels. The
1079 MTU of an internal interface can be explicitly set using the following
dc76953f 1080 command::
edde4861
SF
1081
1082 $ ovs-vsctl set int br0 mtu_request=1450
1083
1084 After this, Open vSwitch will configure br0 MTU to 1450. Since this
1085 setting is in the database it will be persistent (compared to what happens
1086 with `ip` or `ifconfig`).
1087
1088 The MTU configuration can be removed to restore the default behavior
dc76953f 1089 with::
edde4861
SF
1090
1091 $ ovs-vsctl set int br0 mtu_request=[]
1092
1093 The mtu_request column can be used to configure MTU even for physical
1094 interfaces (e.g. eth0).
1095
1096QOS
1097---
1098
1099Q: Does OVS support Quality of Service (QoS)?
1100
1101 A: Yes. For traffic that egresses from a switch, OVS supports traffic
1102 shaping; for traffic that ingresses into a switch, OVS support policing.
1103 Policing is a simple form of quality-of-service that simply drops packets
1104 received in excess of the configured rate. Due to its simplicity, policing
1105 is usually less accurate and less effective than egress traffic shaping,
1106 which queues packets.
1107
1108 Keep in mind that ingress and egress are from the perspective of the
1109 switch. That means that egress shaping limits the rate at which traffic is
1110 allowed to transmit from a physical interface, but not the rate at which
1111 traffic will be received on a virtual machine's VIF. For ingress policing,
1112 the behavior is the opposite.
1113
1114Q: How do I configure egress traffic shaping?
1115
1116 A: Suppose that you want to set up bridge br0 connected to physical
1117 Ethernet port eth0 (a 1 Gbps device) and virtual machine interfaces vif1.0
1118 and vif2.0, and that you want to limit traffic from vif1.0 to eth0 to 10
1119 Mbps and from vif2.0 to eth0 to 20 Mbps. Then, you could configure the
dc76953f 1120 bridge this way::
edde4861
SF
1121
1122 $ ovs-vsctl -- \
1123 add-br br0 -- \
1124 add-port br0 eth0 -- \
1125 add-port br0 vif1.0 -- set interface vif1.0 ofport_request=5 -- \
1126 add-port br0 vif2.0 -- set interface vif2.0 ofport_request=6 -- \
1127 set port eth0 qos=@newqos -- \
1128 --id=@newqos create qos type=linux-htb \
1129 other-config:max-rate=1000000000 \
1130 queues:123=@vif10queue \
1131 queues:234=@vif20queue -- \
1132 --id=@vif10queue create queue other-config:max-rate=10000000 -- \
1133 --id=@vif20queue create queue other-config:max-rate=20000000
1134
1135 At this point, bridge br0 is configured with the ports and eth0 is
1136 configured with the queues that you need for QoS, but nothing is actually
1137 directing packets from vif1.0 or vif2.0 to the queues that we have set up
1138 for them. That means that all of the packets to eth0 are going to the
1139 "default queue", which is not what we want.
1140
1141 We use OpenFlow to direct packets from vif1.0 and vif2.0 to the queues
dc76953f 1142 reserved for them::
edde4861
SF
1143
1144 $ ovs-ofctl add-flow br0 in_port=5,actions=set_queue:123,normal
1145 $ ovs-ofctl add-flow br0 in_port=6,actions=set_queue:234,normal
1146
1147 Each of the above flows matches on the input port, sets up the appropriate
1148 queue (123 for vif1.0, 234 for vif2.0), and then executes the "normal"
1149 action, which performs the same switching that Open vSwitch would have done
1150 without any OpenFlow flows being present. (We know that vif1.0 and vif2.0
1151 have OpenFlow port numbers 5 and 6, respectively, because we set their
1152 ofport_request columns above. If we had not done that, then we would have
1153 needed to find out their port numbers before setting up these flows.)
1154
1155 Now traffic going from vif1.0 or vif2.0 to eth0 should be rate-limited.
1156
dc76953f 1157 By the way, if you delete the bridge created by the above commands, with::
edde4861
SF
1158
1159 $ ovs-vsctl del-br br0
1160
1161 then that will leave one unreferenced QoS record and two unreferenced Queue
1162 records in the Open vSwich database. One way to clear them out, assuming
dc76953f 1163 you don't have other QoS or Queue records that you want to keep, is::
edde4861
SF
1164
1165 $ ovs-vsctl -- --all destroy QoS -- --all destroy Queue
1166
1167 If you do want to keep some QoS or Queue records, or the Open vSwitch you
1168 are using is older than version 1.8 (which added the ``--all`` option),
1169 then you will have to destroy QoS and Queue records individually.
1170
1171Q: How do I configure ingress policing?
1172
1173 A: A policing policy can be configured on an interface to drop packets that
1174 arrive at a higher rate than the configured value. For example, the
1175 following commands will rate-limit traffic that vif1.0 may generate to
1176 10Mbps:
1177
1178 $ ovs-vsctl set interface vif1.0 ingress_policing_rate=10000
1179 $ ovs-vsctl set interface vif1.0 ingress_policing_burst=8000
1180
1181 Traffic policing can interact poorly with some network protocols and can
1182 have surprising results. The "Ingress Policing" section of
1183 ovs-vswitchd.conf.db(5) discusses the issues in greater detail.
1184
1185Q: I configured Quality of Service (QoS) in my OpenFlow network by adding
1186records to the QoS and Queue table, but the results aren't what I expect.
1187
1188 A: Did you install OpenFlow flows that use your queues? This is the
1189 primary way to tell Open vSwitch which queues you want to use. If you
1190 don't do this, then the default queue will be used, which will probably not
1191 have the effect you want.
1192
1193 Refer to the previous question for an example.
1194
1195Q: I'd like to take advantage of some QoS feature that Open vSwitch doesn't yet
1196support. How do I do that?
1197
1198 A: Open vSwitch does not implement QoS itself. Instead, it can configure
1199 some, but not all, of the QoS features built into the Linux kernel. If you
1200 need some QoS feature that OVS cannot configure itself, then the first step
1201 is to figure out whether Linux QoS supports that feature. If it does, then
1202 you can submit a patch to support Open vSwitch configuration for that
1203 feature, or you can use "tc" directly to configure the feature in Linux.
1204 (If Linux QoS doesn't support the feature you want, then first you have to
1205 add that support to Linux.)
1206
1207Q: I configured QoS, correctly, but my measurements show that it isn't working
1208as well as I expect.
1209
1210 A: With the Linux kernel, the Open vSwitch implementation of QoS has two
1211 aspects:
1212
1213 - Open vSwitch configures a subset of Linux kernel QoS features, according
1214 to what is in OVSDB. It is possible that this code has bugs. If you
1215 believe that this is so, then you can configure the Linux traffic control
1216 (QoS) stack directly with the "tc" program. If you get better results
1217 that way, you can send a detailed bug report to bugs@openvswitch.org.
1218
1219 It is certain that Open vSwitch cannot configure every Linux kernel QoS
1220 feature. If you need some feature that OVS cannot configure, then you
1221 can also use "tc" directly (or add that feature to OVS).
1222
1223 - The Open vSwitch implementation of OpenFlow allows flows to be directed
1224 to particular queues. This is pretty simple and unlikely to have serious
1225 bugs at this point.
1226
1227 However, most problems with QoS on Linux are not bugs in Open vSwitch at
1228 all. They tend to be either configuration errors (please see the earlier
1229 questions in this section) or issues with the traffic control (QoS) stack
1230 in Linux. The Open vSwitch developers are not experts on Linux traffic
1231 control. We suggest that, if you believe you are encountering a problem
1232 with Linux traffic control, that you consult the tc manpages (e.g. tc(8),
1233 tc-htb(8), tc-hfsc(8)), web resources (e.g. http://lartc.org/), or mailing
1234 lists (e.g. http://vger.kernel.org/vger-lists.html#netdev).
1235
1236Q: Does Open vSwitch support OpenFlow meters?
1237
1238 A: Since version 2.0, Open vSwitch has OpenFlow protocol support for
1239 OpenFlow meters. There is no implementation of meters in the Open vSwitch
1240 software switch (neither the kernel-based nor userspace switches).
1241
1242VLANs
1243-----
1244
1245Q: What's a VLAN?
1246
1247 A: At the simplest level, a VLAN (short for "virtual LAN") is a way to
1248 partition a single switch into multiple switches. Suppose, for example,
1249 that you have two groups of machines, group A and group B. You want the
1250 machines in group A to be able to talk to each other, and you want the
1251 machine in group B to be able to talk to each other, but you don't want the
1252 machines in group A to be able to talk to the machines in group B. You can
1253 do this with two switches, by plugging the machines in group A into one
1254 switch and the machines in group B into the other switch.
1255
1256 If you only have one switch, then you can use VLANs to do the same thing,
1257 by configuring the ports for machines in group A as VLAN "access ports" for
1258 one VLAN and the ports for group B as "access ports" for a different VLAN.
1259 The switch will only forward packets between ports that are assigned to the
1260 same VLAN, so this effectively subdivides your single switch into two
1261 independent switches, one for each group of machines.
1262
1263 So far we haven't said anything about VLAN headers. With access ports,
1264 like we've described so far, no VLAN header is present in the Ethernet
1265 frame. This means that the machines (or switches) connected to access
1266 ports need not be aware that VLANs are involved, just like in the case
1267 where we use two different physical switches.
1268
1269 Now suppose that you have a whole bunch of switches in your network,
1270 instead of just one, and that some machines in group A are connected
1271 directly to both switches 1 and 2. To allow these machines to talk to each
1272 other, you could add an access port for group A's VLAN to switch 1 and
1273 another to switch 2, and then connect an Ethernet cable between those
1274 ports. That works fine, but it doesn't scale well as the number of
1275 switches and the number of VLANs increases, because you use up a lot of
1276 valuable switch ports just connecting together your VLANs.
1277
1278 This is where VLAN headers come in. Instead of using one cable and two
1279 ports per VLAN to connect a pair of switches, we configure a port on each
1280 switch as a VLAN "trunk port". Packets sent and received on a trunk port
1281 carry a VLAN header that says what VLAN the packet belongs to, so that only
1282 two ports total are required to connect the switches, regardless of the
1283 number of VLANs in use. Normally, only switches (either physical or
1284 virtual) are connected to a trunk port, not individual hosts, because
1285 individual hosts don't expect to see a VLAN header in the traffic that they
1286 receive.
1287
1288 None of the above discussion says anything about particular VLAN numbers.
1289 This is because VLAN numbers are completely arbitrary. One must only
1290 ensure that a given VLAN is numbered consistently throughout a network and
1291 that different VLANs are given different numbers. (That said, VLAN 0 is
1292 usually synonymous with a packet that has no VLAN header, and VLAN 4095 is
1293 reserved.)
1294
1295Q: VLANs don't work.
1296
1297 A: Many drivers in Linux kernels before version 3.3 had VLAN-related bugs.
1298 If you are having problems with VLANs that you suspect to be driver
1299 related, then you have several options:
1300
1301 - Upgrade to Linux 3.3 or later.
1302
1303 - Build and install a fixed version of the particular driver that is
1304 causing trouble, if one is available.
1305
1306 - Use a NIC whose driver does not have VLAN problems.
1307
1308 - Use "VLAN splinters", a feature in Open vSwitch 1.4 upto 2.5 that works
1309 around bugs in kernel drivers. To enable VLAN splinters on interface
dc76953f 1310 eth0, use the command::
edde4861
SF
1311
1312 $ ovs-vsctl set interface eth0 other-config:enable-vlan-splinters=true
1313
1314 For VLAN splinters to be effective, Open vSwitch must know which VLANs
1315 are in use. See the "VLAN splinters" section in the Interface table in
1316 ovs-vswitchd.conf.db(5) for details on how Open vSwitch infers in-use
1317 VLANs.
1318
1319 VLAN splinters increase memory use and reduce performance, so use them
1320 only if needed.
1321
1322 - Apply the "vlan workaround" patch from the XenServer kernel patch queue,
1323 build Open vSwitch against this patched kernel, and then use
1324 ovs-vlan-bug-workaround(8) to enable the VLAN workaround for each
1325 interface whose driver is buggy.
1326
1327 (This is a nontrivial exercise, so this option is included only for
1328 completeness.)
1329
1330 It is not always easy to tell whether a Linux kernel driver has buggy VLAN
1331 support. The ovs-vlan-test(8) and ovs-test(8) utilities can help you test.
1332 See their manpages for details. Of the two utilities, ovs-test(8) is newer
1333 and more thorough, but ovs-vlan-test(8) may be easier to use.
1334
1335Q: VLANs still don't work. I've tested the driver so I know that it's OK.
1336
1337 A: Do you have VLANs enabled on the physical switch that OVS is attached
1338 to? Make sure that the port is configured to trunk the VLAN or VLANs that
1339 you are using with OVS.
1340
1341Q: Outgoing VLAN-tagged traffic goes through OVS to my physical switch
1342and to its destination host, but OVS seems to drop incoming return
1343traffic.
1344
1345 A: It's possible that you have the VLAN configured on your physical switch
1346 as the "native" VLAN. In this mode, the switch treats incoming packets
1347 either tagged with the native VLAN or untagged as part of the native VLAN.
1348 It may also send outgoing packets in the native VLAN without a VLAN tag.
1349
1350 If this is the case, you have two choices:
1351
1352 - Change the physical switch port configuration to tag packets it forwards
1353 to OVS with the native VLAN instead of forwarding them untagged.
1354
1355 - Change the OVS configuration for the physical port to a native VLAN mode.
1356 For example, the following sets up a bridge with port eth0 in
dc76953f 1357 "native-tagged" mode in VLAN 9::
edde4861
SF
1358
1359 $ ovs-vsctl add-br br0 $ ovs-vsctl add-port br0 eth0 tag=9
1360 vlan_mode=native-tagged
1361
1362 In this situation, "native-untagged" mode will probably work equally
1363 well. Refer to the documentation for the Port table in
1364 ovs-vswitchd.conf.db(5) for more information.
1365
dc76953f 1366Q: I added a pair of VMs on different VLANs, like this::
edde4861
SF
1367
1368 $ ovs-vsctl add-br br0
1369 $ ovs-vsctl add-port br0 eth0
1370 $ ovs-vsctl add-port br0 tap0 tag=9
1371 $ ovs-vsctl add-port br0 tap1 tag=10
1372
1373but the VMs can't access each other, the external network, or the Internet.
1374
1375 A: It is to be expected that the VMs can't access each other. VLANs are a
1376 means to partition a network. When you configured tap0 and tap1 as access
1377 ports for different VLANs, you indicated that they should be isolated from
1378 each other.
1379
1380 As for the external network and the Internet, it seems likely that the
1381 machines you are trying to access are not on VLAN 9 (or 10) and that the
1382 Internet is not available on VLAN 9 (or 10).
1383
dc76953f 1384Q: I added a pair of VMs on the same VLAN, like this::
edde4861
SF
1385
1386 $ ovs-vsctl add-br br0
1387 $ ovs-vsctl add-port br0 eth0
1388 $ ovs-vsctl add-port br0 tap0 tag=9
1389 $ ovs-vsctl add-port br0 tap1 tag=9
1390
1391The VMs can access each other, but not the external network or the Internet.
1392
1393 A: It seems likely that the machines you are trying to access in the
1394 external network are not on VLAN 9 and that the Internet is not available
1395 on VLAN 9. Also, ensure VLAN 9 is set up as an allowed trunk VLAN on the
1396 upstream switch port to which eth0 is connected.
1397
1398Q: Can I configure an IP address on a VLAN?
1399
1400 A: Yes. Use an "internal port" configured as an access port. For example,
1401 the following configures IP address 192.168.0.7 on VLAN 9. That is, OVS
1402 will forward packets from eth0 to 192.168.0.7 only if they have an 802.1Q
1403 header with VLAN 9. Conversely, traffic forwarded from 192.168.0.7 to eth0
dc76953f 1404 will be tagged with an 802.1Q header with VLAN 9::
edde4861
SF
1405
1406 $ ovs-vsctl add-br br0
1407 $ ovs-vsctl add-port br0 eth0
1408 $ ovs-vsctl add-port br0 vlan9 tag=9 \
1409 -- set interface vlan9 type=internal
1410 $ ifconfig vlan9 192.168.0.7
1411
1412 See also the following question.
1413
dc76953f 1414Q: I configured one IP address on VLAN 0 and another on VLAN 9, like this::
edde4861
SF
1415
1416 $ ovs-vsctl add-br br0
1417 $ ovs-vsctl add-port br0 eth0
1418 $ ifconfig br0 192.168.0.5
1419 $ ovs-vsctl add-port br0 vlan9 tag=9 -- set interface vlan9 type=internal
1420 $ ifconfig vlan9 192.168.0.9
1421
1422but other hosts that are only on VLAN 0 can reach the IP address configured on
1423VLAN 9. What's going on?
1424
1425 A: `RFC 1122 section 3.3.4.2 "Multihoming Requirements"
1426 <https://tools.ietf.org/html/rfc1122#section-3.3.4.2>`__ describes two
1427 approaches to IP address handling in Internet hosts:
1428
1429 - In the "Strong ES Model", where an ES is a host ("End System"), an IP
1430 address is primarily associated with a particular interface. The host
1431 discards packets that arrive on interface A if they are destined for an
1432 IP address that is configured on interface B. The host never sends
1433 packets from interface A using a source address configured on interface
1434 B.
1435
1436 - In the "Weak ES Model", an IP address is primarily associated with a
1437 host. The host accepts packets that arrive on any interface if they are
1438 destined for any of the host's IP addresses, even if the address is
1439 configured on some interface other than the one on which it arrived. The
1440 host does not restrict itself to sending packets from an IP address
1441 associated with the originating interface.
1442
1443 Linux uses the weak ES model. That means that when packets destined to the
1444 VLAN 9 IP address arrive on eth0 and are bridged to br0, the kernel IP
1445 stack accepts them there for the VLAN 9 IP address, even though they were
1446 not received on vlan9, the network device for vlan9.
1447
1448 To simulate the strong ES model on Linux, one may add iptables rule to
1449 filter packets based on source and destination address and adjust ARP
1450 configuration with sysctls.
1451
1452 BSD uses the strong ES model.
1453
1454Q: My OpenFlow controller doesn't see the VLANs that I expect.
1455
1456 A: The configuration for VLANs in the Open vSwitch database (e.g. via
1457 ovs-vsctl) only affects traffic that goes through Open vSwitch's
1458 implementation of the OpenFlow "normal switching" action. By default, when
1459 Open vSwitch isn't connected to a controller and nothing has been manually
1460 configured in the flow table, all traffic goes through the "normal
1461 switching" action. But, if you set up OpenFlow flows on your own, through
1462 a controller or using ovs-ofctl or through other means, then you have to
1463 implement VLAN handling yourself.
1464
1465 You can use "normal switching" as a component of your OpenFlow actions,
1466 e.g. by putting "normal" into the lists of actions on ovs-ofctl or by
1467 outputting to OFPP_NORMAL from an OpenFlow controller. In situations where
1468 this is not suitable, you can implement VLAN handling yourself, e.g.:
1469
1470 - If a packet comes in on an access port, and the flow table needs to send
1471 it out on a trunk port, then the flow can add the appropriate VLAN tag
1472 with the "mod_vlan_vid" action.
1473
1474 - If a packet comes in on a trunk port, and the flow table needs to send it
1475 out on an access port, then the flow can strip the VLAN tag with the
1476 "strip_vlan" action.
1477
1478Q: I configured ports on a bridge as access ports with different VLAN tags,
dc76953f 1479like this::
edde4861
SF
1480
1481 $ ovs-vsctl add-br br0
1482 $ ovs-vsctl set-controller br0 tcp:192.168.0.10:6653
1483 $ ovs-vsctl add-port br0 eth0
1484 $ ovs-vsctl add-port br0 tap0 tag=9
1485 $ ovs-vsctl add-port br0 tap1 tag=10
1486
1487but the VMs running behind tap0 and tap1 can still communicate, that is, they
1488are not isolated from each other even though they are on different VLANs.
1489
1490 A: Do you have a controller configured on br0 (as the commands above do)?
1491 If so, then this is a variant on the previous question, "My OpenFlow
1492 controller doesn't see the VLANs that I expect," and you can refer to the
1493 answer there for more information.
1494
1495Q: How MAC learning works with VLANs?
1496
1497 A: Open vSwitch implements Independent VLAN Learning (IVL) for
1498 ``OFPP_NORMAL`` action, e.g. it logically has separate learning tables for
1499 each VLANs.
1500
1501VXLANs
1502------
1503
1504Q: What's a VXLAN?
1505
1506 A: VXLAN stands for Virtual eXtensible Local Area Network, and is a means
1507 to solve the scaling challenges of VLAN networks in a multi-tenant
1508 environment. VXLAN is an overlay network which transports an L2 network
1509 over an existing L3 network. For more information on VXLAN, please see `RFC
1510 7348 <http://tools.ietf.org/html/rfc7348>`__.
1511
1512Q: How much of the VXLAN protocol does Open vSwitch currently support?
1513
1514 A: Open vSwitch currently supports the framing format for packets on the
1515 wire. There is currently no support for the multicast aspects of VXLAN. To
1516 get around the lack of multicast support, it is possible to pre-provision
1517 MAC to IP address mappings either manually or from a controller.
1518
1519Q: What destination UDP port does the VXLAN implementation in Open vSwitch
1520use?
1521
1522 A: By default, Open vSwitch will use the assigned IANA port for VXLAN,
1523 which is 4789. However, it is possible to configure the destination UDP
1524 port manually on a per-VXLAN tunnel basis. An example of this configuration
1525 is provided below.::
1526
1527 $ ovs-vsctl add-br br0
1528 $ ovs-vsctl add-port br0 vxlan1 -- set interface vxlan1 type=vxlan \
1529 options:remote_ip=192.168.1.2 options:key=flow options:dst_port=8472
1530
1531Using OpenFlow
1532--------------
1533
1534Q: What versions of OpenFlow does Open vSwitch support?
1535
1536 A: The following table lists the versions of OpenFlow supported by each
1537 version of Open vSwitch:
1538
1539 =============== ===== ===== ===== ===== ===== ===== =====
1540 Open vSwitch OF1.0 OF1.1 OF1.2 OF1.3 OF1.4 OF1.5 OF1.6
1541 =============== ===== ===== ===== ===== ===== ===== =====
1542 1.9 and earlier yes --- --- --- --- --- ---
1543 1.10 yes --- (*) (*) --- --- ---
1544 1.11 yes --- (*) (*) --- --- ---
1545 2.0 yes (*) (*) (*) --- --- ---
1546 2.1 yes (*) (*) (*) --- --- ---
1547 2.2 yes (*) (*) (*) (%) (*) ---
1548 2.3 yes yes yes yes (*) (*) ---
1549 2.4 yes yes yes yes (*) (*) ---
1550 2.5 yes yes yes yes (*) (*) (*)
1551 =============== ===== ===== ===== ===== ===== ===== =====
1552
1553 (*) Supported, with one or more missing features.
1554 (%) Experimental, unsafe implementation.
1555
1556 Open vSwitch 2.3 enables OpenFlow 1.0, 1.1, 1.2, and 1.3 by default in
1557 ovs-vswitchd. In Open vSwitch 1.10 through 2.2, OpenFlow 1.1, 1.2, and 1.3
1558 must be enabled manually in ovs-vswitchd.
1559
1560 Some versions of OpenFlow are supported with missing features and therefore
1561 not enabled by default: OpenFlow 1.4 and 1.5, in Open vSwitch 2.3 and
1562 later, as well as OpenFlow 1.6 in Open vSwitch 2.5 and later. Also, the
1563 OpenFlow 1.6 specification is still under development and thus subject to
1564 change.
1565
1566 In any case, the user may override the default:
1567
dc76953f 1568 - To enable OpenFlow 1.0, 1.1, 1.2, and 1.3 on bridge br0::
edde4861
SF
1569
1570 $ ovs-vsctl set bridge br0 \
1571 protocols=OpenFlow10,OpenFlow11,OpenFlow12,OpenFlow13
1572
dc76953f 1573 - To enable OpenFlow 1.0, 1.1, 1.2, 1.3, 1.4, and 1.5 on bridge br0::
edde4861
SF
1574
1575 $ ovs-vsctl set bridge br0 \
1576 protocols=OpenFlow10,OpenFlow11,OpenFlow12,OpenFlow13,OpenFlow14,OpenFlow15
1577
dc76953f 1578 - To enable only OpenFlow 1.0 on bridge br0::
edde4861
SF
1579
1580 $ ovs-vsctl set bridge br0 protocols=OpenFlow10
1581
1582 All current versions of ovs-ofctl enable only OpenFlow 1.0 by default. Use
1583 the -O option to enable support for later versions of OpenFlow in
dc76953f 1584 ovs-ofctl. For example::
edde4861
SF
1585
1586 $ ovs-ofctl -O OpenFlow13 dump-flows br0
1587
1588 (Open vSwitch 2.2 had an experimental implementation of OpenFlow 1.4 that
1589 could cause crashes. We don't recommend enabling it.)
1590
1591 The `OpenFlow guide <OPENFLOW.rst>`__ tracks support for OpenFlow 1.1 and
1592 later features. When support for OpenFlow 1.4 and 1.5 is solidly
1593 implemented, Open vSwitch will enable those version by default.
1594
1595Q: Does Open vSwitch support MPLS?
1596
1597 A: Before version 1.11, Open vSwitch did not support MPLS. That is, these
1598 versions can match on MPLS Ethernet types, but they cannot match, push, or
1599 pop MPLS labels, nor can they look past MPLS labels into the encapsulated
1600 packet.
1601
1602 Open vSwitch versions 1.11, 2.0, and 2.1 have very minimal support for
1603 MPLS. With the userspace datapath only, these versions can match, push, or
1604 pop a single MPLS label, but they still cannot look past MPLS labels (even
1605 after popping them) into the encapsulated packet. Kernel datapath support
1606 is unchanged from earlier versions.
1607
1608 Open vSwitch version 2.3 can match, push, or pop a single MPLS label and
1609 look past the MPLS label into the encapsulated packet. Both userspace and
1610 kernel datapaths will be supported, but MPLS processing always happens in
1611 userspace either way, so kernel datapath performance will be disappointing.
1612
1613 Open vSwitch version 2.4 can match, push, or pop up to 3 MPLS labels and
1614 look past the MPLS label into the encapsulated packet. It will have kernel
1615 support for MPLS, yielding improved performance.
1616
1617Q: I'm getting "error type 45250 code 0". What's that?
1618
1619 A: This is a Open vSwitch extension to OpenFlow error codes. Open vSwitch
1620 uses this extension when it must report an error to an OpenFlow controller
1621 but no standard OpenFlow error code is suitable.
1622
1623 Open vSwitch logs the errors that it sends to controllers, so the easiest
1624 thing to do is probably to look at the ovs-vswitchd log to find out what
1625 the error was.
1626
1627 If you want to dissect the extended error message yourself, the format is
1628 documented in include/openflow/nicira-ext.h in the Open vSwitch source
1629 distribution. The extended error codes are documented in
1630 include/openvswitch/ofp-errors.h.
1631
1632Q: Some of the traffic that I'd expect my OpenFlow controller to see doesn't
1633actually appear through the OpenFlow connection, even though I know that it's
1634going through.
1635
1636 A: By default, Open vSwitch assumes that OpenFlow controllers are connected
1637 "in-band", that is, that the controllers are actually part of the network
1638 that is being controlled. In in-band mode, Open vSwitch sets up special
1639 "hidden" flows to make sure that traffic can make it back and forth between
1640 OVS and the controllers. These hidden flows are higher priority than any
1641 flows that can be set up through OpenFlow, and they are not visible through
1642 normal OpenFlow flow table dumps.
1643
1644 Usually, the hidden flows are desirable and helpful, but occasionally they
1645 can cause unexpected behavior. You can view the full OpenFlow flow table,
dc76953f 1646 including hidden flows, on bridge br0 with the command::
edde4861
SF
1647
1648 $ ovs-appctl bridge/dump-flows br0
1649
1650 to help you debug. The hidden flows are those with priorities
1651 greater than 65535 (the maximum priority that can be set with
1652 OpenFlow).
1653
1654 The DESIGN file at the top level of the Open vSwitch source
1655 distribution describes the in-band model in detail.
1656
1657 If your controllers are not actually in-band (e.g. they are on
1658 localhost via 127.0.0.1, or on a separate network), then you should
1659 configure your controllers in "out-of-band" mode. If you have one
1660 controller on bridge br0, then you can configure out-of-band mode
dc76953f 1661 on it with::
edde4861
SF
1662
1663 $ ovs-vsctl set controller br0 connection-mode=out-of-band
1664
1665Q: Some of the OpenFlow flows that my controller sets up don't seem to apply to
1666certain traffic, especially traffic between OVS and the controller itself.
1667
1668 A: See above.
1669
1670Q: I configured all my controllers for out-of-band control mode but "ovs-appctl
1671bridge/dump-flows" still shows some hidden flows.
1672
1673 A: You probably have a remote manager configured (e.g. with "ovs-vsctl
1674 set-manager"). By default, Open vSwitch assumes that managers need in-band
1675 rules set up on every bridge. You can disable these rules on bridge br0
dc76953f 1676 with::
edde4861
SF
1677
1678 $ ovs-vsctl set bridge br0 other-config:disable-in-band=true
1679
1680 This actually disables in-band control entirely for the bridge, as if all
1681 the bridge's controllers were configured for out-of-band control.
1682
1683Q: My OpenFlow controller doesn't see the VLANs that I expect.
1684
1685 A: See answer under "VLANs", above.
1686
1687Q: I ran ``ovs-ofctl add-flow br0 nw_dst=192.168.0.1,actions=drop`` but I got a
dc76953f 1688funny message like this::
edde4861
SF
1689
1690 ofp_util|INFO|normalization changed ofp_match, details:
1691 ofp_util|INFO| pre: nw_dst=192.168.0.1
1692 ofp_util|INFO|post:
1693
1694and when I ran ``ovs-ofctl dump-flows br0`` I saw that my nw_dst match had
1695disappeared, so that the flow ends up matching every packet.
1696
1697 A: The term "normalization" in the log message means that a flow cannot
1698 match on an L3 field without saying what L3 protocol is in use. The
1699 "ovs-ofctl" command above didn't specify an L3 protocol, so the L3 field
1700 match was dropped.
1701
1702 In this case, the L3 protocol could be IP or ARP. A correct command for
dc76953f 1703 each possibility is, respectively::
edde4861
SF
1704
1705 $ ovs-ofctl add-flow br0 ip,nw_dst=192.168.0.1,actions=drop
1706
dc76953f 1707 and::
edde4861
SF
1708
1709 $ ovs-ofctl add-flow br0 arp,nw_dst=192.168.0.1,actions=drop
1710
1711 Similarly, a flow cannot match on an L4 field without saying what L4
1712 protocol is in use. For example, the flow match ``tp_src=1234`` is, by
1713 itself, meaningless and will be ignored. Instead, to match TCP source port
1714 1234, write ``tcp,tp_src=1234``, or to match UDP source port 1234, write
1715 ``udp,tp_src=1234``.
1716
1717Q: How can I figure out the OpenFlow port number for a given port?
1718
1719 A: The ``OFPT_FEATURES_REQUEST`` message requests an OpenFlow switch to
1720 respond with an ``OFPT_FEATURES_REPLY`` that, among other information,
1721 includes a mapping between OpenFlow port names and numbers. From a command
1722 prompt, ``ovs-ofctl show br0`` makes such a request and prints the response
1723 for switch br0.
1724
1725 The Interface table in the Open vSwitch database also maps OpenFlow port
1726 names to numbers. To print the OpenFlow port number associated with
dc76953f 1727 interface eth0, run::
edde4861
SF
1728
1729 $ ovs-vsctl get Interface eth0 ofport
1730
dc76953f 1731 You can print the entire mapping with::
edde4861
SF
1732
1733 $ ovs-vsctl -- --columns=name,ofport list Interface
1734
1735 but the output mixes together interfaces from all bridges in the database,
1736 so it may be confusing if more than one bridge exists.
1737
1738 In the Open vSwitch database, ofport value ``-1`` means that the interface
1739 could not be created due to an error. (The Open vSwitch log should
1740 indicate the reason.) ofport value ``[]`` (the empty set) means that the
1741 interface hasn't been created yet. The latter is normally an intermittent
1742 condition (unless ovs-vswitchd is not running).
1743
1744Q: I added some flows with my controller or with ovs-ofctl, but when I run
1745"ovs-dpctl dump-flows" I don't see them.
1746
1747 A: ovs-dpctl queries a kernel datapath, not an OpenFlow switch. It won't
1748 display the information that you want. You want to use ``ovs-ofctl
1749 dump-flows`` instead.
1750
1751Q: It looks like each of the interfaces in my bonded port shows up as an
1752individual OpenFlow port. Is that right?
1753
1754 A: Yes, Open vSwitch makes individual bond interfaces visible as OpenFlow
1755 ports, rather than the bond as a whole. The interfaces are treated
1756 together as a bond for only a few purposes:
1757
1758 - Sending a packet to the OFPP_NORMAL port. (When an OpenFlow controller
1759 is not configured, this happens implicitly to every packet.)
1760
1761 - Mirrors configured for output to a bonded port.
1762
1763 It would make a lot of sense for Open vSwitch to present a bond as a single
1764 OpenFlow port. If you want to contribute an implementation of such a
1765 feature, please bring it up on the Open vSwitch development mailing list at
1766 dev@openvswitch.org.
1767
1768Q: I have a sophisticated network setup involving Open vSwitch, VMs or multiple
1769hosts, and other components. The behavior isn't what I expect. Help!
1770
1771 A: To debug network behavior problems, trace the path of a packet,
1772 hop-by-hop, from its origin in one host to a remote host. If that's
1773 correct, then trace the path of the response packet back to the origin.
1774
1775 The open source tool called ``plotnetcfg`` can help to understand the
1776 relationship between the networking devices on a single host.
1777
1778 Usually a simple ICMP echo request and reply (``ping``) packet is good
1779 enough. Start by initiating an ongoing ``ping`` from the origin host to a
1780 remote host. If you are tracking down a connectivity problem, the "ping"
1781 will not display any successful output, but packets are still being sent.
1782 (In this case the packets being sent are likely ARP rather than ICMP.)
1783
1784 Tools available for tracing include the following:
1785
1786 - ``tcpdump`` and ``wireshark`` for observing hops across network devices,
1787 such as Open vSwitch internal devices and physical wires.
1788
1789 - ``ovs-appctl dpif/dump-flows <br>`` in Open vSwitch 1.10 and later or
1790 ``ovs-dpctl dump-flows <br>`` in earlier versions. These tools allow one
1791 to observe the actions being taken on packets in ongoing flows.
1792
1793 See ovs-vswitchd(8) for ``ovs-appctl dpif/dump-flows`` documentation,
1794 ovs-dpctl(8) for ``ovs-dpctl dump-flows`` documentation, and "Why are
1795 there so many different ways to dump flows?" above for some background.
1796
1797 - ``ovs-appctl ofproto/trace`` to observe the logic behind how ovs-vswitchd
1798 treats packets. See ovs-vswitchd(8) for documentation. You can out more
1799 details about a given flow that ``ovs-dpctl dump-flows`` displays, by
1800 cutting and pasting a flow from the output into an ``ovs-appctl
1801 ofproto/trace`` command.
1802
1803 - SPAN, RSPAN, and ERSPAN features of physical switches, to observe what
1804 goes on at these physical hops.
1805
1806 Starting at the origin of a given packet, observe the packet at each hop in
1807 turn. For example, in one plausible scenario, you might:
1808
1809 1. ``tcpdump`` the ``eth`` interface through which an ARP egresses a VM,
1810 from inside the VM.
1811
1812 2. ``tcpdump`` the ``vif`` or ``tap`` interface through which the ARP
1813 ingresses the host machine.
1814
1815 3. Use ``ovs-dpctl dump-flows`` to spot the ARP flow and observe the host
1816 interface through which the ARP egresses the physical machine. You may
1817 need to use ``ovs-dpctl show`` to interpret the port numbers. If the
1818 output seems surprising, you can use ``ovs-appctl ofproto/trace`` to
1819 observe details of how ovs-vswitchd determined the actions in the
1820 ``ovs-dpctl dump-flows`` output.
1821
1822 4. ``tcpdump`` the ``eth`` interface through which the ARP egresses the
1823 physical machine.
1824
1825 5. ``tcpdump`` the ``eth`` interface through which the ARP ingresses the
1826 physical machine, at the remote host that receives the ARP.
1827
1828 6. Use ``ovs-dpctl dump-flows`` to spot the ARP flow on the remote host
1829 remote host that receives the ARP and observe the VM ``vif`` or ``tap``
1830 interface to which the flow is directed. Again, ``ovs-dpctl show`` and
1831 ``ovs-appctl ofproto/trace`` might help.
1832
1833 7. ``tcpdump`` the ``vif`` or ``tap`` interface to which the ARP is
1834 directed.
1835
1836 8. ``tcpdump`` the ``eth`` interface through which the ARP ingresses a VM,
1837 from inside the VM.
1838
1839 It is likely that during one of these steps you will figure out the
1840 problem. If not, then follow the ARP reply back to the origin, in reverse.
1841
1842Q: How do I make a flow drop packets?
1843
1844 A: To drop a packet is to receive it without forwarding it. OpenFlow
1845 explicitly specifies forwarding actions. Thus, a flow with an empty set of
1846 actions does not forward packets anywhere, causing them to be dropped. You
1847 can specify an empty set of actions with ``actions=`` on the ovs-ofctl
dc76953f 1848 command line. For example::
edde4861
SF
1849
1850 $ ovs-ofctl add-flow br0 priority=65535,actions=
1851
1852 would cause every packet entering switch br0 to be dropped.
1853
1854 You can write "drop" explicitly if you like. The effect is the same.
1855 Thus, the following command also causes every packet entering switch br0 to
dc76953f 1856 be dropped::
edde4861
SF
1857
1858 $ ovs-ofctl add-flow br0 priority=65535,actions=drop
1859
1860 ``drop`` is not an action, either in OpenFlow or Open vSwitch. Rather, it
1861 is only a way to say that there are no actions.
1862
dc76953f 1863Q: I added a flow to send packets out the ingress port, like this::
edde4861
SF
1864
1865 $ ovs-ofctl add-flow br0 in_port=2,actions=2
1866
1867but OVS drops the packets instead.
1868
1869 A: Yes, OpenFlow requires a switch to ignore attempts to send a packet out
1870 its ingress port. The rationale is that dropping these packets makes it
1871 harder to loop the network. Sometimes this behavior can even be
1872 convenient, e.g. it is often the desired behavior in a flow that forwards a
1873 packet to several ports ("floods" the packet).
1874
1875 Sometimes one really needs to send a packet out its ingress port
1876 ("hairpin"). In this case, output to ``OFPP_IN_PORT``, which in ovs-ofctl
dc76953f 1877 syntax is expressed as just ``in_port``, e.g.::
edde4861
SF
1878
1879 $ ovs-ofctl add-flow br0 in_port=2,actions=in_port
1880
1881 This also works in some circumstances where the flow doesn't match on the
1882 input port. For example, if you know that your switch has five ports
1883 numbered 2 through 6, then the following will send every received packet
dc76953f 1884 out every port, even its ingress port::
edde4861
SF
1885
1886 $ ovs-ofctl add-flow br0 actions=2,3,4,5,6,in_port
1887
dc76953f 1888 or, equivalently::
edde4861
SF
1889
1890 $ ovs-ofctl add-flow br0 actions=all,in_port
1891
1892 Sometimes, in complicated flow tables with multiple levels of ``resubmit``
1893 actions, a flow needs to output to a particular port that may or may not be
1894 the ingress port. It's difficult to take advantage of ``OFPP_IN_PORT`` in
1895 this situation. To help, Open vSwitch provides, as an OpenFlow extension,
1896 the ability to modify the in_port field. Whatever value is currently in
1897 the in_port field is the port to which outputs will be dropped, as well as
1898 the destination for ``OFPP_IN_PORT``. This means that the following will
dc76953f 1899 reliably output to port 2 or to ports 2 through 6, respectively::
edde4861
SF
1900
1901 $ ovs-ofctl add-flow br0 in_port=2,actions=load:0->NXM_OF_IN_PORT[],2
1902 $ ovs-ofctl add-flow br0 actions=load:0->NXM_OF_IN_PORT[],2,3,4,5,6
1903
1904 If the input port is important, then one may save and restore it on the
1905 stack:
1906
1907 $ ovs-ofctl add-flow br0 actions=push:NXM_OF_IN_PORT[],\
1908 load:0->NXM_OF_IN_PORT[],\
1909 2,3,4,5,6,\
1910 pop:NXM_OF_IN_PORT[]
1911
1912Q: My bridge br0 has host 192.168.0.1 on port 1 and host 192.168.0.2 on port 2.
1913I set up flows to forward only traffic destined to the other host and drop
dc76953f 1914other traffic, like this::
edde4861
SF
1915
1916 priority=5,in_port=1,ip,nw_dst=192.168.0.2,actions=2
1917 priority=5,in_port=2,ip,nw_dst=192.168.0.1,actions=1
1918 priority=0,actions=drop
1919
1920But it doesn't work--I don't get any connectivity when I do this. Why?
1921
1922 A: These flows drop the ARP packets that IP hosts use to establish IP
1923 connectivity over Ethernet. To solve the problem, add flows to allow ARP
dc76953f 1924 to pass between the hosts::
edde4861
SF
1925
1926 priority=5,in_port=1,arp,actions=2
1927 priority=5,in_port=2,arp,actions=1
1928
1929 This issue can manifest other ways, too. The following flows that match on
1930 Ethernet addresses instead of IP addresses will also drop ARP packets,
1931 because ARP requests are broadcast instead of being directed to a specific
dc76953f 1932 host::
edde4861
SF
1933
1934 priority=5,in_port=1,dl_dst=54:00:00:00:00:02,actions=2
1935 priority=5,in_port=2,dl_dst=54:00:00:00:00:01,actions=1
1936 priority=0,actions=drop
1937
1938 The solution already described above will also work in this case. It may
dc76953f 1939 be better to add flows to allow all multicast and broadcast traffic::
edde4861
SF
1940
1941 priority=5,in_port=1,dl_dst=01:00:00:00:00:00/01:00:00:00:00:00,actions=2
1942 priority=5,in_port=2,dl_dst=01:00:00:00:00:00/01:00:00:00:00:00,actions=1
1943
1944Q: My bridge disconnects from my controller on add-port/del-port.
1945
1946 A: Reconfiguring your bridge can change your bridge's datapath-id because
1947 Open vSwitch generates datapath-id from the MAC address of one of its
1948 ports. In that case, Open vSwitch disconnects from controllers because
1949 there's no graceful way to notify controllers about the change of
1950 datapath-id.
1951
1952 To avoid the behaviour, you can configure datapath-id manually.::
1953
1954 $ ovs-vsctl set bridge br0 other-config:datapath-id=0123456789abcdef
1955
1956Q: My controller complains that OVS is not buffering packets.
1957What's going on?
1958
1959 A: "Packet buffering" is an optional OpenFlow feature, and controllers
1960 should detect how many "buffers" an OpenFlow switch implements. It was
1961 recently noticed that OVS implementation of the buffering feature was not
1962 compliant to OpenFlow specifications. Rather than fix it and risk
1963 controller incompatibility, the buffering feature is removed as of OVS 2.7.
1964 Controllers are already expected to work properly in cases where the switch
1965 can not buffer packets, but sends full packets in "packet-in" messages
1966 instead, so this change should not affect existing users. After the change
1967 OVS always sends the ``buffer_id`` as ``0xffffffff`` in "packet-in"
1968 messages and will send an error response if any other value of this field
1969 is included in a "packet-out" or a "flow mod" sent by a controller.
1970
1971Q: How does OVS divide flows among buckets in an OpenFlow "select" group?
1972
1973 A: In Open vSwitch 2.3 and earlier, Open vSwitch used the destination
1974 Ethernet address to choose a bucket in a select group.
1975
1976 Open vSwitch 2.4 and later by default hashes the source and destination
1977 Ethernet address, VLAN ID, Ethernet type, IPv4/v6 source and destination
1978 address and protocol, and for TCP and SCTP only, the source and destination
1979 ports. The hash is "symmetric", meaning that exchanging source and
1980 destination addresses does not change the bucket selection.
1981
1982 Select groups in Open vSwitch 2.4 and later can be configured to use a
1983 different hash function, using a Netronome extension to the OpenFlow 1.5+
1984 group_mod message. For more information, see
1985 Documentation/group-selection-method-property.txt in the Open vSwitch
1986 source tree. (OpenFlow 1.5 support in Open vSwitch is still experimental.)
1987
1988Q: I added a flow to accept packets on VLAN 123 and output them on VLAN 456,
dc76953f 1989like so::
edde4861
SF
1990
1991 $ ovs-ofctl add-flow br0 dl_vlan=123,actions=output:1,mod_vlan_vid:456
1992
1993but the packets are actually being output in VLAN 123. Why?
1994
1995 A: OpenFlow actions are executed in the order specified. Thus, the actions
1996 above first output the packet, then change its VLAN. Since the output
1997 occurs before changing the VLAN, the change in VLAN will have no visible
1998 effect.
1999
2000 To solve this and similar problems, order actions so that changes to
dc76953f 2001 headers happen before output, e.g.::
edde4861
SF
2002
2003 $ ovs-ofctl add-flow br0 dl_vlan=123,actions=mod_vlan_vid:456,output:1
2004
2005Q: The "learn" action can't learn the action I want, can you improve it?
2006
2007 A: By itself, the "learn" action can only put two kinds of actions into the
2008 flows that it creates: "load" and "output" actions. If "learn" is used in
2009 isolation, these are severe limits.
2010
2011 However, "learn" is not meant to be used in isolation. It is a primitive
2012 meant to be used together with other Open vSwitch features to accomplish a
2013 task. Its existing features are enough to accomplish most tasks.
2014
2015 Here is an outline of a typical pipeline structure that allows for
2016 versatile behavior using "learn":
2017
2018 - Flows in table A contain a "learn" action, that populates flows in table
2019 L, that use a "load" action to populate register R with information about
2020 what was learned.
2021
2022 - Flows in table B contain two sequential resubmit actions: one to table L
2023 and another one to table B+1.
2024
2025 - Flows in table B+1 match on register R and act differently depending on
2026 what the flows in table L loaded into it.
2027
2028 This approach can be used to implement many "learn"-based features. For
2029 example:
2030
2031 - Resubmit to a table selected based on learned information, e.g. see:
2032 http://openvswitch.org/pipermail/discuss/2016-June/021694.html
2033
2034 - MAC learning in the middle of a pipeline, as described in `the tutorial
c27b7536 2035 <tutorial/tutorial.rst>`__.
edde4861
SF
2036
2037 - TCP state based firewalling, by learning outgoing connections based on
2038 SYN packets and matching them up with incoming packets.
2039
2040 - At least some of the features described in T. A. Hoff, "Extending Open
2041 vSwitch to Facilitate Creation of Stateful SDN Applications".
2042
2043Development
2044-----------
2045
2046Q: How do I implement a new OpenFlow message?
2047
2048 A: Add your new message to ``enum ofpraw`` and ``enum ofptype`` in
2049 ``lib/ofp-msgs.h``, following the existing pattern. Then recompile and fix
2050 all of the new warnings, implementing new functionality for the new message
2051 as needed. (If you configure with ``--enable-Werror``, as described in the
795752a3 2052 `general installation guide`_, then it is impossible to miss any warnings.)
edde4861
SF
2053
2054 If you need to add an OpenFlow vendor extension message for a vendor that
2055 doesn't yet have any extension messages, then you will also need to edit
2056 ``build-aux/extract-ofp-msgs``.
2057
2058Q: How do I add support for a new field or header?
2059
2060 A: Add new members for your field to ``struct flow`` in ``lib/flow.h``, and
2061 add new enumerations for your new field to ``enum mf_field_id`` in
2062 ``lib/meta-flow.h``, following the existing pattern. Also, add support to
2063 ``miniflow_extract()`` in ``lib/flow.c`` for extracting your new field from
2064 a packet into struct miniflow, and to ``nx_put_raw()`` in
2065 ``lib/nx-match.c`` to output your new field in OXM matches. Then recompile
2066 and fix all of the new warnings, implementing new functionality for the new
2067 field or header as needed. (If you configure with ``--enable-Werror``, as
795752a3
SF
2068 described in the `general installation guide`_, then it is impossible to
2069 miss any warnings.)
edde4861
SF
2070
2071 If you want kernel datapath support for your new field, you also need to
2072 modify the kernel module for the operating systems you are interested in.
2073 This isn't mandatory, since fields understood only by userspace work too
2074 (with a performance penalty), so it's reasonable to start development
2075 without it. If you implement kernel module support for Linux, then the
2076 Linux kernel "netdev" mailing list is the place to submit that support
2077 first; please read up on the Linux kernel development process separately.
2078 The Windows datapath kernel module support, on the other hand, is
2079 maintained within the OVS tree, so patches for that can go directly to
2080 ovs-dev.
2081
2082Q: How do I add support for a new OpenFlow action?
2083
2084 A: Add your new action to ``enum ofp_raw_action_type`` in
2085 ``lib/ofp-actions.c``, following the existing pattern. Then recompile and
2086 fix all of the new warnings, implementing new functionality for the new
2087 action as needed. (If you configure with ``--enable-Werror``, as described
795752a3
SF
2088 in the `general installation guide`_, then it is impossible to miss any
2089 warnings.)
edde4861
SF
2090
2091 If you need to add an OpenFlow vendor extension action for a vendor that
2092 doesn't yet have any extension actions, then you will also need to edit
2093 ``build-aux/extract-ofp-actions``.
795752a3
SF
2094
2095.. _general installation guide: Documentation/intro/install/general.rst
2096.. _DPDK installation guide: Documentation/intro/install/dpdk.rst