]> git.proxmox.com Git - mirror_ovs.git/blame - FAQ.rst
ovs-ofctl: Fix memory leak in ofctl_packet_out().
[mirror_ovs.git] / FAQ.rst
CommitLineData
edde4861
SF
1..
2 Licensed under the Apache License, Version 2.0 (the "License"); you may
3 not use this file except in compliance with the License. You may obtain
4 a copy of the License at
5
6 http://www.apache.org/licenses/LICENSE-2.0
7
8 Unless required by applicable law or agreed to in writing, software
9 distributed under the License is distributed on an "AS IS" BASIS, WITHOUT
10 WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the
11 License for the specific language governing permissions and limitations
12 under the License.
13
14 Convention for heading levels in Open vSwitch documentation:
15
16 ======= Heading 0 (reserved for the title in a document)
17 ------- Heading 1
18 ~~~~~~~ Heading 2
19 +++++++ Heading 3
20 ''''''' Heading 4
21
22 Avoid deeper levels because they do not render well.
23
24==========================
25Frequently Asked Questions
26==========================
27
28General
29-------
30
31Q: What is Open vSwitch?
32
33 A: Open vSwitch is a production quality open source software switch
34 designed to be used as a vswitch in virtualized server environments. A
35 vswitch forwards traffic between different VMs on the same physical host
36 and also forwards traffic between VMs and the physical network. Open
37 vSwitch supports standard management interfaces (e.g. sFlow, NetFlow,
38 IPFIX, RSPAN, CLI), and is open to programmatic extension and control using
39 OpenFlow and the OVSDB management protocol.
40
41 Open vSwitch as designed to be compatible with modern switching chipsets.
42 This means that it can be ported to existing high-fanout switches allowing
43 the same flexible control of the physical infrastructure as the virtual
44 infrastructure. It also means that Open vSwitch will be able to take
45 advantage of on-NIC switching chipsets as their functionality matures.
46
47Q: What virtualization platforms can use Open vSwitch?
48
49 A: Open vSwitch can currently run on any Linux-based virtualization
50 platform (kernel 3.10 and newer), including: KVM, VirtualBox, Xen, Xen
51 Cloud Platform, XenServer. As of Linux 3.3 it is part of the mainline
52 kernel. The bulk of the code is written in platform- independent C and is
53 easily ported to other environments. We welcome inquires about integrating
54 Open vSwitch with other virtualization platforms.
55
56Q: How can I try Open vSwitch?
57
58 A: The Open vSwitch source code can be built on a Linux system. You can
59 build and experiment with Open vSwitch on any Linux machine. Packages for
60 various Linux distributions are available on many platforms, including:
61 Debian, Ubuntu, Fedora.
62
63 You may also download and run a virtualization platform that already has
64 Open vSwitch integrated. For example, download a recent ISO for XenServer
65 or Xen Cloud Platform. Be aware that the version integrated with a
66 particular platform may not be the most recent Open vSwitch release.
67
68Q: Does Open vSwitch only work on Linux?
69
70 A: No, Open vSwitch has been ported to a number of different operating
71 systems and hardware platforms. Most of the development work occurs on
72 Linux, but the code should be portable to any POSIX system. We've seen
73 Open vSwitch ported to a number of different platforms, including FreeBSD,
74 Windows, and even non-POSIX embedded systems.
75
76 By definition, the Open vSwitch Linux kernel module only works on Linux and
77 will provide the highest performance. However, a userspace datapath is
78 available that should be very portable.
79
80Q: What's involved with porting Open vSwitch to a new platform or switching ASIC?
81
82 A: The `porting document <PORTING.rst>`__ describes how one would go about
83 porting Open vSwitch to a new operating system or hardware platform.
84
85Q: Why would I use Open vSwitch instead of the Linux bridge?
86
87 A: Open vSwitch is specially designed to make it easier to manage VM
88 network configuration and monitor state spread across many physical hosts
89 in dynamic virtualized environments. Refer to `WHY-OVS <WHY-OVS.rst>`__
90 for a more detailed description of how Open vSwitch relates to the Linux
91 Bridge.
92
93Q: How is Open vSwitch related to distributed virtual switches like the VMware
94vNetwork distributed switch or the Cisco Nexus 1000V?
95
96 A: Distributed vswitch applications (e.g., VMware vNetwork distributed
97 switch, Cisco Nexus 1000V) provide a centralized way to configure and
98 monitor the network state of VMs that are spread across many physical
99 hosts. Open vSwitch is not a distributed vswitch itself, rather it runs on
100 each physical host and supports remote management in a way that makes it
101 easier for developers of virtualization/cloud management platforms to offer
102 distributed vswitch capabilities.
103
104 To aid in distribution, Open vSwitch provides two open protocols that are
105 specially designed for remote management in virtualized network
106 environments: OpenFlow, which exposes flow-based forwarding state, and the
107 OVSDB management protocol, which exposes switch port state. In addition to
108 the switch implementation itself, Open vSwitch includes tools (ovs-ofctl,
109 ovs-vsctl) that developers can script and extend to provide distributed
110 vswitch capabilities that are closely integrated with their virtualization
111 management platform.
112
113Q: Why doesn't Open vSwitch support distribution?
114
115 A: Open vSwitch is intended to be a useful component for building flexible
116 network infrastructure. There are many different approaches to distribution
117 which balance trade-offs between simplicity, scalability, hardware
118 compatibility, convergence times, logical forwarding model, etc. The goal
119 of Open vSwitch is to be able to support all as a primitive building block
120 rather than choose a particular point in the distributed design space.
121
122Q: How can I contribute to the Open vSwitch Community?
123
124 A: You can start by joining the mailing lists and helping to answer
125 questions. You can also suggest improvements to documentation. If you
126 have a feature or bug you would like to work on, send a mail to `one of the
8bb0434a 127 mailing lists <http://openvswitch.org/mlists/>`__.
edde4861
SF
128
129Q: Why can I no longer connect to my OpenFlow controller or OVSDB manager?
130
131 A: Starting in OVS 2.4, we switched the default ports to the IANA-specified
132 port numbers for OpenFlow (6633->6653) and OVSDB (6632->6640). We
133 recommend using these port numbers, but if you cannot, all the programs
134 allow overriding the default port. See the appropriate man page.
135
136Releases
137--------
138
139Q: What does it mean for an Open vSwitch release to be LTS (long-term support)?
140
141 A: All official releases have been through a comprehensive testing process
142 and are suitable for production use. Planned releases occur twice a year.
143 If a significant bug is identified in an LTS release, we will provide an
144 updated release that includes the fix. Releases that are not LTS may not
145 be fixed and may just be supplanted by the next major release. The current
7169d4fd 146 LTS release is 2.5.x.
edde4861 147
8bb0434a 148 For more information on the Open vSwitch release process, refer to `release
f8e5a539 149 process <Documentation/release-process.rst>`__.
edde4861
SF
150
151Q: What Linux kernel versions does each Open vSwitch release work with?
152
153 A: The following table lists the Linux kernel versions against which the
154 given versions of the Open vSwitch kernel module will successfully build.
155 The Linux kernel versions are upstream kernel versions, so Linux kernels
156 modified from the upstream sources may not build in some cases even if they
157 are based on a supported version. This is most notably true of Red Hat
158 Enterprise Linux (RHEL) kernels, which are extensively modified from
159 upstream.
160
161 ============ ==============
162 Open vSwitch Linux kernel
163 ============ ==============
164 1.4.x 2.6.18 to 3.2
165 1.5.x 2.6.18 to 3.2
166 1.6.x 2.6.18 to 3.2
167 1.7.x 2.6.18 to 3.3
168 1.8.x 2.6.18 to 3.4
169 1.9.x 2.6.18 to 3.8
170 1.10.x 2.6.18 to 3.8
171 1.11.x 2.6.18 to 3.8
172 2.0.x 2.6.32 to 3.10
173 2.1.x 2.6.32 to 3.11
174 2.3.x 2.6.32 to 3.14
175 2.4.x 2.6.32 to 4.0
176 2.5.x 2.6.32 to 4.3
177 2.6.x 3.10 to 4.7
178 ============ ==============
179
180 Open vSwitch userspace should also work with the Linux kernel module built
181 into Linux 3.3 and later.
182
183 Open vSwitch userspace is not sensitive to the Linux kernel version. It
184 should build against almost any kernel, certainly against 2.6.32 and later.
185
186Q: Are all features available with all datapaths?
187
188 A: Open vSwitch supports different datapaths on different platforms. Each
189 datapath has a different feature set: the following tables try to summarize
190 the status.
191
192 Supported datapaths:
193
194 Linux upstream
195 The datapath implemented by the kernel module shipped with Linux
196 upstream. Since features have been gradually introduced into the kernel,
197 the table mentions the first Linux release whose OVS module supports the
198 feature.
199
200 Linux OVS tree
201 The datapath implemented by the Linux kernel module distributed with the
202 OVS source tree.
203
204 Userspace
205 Also known as DPDK, dpif-netdev or dummy datapath. It is the only
206 datapath that works on NetBSD, FreeBSD and Mac OSX.
207
208 Hyper-V
209 Also known as the Windows datapath.
210
211 The following table lists the datapath supported features from an Open
212 vSwitch user's perspective.
213
214 ===================== ============== ============== ========= =======
215 Feature Linux upstream Linux OVS tree Userspace Hyper-V
216 ===================== ============== ============== ========= =======
217 NAT 4.6 YES NO NO
218 Connection tracking 4.3 YES PARTIAL PARTIAL
219 Tunnel - LISP NO YES NO NO
220 Tunnel - STT NO YES NO YES
221 Tunnel - GRE 3.11 YES YES YES
222 Tunnel - VXLAN 3.12 YES YES YES
223 Tunnel - Geneve 3.18 YES YES YES
224 Tunnel - GRE-IPv6 NO NO YES NO
225 Tunnel - VXLAN-IPv6 4.3 YES YES NO
226 Tunnel - Geneve-IPv6 4.4 YES YES NO
227 QoS - Policing YES YES YES NO
228 QoS - Shaping YES YES NO NO
229 sFlow YES YES YES NO
230 IPFIX 3.10 YES YES NO
231 Set action YES YES YES PARTIAL
232 NIC Bonding YES YES YES NO
233 Multiple VTEPs YES YES YES NO
234 ===================== ============== ============== ========= =======
235
236 Do note, however:
237
238 * Only a limited set of flow fields is modifiable via the set action by the
239 Hyper-V datapath.
240 * The Hyper-V datapath only supports one physical NIC per datapath. This is
241 why bonding is not supported.
242 * The Hyper-V datapath can have at most one IP address configured as a
243 tunnel endpoint.
244
245 The following table lists features that do not *directly* impact an Open
246 vSwitch user, e.g. because their absence can be hidden by the ofproto layer
247 (usually this comes with a performance penalty).
248
249 ===================== ============== ============== ========= =======
250 Feature Linux upstream Linux OVS tree Userspace Hyper-V
251 ===================== ============== ============== ========= =======
252 SCTP flows 3.12 YES YES YES
253 MPLS 3.19 YES YES YES
254 UFID 4.0 YES YES NO
255 Megaflows 3.12 YES YES NO
256 Masked set action 4.0 YES YES NO
257 Recirculation 3.19 YES YES YES
258 TCP flags matching 3.13 YES YES NO
259 Validate flow actions YES YES N/A NO
260 Multiple datapaths YES YES YES NO
261 Tunnel TSO - STT N/A YES NO YES
262 ===================== ============== ============== ========= =======
263
264Q: What DPDK version does each Open vSwitch release work with?
265
266 A: The following table lists the DPDK version against which the given
267 versions of Open vSwitch will successfully build.
268
269 ============ =====
270 Open vSwitch DPDK
271 ============ =====
272 2.2.x 1.6
273 2.3.x 1.6
274 2.4.x 2.0
275 2.5.x 2.2
276 2.6.x 16.07
277 ============ =====
278
dc76953f 279Q: I get an error like this when I configure Open vSwitch::
edde4861
SF
280
281 configure: error: Linux kernel in <dir> is version <x>, but
282 version newer than <y> is not supported (please refer to the
283 FAQ for advice)
284
285What should I do?
286
287 A: You have the following options:
288
289 - Use the Linux kernel module supplied with the kernel that you are using.
290 (See also the following FAQ.)
291
292 - If there is a newer released version of Open vSwitch, consider building
293 that one, because it may support the kernel that you are building
294 against. (To find out, consult the table in the previous FAQ.)
295
296 - The Open vSwitch "master" branch may support the kernel that you are
297 using, so consider building the kernel module from "master".
298
299 All versions of Open vSwitch userspace are compatible with all versions of
300 the Open vSwitch kernel module, so you do not have to use the kernel module
301 from one source along with the userspace programs from the same source.
302
303Q: What features are not available in the Open vSwitch kernel datapath that
304ships as part of the upstream Linux kernel?
305
306 A: The kernel module in upstream Linux does not include support for LISP.
307 Work is in progress to add support for LISP to the upstream Linux version
308 of the Open vSwitch kernel module. For now, if you need this feature, use
309 the kernel module from the Open vSwitch distribution instead of the
310 upstream Linux kernel module.
311
312 Certain features require kernel support to function or to have reasonable
313 performance. If the ovs-vswitchd log file indicates that a feature is not
314 supported, consider upgrading to a newer upstream Linux release or using
315 the kernel module paired with the userspace distribution.
316
317Q: Why do tunnels not work when using a kernel module other than the one
318packaged with Open vSwitch?
319
320 A: Support for tunnels was added to the upstream Linux kernel module after
321 the rest of Open vSwitch. As a result, some kernels may contain support for
322 Open vSwitch but not tunnels. The minimum kernel version that supports each
323 tunnel protocol is:
324
325 ======== ============
326 Protocol Linux Kernel
327 ======== ============
328 GRE 3.11
329 VXLAN 3.12
330 Geneve 3.18
331 LISP not upstream
332 STT not upstream
333 ======== ============
334
335 If you are using a version of the kernel that is older than the one listed
336 above, it is still possible to use that tunnel protocol. However, you must
337 compile and install the kernel module included with the Open vSwitch
338 distribution rather than the one on your machine. If problems persist after
339 doing this, check to make sure that the module that is loaded is the one
340 you expect.
341
342Q: Why are UDP tunnel checksums not computed for VXLAN or Geneve?
343
344 A: Generating outer UDP checksums requires kernel support that was not part
345 of the initial implementation of these protocols. If using the upstream
346 Linux Open vSwitch module, you must use kernel 4.0 or newer. The
347 out-of-tree modules from Open vSwitch release 2.4 and later support UDP
348 checksums.
349
350Q: What features are not available when using the userspace datapath?
351
352 A: Tunnel virtual ports are not supported, as described in the previous
353 answer. It is also not possible to use queue-related actions. On Linux
354 kernels before 2.6.39, maximum-sized VLAN packets may not be transmitted.
355
356Q: Should userspace or kernel be upgraded first to minimize downtime?
357
358 A. In general, the Open vSwitch userspace should be used with the kernel
359 version included in the same release or with the version from upstream
360 Linux. However, when upgrading between two releases of Open vSwitch it is
361 best to migrate userspace first to reduce the possibility of
362 incompatibilities.
363
364Q: What happened to the bridge compatibility feature?
365
366 A: Bridge compatibility was a feature of Open vSwitch 1.9 and earlier.
367 When it was enabled, Open vSwitch imitated the interface of the Linux
368 kernel "bridge" module. This allowed users to drop Open vSwitch into
369 environments designed to use the Linux kernel bridge module without
370 adapting the environment to use Open vSwitch.
371
372 Open vSwitch 1.10 and later do not support bridge compatibility. The
373 feature was dropped because version 1.10 adopted a new internal
374 architecture that made bridge compatibility difficult to maintain. Now
375 that many environments use OVS directly, it would be rarely useful in any
376 case.
377
378 To use bridge compatibility, install OVS 1.9 or earlier, including the
379 accompanying kernel modules (both the main and bridge compatibility
380 modules), following the instructions that come with the release. Be sure
381 to start the ovs-brcompatd daemon.
382
383Terminology
384-----------
385
386Q: I thought Open vSwitch was a virtual Ethernet switch, but the documentation
387keeps talking about bridges. What's a bridge?
388
389 A: In networking, the terms "bridge" and "switch" are synonyms. Open
390 vSwitch implements an Ethernet switch, which means that it is also an
391 Ethernet bridge.
392
393Q: What's a VLAN?
394
395 A: See the "VLAN" section below.
396
397Basic Configuration
398-------------------
399
400Q: How do I configure a port as an access port?
401
402 A. Add ``tag=VLAN`` to your ``ovs-vsctl add-port`` command. For example,
403 the following commands configure br0 with eth0 as a trunk port (the
404 default) and tap0 as an access port for VLAN 9:
405
406 ::
407
408 $ ovs-vsctl add-br br0
409 $ ovs-vsctl add-port br0 eth0
410 $ ovs-vsctl add-port br0 tap0 tag=9
411
412 If you want to configure an already added port as an access port, use
413 ``ovs-vsctl set``, e.g.:
414
415 ::
416
417 $ ovs-vsctl set port tap0 tag=9
418
419Q: How do I configure a port as a SPAN port, that is, enable mirroring of all
420traffic to that port?
421
422 A. The following commands configure br0 with eth0 and tap0 as trunk ports.
423 All traffic coming in or going out on eth0 or tap0 is also mirrored to
424 tap1; any traffic arriving on tap1 is dropped:
425
426 ::
427
428 $ ovs-vsctl add-br br0
429 $ ovs-vsctl add-port br0 eth0
430 $ ovs-vsctl add-port br0 tap0
431 $ ovs-vsctl add-port br0 tap1 \
432 -- --id=@p get port tap1 \
433 -- --id=@m create mirror name=m0 select-all=true output-port=@p \
434 -- set bridge br0 mirrors=@m
435
436 To later disable mirroring, run:
437
438 ::
439
440 $ ovs-vsctl clear bridge br0 mirrors
441
442Q: Does Open vSwitch support configuring a port in promiscuous mode?
443
444 A: Yes. How you configure it depends on what you mean by "promiscuous
445 mode":
446
447 - Conventionally, "promiscuous mode" is a feature of a network interface
448 card. Ordinarily, a NIC passes to the CPU only the packets actually
449 destined to its host machine. It discards the rest to avoid wasting
450 memory and CPU cycles. When promiscuous mode is enabled, however, it
451 passes every packet to the CPU. On an old-style shared-media or
452 hub-based network, this allows the host to spy on all packets on the
453 network. But in the switched networks that are almost everywhere these
454 days, promiscuous mode doesn't have much effect, because few packets not
455 destined to a host are delivered to the host's NIC.
456
457 This form of promiscuous mode is configured in the guest OS of the VMs on
458 your bridge, e.g. with "ifconfig".
459
460 - The VMware vSwitch uses a different definition of "promiscuous mode".
461 When you configure promiscuous mode on a VMware vNIC, the vSwitch sends a
462 copy of every packet received by the vSwitch to that vNIC. That has a
463 much bigger effect than just enabling promiscuous mode in a guest OS.
464 Rather than getting a few stray packets for which the switch does not yet
465 know the correct destination, the vNIC gets every packet. The effect is
466 similar to replacing the vSwitch by a virtual hub.
467
468 This "promiscuous mode" is what switches normally call "port mirroring"
469 or "SPAN". For information on how to configure SPAN, see "How do I
470 configure a port as a SPAN port, that is, enable mirroring of all traffic
471 to that port?"
472
473Q: How do I configure a DPDK port as an access port?
474
475 A: Firstly, you must have a DPDK-enabled version of Open vSwitch.
476
477 If your version is DPDK-enabled it will support the other-config:dpdk-init
478 configuration in the database and will display lines with "EAL:..." during
479 startup when other_config:dpdk-init is set to 'true'.
480
481 Secondly, when adding a DPDK port, unlike a system port, the type for the
dc76953f 482 interface must be specified. For example::
edde4861
SF
483
484 $ ovs-vsctl add-br br0
485 $ ovs-vsctl add-port br0 dpdk0 -- set Interface dpdk0 type=dpdk
486
487 Finally, it is required that DPDK port names begin with ``dpdk``.
488
489 See `INSTALL.DPDK <INSTALL.DPDK.rst>`__ for more information on enabling
490 and using DPDK with Open vSwitch.
491
492Q: How do I configure a VLAN as an RSPAN VLAN, that is, enable mirroring of all
493traffic to that VLAN?
494
495 A: The following commands configure br0 with eth0 as a trunk port and tap0
496 as an access port for VLAN 10. All traffic coming in or going out on tap0,
497 as well as traffic coming in or going out on eth0 in VLAN 10, is also
498 mirrored to VLAN 15 on eth0. The original tag for VLAN 10, in cases where
499 one is present, is dropped as part of mirroring:
500
501 ::
502
503 $ ovs-vsctl add-br br0
504 $ ovs-vsctl add-port br0 eth0
505 $ ovs-vsctl add-port br0 tap0 tag=10
506 $ ovs-vsctl \
507 -- --id=@m create mirror name=m0 select-all=true select-vlan=10 \
508 output-vlan=15 \
509 -- set bridge br0 mirrors=@m
510
511 To later disable mirroring, run:
512
513 ::
514
515 $ ovs-vsctl clear bridge br0 mirrors
516
517 Mirroring to a VLAN can disrupt a network that contains unmanaged switches.
518 See ovs-vswitchd.conf.db(5) for details. Mirroring to a GRE tunnel has
519 fewer caveats than mirroring to a VLAN and should generally be preferred.
520
521Q: Can I mirror more than one input VLAN to an RSPAN VLAN?
522
523 A: Yes, but mirroring to a VLAN strips the original VLAN tag in favor of
524 the specified output-vlan. This loss of information may make the mirrored
525 traffic too hard to interpret.
526
527 To mirror multiple VLANs, use the commands above, but specify a
528 comma-separated list of VLANs as the value for select-vlan. To mirror
529 every VLAN, use the commands above, but omit select-vlan and its value
530 entirely.
531
532 When a packet arrives on a VLAN that is used as a mirror output VLAN, the
533 mirror is disregarded. Instead, in standalone mode, OVS floods the packet
534 across all the ports for which the mirror output VLAN is configured. (If
535 an OpenFlow controller is in use, then it can override this behavior
536 through the flow table.) If OVS is used as an intermediate switch, rather
537 than an edge switch, this ensures that the RSPAN traffic is distributed
538 through the network.
539
540 Mirroring to a VLAN can disrupt a network that contains unmanaged switches.
541 See ovs-vswitchd.conf.db(5) for details. Mirroring to a GRE tunnel has
542 fewer caveats than mirroring to a VLAN and should generally be preferred.
543
544Q: How do I configure mirroring of all traffic to a GRE tunnel?
545
546 A: The following commands configure br0 with eth0 and tap0 as trunk ports.
547 All traffic coming in or going out on eth0 or tap0 is also mirrored to
548 gre0, a GRE tunnel to the remote host 192.168.1.10; any traffic arriving on
dc76953f 549 gre0 is dropped::
edde4861
SF
550
551 $ ovs-vsctl add-br br0
552 $ ovs-vsctl add-port br0 eth0
553 $ ovs-vsctl add-port br0 tap0
554 $ ovs-vsctl add-port br0 gre0 \
555 -- set interface gre0 type=gre options:remote_ip=192.168.1.10 \
556 -- --id=@p get port gre0 \
557 -- --id=@m create mirror name=m0 select-all=true output-port=@p \
558 -- set bridge br0 mirrors=@m
559
dc76953f 560 To later disable mirroring and destroy the GRE tunnel::
edde4861
SF
561
562 $ ovs-vsctl clear bridge br0 mirrors
563 $ ovs-vsctl del-port br0 gre0
564
565Q: Does Open vSwitch support ERSPAN?
566
567 A: No. As an alternative, Open vSwitch supports mirroring to a GRE tunnel
568 (see above).
569
570Q: How do I connect two bridges?
571
572 A: First, why do you want to do this? Two connected bridges are not much
573 different from a single bridge, so you might as well just have a single
574 bridge with all your ports on it.
575
576 If you still want to connect two bridges, you can use a pair of patch
577 ports. The following example creates bridges br0 and br1, adds eth0 and
578 tap0 to br0, adds tap1 to br1, and then connects br0 and br1 with a pair of
579 patch ports.
580
581 ::
582
583 $ ovs-vsctl add-br br0
584 $ ovs-vsctl add-port br0 eth0
585 $ ovs-vsctl add-port br0 tap0
586 $ ovs-vsctl add-br br1
587 $ ovs-vsctl add-port br1 tap1
588 $ ovs-vsctl \
589 -- add-port br0 patch0 \
590 -- set interface patch0 type=patch options:peer=patch1 \
591 -- add-port br1 patch1 \
592 -- set interface patch1 type=patch options:peer=patch0
593
594 Bridges connected with patch ports are much like a single bridge. For
595 instance, if the example above also added eth1 to br1, and both eth0 and
596 eth1 happened to be connected to the same next-hop switch, then you could
597 loop your network just as you would if you added eth0 and eth1 to the same
598 bridge (see the "Configuration Problems" section below for more
599 information).
600
601 If you are using Open vSwitch 1.9 or an earlier version, then you need to
602 be using the kernel module bundled with Open vSwitch rather than the one
603 that is integrated into Linux 3.3 and later, because Open vSwitch 1.9 and
604 earlier versions need kernel support for patch ports. This also means that
605 in Open vSwitch 1.9 and earlier, patch ports will not work with the
606 userspace datapath, only with the kernel module.
607
608Q: How do I configure a bridge without an OpenFlow local port? (Local port in
609the sense of OFPP_LOCAL)
610
611 A: Open vSwitch does not support such a configuration. Bridges always have
612 their local ports.
613
614Implementation Details
615----------------------
616
617Q: I hear OVS has a couple of kinds of flows. Can you tell me about them?
618
619 A: Open vSwitch uses different kinds of flows for different purposes:
620
621 - OpenFlow flows are the most important kind of flow. OpenFlow controllers
622 use these flows to define a switch's policy. OpenFlow flows support
623 wildcards, priorities, and multiple tables.
624
625 When in-band control is in use, Open vSwitch sets up a few "hidden"
626 flows, with priority higher than a controller or the user can configure,
627 that are not visible via OpenFlow. (See the "Controller" section of the
628 FAQ for more information about hidden flows.)
629
630 - The Open vSwitch software switch implementation uses a second kind of
631 flow internally. These flows, called "datapath" or "kernel" flows, do
632 not support priorities and comprise only a single table, which makes them
633 suitable for caching. (Like OpenFlow flows, datapath flows do support
634 wildcarding, in Open vSwitch 1.11 and later.) OpenFlow flows and
635 datapath flows also support different actions and number ports
636 differently.
637
638 Datapath flows are an implementation detail that is subject to change in
639 future versions of Open vSwitch. Even with the current version of Open
640 vSwitch, hardware switch implementations do not necessarily use this
641 architecture.
642
643Users and controllers directly control only the OpenFlow flow table. Open
644vSwitch manages the datapath flow table itself, so users should not normally be
645concerned with it.
646
647Q: Why are there so many different ways to dump flows?
648
649 A: Open vSwitch has two kinds of flows (see the previous question), so it
650 has commands with different purposes for dumping each kind of flow:
651
652 - ``ovs-ofctl dump-flows <br>`` dumps OpenFlow flows, excluding hidden
653 flows. This is the most commonly useful form of flow dump. (Unlike the
654 other commands, this should work with any OpenFlow switch, not just Open
655 vSwitch.)
656
657 - ``ovs-appctl bridge/dump-flows <br>`` dumps OpenFlow flows, including
658 hidden flows. This is occasionally useful for troubleshooting suspected
659 issues with in-band control.
660
661 - ``ovs-dpctl dump-flows [dp]`` dumps the datapath flow table entries for a
662 Linux kernel-based datapath. In Open vSwitch 1.10 and later,
663 ovs-vswitchd merges multiple switches into a single datapath, so it will
664 show all the flows on all your kernel-based switches. This command can
665 occasionally be useful for debugging.
666
667 - ``ovs-appctl dpif/dump-flows <br>``, new in Open vSwitch 1.10, dumps
668 datapath flows for only the specified bridge, regardless of the type.
669
670Q: How does multicast snooping works with VLANs?
671
672 A: Open vSwitch maintains snooping tables for each VLAN.
673
674Q: Can OVS populate the kernel flow table in advance instead of in reaction to
675packets?
676
677 A: No. There are several reasons:
678
679 - Kernel flows are not as sophisticated as OpenFlow flows, which means that
680 some OpenFlow policies could require a large number of kernel flows. The
681 "conjunctive match" feature is an extreme example: the number of kernel
682 flows it requires is the product of the number of flows in each
683 dimension.
684
685 - With multiple OpenFlow flow tables and simple sets of actions, the number
686 of kernel flows required can be as large as the product of the number of
687 flows in each dimension. With more sophisticated actions, the number of
688 kernel flows could be even larger.
689
690 - Open vSwitch is designed so that any version of OVS userspace
691 interoperates with any version of the OVS kernel module. This forward
692 and backward compatibility requires that userspace observe how the kernel
693 module parses received packets. This is only possible in a
694 straightforward way when userspace adds kernel flows in reaction to
695 received packets.
696
697 For more relevant information on the architecture of Open vSwitch, please
698 read "The Design and Implementation of Open vSwitch", published in USENIX
699 NSDI 2015.
700
701Performance
702-----------
703
704Q: I just upgraded and I see a performance drop. Why?
705
706 A: The OVS kernel datapath may have been updated to a newer version than
707 the OVS userspace components. Sometimes new versions of OVS kernel module
708 add functionality that is backwards compatible with older userspace
709 components but may cause a drop in performance with them. Especially, if a
710 kernel module from OVS 2.1 or newer is paired with OVS userspace 1.10 or
711 older, there will be a performance drop for TCP traffic.
712
713 Updating the OVS userspace components to the latest released version should
714 fix the performance degradation.
715
716To get the best possible performance and functionality, it is recommended to
717pair the same versions of the kernel module and OVS userspace.
718
719Configuration Problems
720----------------------
721
722Q: I created a bridge and added my Ethernet port to it, using commands like
dc76953f 723these::
edde4861
SF
724
725 ovs-vsctl add-br br0
726 ovs-vsctl add-port br0 eth0
727
728and as soon as I ran the "add-port" command I lost all connectivity through
729eth0. Help!
730
731 A: A physical Ethernet device that is part of an Open vSwitch bridge should
732 not have an IP address. If one does, then that IP address will not be
733 fully functional.
734
735 You can restore functionality by moving the IP address to an Open vSwitch
736 "internal" device, such as the network device named after the bridge
737 itself. For example, assuming that eth0's IP address is 192.168.128.5, you
dc76953f 738 could run the commands below to fix up the situation::
edde4861
SF
739
740 $ ifconfig eth0 0.0.0.0
741 $ ifconfig br0 192.168.128.5
742
743 (If your only connection to the machine running OVS is through the IP
744 address in question, then you would want to run all of these commands on a
745 single command line, or put them into a script.) If there were any
746 additional routes assigned to eth0, then you would also want to use
747 commands to adjust these routes to go through br0.
748
749 If you use DHCP to obtain an IP address, then you should kill the DHCP
750 client that was listening on the physical Ethernet interface (e.g. eth0)
751 and start one listening on the internal interface (e.g. br0). You might
752 still need to manually clear the IP address from the physical interface
753 (e.g. with "ifconfig eth0 0.0.0.0").
754
755 There is no compelling reason why Open vSwitch must work this way.
756 However, this is the way that the Linux kernel bridge module has always
757 worked, so it's a model that those accustomed to Linux bridging are already
758 used to. Also, the model that most people expect is not implementable
759 without kernel changes on all the versions of Linux that Open vSwitch
760 supports.
761
762 By the way, this issue is not specific to physical Ethernet devices. It
763 applies to all network devices except Open vSwitch "internal" devices.
764
765Q: I created a bridge and added a couple of Ethernet ports to it, using
dc76953f 766commands like these::
edde4861
SF
767
768 $ ovs-vsctl add-br br0
769 $ ovs-vsctl add-port br0 eth0
770 $ ovs-vsctl add-port br0 eth1
771
772and now my network seems to have melted: connectivity is unreliable (even
773connectivity that doesn't go through Open vSwitch), all the LEDs on my physical
774switches are blinking, wireshark shows duplicated packets, and CPU usage is
775very high.
776
777 A: More than likely, you've looped your network. Probably, eth0 and eth1
778 are connected to the same physical Ethernet switch. This yields a scenario
779 where OVS receives a broadcast packet on eth0 and sends it out on eth1,
780 then the physical switch connected to eth1 sends the packet back on eth0,
781 and so on forever. More complicated scenarios, involving a loop through
782 multiple switches, are possible too.
783
784 The solution depends on what you are trying to do:
785
786 - If you added eth0 and eth1 to get higher bandwidth or higher reliability
787 between OVS and your physical Ethernet switch, use a bond. The following
dc76953f 788 commands create br0 and then add eth0 and eth1 as a bond::
edde4861
SF
789
790 $ ovs-vsctl add-br br0
791 $ ovs-vsctl add-bond br0 bond0 eth0 eth1
792
793 Bonds have tons of configuration options. Please read the documentation
794 on the Port table in ovs-vswitchd.conf.db(5) for all the details.
795
796 Configuration for DPDK-enabled interfaces is slightly less
797 straightforward: see `INSTALL.DPDK <INSTALL.DPDK.rst>`__.
798
799 - Perhaps you don't actually need eth0 and eth1 to be on the same bridge.
800 For example, if you simply want to be able to connect each of them to
801 virtual machines, then you can put each of them on a bridge of its own:
802
803 $ ovs-vsctl add-br br0
804 $ ovs-vsctl add-port br0 eth0
805
806 $ ovs-vsctl add-br br1
807 $ ovs-vsctl add-port br1 eth1
808
809 and then connect VMs to br0 and br1. (A potential disadvantage is that
810 traffic cannot directly pass between br0 and br1. Instead, it will go
811 out eth0 and come back in eth1, or vice versa.)
812
813 - If you have a redundant or complex network topology and you want to
814 prevent loops, turn on spanning tree protocol (STP). The following
815 commands create br0, enable STP, and add eth0 and eth1 to the bridge.
816 The order is important because you don't want have to have a loop in your
dc76953f 817 network even transiently::
edde4861
SF
818
819 $ ovs-vsctl add-br br0
820 $ ovs-vsctl set bridge br0 stp_enable=true
821 $ ovs-vsctl add-port br0 eth0
822 $ ovs-vsctl add-port br0 eth1
823
824 The Open vSwitch implementation of STP is not well tested. Report any
825 bugs you observe, but if you'd rather avoid acting as a beta tester then
826 another option might be your best shot.
827
828Q: I can't seem to use Open vSwitch in a wireless network.
829
830 A: Wireless base stations generally only allow packets with the source MAC
831 address of NIC that completed the initial handshake. Therefore, without
832 MAC rewriting, only a single device can communicate over a single wireless
833 link.
834
835 This isn't specific to Open vSwitch, it's enforced by the access point, so
836 the same problems will show up with the Linux bridge or any other way to do
837 bridging.
838
839Q: I can't seem to add my PPP interface to an Open vSwitch bridge.
840
841 A: PPP most commonly carries IP packets, but Open vSwitch works only with
842 Ethernet frames. The correct way to interface PPP to an Ethernet network
843 is usually to use routing instead of switching.
844
845Q: Is there any documentation on the database tables and fields?
846
847 A: Yes. ovs-vswitchd.conf.db(5) is a comprehensive reference.
848
849Q: When I run ovs-dpctl I no longer see the bridges I created. Instead, I only
850see a datapath called "ovs-system". How can I see datapath information about a
851particular bridge?
852
853 A: In version 1.9.0, OVS switched to using a single datapath that is shared
854 by all bridges of that type. The ``ovs-appctl dpif/*`` commands provide
855 similar functionality that is scoped by the bridge.
856
857Q: I created a GRE port using ovs-vsctl so why can't I send traffic or see the
858port in the datapath?
859
860 A: On Linux kernels before 3.11, the OVS GRE module and Linux GRE module
861 cannot be loaded at the same time. It is likely that on your system the
862 Linux GRE module is already loaded and blocking OVS (to confirm, check
863 dmesg for errors regarding GRE registration). To fix this, unload all GRE
864 modules that appear in lsmod as well as the OVS kernel module. You can then
865 reload the OVS module following the directions in the `installation guide
866 <INSTALL.rst>`__, which will ensure that dependencies are satisfied.
867
868Q: Open vSwitch does not seem to obey my packet filter rules.
869
870 A: It depends on mechanisms and configurations you want to use.
871
872 You cannot usefully use typical packet filters, like iptables, on physical
873 Ethernet ports that you add to an Open vSwitch bridge. This is because
874 Open vSwitch captures packets from the interface at a layer lower below
875 where typical packet-filter implementations install their hooks. (This
876 actually applies to any interface of type "system" that you might add to an
877 Open vSwitch bridge.)
878
879 You can usefully use typical packet filters on Open vSwitch internal ports
880 as they are mostly ordinary interfaces from the point of view of packet
881 filters.
882
883 For example, suppose you create a bridge br0 and add Ethernet port eth0 to
884 it. Then you can usefully add iptables rules to affect the internal
885 interface br0, but not the physical interface eth0. (br0 is also where you
886 would add an IP address, as discussed elsewhere in the FAQ.)
887
888 For simple filtering rules, it might be possible to achieve similar results
0b1545be
HZ
889 by installing appropriate OpenFlow flows instead. The OVS conntrack
890 feature (see the "ct" action in ovs-ofctl(8)) can implement a stateful
891 firewall.
edde4861
SF
892
893 If the use of a particular packet filter setup is essential, Open vSwitch
894 might not be the best choice for you. On Linux, you might want to consider
895 using the Linux Bridge. (This is the only choice if you want to use
896 ebtables rules.) On NetBSD, you might want to consider using the bridge(4)
897 with BRIDGE_IPF option.
898
899Q: It seems that Open vSwitch does nothing when I removed a port and then
900immediately put it back. For example, consider that p1 is a port of
dc76953f 901``type=internal``::
edde4861
SF
902
903 $ ovs-vsctl del-port br0 p1 -- \
904 add-port br0 p1 -- \
905 set interface p1 type=internal
906
907 A: It's an expected behaviour.
908
909 If del-port and add-port happen in a single OVSDB transaction as your
910 example, Open vSwitch always "skips" the intermediate steps. Even if they
911 are done in multiple transactions, it's still allowed for Open vSwitch to
912 skip the intermediate steps and just implement the overall effect. In both
913 cases, your example would be turned into a no-op.
914
915 If you want to make Open vSwitch actually destroy and then re-create the
916 port for some side effects like resetting kernel setting for the
917 corresponding interface, you need to separate operations into multiple
918 OVSDB transactions and ensure that at least the first one does not have
919 ``--no-wait``. In the following example, the first ovs-vsctl will block
dc76953f 920 until Open vSwitch reloads the new configuration and removes the port::
edde4861
SF
921
922 $ ovs-vsctl del-port br0 p1
923 $ ovs-vsctl add-port br0 p1 -- \
924 set interface p1 type=internal
925
926Q: I want to add thousands of ports to an Open vSwitch bridge, but it takes too
927long (minutes or hours) to do it with ovs-vsctl. How can I do it faster?
928
929 A: If you add them one at a time with ovs-vsctl, it can take a long time to
930 add thousands of ports to an Open vSwitch bridge. This is because every
931 invocation of ovs-vsctl first reads the current configuration from OVSDB.
932 As the number of ports grows, this starts to take an appreciable amount of
933 time, and when it is repeated thousands of times the total time becomes
934 significant.
935
936 The solution is to add the ports in one invocation of ovs-vsctl (or a small
dc76953f 937 number of them). For example, using bash::
edde4861
SF
938
939 $ ovs-vsctl add-br br0
940 $ cmds=; for i in {1..5000}; do cmds+=" -- add-port br0 p$i"; done
941 $ ovs-vsctl $cmds
942
943 takes seconds, not minutes or hours, in the OVS sandbox environment.
944
945Q: I created a bridge named br0. My bridge shows up in "ovs-vsctl show", but
946"ovs-ofctl show br0" just prints "br0 is not a bridge or a socket".
947
948 A: Open vSwitch wasn't able to create the bridge. Check the ovs-vswitchd
949 log for details (Debian and Red Hat packaging for Open vSwitch put it in
950 /var/log/openvswitch/ovs-vswitchd.log).
951
952 In general, the Open vSwitch database reflects the desired configuration
953 state. ovs-vswitchd monitors the database and, when it changes,
954 reconfigures the system to reflect the new desired state. This normally
955 happens very quickly. Thus, a discrepancy between the database and the
956 actual state indicates that ovs-vswitchd could not implement the
957 configuration, and so one should check the log to find out why. (Another
958 possible cause is that ovs-vswitchd is not running. This will make
959 ovs-vsctl commands hang, if they change the configuration, unless one
960 specifies ``--no-wait``.)
961
962Q: I have a bridge br0. I added a new port vif1.0, and it shows up in
963"ovs-vsctl show", but "ovs-vsctl list port" says that it has OpenFlow port
964("ofport") -1, and "ovs-ofctl show br0" doesn't show vif1.0 at all.
965
966 A: Open vSwitch wasn't able to create the port. Check the ovs-vswitchd log
967 for details (Debian and Red Hat packaging for Open vSwitch put it in
968 /var/log/openvswitch/ovs-vswitchd.log). Please see the previous question
969 for more information.
970
971 You may want to upgrade to Open vSwitch 2.3 (or later), in which ovs-vsctl
972 will immediately report when there is an issue creating a port.
973
974Q: I created a tap device tap0, configured an IP address on it, and added it to
dc76953f 975a bridge, like this::
edde4861
SF
976
977 $ tunctl -t tap0
978 $ ifconfig tap0 192.168.0.123
979 $ ovs-vsctl add-br br0
980 $ ovs-vsctl add-port br0 tap0
981
982I expected that I could then use this IP address to contact other hosts on the
983network, but it doesn't work. Why not?
984
985 A: The short answer is that this is a misuse of a "tap" device. Use an
986 "internal" device implemented by Open vSwitch, which works differently and
987 is designed for this use. To solve this problem with an internal device,
dc76953f 988 instead run::
edde4861
SF
989
990 $ ovs-vsctl add-br br0
991 $ ovs-vsctl add-port br0 int0 -- set Interface int0 type=internal
992 $ ifconfig int0 192.168.0.123
993
994 Even more simply, you can take advantage of the internal port that every
dc76953f 995 bridge has under the name of the bridge::
edde4861
SF
996
997 $ ovs-vsctl add-br br0
998 $ ifconfig br0 192.168.0.123
999
1000 In more detail, a "tap" device is an interface between the Linux (or BSD)
1001 network stack and a user program that opens it as a socket. When the "tap"
1002 device transmits a packet, it appears in the socket opened by the userspace
1003 program. Conversely, when the userspace program writes to the "tap"
1004 socket, the kernel TCP/IP stack processes the packet as if it had been
1005 received by the "tap" device.
1006
1007 Consider the configuration above. Given this configuration, if you "ping"
1008 an IP address in the 192.168.0.x subnet, the Linux kernel routing stack
1009 will transmit an ARP on the tap0 device. Open vSwitch userspace treats
1010 "tap" devices just like any other network device; that is, it doesn't open
1011 them as "tap" sockets. That means that the ARP packet will simply get
1012 dropped.
1013
1014 You might wonder why the Open vSwitch kernel module doesn't intercept the
1015 ARP packet and bridge it. After all, Open vSwitch intercepts packets on
1016 other devices. The answer is that Open vSwitch only intercepts *received*
1017 packets, but this is a packet being transmitted. The same thing happens
1018 for all other types of network devices, except for Open vSwitch "internal"
1019 ports. If you, for example, add a physical Ethernet port to an OVS bridge,
1020 configure an IP address on a physical Ethernet port, and then issue a
1021 "ping" to an address in that subnet, the same thing happens: an ARP gets
1022 transmitted on the physical Ethernet port and Open vSwitch never sees it.
1023 (You should not do that, as documented at the beginning of this section.)
1024
1025 It can make sense to add a "tap" device to an Open vSwitch bridge, if some
1026 userspace program (other than Open vSwitch) has opened the tap socket.
1027 This is the case, for example, if the "tap" device was created by KVM (or
1028 QEMU) to simulate a virtual NIC. In such a case, when OVS bridges a packet
1029 to the "tap" device, the kernel forwards that packet to KVM in userspace,
1030 which passes it along to the VM, and in the other direction, when the VM
1031 sends a packet, KVM writes it to the "tap" socket, which causes OVS to
1032 receive it and bridge it to the other OVS ports. Please note that in such
1033 a case no IP address is configured on the "tap" device (there is normally
1034 an IP address configured in the virtual NIC inside the VM, but this is not
1035 visible to the host Linux kernel or to Open vSwitch).
1036
1037 There is one special case in which Open vSwitch does directly read and
1038 write "tap" sockets. This is an implementation detail of the Open vSwitch
1039 userspace switch, which implements its "internal" ports as Linux (or BSD)
1040 "tap" sockets. In such a userspace switch, OVS receives packets sent on
1041 the "tap" device used to implement an "internal" port by reading the
1042 associated "tap" socket, and bridges them to the rest of the switch. In
1043 the other direction, OVS transmits packets bridged to the "internal" port
1044 by writing them to the "tap" socket, causing them to be processed by the
1045 kernel TCP/IP stack as if they had been received on the "tap" device.
1046 Users should not need to be concerned with this implementation detail.
1047
1048 Open vSwitch has a network device type called "tap". This is intended only
1049 for implementing "internal" ports in the OVS userspace switch and should
1050 not be used otherwise. In particular, users should not configure KVM "tap"
1051 devices as type "tap" (use type "system", the default, instead).
1052
1053Q: I observe packet loss at the beginning of RFC2544 tests on a server running
1054few hundred container apps bridged to OVS with traffic generated by HW traffic
1055generator. How can I fix this?
1056
1057 A: This is expected behavior on virtual switches. RFC2544 tests were
1058 designed for hardware switches, which don't have caches on the fastpath
1059 that need to be heated. Traffic generators in order to prime the switch
1060 use learning phase to heat the caches before sending the actual traffic in
1061 test phase. In case of OVS the cache is flushed quickly and to accommodate
1062 the traffic generator's delay between learning and test phase, the max-idle
1063 timeout settings should be changed to 50000 ms.::
1064
1065 $ ovs-vsctl --no-wait set Open_vSwitch . other_config:max-idle=50000
1066
1067Q: How can I configure the bridge internal interface MTU? Why does Open vSwitch
1068keep changing internal ports MTU?
1069
1070 A: By default Open vSwitch overrides the internal interfaces (e.g. br0)
1071 MTU. If you have just an internal interface (e.g. br0) and a physical
1072 interface (e.g. eth0), then every change in MTU to eth0 will be reflected
1073 to br0. Any manual MTU configuration using `ip` or `ifconfig` on internal
1074 interfaces is going to be overridden by Open vSwitch to match the current
1075 bridge minimum.
1076
1077 Sometimes this behavior is not desirable, for example with tunnels. The
1078 MTU of an internal interface can be explicitly set using the following
dc76953f 1079 command::
edde4861
SF
1080
1081 $ ovs-vsctl set int br0 mtu_request=1450
1082
1083 After this, Open vSwitch will configure br0 MTU to 1450. Since this
1084 setting is in the database it will be persistent (compared to what happens
1085 with `ip` or `ifconfig`).
1086
1087 The MTU configuration can be removed to restore the default behavior
dc76953f 1088 with::
edde4861
SF
1089
1090 $ ovs-vsctl set int br0 mtu_request=[]
1091
1092 The mtu_request column can be used to configure MTU even for physical
1093 interfaces (e.g. eth0).
1094
1095QOS
1096---
1097
1098Q: Does OVS support Quality of Service (QoS)?
1099
1100 A: Yes. For traffic that egresses from a switch, OVS supports traffic
1101 shaping; for traffic that ingresses into a switch, OVS support policing.
1102 Policing is a simple form of quality-of-service that simply drops packets
1103 received in excess of the configured rate. Due to its simplicity, policing
1104 is usually less accurate and less effective than egress traffic shaping,
1105 which queues packets.
1106
1107 Keep in mind that ingress and egress are from the perspective of the
1108 switch. That means that egress shaping limits the rate at which traffic is
1109 allowed to transmit from a physical interface, but not the rate at which
1110 traffic will be received on a virtual machine's VIF. For ingress policing,
1111 the behavior is the opposite.
1112
1113Q: How do I configure egress traffic shaping?
1114
1115 A: Suppose that you want to set up bridge br0 connected to physical
1116 Ethernet port eth0 (a 1 Gbps device) and virtual machine interfaces vif1.0
1117 and vif2.0, and that you want to limit traffic from vif1.0 to eth0 to 10
1118 Mbps and from vif2.0 to eth0 to 20 Mbps. Then, you could configure the
dc76953f 1119 bridge this way::
edde4861
SF
1120
1121 $ ovs-vsctl -- \
1122 add-br br0 -- \
1123 add-port br0 eth0 -- \
1124 add-port br0 vif1.0 -- set interface vif1.0 ofport_request=5 -- \
1125 add-port br0 vif2.0 -- set interface vif2.0 ofport_request=6 -- \
1126 set port eth0 qos=@newqos -- \
1127 --id=@newqos create qos type=linux-htb \
1128 other-config:max-rate=1000000000 \
1129 queues:123=@vif10queue \
1130 queues:234=@vif20queue -- \
1131 --id=@vif10queue create queue other-config:max-rate=10000000 -- \
1132 --id=@vif20queue create queue other-config:max-rate=20000000
1133
1134 At this point, bridge br0 is configured with the ports and eth0 is
1135 configured with the queues that you need for QoS, but nothing is actually
1136 directing packets from vif1.0 or vif2.0 to the queues that we have set up
1137 for them. That means that all of the packets to eth0 are going to the
1138 "default queue", which is not what we want.
1139
1140 We use OpenFlow to direct packets from vif1.0 and vif2.0 to the queues
dc76953f 1141 reserved for them::
edde4861
SF
1142
1143 $ ovs-ofctl add-flow br0 in_port=5,actions=set_queue:123,normal
1144 $ ovs-ofctl add-flow br0 in_port=6,actions=set_queue:234,normal
1145
1146 Each of the above flows matches on the input port, sets up the appropriate
1147 queue (123 for vif1.0, 234 for vif2.0), and then executes the "normal"
1148 action, which performs the same switching that Open vSwitch would have done
1149 without any OpenFlow flows being present. (We know that vif1.0 and vif2.0
1150 have OpenFlow port numbers 5 and 6, respectively, because we set their
1151 ofport_request columns above. If we had not done that, then we would have
1152 needed to find out their port numbers before setting up these flows.)
1153
1154 Now traffic going from vif1.0 or vif2.0 to eth0 should be rate-limited.
1155
dc76953f 1156 By the way, if you delete the bridge created by the above commands, with::
edde4861
SF
1157
1158 $ ovs-vsctl del-br br0
1159
1160 then that will leave one unreferenced QoS record and two unreferenced Queue
1161 records in the Open vSwich database. One way to clear them out, assuming
dc76953f 1162 you don't have other QoS or Queue records that you want to keep, is::
edde4861
SF
1163
1164 $ ovs-vsctl -- --all destroy QoS -- --all destroy Queue
1165
1166 If you do want to keep some QoS or Queue records, or the Open vSwitch you
1167 are using is older than version 1.8 (which added the ``--all`` option),
1168 then you will have to destroy QoS and Queue records individually.
1169
1170Q: How do I configure ingress policing?
1171
1172 A: A policing policy can be configured on an interface to drop packets that
1173 arrive at a higher rate than the configured value. For example, the
1174 following commands will rate-limit traffic that vif1.0 may generate to
1175 10Mbps:
1176
1177 $ ovs-vsctl set interface vif1.0 ingress_policing_rate=10000
1178 $ ovs-vsctl set interface vif1.0 ingress_policing_burst=8000
1179
1180 Traffic policing can interact poorly with some network protocols and can
1181 have surprising results. The "Ingress Policing" section of
1182 ovs-vswitchd.conf.db(5) discusses the issues in greater detail.
1183
1184Q: I configured Quality of Service (QoS) in my OpenFlow network by adding
1185records to the QoS and Queue table, but the results aren't what I expect.
1186
1187 A: Did you install OpenFlow flows that use your queues? This is the
1188 primary way to tell Open vSwitch which queues you want to use. If you
1189 don't do this, then the default queue will be used, which will probably not
1190 have the effect you want.
1191
1192 Refer to the previous question for an example.
1193
1194Q: I'd like to take advantage of some QoS feature that Open vSwitch doesn't yet
1195support. How do I do that?
1196
1197 A: Open vSwitch does not implement QoS itself. Instead, it can configure
1198 some, but not all, of the QoS features built into the Linux kernel. If you
1199 need some QoS feature that OVS cannot configure itself, then the first step
1200 is to figure out whether Linux QoS supports that feature. If it does, then
1201 you can submit a patch to support Open vSwitch configuration for that
1202 feature, or you can use "tc" directly to configure the feature in Linux.
1203 (If Linux QoS doesn't support the feature you want, then first you have to
1204 add that support to Linux.)
1205
1206Q: I configured QoS, correctly, but my measurements show that it isn't working
1207as well as I expect.
1208
1209 A: With the Linux kernel, the Open vSwitch implementation of QoS has two
1210 aspects:
1211
1212 - Open vSwitch configures a subset of Linux kernel QoS features, according
1213 to what is in OVSDB. It is possible that this code has bugs. If you
1214 believe that this is so, then you can configure the Linux traffic control
1215 (QoS) stack directly with the "tc" program. If you get better results
1216 that way, you can send a detailed bug report to bugs@openvswitch.org.
1217
1218 It is certain that Open vSwitch cannot configure every Linux kernel QoS
1219 feature. If you need some feature that OVS cannot configure, then you
1220 can also use "tc" directly (or add that feature to OVS).
1221
1222 - The Open vSwitch implementation of OpenFlow allows flows to be directed
1223 to particular queues. This is pretty simple and unlikely to have serious
1224 bugs at this point.
1225
1226 However, most problems with QoS on Linux are not bugs in Open vSwitch at
1227 all. They tend to be either configuration errors (please see the earlier
1228 questions in this section) or issues with the traffic control (QoS) stack
1229 in Linux. The Open vSwitch developers are not experts on Linux traffic
1230 control. We suggest that, if you believe you are encountering a problem
1231 with Linux traffic control, that you consult the tc manpages (e.g. tc(8),
1232 tc-htb(8), tc-hfsc(8)), web resources (e.g. http://lartc.org/), or mailing
1233 lists (e.g. http://vger.kernel.org/vger-lists.html#netdev).
1234
1235Q: Does Open vSwitch support OpenFlow meters?
1236
1237 A: Since version 2.0, Open vSwitch has OpenFlow protocol support for
1238 OpenFlow meters. There is no implementation of meters in the Open vSwitch
1239 software switch (neither the kernel-based nor userspace switches).
1240
1241VLANs
1242-----
1243
1244Q: What's a VLAN?
1245
1246 A: At the simplest level, a VLAN (short for "virtual LAN") is a way to
1247 partition a single switch into multiple switches. Suppose, for example,
1248 that you have two groups of machines, group A and group B. You want the
1249 machines in group A to be able to talk to each other, and you want the
1250 machine in group B to be able to talk to each other, but you don't want the
1251 machines in group A to be able to talk to the machines in group B. You can
1252 do this with two switches, by plugging the machines in group A into one
1253 switch and the machines in group B into the other switch.
1254
1255 If you only have one switch, then you can use VLANs to do the same thing,
1256 by configuring the ports for machines in group A as VLAN "access ports" for
1257 one VLAN and the ports for group B as "access ports" for a different VLAN.
1258 The switch will only forward packets between ports that are assigned to the
1259 same VLAN, so this effectively subdivides your single switch into two
1260 independent switches, one for each group of machines.
1261
1262 So far we haven't said anything about VLAN headers. With access ports,
1263 like we've described so far, no VLAN header is present in the Ethernet
1264 frame. This means that the machines (or switches) connected to access
1265 ports need not be aware that VLANs are involved, just like in the case
1266 where we use two different physical switches.
1267
1268 Now suppose that you have a whole bunch of switches in your network,
1269 instead of just one, and that some machines in group A are connected
1270 directly to both switches 1 and 2. To allow these machines to talk to each
1271 other, you could add an access port for group A's VLAN to switch 1 and
1272 another to switch 2, and then connect an Ethernet cable between those
1273 ports. That works fine, but it doesn't scale well as the number of
1274 switches and the number of VLANs increases, because you use up a lot of
1275 valuable switch ports just connecting together your VLANs.
1276
1277 This is where VLAN headers come in. Instead of using one cable and two
1278 ports per VLAN to connect a pair of switches, we configure a port on each
1279 switch as a VLAN "trunk port". Packets sent and received on a trunk port
1280 carry a VLAN header that says what VLAN the packet belongs to, so that only
1281 two ports total are required to connect the switches, regardless of the
1282 number of VLANs in use. Normally, only switches (either physical or
1283 virtual) are connected to a trunk port, not individual hosts, because
1284 individual hosts don't expect to see a VLAN header in the traffic that they
1285 receive.
1286
1287 None of the above discussion says anything about particular VLAN numbers.
1288 This is because VLAN numbers are completely arbitrary. One must only
1289 ensure that a given VLAN is numbered consistently throughout a network and
1290 that different VLANs are given different numbers. (That said, VLAN 0 is
1291 usually synonymous with a packet that has no VLAN header, and VLAN 4095 is
1292 reserved.)
1293
1294Q: VLANs don't work.
1295
1296 A: Many drivers in Linux kernels before version 3.3 had VLAN-related bugs.
1297 If you are having problems with VLANs that you suspect to be driver
1298 related, then you have several options:
1299
1300 - Upgrade to Linux 3.3 or later.
1301
1302 - Build and install a fixed version of the particular driver that is
1303 causing trouble, if one is available.
1304
1305 - Use a NIC whose driver does not have VLAN problems.
1306
1307 - Use "VLAN splinters", a feature in Open vSwitch 1.4 upto 2.5 that works
1308 around bugs in kernel drivers. To enable VLAN splinters on interface
dc76953f 1309 eth0, use the command::
edde4861
SF
1310
1311 $ ovs-vsctl set interface eth0 other-config:enable-vlan-splinters=true
1312
1313 For VLAN splinters to be effective, Open vSwitch must know which VLANs
1314 are in use. See the "VLAN splinters" section in the Interface table in
1315 ovs-vswitchd.conf.db(5) for details on how Open vSwitch infers in-use
1316 VLANs.
1317
1318 VLAN splinters increase memory use and reduce performance, so use them
1319 only if needed.
1320
1321 - Apply the "vlan workaround" patch from the XenServer kernel patch queue,
1322 build Open vSwitch against this patched kernel, and then use
1323 ovs-vlan-bug-workaround(8) to enable the VLAN workaround for each
1324 interface whose driver is buggy.
1325
1326 (This is a nontrivial exercise, so this option is included only for
1327 completeness.)
1328
1329 It is not always easy to tell whether a Linux kernel driver has buggy VLAN
1330 support. The ovs-vlan-test(8) and ovs-test(8) utilities can help you test.
1331 See their manpages for details. Of the two utilities, ovs-test(8) is newer
1332 and more thorough, but ovs-vlan-test(8) may be easier to use.
1333
1334Q: VLANs still don't work. I've tested the driver so I know that it's OK.
1335
1336 A: Do you have VLANs enabled on the physical switch that OVS is attached
1337 to? Make sure that the port is configured to trunk the VLAN or VLANs that
1338 you are using with OVS.
1339
1340Q: Outgoing VLAN-tagged traffic goes through OVS to my physical switch
1341and to its destination host, but OVS seems to drop incoming return
1342traffic.
1343
1344 A: It's possible that you have the VLAN configured on your physical switch
1345 as the "native" VLAN. In this mode, the switch treats incoming packets
1346 either tagged with the native VLAN or untagged as part of the native VLAN.
1347 It may also send outgoing packets in the native VLAN without a VLAN tag.
1348
1349 If this is the case, you have two choices:
1350
1351 - Change the physical switch port configuration to tag packets it forwards
1352 to OVS with the native VLAN instead of forwarding them untagged.
1353
1354 - Change the OVS configuration for the physical port to a native VLAN mode.
1355 For example, the following sets up a bridge with port eth0 in
dc76953f 1356 "native-tagged" mode in VLAN 9::
edde4861
SF
1357
1358 $ ovs-vsctl add-br br0 $ ovs-vsctl add-port br0 eth0 tag=9
1359 vlan_mode=native-tagged
1360
1361 In this situation, "native-untagged" mode will probably work equally
1362 well. Refer to the documentation for the Port table in
1363 ovs-vswitchd.conf.db(5) for more information.
1364
dc76953f 1365Q: I added a pair of VMs on different VLANs, like this::
edde4861
SF
1366
1367 $ ovs-vsctl add-br br0
1368 $ ovs-vsctl add-port br0 eth0
1369 $ ovs-vsctl add-port br0 tap0 tag=9
1370 $ ovs-vsctl add-port br0 tap1 tag=10
1371
1372but the VMs can't access each other, the external network, or the Internet.
1373
1374 A: It is to be expected that the VMs can't access each other. VLANs are a
1375 means to partition a network. When you configured tap0 and tap1 as access
1376 ports for different VLANs, you indicated that they should be isolated from
1377 each other.
1378
1379 As for the external network and the Internet, it seems likely that the
1380 machines you are trying to access are not on VLAN 9 (or 10) and that the
1381 Internet is not available on VLAN 9 (or 10).
1382
dc76953f 1383Q: I added a pair of VMs on the same VLAN, like this::
edde4861
SF
1384
1385 $ ovs-vsctl add-br br0
1386 $ ovs-vsctl add-port br0 eth0
1387 $ ovs-vsctl add-port br0 tap0 tag=9
1388 $ ovs-vsctl add-port br0 tap1 tag=9
1389
1390The VMs can access each other, but not the external network or the Internet.
1391
1392 A: It seems likely that the machines you are trying to access in the
1393 external network are not on VLAN 9 and that the Internet is not available
1394 on VLAN 9. Also, ensure VLAN 9 is set up as an allowed trunk VLAN on the
1395 upstream switch port to which eth0 is connected.
1396
1397Q: Can I configure an IP address on a VLAN?
1398
1399 A: Yes. Use an "internal port" configured as an access port. For example,
1400 the following configures IP address 192.168.0.7 on VLAN 9. That is, OVS
1401 will forward packets from eth0 to 192.168.0.7 only if they have an 802.1Q
1402 header with VLAN 9. Conversely, traffic forwarded from 192.168.0.7 to eth0
dc76953f 1403 will be tagged with an 802.1Q header with VLAN 9::
edde4861
SF
1404
1405 $ ovs-vsctl add-br br0
1406 $ ovs-vsctl add-port br0 eth0
1407 $ ovs-vsctl add-port br0 vlan9 tag=9 \
1408 -- set interface vlan9 type=internal
1409 $ ifconfig vlan9 192.168.0.7
1410
1411 See also the following question.
1412
dc76953f 1413Q: I configured one IP address on VLAN 0 and another on VLAN 9, like this::
edde4861
SF
1414
1415 $ ovs-vsctl add-br br0
1416 $ ovs-vsctl add-port br0 eth0
1417 $ ifconfig br0 192.168.0.5
1418 $ ovs-vsctl add-port br0 vlan9 tag=9 -- set interface vlan9 type=internal
1419 $ ifconfig vlan9 192.168.0.9
1420
1421but other hosts that are only on VLAN 0 can reach the IP address configured on
1422VLAN 9. What's going on?
1423
1424 A: `RFC 1122 section 3.3.4.2 "Multihoming Requirements"
1425 <https://tools.ietf.org/html/rfc1122#section-3.3.4.2>`__ describes two
1426 approaches to IP address handling in Internet hosts:
1427
1428 - In the "Strong ES Model", where an ES is a host ("End System"), an IP
1429 address is primarily associated with a particular interface. The host
1430 discards packets that arrive on interface A if they are destined for an
1431 IP address that is configured on interface B. The host never sends
1432 packets from interface A using a source address configured on interface
1433 B.
1434
1435 - In the "Weak ES Model", an IP address is primarily associated with a
1436 host. The host accepts packets that arrive on any interface if they are
1437 destined for any of the host's IP addresses, even if the address is
1438 configured on some interface other than the one on which it arrived. The
1439 host does not restrict itself to sending packets from an IP address
1440 associated with the originating interface.
1441
1442 Linux uses the weak ES model. That means that when packets destined to the
1443 VLAN 9 IP address arrive on eth0 and are bridged to br0, the kernel IP
1444 stack accepts them there for the VLAN 9 IP address, even though they were
1445 not received on vlan9, the network device for vlan9.
1446
1447 To simulate the strong ES model on Linux, one may add iptables rule to
1448 filter packets based on source and destination address and adjust ARP
1449 configuration with sysctls.
1450
1451 BSD uses the strong ES model.
1452
1453Q: My OpenFlow controller doesn't see the VLANs that I expect.
1454
1455 A: The configuration for VLANs in the Open vSwitch database (e.g. via
1456 ovs-vsctl) only affects traffic that goes through Open vSwitch's
1457 implementation of the OpenFlow "normal switching" action. By default, when
1458 Open vSwitch isn't connected to a controller and nothing has been manually
1459 configured in the flow table, all traffic goes through the "normal
1460 switching" action. But, if you set up OpenFlow flows on your own, through
1461 a controller or using ovs-ofctl or through other means, then you have to
1462 implement VLAN handling yourself.
1463
1464 You can use "normal switching" as a component of your OpenFlow actions,
1465 e.g. by putting "normal" into the lists of actions on ovs-ofctl or by
1466 outputting to OFPP_NORMAL from an OpenFlow controller. In situations where
1467 this is not suitable, you can implement VLAN handling yourself, e.g.:
1468
1469 - If a packet comes in on an access port, and the flow table needs to send
1470 it out on a trunk port, then the flow can add the appropriate VLAN tag
1471 with the "mod_vlan_vid" action.
1472
1473 - If a packet comes in on a trunk port, and the flow table needs to send it
1474 out on an access port, then the flow can strip the VLAN tag with the
1475 "strip_vlan" action.
1476
1477Q: I configured ports on a bridge as access ports with different VLAN tags,
dc76953f 1478like this::
edde4861
SF
1479
1480 $ ovs-vsctl add-br br0
1481 $ ovs-vsctl set-controller br0 tcp:192.168.0.10:6653
1482 $ ovs-vsctl add-port br0 eth0
1483 $ ovs-vsctl add-port br0 tap0 tag=9
1484 $ ovs-vsctl add-port br0 tap1 tag=10
1485
1486but the VMs running behind tap0 and tap1 can still communicate, that is, they
1487are not isolated from each other even though they are on different VLANs.
1488
1489 A: Do you have a controller configured on br0 (as the commands above do)?
1490 If so, then this is a variant on the previous question, "My OpenFlow
1491 controller doesn't see the VLANs that I expect," and you can refer to the
1492 answer there for more information.
1493
1494Q: How MAC learning works with VLANs?
1495
1496 A: Open vSwitch implements Independent VLAN Learning (IVL) for
1497 ``OFPP_NORMAL`` action, e.g. it logically has separate learning tables for
1498 each VLANs.
1499
1500VXLANs
1501------
1502
1503Q: What's a VXLAN?
1504
1505 A: VXLAN stands for Virtual eXtensible Local Area Network, and is a means
1506 to solve the scaling challenges of VLAN networks in a multi-tenant
1507 environment. VXLAN is an overlay network which transports an L2 network
1508 over an existing L3 network. For more information on VXLAN, please see `RFC
1509 7348 <http://tools.ietf.org/html/rfc7348>`__.
1510
1511Q: How much of the VXLAN protocol does Open vSwitch currently support?
1512
1513 A: Open vSwitch currently supports the framing format for packets on the
1514 wire. There is currently no support for the multicast aspects of VXLAN. To
1515 get around the lack of multicast support, it is possible to pre-provision
1516 MAC to IP address mappings either manually or from a controller.
1517
1518Q: What destination UDP port does the VXLAN implementation in Open vSwitch
1519use?
1520
1521 A: By default, Open vSwitch will use the assigned IANA port for VXLAN,
1522 which is 4789. However, it is possible to configure the destination UDP
1523 port manually on a per-VXLAN tunnel basis. An example of this configuration
1524 is provided below.::
1525
1526 $ ovs-vsctl add-br br0
1527 $ ovs-vsctl add-port br0 vxlan1 -- set interface vxlan1 type=vxlan \
1528 options:remote_ip=192.168.1.2 options:key=flow options:dst_port=8472
1529
1530Using OpenFlow
1531--------------
1532
1533Q: What versions of OpenFlow does Open vSwitch support?
1534
1535 A: The following table lists the versions of OpenFlow supported by each
1536 version of Open vSwitch:
1537
1538 =============== ===== ===== ===== ===== ===== ===== =====
1539 Open vSwitch OF1.0 OF1.1 OF1.2 OF1.3 OF1.4 OF1.5 OF1.6
1540 =============== ===== ===== ===== ===== ===== ===== =====
1541 1.9 and earlier yes --- --- --- --- --- ---
1542 1.10 yes --- (*) (*) --- --- ---
1543 1.11 yes --- (*) (*) --- --- ---
1544 2.0 yes (*) (*) (*) --- --- ---
1545 2.1 yes (*) (*) (*) --- --- ---
1546 2.2 yes (*) (*) (*) (%) (*) ---
1547 2.3 yes yes yes yes (*) (*) ---
1548 2.4 yes yes yes yes (*) (*) ---
1549 2.5 yes yes yes yes (*) (*) (*)
1550 =============== ===== ===== ===== ===== ===== ===== =====
1551
1552 (*) Supported, with one or more missing features.
1553 (%) Experimental, unsafe implementation.
1554
1555 Open vSwitch 2.3 enables OpenFlow 1.0, 1.1, 1.2, and 1.3 by default in
1556 ovs-vswitchd. In Open vSwitch 1.10 through 2.2, OpenFlow 1.1, 1.2, and 1.3
1557 must be enabled manually in ovs-vswitchd.
1558
1559 Some versions of OpenFlow are supported with missing features and therefore
1560 not enabled by default: OpenFlow 1.4 and 1.5, in Open vSwitch 2.3 and
1561 later, as well as OpenFlow 1.6 in Open vSwitch 2.5 and later. Also, the
1562 OpenFlow 1.6 specification is still under development and thus subject to
1563 change.
1564
1565 In any case, the user may override the default:
1566
dc76953f 1567 - To enable OpenFlow 1.0, 1.1, 1.2, and 1.3 on bridge br0::
edde4861
SF
1568
1569 $ ovs-vsctl set bridge br0 \
1570 protocols=OpenFlow10,OpenFlow11,OpenFlow12,OpenFlow13
1571
dc76953f 1572 - To enable OpenFlow 1.0, 1.1, 1.2, 1.3, 1.4, and 1.5 on bridge br0::
edde4861
SF
1573
1574 $ ovs-vsctl set bridge br0 \
1575 protocols=OpenFlow10,OpenFlow11,OpenFlow12,OpenFlow13,OpenFlow14,OpenFlow15
1576
dc76953f 1577 - To enable only OpenFlow 1.0 on bridge br0::
edde4861
SF
1578
1579 $ ovs-vsctl set bridge br0 protocols=OpenFlow10
1580
1581 All current versions of ovs-ofctl enable only OpenFlow 1.0 by default. Use
1582 the -O option to enable support for later versions of OpenFlow in
dc76953f 1583 ovs-ofctl. For example::
edde4861
SF
1584
1585 $ ovs-ofctl -O OpenFlow13 dump-flows br0
1586
1587 (Open vSwitch 2.2 had an experimental implementation of OpenFlow 1.4 that
1588 could cause crashes. We don't recommend enabling it.)
1589
1590 The `OpenFlow guide <OPENFLOW.rst>`__ tracks support for OpenFlow 1.1 and
1591 later features. When support for OpenFlow 1.4 and 1.5 is solidly
1592 implemented, Open vSwitch will enable those version by default.
1593
1594Q: Does Open vSwitch support MPLS?
1595
1596 A: Before version 1.11, Open vSwitch did not support MPLS. That is, these
1597 versions can match on MPLS Ethernet types, but they cannot match, push, or
1598 pop MPLS labels, nor can they look past MPLS labels into the encapsulated
1599 packet.
1600
1601 Open vSwitch versions 1.11, 2.0, and 2.1 have very minimal support for
1602 MPLS. With the userspace datapath only, these versions can match, push, or
1603 pop a single MPLS label, but they still cannot look past MPLS labels (even
1604 after popping them) into the encapsulated packet. Kernel datapath support
1605 is unchanged from earlier versions.
1606
1607 Open vSwitch version 2.3 can match, push, or pop a single MPLS label and
1608 look past the MPLS label into the encapsulated packet. Both userspace and
1609 kernel datapaths will be supported, but MPLS processing always happens in
1610 userspace either way, so kernel datapath performance will be disappointing.
1611
1612 Open vSwitch version 2.4 can match, push, or pop up to 3 MPLS labels and
1613 look past the MPLS label into the encapsulated packet. It will have kernel
1614 support for MPLS, yielding improved performance.
1615
1616Q: I'm getting "error type 45250 code 0". What's that?
1617
1618 A: This is a Open vSwitch extension to OpenFlow error codes. Open vSwitch
1619 uses this extension when it must report an error to an OpenFlow controller
1620 but no standard OpenFlow error code is suitable.
1621
1622 Open vSwitch logs the errors that it sends to controllers, so the easiest
1623 thing to do is probably to look at the ovs-vswitchd log to find out what
1624 the error was.
1625
1626 If you want to dissect the extended error message yourself, the format is
1627 documented in include/openflow/nicira-ext.h in the Open vSwitch source
1628 distribution. The extended error codes are documented in
1629 include/openvswitch/ofp-errors.h.
1630
1631Q: Some of the traffic that I'd expect my OpenFlow controller to see doesn't
1632actually appear through the OpenFlow connection, even though I know that it's
1633going through.
1634
1635 A: By default, Open vSwitch assumes that OpenFlow controllers are connected
1636 "in-band", that is, that the controllers are actually part of the network
1637 that is being controlled. In in-band mode, Open vSwitch sets up special
1638 "hidden" flows to make sure that traffic can make it back and forth between
1639 OVS and the controllers. These hidden flows are higher priority than any
1640 flows that can be set up through OpenFlow, and they are not visible through
1641 normal OpenFlow flow table dumps.
1642
1643 Usually, the hidden flows are desirable and helpful, but occasionally they
1644 can cause unexpected behavior. You can view the full OpenFlow flow table,
dc76953f 1645 including hidden flows, on bridge br0 with the command::
edde4861
SF
1646
1647 $ ovs-appctl bridge/dump-flows br0
1648
1649 to help you debug. The hidden flows are those with priorities
1650 greater than 65535 (the maximum priority that can be set with
1651 OpenFlow).
1652
1653 The DESIGN file at the top level of the Open vSwitch source
1654 distribution describes the in-band model in detail.
1655
1656 If your controllers are not actually in-band (e.g. they are on
1657 localhost via 127.0.0.1, or on a separate network), then you should
1658 configure your controllers in "out-of-band" mode. If you have one
1659 controller on bridge br0, then you can configure out-of-band mode
dc76953f 1660 on it with::
edde4861
SF
1661
1662 $ ovs-vsctl set controller br0 connection-mode=out-of-band
1663
1664Q: Some of the OpenFlow flows that my controller sets up don't seem to apply to
1665certain traffic, especially traffic between OVS and the controller itself.
1666
1667 A: See above.
1668
1669Q: I configured all my controllers for out-of-band control mode but "ovs-appctl
1670bridge/dump-flows" still shows some hidden flows.
1671
1672 A: You probably have a remote manager configured (e.g. with "ovs-vsctl
1673 set-manager"). By default, Open vSwitch assumes that managers need in-band
1674 rules set up on every bridge. You can disable these rules on bridge br0
dc76953f 1675 with::
edde4861
SF
1676
1677 $ ovs-vsctl set bridge br0 other-config:disable-in-band=true
1678
1679 This actually disables in-band control entirely for the bridge, as if all
1680 the bridge's controllers were configured for out-of-band control.
1681
1682Q: My OpenFlow controller doesn't see the VLANs that I expect.
1683
1684 A: See answer under "VLANs", above.
1685
1686Q: I ran ``ovs-ofctl add-flow br0 nw_dst=192.168.0.1,actions=drop`` but I got a
dc76953f 1687funny message like this::
edde4861
SF
1688
1689 ofp_util|INFO|normalization changed ofp_match, details:
1690 ofp_util|INFO| pre: nw_dst=192.168.0.1
1691 ofp_util|INFO|post:
1692
1693and when I ran ``ovs-ofctl dump-flows br0`` I saw that my nw_dst match had
1694disappeared, so that the flow ends up matching every packet.
1695
1696 A: The term "normalization" in the log message means that a flow cannot
1697 match on an L3 field without saying what L3 protocol is in use. The
1698 "ovs-ofctl" command above didn't specify an L3 protocol, so the L3 field
1699 match was dropped.
1700
1701 In this case, the L3 protocol could be IP or ARP. A correct command for
dc76953f 1702 each possibility is, respectively::
edde4861
SF
1703
1704 $ ovs-ofctl add-flow br0 ip,nw_dst=192.168.0.1,actions=drop
1705
dc76953f 1706 and::
edde4861
SF
1707
1708 $ ovs-ofctl add-flow br0 arp,nw_dst=192.168.0.1,actions=drop
1709
1710 Similarly, a flow cannot match on an L4 field without saying what L4
1711 protocol is in use. For example, the flow match ``tp_src=1234`` is, by
1712 itself, meaningless and will be ignored. Instead, to match TCP source port
1713 1234, write ``tcp,tp_src=1234``, or to match UDP source port 1234, write
1714 ``udp,tp_src=1234``.
1715
1716Q: How can I figure out the OpenFlow port number for a given port?
1717
1718 A: The ``OFPT_FEATURES_REQUEST`` message requests an OpenFlow switch to
1719 respond with an ``OFPT_FEATURES_REPLY`` that, among other information,
1720 includes a mapping between OpenFlow port names and numbers. From a command
1721 prompt, ``ovs-ofctl show br0`` makes such a request and prints the response
1722 for switch br0.
1723
1724 The Interface table in the Open vSwitch database also maps OpenFlow port
1725 names to numbers. To print the OpenFlow port number associated with
dc76953f 1726 interface eth0, run::
edde4861
SF
1727
1728 $ ovs-vsctl get Interface eth0 ofport
1729
dc76953f 1730 You can print the entire mapping with::
edde4861
SF
1731
1732 $ ovs-vsctl -- --columns=name,ofport list Interface
1733
1734 but the output mixes together interfaces from all bridges in the database,
1735 so it may be confusing if more than one bridge exists.
1736
1737 In the Open vSwitch database, ofport value ``-1`` means that the interface
1738 could not be created due to an error. (The Open vSwitch log should
1739 indicate the reason.) ofport value ``[]`` (the empty set) means that the
1740 interface hasn't been created yet. The latter is normally an intermittent
1741 condition (unless ovs-vswitchd is not running).
1742
1743Q: I added some flows with my controller or with ovs-ofctl, but when I run
1744"ovs-dpctl dump-flows" I don't see them.
1745
1746 A: ovs-dpctl queries a kernel datapath, not an OpenFlow switch. It won't
1747 display the information that you want. You want to use ``ovs-ofctl
1748 dump-flows`` instead.
1749
1750Q: It looks like each of the interfaces in my bonded port shows up as an
1751individual OpenFlow port. Is that right?
1752
1753 A: Yes, Open vSwitch makes individual bond interfaces visible as OpenFlow
1754 ports, rather than the bond as a whole. The interfaces are treated
1755 together as a bond for only a few purposes:
1756
1757 - Sending a packet to the OFPP_NORMAL port. (When an OpenFlow controller
1758 is not configured, this happens implicitly to every packet.)
1759
1760 - Mirrors configured for output to a bonded port.
1761
1762 It would make a lot of sense for Open vSwitch to present a bond as a single
1763 OpenFlow port. If you want to contribute an implementation of such a
1764 feature, please bring it up on the Open vSwitch development mailing list at
1765 dev@openvswitch.org.
1766
1767Q: I have a sophisticated network setup involving Open vSwitch, VMs or multiple
1768hosts, and other components. The behavior isn't what I expect. Help!
1769
1770 A: To debug network behavior problems, trace the path of a packet,
1771 hop-by-hop, from its origin in one host to a remote host. If that's
1772 correct, then trace the path of the response packet back to the origin.
1773
1774 The open source tool called ``plotnetcfg`` can help to understand the
1775 relationship between the networking devices on a single host.
1776
1777 Usually a simple ICMP echo request and reply (``ping``) packet is good
1778 enough. Start by initiating an ongoing ``ping`` from the origin host to a
1779 remote host. If you are tracking down a connectivity problem, the "ping"
1780 will not display any successful output, but packets are still being sent.
1781 (In this case the packets being sent are likely ARP rather than ICMP.)
1782
1783 Tools available for tracing include the following:
1784
1785 - ``tcpdump`` and ``wireshark`` for observing hops across network devices,
1786 such as Open vSwitch internal devices and physical wires.
1787
1788 - ``ovs-appctl dpif/dump-flows <br>`` in Open vSwitch 1.10 and later or
1789 ``ovs-dpctl dump-flows <br>`` in earlier versions. These tools allow one
1790 to observe the actions being taken on packets in ongoing flows.
1791
1792 See ovs-vswitchd(8) for ``ovs-appctl dpif/dump-flows`` documentation,
1793 ovs-dpctl(8) for ``ovs-dpctl dump-flows`` documentation, and "Why are
1794 there so many different ways to dump flows?" above for some background.
1795
1796 - ``ovs-appctl ofproto/trace`` to observe the logic behind how ovs-vswitchd
1797 treats packets. See ovs-vswitchd(8) for documentation. You can out more
1798 details about a given flow that ``ovs-dpctl dump-flows`` displays, by
1799 cutting and pasting a flow from the output into an ``ovs-appctl
1800 ofproto/trace`` command.
1801
1802 - SPAN, RSPAN, and ERSPAN features of physical switches, to observe what
1803 goes on at these physical hops.
1804
1805 Starting at the origin of a given packet, observe the packet at each hop in
1806 turn. For example, in one plausible scenario, you might:
1807
1808 1. ``tcpdump`` the ``eth`` interface through which an ARP egresses a VM,
1809 from inside the VM.
1810
1811 2. ``tcpdump`` the ``vif`` or ``tap`` interface through which the ARP
1812 ingresses the host machine.
1813
1814 3. Use ``ovs-dpctl dump-flows`` to spot the ARP flow and observe the host
1815 interface through which the ARP egresses the physical machine. You may
1816 need to use ``ovs-dpctl show`` to interpret the port numbers. If the
1817 output seems surprising, you can use ``ovs-appctl ofproto/trace`` to
1818 observe details of how ovs-vswitchd determined the actions in the
1819 ``ovs-dpctl dump-flows`` output.
1820
1821 4. ``tcpdump`` the ``eth`` interface through which the ARP egresses the
1822 physical machine.
1823
1824 5. ``tcpdump`` the ``eth`` interface through which the ARP ingresses the
1825 physical machine, at the remote host that receives the ARP.
1826
1827 6. Use ``ovs-dpctl dump-flows`` to spot the ARP flow on the remote host
1828 remote host that receives the ARP and observe the VM ``vif`` or ``tap``
1829 interface to which the flow is directed. Again, ``ovs-dpctl show`` and
1830 ``ovs-appctl ofproto/trace`` might help.
1831
1832 7. ``tcpdump`` the ``vif`` or ``tap`` interface to which the ARP is
1833 directed.
1834
1835 8. ``tcpdump`` the ``eth`` interface through which the ARP ingresses a VM,
1836 from inside the VM.
1837
1838 It is likely that during one of these steps you will figure out the
1839 problem. If not, then follow the ARP reply back to the origin, in reverse.
1840
1841Q: How do I make a flow drop packets?
1842
1843 A: To drop a packet is to receive it without forwarding it. OpenFlow
1844 explicitly specifies forwarding actions. Thus, a flow with an empty set of
1845 actions does not forward packets anywhere, causing them to be dropped. You
1846 can specify an empty set of actions with ``actions=`` on the ovs-ofctl
dc76953f 1847 command line. For example::
edde4861
SF
1848
1849 $ ovs-ofctl add-flow br0 priority=65535,actions=
1850
1851 would cause every packet entering switch br0 to be dropped.
1852
1853 You can write "drop" explicitly if you like. The effect is the same.
1854 Thus, the following command also causes every packet entering switch br0 to
dc76953f 1855 be dropped::
edde4861
SF
1856
1857 $ ovs-ofctl add-flow br0 priority=65535,actions=drop
1858
1859 ``drop`` is not an action, either in OpenFlow or Open vSwitch. Rather, it
1860 is only a way to say that there are no actions.
1861
dc76953f 1862Q: I added a flow to send packets out the ingress port, like this::
edde4861
SF
1863
1864 $ ovs-ofctl add-flow br0 in_port=2,actions=2
1865
1866but OVS drops the packets instead.
1867
1868 A: Yes, OpenFlow requires a switch to ignore attempts to send a packet out
1869 its ingress port. The rationale is that dropping these packets makes it
1870 harder to loop the network. Sometimes this behavior can even be
1871 convenient, e.g. it is often the desired behavior in a flow that forwards a
1872 packet to several ports ("floods" the packet).
1873
1874 Sometimes one really needs to send a packet out its ingress port
1875 ("hairpin"). In this case, output to ``OFPP_IN_PORT``, which in ovs-ofctl
dc76953f 1876 syntax is expressed as just ``in_port``, e.g.::
edde4861
SF
1877
1878 $ ovs-ofctl add-flow br0 in_port=2,actions=in_port
1879
1880 This also works in some circumstances where the flow doesn't match on the
1881 input port. For example, if you know that your switch has five ports
1882 numbered 2 through 6, then the following will send every received packet
dc76953f 1883 out every port, even its ingress port::
edde4861
SF
1884
1885 $ ovs-ofctl add-flow br0 actions=2,3,4,5,6,in_port
1886
dc76953f 1887 or, equivalently::
edde4861
SF
1888
1889 $ ovs-ofctl add-flow br0 actions=all,in_port
1890
1891 Sometimes, in complicated flow tables with multiple levels of ``resubmit``
1892 actions, a flow needs to output to a particular port that may or may not be
1893 the ingress port. It's difficult to take advantage of ``OFPP_IN_PORT`` in
1894 this situation. To help, Open vSwitch provides, as an OpenFlow extension,
1895 the ability to modify the in_port field. Whatever value is currently in
1896 the in_port field is the port to which outputs will be dropped, as well as
1897 the destination for ``OFPP_IN_PORT``. This means that the following will
dc76953f 1898 reliably output to port 2 or to ports 2 through 6, respectively::
edde4861
SF
1899
1900 $ ovs-ofctl add-flow br0 in_port=2,actions=load:0->NXM_OF_IN_PORT[],2
1901 $ ovs-ofctl add-flow br0 actions=load:0->NXM_OF_IN_PORT[],2,3,4,5,6
1902
1903 If the input port is important, then one may save and restore it on the
1904 stack:
1905
1906 $ ovs-ofctl add-flow br0 actions=push:NXM_OF_IN_PORT[],\
1907 load:0->NXM_OF_IN_PORT[],\
1908 2,3,4,5,6,\
1909 pop:NXM_OF_IN_PORT[]
1910
1911Q: My bridge br0 has host 192.168.0.1 on port 1 and host 192.168.0.2 on port 2.
1912I set up flows to forward only traffic destined to the other host and drop
dc76953f 1913other traffic, like this::
edde4861
SF
1914
1915 priority=5,in_port=1,ip,nw_dst=192.168.0.2,actions=2
1916 priority=5,in_port=2,ip,nw_dst=192.168.0.1,actions=1
1917 priority=0,actions=drop
1918
1919But it doesn't work--I don't get any connectivity when I do this. Why?
1920
1921 A: These flows drop the ARP packets that IP hosts use to establish IP
1922 connectivity over Ethernet. To solve the problem, add flows to allow ARP
dc76953f 1923 to pass between the hosts::
edde4861
SF
1924
1925 priority=5,in_port=1,arp,actions=2
1926 priority=5,in_port=2,arp,actions=1
1927
1928 This issue can manifest other ways, too. The following flows that match on
1929 Ethernet addresses instead of IP addresses will also drop ARP packets,
1930 because ARP requests are broadcast instead of being directed to a specific
dc76953f 1931 host::
edde4861
SF
1932
1933 priority=5,in_port=1,dl_dst=54:00:00:00:00:02,actions=2
1934 priority=5,in_port=2,dl_dst=54:00:00:00:00:01,actions=1
1935 priority=0,actions=drop
1936
1937 The solution already described above will also work in this case. It may
dc76953f 1938 be better to add flows to allow all multicast and broadcast traffic::
edde4861
SF
1939
1940 priority=5,in_port=1,dl_dst=01:00:00:00:00:00/01:00:00:00:00:00,actions=2
1941 priority=5,in_port=2,dl_dst=01:00:00:00:00:00/01:00:00:00:00:00,actions=1
1942
1943Q: My bridge disconnects from my controller on add-port/del-port.
1944
1945 A: Reconfiguring your bridge can change your bridge's datapath-id because
1946 Open vSwitch generates datapath-id from the MAC address of one of its
1947 ports. In that case, Open vSwitch disconnects from controllers because
1948 there's no graceful way to notify controllers about the change of
1949 datapath-id.
1950
1951 To avoid the behaviour, you can configure datapath-id manually.::
1952
1953 $ ovs-vsctl set bridge br0 other-config:datapath-id=0123456789abcdef
1954
1955Q: My controller complains that OVS is not buffering packets.
1956What's going on?
1957
1958 A: "Packet buffering" is an optional OpenFlow feature, and controllers
1959 should detect how many "buffers" an OpenFlow switch implements. It was
1960 recently noticed that OVS implementation of the buffering feature was not
1961 compliant to OpenFlow specifications. Rather than fix it and risk
1962 controller incompatibility, the buffering feature is removed as of OVS 2.7.
1963 Controllers are already expected to work properly in cases where the switch
1964 can not buffer packets, but sends full packets in "packet-in" messages
1965 instead, so this change should not affect existing users. After the change
1966 OVS always sends the ``buffer_id`` as ``0xffffffff`` in "packet-in"
1967 messages and will send an error response if any other value of this field
1968 is included in a "packet-out" or a "flow mod" sent by a controller.
1969
1970Q: How does OVS divide flows among buckets in an OpenFlow "select" group?
1971
1972 A: In Open vSwitch 2.3 and earlier, Open vSwitch used the destination
1973 Ethernet address to choose a bucket in a select group.
1974
1975 Open vSwitch 2.4 and later by default hashes the source and destination
1976 Ethernet address, VLAN ID, Ethernet type, IPv4/v6 source and destination
1977 address and protocol, and for TCP and SCTP only, the source and destination
1978 ports. The hash is "symmetric", meaning that exchanging source and
1979 destination addresses does not change the bucket selection.
1980
1981 Select groups in Open vSwitch 2.4 and later can be configured to use a
1982 different hash function, using a Netronome extension to the OpenFlow 1.5+
1983 group_mod message. For more information, see
1984 Documentation/group-selection-method-property.txt in the Open vSwitch
1985 source tree. (OpenFlow 1.5 support in Open vSwitch is still experimental.)
1986
1987Q: I added a flow to accept packets on VLAN 123 and output them on VLAN 456,
dc76953f 1988like so::
edde4861
SF
1989
1990 $ ovs-ofctl add-flow br0 dl_vlan=123,actions=output:1,mod_vlan_vid:456
1991
1992but the packets are actually being output in VLAN 123. Why?
1993
1994 A: OpenFlow actions are executed in the order specified. Thus, the actions
1995 above first output the packet, then change its VLAN. Since the output
1996 occurs before changing the VLAN, the change in VLAN will have no visible
1997 effect.
1998
1999 To solve this and similar problems, order actions so that changes to
dc76953f 2000 headers happen before output, e.g.::
edde4861
SF
2001
2002 $ ovs-ofctl add-flow br0 dl_vlan=123,actions=mod_vlan_vid:456,output:1
2003
2004Q: The "learn" action can't learn the action I want, can you improve it?
2005
2006 A: By itself, the "learn" action can only put two kinds of actions into the
2007 flows that it creates: "load" and "output" actions. If "learn" is used in
2008 isolation, these are severe limits.
2009
2010 However, "learn" is not meant to be used in isolation. It is a primitive
2011 meant to be used together with other Open vSwitch features to accomplish a
2012 task. Its existing features are enough to accomplish most tasks.
2013
2014 Here is an outline of a typical pipeline structure that allows for
2015 versatile behavior using "learn":
2016
2017 - Flows in table A contain a "learn" action, that populates flows in table
2018 L, that use a "load" action to populate register R with information about
2019 what was learned.
2020
2021 - Flows in table B contain two sequential resubmit actions: one to table L
2022 and another one to table B+1.
2023
2024 - Flows in table B+1 match on register R and act differently depending on
2025 what the flows in table L loaded into it.
2026
2027 This approach can be used to implement many "learn"-based features. For
2028 example:
2029
2030 - Resubmit to a table selected based on learned information, e.g. see:
2031 http://openvswitch.org/pipermail/discuss/2016-June/021694.html
2032
2033 - MAC learning in the middle of a pipeline, as described in `the tutorial
c27b7536 2034 <tutorial/tutorial.rst>`__.
edde4861
SF
2035
2036 - TCP state based firewalling, by learning outgoing connections based on
2037 SYN packets and matching them up with incoming packets.
2038
2039 - At least some of the features described in T. A. Hoff, "Extending Open
2040 vSwitch to Facilitate Creation of Stateful SDN Applications".
2041
2042Development
2043-----------
2044
2045Q: How do I implement a new OpenFlow message?
2046
2047 A: Add your new message to ``enum ofpraw`` and ``enum ofptype`` in
2048 ``lib/ofp-msgs.h``, following the existing pattern. Then recompile and fix
2049 all of the new warnings, implementing new functionality for the new message
2050 as needed. (If you configure with ``--enable-Werror``, as described in the
2051 `installation guide <INSTALL.rst>`__, then it is impossible to miss any
2052 warnings.)
2053
2054 If you need to add an OpenFlow vendor extension message for a vendor that
2055 doesn't yet have any extension messages, then you will also need to edit
2056 ``build-aux/extract-ofp-msgs``.
2057
2058Q: How do I add support for a new field or header?
2059
2060 A: Add new members for your field to ``struct flow`` in ``lib/flow.h``, and
2061 add new enumerations for your new field to ``enum mf_field_id`` in
2062 ``lib/meta-flow.h``, following the existing pattern. Also, add support to
2063 ``miniflow_extract()`` in ``lib/flow.c`` for extracting your new field from
2064 a packet into struct miniflow, and to ``nx_put_raw()`` in
2065 ``lib/nx-match.c`` to output your new field in OXM matches. Then recompile
2066 and fix all of the new warnings, implementing new functionality for the new
2067 field or header as needed. (If you configure with ``--enable-Werror``, as
2068 described in the `installation guide <INSTALL.rst>`__, then it is impossible
2069 to miss any warnings.)
2070
2071 If you want kernel datapath support for your new field, you also need to
2072 modify the kernel module for the operating systems you are interested in.
2073 This isn't mandatory, since fields understood only by userspace work too
2074 (with a performance penalty), so it's reasonable to start development
2075 without it. If you implement kernel module support for Linux, then the
2076 Linux kernel "netdev" mailing list is the place to submit that support
2077 first; please read up on the Linux kernel development process separately.
2078 The Windows datapath kernel module support, on the other hand, is
2079 maintained within the OVS tree, so patches for that can go directly to
2080 ovs-dev.
2081
2082Q: How do I add support for a new OpenFlow action?
2083
2084 A: Add your new action to ``enum ofp_raw_action_type`` in
2085 ``lib/ofp-actions.c``, following the existing pattern. Then recompile and
2086 fix all of the new warnings, implementing new functionality for the new
2087 action as needed. (If you configure with ``--enable-Werror``, as described
2088 in the `installation guide <INSTALL.rst>`__, then it is impossible to miss
2089 any warnings.)
2090
2091 If you need to add an OpenFlow vendor extension action for a vendor that
2092 doesn't yet have any extension actions, then you will also need to edit
2093 ``build-aux/extract-ofp-actions``.