]> git.proxmox.com Git - ovs.git/blob - vswitchd/vswitch.xml
ovsschema: Add protected column to Port table
[ovs.git] / vswitchd / vswitch.xml
1 <?xml version="1.0" encoding="utf-8"?>
2 <database name="ovs-vswitchd.conf.db" title="Open vSwitch Configuration Database">
3 <p>
4 A database with this schema holds the configuration for one Open
5 vSwitch daemon. The top-level configuration for the daemon is the
6 <ref table="Open_vSwitch"/> table, which must have exactly one
7 record. Records in other tables are significant only when they
8 can be reached directly or indirectly from the <ref
9 table="Open_vSwitch"/> table. Records that are not reachable from
10 the <ref table="Open_vSwitch"/> table are automatically deleted
11 from the database, except for records in a few distinguished
12 ``root set'' tables.
13 </p>
14
15 <h2>Common Columns</h2>
16
17 <p>
18 Most tables contain two special columns, named <code>other_config</code>
19 and <code>external_ids</code>. These columns have the same form and
20 purpose each place that they appear, so we describe them here to save space
21 later.
22 </p>
23
24 <dl>
25 <dt><code>other_config</code>: map of string-string pairs</dt>
26 <dd>
27 <p>
28 Key-value pairs for configuring rarely used features. Supported keys,
29 along with the forms taken by their values, are documented individually
30 for each table.
31 </p>
32 <p>
33 A few tables do not have <code>other_config</code> columns because no
34 key-value pairs have yet been defined for them.
35 </p>
36 </dd>
37
38 <dt><code>external_ids</code>: map of string-string pairs</dt>
39 <dd>
40 Key-value pairs for use by external frameworks that integrate with Open
41 vSwitch, rather than by Open vSwitch itself. System integrators should
42 either use the Open vSwitch development mailing list to coordinate on
43 common key-value definitions, or choose key names that are likely to be
44 unique. In some cases, where key-value pairs have been defined that are
45 likely to be widely useful, they are documented individually for each
46 table.
47 </dd>
48 </dl>
49
50 <table name="Open_vSwitch" title="Open vSwitch configuration.">
51 Configuration for an Open vSwitch daemon. There must be exactly
52 one record in the <ref table="Open_vSwitch"/> table.
53
54 <group title="Configuration">
55 <column name="bridges">
56 Set of bridges managed by the daemon.
57 </column>
58
59 <column name="ssl">
60 SSL used globally by the daemon.
61 </column>
62
63 <column name="external_ids" key="system-id">
64 A unique identifier for the Open vSwitch's physical host.
65 The form of the identifier depends on the type of the host.
66 On a Citrix XenServer, this will likely be the same as
67 <ref column="external_ids" key="xs-system-uuid"/>.
68 </column>
69
70 <column name="external_ids" key="xs-system-uuid">
71 The Citrix XenServer universally unique identifier for the physical
72 host as displayed by <code>xe host-list</code>.
73 </column>
74
75 <column name="external_ids" key="hostname">
76 The hostname for the host running Open vSwitch.
77 </column>
78
79 <column name="other_config" key="stats-update-interval"
80 type='{"type": "integer", "minInteger": 5000}'>
81 <p>
82 Interval for updating statistics to the database, in milliseconds.
83 This option will affect the update of the <code>statistics</code>
84 column in the following tables: <code>Port</code>, <code>Interface
85 </code>, <code>Mirror</code>.
86 </p>
87 <p>
88 Default value is 5000 ms.
89 </p>
90 <p>
91 Getting statistics more frequently can be achieved via OpenFlow.
92 </p>
93 </column>
94
95 <column name="other_config" key="flow-restore-wait"
96 type='{"type": "boolean"}'>
97 <p>
98 When <code>ovs-vswitchd</code> starts up, it has an empty flow table
99 and therefore it handles all arriving packets in its default fashion
100 according to its configuration, by dropping them or sending them to
101 an OpenFlow controller or switching them as a standalone switch.
102 This behavior is ordinarily desirable. However, if
103 <code>ovs-vswitchd</code> is restarting as part of a ``hot-upgrade,''
104 then this leads to a relatively long period during which packets are
105 mishandled.
106 </p>
107 <p>
108 This option allows for improvement. When <code>ovs-vswitchd</code>
109 starts with this value set as <code>true</code>, it will neither
110 flush or expire previously set datapath flows nor will it send and
111 receive any packets to or from the datapath. When this value is
112 later set to <code>false</code>, <code>ovs-vswitchd</code> will
113 start receiving packets from the datapath and re-setup the flows.
114 </p>
115 <p>
116 Thus, with this option, the procedure for a hot-upgrade of
117 <code>ovs-vswitchd</code> becomes roughly the following:
118 </p>
119 <ol>
120 <li>
121 Stop <code>ovs-vswitchd</code>.
122 </li>
123 <li>
124 Set <ref column="other_config" key="flow-restore-wait"/>
125 to <code>true</code>.
126 </li>
127 <li>
128 Start <code>ovs-vswitchd</code>.
129 </li>
130 <li>
131 Use <code>ovs-ofctl</code> (or some other program, such as an
132 OpenFlow controller) to restore the OpenFlow flow table
133 to the desired state.
134 </li>
135 <li>
136 Set <ref column="other_config" key="flow-restore-wait"/>
137 to <code>false</code> (or remove it entirely from the database).
138 </li>
139 </ol>
140 <p>
141 The <code>ovs-ctl</code>'s ``restart'' and ``force-reload-kmod''
142 functions use the above config option during hot upgrades.
143 </p>
144 </column>
145
146 <column name="other_config" key="flow-limit"
147 type='{"type": "integer", "minInteger": 0}'>
148 <p>
149 The maximum
150 number of flows allowed in the datapath flow table. Internally OVS
151 will choose a flow limit which will likely be lower than this number,
152 based on real time network conditions. Tweaking this value is
153 discouraged unless you know exactly what you're doing.
154 </p>
155 <p>
156 The default is 200000.
157 </p>
158 </column>
159
160 <column name="other_config" key="max-idle"
161 type='{"type": "integer", "minInteger": 500}'>
162 <p>
163 The maximum time (in ms) that idle flows will remain cached in the
164 datapath. Internally OVS will check the validity and activity for
165 datapath flows regularly and may expire flows quicker than this
166 number, based on real time network conditions. Tweaking this
167 value is discouraged unless you know exactly what you're doing.
168 </p>
169 <p>
170 The default is 10000.
171 </p>
172 </column>
173
174 <column name="other_config" key="dpdk-init"
175 type='{"type": "boolean"}'>
176 <p>
177 Set this value to <code>true</code> to enable runtime support for
178 DPDK ports. The vswitch must have compile-time support for DPDK as
179 well.
180 </p>
181 <p>
182 The default value is <code>false</code>. Changing this value requires
183 restarting the daemon
184 </p>
185 <p>
186 If this value is <code>false</code> at startup, any dpdk ports which
187 are configured in the bridge will fail due to memory errors.
188 </p>
189 </column>
190
191 <column name="other_config" key="dpdk-lcore-mask"
192 type='{"type": "integer", "minInteger": 1}'>
193 <p>
194 Specifies the CPU cores where dpdk lcore threads should be spawned.
195 The DPDK lcore threads are used for DPDK library tasks, such as
196 library internal message processing, logging, etc. Value should be in
197 the form of a hex string (so '0x123') similar to the 'taskset' mask
198 input.
199 </p>
200 <p>
201 The lowest order bit corresponds to the first CPU core. A set bit
202 means the corresponding core is available and an lcore thread will be
203 created and pinned to it. If the input does not cover all cores,
204 those uncovered cores are considered not set.
205 </p>
206 <p>
207 For performance reasons, it is best to set this to a single core on
208 the system, rather than allow lcore threads to float.
209 </p>
210 <p>
211 If not specified, the value will be determined by choosing the lowest
212 CPU core from initial cpu affinity list. Otherwise, the value will be
213 passed directly to the DPDK library.
214 </p>
215 </column>
216
217 <column name="other_config" key="pmd-cpu-mask">
218 <p>
219 Specifies CPU mask for setting the cpu affinity of PMD (Poll
220 Mode Driver) threads. Value should be in the form of hex string,
221 similar to the dpdk EAL '-c COREMASK' option input or the 'taskset'
222 mask input.
223 </p>
224 <p>
225 The lowest order bit corresponds to the first CPU core. A set bit
226 means the corresponding core is available and a pmd thread will be
227 created and pinned to it. If the input does not cover all cores,
228 those uncovered cores are considered not set.
229 </p>
230 <p>
231 If not specified, one pmd thread will be created for each numa node
232 and pinned to any available core on the numa node by default.
233 </p>
234 </column>
235
236 <column name="other_config" key="dpdk-alloc-mem"
237 type='{"type": "integer", "minInteger": 0}'>
238 <p>
239 Specifies the amount of memory to preallocate from the hugepage pool,
240 regardless of socket. It is recommended that dpdk-socket-mem is used
241 instead.
242 </p>
243 <p>
244 If not specified, the value is 0. Changing this value requires
245 restarting the daemon.
246 </p>
247 </column>
248
249 <column name="other_config" key="dpdk-socket-mem"
250 type='{"type": "string"}'>
251 <p>
252 Specifies the amount of memory to preallocate from the hugepage pool,
253 on a per-socket basis.
254 </p>
255 <p>
256 The specifier is a comma-separated string, in ascending order of CPU
257 socket (ex: 1024,2048,4096,8192 would set socket 0 to preallocate
258 1024MB, socket 1 to preallocate 2048MB, etc.)
259 </p>
260 <p>
261 If not specified, the default value is 1024,0. Changing this value
262 requires restarting the daemon.
263 </p>
264 </column>
265
266 <column name="other_config" key="dpdk-hugepage-dir"
267 type='{"type": "string"}'>
268 <p>
269 Specifies the path to the hugetlbfs mount point.
270 </p>
271 <p>
272 If not specified, this will be guessed by the DPDK library (default
273 is /dev/hugepages). Changing this value requires restarting the
274 daemon.
275 </p>
276 </column>
277
278 <column name="other_config" key="dpdk-extra"
279 type='{"type": "string"}'>
280 <p>
281 Specifies additional eal command line arguments for DPDK.
282 </p>
283 <p>
284 The default is empty. Changing this value requires restarting the
285 daemon
286 </p>
287 </column>
288
289 <column name="other_config" key="vhost-sock-dir"
290 type='{"type": "string"}'>
291 <p>
292 Specifies the path to the vhost-user unix domain socket files. This
293 path must exist and be a subdirectory tree of the Open vSwitch
294 run directory.
295 </p>
296 <p>
297 Defaults to the working directory of the application. Changing this
298 value requires restarting the daemon.
299 </p>
300 </column>
301
302 <column name="other_config" key="n-handler-threads"
303 type='{"type": "integer", "minInteger": 1}'>
304 <p>
305 Specifies the number of threads for software datapaths to use for
306 handling new flows. The default the number of online CPU cores minus
307 the number of revalidators.
308 </p>
309 <p>
310 This configuration is per datapath. If you have more than one
311 software datapath (e.g. some <code>system</code> bridges and some
312 <code>netdev</code> bridges), then the total number of threads is
313 <code>n-handler-threads</code> times the number of software
314 datapaths.
315 </p>
316 </column>
317
318 <column name="other_config" key="n-revalidator-threads"
319 type='{"type": "integer", "minInteger": 1}'>
320 <p>
321 Specifies the number of threads for software datapaths to use for
322 revalidating flows in the datapath. Typically, there is a direct
323 correlation between the number of revalidator threads, and the number
324 of flows allowed in the datapath. The default is the number of cpu
325 cores divided by four plus one. If <code>n-handler-threads</code> is
326 set, the default changes to the number of cpu cores minus the number
327 of handler threads.
328 </p>
329 <p>
330 This configuration is per datapath. If you have more than one
331 software datapath (e.g. some <code>system</code> bridges and some
332 <code>netdev</code> bridges), then the total number of threads is
333 <code>n-handler-threads</code> times the number of software
334 datapaths.
335 </p>
336 </column>
337 </group>
338
339 <group title="Status">
340 <column name="next_cfg">
341 Sequence number for client to increment. When a client modifies
342 any part of the database configuration and wishes to wait for
343 Open vSwitch to finish applying the changes, it may increment
344 this sequence number.
345 </column>
346
347 <column name="cur_cfg">
348 Sequence number that Open vSwitch sets to the current value of
349 <ref column="next_cfg"/> after it finishes applying a set of
350 configuration changes.
351 </column>
352
353 <group title="Statistics">
354 <p>
355 The <code>statistics</code> column contains key-value pairs that
356 report statistics about a system running an Open vSwitch. These are
357 updated periodically (currently, every 5 seconds). Key-value pairs
358 that cannot be determined or that do not apply to a platform are
359 omitted.
360 </p>
361
362 <column name="other_config" key="enable-statistics"
363 type='{"type": "boolean"}'>
364 Statistics are disabled by default to avoid overhead in the common
365 case when statistics gathering is not useful. Set this value to
366 <code>true</code> to enable populating the <ref column="statistics"/>
367 column or to <code>false</code> to explicitly disable it.
368 </column>
369
370 <column name="statistics" key="cpu"
371 type='{"type": "integer", "minInteger": 1}'>
372 <p>
373 Number of CPU processors, threads, or cores currently online and
374 available to the operating system on which Open vSwitch is running,
375 as an integer. This may be less than the number installed, if some
376 are not online or if they are not available to the operating
377 system.
378 </p>
379 <p>
380 Open vSwitch userspace processes are not multithreaded, but the
381 Linux kernel-based datapath is.
382 </p>
383 </column>
384
385 <column name="statistics" key="load_average">
386 A comma-separated list of three floating-point numbers,
387 representing the system load average over the last 1, 5, and 15
388 minutes, respectively.
389 </column>
390
391 <column name="statistics" key="memory">
392 <p>
393 A comma-separated list of integers, each of which represents a
394 quantity of memory in kilobytes that describes the operating
395 system on which Open vSwitch is running. In respective order,
396 these values are:
397 </p>
398
399 <ol>
400 <li>Total amount of RAM allocated to the OS.</li>
401 <li>RAM allocated to the OS that is in use.</li>
402 <li>RAM that can be flushed out to disk or otherwise discarded
403 if that space is needed for another purpose. This number is
404 necessarily less than or equal to the previous value.</li>
405 <li>Total disk space allocated for swap.</li>
406 <li>Swap space currently in use.</li>
407 </ol>
408
409 <p>
410 On Linux, all five values can be determined and are included. On
411 other operating systems, only the first two values can be
412 determined, so the list will only have two values.
413 </p>
414 </column>
415
416 <column name="statistics" key="process_NAME">
417 <p>
418 One such key-value pair, with <code>NAME</code> replaced by
419 a process name, will exist for each running Open vSwitch
420 daemon process, with <var>name</var> replaced by the
421 daemon's name (e.g. <code>process_ovs-vswitchd</code>). The
422 value is a comma-separated list of integers. The integers
423 represent the following, with memory measured in kilobytes
424 and durations in milliseconds:
425 </p>
426
427 <ol>
428 <li>The process's virtual memory size.</li>
429 <li>The process's resident set size.</li>
430 <li>The amount of user and system CPU time consumed by the
431 process.</li>
432 <li>The number of times that the process has crashed and been
433 automatically restarted by the monitor.</li>
434 <li>The duration since the process was started.</li>
435 <li>The duration for which the process has been running.</li>
436 </ol>
437
438 <p>
439 The interpretation of some of these values depends on whether the
440 process was started with the <option>--monitor</option>. If it
441 was not, then the crash count will always be 0 and the two
442 durations will always be the same. If <option>--monitor</option>
443 was given, then the crash count may be positive; if it is, the
444 latter duration is the amount of time since the most recent crash
445 and restart.
446 </p>
447
448 <p>
449 There will be one key-value pair for each file in Open vSwitch's
450 ``run directory'' (usually <code>/var/run/openvswitch</code>)
451 whose name ends in <code>.pid</code>, whose contents are a
452 process ID, and which is locked by a running process. The
453 <var>name</var> is taken from the pidfile's name.
454 </p>
455
456 <p>
457 Currently Open vSwitch is only able to obtain all of the above
458 detail on Linux systems. On other systems, the same key-value
459 pairs will be present but the values will always be the empty
460 string.
461 </p>
462 </column>
463
464 <column name="statistics" key="file_systems">
465 <p>
466 A space-separated list of information on local, writable file
467 systems. Each item in the list describes one file system and
468 consists in turn of a comma-separated list of the following:
469 </p>
470
471 <ol>
472 <li>Mount point, e.g. <code>/</code> or <code>/var/log</code>.
473 Any spaces or commas in the mount point are replaced by
474 underscores.</li>
475 <li>Total size, in kilobytes, as an integer.</li>
476 <li>Amount of storage in use, in kilobytes, as an integer.</li>
477 </ol>
478
479 <p>
480 This key-value pair is omitted if there are no local, writable
481 file systems or if Open vSwitch cannot obtain the needed
482 information.
483 </p>
484 </column>
485 </group>
486 </group>
487
488 <group title="Version Reporting">
489 <p>
490 These columns report the types and versions of the hardware and
491 software running Open vSwitch. We recommend in general that software
492 should test whether specific features are supported instead of relying
493 on version number checks. These values are primarily intended for
494 reporting to human administrators.
495 </p>
496
497 <column name="ovs_version">
498 The Open vSwitch version number, e.g. <code>1.1.0</code>.
499 </column>
500
501 <column name="db_version">
502 <p>
503 The database schema version number in the form
504 <code><var>major</var>.<var>minor</var>.<var>tweak</var></code>,
505 e.g. <code>1.2.3</code>. Whenever the database schema is changed in
506 a non-backward compatible way (e.g. deleting a column or a table),
507 <var>major</var> is incremented. When the database schema is changed
508 in a backward compatible way (e.g. adding a new column),
509 <var>minor</var> is incremented. When the database schema is changed
510 cosmetically (e.g. reindenting its syntax), <var>tweak</var> is
511 incremented.
512 </p>
513
514 <p>
515 The schema version is part of the database schema, so it can also be
516 retrieved by fetching the schema using the Open vSwitch database
517 protocol.
518 </p>
519 </column>
520
521 <column name="system_type">
522 <p>
523 An identifier for the type of system on top of which Open vSwitch
524 runs, e.g. <code>XenServer</code> or <code>KVM</code>.
525 </p>
526 <p>
527 System integrators are responsible for choosing and setting an
528 appropriate value for this column.
529 </p>
530 </column>
531
532 <column name="system_version">
533 <p>
534 The version of the system identified by <ref column="system_type"/>,
535 e.g. <code>5.6.100-39265p</code> on XenServer 5.6.100 build 39265.
536 </p>
537 <p>
538 System integrators are responsible for choosing and setting an
539 appropriate value for this column.
540 </p>
541 </column>
542
543 </group>
544
545 <group title="Capabilities">
546 <p>
547 These columns report capabilities of the Open vSwitch instance.
548 </p>
549 <column name="datapath_types">
550 <p>
551 This column reports the different dpifs registered with the system.
552 These are the values that this instance supports in the <ref
553 column="datapath_type" table="Bridge"/> column of the <ref
554 table="Bridge"/> table.
555 </p>
556 </column>
557 <column name="iface_types">
558 <p>
559 This column reports the different netdevs registered with the system.
560 These are the values that this instance supports in the <ref
561 column="type" table="Interface"/> column of the <ref
562 table="Interface"/> table.
563 </p>
564 </column>
565 </group>
566
567 <group title="Database Configuration">
568 <p>
569 These columns primarily configure the Open vSwitch database
570 (<code>ovsdb-server</code>), not the Open vSwitch switch
571 (<code>ovs-vswitchd</code>). The OVSDB database also uses the <ref
572 column="ssl"/> settings.
573 </p>
574
575 <p>
576 The Open vSwitch switch does read the database configuration to
577 determine remote IP addresses to which in-band control should apply.
578 </p>
579
580 <column name="manager_options">
581 Database clients to which the Open vSwitch database server should
582 connect or to which it should listen, along with options for how these
583 connection should be configured. See the <ref table="Manager"/> table
584 for more information.
585 </column>
586 </group>
587
588 <group title="Common Columns">
589 The overall purpose of these columns is described under <code>Common
590 Columns</code> at the beginning of this document.
591
592 <column name="other_config"/>
593 <column name="external_ids"/>
594 </group>
595 </table>
596
597 <table name="Bridge">
598 <p>
599 Configuration for a bridge within an
600 <ref table="Open_vSwitch"/>.
601 </p>
602 <p>
603 A <ref table="Bridge"/> record represents an Ethernet switch with one or
604 more ``ports,'' which are the <ref table="Port"/> records pointed to by
605 the <ref table="Bridge"/>'s <ref column="ports"/> column.
606 </p>
607
608 <group title="Core Features">
609 <column name="name">
610 <p>
611 Bridge identifier. Must be unique among the names of ports,
612 interfaces, and bridges on a host.
613 </p>
614
615 <p>
616 The name must be alphanumeric and must not contain forward or
617 backward slashes. The name of a bridge is also the name of an <ref
618 table="Interface"/> (and a <ref table="Port"/>) within the bridge, so
619 the restrictions on the <ref table="Interface" column="name"/> column
620 in the <ref table="Interface"/> table, particularly on length, also
621 apply to bridge names. Refer to the documentation for <ref
622 table="Interface"/> names for details.
623 </p>
624 </column>
625
626 <column name="ports">
627 Ports included in the bridge.
628 </column>
629
630 <column name="mirrors">
631 Port mirroring configuration.
632 </column>
633
634 <column name="netflow">
635 NetFlow configuration.
636 </column>
637
638 <column name="sflow">
639 sFlow(R) configuration.
640 </column>
641
642 <column name="ipfix">
643 IPFIX configuration.
644 </column>
645
646 <column name="flood_vlans">
647 <p>
648 VLAN IDs of VLANs on which MAC address learning should be disabled,
649 so that packets are flooded instead of being sent to specific ports
650 that are believed to contain packets' destination MACs. This should
651 ordinarily be used to disable MAC learning on VLANs used for
652 mirroring (RSPAN VLANs). It may also be useful for debugging.
653 </p>
654 <p>
655 SLB bonding (see the <ref table="Port" column="bond_mode"/> column in
656 the <ref table="Port"/> table) is incompatible with
657 <code>flood_vlans</code>. Consider using another bonding mode or
658 a different type of mirror instead.
659 </p>
660 </column>
661
662 <column name="auto_attach">
663 Auto Attach configuration.
664 </column>
665 </group>
666
667 <group title="OpenFlow Configuration">
668 <column name="controller">
669 <p>
670 OpenFlow controller set. If unset, then no OpenFlow controllers
671 will be used.
672 </p>
673
674 <p>
675 If there are primary controllers, removing all of them clears the
676 flow table. If there are no primary controllers, adding one also
677 clears the flow table. Other changes to the set of controllers, such
678 as adding or removing a service controller, adding another primary
679 controller to supplement an existing primary controller, or removing
680 only one of two primary controllers, have no effect on the flow
681 table.
682 </p>
683 </column>
684
685 <column name="flow_tables">
686 Configuration for OpenFlow tables. Each pair maps from an OpenFlow
687 table ID to configuration for that table.
688 </column>
689
690 <column name="fail_mode">
691 <p>When a controller is configured, it is, ordinarily, responsible
692 for setting up all flows on the switch. Thus, if the connection to
693 the controller fails, no new network connections can be set up.
694 If the connection to the controller stays down long enough,
695 no packets can pass through the switch at all. This setting
696 determines the switch's response to such a situation. It may be set
697 to one of the following:
698 <dl>
699 <dt><code>standalone</code></dt>
700 <dd>If no message is received from the controller for three
701 times the inactivity probe interval
702 (see <ref column="inactivity_probe"/>), then Open vSwitch
703 will take over responsibility for setting up flows. In
704 this mode, Open vSwitch causes the bridge to act like an
705 ordinary MAC-learning switch. Open vSwitch will continue
706 to retry connecting to the controller in the background
707 and, when the connection succeeds, it will discontinue its
708 standalone behavior.</dd>
709 <dt><code>secure</code></dt>
710 <dd>Open vSwitch will not set up flows on its own when the
711 controller connection fails or when no controllers are
712 defined. The bridge will continue to retry connecting to
713 any defined controllers forever.</dd>
714 </dl>
715 </p>
716 <p>
717 The default is <code>standalone</code> if the value is unset, but
718 future versions of Open vSwitch may change the default.
719 </p>
720 <p>
721 The <code>standalone</code> mode can create forwarding loops on a
722 bridge that has more than one uplink port unless STP is enabled. To
723 avoid loops on such a bridge, configure <code>secure</code> mode or
724 enable STP (see <ref column="stp_enable"/>).
725 </p>
726 <p>When more than one controller is configured,
727 <ref column="fail_mode"/> is considered only when none of the
728 configured controllers can be contacted.</p>
729 <p>
730 Changing <ref column="fail_mode"/> when no primary controllers are
731 configured clears the flow table.
732 </p>
733 </column>
734
735 <column name="datapath_id">
736 Reports the OpenFlow datapath ID in use. Exactly 16 hex digits.
737 (Setting this column has no useful effect. Set <ref
738 column="other-config" key="datapath-id"/> instead.)
739 </column>
740
741 <column name="datapath_version">
742 <p>
743 Reports the version number of the Open vSwitch datapath in use.
744 This allows management software to detect and report discrepancies
745 between Open vSwitch userspace and datapath versions. (The <ref
746 column="ovs_version" table="Open_vSwitch"/> column in the <ref
747 table="Open_vSwitch"/> reports the Open vSwitch userspace version.)
748 The version reported depends on the datapath in use:
749 </p>
750
751 <ul>
752 <li>
753 When the kernel module included in the Open vSwitch source tree is
754 used, this column reports the Open vSwitch version from which the
755 module was taken.
756 </li>
757
758 <li>
759 When the kernel module that is part of the upstream Linux kernel is
760 used, this column reports <code>&lt;unknown&gt;</code>.
761 </li>
762
763 <li>
764 When the datapath is built into the <code>ovs-vswitchd</code>
765 binary, this column reports <code>&lt;built-in&gt;</code>. A
766 built-in datapath is by definition the same version as the rest of
767 the Open VSwitch userspace.
768 </li>
769
770 <li>
771 Other datapaths (such as the Hyper-V kernel datapath) currently
772 report <code>&lt;unknown&gt;</code>.
773 </li>
774 </ul>
775
776 <p>
777 A version discrepancy between <code>ovs-vswitchd</code> and the
778 datapath in use is not normally cause for alarm. The Open vSwitch
779 kernel datapaths for Linux and Hyper-V, in particular, are designed
780 for maximum inter-version compatibility: any userspace version works
781 with with any kernel version. Some reasons do exist to insist on
782 particular user/kernel pairings. First, newer kernel versions add
783 new features, that can only be used by new-enough userspace, e.g.
784 VXLAN tunneling requires certain minimal userspace and kernel
785 versions. Second, as an extension to the first reason, some newer
786 kernel versions add new features for enhancing performance that only
787 new-enough userspace versions can take advantage of.
788 </p>
789 </column>
790
791 <column name="other_config" key="datapath-id">
792 Exactly 16 hex digits to set the OpenFlow datapath ID to a specific
793 value. May not be all-zero.
794 </column>
795
796 <column name="other_config" key="dp-desc">
797 Human readable description of datapath. It it a maximum 256
798 byte-long free-form string to describe the datapath for
799 debugging purposes, e.g. <code>switch3 in room 3120</code>.
800 </column>
801
802 <column name="other_config" key="disable-in-band"
803 type='{"type": "boolean"}'>
804 If set to <code>true</code>, disable in-band control on the bridge
805 regardless of controller and manager settings.
806 </column>
807
808 <column name="other_config" key="in-band-queue"
809 type='{"type": "integer", "minInteger": 0, "maxInteger": 4294967295}'>
810 A queue ID as a nonnegative integer. This sets the OpenFlow queue ID
811 that will be used by flows set up by in-band control on this bridge.
812 If unset, or if the port used by an in-band control flow does not have
813 QoS configured, or if the port does not have a queue with the specified
814 ID, the default queue is used instead.
815 </column>
816
817 <column name="protocols">
818 <p>
819 List of OpenFlow protocols that may be used when negotiating
820 a connection with a controller. OpenFlow 1.0, 1.1, 1.2, and
821 1.3 are enabled by default if this column is empty.
822 </p>
823
824 <p>
825 OpenFlow 1.4, 1.5, and 1.6 are not enabled by default because their
826 implementations are missing features. In addition, the OpenFlow 1.6
827 specification is still under development and thus subject to change.
828 </p>
829 </column>
830 </group>
831
832 <group title="Spanning Tree Configuration">
833 <p>
834 The IEEE 802.1D Spanning Tree Protocol (STP) is a network protocol
835 that ensures loop-free topologies. It allows redundant links to
836 be included in the network to provide automatic backup paths if
837 the active links fails.
838 </p>
839
840 <p>
841 These settings configure the slower-to-converge but still widely
842 supported version of Spanning Tree Protocol, sometimes known as
843 802.1D-1998. Open vSwitch also supports the newer Rapid Spanning Tree
844 Protocol (RSTP), documented later in the section titled <code>Rapid
845 Spanning Tree Configuration</code>.
846 </p>
847
848 <group title="STP Configuration">
849 <column name="stp_enable" type='{"type": "boolean"}'>
850 <p>
851 Enable spanning tree on the bridge. By default, STP is disabled
852 on bridges. Bond, internal, and mirror ports are not supported
853 and will not participate in the spanning tree.
854 </p>
855
856 <p>
857 STP and RSTP are mutually exclusive. If both are enabled, RSTP
858 will be used.
859 </p>
860 </column>
861
862 <column name="other_config" key="stp-system-id">
863 The bridge's STP identifier (the lower 48 bits of the bridge-id)
864 in the form
865 <var>xx</var>:<var>xx</var>:<var>xx</var>:<var>xx</var>:<var>xx</var>:<var>xx</var>.
866 By default, the identifier is the MAC address of the bridge.
867 </column>
868
869 <column name="other_config" key="stp-priority"
870 type='{"type": "integer", "minInteger": 0, "maxInteger": 65535}'>
871 The bridge's relative priority value for determining the root
872 bridge (the upper 16 bits of the bridge-id). A bridge with the
873 lowest bridge-id is elected the root. By default, the priority
874 is 0x8000.
875 </column>
876
877 <column name="other_config" key="stp-hello-time"
878 type='{"type": "integer", "minInteger": 1, "maxInteger": 10}'>
879 The interval between transmissions of hello messages by
880 designated ports, in seconds. By default the hello interval is
881 2 seconds.
882 </column>
883
884 <column name="other_config" key="stp-max-age"
885 type='{"type": "integer", "minInteger": 6, "maxInteger": 40}'>
886 The maximum age of the information transmitted by the bridge
887 when it is the root bridge, in seconds. By default, the maximum
888 age is 20 seconds.
889 </column>
890
891 <column name="other_config" key="stp-forward-delay"
892 type='{"type": "integer", "minInteger": 4, "maxInteger": 30}'>
893 The delay to wait between transitioning root and designated
894 ports to <code>forwarding</code>, in seconds. By default, the
895 forwarding delay is 15 seconds.
896 </column>
897
898 <column name="other_config" key="mcast-snooping-aging-time"
899 type='{"type": "integer", "minInteger": 1}'>
900 <p>
901 The maximum number of seconds to retain a multicast snooping entry for
902 which no packets have been seen. The default is currently 300
903 seconds (5 minutes). The value, if specified, is forced into a
904 reasonable range, currently 15 to 3600 seconds.
905 </p>
906 </column>
907
908 <column name="other_config" key="mcast-snooping-table-size"
909 type='{"type": "integer", "minInteger": 1}'>
910 <p>
911 The maximum number of multicast snooping addresses to learn. The
912 default is currently 2048. The value, if specified, is forced into
913 a reasonable range, currently 10 to 1,000,000.
914 </p>
915 </column>
916 <column name="other_config" key="mcast-snooping-disable-flood-unregistered"
917 type='{"type": "boolean"}'>
918 <p>
919 If set to <code>false</code>, unregistered multicast packets are forwarded
920 to all ports.
921 If set to <code>true</code>, unregistered multicast packets are forwarded
922 to ports connected to multicast routers.
923 </p>
924 </column>
925 </group>
926
927 <group title="STP Status">
928 <p>
929 These key-value pairs report the status of 802.1D-1998. They are
930 present only if STP is enabled (via the <ref column="stp_enable"/>
931 column).
932 </p>
933 <column name="status" key="stp_bridge_id">
934 The bridge ID used in spanning tree advertisements, in the form
935 <var>xxxx</var>.<var>yyyyyyyyyyyy</var> where the <var>x</var>s are
936 the STP priority, the <var>y</var>s are the STP system ID, and each
937 <var>x</var> and <var>y</var> is a hex digit.
938 </column>
939 <column name="status" key="stp_designated_root">
940 The designated root for this spanning tree, in the same form as <ref
941 column="status" key="stp_bridge_id"/>. If this bridge is the root,
942 this will have the same value as <ref column="status"
943 key="stp_bridge_id"/>, otherwise it will differ.
944 </column>
945 <column name="status" key="stp_root_path_cost">
946 The path cost of reaching the designated bridge. A lower number is
947 better. The value is 0 if this bridge is the root, otherwise it is
948 higher.
949 </column>
950 </group>
951 </group>
952
953 <group title="Rapid Spanning Tree">
954 <p>
955 Rapid Spanning Tree Protocol (RSTP), like STP, is a network protocol
956 that ensures loop-free topologies. RSTP superseded STP with the
957 publication of 802.1D-2004. Compared to STP, RSTP converges more
958 quickly and recovers more quickly from failures.
959 </p>
960
961 <group title="RSTP Configuration">
962 <column name="rstp_enable" type='{"type": "boolean"}'>
963 <p>
964 Enable Rapid Spanning Tree on the bridge. By default, RSTP is disabled
965 on bridges. Bond, internal, and mirror ports are not supported
966 and will not participate in the spanning tree.
967 </p>
968
969 <p>
970 STP and RSTP are mutually exclusive. If both are enabled, RSTP
971 will be used.
972 </p>
973 </column>
974
975 <column name="other_config" key="rstp-address">
976 The bridge's RSTP address (the lower 48 bits of the bridge-id)
977 in the form
978 <var>xx</var>:<var>xx</var>:<var>xx</var>:<var>xx</var>:<var>xx</var>:<var>xx</var>.
979 By default, the address is the MAC address of the bridge.
980 </column>
981
982 <column name="other_config" key="rstp-priority"
983 type='{"type": "integer", "minInteger": 0, "maxInteger": 61440}'>
984 The bridge's relative priority value for determining the root
985 bridge (the upper 16 bits of the bridge-id). A bridge with the
986 lowest bridge-id is elected the root. By default, the priority
987 is 0x8000 (32768). This value needs to be a multiple of 4096,
988 otherwise it's rounded to the nearest inferior one.
989 </column>
990
991 <column name="other_config" key="rstp-ageing-time"
992 type='{"type": "integer", "minInteger": 10, "maxInteger": 1000000}'>
993 The Ageing Time parameter for the Bridge. The default value
994 is 300 seconds.
995 </column>
996
997 <column name="other_config" key="rstp-force-protocol-version"
998 type='{"type": "integer"}'>
999 The Force Protocol Version parameter for the Bridge. This
1000 can take the value 0 (STP Compatibility mode) or 2
1001 (the default, normal operation).
1002 </column>
1003
1004 <column name="other_config" key="rstp-max-age"
1005 type='{"type": "integer", "minInteger": 6, "maxInteger": 40}'>
1006 The maximum age of the information transmitted by the Bridge
1007 when it is the Root Bridge. The default value is 20.
1008 </column>
1009
1010 <column name="other_config" key="rstp-forward-delay"
1011 type='{"type": "integer", "minInteger": 4, "maxInteger": 30}'>
1012 The delay used by STP Bridges to transition Root and Designated
1013 Ports to Forwarding. The default value is 15.
1014 </column>
1015
1016 <column name="other_config" key="rstp-transmit-hold-count"
1017 type='{"type": "integer", "minInteger": 1, "maxInteger": 10}'>
1018 The Transmit Hold Count used by the Port Transmit state machine
1019 to limit transmission rate. The default value is 6.
1020 </column>
1021 </group>
1022
1023 <group title="RSTP Status">
1024 <p>
1025 These key-value pairs report the status of 802.1D-2004. They are
1026 present only if RSTP is enabled (via the <ref column="rstp_enable"/>
1027 column).
1028 </p>
1029 <column name="rstp_status" key="rstp_bridge_id">
1030 The bridge ID used in rapid spanning tree advertisements, in the form
1031 <var>x</var>.<var>yyy</var>.<var>zzzzzzzzzzzz</var> where
1032 <var>x</var> is the RSTP priority, the <var>y</var>s are a locally
1033 assigned system ID extension, the <var>z</var>s are the STP system
1034 ID, and each <var>x</var>, <var>y</var>, or <var>z</var> is a hex
1035 digit.
1036 </column>
1037 <column name="rstp_status" key="rstp_root_id">
1038 The root of this spanning tree, in the same form as <ref
1039 column="rstp_status" key="rstp_bridge_id"/>. If this bridge is the
1040 root, this will have the same value as <ref column="rstp_status"
1041 key="rstp_bridge_id"/>, otherwise it will differ.
1042 </column>
1043 <column name="rstp_status" key="rstp_root_path_cost"
1044 type='{"type": "integer", "minInteger": 0}'>
1045 The path cost of reaching the root. A lower number is better. The
1046 value is 0 if this bridge is the root, otherwise it is higher.
1047 </column>
1048 <column name="rstp_status" key="rstp_designated_id">
1049 The RSTP designated ID, in the same form as <ref column="rstp_status"
1050 key="rstp_bridge_id"/>.
1051 </column>
1052 <column name="rstp_status" key="rstp_designated_port_id">
1053 The RSTP designated port ID, as a 4-digit hex number.
1054 </column>
1055 <column name="rstp_status" key="rstp_bridge_port_id">
1056 The RSTP bridge port ID, as a 4-digit hex number.
1057 </column>
1058 </group>
1059 </group>
1060
1061 <group title="Multicast Snooping Configuration">
1062 Multicast snooping (RFC 4541) monitors the Internet Group Management
1063 Protocol (IGMP) and Multicast Listener Discovery traffic between hosts
1064 and multicast routers. The switch uses what IGMP and MLD snooping
1065 learns to forward multicast traffic only to interfaces that are connected
1066 to interested receivers. Currently it supports IGMPv1, IGMPv2, IGMPv3,
1067 MLDv1 and MLDv2 protocols.
1068
1069 <column name="mcast_snooping_enable">
1070 Enable multicast snooping on the bridge. For now, the default
1071 is disabled.
1072 </column>
1073 </group>
1074
1075 <group title="Other Features">
1076 <column name="datapath_type">
1077 Name of datapath provider. The kernel datapath has type
1078 <code>system</code>. The userspace datapath has type
1079 <code>netdev</code>. A manager may refer to the <ref
1080 table="Open_vSwitch" column="datapath_types"/> column of the <ref
1081 table="Open_vSwitch"/> table for a list of the types accepted by this
1082 Open vSwitch instance.
1083 </column>
1084
1085 <column name="external_ids" key="bridge-id">
1086 A unique identifier of the bridge. On Citrix XenServer this will
1087 commonly be the same as
1088 <ref column="external_ids" key="xs-network-uuids"/>.
1089 </column>
1090
1091 <column name="external_ids" key="xs-network-uuids">
1092 Semicolon-delimited set of universally unique identifier(s) for the
1093 network with which this bridge is associated on a Citrix XenServer
1094 host. The network identifiers are RFC 4122 UUIDs as displayed by,
1095 e.g., <code>xe network-list</code>.
1096 </column>
1097
1098 <column name="other_config" key="hwaddr">
1099 An Ethernet address in the form
1100 <var>xx</var>:<var>xx</var>:<var>xx</var>:<var>xx</var>:<var>xx</var>:<var>xx</var>
1101 to set the hardware address of the local port and influence the
1102 datapath ID.
1103 </column>
1104
1105 <column name="other_config" key="forward-bpdu"
1106 type='{"type": "boolean"}'>
1107
1108 <p>
1109 Controls forwarding of BPDUs and other network control frames when
1110 NORMAL action is invoked. When this option is <code>false</code> or
1111 unset, frames with reserved Ethernet addresses (see table below) will
1112 not be forwarded. When this option is <code>true</code>, such frames
1113 will not be treated specially.
1114 </p>
1115
1116 <p>
1117 The above general rule has the following exceptions:
1118 </p>
1119
1120 <ul>
1121 <li>
1122 If STP is enabled on the bridge (see the <ref column="stp_enable"
1123 table="Bridge"/> column in the <ref table="Bridge"/> table), the
1124 bridge processes all received STP packets and never passes them to
1125 OpenFlow or forwards them. This is true even if STP is disabled on
1126 an individual port.
1127 </li>
1128
1129 <li>
1130 If LLDP is enabled on an interface (see the <ref column="lldp"
1131 table="Interface"/> column in the <ref table="Interface"/> table),
1132 the interface processes received LLDP packets and never passes them
1133 to OpenFlow or forwards them.
1134 </li>
1135 </ul>
1136
1137 <p>
1138 Set this option to <code>true</code> if the Open vSwitch bridge
1139 connects different Ethernet networks and is not configured to
1140 participate in STP.
1141 </p>
1142
1143 <p>
1144 This option affects packets with the following destination MAC
1145 addresses:
1146 </p>
1147
1148 <dl>
1149 <dt><code>01:80:c2:00:00:00</code></dt>
1150 <dd>IEEE 802.1D Spanning Tree Protocol (STP).</dd>
1151
1152 <dt><code>01:80:c2:00:00:01</code></dt>
1153 <dd>IEEE Pause frame.</dd>
1154
1155 <dt><code>01:80:c2:00:00:0<var>x</var></code></dt>
1156 <dd>Other reserved protocols.</dd>
1157
1158 <dt><code>00:e0:2b:00:00:00</code></dt>
1159 <dd>Extreme Discovery Protocol (EDP).</dd>
1160
1161 <dt>
1162 <code>00:e0:2b:00:00:04</code> and <code>00:e0:2b:00:00:06</code>
1163 </dt>
1164 <dd>Ethernet Automatic Protection Switching (EAPS).</dd>
1165
1166 <dt><code>01:00:0c:cc:cc:cc</code></dt>
1167 <dd>
1168 Cisco Discovery Protocol (CDP), VLAN Trunking Protocol (VTP),
1169 Dynamic Trunking Protocol (DTP), Port Aggregation Protocol (PAgP),
1170 and others.
1171 </dd>
1172
1173 <dt><code>01:00:0c:cc:cc:cd</code></dt>
1174 <dd>Cisco Shared Spanning Tree Protocol PVSTP+.</dd>
1175
1176 <dt><code>01:00:0c:cd:cd:cd</code></dt>
1177 <dd>Cisco STP Uplink Fast.</dd>
1178
1179 <dt><code>01:00:0c:00:00:00</code></dt>
1180 <dd>Cisco Inter Switch Link.</dd>
1181
1182 <dt><code>01:00:0c:cc:cc:c<var>x</var></code></dt>
1183 <dd>Cisco CFM.</dd>
1184 </dl>
1185 </column>
1186
1187 <column name="other_config" key="mac-aging-time"
1188 type='{"type": "integer", "minInteger": 1}'>
1189 <p>
1190 The maximum number of seconds to retain a MAC learning entry for
1191 which no packets have been seen. The default is currently 300
1192 seconds (5 minutes). The value, if specified, is forced into a
1193 reasonable range, currently 15 to 3600 seconds.
1194 </p>
1195
1196 <p>
1197 A short MAC aging time allows a network to more quickly detect that a
1198 host is no longer connected to a switch port. However, it also makes
1199 it more likely that packets will be flooded unnecessarily, when they
1200 are addressed to a connected host that rarely transmits packets. To
1201 reduce the incidence of unnecessary flooding, use a MAC aging time
1202 longer than the maximum interval at which a host will ordinarily
1203 transmit packets.
1204 </p>
1205 </column>
1206
1207 <column name="other_config" key="mac-table-size"
1208 type='{"type": "integer", "minInteger": 1}'>
1209 <p>
1210 The maximum number of MAC addresses to learn. The default is
1211 currently 2048. The value, if specified, is forced into a reasonable
1212 range, currently 10 to 1,000,000.
1213 </p>
1214 </column>
1215 </group>
1216
1217 <group title="Common Columns">
1218 The overall purpose of these columns is described under <code>Common
1219 Columns</code> at the beginning of this document.
1220
1221 <column name="other_config"/>
1222 <column name="external_ids"/>
1223 </group>
1224 </table>
1225
1226 <table name="Port" table="Port or bond configuration.">
1227 <p>A port within a <ref table="Bridge"/>.</p>
1228 <p>Most commonly, a port has exactly one ``interface,'' pointed to by its
1229 <ref column="interfaces"/> column. Such a port logically
1230 corresponds to a port on a physical Ethernet switch. A port
1231 with more than one interface is a ``bonded port'' (see
1232 <ref group="Bonding Configuration"/>).</p>
1233 <p>Some properties that one might think as belonging to a port are actually
1234 part of the port's <ref table="Interface"/> members.</p>
1235
1236 <column name="name">
1237 Port name. For a non-bonded port, this should be the same as its
1238 interface's name. Port names must otherwise be unique among the names of
1239 ports, interfaces, and bridges on a host. Because port and interfaces
1240 names are usually the same, the restrictions on the <ref
1241 table="Interface" column="name"/> column in the <ref table="Interface"/>
1242 table, particularly on length, also apply to port names. Refer to the
1243 documentation for <ref table="Interface"/> names for details.
1244 </column>
1245
1246 <column name="interfaces">
1247 The port's interfaces. If there is more than one, this is a
1248 bonded Port.
1249 </column>
1250
1251 <group title="VLAN Configuration">
1252 <p>Bridge ports support the following types of VLAN configuration:</p>
1253 <dl>
1254 <dt>trunk</dt>
1255 <dd>
1256 <p>
1257 A trunk port carries packets on one or more specified VLANs
1258 specified in the <ref column="trunks"/> column (often, on every
1259 VLAN). A packet that ingresses on a trunk port is in the VLAN
1260 specified in its 802.1Q header, or VLAN 0 if the packet has no
1261 802.1Q header. A packet that egresses through a trunk port will
1262 have an 802.1Q header if it has a nonzero VLAN ID.
1263 </p>
1264
1265 <p>
1266 Any packet that ingresses on a trunk port tagged with a VLAN that
1267 the port does not trunk is dropped.
1268 </p>
1269 </dd>
1270
1271 <dt>access</dt>
1272 <dd>
1273 <p>
1274 An access port carries packets on exactly one VLAN specified in the
1275 <ref column="tag"/> column. Packets egressing on an access port
1276 have no 802.1Q header.
1277 </p>
1278
1279 <p>
1280 Any packet with an 802.1Q header with a nonzero VLAN ID that
1281 ingresses on an access port is dropped, regardless of whether the
1282 VLAN ID in the header is the access port's VLAN ID.
1283 </p>
1284 </dd>
1285
1286 <dt>native-tagged</dt>
1287 <dd>
1288 A native-tagged port resembles a trunk port, with the exception that
1289 a packet without an 802.1Q header that ingresses on a native-tagged
1290 port is in the ``native VLAN'' (specified in the <ref column="tag"/>
1291 column).
1292 </dd>
1293
1294 <dt>native-untagged</dt>
1295 <dd>
1296 A native-untagged port resembles a native-tagged port, with the
1297 exception that a packet that egresses on a native-untagged port in
1298 the native VLAN will not have an 802.1Q header.
1299 </dd>
1300 </dl>
1301 <p>
1302 A packet will only egress through bridge ports that carry the VLAN of
1303 the packet, as described by the rules above.
1304 </p>
1305
1306 <column name="vlan_mode">
1307 <p>
1308 The VLAN mode of the port, as described above. When this column is
1309 empty, a default mode is selected as follows:
1310 </p>
1311 <ul>
1312 <li>
1313 If <ref column="tag"/> contains a value, the port is an access
1314 port. The <ref column="trunks"/> column should be empty.
1315 </li>
1316 <li>
1317 Otherwise, the port is a trunk port. The <ref column="trunks"/>
1318 column value is honored if it is present.
1319 </li>
1320 </ul>
1321 </column>
1322
1323 <column name="tag">
1324 <p>
1325 For an access port, the port's implicitly tagged VLAN. For a
1326 native-tagged or native-untagged port, the port's native VLAN. Must
1327 be empty if this is a trunk port.
1328 </p>
1329 </column>
1330
1331 <column name="trunks">
1332 <p>
1333 For a trunk, native-tagged, or native-untagged port, the 802.1Q VLAN
1334 or VLANs that this port trunks; if it is empty, then the port trunks
1335 all VLANs. Must be empty if this is an access port.
1336 </p>
1337 <p>
1338 A native-tagged or native-untagged port always trunks its native
1339 VLAN, regardless of whether <ref column="trunks"/> includes that
1340 VLAN.
1341 </p>
1342 </column>
1343
1344 <column name="other_config" key="priority-tags"
1345 type='{"type": "boolean"}'>
1346 <p>
1347 An 802.1Q header contains two important pieces of information: a VLAN
1348 ID and a priority. A frame with a zero VLAN ID, called a
1349 ``priority-tagged'' frame, is supposed to be treated the same way as
1350 a frame without an 802.1Q header at all (except for the priority).
1351 </p>
1352
1353 <p>
1354 However, some network elements ignore any frame that has 802.1Q
1355 header at all, even when the VLAN ID is zero. Therefore, by default
1356 Open vSwitch does not output priority-tagged frames, instead omitting
1357 the 802.1Q header entirely if the VLAN ID is zero. Set this key to
1358 <code>true</code> to enable priority-tagged frames on a port.
1359 </p>
1360
1361 <p>
1362 Regardless of this setting, Open vSwitch omits the 802.1Q header on
1363 output if both the VLAN ID and priority would be zero.
1364 </p>
1365
1366 <p>
1367 All frames output to native-tagged ports have a nonzero VLAN ID, so
1368 this setting is not meaningful on native-tagged ports.
1369 </p>
1370 </column>
1371 </group>
1372
1373 <group title="Bonding Configuration">
1374 <p>A port that has more than one interface is a ``bonded port.'' Bonding
1375 allows for load balancing and fail-over.</p>
1376
1377 <p>
1378 The following types of bonding will work with any kind of upstream
1379 switch. On the upstream switch, do not configure the interfaces as a
1380 bond:
1381 </p>
1382
1383 <dl>
1384 <dt><code>balance-slb</code></dt>
1385 <dd>
1386 Balances flows among slaves based on source MAC address and output
1387 VLAN, with periodic rebalancing as traffic patterns change.
1388 </dd>
1389
1390 <dt><code>active-backup</code></dt>
1391 <dd>
1392 Assigns all flows to one slave, failing over to a backup slave when
1393 the active slave is disabled. This is the only bonding mode in which
1394 interfaces may be plugged into different upstream switches.
1395 </dd>
1396 </dl>
1397
1398 <p>
1399 The following modes require the upstream switch to support 802.3ad with
1400 successful LACP negotiation. If LACP negotiation fails and
1401 other-config:lacp-fallback-ab is true, then <code>active-backup</code>
1402 mode is used:
1403 </p>
1404
1405 <dl>
1406 <dt><code>balance-tcp</code></dt>
1407 <dd>
1408 Balances flows among slaves based on L2, L3, and L4 protocol
1409 information such as destination MAC address, IP address, and TCP
1410 port.
1411 </dd>
1412 </dl>
1413
1414 <p>These columns apply only to bonded ports. Their values are
1415 otherwise ignored.</p>
1416
1417 <column name="bond_mode">
1418 <p>The type of bonding used for a bonded port. Defaults to
1419 <code>active-backup</code> if unset.
1420 </p>
1421 </column>
1422
1423 <column name="other_config" key="bond-hash-basis"
1424 type='{"type": "integer"}'>
1425 An integer hashed along with flows when choosing output slaves in load
1426 balanced bonds. When changed, all flows will be assigned different
1427 hash values possibly causing slave selection decisions to change. Does
1428 not affect bonding modes which do not employ load balancing such as
1429 <code>active-backup</code>.
1430 </column>
1431
1432 <group title="Link Failure Detection">
1433 <p>
1434 An important part of link bonding is detecting that links are down so
1435 that they may be disabled. These settings determine how Open vSwitch
1436 detects link failure.
1437 </p>
1438
1439 <column name="other_config" key="bond-detect-mode"
1440 type='{"type": "string", "enum": ["set", ["carrier", "miimon"]]}'>
1441 The means used to detect link failures. Defaults to
1442 <code>carrier</code> which uses each interface's carrier to detect
1443 failures. When set to <code>miimon</code>, will check for failures
1444 by polling each interface's MII.
1445 </column>
1446
1447 <column name="other_config" key="bond-miimon-interval"
1448 type='{"type": "integer"}'>
1449 The interval, in milliseconds, between successive attempts to poll
1450 each interface's MII. Relevant only when <ref column="other_config"
1451 key="bond-detect-mode"/> is <code>miimon</code>.
1452 </column>
1453
1454 <column name="bond_updelay">
1455 <p>
1456 The number of milliseconds for which the link must stay up on an
1457 interface before the interface is considered to be up. Specify
1458 <code>0</code> to enable the interface immediately.
1459 </p>
1460
1461 <p>
1462 This setting is honored only when at least one bonded interface is
1463 already enabled. When no interfaces are enabled, then the first
1464 bond interface to come up is enabled immediately.
1465 </p>
1466 </column>
1467
1468 <column name="bond_downdelay">
1469 The number of milliseconds for which the link must stay down on an
1470 interface before the interface is considered to be down. Specify
1471 <code>0</code> to disable the interface immediately.
1472 </column>
1473 </group>
1474
1475 <group title="LACP Configuration">
1476 <p>
1477 LACP, the Link Aggregation Control Protocol, is an IEEE standard that
1478 allows switches to automatically detect that they are connected by
1479 multiple links and aggregate across those links. These settings
1480 control LACP behavior.
1481 </p>
1482
1483 <column name="lacp">
1484 Configures LACP on this port. LACP allows directly connected
1485 switches to negotiate which links may be bonded. LACP may be enabled
1486 on non-bonded ports for the benefit of any switches they may be
1487 connected to. <code>active</code> ports are allowed to initiate LACP
1488 negotiations. <code>passive</code> ports are allowed to participate
1489 in LACP negotiations initiated by a remote switch, but not allowed to
1490 initiate such negotiations themselves. If LACP is enabled on a port
1491 whose partner switch does not support LACP, the bond will be
1492 disabled, unless other-config:lacp-fallback-ab is set to true.
1493 Defaults to <code>off</code> if unset.
1494 </column>
1495
1496 <column name="other_config" key="lacp-system-id">
1497 The LACP system ID of this <ref table="Port"/>. The system ID of a
1498 LACP bond is used to identify itself to its partners. Must be a
1499 nonzero MAC address. Defaults to the bridge Ethernet address if
1500 unset.
1501 </column>
1502
1503 <column name="other_config" key="lacp-system-priority"
1504 type='{"type": "integer", "minInteger": 1, "maxInteger": 65535}'>
1505 The LACP system priority of this <ref table="Port"/>. In LACP
1506 negotiations, link status decisions are made by the system with the
1507 numerically lower priority.
1508 </column>
1509
1510 <column name="other_config" key="lacp-time"
1511 type='{"type": "string", "enum": ["set", ["fast", "slow"]]}'>
1512 <p>
1513 The LACP timing which should be used on this <ref table="Port"/>.
1514 By default <code>slow</code> is used. When configured to be
1515 <code>fast</code> LACP heartbeats are requested at a rate of once
1516 per second causing connectivity problems to be detected more
1517 quickly. In <code>slow</code> mode, heartbeats are requested at a
1518 rate of once every 30 seconds.
1519 </p>
1520 </column>
1521
1522 <column name="other_config" key="lacp-fallback-ab"
1523 type='{"type": "boolean"}'>
1524 <p>
1525 Determines the behavior of openvswitch bond in LACP mode. If
1526 the partner switch does not support LACP, setting this option
1527 to <code>true</code> allows openvswitch to fallback to
1528 active-backup. If the option is set to <code>false</code>, the
1529 bond will be disabled. In both the cases, once the partner switch
1530 is configured to LACP mode, the bond will use LACP.
1531 </p>
1532 </column>
1533 </group>
1534
1535 <group title="Rebalancing Configuration">
1536 <p>
1537 These settings control behavior when a bond is in
1538 <code>balance-slb</code> or <code>balance-tcp</code> mode.
1539 </p>
1540
1541 <column name="other_config" key="bond-rebalance-interval"
1542 type='{"type": "integer", "minInteger": 0, "maxInteger": 10000}'>
1543 For a load balanced bonded port, the number of milliseconds between
1544 successive attempts to rebalance the bond, that is, to move flows
1545 from one interface on the bond to another in an attempt to keep usage
1546 of each interface roughly equal. If zero, load balancing is disabled
1547 on the bond (link failure still cause flows to move). If
1548 less than 1000ms, the rebalance interval will be 1000ms.
1549 </column>
1550 </group>
1551
1552 <column name="bond_fake_iface">
1553 For a bonded port, whether to create a fake internal interface with the
1554 name of the port. Use only for compatibility with legacy software that
1555 requires this.
1556 </column>
1557 </group>
1558
1559 <group title="Spanning Tree Protocol">
1560 <p>
1561 The configuration here is only meaningful, and the status is only
1562 populated, when 802.1D-1998 Spanning Tree Protocol is enabled on the
1563 port's <ref column="Bridge"/> with its <ref column="stp_enable"/>
1564 column.
1565 </p>
1566
1567 <group title="STP Configuration">
1568 <column name="other_config" key="stp-enable"
1569 type='{"type": "boolean"}'>
1570 When STP is enabled on a bridge, it is enabled by default on all of
1571 the bridge's ports except bond, internal, and mirror ports (which do
1572 not work with STP). If this column's value is <code>false</code>,
1573 STP is disabled on the port.
1574 </column>
1575
1576 <column name="other_config" key="stp-port-num"
1577 type='{"type": "integer", "minInteger": 1, "maxInteger": 255}'>
1578 The port number used for the lower 8 bits of the port-id. By
1579 default, the numbers will be assigned automatically. If any
1580 port's number is manually configured on a bridge, then they
1581 must all be.
1582 </column>
1583
1584 <column name="other_config" key="stp-port-priority"
1585 type='{"type": "integer", "minInteger": 0, "maxInteger": 255}'>
1586 The port's relative priority value for determining the root
1587 port (the upper 8 bits of the port-id). A port with a lower
1588 port-id will be chosen as the root port. By default, the
1589 priority is 0x80.
1590 </column>
1591
1592 <column name="other_config" key="stp-path-cost"
1593 type='{"type": "integer", "minInteger": 0, "maxInteger": 65535}'>
1594 Spanning tree path cost for the port. A lower number indicates
1595 a faster link. By default, the cost is based on the maximum
1596 speed of the link.
1597 </column>
1598 </group>
1599
1600 <group title="STP Status">
1601 <column name="status" key="stp_port_id">
1602 The port ID used in spanning tree advertisements for this port, as 4
1603 hex digits. Configuring the port ID is described in the
1604 <code>stp-port-num</code> and <code>stp-port-priority</code> keys of
1605 the <code>other_config</code> section earlier.
1606 </column>
1607 <column name="status" key="stp_state"
1608 type='{"type": "string", "enum": ["set",
1609 ["disabled", "listening", "learning",
1610 "forwarding", "blocking"]]}'>
1611 STP state of the port.
1612 </column>
1613 <column name="status" key="stp_sec_in_state"
1614 type='{"type": "integer", "minInteger": 0}'>
1615 The amount of time this port has been in the current STP state, in
1616 seconds.
1617 </column>
1618 <column name="status" key="stp_role"
1619 type='{"type": "string", "enum": ["set",
1620 ["root", "designated", "alternate"]]}'>
1621 STP role of the port.
1622 </column>
1623 </group>
1624 </group>
1625
1626 <group title="Rapid Spanning Tree Protocol">
1627 <p>
1628 The configuration here is only meaningful, and the status and
1629 statistics are only populated, when 802.1D-1998 Spanning Tree Protocol
1630 is enabled on the port's <ref column="Bridge"/> with its <ref
1631 column="stp_enable"/> column.
1632 </p>
1633
1634 <group title="RSTP Configuration">
1635 <column name="other_config" key="rstp-enable"
1636 type='{"type": "boolean"}'>
1637 When RSTP is enabled on a bridge, it is enabled by default on all of
1638 the bridge's ports except bond, internal, and mirror ports (which do
1639 not work with RSTP). If this column's value is <code>false</code>,
1640 RSTP is disabled on the port.
1641 </column>
1642
1643 <column name="other_config" key="rstp-port-priority"
1644 type='{"type": "integer", "minInteger": 0, "maxInteger": 240}'>
1645 The port's relative priority value for determining the root port, in
1646 multiples of 16. By default, the port priority is 0x80 (128). Any
1647 value in the lower 4 bits is rounded off. The significant upper 4
1648 bits become the upper 4 bits of the port-id. A port with the lowest
1649 port-id is elected as the root.
1650 </column>
1651
1652 <column name="other_config" key="rstp-port-num"
1653 type='{"type": "integer", "minInteger": 1, "maxInteger": 4095}'>
1654 The local RSTP port number, used as the lower 12 bits of the port-id.
1655 By default the port numbers are assigned automatically, and typically
1656 may not correspond to the OpenFlow port numbers. A port with the
1657 lowest port-id is elected as the root.
1658 </column>
1659
1660 <column name="other_config" key="rstp-port-path-cost"
1661 type='{"type": "integer"}'>
1662 The port path cost. The Port's contribution, when it is
1663 the Root Port, to the Root Path Cost for the Bridge. By default the
1664 cost is automatically calculated from the port's speed.
1665 </column>
1666
1667 <column name="other_config" key="rstp-port-admin-edge"
1668 type='{"type": "boolean"}'>
1669 The admin edge port parameter for the Port. Default is
1670 <code>false</code>.
1671 </column>
1672
1673 <column name="other_config" key="rstp-port-auto-edge"
1674 type='{"type": "boolean"}'>
1675 The auto edge port parameter for the Port. Default is
1676 <code>true</code>.
1677 </column>
1678
1679 <column name="other_config" key="rstp-port-mcheck"
1680 type='{"type": "boolean"}'>
1681 <p>
1682 The mcheck port parameter for the Port. Default is
1683 <code>false</code>. May be set to force the Port Protocol
1684 Migration state machine to transmit RST BPDUs for a
1685 MigrateTime period, to test whether all STP Bridges on the
1686 attached LAN have been removed and the Port can continue to
1687 transmit RSTP BPDUs. Setting mcheck has no effect if the
1688 Bridge is operating in STP Compatibility mode.
1689 </p>
1690 <p>
1691 Changing the value from <code>true</code> to
1692 <code>false</code> has no effect, but needs to be done if
1693 this behavior is to be triggered again by subsequently
1694 changing the value from <code>false</code> to
1695 <code>true</code>.
1696 </p>
1697 </column>
1698 </group>
1699
1700 <group title="RSTP Status">
1701 <column name="rstp_status" key="rstp_port_id">
1702 The port ID used in spanning tree advertisements for this port, as 4
1703 hex digits. Configuring the port ID is described in the
1704 <code>rstp-port-num</code> and <code>rstp-port-priority</code> keys
1705 of the <code>other_config</code> section earlier.
1706 </column>
1707 <column name="rstp_status" key="rstp_port_role"
1708 type='{"type": "string", "enum": ["set",
1709 ["Root", "Designated", "Alternate", "Backup", "Disabled"]]}'>
1710 RSTP role of the port.
1711 </column>
1712 <column name="rstp_status" key="rstp_port_state"
1713 type='{"type": "string", "enum": ["set",
1714 ["Disabled", "Learning", "Forwarding", "Discarding"]]}'>
1715 RSTP state of the port.
1716 </column>
1717 <column name="rstp_status" key="rstp_designated_bridge_id">
1718 The port's RSTP designated bridge ID, in the same form as <ref
1719 column="rstp_status" key="rstp_bridge_id"/> in the <ref
1720 table="Bridge"/> table.
1721 </column>
1722 <column name="rstp_status" key="rstp_designated_port_id">
1723 The port's RSTP designated port ID, as 4 hex digits.
1724 </column>
1725 <column name="rstp_status" key="rstp_designated_path_cost"
1726 type='{"type": "integer"}'>
1727 The port's RSTP designated path cost. Lower is better.
1728 </column>
1729 </group>
1730
1731 <group title="RSTP Statistics">
1732 <column name="rstp_statistics" key="rstp_tx_count">
1733 Number of RSTP BPDUs transmitted through this port.
1734 </column>
1735 <column name="rstp_statistics" key="rstp_rx_count">
1736 Number of valid RSTP BPDUs received by this port.
1737 </column>
1738 <column name="rstp_statistics" key="rstp_error_count">
1739 Number of invalid RSTP BPDUs received by this port.
1740 </column>
1741 <column name="rstp_statistics" key="rstp_uptime">
1742 The duration covered by the other RSTP statistics, in seconds.
1743 </column>
1744 </group>
1745 </group>
1746
1747 <group title="Multicast Snooping">
1748 <column name="other_config" key="mcast-snooping-flood"
1749 type='{"type": "boolean"}'>
1750 <p>
1751 If set to <code>true</code>, multicast packets (except Reports) are
1752 unconditionally forwarded to the specific port.
1753 </p>
1754 </column>
1755 <column name="other_config" key="mcast-snooping-flood-reports"
1756 type='{"type": "boolean"}'>
1757 <p>
1758 If set to <code>true</code>, multicast Reports are unconditionally
1759 forwarded to the specific port.
1760 </p>
1761 </column>
1762 </group>
1763
1764 <group title="Other Features">
1765 <column name="qos">
1766 Quality of Service configuration for this port.
1767 </column>
1768
1769 <column name="mac">
1770 The MAC address to use for this port for the purpose of choosing the
1771 bridge's MAC address. This column does not necessarily reflect the
1772 port's actual MAC address, nor will setting it change the port's actual
1773 MAC address.
1774 </column>
1775
1776 <column name="fake_bridge">
1777 Does this port represent a sub-bridge for its tagged VLAN within the
1778 Bridge? See ovs-vsctl(8) for more information.
1779 </column>
1780
1781 <column name="protected" type='{"type": "boolean"}'>
1782 The protected ports feature allows certain ports to be designated as
1783 protected. Traffic between protected ports is blocked. Protected
1784 ports can send traffic to unprotected ports. Unprotected ports can
1785 send traffic to any port.
1786 Default is false.
1787 </column>
1788
1789 <column name="external_ids" key="fake-bridge-id-*">
1790 External IDs for a fake bridge (see the <ref column="fake_bridge"/>
1791 column) are defined by prefixing a <ref table="Bridge"/> <ref
1792 table="Bridge" column="external_ids"/> key with
1793 <code>fake-bridge-</code>,
1794 e.g. <code>fake-bridge-xs-network-uuids</code>.
1795 </column>
1796
1797 <column name="other_config" key="transient"
1798 type='{"type": "boolean"}'>
1799 <p>
1800 If set to <code>true</code>, the port will be removed when
1801 <code>ovs-ctl start --delete-transient-ports</code> is used.
1802 </p>
1803 </column>
1804 </group>
1805
1806 <column name="bond_active_slave">
1807 For a bonded port, record the mac address of the current active slave.
1808 </column>
1809
1810 <group title="Port Statistics">
1811 <p>
1812 Key-value pairs that report port statistics. The update period
1813 is controlled by <ref column="other_config"
1814 key="stats-update-interval"/> in the <code>Open_vSwitch</code> table.
1815 </p>
1816 <group title="Statistics: STP transmit and receive counters">
1817 <column name="statistics" key="stp_tx_count">
1818 Number of STP BPDUs sent on this port by the spanning
1819 tree library.
1820 </column>
1821 <column name="statistics" key="stp_rx_count">
1822 Number of STP BPDUs received on this port and accepted by the
1823 spanning tree library.
1824 </column>
1825 <column name="statistics" key="stp_error_count">
1826 Number of bad STP BPDUs received on this port. Bad BPDUs
1827 include runt packets and those with an unexpected protocol ID.
1828 </column>
1829 </group>
1830 </group>
1831
1832 <group title="Common Columns">
1833 The overall purpose of these columns is described under <code>Common
1834 Columns</code> at the beginning of this document.
1835
1836 <column name="other_config"/>
1837 <column name="external_ids"/>
1838 </group>
1839 </table>
1840
1841 <table name="Interface" title="One physical network device in a Port.">
1842 An interface within a <ref table="Port"/>.
1843
1844 <group title="Core Features">
1845 <column name="name">
1846 <p>
1847 Interface name. Should be alphanumeric. For non-bonded port, this
1848 should be the same as the port name. It must otherwise be unique
1849 among the names of ports, interfaces, and bridges on a host.
1850 </p>
1851
1852 <p>
1853 The maximum length of an interface name depends on the underlying
1854 datapath:
1855 </p>
1856
1857 <ul>
1858 <li>
1859 The names of interfaces implemented as Linux and BSD network
1860 devices, including interfaces with type <code>internal</code>,
1861 <code>tap</code>, or <code>system</code> plus the different types
1862 of tunnel ports, are limited to 15 bytes. Windows limits these
1863 names to 255 bytes.
1864 </li>
1865
1866 <li>
1867 The names of patch ports are not used in the underlying datapath,
1868 so operating system restrictions do not apply. Thus, they may have
1869 arbitrary length.
1870 </li>
1871 </ul>
1872
1873 <p>
1874 Regardless of other restrictions, OpenFlow only supports 15-byte
1875 names, which means that <code>ovs-ofctl</code> and OpenFlow
1876 controllers will show names truncated to 15 bytes.
1877 </p>
1878 </column>
1879
1880 <column name="ifindex">
1881 A positive interface index as defined for SNMP MIB-II in RFCs 1213 and
1882 2863, if the interface has one, otherwise 0. The ifindex is useful for
1883 seamless integration with protocols such as SNMP and sFlow.
1884 </column>
1885
1886 <column name="mac_in_use">
1887 The MAC address in use by this interface.
1888 </column>
1889
1890 <column name="mac">
1891 <p>Ethernet address to set for this interface. If unset then the
1892 default MAC address is used:</p>
1893 <ul>
1894 <li>For the local interface, the default is the lowest-numbered MAC
1895 address among the other bridge ports, either the value of the
1896 <ref table="Port" column="mac"/> in its <ref table="Port"/> record,
1897 if set, or its actual MAC (for bonded ports, the MAC of its slave
1898 whose name is first in alphabetical order). Internal ports and
1899 bridge ports that are used as port mirroring destinations (see the
1900 <ref table="Mirror"/> table) are ignored.</li>
1901 <li>For other internal interfaces, the default MAC is randomly
1902 generated.</li>
1903 <li>External interfaces typically have a MAC address associated with
1904 their hardware.</li>
1905 </ul>
1906 <p>Some interfaces may not have a software-controllable MAC
1907 address.</p>
1908 </column>
1909
1910 <column name="error">
1911 If the configuration of the port failed, as indicated by -1 in <ref
1912 column="ofport"/>, Open vSwitch sets this column to an error
1913 description in human readable form. Otherwise, Open vSwitch clears
1914 this column.
1915 </column>
1916
1917 <group title="OpenFlow Port Number">
1918 <p>
1919 When a client adds a new interface, Open vSwitch chooses an OpenFlow
1920 port number for the new port. If the client that adds the port fills
1921 in <ref column="ofport_request"/>, then Open vSwitch tries to use its
1922 value as the OpenFlow port number. Otherwise, or if the requested
1923 port number is already in use or cannot be used for another reason,
1924 Open vSwitch automatically assigns a free port number. Regardless of
1925 how the port number was obtained, Open vSwitch then reports in <ref
1926 column="ofport"/> the port number actually assigned.
1927 </p>
1928
1929 <p>
1930 Open vSwitch limits the port numbers that it automatically assigns to
1931 the range 1 through 32,767, inclusive. Controllers therefore have
1932 free use of ports 32,768 and up.
1933 </p>
1934
1935 <column name="ofport">
1936 <p>
1937 OpenFlow port number for this interface. Open vSwitch sets this
1938 column's value, so other clients should treat it as read-only.
1939 </p>
1940 <p>
1941 The OpenFlow ``local'' port (<code>OFPP_LOCAL</code>) is 65,534.
1942 The other valid port numbers are in the range 1 to 65,279,
1943 inclusive. Value -1 indicates an error adding the interface.
1944 </p>
1945 </column>
1946
1947 <column name="ofport_request"
1948 type='{"type": "integer", "minInteger": 1, "maxInteger": 65279}'>
1949 <p>
1950 Requested OpenFlow port number for this interface.
1951 </p>
1952
1953 <p>
1954 A client should ideally set this column's value in the same
1955 database transaction that it uses to create the interface. Open
1956 vSwitch version 2.1 and later will honor a later request for a
1957 specific port number, althuogh it might confuse some controllers:
1958 OpenFlow does not have a way to announce a port number change, so
1959 Open vSwitch represents it over OpenFlow as a port deletion
1960 followed immediately by a port addition.
1961 </p>
1962
1963 <p>
1964 If <ref column="ofport_request"/> is set or changed to some other
1965 port's automatically assigned port number, Open vSwitch chooses a
1966 new port number for the latter port.
1967 </p>
1968 </column>
1969 </group>
1970 </group>
1971
1972 <group title="System-Specific Details">
1973 <column name="type">
1974 <p>
1975 The interface type. The types supported by a particular instance of
1976 Open vSwitch are listed in the <ref table="Open_vSwitch"
1977 column="iface_types"/> column in the <ref table="Open_vSwitch"/>
1978 table. The following types are defined:
1979 </p>
1980
1981 <dl>
1982 <dt><code>system</code></dt>
1983 <dd>An ordinary network device, e.g. <code>eth0</code> on Linux.
1984 Sometimes referred to as ``external interfaces'' since they are
1985 generally connected to hardware external to that on which the Open
1986 vSwitch is running. The empty string is a synonym for
1987 <code>system</code>.</dd>
1988
1989 <dt><code>internal</code></dt>
1990 <dd>A simulated network device that sends and receives traffic. An
1991 internal interface whose <ref column="name"/> is the same as its
1992 bridge's <ref table="Open_vSwitch" column="name"/> is called the
1993 ``local interface.'' It does not make sense to bond an internal
1994 interface, so the terms ``port'' and ``interface'' are often used
1995 imprecisely for internal interfaces.</dd>
1996
1997 <dt><code>tap</code></dt>
1998 <dd>A TUN/TAP device managed by Open vSwitch.</dd>
1999
2000 <dt><code>geneve</code></dt>
2001 <dd>
2002 An Ethernet over Geneve (<code>http://tools.ietf.org/html/draft-ietf-nvo3-geneve</code>)
2003 IPv4/IPv6 tunnel.
2004
2005 A description of how to match and set Geneve options can be found
2006 in the <code>ovs-ofctl</code> manual page.
2007 </dd>
2008
2009 <dt><code>gre</code></dt>
2010 <dd>
2011 An Ethernet over RFC 2890 Generic Routing Encapsulation over IPv4/IPv6
2012 tunnel.
2013 </dd>
2014
2015 <dt><code>vxlan</code></dt>
2016 <dd>
2017 <p>
2018 An Ethernet tunnel over the UDP-based VXLAN protocol described in
2019 RFC 7348.
2020 </p>
2021 <p>
2022 Open vSwitch uses IANA-assigned UDP destination port 4789. The
2023 source port used for VXLAN traffic varies on a per-flow basis
2024 and is in the ephemeral port range.
2025 </p>
2026 </dd>
2027
2028 <dt><code>lisp</code></dt>
2029 <dd>
2030 <p>
2031 A layer 3 tunnel over the experimental, UDP-based Locator/ID
2032 Separation Protocol (RFC 6830).
2033 </p>
2034 <p>
2035 Only IPv4 and IPv6 packets are supported by the protocol, and
2036 they are sent and received without an Ethernet header. Traffic
2037 to/from LISP ports is expected to be configured explicitly, and
2038 the ports are not intended to participate in learning based
2039 switching. As such, they are always excluded from packet
2040 flooding.
2041 </p>
2042 </dd>
2043
2044 <dt><code>stt</code></dt>
2045 <dd>
2046 The Stateless TCP Tunnel (STT) is particularly useful when tunnel
2047 endpoints are in end-systems, as it utilizes the capabilities of
2048 standard network interface cards to improve performance. STT utilizes
2049 a TCP-like header inside the IP header. It is stateless, i.e., there is
2050 no TCP connection state of any kind associated with the tunnel. The
2051 TCP-like header is used to leverage the capabilities of existing
2052 network interface cards, but should not be interpreted as implying
2053 any sort of connection state between endpoints.
2054 Since the STT protocol does not engage in the usual TCP 3-way handshake,
2055 so it will have difficulty traversing stateful firewalls.
2056 The protocol is documented at
2057 https://tools.ietf.org/html/draft-davie-stt
2058
2059 All traffic uses a default destination port of 7471.
2060 </dd>
2061
2062 <dt><code>patch</code></dt>
2063 <dd>
2064 A pair of virtual devices that act as a patch cable.
2065 </dd>
2066
2067 <dt><code>null</code></dt>
2068 <dd>An ignored interface. Deprecated and slated for removal in
2069 February 2013.</dd>
2070 </dl>
2071 </column>
2072 </group>
2073
2074 <group title="Tunnel Options">
2075 <p>
2076 These options apply to interfaces with <ref column="type"/> of
2077 <code>geneve</code>, <code>gre</code>, <code>vxlan</code>,
2078 <code>lisp</code> and <code>stt</code>.
2079 </p>
2080
2081 <p>
2082 Each tunnel must be uniquely identified by the combination of <ref
2083 column="type"/>, <ref column="options" key="remote_ip"/>, <ref
2084 column="options" key="local_ip"/>, and <ref column="options"
2085 key="in_key"/>. If two ports are defined that are the same except one
2086 has an optional identifier and the other does not, the more specific
2087 one is matched first. <ref column="options" key="in_key"/> is
2088 considered more specific than <ref column="options" key="local_ip"/> if
2089 a port defines one and another port defines the other.
2090 </p>
2091
2092 <column name="options" key="remote_ip">
2093 <p>Required. The remote tunnel endpoint, one of:</p>
2094
2095 <ul>
2096 <li>
2097 An IPv4 or IPv6 address (not a DNS name), e.g. <code>192.168.0.123</code>.
2098 Only unicast endpoints are supported.
2099 </li>
2100 <li>
2101 The word <code>flow</code>. The tunnel accepts packets from any
2102 remote tunnel endpoint. To process only packets from a specific
2103 remote tunnel endpoint, the flow entries may match on the
2104 <code>tun_src</code> or <code>tun_ipv6_src</code>field. When
2105 sending packets to a <code>remote_ip=flow</code> tunnel, the flow
2106 actions must explicitly set the <code>tun_dst</code> or
2107 <code>tun_ipv6_dst</code> field to the IP address of the desired
2108 remote tunnel endpoint, e.g. with a <code>set_field</code> action.
2109 </li>
2110 </ul>
2111
2112 <p>
2113 The remote tunnel endpoint for any packet received from a tunnel
2114 is available in the <code>tun_src</code> field for matching in the
2115 flow table.
2116 </p>
2117 </column>
2118
2119 <column name="options" key="local_ip">
2120 <p>
2121 Optional. The tunnel destination IP that received packets must
2122 match. Default is to match all addresses. If specified, may be one
2123 of:
2124 </p>
2125
2126 <ul>
2127 <li>
2128 An IPv4/IPv6 address (not a DNS name), e.g. <code>192.168.12.3</code>.
2129 </li>
2130 <li>
2131 The word <code>flow</code>. The tunnel accepts packets sent to any
2132 of the local IP addresses of the system running OVS. To process
2133 only packets sent to a specific IP address, the flow entries may
2134 match on the <code>tun_dst</code> or <code>tun_ipv6_dst</code> field.
2135 When sending packets to a <code>local_ip=flow</code> tunnel, the flow
2136 actions may explicitly set the <code>tun_src</code> or <code>tun_ipv6_src</code>
2137 field to the desired IP address, e.g. with a <code>set_field</code> action.
2138 However, while routing the tunneled packet out, the local system may
2139 override the specified address with the local IP address configured for the
2140 outgoing system interface.
2141
2142 <p>
2143 This option is valid only for tunnels also configured with the
2144 <code>remote_ip=flow</code> option.
2145 </p>
2146 </li>
2147 </ul>
2148
2149 <p>
2150 The tunnel destination IP address for any packet received from a
2151 tunnel is available in the <code>tun_dst</code> or <code>tun_ipv6_dst</code>
2152 field for matching in the flow table.
2153 </p>
2154 </column>
2155
2156 <column name="options" key="in_key">
2157 <p>Optional. The key that received packets must contain, one of:</p>
2158
2159 <ul>
2160 <li>
2161 <code>0</code>. The tunnel receives packets with no key or with a
2162 key of 0. This is equivalent to specifying no <ref column="options"
2163 key="in_key"/> at all.
2164 </li>
2165 <li>
2166 A positive 24-bit (for Geneve, VXLAN, and LISP), 32-bit (for GRE)
2167 or 64-bit (for STT) number. The tunnel receives only
2168 packets with the specified key.
2169 </li>
2170 <li>
2171 The word <code>flow</code>. The tunnel accepts packets with any
2172 key. The key will be placed in the <code>tun_id</code> field for
2173 matching in the flow table. The <code>ovs-ofctl</code> manual page
2174 contains additional information about matching fields in OpenFlow
2175 flows.
2176 </li>
2177 </ul>
2178
2179 <p>
2180 </p>
2181 </column>
2182
2183 <column name="options" key="out_key">
2184 <p>Optional. The key to be set on outgoing packets, one of:</p>
2185
2186 <ul>
2187 <li>
2188 <code>0</code>. Packets sent through the tunnel will have no key.
2189 This is equivalent to specifying no <ref column="options"
2190 key="out_key"/> at all.
2191 </li>
2192 <li>
2193 A positive 24-bit (for Geneve, VXLAN and LISP), 32-bit (for GRE) or
2194 64-bit (for STT) number. Packets sent through the tunnel
2195 will have the specified key.
2196 </li>
2197 <li>
2198 The word <code>flow</code>. Packets sent through the tunnel will
2199 have the key set using the <code>set_tunnel</code> Nicira OpenFlow
2200 vendor extension (0 is used in the absence of an action). The
2201 <code>ovs-ofctl</code> manual page contains additional information
2202 about the Nicira OpenFlow vendor extensions.
2203 </li>
2204 </ul>
2205 </column>
2206
2207 <column name="options" key="key">
2208 Optional. Shorthand to set <code>in_key</code> and
2209 <code>out_key</code> at the same time.
2210 </column>
2211
2212 <column name="options" key="tos">
2213 Optional. The value of the ToS bits to be set on the encapsulating
2214 packet. ToS is interpreted as DSCP and ECN bits, ECN part must be
2215 zero. It may also be the word <code>inherit</code>, in which case
2216 the ToS will be copied from the inner packet if it is IPv4 or IPv6
2217 (otherwise it will be 0). The ECN fields are always inherited.
2218 Default is 0.
2219 </column>
2220
2221 <column name="options" key="ttl">
2222 Optional. The TTL to be set on the encapsulating packet. It may also
2223 be the word <code>inherit</code>, in which case the TTL will be copied
2224 from the inner packet if it is IPv4 or IPv6 (otherwise it will be the
2225 system default, typically 64). Default is the system default TTL.
2226 </column>
2227
2228 <column name="options" key="df_default"
2229 type='{"type": "boolean"}'>
2230 Optional. If enabled, the Don't Fragment bit will be set on tunnel
2231 outer headers to allow path MTU discovery. Default is enabled; set
2232 to <code>false</code> to disable.
2233 </column>
2234
2235 <group title="Tunnel Options: vxlan only">
2236
2237 <column name="options" key="exts">
2238 <p>Optional. Comma separated list of optional VXLAN extensions to
2239 enable. The following extensions are supported:</p>
2240
2241 <ul>
2242 <li>
2243 <code>gbp</code>: VXLAN-GBP allows to transport the group policy
2244 context of a packet across the VXLAN tunnel to other network
2245 peers. See the field description of <code>tun_gbp_id</code> and
2246 <code>tun_gbp_flags</code> in ovs-ofctl(8) for additional
2247 information.
2248 (<code>https://tools.ietf.org/html/draft-smith-vxlan-group-policy</code>)
2249 </li>
2250 </ul>
2251 </column>
2252
2253 </group>
2254
2255 <group title="Tunnel Options: gre, geneve, and vxlan">
2256 <p>
2257 <code>gre</code>, <code>geneve</code>, and
2258 <code>vxlan</code> interfaces support these options.
2259 </p>
2260
2261 <column name="options" key="csum" type='{"type": "boolean"}'>
2262 <p>
2263 Optional. Compute encapsulation header (either GRE or UDP)
2264 checksums on outgoing packets. Default is disabled, set to
2265 <code>true</code> to enable. Checksums present on incoming
2266 packets will be validated regardless of this setting.
2267 </p>
2268
2269 <p>
2270 When using the upstream Linux kernel module, computation of
2271 checksums for <code>geneve</code> and <code>vxlan</code> requires
2272 Linux kernel version 4.0 or higher. <code>gre</code> supports
2273 checksums for all versions of Open vSwitch that support GRE.
2274 The out of tree kernel module distributed as part of OVS
2275 can compute all tunnel checksums on any kernel version that it
2276 is compatible with.
2277 </p>
2278
2279 </column>
2280 </group>
2281 </group>
2282
2283 <group title="Patch Options">
2284 <p>
2285 Only <code>patch</code> interfaces support these options.
2286 </p>
2287
2288 <column name="options" key="peer">
2289 The <ref column="name"/> of the <ref table="Interface"/> for the other
2290 side of the patch. The named <ref table="Interface"/>'s own
2291 <code>peer</code> option must specify this <ref table="Interface"/>'s
2292 name. That is, the two patch interfaces must have reversed <ref
2293 column="name"/> and <code>peer</code> values.
2294 </column>
2295 </group>
2296
2297 <group title="PMD (Poll Mode Driver) Options">
2298 <p>
2299 Only PMD netdevs support these options.
2300 </p>
2301
2302 <column name="options" key="n_rxq"
2303 type='{"type": "integer", "minInteger": 1}'>
2304 <p>
2305 Specifies the maximum number of rx queues to be created for PMD
2306 netdev. If not specified or specified to 0, one rx queue will
2307 be created by default.
2308 Not supported by DPDK vHost interfaces.
2309 </p>
2310 </column>
2311
2312 <column name="other_config" key="pmd-rxq-affinity">
2313 <p>Specifies mapping of RX queues of this interface to CPU cores.</p>
2314 <p>Value should be set in the following form:</p>
2315 <p>
2316 <code>other_config:pmd-rxq-affinity=&lt;rxq-affinity-list&gt;</code>
2317 </p>
2318 <p>where</p>
2319 <p>
2320 <ul>
2321 <li>
2322 &lt;rxq-affinity-list&gt; ::= NULL | &lt;non-empty-list&gt;
2323 </li>
2324 <li>
2325 &lt;non-empty-list&gt; ::= &lt;affinity-pair&gt; |
2326 &lt;affinity-pair&gt; , &lt;non-empty-list&gt;
2327 </li>
2328 <li>
2329 &lt;affinity-pair&gt; ::= &lt;queue-id&gt; : &lt;core-id&gt;
2330 </li>
2331 </ul>
2332 </p>
2333 </column>
2334
2335 <column name="options" key="vhost-server-path"
2336 type='{"type": "string"}'>
2337 <p>
2338 The value specifies the path to the socket associated with a vHost
2339 User client mode device that has been or will be created by QEMU.
2340 Only supported by dpdkvhostuserclient interfaces.
2341 </p>
2342 </column>
2343
2344 <column name="options" key="n_rxq_desc"
2345 type='{"type": "integer", "minInteger": 1, "maxInteger": 4096}'>
2346 <p>
2347 Specifies the rx queue size (number rx descriptors) for dpdk ports.
2348 The value must be a power of 2, less than 4096 and supported
2349 by the hardware of the device being configured.
2350 If not specified or an incorrect value is specified, 2048 rx
2351 descriptors will be used by default.
2352 </p>
2353 </column>
2354
2355 <column name="options" key="n_txq_desc"
2356 type='{"type": "integer", "minInteger": 1, "maxInteger": 4096}'>
2357 <p>
2358 Specifies the tx queue size (number tx descriptors) for dpdk ports.
2359 The value must be a power of 2, less than 4096 and supported
2360 by the hardware of the device being configured.
2361 If not specified or an incorrect value is specified, 2048 tx
2362 descriptors will be used by default.
2363 </p>
2364 </column>
2365 </group>
2366
2367 <group title="MTU">
2368 <p>
2369 The MTU (maximum transmission unit) is the largest amount of data
2370 that can fit into a single Ethernet frame. The standard Ethernet
2371 MTU is 1500 bytes. Some physical media and many kinds of virtual
2372 interfaces can be configured with higher MTUs.
2373 </p>
2374
2375 <p>
2376 A client may change an interface MTU by filling in
2377 <ref column="mtu_request"/>. Open vSwitch then reports in
2378 <ref column="mtu"/> the currently configured value.
2379 </p>
2380
2381 <column name="mtu">
2382 <p>
2383 The currently configured MTU for the interface.
2384 </p>
2385
2386 <p>
2387 This column will be empty for an interface that does not
2388 have an MTU as, for example, some kinds of tunnels do not.
2389 </p>
2390
2391 <p>
2392 Open vSwitch sets this column's value, so other clients should treat
2393 it as read-only.
2394 </p>
2395 </column>
2396
2397 <column name="mtu_request"
2398 type='{"type": "integer", "minInteger": 1}'>
2399 <p>
2400 Requested MTU (Maximum Transmission Unit) for the interface. A client
2401 can fill this column to change the MTU of an interface.
2402 </p>
2403
2404 <p>
2405 If this is not set and if the interface has <code>internal</code>
2406 type, Open vSwitch will change the MTU to match the minimum of the
2407 other interfaces in the bridge.
2408 </p>
2409 </column>
2410
2411 </group>
2412
2413 <group title="Interface Status">
2414 <p>
2415 Status information about interfaces attached to bridges, updated every
2416 5 seconds. Not all interfaces have all of these properties; virtual
2417 interfaces don't have a link speed, for example. Non-applicable
2418 columns will have empty values.
2419 </p>
2420 <column name="admin_state">
2421 <p>
2422 The administrative state of the physical network link.
2423 </p>
2424 </column>
2425
2426 <column name="link_state">
2427 <p>
2428 The observed state of the physical network link. This is ordinarily
2429 the link's carrier status. If the interface's <ref table="Port"/> is
2430 a bond configured for miimon monitoring, it is instead the network
2431 link's miimon status.
2432 </p>
2433 </column>
2434
2435 <column name="link_resets">
2436 <p>
2437 The number of times Open vSwitch has observed the
2438 <ref column="link_state"/> of this <ref table="Interface"/> change.
2439 </p>
2440 </column>
2441
2442 <column name="link_speed">
2443 <p>
2444 The negotiated speed of the physical network link.
2445 Valid values are positive integers greater than 0.
2446 </p>
2447 </column>
2448
2449 <column name="duplex">
2450 <p>
2451 The duplex mode of the physical network link.
2452 </p>
2453 </column>
2454
2455 <column name="lacp_current">
2456 Boolean value indicating LACP status for this interface. If true, this
2457 interface has current LACP information about its LACP partner. This
2458 information may be used to monitor the health of interfaces in a LACP
2459 enabled port. This column will be empty if LACP is not enabled.
2460 </column>
2461
2462 <column name="status">
2463 Key-value pairs that report port status. Supported status values are
2464 <ref column="type"/>-dependent; some interfaces may not have a valid
2465 <ref column="status" key="driver_name"/>, for example.
2466 </column>
2467
2468 <column name="status" key="driver_name">
2469 The name of the device driver controlling the network adapter.
2470 </column>
2471
2472 <column name="status" key="driver_version">
2473 The version string of the device driver controlling the network
2474 adapter.
2475 </column>
2476
2477 <column name="status" key="firmware_version">
2478 The version string of the network adapter's firmware, if available.
2479 </column>
2480
2481 <column name="status" key="source_ip">
2482 The source IP address used for an IPv4/IPv6 tunnel end-point, such as
2483 <code>gre</code>.
2484 </column>
2485
2486 <column name="status" key="tunnel_egress_iface">
2487 Egress interface for tunnels. Currently only relevant for tunnels
2488 on Linux systems, this column will show the name of the interface
2489 which is responsible for routing traffic destined for the configured
2490 <ref column="options" key="remote_ip"/>. This could be an internal
2491 interface such as a bridge port.
2492 </column>
2493
2494 <column name="status" key="tunnel_egress_iface_carrier"
2495 type='{"type": "string", "enum": ["set", ["down", "up"]]}'>
2496 Whether carrier is detected on <ref column="status"
2497 key="tunnel_egress_iface"/>.
2498 </column>
2499 </group>
2500
2501 <group title="Statistics">
2502 <p>
2503 Key-value pairs that report interface statistics. The current
2504 implementation updates these counters periodically. The update period
2505 is controlled by <ref column="other_config"
2506 key="stats-update-interval"/> in the <code>Open_vSwitch</code> table.
2507 Future implementations may update them when an interface is created,
2508 when they are queried (e.g. using an OVSDB <code>select</code>
2509 operation), and just before an interface is deleted due to virtual
2510 interface hot-unplug or VM shutdown, and perhaps at other times, but
2511 not on any regular periodic basis.
2512 </p>
2513 <p>
2514 These are the same statistics reported by OpenFlow in its <code>struct
2515 ofp_port_stats</code> structure. If an interface does not support a
2516 given statistic, then that pair is omitted.
2517 </p>
2518 <group title="Statistics: Successful transmit and receive counters">
2519 <column name="statistics" key="rx_packets">
2520 Number of received packets.
2521 </column>
2522 <column name="statistics" key="rx_bytes">
2523 Number of received bytes.
2524 </column>
2525 <column name="statistics" key="tx_packets">
2526 Number of transmitted packets.
2527 </column>
2528 <column name="statistics" key="tx_bytes">
2529 Number of transmitted bytes.
2530 </column>
2531 </group>
2532 <group title="Statistics: Receive errors">
2533 <column name="statistics" key="rx_dropped">
2534 Number of packets dropped by RX.
2535 </column>
2536 <column name="statistics" key="rx_frame_err">
2537 Number of frame alignment errors.
2538 </column>
2539 <column name="statistics" key="rx_over_err">
2540 Number of packets with RX overrun.
2541 </column>
2542 <column name="statistics" key="rx_crc_err">
2543 Number of CRC errors.
2544 </column>
2545 <column name="statistics" key="rx_errors">
2546 Total number of receive errors, greater than or equal to the sum of
2547 the above.
2548 </column>
2549 </group>
2550 <group title="Statistics: Transmit errors">
2551 <column name="statistics" key="tx_dropped">
2552 Number of packets dropped by TX.
2553 </column>
2554 <column name="statistics" key="collisions">
2555 Number of collisions.
2556 </column>
2557 <column name="statistics" key="tx_errors">
2558 Total number of transmit errors, greater than or equal to the sum of
2559 the above.
2560 </column>
2561 </group>
2562 </group>
2563
2564 <group title="Ingress Policing">
2565 <p>
2566 These settings control ingress policing for packets received on this
2567 interface. On a physical interface, this limits the rate at which
2568 traffic is allowed into the system from the outside; on a virtual
2569 interface (one connected to a virtual machine), this limits the rate at
2570 which the VM is able to transmit.
2571 </p>
2572 <p>
2573 Policing is a simple form of quality-of-service that simply drops
2574 packets received in excess of the configured rate. Due to its
2575 simplicity, policing is usually less accurate and less effective than
2576 egress QoS (which is configured using the <ref table="QoS"/> and <ref
2577 table="Queue"/> tables).
2578 </p>
2579 <p>
2580 Policing is currently implemented on Linux and OVS with DPDK. Both
2581 implementations use a simple ``token bucket'' approach:
2582 </p>
2583 <ul>
2584 <li>
2585 The size of the bucket corresponds to <ref
2586 column="ingress_policing_burst"/>. Initially the bucket is full.
2587 </li>
2588 <li>
2589 Whenever a packet is received, its size (converted to tokens) is
2590 compared to the number of tokens currently in the bucket. If the
2591 required number of tokens are available, they are removed and the
2592 packet is forwarded. Otherwise, the packet is dropped.
2593 </li>
2594 <li>
2595 Whenever it is not full, the bucket is refilled with tokens at the
2596 rate specified by <ref column="ingress_policing_rate"/>.
2597 </li>
2598 </ul>
2599 <p>
2600 Policing interacts badly with some network protocols, and especially
2601 with fragmented IP packets. Suppose that there is enough network
2602 activity to keep the bucket nearly empty all the time. Then this token
2603 bucket algorithm will forward a single packet every so often, with the
2604 period depending on packet size and on the configured rate. All of the
2605 fragments of an IP packets are normally transmitted back-to-back, as a
2606 group. In such a situation, therefore, only one of these fragments
2607 will be forwarded and the rest will be dropped. IP does not provide
2608 any way for the intended recipient to ask for only the remaining
2609 fragments. In such a case there are two likely possibilities for what
2610 will happen next: either all of the fragments will eventually be
2611 retransmitted (as TCP will do), in which case the same problem will
2612 recur, or the sender will not realize that its packet has been dropped
2613 and data will simply be lost (as some UDP-based protocols will do).
2614 Either way, it is possible that no forward progress will ever occur.
2615 </p>
2616 <column name="ingress_policing_rate">
2617 <p>
2618 Maximum rate for data received on this interface, in kbps. Data
2619 received faster than this rate is dropped. Set to <code>0</code>
2620 (the default) to disable policing.
2621 </p>
2622 </column>
2623
2624 <column name="ingress_policing_burst">
2625 <p>Maximum burst size for data received on this interface, in kb. The
2626 default burst size if set to <code>0</code> is 8000 kbit. This value
2627 has no effect if <ref column="ingress_policing_rate"/>
2628 is <code>0</code>.</p>
2629 <p>
2630 Specifying a larger burst size lets the algorithm be more forgiving,
2631 which is important for protocols like TCP that react severely to
2632 dropped packets. The burst size should be at least the size of the
2633 interface's MTU. Specifying a value that is numerically at least as
2634 large as 80% of <ref column="ingress_policing_rate"/> helps TCP come
2635 closer to achieving the full rate.
2636 </p>
2637 </column>
2638 </group>
2639
2640 <group title="Bidirectional Forwarding Detection (BFD)">
2641 <p>
2642 BFD, defined in RFC 5880 and RFC 5881, allows point-to-point
2643 detection of connectivity failures by occasional transmission of
2644 BFD control messages. Open vSwitch implements BFD to serve
2645 as a more popular and standards compliant alternative to CFM.
2646 </p>
2647
2648 <p>
2649 BFD operates by regularly transmitting BFD control messages at a rate
2650 negotiated independently in each direction. Each endpoint specifies
2651 the rate at which it expects to receive control messages, and the rate
2652 at which it is willing to transmit them. Open vSwitch uses a detection
2653 multiplier of three, meaning that an endpoint signals a connectivity
2654 fault if three consecutive BFD control messages fail to arrive. In the
2655 case of a unidirectional connectivity issue, the system not receiving
2656 BFD control messages signals the problem to its peer in the messages it
2657 transmits.
2658 </p>
2659
2660 <p>
2661 The Open vSwitch implementation of BFD aims to comply faithfully
2662 with RFC 5880 requirements. Open vSwitch does not implement the
2663 optional Authentication or ``Echo Mode'' features.
2664 </p>
2665
2666 <group title="BFD Configuration">
2667 <p>
2668 A controller sets up key-value pairs in the <ref column="bfd"/>
2669 column to enable and configure BFD.
2670 </p>
2671
2672 <column name="bfd" key="enable" type='{"type": "boolean"}'>
2673 True to enable BFD on this <ref table="Interface"/>. If not
2674 specified, BFD will not be enabled by default.
2675 </column>
2676
2677 <column name="bfd" key="min_rx"
2678 type='{"type": "integer", "minInteger": 1}'>
2679 The shortest interval, in milliseconds, at which this BFD session
2680 offers to receive BFD control messages. The remote endpoint may
2681 choose to send messages at a slower rate. Defaults to
2682 <code>1000</code>.
2683 </column>
2684
2685 <column name="bfd" key="min_tx"
2686 type='{"type": "integer", "minInteger": 1}'>
2687 The shortest interval, in milliseconds, at which this BFD session is
2688 willing to transmit BFD control messages. Messages will actually be
2689 transmitted at a slower rate if the remote endpoint is not willing to
2690 receive as quickly as specified. Defaults to <code>100</code>.
2691 </column>
2692
2693 <column name="bfd" key="decay_min_rx" type='{"type": "integer"}'>
2694 An alternate receive interval, in milliseconds, that must be greater
2695 than or equal to <ref column="bfd" key="min_rx"/>. The
2696 implementation switches from <ref column="bfd" key="min_rx"/> to <ref
2697 column="bfd" key="decay_min_rx"/> when there is no obvious incoming
2698 data traffic at the interface, to reduce the CPU and bandwidth cost
2699 of monitoring an idle interface. This feature may be disabled by
2700 setting a value of 0. This feature is reset whenever <ref
2701 column="bfd" key="decay_min_rx"/> or <ref column="bfd" key="min_rx"/>
2702 changes.
2703 </column>
2704
2705 <column name="bfd" key="forwarding_if_rx" type='{"type": "boolean"}'>
2706 When <code>true</code>, traffic received on the
2707 <ref table="Interface"/> is used to indicate the capability of packet
2708 I/O. BFD control packets are still transmitted and received. At
2709 least one BFD control packet must be received every 100 * <ref
2710 column="bfd" key="min_rx"/> amount of time. Otherwise, even if
2711 traffic are received, the <ref column="bfd" key="forwarding"/>
2712 will be <code>false</code>.
2713 </column>
2714
2715 <column name="bfd" key="cpath_down" type='{"type": "boolean"}'>
2716 Set to true to notify the remote endpoint that traffic should not be
2717 forwarded to this system for some reason other than a connectivty
2718 failure on the interface being monitored. The typical underlying
2719 reason is ``concatenated path down,'' that is, that connectivity
2720 beyond the local system is down. Defaults to false.
2721 </column>
2722
2723 <column name="bfd" key="check_tnl_key" type='{"type": "boolean"}'>
2724 Set to true to make BFD accept only control messages with a tunnel
2725 key of zero. By default, BFD accepts control messages with any
2726 tunnel key.
2727 </column>
2728
2729 <column name="bfd" key="bfd_local_src_mac">
2730 Set to an Ethernet address in the form
2731 <var>xx</var>:<var>xx</var>:<var>xx</var>:<var>xx</var>:<var>xx</var>:<var>xx</var>
2732 to set the MAC used as source for transmitted BFD packets. The
2733 default is the mac address of the BFD enabled interface.
2734 </column>
2735
2736 <column name="bfd" key="bfd_local_dst_mac">
2737 Set to an Ethernet address in the form
2738 <var>xx</var>:<var>xx</var>:<var>xx</var>:<var>xx</var>:<var>xx</var>:<var>xx</var>
2739 to set the MAC used as destination for transmitted BFD packets. The
2740 default is <code>00:23:20:00:00:01</code>.
2741 </column>
2742
2743 <column name="bfd" key="bfd_remote_dst_mac">
2744 Set to an Ethernet address in the form
2745 <var>xx</var>:<var>xx</var>:<var>xx</var>:<var>xx</var>:<var>xx</var>:<var>xx</var>
2746 to set the MAC used for checking the destination of received BFD packets.
2747 Packets with different destination MAC will not be considered as BFD packets.
2748 If not specified the destination MAC address of received BFD packets
2749 are not checked.
2750 </column>
2751
2752 <column name="bfd" key="bfd_src_ip">
2753 Set to an IPv4 address to set the IP address used as source for
2754 transmitted BFD packets. The default is <code>169.254.1.1</code>.
2755 </column>
2756
2757 <column name="bfd" key="bfd_dst_ip">
2758 Set to an IPv4 address to set the IP address used as destination
2759 for transmitted BFD packets. The default is <code>169.254.1.0</code>.
2760 </column>
2761
2762 <column name="bfd" key="oam">
2763 Some tunnel protocols (such as Geneve) include a bit in the header
2764 to indicate that the encapsulated packet is an OAM frame. By setting
2765 this to true, BFD packets will be marked as OAM if encapsulated in
2766 one of these tunnels.
2767 </column>
2768 </group>
2769
2770 <group title="BFD Status">
2771 <p>
2772 The switch sets key-value pairs in the <ref column="bfd_status"/>
2773 column to report the status of BFD on this interface. When BFD is
2774 not enabled, with <ref column="bfd" key="enable"/>, the switch clears
2775 all key-value pairs from <ref column="bfd_status"/>.
2776 </p>
2777
2778 <column name="bfd_status" key="state"
2779 type='{"type": "string",
2780 "enum": ["set", ["admin_down", "down", "init", "up"]]}'>
2781 Reports the state of the BFD session. The BFD session is fully
2782 healthy and negotiated if <code>UP</code>.
2783 </column>
2784
2785 <column name="bfd_status" key="forwarding" type='{"type": "boolean"}'>
2786 Reports whether the BFD session believes this <ref
2787 table="Interface"/> may be used to forward traffic. Typically this
2788 means the local session is signaling <code>UP</code>, and the remote
2789 system isn't signaling a problem such as concatenated path down.
2790 </column>
2791
2792 <column name="bfd_status" key="diagnostic">
2793 A diagnostic code specifying the local system's reason for the
2794 last change in session state. The error messages are defined in
2795 section 4.1 of [RFC 5880].
2796 </column>
2797
2798 <column name="bfd_status" key="remote_state"
2799 type='{"type": "string",
2800 "enum": ["set", ["admin_down", "down", "init", "up"]]}'>
2801 Reports the state of the remote endpoint's BFD session.
2802 </column>
2803
2804 <column name="bfd_status" key="remote_diagnostic">
2805 A diagnostic code specifying the remote system's reason for the
2806 last change in session state. The error messages are defined in
2807 section 4.1 of [RFC 5880].
2808 </column>
2809
2810 <column name="bfd_status" key="flap_count"
2811 type='{"type": "integer", "minInteger": 0}'>
2812 Counts the number of <ref column="bfd_status" key="forwarding" />
2813 flaps since start. A flap is considered as a change of the
2814 <ref column="bfd_status" key="forwarding" /> value.
2815 </column>
2816 </group>
2817 </group>
2818
2819 <group title="Connectivity Fault Management">
2820 <p>
2821 802.1ag Connectivity Fault Management (CFM) allows a group of
2822 Maintenance Points (MPs) called a Maintenance Association (MA) to
2823 detect connectivity problems with each other. MPs within a MA should
2824 have complete and exclusive interconnectivity. This is verified by
2825 occasionally broadcasting Continuity Check Messages (CCMs) at a
2826 configurable transmission interval.
2827 </p>
2828
2829 <p>
2830 According to the 802.1ag specification, each Maintenance Point should
2831 be configured out-of-band with a list of Remote Maintenance Points it
2832 should have connectivity to. Open vSwitch differs from the
2833 specification in this area. It simply assumes the link is faulted if
2834 no Remote Maintenance Points are reachable, and considers it not
2835 faulted otherwise.
2836 </p>
2837
2838 <p>
2839 When operating over tunnels which have no <code>in_key</code>, or an
2840 <code>in_key</code> of <code>flow</code>. CFM will only accept CCMs
2841 with a tunnel key of zero.
2842 </p>
2843
2844 <column name="cfm_mpid">
2845 <p>
2846 A Maintenance Point ID (MPID) uniquely identifies each endpoint
2847 within a Maintenance Association. The MPID is used to identify this
2848 endpoint to other Maintenance Points in the MA. Each end of a link
2849 being monitored should have a different MPID. Must be configured to
2850 enable CFM on this <ref table="Interface"/>.
2851 </p>
2852 <p>
2853 According to the 802.1ag specification, MPIDs can only range between
2854 [1, 8191]. However, extended mode (see <ref column="other_config"
2855 key="cfm_extended"/>) supports eight byte MPIDs.
2856 </p>
2857 </column>
2858
2859 <column name="cfm_flap_count">
2860 Counts the number of cfm fault flapps since boot. A flap is
2861 considered to be a change of the <ref column="cfm_fault"/> value.
2862 </column>
2863
2864 <column name="cfm_fault">
2865 <p>
2866 Indicates a connectivity fault triggered by an inability to receive
2867 heartbeats from any remote endpoint. When a fault is triggered on
2868 <ref table="Interface"/>s participating in bonds, they will be
2869 disabled.
2870 </p>
2871 <p>
2872 Faults can be triggered for several reasons. Most importantly they
2873 are triggered when no CCMs are received for a period of 3.5 times the
2874 transmission interval. Faults are also triggered when any CCMs
2875 indicate that a Remote Maintenance Point is not receiving CCMs but
2876 able to send them. Finally, a fault is triggered if a CCM is
2877 received which indicates unexpected configuration. Notably, this
2878 case arises when a CCM is received which advertises the local MPID.
2879 </p>
2880 </column>
2881
2882 <column name="cfm_fault_status" key="recv">
2883 Indicates a CFM fault was triggered due to a lack of CCMs received on
2884 the <ref table="Interface"/>.
2885 </column>
2886
2887 <column name="cfm_fault_status" key="rdi">
2888 Indicates a CFM fault was triggered due to the reception of a CCM with
2889 the RDI bit flagged. Endpoints set the RDI bit in their CCMs when they
2890 are not receiving CCMs themselves. This typically indicates a
2891 unidirectional connectivity failure.
2892 </column>
2893
2894 <column name="cfm_fault_status" key="maid">
2895 Indicates a CFM fault was triggered due to the reception of a CCM with
2896 a MAID other than the one Open vSwitch uses. CFM broadcasts are tagged
2897 with an identification number in addition to the MPID called the MAID.
2898 Open vSwitch only supports receiving CCM broadcasts tagged with the
2899 MAID it uses internally.
2900 </column>
2901
2902 <column name="cfm_fault_status" key="loopback">
2903 Indicates a CFM fault was triggered due to the reception of a CCM
2904 advertising the same MPID configured in the <ref column="cfm_mpid"/>
2905 column of this <ref table="Interface"/>. This may indicate a loop in
2906 the network.
2907 </column>
2908
2909 <column name="cfm_fault_status" key="overflow">
2910 Indicates a CFM fault was triggered because the CFM module received
2911 CCMs from more remote endpoints than it can keep track of.
2912 </column>
2913
2914 <column name="cfm_fault_status" key="override">
2915 Indicates a CFM fault was manually triggered by an administrator using
2916 an <code>ovs-appctl</code> command.
2917 </column>
2918
2919 <column name="cfm_fault_status" key="interval">
2920 Indicates a CFM fault was triggered due to the reception of a CCM
2921 frame having an invalid interval.
2922 </column>
2923
2924 <column name="cfm_remote_opstate">
2925 <p>When in extended mode, indicates the operational state of the
2926 remote endpoint as either <code>up</code> or <code>down</code>. See
2927 <ref column="other_config" key="cfm_opstate"/>.
2928 </p>
2929 </column>
2930
2931 <column name="cfm_health">
2932 <p>
2933 Indicates the health of the interface as a percentage of CCM frames
2934 received over 21 <ref column="other_config" key="cfm_interval"/>s.
2935 The health of an interface is undefined if it is communicating with
2936 more than one <ref column="cfm_remote_mpids"/>. It reduces if
2937 healthy heartbeats are not received at the expected rate, and
2938 gradually improves as healthy heartbeats are received at the desired
2939 rate. Every 21 <ref column="other_config" key="cfm_interval"/>s, the
2940 health of the interface is refreshed.
2941 </p>
2942 <p>
2943 As mentioned above, the faults can be triggered for several reasons.
2944 The link health will deteriorate even if heartbeats are received but
2945 they are reported to be unhealthy. An unhealthy heartbeat in this
2946 context is a heartbeat for which either some fault is set or is out
2947 of sequence. The interface health can be 100 only on receiving
2948 healthy heartbeats at the desired rate.
2949 </p>
2950 </column>
2951
2952 <column name="cfm_remote_mpids">
2953 When CFM is properly configured, Open vSwitch will occasionally
2954 receive CCM broadcasts. These broadcasts contain the MPID of the
2955 sending Maintenance Point. The list of MPIDs from which this
2956 <ref table="Interface"/> is receiving broadcasts from is regularly
2957 collected and written to this column.
2958 </column>
2959
2960 <column name="other_config" key="cfm_interval"
2961 type='{"type": "integer"}'>
2962 <p>
2963 The interval, in milliseconds, between transmissions of CFM
2964 heartbeats. Three missed heartbeat receptions indicate a
2965 connectivity fault.
2966 </p>
2967
2968 <p>
2969 In standard operation only intervals of 3, 10, 100, 1,000, 10,000,
2970 60,000, or 600,000 ms are supported. Other values will be rounded
2971 down to the nearest value on the list. Extended mode (see <ref
2972 column="other_config" key="cfm_extended"/>) supports any interval up
2973 to 65,535 ms. In either mode, the default is 1000 ms.
2974 </p>
2975
2976 <p>We do not recommend using intervals less than 100 ms.</p>
2977 </column>
2978
2979 <column name="other_config" key="cfm_extended"
2980 type='{"type": "boolean"}'>
2981 When <code>true</code>, the CFM module operates in extended mode. This
2982 causes it to use a nonstandard destination address to avoid conflicting
2983 with compliant implementations which may be running concurrently on the
2984 network. Furthermore, extended mode increases the accuracy of the
2985 <code>cfm_interval</code> configuration parameter by breaking wire
2986 compatibility with 802.1ag compliant implementations. And extended
2987 mode allows eight byte MPIDs. Defaults to <code>false</code>.
2988 </column>
2989
2990 <column name="other_config" key="cfm_demand" type='{"type": "boolean"}'>
2991 <p>
2992 When <code>true</code>, and
2993 <ref column="other_config" key="cfm_extended"/> is true, the CFM
2994 module operates in demand mode. When in demand mode, traffic
2995 received on the <ref table="Interface"/> is used to indicate
2996 liveness. CCMs are still transmitted and received. At least one
2997 CCM must be received every 100 * <ref column="other_config"
2998 key="cfm_interval"/> amount of time. Otherwise, even if traffic
2999 are received, the CFM module will raise the connectivity fault.
3000 </p>
3001
3002 <p>
3003 Demand mode has a couple of caveats:
3004 <ul>
3005 <li>
3006 To ensure that ovs-vswitchd has enough time to pull statistics
3007 from the datapath, the fault detection interval is set to
3008 3.5 * MAX(<ref column="other_config" key="cfm_interval"/>, 500)
3009 ms.
3010 </li>
3011
3012 <li>
3013 To avoid ambiguity, demand mode disables itself when there are
3014 multiple remote maintenance points.
3015 </li>
3016
3017 <li>
3018 If the <ref table="Interface"/> is heavily congested, CCMs
3019 containing the <ref column="other_config" key="cfm_opstate"/>
3020 status may be dropped causing changes in the operational state to
3021 be delayed. Similarly, if CCMs containing the RDI bit are not
3022 received, unidirectional link failures may not be detected.
3023 </li>
3024 </ul>
3025 </p>
3026 </column>
3027
3028 <column name="other_config" key="cfm_opstate"
3029 type='{"type": "string", "enum": ["set", ["down", "up"]]}'>
3030 When <code>down</code>, the CFM module marks all CCMs it generates as
3031 operationally down without triggering a fault. This allows remote
3032 maintenance points to choose not to forward traffic to the
3033 <ref table="Interface"/> on which this CFM module is running.
3034 Currently, in Open vSwitch, the opdown bit of CCMs affects
3035 <ref table="Interface"/>s participating in bonds, and the bundle
3036 OpenFlow action. This setting is ignored when CFM is not in extended
3037 mode. Defaults to <code>up</code>.
3038 </column>
3039
3040 <column name="other_config" key="cfm_ccm_vlan"
3041 type='{"type": "integer", "minInteger": 1, "maxInteger": 4095}'>
3042 When set, the CFM module will apply a VLAN tag to all CCMs it generates
3043 with the given value. May be the string <code>random</code> in which
3044 case each CCM will be tagged with a different randomly generated VLAN.
3045 </column>
3046
3047 <column name="other_config" key="cfm_ccm_pcp"
3048 type='{"type": "integer", "minInteger": 1, "maxInteger": 7}'>
3049 When set, the CFM module will apply a VLAN tag to all CCMs it generates
3050 with the given PCP value, the VLAN ID of the tag is governed by the
3051 value of <ref column="other_config" key="cfm_ccm_vlan"/>. If
3052 <ref column="other_config" key="cfm_ccm_vlan"/> is unset, a VLAN ID of
3053 zero is used.
3054 </column>
3055
3056 </group>
3057
3058 <group title="Bonding Configuration">
3059 <column name="other_config" key="lacp-port-id"
3060 type='{"type": "integer", "minInteger": 1, "maxInteger": 65535}'>
3061 The LACP port ID of this <ref table="Interface"/>. Port IDs are
3062 used in LACP negotiations to identify individual ports
3063 participating in a bond.
3064 </column>
3065
3066 <column name="other_config" key="lacp-port-priority"
3067 type='{"type": "integer", "minInteger": 1, "maxInteger": 65535}'>
3068 The LACP port priority of this <ref table="Interface"/>. In LACP
3069 negotiations <ref table="Interface"/>s with numerically lower
3070 priorities are preferred for aggregation.
3071 </column>
3072
3073 <column name="other_config" key="lacp-aggregation-key"
3074 type='{"type": "integer", "minInteger": 1, "maxInteger": 65535}'>
3075 The LACP aggregation key of this <ref table="Interface"/>. <ref
3076 table="Interface"/>s with different aggregation keys may not be active
3077 within a given <ref table="Port"/> at the same time.
3078 </column>
3079 </group>
3080
3081 <group title="Virtual Machine Identifiers">
3082 <p>
3083 These key-value pairs specifically apply to an interface that
3084 represents a virtual Ethernet interface connected to a virtual
3085 machine. These key-value pairs should not be present for other types
3086 of interfaces. Keys whose names end in <code>-uuid</code> have
3087 values that uniquely identify the entity in question. For a Citrix
3088 XenServer hypervisor, these values are UUIDs in RFC 4122 format.
3089 Other hypervisors may use other formats.
3090 </p>
3091
3092 <column name="external_ids" key="attached-mac">
3093 The MAC address programmed into the ``virtual hardware'' for this
3094 interface, in the form
3095 <var>xx</var>:<var>xx</var>:<var>xx</var>:<var>xx</var>:<var>xx</var>:<var>xx</var>.
3096 For Citrix XenServer, this is the value of the <code>MAC</code> field
3097 in the VIF record for this interface.
3098 </column>
3099
3100 <column name="external_ids" key="iface-id">
3101 A system-unique identifier for the interface. On XenServer, this will
3102 commonly be the same as <ref column="external_ids" key="xs-vif-uuid"/>.
3103 </column>
3104
3105 <column name="external_ids" key="iface-status"
3106 type='{"type": "string",
3107 "enum": ["set", ["active", "inactive"]]}'>
3108 <p>
3109 Hypervisors may sometimes have more than one interface associated
3110 with a given <ref column="external_ids" key="iface-id"/>, only one of
3111 which is actually in use at a given time. For example, in some
3112 circumstances XenServer has both a ``tap'' and a ``vif'' interface
3113 for a single <ref column="external_ids" key="iface-id"/>, but only
3114 uses one of them at a time. A hypervisor that behaves this way must
3115 mark the currently in use interface <code>active</code> and the
3116 others <code>inactive</code>. A hypervisor that never has more than
3117 one interface for a given <ref column="external_ids" key="iface-id"/>
3118 may mark that interface <code>active</code> or omit <ref
3119 column="external_ids" key="iface-status"/> entirely.
3120 </p>
3121
3122 <p>
3123 During VM migration, a given <ref column="external_ids"
3124 key="iface-id"/> might transiently be marked <code>active</code> on
3125 two different hypervisors. That is, <code>active</code> means that
3126 this <ref column="external_ids" key="iface-id"/> is the active
3127 instance within a single hypervisor, not in a broader scope.
3128 There is one exception: some hypervisors support ``migration'' from a
3129 given hypervisor to itself (most often for test purposes). During
3130 such a ``migration,'' two instances of a single <ref
3131 column="external_ids" key="iface-id"/> might both be briefly marked
3132 <code>active</code> on a single hypervisor.
3133 </p>
3134 </column>
3135
3136 <column name="external_ids" key="xs-vif-uuid">
3137 The virtual interface associated with this interface.
3138 </column>
3139
3140 <column name="external_ids" key="xs-network-uuid">
3141 The virtual network to which this interface is attached.
3142 </column>
3143
3144 <column name="external_ids" key="vm-id">
3145 The VM to which this interface belongs. On XenServer, this will be the
3146 same as <ref column="external_ids" key="xs-vm-uuid"/>.
3147 </column>
3148
3149 <column name="external_ids" key="xs-vm-uuid">
3150 The VM to which this interface belongs.
3151 </column>
3152 </group>
3153
3154 <group title="Auto Attach Configuration">
3155 <p>
3156 Auto Attach configuration for a particular interface.
3157 </p>
3158
3159 <column name="lldp" key="enable" type='{"type": "boolean"}'>
3160 True to enable LLDP on this <ref table="Interface"/>. If not
3161 specified, LLDP will be disabled by default.
3162 </column>
3163 </group>
3164
3165 <group title="Flow control Configuration">
3166 <p>
3167 Ethernet flow control defined in IEEE 802.1Qbb provides link level flow
3168 control using MAC pause frames. Implemented only for interfaces with
3169 type <code>dpdk</code>.
3170 </p>
3171
3172 <column name="options" key="rx-flow-ctrl" type='{"type": "boolean"}'>
3173 Set to <code>true</code> to enable Rx flow control on physical ports.
3174 By default, Rx flow control is disabled.
3175 </column>
3176
3177 <column name="options" key="tx-flow-ctrl" type='{"type": "boolean"}'>
3178 Set to <code>true</code> to enable Tx flow control on physical ports.
3179 By default, Tx flow control is disabled.
3180 </column>
3181
3182 <column name="options" key="flow-ctrl-autoneg"
3183 type='{"type": "boolean"}'>
3184 Set to <code>true</code> to enable flow control auto negotiation on
3185 physical ports. By default, auto-neg is disabled.
3186 </column>
3187 </group>
3188
3189 <group title="Common Columns">
3190 The overall purpose of these columns is described under <code>Common
3191 Columns</code> at the beginning of this document.
3192
3193 <column name="other_config"/>
3194 <column name="external_ids"/>
3195 </group>
3196 </table>
3197
3198 <table name="Flow_Table" title="OpenFlow table configuration">
3199 <p>Configuration for a particular OpenFlow table.</p>
3200
3201 <column name="name">
3202 The table's name. Set this column to change the name that controllers
3203 will receive when they request table statistics, e.g. <code>ovs-ofctl
3204 dump-tables</code>. The name does not affect switch behavior.
3205 </column>
3206
3207 <group title="Eviction Policy">
3208 <p>
3209 Open vSwitch supports limiting the number of flows that may be
3210 installed in a flow table, via the <ref column="flow_limit"/> column.
3211 When adding a flow would exceed this limit, by default Open vSwitch
3212 reports an error, but there are two ways to configure Open vSwitch to
3213 instead delete (``evict'') a flow to make room for the new one:
3214 </p>
3215
3216 <ul>
3217 <li>
3218 Set the <ref column="overflow_policy"/> column to <code>evict</code>.
3219 </li>
3220
3221 <li>
3222 Send an OpenFlow 1.4+ ``table mod request'' to enable eviction for
3223 the flow table (e.g. <code>ovs-ofctl -O OpenFlow14 mod-table br0 0
3224 evict</code> to enable eviction on flow table 0 of bridge
3225 <code>br0</code>).
3226 </li>
3227 </ul>
3228
3229 <p>
3230 When a flow must be evicted due to overflow, the flow to evict is
3231 chosen through an approximation of the following algorithm. This
3232 algorithm is used regardless of how eviction was enabled:
3233 </p>
3234
3235 <ol>
3236 <li>
3237 Divide the flows in the table into groups based on the values of the
3238 fields or subfields specified in the <ref column="groups"/> column,
3239 so that all of the flows in a given group have the same values for
3240 those fields. If a flow does not specify a given field, that field's
3241 value is treated as 0. If <ref column="groups"/> is empty, then all
3242 of the flows in the flow table are treated as a single group.
3243 </li>
3244
3245 <li>
3246 Consider the flows in the largest group, that is, the group that
3247 contains the greatest number of flows. If two or more groups all
3248 have the same largest number of flows, consider the flows in all of
3249 those groups.
3250 </li>
3251
3252 <li>
3253 If the flows under consideration have different importance values,
3254 eliminate from consideration any flows except those with the lowest
3255 importance. (``Importance,'' a 16-bit integer value attached to each
3256 flow, was introduced in OpenFlow 1.4. Flows inserted with older
3257 versions of OpenFlow always have an importance of 0.)
3258 </li>
3259
3260 <li>
3261 Among the flows under consideration, choose the flow that expires
3262 soonest for eviction.
3263 </li>
3264 </ol>
3265
3266 <p>
3267 The eviction process only considers flows that have an idle timeout
3268 or a hard timeout. That is, eviction never deletes permanent flows.
3269 (Permanent flows do count against <ref column="flow_limit"/>.)
3270 </p>
3271
3272 <column name="flow_limit">
3273 If set, limits the number of flows that may be added to the table.
3274 Open vSwitch may limit the number of flows in a table for other
3275 reasons, e.g. due to hardware limitations or for resource availability
3276 or performance reasons.
3277 </column>
3278
3279 <column name="overflow_policy">
3280 <p>
3281 Controls the switch's behavior when an OpenFlow flow table
3282 modification request would add flows in excess of <ref
3283 column="flow_limit"/>. The supported values are:
3284 </p>
3285
3286 <dl>
3287 <dt><code>refuse</code></dt>
3288 <dd>
3289 Refuse to add the flow or flows. This is also the default policy
3290 when <ref column="overflow_policy"/> is unset.
3291 </dd>
3292
3293 <dt><code>evict</code></dt>
3294 <dd>
3295 Delete a flow chosen according to the algorithm described above.
3296 </dd>
3297 </dl>
3298 </column>
3299
3300 <column name="groups">
3301 <p>
3302 When <ref column="overflow_policy"/> is <code>evict</code>, this
3303 controls how flows are chosen for eviction when the flow table would
3304 otherwise exceed <ref column="flow_limit"/> flows. Its value is a
3305 set of NXM fields or sub-fields, each of which takes one of the forms
3306 <code><var>field</var>[]</code> or
3307 <code><var>field</var>[<var>start</var>..<var>end</var>]</code>,
3308 e.g. <code>NXM_OF_IN_PORT[]</code>. Please see
3309 <code>meta-flow.h</code> for a complete list of NXM field names.
3310 </p>
3311
3312 <p>
3313 Open vSwitch ignores any invalid or unknown field specifications.
3314 </p>
3315
3316 <p>
3317 When eviction is not enabled, via <ref column="overflow_policy"/> or
3318 an OpenFlow 1.4+ ``table mod,'' this column has no effect.
3319 </p>
3320 </column>
3321 </group>
3322
3323 <group title="Classifier Optimization">
3324 <column name="prefixes">
3325 <p>
3326 This string set specifies which fields should be used for
3327 address prefix tracking. Prefix tracking allows the
3328 classifier to skip rules with longer than necessary prefixes,
3329 resulting in better wildcarding for datapath flows.
3330 </p>
3331 <p>
3332 Prefix tracking may be beneficial when a flow table contains
3333 matches on IP address fields with different prefix lengths.
3334 For example, when a flow table contains IP address matches on
3335 both full addresses and proper prefixes, the full address
3336 matches will typically cause the datapath flow to un-wildcard
3337 the whole address field (depending on flow entry priorities).
3338 In this case each packet with a different address gets handed
3339 to the userspace for flow processing and generates its own
3340 datapath flow. With prefix tracking enabled for the address
3341 field in question packets with addresses matching shorter
3342 prefixes would generate datapath flows where the irrelevant
3343 address bits are wildcarded, allowing the same datapath flow
3344 to handle all the packets within the prefix in question. In
3345 this case many userspace upcalls can be avoided and the
3346 overall performance can be better.
3347 </p>
3348 <p>
3349 This is a performance optimization only, so packets will
3350 receive the same treatment with or without prefix tracking.
3351 </p>
3352 <p>
3353 The supported fields are: <code>tun_id</code>,
3354 <code>tun_src</code>, <code>tun_dst</code>,
3355 <code>tun_ipv6_src</code>, <code>tun_ipv6_dst</code>,
3356 <code>nw_src</code>, <code>nw_dst</code> (or aliases
3357 <code>ip_src</code> and <code>ip_dst</code>),
3358 <code>ipv6_src</code>, and <code>ipv6_dst</code>. (Using this
3359 feature for <code>tun_id</code> would only make sense if the
3360 tunnel IDs have prefix structure similar to IP addresses.)
3361 </p>
3362
3363 <p>
3364 By default, the <code>prefixes=ip_dst,ip_src</code> are used
3365 on each flow table. This instructs the flow classifier to
3366 track the IP destination and source addresses used by the
3367 rules in this specific flow table.
3368 </p>
3369
3370 <p>
3371 The keyword <code>none</code> is recognized as an explicit
3372 override of the default values, causing no prefix fields to be
3373 tracked.
3374 </p>
3375
3376 <p>
3377 To set the prefix fields, the flow table record needs to
3378 exist:
3379 </p>
3380
3381 <dl>
3382 <dt><code>ovs-vsctl set Bridge br0 flow_tables:0=@N1 -- --id=@N1 create Flow_Table name=table0</code></dt>
3383 <dd>
3384 Creates a flow table record for the OpenFlow table number 0.
3385 </dd>
3386
3387 <dt><code>ovs-vsctl set Flow_Table table0 prefixes=ip_dst,ip_src</code></dt>
3388 <dd>
3389 Enables prefix tracking for IP source and destination
3390 address fields.
3391 </dd>
3392 </dl>
3393
3394 <p>
3395 There is a maximum number of fields that can be enabled for any
3396 one flow table. Currently this limit is 3.
3397 </p>
3398 </column>
3399 </group>
3400
3401 <group title="Common Columns">
3402 The overall purpose of these columns is described under <code>Common
3403 Columns</code> at the beginning of this document.
3404
3405 <column name="external_ids"/>
3406 </group>
3407 </table>
3408
3409 <table name="QoS" title="Quality of Service configuration">
3410 <p>Quality of Service (QoS) configuration for each Port that
3411 references it.</p>
3412
3413 <column name="type">
3414 <p>The type of QoS to implement. The currently defined types are
3415 listed below:</p>
3416 <dl>
3417 <dt><code>linux-htb</code></dt>
3418 <dd>
3419 Linux ``hierarchy token bucket'' classifier. See tc-htb(8) (also at
3420 <code>http://linux.die.net/man/8/tc-htb</code>) and the HTB manual
3421 (<code>http://luxik.cdi.cz/~devik/qos/htb/manual/userg.htm</code>)
3422 for information on how this classifier works and how to configure it.
3423 </dd>
3424
3425 <dt><code>linux-hfsc</code></dt>
3426 <dd>
3427 Linux "Hierarchical Fair Service Curve" classifier.
3428 See <code>http://linux-ip.net/articles/hfsc.en/</code> for
3429 information on how this classifier works.
3430 </dd>
3431
3432 <dt><code>linux-sfq</code></dt>
3433 <dd>
3434 Linux ``Stochastic Fairness Queueing'' classifier. See
3435 <code>tc-sfq</code>(8) (also at
3436 <code>http://linux.die.net/man/8/tc-sfq</code>) for information on
3437 how this classifier works.
3438 </dd>
3439
3440 <dt><code>linux-codel</code></dt>
3441 <dd>
3442 Linux ``Controlled Delay'' classifier. See <code>tc-codel</code>(8)
3443 (also at
3444 <code>http://man7.org/linux/man-pages/man8/tc-codel.8.html</code>)
3445 for information on how this classifier works.
3446 </dd>
3447
3448 <dt><code>linux-fq_codel</code></dt>
3449 <dd>
3450 Linux ``Fair Queuing with Controlled Delay'' classifier. See
3451 <code>tc-fq_codel</code>(8) (also at
3452 <code>http://man7.org/linux/man-pages/man8/tc-fq_codel.8.html</code>)
3453 for information on how this classifier works.
3454 </dd>
3455
3456 <dt><code>linux-noop</code></dt>
3457 <dd>
3458 Linux ``No operation.'' By default, Open vSwitch manages quality of
3459 service on all of its configured ports. This can be helpful, but
3460 sometimes administrators prefer to use other software to manage QoS.
3461 This <ref column="type"/> prevents Open vSwitch from changing the QoS
3462 configuration for a port.
3463 </dd>
3464
3465 <dt><code>egress-policer</code></dt>
3466 <dd>
3467 A DPDK egress policer algorithm using the DPDK
3468 rte_meter library. The rte_meter library provides an implementation
3469 which allows the metering and policing of traffic. The implementation
3470 in OVS essentially creates a single token bucket used to police
3471 traffic. It should be noted that when the rte_meter is configured as
3472 part of QoS there will be a performance overhead as the rte_meter
3473 itself will consume CPU cycles in order to police traffic. These CPU
3474 cycles ordinarily are used for packet proccessing. As such the drop
3475 in performance will be noticed in terms of overall aggregate traffic
3476 throughput.
3477 </dd>
3478 </dl>
3479 </column>
3480
3481 <column name="queues">
3482 <p>A map from queue numbers to <ref table="Queue"/> records. The
3483 supported range of queue numbers depend on <ref column="type"/>. The
3484 queue numbers are the same as the <code>queue_id</code> used in
3485 OpenFlow in <code>struct ofp_action_enqueue</code> and other
3486 structures.</p>
3487
3488 <p>
3489 Queue 0 is the ``default queue.'' It is used by OpenFlow output
3490 actions when no specific queue has been set. When no configuration for
3491 queue 0 is present, it is automatically configured as if a <ref
3492 table="Queue"/> record with empty <ref table="Queue" column="dscp"/>
3493 and <ref table="Queue" column="other_config"/> columns had been
3494 specified.
3495 (Before version 1.6, Open vSwitch would leave queue 0 unconfigured in
3496 this case. With some queuing disciplines, this dropped all packets
3497 destined for the default queue.)
3498 </p>
3499 </column>
3500
3501 <group title="Configuration for linux-htb and linux-hfsc">
3502 <p>
3503 The <code>linux-htb</code> and <code>linux-hfsc</code> classes support
3504 the following key-value pair:
3505 </p>
3506
3507 <column name="other_config" key="max-rate" type='{"type": "integer"}'>
3508 Maximum rate shared by all queued traffic, in bit/s. Optional. If not
3509 specified, for physical interfaces, the default is the link rate. For
3510 other interfaces or if the link rate cannot be determined, the default
3511 is currently 100 Mbps.
3512 </column>
3513 </group>
3514
3515 <group title="Configuration for egress-policer QoS">
3516 <p>
3517 <ref table="QoS"/> <ref table="QoS" column="type"/>
3518 <code>egress-policer</code> provides egress policing for userspace
3519 port types with DPDK.
3520
3521 It has the following key-value pairs defined.
3522 </p>
3523
3524 <column name="other_config" key="cir" type='{"type": "integer"}'>
3525 The Committed Information Rate (CIR) is measured in bytes of IP
3526 packets per second, i.e. it includes the IP header, but not link
3527 specific (e.g. Ethernet) headers. This represents the bytes per second
3528 rate at which the token bucket will be updated. The cir value is
3529 calculated by (pps x packet data size). For example assuming a user
3530 wishes to limit a stream consisting of 64 byte packets to 1 million
3531 packets per second the CIR would be set to to to 46000000. This value
3532 can be broken into '1,000,000 x 46'. Where 1,000,000 is the policing
3533 rate for the number of packets per second and 46 represents the size
3534 of the packet data for a 64 byte ip packet.
3535 </column>
3536 <column name="other_config" key="cbs" type='{"type": "integer"}'>
3537 The Committed Burst Size (CBS) is measured in bytes and represents a
3538 token bucket. At a minimum this value should be be set to the expected
3539 largest size packet in the traffic stream. In practice larger values
3540 may be used to increase the size of the token bucket. If a packet can
3541 be transmitted then the cbs will be decremented by the number of
3542 bytes/tokens of the packet. If there are not enough tokens in the cbs
3543 bucket the packet will be dropped.
3544 </column>
3545 </group>
3546
3547 <group title="Common Columns">
3548 The overall purpose of these columns is described under <code>Common
3549 Columns</code> at the beginning of this document.
3550
3551 <column name="other_config"/>
3552 <column name="external_ids"/>
3553 </group>
3554 </table>
3555
3556 <table name="Queue" title="QoS output queue.">
3557 <p>A configuration for a port output queue, used in configuring Quality of
3558 Service (QoS) features. May be referenced by <ref column="queues"
3559 table="QoS"/> column in <ref table="QoS"/> table.</p>
3560
3561 <column name="dscp">
3562 If set, Open vSwitch will mark all traffic egressing this
3563 <ref table="Queue"/> with the given DSCP bits. Traffic egressing the
3564 default <ref table="Queue"/> is only marked if it was explicitly selected
3565 as the <ref table="Queue"/> at the time the packet was output. If unset,
3566 the DSCP bits of traffic egressing this <ref table="Queue"/> will remain
3567 unchanged.
3568 </column>
3569
3570 <group title="Configuration for linux-htb QoS">
3571 <p>
3572 <ref table="QoS"/> <ref table="QoS" column="type"/>
3573 <code>linux-htb</code> may use <code>queue_id</code>s less than 61440.
3574 It has the following key-value pairs defined.
3575 </p>
3576
3577 <column name="other_config" key="min-rate"
3578 type='{"type": "integer", "minInteger": 1}'>
3579 Minimum guaranteed bandwidth, in bit/s.
3580 </column>
3581
3582 <column name="other_config" key="max-rate"
3583 type='{"type": "integer", "minInteger": 1}'>
3584 Maximum allowed bandwidth, in bit/s. Optional. If specified, the
3585 queue's rate will not be allowed to exceed the specified value, even
3586 if excess bandwidth is available. If unspecified, defaults to no
3587 limit.
3588 </column>
3589
3590 <column name="other_config" key="burst"
3591 type='{"type": "integer", "minInteger": 1}'>
3592 Burst size, in bits. This is the maximum amount of ``credits'' that a
3593 queue can accumulate while it is idle. Optional. Details of the
3594 <code>linux-htb</code> implementation require a minimum burst size, so
3595 a too-small <code>burst</code> will be silently ignored.
3596 </column>
3597
3598 <column name="other_config" key="priority"
3599 type='{"type": "integer", "minInteger": 0, "maxInteger": 4294967295}'>
3600 A queue with a smaller <code>priority</code> will receive all the
3601 excess bandwidth that it can use before a queue with a larger value
3602 receives any. Specific priority values are unimportant; only relative
3603 ordering matters. Defaults to 0 if unspecified.
3604 </column>
3605 </group>
3606
3607 <group title="Configuration for linux-hfsc QoS">
3608 <p>
3609 <ref table="QoS"/> <ref table="QoS" column="type"/>
3610 <code>linux-hfsc</code> may use <code>queue_id</code>s less than 61440.
3611 It has the following key-value pairs defined.
3612 </p>
3613
3614 <column name="other_config" key="min-rate"
3615 type='{"type": "integer", "minInteger": 1}'>
3616 Minimum guaranteed bandwidth, in bit/s.
3617 </column>
3618
3619 <column name="other_config" key="max-rate"
3620 type='{"type": "integer", "minInteger": 1}'>
3621 Maximum allowed bandwidth, in bit/s. Optional. If specified, the
3622 queue's rate will not be allowed to exceed the specified value, even if
3623 excess bandwidth is available. If unspecified, defaults to no
3624 limit.
3625 </column>
3626 </group>
3627
3628 <group title="Common Columns">
3629 The overall purpose of these columns is described under <code>Common
3630 Columns</code> at the beginning of this document.
3631
3632 <column name="other_config"/>
3633 <column name="external_ids"/>
3634 </group>
3635 </table>
3636
3637 <table name="Mirror" title="Port mirroring.">
3638 <p>A port mirror within a <ref table="Bridge"/>.</p>
3639 <p>A port mirror configures a bridge to send selected frames to special
3640 ``mirrored'' ports, in addition to their normal destinations. Mirroring
3641 traffic may also be referred to as SPAN or RSPAN, depending on how
3642 the mirrored traffic is sent.</p>
3643
3644 <p>
3645 When a packet enters an Open vSwitch bridge, it becomes eligible for
3646 mirroring based on its ingress port and VLAN. As the packet travels
3647 through the flow tables, each time it is output to a port, it becomes
3648 eligible for mirroring based on the egress port and VLAN. In Open
3649 vSwitch 2.5 and later, mirroring occurs just after a packet first becomes
3650 eligible, using the packet as it exists at that point; in Open vSwitch
3651 2.4 and earlier, mirroring occurs only after a packet has traversed all
3652 the flow tables, using the original packet as it entered the bridge.
3653 This makes a difference only when the flow table modifies the packet: in
3654 Open vSwitch 2.4, the modifications are never visible to mirrors, whereas
3655 in Open vSwitch 2.5 and later modifications made before the first output
3656 that makes it eligible for mirroring to a particular destination are
3657 visible.
3658 </p>
3659
3660 <p>
3661 A packet that enters an Open vSwitch bridge is mirrored to a particular
3662 destination only once, even if it is eligible for multiple reasons. For
3663 example, a packet would be mirrored to a particular <ref
3664 column="output_port"/> only once, even if it is selected for mirroring to
3665 that port by <ref column="select_dst_port"/> and <ref
3666 column="select_src_port"/> in the same or different <ref table="Mirror"/>
3667 records.
3668 </p>
3669
3670 <column name="name">
3671 Arbitrary identifier for the <ref table="Mirror"/>.
3672 </column>
3673
3674 <group title="Selecting Packets for Mirroring">
3675 <p>
3676 To be selected for mirroring, a given packet must enter or leave the
3677 bridge through a selected port and it must also be in one of the
3678 selected VLANs.
3679 </p>
3680
3681 <column name="select_all">
3682 If true, every packet arriving or departing on any port is
3683 selected for mirroring.
3684 </column>
3685
3686 <column name="select_dst_port">
3687 Ports on which departing packets are selected for mirroring.
3688 </column>
3689
3690 <column name="select_src_port">
3691 Ports on which arriving packets are selected for mirroring.
3692 </column>
3693
3694 <column name="select_vlan">
3695 VLANs on which packets are selected for mirroring. An empty set
3696 selects packets on all VLANs.
3697 </column>
3698 </group>
3699
3700 <group title="Mirroring Destination Configuration">
3701 <p>
3702 These columns are mutually exclusive. Exactly one of them must be
3703 nonempty.
3704 </p>
3705
3706 <column name="output_port">
3707 <p>Output port for selected packets, if nonempty.</p>
3708 <p>Specifying a port for mirror output reserves that port exclusively
3709 for mirroring. No frames other than those selected for mirroring
3710 via this column
3711 will be forwarded to the port, and any frames received on the port
3712 will be discarded.</p>
3713 <p>
3714 The output port may be any kind of port supported by Open vSwitch.
3715 It may be, for example, a physical port (sometimes called SPAN) or a
3716 GRE tunnel.
3717 </p>
3718 </column>
3719
3720 <column name="output_vlan">
3721 <p>Output VLAN for selected packets, if nonempty.</p>
3722 <p>The frames will be sent out all ports that trunk
3723 <ref column="output_vlan"/>, as well as any ports with implicit VLAN
3724 <ref column="output_vlan"/>. When a mirrored frame is sent out a
3725 trunk port, the frame's VLAN tag will be set to
3726 <ref column="output_vlan"/>, replacing any existing tag; when it is
3727 sent out an implicit VLAN port, the frame will not be tagged. This
3728 type of mirroring is sometimes called RSPAN.</p>
3729 <p>
3730 See the documentation for
3731 <ref column="other_config" key="forward-bpdu"/> in the
3732 <ref table="Interface"/> table for a list of destination MAC
3733 addresses which will not be mirrored to a VLAN to avoid confusing
3734 switches that interpret the protocols that they represent.
3735 </p>
3736 <p><em>Please note:</em> Mirroring to a VLAN can disrupt a network that
3737 contains unmanaged switches. Consider an unmanaged physical switch
3738 with two ports: port 1, connected to an end host, and port 2,
3739 connected to an Open vSwitch configured to mirror received packets
3740 into VLAN 123 on port 2. Suppose that the end host sends a packet on
3741 port 1 that the physical switch forwards to port 2. The Open vSwitch
3742 forwards this packet to its destination and then reflects it back on
3743 port 2 in VLAN 123. This reflected packet causes the unmanaged
3744 physical switch to replace the MAC learning table entry, which
3745 correctly pointed to port 1, with one that incorrectly points to port
3746 2. Afterward, the physical switch will direct packets destined for
3747 the end host to the Open vSwitch on port 2, instead of to the end
3748 host on port 1, disrupting connectivity. If mirroring to a VLAN is
3749 desired in this scenario, then the physical switch must be replaced
3750 by one that learns Ethernet addresses on a per-VLAN basis. In
3751 addition, learning should be disabled on the VLAN containing mirrored
3752 traffic. If this is not done then intermediate switches will learn
3753 the MAC address of each end host from the mirrored traffic. If
3754 packets being sent to that end host are also mirrored, then they will
3755 be dropped since the switch will attempt to send them out the input
3756 port. Disabling learning for the VLAN will cause the switch to
3757 correctly send the packet out all ports configured for that VLAN. If
3758 Open vSwitch is being used as an intermediate switch, learning can be
3759 disabled by adding the mirrored VLAN to <ref column="flood_vlans"/>
3760 in the appropriate <ref table="Bridge"/> table or tables.</p>
3761 <p>
3762 Mirroring to a GRE tunnel has fewer caveats than mirroring to a
3763 VLAN and should generally be preferred.
3764 </p>
3765 </column>
3766
3767 <column name="snaplen">
3768 <p>Maximum per-packet number of bytes to mirror.</p>
3769 <p>A mirrored packet with size larger than <ref column="snaplen"/>
3770 will be truncated in datapath to <ref column="snaplen"/> bytes
3771 before sending to the mirror output port. If omitted, packets
3772 are not truncated.
3773 </p>
3774 </column>
3775 </group>
3776
3777 <group title="Statistics: Mirror counters">
3778 <p>
3779 Key-value pairs that report mirror statistics. The update period
3780 is controlled by <ref column="other_config"
3781 key="stats-update-interval"/> in the <code>Open_vSwitch</code> table.
3782 </p>
3783 <column name="statistics" key="tx_packets">
3784 Number of packets transmitted through this mirror.
3785 </column>
3786 <column name="statistics" key="tx_bytes">
3787 Number of bytes transmitted through this mirror.
3788 </column>
3789 </group>
3790
3791 <group title="Common Columns">
3792 The overall purpose of these columns is described under <code>Common
3793 Columns</code> at the beginning of this document.
3794
3795 <column name="external_ids"/>
3796 </group>
3797 </table>
3798
3799 <table name="Controller" title="OpenFlow controller configuration.">
3800 <p>An OpenFlow controller.</p>
3801
3802 <p>
3803 Open vSwitch supports two kinds of OpenFlow controllers:
3804 </p>
3805
3806 <dl>
3807 <dt>Primary controllers</dt>
3808 <dd>
3809 <p>
3810 This is the kind of controller envisioned by the OpenFlow 1.0
3811 specification. Usually, a primary controller implements a network
3812 policy by taking charge of the switch's flow table.
3813 </p>
3814
3815 <p>
3816 Open vSwitch initiates and maintains persistent connections to
3817 primary controllers, retrying the connection each time it fails or
3818 drops. The <ref table="Bridge" column="fail_mode"/> column in the
3819 <ref table="Bridge"/> table applies to primary controllers.
3820 </p>
3821
3822 <p>
3823 Open vSwitch permits a bridge to have any number of primary
3824 controllers. When multiple controllers are configured, Open
3825 vSwitch connects to all of them simultaneously. Because
3826 OpenFlow 1.0 does not specify how multiple controllers
3827 coordinate in interacting with a single switch, more than
3828 one primary controller should be specified only if the
3829 controllers are themselves designed to coordinate with each
3830 other. (The Nicira-defined <code>NXT_ROLE</code> OpenFlow
3831 vendor extension may be useful for this.)
3832 </p>
3833 </dd>
3834 <dt>Service controllers</dt>
3835 <dd>
3836 <p>
3837 These kinds of OpenFlow controller connections are intended for
3838 occasional support and maintenance use, e.g. with
3839 <code>ovs-ofctl</code>. Usually a service controller connects only
3840 briefly to inspect or modify some of a switch's state.
3841 </p>
3842
3843 <p>
3844 Open vSwitch listens for incoming connections from service
3845 controllers. The service controllers initiate and, if necessary,
3846 maintain the connections from their end. The <ref table="Bridge"
3847 column="fail_mode"/> column in the <ref table="Bridge"/> table does
3848 not apply to service controllers.
3849 </p>
3850
3851 <p>
3852 Open vSwitch supports configuring any number of service controllers.
3853 </p>
3854 </dd>
3855 </dl>
3856
3857 <p>
3858 The <ref column="target"/> determines the type of controller.
3859 </p>
3860
3861 <group title="Core Features">
3862 <column name="target">
3863 <p>Connection method for controller.</p>
3864 <p>
3865 The following connection methods are currently supported for primary
3866 controllers:
3867 </p>
3868 <dl>
3869 <dt><code>ssl:<var>ip</var></code>[<code>:<var>port</var></code>]</dt>
3870 <dd>
3871 <p>The specified SSL <var>port</var> on the host at the
3872 given <var>ip</var>, which must be expressed as an IP
3873 address (not a DNS name). The <ref table="Open_vSwitch"
3874 column="ssl"/> column in the <ref table="Open_vSwitch"/>
3875 table must point to a valid SSL configuration when this form
3876 is used.</p>
3877 <p>If <var>port</var> is not specified, it defaults to 6653.</p>
3878 <p>SSL support is an optional feature that is not always built as
3879 part of Open vSwitch.</p>
3880 </dd>
3881 <dt><code>tcp:<var>ip</var></code>[<code>:<var>port</var></code>]</dt>
3882 <dd>
3883 <p>
3884 The specified TCP <var>port</var> on the host at the given
3885 <var>ip</var>, which must be expressed as an IP address (not a
3886 DNS name), where <var>ip</var> can be IPv4 or IPv6 address. If
3887 <var>ip</var> is an IPv6 address, wrap it in square brackets,
3888 e.g. <code>tcp:[::1]:6653</code>.
3889 </p>
3890 <p>
3891 If <var>port</var> is not specified, it defaults to 6653.
3892 </p>
3893 </dd>
3894 </dl>
3895 <p>
3896 The following connection methods are currently supported for service
3897 controllers:
3898 </p>
3899 <dl>
3900 <dt><code>pssl:</code>[<var>port</var>][<code>:<var>ip</var></code>]</dt>
3901 <dd>
3902 <p>
3903 Listens for SSL connections on the specified TCP <var>port</var>.
3904 If <var>ip</var>, which must be expressed as an IP address (not a
3905 DNS name), is specified, then connections are restricted to the
3906 specified local IP address (either IPv4 or IPv6). If
3907 <var>ip</var> is an IPv6 address, wrap it in square brackets,
3908 e.g. <code>pssl:6653:[::1]</code>.
3909 </p>
3910 <p>
3911 If <var>port</var> is not specified, it defaults to
3912 6653. If <var>ip</var> is not specified then it listens only on
3913 IPv4 (but not IPv6) addresses. The
3914 <ref table="Open_vSwitch" column="ssl"/>
3915 column in the <ref table="Open_vSwitch"/> table must point to a
3916 valid SSL configuration when this form is used.
3917 </p>
3918 <p>
3919 If <var>port</var> is not specified, it currently to 6653.
3920 </p>
3921 <p>
3922 SSL support is an optional feature that is not always built as
3923 part of Open vSwitch.
3924 </p>
3925 </dd>
3926 <dt><code>ptcp:</code>[<var>port</var>][<code>:<var>ip</var></code>]</dt>
3927 <dd>
3928 <p>
3929 Listens for connections on the specified TCP <var>port</var>. If
3930 <var>ip</var>, which must be expressed as an IP address (not a
3931 DNS name), is specified, then connections are restricted to the
3932 specified local IP address (either IPv4 or IPv6). If
3933 <var>ip</var> is an IPv6 address, wrap it in square brackets,
3934 e.g. <code>ptcp:6653:[::1]</code>. If <var>ip</var> is not
3935 specified then it listens only on IPv4 addresses.
3936 </p>
3937 <p>
3938 If <var>port</var> is not specified, it defaults to 6653.
3939 </p>
3940 </dd>
3941 </dl>
3942 <p>When multiple controllers are configured for a single bridge, the
3943 <ref column="target"/> values must be unique. Duplicate
3944 <ref column="target"/> values yield unspecified results.</p>
3945 </column>
3946
3947 <column name="connection_mode">
3948 <p>If it is specified, this setting must be one of the following
3949 strings that describes how Open vSwitch contacts this OpenFlow
3950 controller over the network:</p>
3951
3952 <dl>
3953 <dt><code>in-band</code></dt>
3954 <dd>In this mode, this controller's OpenFlow traffic travels over the
3955 bridge associated with the controller. With this setting, Open
3956 vSwitch allows traffic to and from the controller regardless of the
3957 contents of the OpenFlow flow table. (Otherwise, Open vSwitch
3958 would never be able to connect to the controller, because it did
3959 not have a flow to enable it.) This is the most common connection
3960 mode because it is not necessary to maintain two independent
3961 networks.</dd>
3962 <dt><code>out-of-band</code></dt>
3963 <dd>In this mode, OpenFlow traffic uses a control network separate
3964 from the bridge associated with this controller, that is, the
3965 bridge does not use any of its own network devices to communicate
3966 with the controller. The control network must be configured
3967 separately, before or after <code>ovs-vswitchd</code> is started.
3968 </dd>
3969 </dl>
3970
3971 <p>If not specified, the default is implementation-specific.</p>
3972 </column>
3973 </group>
3974
3975 <group title="Controller Failure Detection and Handling">
3976 <column name="max_backoff">
3977 Maximum number of milliseconds to wait between connection attempts.
3978 Default is implementation-specific.
3979 </column>
3980
3981 <column name="inactivity_probe">
3982 Maximum number of milliseconds of idle time on connection to
3983 controller before sending an inactivity probe message. If Open
3984 vSwitch does not communicate with the controller for the specified
3985 number of seconds, it will send a probe. If a response is not
3986 received for the same additional amount of time, Open vSwitch
3987 assumes the connection has been broken and attempts to reconnect.
3988 Default is implementation-specific. A value of 0 disables
3989 inactivity probes.
3990 </column>
3991 </group>
3992
3993 <group title="Asynchronous Messages">
3994 <p>
3995 OpenFlow switches send certain messages to controllers spontanenously,
3996 that is, not in response to any request from the controller. These
3997 messages are called ``asynchronous messages.'' These columns allow
3998 asynchronous messages to be limited or disabled to ensure the best use
3999 of network resources.
4000 </p>
4001
4002 <column name="enable_async_messages">
4003 The OpenFlow protocol enables asynchronous messages at time of
4004 connection establishment, which means that a controller can receive
4005 asynchronous messages, potentially many of them, even if it turns them
4006 off immediately after connecting. Set this column to
4007 <code>false</code> to change Open vSwitch behavior to disable, by
4008 default, all asynchronous messages. The controller can use the
4009 <code>NXT_SET_ASYNC_CONFIG</code> Nicira extension to OpenFlow to turn
4010 on any messages that it does want to receive, if any.
4011 </column>
4012
4013 <group title="Controller Rate Limiting">
4014 <p>
4015 A switch can forward packets to a controller over the OpenFlow
4016 protocol. Forwarding packets this way at too high a rate can
4017 overwhelm a controller, frustrate use of the OpenFlow connection for
4018 other purposes, increase the latency of flow setup, and use an
4019 unreasonable amount of bandwidth. Therefore, Open vSwitch supports
4020 limiting the rate of packet forwarding to a controller.
4021 </p>
4022
4023 <p>
4024 There are two main reasons in OpenFlow for a packet to be sent to a
4025 controller: either the packet ``misses'' in the flow table, that is,
4026 there is no matching flow, or a flow table action says to send the
4027 packet to the controller. Open vSwitch limits the rate of each kind
4028 of packet separately at the configured rate. Therefore, the actual
4029 rate that packets are sent to the controller can be up to twice the
4030 configured rate, when packets are sent for both reasons.
4031 </p>
4032
4033 <p>
4034 This feature is specific to forwarding packets over an OpenFlow
4035 connection. It is not general-purpose QoS. See the <ref
4036 table="QoS"/> table for quality of service configuration, and <ref
4037 column="ingress_policing_rate" table="Interface"/> in the <ref
4038 table="Interface"/> table for ingress policing configuration.
4039 </p>
4040
4041 <column name="controller_rate_limit">
4042 <p>
4043 The maximum rate at which the switch will forward packets to the
4044 OpenFlow controller, in packets per second. If no value is
4045 specified, rate limiting is disabled.
4046 </p>
4047 </column>
4048
4049 <column name="controller_burst_limit">
4050 <p>
4051 When a high rate triggers rate-limiting, Open vSwitch queues
4052 packets to the controller for each port and transmits them to the
4053 controller at the configured rate. This value limits the number of
4054 queued packets. Ports on a bridge share the packet queue fairly.
4055 </p>
4056
4057 <p>
4058 This value has no effect unless <ref
4059 column="controller_rate_limit"/> is configured. The current
4060 default when this value is not specified is one-quarter of <ref
4061 column="controller_rate_limit"/>, meaning that queuing can delay
4062 forwarding a packet to the controller by up to 250 ms.
4063 </p>
4064 </column>
4065
4066 <group title="Controller Rate Limiting Statistics">
4067 <p>
4068 These values report the effects of rate limiting. Their values are
4069 relative to establishment of the most recent OpenFlow connection,
4070 or since rate limiting was enabled, whichever happened more
4071 recently. Each consists of two values, one with <code>TYPE</code>
4072 replaced by <code>miss</code> for rate limiting flow table misses,
4073 and the other with <code>TYPE</code> replaced by
4074 <code>action</code> for rate limiting packets sent by OpenFlow
4075 actions.
4076 </p>
4077
4078 <p>
4079 These statistics are reported only when controller rate limiting is
4080 enabled.
4081 </p>
4082
4083 <column name="status" key="packet-in-TYPE-bypassed"
4084 type='{"type": "integer", "minInteger": 0}'>
4085 Number of packets sent directly to the controller, without queuing,
4086 because the rate did not exceed the configured maximum.
4087 </column>
4088
4089 <column name="status" key="packet-in-TYPE-queued"
4090 type='{"type": "integer", "minInteger": 0}'>
4091 Number of packets added to the queue to send later.
4092 </column>
4093
4094 <column name="status" key="packet-in-TYPE-dropped"
4095 type='{"type": "integer", "minInteger": 0}'>
4096 Number of packets added to the queue that were later dropped due to
4097 overflow. This value is less than or equal to <ref column="status"
4098 key="packet-in-TYPE-queued"/>.
4099 </column>
4100
4101 <column name="status" key="packet-in-TYPE-backlog"
4102 type='{"type": "integer", "minInteger": 0}'>
4103 Number of packets currently queued. The other statistics increase
4104 monotonically, but this one fluctuates between 0 and the <ref
4105 column="controller_burst_limit"/> as conditions change.
4106 </column>
4107 </group>
4108 </group>
4109 </group>
4110
4111 <group title="Additional In-Band Configuration">
4112 <p>These values are considered only in in-band control mode (see
4113 <ref column="connection_mode"/>).</p>
4114
4115 <p>When multiple controllers are configured on a single bridge, there
4116 should be only one set of unique values in these columns. If different
4117 values are set for these columns in different controllers, the effect
4118 is unspecified.</p>
4119
4120 <column name="local_ip">
4121 The IP address to configure on the local port,
4122 e.g. <code>192.168.0.123</code>. If this value is unset, then
4123 <ref column="local_netmask"/> and <ref column="local_gateway"/> are
4124 ignored.
4125 </column>
4126
4127 <column name="local_netmask">
4128 The IP netmask to configure on the local port,
4129 e.g. <code>255.255.255.0</code>. If <ref column="local_ip"/> is set
4130 but this value is unset, then the default is chosen based on whether
4131 the IP address is class A, B, or C.
4132 </column>
4133
4134 <column name="local_gateway">
4135 The IP address of the gateway to configure on the local port, as a
4136 string, e.g. <code>192.168.0.1</code>. Leave this column unset if
4137 this network has no gateway.
4138 </column>
4139 </group>
4140
4141 <group title="Controller Status">
4142 <column name="is_connected">
4143 <code>true</code> if currently connected to this controller,
4144 <code>false</code> otherwise.
4145 </column>
4146
4147 <column name="role"
4148 type='{"type": "string", "enum": ["set", ["other", "master", "slave"]]}'>
4149 <p>The level of authority this controller has on the associated
4150 bridge. Possible values are:</p>
4151 <dl>
4152 <dt><code>other</code></dt>
4153 <dd>Allows the controller access to all OpenFlow features.</dd>
4154 <dt><code>master</code></dt>
4155 <dd>Equivalent to <code>other</code>, except that there may be at
4156 most one master controller at a time. When a controller configures
4157 itself as <code>master</code>, any existing master is demoted to
4158 the <code>slave</code> role.</dd>
4159 <dt><code>slave</code></dt>
4160 <dd>Allows the controller read-only access to OpenFlow features.
4161 Attempts to modify the flow table will be rejected with an
4162 error. Slave controllers do not receive OFPT_PACKET_IN or
4163 OFPT_FLOW_REMOVED messages, but they do receive OFPT_PORT_STATUS
4164 messages.</dd>
4165 </dl>
4166 </column>
4167
4168 <column name="status" key="last_error">
4169 A human-readable description of the last error on the connection
4170 to the controller; i.e. <code>strerror(errno)</code>. This key
4171 will exist only if an error has occurred.
4172 </column>
4173
4174 <column name="status" key="state"
4175 type='{"type": "string", "enum": ["set", ["VOID", "BACKOFF", "CONNECTING", "ACTIVE", "IDLE"]]}'>
4176 <p>
4177 The state of the connection to the controller:
4178 </p>
4179 <dl>
4180 <dt><code>VOID</code></dt>
4181 <dd>Connection is disabled.</dd>
4182
4183 <dt><code>BACKOFF</code></dt>
4184 <dd>Attempting to reconnect at an increasing period.</dd>
4185
4186 <dt><code>CONNECTING</code></dt>
4187 <dd>Attempting to connect.</dd>
4188
4189 <dt><code>ACTIVE</code></dt>
4190 <dd>Connected, remote host responsive.</dd>
4191
4192 <dt><code>IDLE</code></dt>
4193 <dd>Connection is idle. Waiting for response to keep-alive.</dd>
4194 </dl>
4195 <p>
4196 These values may change in the future. They are provided only for
4197 human consumption.
4198 </p>
4199 </column>
4200
4201 <column name="status" key="sec_since_connect"
4202 type='{"type": "integer", "minInteger": 0}'>
4203 The amount of time since this controller last successfully connected to
4204 the switch (in seconds). Value is empty if controller has never
4205 successfully connected.
4206 </column>
4207
4208 <column name="status" key="sec_since_disconnect"
4209 type='{"type": "integer", "minInteger": 1}'>
4210 The amount of time since this controller last disconnected from
4211 the switch (in seconds). Value is empty if controller has never
4212 disconnected.
4213 </column>
4214 </group>
4215
4216 <group title="Connection Parameters">
4217 <p>
4218 Additional configuration for a connection between the controller
4219 and the Open vSwitch.
4220 </p>
4221
4222 <column name="other_config" key="dscp"
4223 type='{"type": "integer"}'>
4224 The Differentiated Service Code Point (DSCP) is specified using 6 bits
4225 in the Type of Service (TOS) field in the IP header. DSCP provides a
4226 mechanism to classify the network traffic and provide Quality of
4227 Service (QoS) on IP networks.
4228
4229 The DSCP value specified here is used when establishing the connection
4230 between the controller and the Open vSwitch. If no value is specified,
4231 a default value of 48 is chosen. Valid DSCP values must be in the
4232 range 0 to 63.
4233 </column>
4234 </group>
4235
4236
4237 <group title="Common Columns">
4238 The overall purpose of these columns is described under <code>Common
4239 Columns</code> at the beginning of this document.
4240
4241 <column name="external_ids"/>
4242 <column name="other_config"/>
4243 </group>
4244 </table>
4245
4246 <table name="Manager" title="OVSDB management connection.">
4247 <p>
4248 Configuration for a database connection to an Open vSwitch database
4249 (OVSDB) client.
4250 </p>
4251
4252 <p>
4253 This table primarily configures the Open vSwitch database
4254 (<code>ovsdb-server</code>), not the Open vSwitch switch
4255 (<code>ovs-vswitchd</code>). The switch does read the table to determine
4256 what connections should be treated as in-band.
4257 </p>
4258
4259 <p>
4260 The Open vSwitch database server can initiate and maintain active
4261 connections to remote clients. It can also listen for database
4262 connections.
4263 </p>
4264
4265 <group title="Core Features">
4266 <column name="target">
4267 <p>Connection method for managers.</p>
4268 <p>
4269 The following connection methods are currently supported:
4270 </p>
4271 <dl>
4272 <dt><code>ssl:<var>ip</var></code>[<code>:<var>port</var></code>]</dt>
4273 <dd>
4274 <p>
4275 The specified SSL <var>port</var> on the host at the given
4276 <var>ip</var>, which must be expressed as an IP address
4277 (not a DNS name). The <ref table="Open_vSwitch"
4278 column="ssl"/> column in the <ref table="Open_vSwitch"/>
4279 table must point to a valid SSL configuration when this
4280 form is used.
4281 </p>
4282 <p>
4283 If <var>port</var> is not specified, it defaults to 6640.
4284 </p>
4285 <p>
4286 SSL support is an optional feature that is not always
4287 built as part of Open vSwitch.
4288 </p>
4289 </dd>
4290
4291 <dt><code>tcp:<var>ip</var></code>[<code>:<var>port</var></code>]</dt>
4292 <dd>
4293 <p>
4294 The specified TCP <var>port</var> on the host at the given
4295 <var>ip</var>, which must be expressed as an IP address (not a
4296 DNS name), where <var>ip</var> can be IPv4 or IPv6 address. If
4297 <var>ip</var> is an IPv6 address, wrap it in square brackets,
4298 e.g. <code>tcp:[::1]:6640</code>.
4299 </p>
4300 <p>
4301 If <var>port</var> is not specified, it defaults to 6640.
4302 </p>
4303 </dd>
4304 <dt><code>pssl:</code>[<var>port</var>][<code>:<var>ip</var></code>]</dt>
4305 <dd>
4306 <p>
4307 Listens for SSL connections on the specified TCP <var>port</var>.
4308 Specify 0 for <var>port</var> to have the kernel automatically
4309 choose an available port. If <var>ip</var>, which must be
4310 expressed as an IP address (not a DNS name), is specified, then
4311 connections are restricted to the specified local IP address
4312 (either IPv4 or IPv6 address). If <var>ip</var> is an IPv6
4313 address, wrap in square brackets,
4314 e.g. <code>pssl:6640:[::1]</code>. If <var>ip</var> is not
4315 specified then it listens only on IPv4 (but not IPv6) addresses.
4316 The <ref table="Open_vSwitch" column="ssl"/> column in the <ref
4317 table="Open_vSwitch"/> table must point to a valid SSL
4318 configuration when this form is used.
4319 </p>
4320 <p>
4321 If <var>port</var> is not specified, it defaults to 6640.
4322 </p>
4323 <p>
4324 SSL support is an optional feature that is not always built as
4325 part of Open vSwitch.
4326 </p>
4327 </dd>
4328 <dt><code>ptcp:</code>[<var>port</var>][<code>:<var>ip</var></code>]</dt>
4329 <dd>
4330 <p>
4331 Listens for connections on the specified TCP <var>port</var>.
4332 Specify 0 for <var>port</var> to have the kernel automatically
4333 choose an available port. If <var>ip</var>, which must be
4334 expressed as an IP address (not a DNS name), is specified, then
4335 connections are restricted to the specified local IP address
4336 (either IPv4 or IPv6 address). If <var>ip</var> is an IPv6
4337 address, wrap it in square brackets,
4338 e.g. <code>ptcp:6640:[::1]</code>. If <var>ip</var> is not
4339 specified then it listens only on IPv4 addresses.
4340 </p>
4341 <p>
4342 If <var>port</var> is not specified, it defaults to 6640.
4343 </p>
4344 </dd>
4345 </dl>
4346 <p>When multiple managers are configured, the <ref column="target"/>
4347 values must be unique. Duplicate <ref column="target"/> values yield
4348 unspecified results.</p>
4349 </column>
4350
4351 <column name="connection_mode">
4352 <p>
4353 If it is specified, this setting must be one of the following strings
4354 that describes how Open vSwitch contacts this OVSDB client over the
4355 network:
4356 </p>
4357
4358 <dl>
4359 <dt><code>in-band</code></dt>
4360 <dd>
4361 In this mode, this connection's traffic travels over a bridge
4362 managed by Open vSwitch. With this setting, Open vSwitch allows
4363 traffic to and from the client regardless of the contents of the
4364 OpenFlow flow table. (Otherwise, Open vSwitch would never be able
4365 to connect to the client, because it did not have a flow to enable
4366 it.) This is the most common connection mode because it is not
4367 necessary to maintain two independent networks.
4368 </dd>
4369 <dt><code>out-of-band</code></dt>
4370 <dd>
4371 In this mode, the client's traffic uses a control network separate
4372 from that managed by Open vSwitch, that is, Open vSwitch does not
4373 use any of its own network devices to communicate with the client.
4374 The control network must be configured separately, before or after
4375 <code>ovs-vswitchd</code> is started.
4376 </dd>
4377 </dl>
4378
4379 <p>
4380 If not specified, the default is implementation-specific.
4381 </p>
4382 </column>
4383 </group>
4384
4385 <group title="Client Failure Detection and Handling">
4386 <column name="max_backoff">
4387 Maximum number of milliseconds to wait between connection attempts.
4388 Default is implementation-specific.
4389 </column>
4390
4391 <column name="inactivity_probe">
4392 Maximum number of milliseconds of idle time on connection to the client
4393 before sending an inactivity probe message. If Open vSwitch does not
4394 communicate with the client for the specified number of seconds, it
4395 will send a probe. If a response is not received for the same
4396 additional amount of time, Open vSwitch assumes the connection has been
4397 broken and attempts to reconnect. Default is implementation-specific.
4398 A value of 0 disables inactivity probes.
4399 </column>
4400 </group>
4401
4402 <group title="Status">
4403 <p>
4404 Key-value pair of <ref column="is_connected"/> is always updated.
4405 Other key-value pairs in the status columns may be updated depends
4406 on the <ref column="target"/> type.
4407 </p>
4408
4409 <p>
4410 When <ref column="target"/> specifies a connection method that
4411 listens for inbound connections (e.g. <code>ptcp:</code> or
4412 <code>punix:</code>), both <ref column="n_connections"/> and
4413 <ref column="is_connected"/> may also be updated while the
4414 remaining key-value pairs are omitted.
4415 </p>
4416
4417 <p>
4418 On the other hand, when <ref column="target"/> specifies an
4419 outbound connection, all key-value pairs may be updated, except
4420 the above-mentioned two key-value pairs associated with inbound
4421 connection targets. They are omitted.
4422 </p>
4423
4424 <column name="is_connected">
4425 <code>true</code> if currently connected to this manager,
4426 <code>false</code> otherwise.
4427 </column>
4428
4429 <column name="status" key="last_error">
4430 A human-readable description of the last error on the connection
4431 to the manager; i.e. <code>strerror(errno)</code>. This key
4432 will exist only if an error has occurred.
4433 </column>
4434
4435 <column name="status" key="state"
4436 type='{"type": "string", "enum": ["set", ["VOID", "BACKOFF", "CONNECTING", "ACTIVE", "IDLE"]]}'>
4437 <p>
4438 The state of the connection to the manager:
4439 </p>
4440 <dl>
4441 <dt><code>VOID</code></dt>
4442 <dd>Connection is disabled.</dd>
4443
4444 <dt><code>BACKOFF</code></dt>
4445 <dd>Attempting to reconnect at an increasing period.</dd>
4446
4447 <dt><code>CONNECTING</code></dt>
4448 <dd>Attempting to connect.</dd>
4449
4450 <dt><code>ACTIVE</code></dt>
4451 <dd>Connected, remote host responsive.</dd>
4452
4453 <dt><code>IDLE</code></dt>
4454 <dd>Connection is idle. Waiting for response to keep-alive.</dd>
4455 </dl>
4456 <p>
4457 These values may change in the future. They are provided only for
4458 human consumption.
4459 </p>
4460 </column>
4461
4462 <column name="status" key="sec_since_connect"
4463 type='{"type": "integer", "minInteger": 0}'>
4464 The amount of time since this manager last successfully connected
4465 to the database (in seconds). Value is empty if manager has never
4466 successfully connected.
4467 </column>
4468
4469 <column name="status" key="sec_since_disconnect"
4470 type='{"type": "integer", "minInteger": 0}'>
4471 The amount of time since this manager last disconnected from the
4472 database (in seconds). Value is empty if manager has never
4473 disconnected.
4474 </column>
4475
4476 <column name="status" key="locks_held">
4477 Space-separated list of the names of OVSDB locks that the connection
4478 holds. Omitted if the connection does not hold any locks.
4479 </column>
4480
4481 <column name="status" key="locks_waiting">
4482 Space-separated list of the names of OVSDB locks that the connection is
4483 currently waiting to acquire. Omitted if the connection is not waiting
4484 for any locks.
4485 </column>
4486
4487 <column name="status" key="locks_lost">
4488 Space-separated list of the names of OVSDB locks that the connection
4489 has had stolen by another OVSDB client. Omitted if no locks have been
4490 stolen from this connection.
4491 </column>
4492
4493 <column name="status" key="n_connections"
4494 type='{"type": "integer", "minInteger": 2}'>
4495 When <ref column="target"/> specifies a connection method that
4496 listens for inbound connections (e.g. <code>ptcp:</code> or
4497 <code>pssl:</code>) and more than one connection is actually active,
4498 the value is the number of active connections. Otherwise, this
4499 key-value pair is omitted.
4500 </column>
4501
4502 <column name="status" key="bound_port" type='{"type": "integer"}'>
4503 When <ref column="target"/> is <code>ptcp:</code> or
4504 <code>pssl:</code>, this is the TCP port on which the OVSDB server is
4505 listening. (This is particularly useful when <ref
4506 column="target"/> specifies a port of 0, allowing the kernel to
4507 choose any available port.)
4508 </column>
4509 </group>
4510
4511 <group title="Connection Parameters">
4512 <p>
4513 Additional configuration for a connection between the manager
4514 and the Open vSwitch Database.
4515 </p>
4516
4517 <column name="other_config" key="dscp"
4518 type='{"type": "integer"}'>
4519 The Differentiated Service Code Point (DSCP) is specified using 6 bits
4520 in the Type of Service (TOS) field in the IP header. DSCP provides a
4521 mechanism to classify the network traffic and provide Quality of
4522 Service (QoS) on IP networks.
4523
4524 The DSCP value specified here is used when establishing the connection
4525 between the manager and the Open vSwitch. If no value is specified, a
4526 default value of 48 is chosen. Valid DSCP values must be in the range
4527 0 to 63.
4528 </column>
4529 </group>
4530
4531 <group title="Common Columns">
4532 The overall purpose of these columns is described under <code>Common
4533 Columns</code> at the beginning of this document.
4534
4535 <column name="external_ids"/>
4536 <column name="other_config"/>
4537 </group>
4538 </table>
4539
4540 <table name="NetFlow">
4541 A NetFlow target. NetFlow is a protocol that exports a number of
4542 details about terminating IP flows, such as the principals involved
4543 and duration.
4544
4545 <column name="targets">
4546 NetFlow targets in the form
4547 <code><var>ip</var>:<var>port</var></code>. The <var>ip</var>
4548 must be specified numerically, not as a DNS name.
4549 </column>
4550
4551 <column name="engine_id">
4552 Engine ID to use in NetFlow messages. Defaults to datapath index
4553 if not specified.
4554 </column>
4555
4556 <column name="engine_type">
4557 Engine type to use in NetFlow messages. Defaults to datapath
4558 index if not specified.
4559 </column>
4560
4561 <column name="active_timeout">
4562 <p>
4563 The interval at which NetFlow records are sent for flows that
4564 are still active, in seconds. A value of <code>0</code>
4565 requests the default timeout (currently 600 seconds); a value
4566 of <code>-1</code> disables active timeouts.
4567 </p>
4568
4569 <p>
4570 The NetFlow passive timeout, for flows that become inactive,
4571 is not configurable. It will vary depending on the Open
4572 vSwitch version, the forms and contents of the OpenFlow flow
4573 tables, CPU and memory usage, and network activity. A typical
4574 passive timeout is about a second.
4575 </p>
4576 </column>
4577
4578 <column name="add_id_to_interface">
4579 <p>If this column's value is <code>false</code>, the ingress and egress
4580 interface fields of NetFlow flow records are derived from OpenFlow port
4581 numbers. When it is <code>true</code>, the 7 most significant bits of
4582 these fields will be replaced by the least significant 7 bits of the
4583 engine id. This is useful because many NetFlow collectors do not
4584 expect multiple switches to be sending messages from the same host, so
4585 they do not store the engine information which could be used to
4586 disambiguate the traffic.</p>
4587 <p>When this option is enabled, a maximum of 508 ports are supported.</p>
4588 </column>
4589
4590 <group title="Common Columns">
4591 The overall purpose of these columns is described under <code>Common
4592 Columns</code> at the beginning of this document.
4593
4594 <column name="external_ids"/>
4595 </group>
4596 </table>
4597
4598 <table name="SSL">
4599 SSL configuration for an Open_vSwitch.
4600
4601 <column name="private_key">
4602 Name of a PEM file containing the private key used as the switch's
4603 identity for SSL connections to the controller.
4604 </column>
4605
4606 <column name="certificate">
4607 Name of a PEM file containing a certificate, signed by the
4608 certificate authority (CA) used by the controller and manager,
4609 that certifies the switch's private key, identifying a trustworthy
4610 switch.
4611 </column>
4612
4613 <column name="ca_cert">
4614 Name of a PEM file containing the CA certificate used to verify
4615 that the switch is connected to a trustworthy controller.
4616 </column>
4617
4618 <column name="bootstrap_ca_cert">
4619 If set to <code>true</code>, then Open vSwitch will attempt to
4620 obtain the CA certificate from the controller on its first SSL
4621 connection and save it to the named PEM file. If it is successful,
4622 it will immediately drop the connection and reconnect, and from then
4623 on all SSL connections must be authenticated by a certificate signed
4624 by the CA certificate thus obtained. <em>This option exposes the
4625 SSL connection to a man-in-the-middle attack obtaining the initial
4626 CA certificate.</em> It may still be useful for bootstrapping.
4627 </column>
4628
4629 <group title="Common Columns">
4630 The overall purpose of these columns is described under <code>Common
4631 Columns</code> at the beginning of this document.
4632
4633 <column name="external_ids"/>
4634 </group>
4635 </table>
4636
4637 <table name="sFlow">
4638 <p>A set of sFlow(R) targets. sFlow is a protocol for remote
4639 monitoring of switches.</p>
4640
4641 <column name="agent">
4642 Name of the network device whose IP address should be reported as the
4643 ``agent address'' to collectors. If not specified, the agent device is
4644 figured from the first target address and the routing table. If the
4645 routing table does not contain a route to the target, the IP address
4646 defaults to the <ref table="Controller" column="local_ip"/> in the
4647 collector's <ref table="Controller"/>. If an agent IP address cannot be
4648 determined any of these ways, sFlow is disabled.
4649 </column>
4650
4651 <column name="header">
4652 Number of bytes of a sampled packet to send to the collector.
4653 If not specified, the default is 128 bytes.
4654 </column>
4655
4656 <column name="polling">
4657 Polling rate in seconds to send port statistics to the collector.
4658 If not specified, defaults to 30 seconds.
4659 </column>
4660
4661 <column name="sampling">
4662 Rate at which packets should be sampled and sent to the collector.
4663 If not specified, defaults to 400, which means one out of 400
4664 packets, on average, will be sent to the collector.
4665 </column>
4666
4667 <column name="targets">
4668 sFlow targets in the form
4669 <code><var>ip</var>:<var>port</var></code>.
4670 </column>
4671
4672 <group title="Common Columns">
4673 The overall purpose of these columns is described under <code>Common
4674 Columns</code> at the beginning of this document.
4675
4676 <column name="external_ids"/>
4677 </group>
4678 </table>
4679
4680 <table name="IPFIX">
4681 <p>Configuration for sending packets to IPFIX collectors.</p>
4682
4683 <p>
4684 IPFIX is a protocol that exports a number of details about flows. The
4685 IPFIX implementation in Open vSwitch samples packets at a configurable
4686 rate, extracts flow information from those packets, optionally caches and
4687 aggregates the flow information, and sends the result to one or more
4688 collectors.
4689 </p>
4690
4691 <p>
4692 IPFIX in Open vSwitch can be configured two different ways:
4693 </p>
4694
4695 <ul>
4696 <li>
4697 With <em>per-bridge sampling</em>, Open vSwitch performs IPFIX sampling
4698 automatically on all packets that pass through a bridge. To configure
4699 per-bridge sampling, create an <ref table="IPFIX"/> record and point a
4700 <ref table="Bridge"/> table's <ref table="Bridge" column="ipfix"/>
4701 column to it. The <ref table="Flow_Sample_Collector_Set"/> table is
4702 not used for per-bridge sampling.
4703 </li>
4704
4705 <li>
4706 <p>
4707 With <em>flow-based sampling</em>, <code>sample</code> actions in the
4708 OpenFlow flow table drive IPFIX sampling. See
4709 <code>ovs-ofctl</code>(8) for a description of the
4710 <code>sample</code> action.
4711 </p>
4712
4713 <p>
4714 Flow-based sampling also requires database configuration: create a
4715 <ref table="IPFIX"/> record that describes the IPFIX configuration
4716 and a <ref table="Flow_Sample_Collector_Set"/> record that points to
4717 the <ref table="Bridge"/> whose flow table holds the
4718 <code>sample</code> actions and to <ref table="IPFIX"/> record. The
4719 <ref table="Bridge" column="ipfix"/> in the <ref table="Bridge"/>
4720 table is not used for flow-based sampling.
4721 </p>
4722 </li>
4723 </ul>
4724
4725 <column name="targets">
4726 IPFIX target collectors in the form
4727 <code><var>ip</var>:<var>port</var></code>.
4728 </column>
4729
4730 <column name="cache_active_timeout">
4731 The maximum period in seconds for which an IPFIX flow record is
4732 cached and aggregated before being sent. If not specified,
4733 defaults to 0. If 0, caching is disabled.
4734 </column>
4735
4736 <column name="cache_max_flows">
4737 The maximum number of IPFIX flow records that can be cached at a
4738 time. If not specified, defaults to 0. If 0, caching is
4739 disabled.
4740 </column>
4741
4742 <column name="other_config" key="enable-tunnel-sampling"
4743 type='{"type": "boolean"}'>
4744 <p>
4745 Set to <code>true</code> to enable sampling and reporting tunnel
4746 header 7-tuples in IPFIX flow records. Tunnel sampling is enabled
4747 by default.
4748 </p>
4749
4750 <p>
4751 The following enterprise entities report the sampled tunnel info:
4752 </p>
4753
4754 <dl>
4755 <dt>tunnelType:</dt>
4756 <dd>
4757 <p>ID: 891, and enterprise ID 6876 (VMware).</p>
4758 <p>type: unsigned 8-bit integer.</p>
4759 <p>data type semantics: identifier.</p>
4760 <p>description: Identifier of the layer 2 network overlay network
4761 encapsulation type: 0x01 VxLAN, 0x02 GRE, 0x03 LISP, 0x07 GENEVE.</p>
4762 </dd>
4763 <dt>tunnelKey:</dt>
4764 <dd>
4765 <p>ID: 892, and enterprise ID 6876 (VMware).</p>
4766 <p>type: variable-length octetarray.</p>
4767 <p>data type semantics: identifier.</p>
4768 <p>description: Key which is used for identifying an individual
4769 traffic flow within a VxLAN (24-bit VNI), GENEVE (24-bit VNI),
4770 GRE (32-bit key), or LISP (24-bit instance ID) tunnel. The
4771 key is encoded in this octetarray as a 3-, 4-, or 8-byte integer
4772 ID in network byte order.</p>
4773 </dd>
4774 <dt>tunnelSourceIPv4Address:</dt>
4775 <dd>
4776 <p>ID: 893, and enterprise ID 6876 (VMware).</p>
4777 <p>type: unsigned 32-bit integer.</p>
4778 <p>data type semantics: identifier.</p>
4779 <p>description: The IPv4 source address in the tunnel IP packet
4780 header.</p>
4781 </dd>
4782 <dt>tunnelDestinationIPv4Address:</dt>
4783 <dd>
4784 <p>ID: 894, and enterprise ID 6876 (VMware).</p>
4785 <p>type: unsigned 32-bit integer.</p>
4786 <p>data type semantics: identifier.</p>
4787 <p>description: The IPv4 destination address in the tunnel IP
4788 packet header.</p>
4789 </dd>
4790 <dt>tunnelProtocolIdentifier:</dt>
4791 <dd>
4792 <p>ID: 895, and enterprise ID 6876 (VMware).</p>
4793 <p>type: unsigned 8-bit integer.</p>
4794 <p>data type semantics: identifier.</p>
4795 <p>description: The value of the protocol number in the tunnel
4796 IP packet header. The protocol number identifies the tunnel IP
4797 packet payload type.</p>
4798 </dd>
4799 <dt>tunnelSourceTransportPort:</dt>
4800 <dd>
4801 <p>ID: 896, and enterprise ID 6876 (VMware).</p>
4802 <p>type: unsigned 16-bit integer.</p>
4803 <p>data type semantics: identifier.</p>
4804 <p>description: The source port identifier in the tunnel transport
4805 header. For the transport protocols UDP, TCP, and SCTP, this is
4806 the source port number given in the respective header.</p>
4807 </dd>
4808 <dt>tunnelDestinationTransportPort:</dt>
4809 <dd>
4810 <p>ID: 897, and enterprise ID 6876 (VMware).</p>
4811 <p>type: unsigned 16-bit integer.</p>
4812 <p>data type semantics: identifier.</p>
4813 <p>description: The destination port identifier in the tunnel
4814 transport header. For the transport protocols UDP, TCP, and SCTP,
4815 this is the destination port number given in the respective header.
4816 </p>
4817 </dd>
4818 </dl>
4819
4820 <p>
4821 Before Open vSwitch 2.5.90, <ref column="other_config"
4822 key="enable-tunnel-sampling"/> was only supported with per-bridge
4823 sampling, and ignored otherwise. Open vSwitch 2.5.90 and later support
4824 <ref column="other_config" key="enable-tunnel-sampling"/> for
4825 per-bridge and per-flow sampling.
4826 </p>
4827 </column>
4828
4829 <column name="other_config" key="virtual_obs_id"
4830 type='{"type": "string"}'>
4831 <p>
4832 A string that accompanies each IPFIX flow record. Its intended use is
4833 for the ``virtual observation ID,'' an identifier of a virtual
4834 observation point that is locally unique in a virtual network. It
4835 describes a location in the virtual network where IP packets can be
4836 observed. The maximum length is 254 bytes. If not specified, the
4837 field is omitted from the IPFIX flow record.
4838 </p>
4839
4840 <p>
4841 The following enterprise entity reports the specified virtual
4842 observation ID:
4843 </p>
4844
4845 <dl>
4846 <dt>virtualObsID:</dt>
4847 <dd>
4848 <p>ID: 898, and enterprise ID 6876 (VMware).</p>
4849 <p>type: variable-length string.</p>
4850 <p>data type semantics: identifier.</p>
4851 <p>description: A virtual observation domain ID that is locally
4852 unique in a virtual network.
4853 </p>
4854 </dd>
4855 </dl>
4856
4857 <p>
4858 This feature was introduced in Open vSwitch 2.5.90.
4859 </p>
4860 </column>
4861
4862 <group title="Per-Bridge Sampling">
4863 <p>
4864 These values affect only per-bridge sampling. See above for a
4865 description of the differences between per-bridge and flow-based
4866 sampling.
4867 </p>
4868
4869 <column name="sampling">
4870 The rate at which packets should be sampled and sent to each target
4871 collector. If not specified, defaults to 400, which means one out of
4872 400 packets, on average, will be sent to each target collector.
4873 </column>
4874
4875 <column name="obs_domain_id">
4876 The IPFIX Observation Domain ID sent in each IPFIX packet. If not
4877 specified, defaults to 0.
4878 </column>
4879
4880 <column name="obs_point_id">
4881 The IPFIX Observation Point ID sent in each IPFIX flow record. If not
4882 specified, defaults to 0.
4883 </column>
4884
4885 <column name="other_config" key="enable-input-sampling"
4886 type='{"type": "boolean"}'>
4887 By default, Open vSwitch samples and reports flows at bridge port input
4888 in IPFIX flow records. Set this column to <code>false</code> to
4889 disable input sampling.
4890 </column>
4891
4892 <column name="other_config" key="enable-output-sampling"
4893 type='{"type": "boolean"}'>
4894 By default, Open vSwitch samples and reports flows at bridge port
4895 output in IPFIX flow records. Set this column to <code>false</code> to
4896 disable output sampling.
4897 </column>
4898 </group>
4899
4900 <group title="Common Columns">
4901 The overall purpose of these columns is described under <code>Common
4902 Columns</code> at the beginning of this document.
4903
4904 <column name="external_ids"/>
4905 </group>
4906 </table>
4907
4908 <table name="Flow_Sample_Collector_Set">
4909 <p>
4910 A set of IPFIX collectors of packet samples generated by OpenFlow
4911 <code>sample</code> actions. This table is used only for IPFIX
4912 flow-based sampling, not for per-bridge sampling (see the <ref
4913 table="IPFIX"/> table for a description of the two forms).
4914 </p>
4915
4916 <column name="id">
4917 The ID of this collector set, unique among the bridge's
4918 collector sets, to be used as the <code>collector_set_id</code>
4919 in OpenFlow <code>sample</code> actions.
4920 </column>
4921
4922 <column name="bridge">
4923 The bridge into which OpenFlow <code>sample</code> actions can
4924 be added to send packet samples to this set of IPFIX collectors.
4925 </column>
4926
4927 <column name="ipfix">
4928 Configuration of the set of IPFIX collectors to send one flow
4929 record per sampled packet to.
4930 </column>
4931
4932 <group title="Common Columns">
4933 The overall purpose of these columns is described under <code>Common
4934 Columns</code> at the beginning of this document.
4935
4936 <column name="external_ids"/>
4937 </group>
4938 </table>
4939
4940 <table name="AutoAttach">
4941 <p>
4942 Auto Attach configuration within a bridge. The IETF Auto-Attach SPBM
4943 draft standard describes a compact method of using IEEE 802.1AB Link
4944 Layer Discovery Protocol (LLDP) together with a IEEE 802.1aq Shortest
4945 Path Bridging (SPB) network to automatically attach network devices
4946 to individual services in a SPB network. The intent here is to allow
4947 network applications and devices using OVS to be able to easily take
4948 advantage of features offered by industry standard SPB networks.
4949 </p>
4950
4951 <p>
4952 Auto Attach (AA) uses LLDP to communicate between a directly connected
4953 Auto Attach Client (AAC) and Auto Attach Server (AAS). The LLDP protocol
4954 is extended to add two new Type-Length-Value tuples (TLVs). The first
4955 new TLV supports the ongoing discovery of directly connected AA
4956 correspondents. Auto Attach operates by regularly transmitting AA
4957 discovery TLVs between the AA client and AA server. By exchanging these
4958 discovery messages, both the AAC and AAS learn the system name and
4959 system description of their peer. In the OVS context, OVS operates as
4960 the AA client and the AA server resides on a switch at the edge of the
4961 SPB network.
4962 </p>
4963
4964 <p>
4965 Once AA discovery has been completed the AAC then uses the second new TLV
4966 to deliver identifier mappings from the AAC to the AAS. A primary feature
4967 of Auto Attach is to facilitate the mapping of VLANs defined outside the
4968 SPB network onto service ids (ISIDs) defined within the SPM network. By
4969 doing so individual external VLANs can be mapped onto specific SPB
4970 network services. These VLAN id to ISID mappings can be configured and
4971 managed locally using new options added to the ovs-vsctl command.
4972 </p>
4973
4974 <p>
4975 The Auto Attach OVS feature does not provide a full implementation of
4976 the LLDP protocol. Support for the mandatory TLVs as defined by the LLDP
4977 standard and support for the AA TLV extensions is provided. LLDP
4978 protocol support in OVS can be enabled or disabled on a port by port
4979 basis. LLDP support is disabled by default.
4980 </p>
4981
4982 <column name="system_name">
4983 The system_name string is exported in LLDP messages. It should uniquely
4984 identify the bridge in the network.
4985 </column>
4986
4987 <column name="system_description">
4988 The system_description string is exported in LLDP messages. It should
4989 describe the type of software and hardware.
4990 </column>
4991
4992 <column name="mappings">
4993 A mapping from SPB network Individual Service Identifier (ISID) to VLAN
4994 id.
4995 </column>
4996 </table>
4997 </database>