]> git.proxmox.com Git - mirror_ubuntu-bionic-kernel.git/blame - Documentation/networking/ip-sysctl.txt
net/mlx4_en: fix error return code in mlx4_en_get_qp()
[mirror_ubuntu-bionic-kernel.git] / Documentation / networking / ip-sysctl.txt
CommitLineData
1da177e4
LT
1/proc/sys/net/ipv4/* Variables:
2
3ip_forward - BOOLEAN
4 0 - disabled (default)
e18f5feb 5 not 0 - enabled
1da177e4
LT
6
7 Forward Packets between interfaces.
8
9 This variable is special, its change resets all configuration
10 parameters to their default state (RFC1122 for hosts, RFC1812
11 for routers)
12
13ip_default_ttl - INTEGER
cc6f02dd
ED
14 Default value of TTL field (Time To Live) for outgoing (but not
15 forwarded) IP packets. Should be between 1 and 255 inclusive.
16 Default: 64 (as recommended by RFC1700)
1da177e4 17
cd174e67
HFS
18ip_no_pmtu_disc - INTEGER
19 Disable Path MTU Discovery. If enabled in mode 1 and a
188b04d5
HFS
20 fragmentation-required ICMP is received, the PMTU to this
21 destination will be set to min_pmtu (see below). You will need
22 to raise min_pmtu to the smallest interface MTU on your system
23 manually if you want to avoid locally generated fragments.
cd174e67
HFS
24
25 In mode 2 incoming Path MTU Discovery messages will be
26 discarded. Outgoing frames are handled the same as in mode 1,
27 implicitly setting IP_PMTUDISC_DONT on every created socket.
28
29 Possible values: 0-2
188b04d5 30 Default: FALSE
1da177e4
LT
31
32min_pmtu - INTEGER
20db93c3 33 default 552 - minimum discovered Path MTU
1da177e4 34
cbaf087a
BG
35route/max_size - INTEGER
36 Maximum number of routes allowed in the kernel. Increase
37 this when using large numbers of interfaces and/or routes.
38
2724680b
YH
39neigh/default/gc_thresh1 - INTEGER
40 Minimum number of entries to keep. Garbage collector will not
41 purge entries if there are fewer than this number.
b66c66dc 42 Default: 128
2724680b 43
cbaf087a
BG
44neigh/default/gc_thresh3 - INTEGER
45 Maximum number of neighbor entries allowed. Increase this
46 when using large numbers of interfaces and when communicating
47 with large numbers of directly-connected peers.
cc868028 48 Default: 1024
cbaf087a 49
8b5c171b
ED
50neigh/default/unres_qlen_bytes - INTEGER
51 The maximum number of bytes which may be used by packets
52 queued for each unresolved address by other network layers.
53 (added in linux 3.3)
3b09adcb 54 Setting negative value is meaningless and will return error.
cc868028 55 Default: 65536 Bytes(64KB)
8b5c171b
ED
56
57neigh/default/unres_qlen - INTEGER
58 The maximum number of packets which may be queued for each
59 unresolved address by other network layers.
60 (deprecated in linux 3.3) : use unres_qlen_bytes instead.
cc868028 61 Prior to linux 3.3, the default value is 3 which may cause
5d248c49 62 unexpected packet loss. The current default value is calculated
cc868028
SW
63 according to default value of unres_qlen_bytes and true size of
64 packet.
65 Default: 31
8b5c171b 66
1da177e4
LT
67mtu_expires - INTEGER
68 Time, in seconds, that cached PMTU information is kept.
69
70min_adv_mss - INTEGER
71 The advertised MSS depends on the first hop route MTU, but will
72 never be lower than this setting.
73
74IP Fragmentation:
75
76ipfrag_high_thresh - INTEGER
e18f5feb 77 Maximum memory used to reassemble IP fragments. When
1da177e4
LT
78 ipfrag_high_thresh bytes of memory is allocated for this purpose,
79 the fragment handler will toss packets until ipfrag_low_thresh
80 is reached.
e18f5feb 81
1da177e4 82ipfrag_low_thresh - INTEGER
e18f5feb 83 See ipfrag_high_thresh
1da177e4
LT
84
85ipfrag_time - INTEGER
e18f5feb 86 Time in seconds to keep an IP fragment in memory.
1da177e4
LT
87
88ipfrag_secret_interval - INTEGER
e18f5feb 89 Regeneration interval (in seconds) of the hash secret (or lifetime
1da177e4
LT
90 for the hash secret) for IP fragments.
91 Default: 600
92
89cee8b1 93ipfrag_max_dist - INTEGER
e18f5feb
JDB
94 ipfrag_max_dist is a non-negative integer value which defines the
95 maximum "disorder" which is allowed among fragments which share a
96 common IP source address. Note that reordering of packets is
97 not unusual, but if a large number of fragments arrive from a source
98 IP address while a particular fragment queue remains incomplete, it
99 probably indicates that one or more fragments belonging to that queue
100 have been lost. When ipfrag_max_dist is positive, an additional check
101 is done on fragments before they are added to a reassembly queue - if
102 ipfrag_max_dist (or more) fragments have arrived from a particular IP
103 address between additions to any IP fragment queue using that source
104 address, it's presumed that one or more fragments in the queue are
105 lost. The existing fragment queue will be dropped, and a new one
89cee8b1
HX
106 started. An ipfrag_max_dist value of zero disables this check.
107
108 Using a very small value, e.g. 1 or 2, for ipfrag_max_dist can
109 result in unnecessarily dropping fragment queues when normal
e18f5feb
JDB
110 reordering of packets occurs, which could lead to poor application
111 performance. Using a very large value, e.g. 50000, increases the
112 likelihood of incorrectly reassembling IP fragments that originate
89cee8b1
HX
113 from different IP datagrams, which could result in data corruption.
114 Default: 64
115
1da177e4
LT
116INET peer storage:
117
118inet_peer_threshold - INTEGER
e18f5feb 119 The approximate size of the storage. Starting from this threshold
1da177e4
LT
120 entries will be thrown aggressively. This threshold also determines
121 entries' time-to-live and time intervals between garbage collection
122 passes. More entries, less time-to-live, less GC interval.
123
124inet_peer_minttl - INTEGER
125 Minimum time-to-live of entries. Should be enough to cover fragment
126 time-to-live on the reassembling side. This minimum time-to-live is
127 guaranteed if the pool size is less than inet_peer_threshold.
77a538d5 128 Measured in seconds.
1da177e4
LT
129
130inet_peer_maxttl - INTEGER
131 Maximum time-to-live of entries. Unused entries will expire after
132 this period of time if there is no memory pressure on the pool (i.e.
133 when the number of entries in the pool is very small).
77a538d5 134 Measured in seconds.
1da177e4 135
e18f5feb 136TCP variables:
1da177e4 137
ef56e622
SH
138somaxconn - INTEGER
139 Limit of socket listen() backlog, known in userspace as SOMAXCONN.
140 Defaults to 128. See also tcp_max_syn_backlog for additional tuning
141 for TCP sockets.
142
ef56e622
SH
143tcp_abort_on_overflow - BOOLEAN
144 If listening service is too slow to accept new connections,
145 reset them. Default state is FALSE. It means that if overflow
146 occurred due to a burst, connection will recover. Enable this
147 option _only_ if you are really sure that listening daemon
148 cannot be tuned to accept connections faster. Enabling this
149 option can harm clients of your server.
1da177e4 150
ef56e622
SH
151tcp_adv_win_scale - INTEGER
152 Count buffering overhead as bytes/2^tcp_adv_win_scale
153 (if tcp_adv_win_scale > 0) or bytes-bytes/2^(-tcp_adv_win_scale),
154 if it is <= 0.
0147fc05 155 Possible values are [-31, 31], inclusive.
b49960a0 156 Default: 1
1da177e4 157
ef56e622
SH
158tcp_allowed_congestion_control - STRING
159 Show/set the congestion control choices available to non-privileged
160 processes. The list is a subset of those listed in
161 tcp_available_congestion_control.
162 Default is "reno" and the default setting (tcp_congestion_control).
1da177e4 163
ef56e622
SH
164tcp_app_win - INTEGER
165 Reserve max(window/2^tcp_app_win, mss) of window for application
166 buffer. Value 0 is special, it means that nothing is reserved.
167 Default: 31
1da177e4 168
f54b3111
ED
169tcp_autocorking - BOOLEAN
170 Enable TCP auto corking :
171 When applications do consecutive small write()/sendmsg() system calls,
172 we try to coalesce these small writes as much as possible, to lower
173 total amount of sent packets. This is done if at least one prior
174 packet for the flow is waiting in Qdisc queues or device transmit
175 queue. Applications can still use TCP_CORK for optimal behavior
176 when they know how/when to uncork their sockets.
177 Default : 1
178
ef56e622
SH
179tcp_available_congestion_control - STRING
180 Shows the available congestion control choices that are registered.
181 More congestion control algorithms may be available as modules,
182 but not loaded.
1da177e4 183
71599cd1 184tcp_base_mss - INTEGER
4edc2f34
SH
185 The initial value of search_low to be used by the packetization layer
186 Path MTU discovery (MTU probing). If MTU probing is enabled,
187 this is the initial MSS used by the connection.
71599cd1 188
ef56e622
SH
189tcp_congestion_control - STRING
190 Set the congestion control algorithm to be used for new
191 connections. The algorithm "reno" is always available, but
192 additional choices may be available based on kernel configuration.
193 Default is set as part of kernel configuration.
d8a6e65f
ED
194 For passive connections, the listener congestion control choice
195 is inherited.
196 [see setsockopt(listenfd, SOL_TCP, TCP_CONGESTION, "name" ...) ]
1da177e4 197
ef56e622
SH
198tcp_dsack - BOOLEAN
199 Allows TCP to send "duplicate" SACKs.
1da177e4 200
eed530b6
YC
201tcp_early_retrans - INTEGER
202 Enable Early Retransmit (ER), per RFC 5827. ER lowers the threshold
203 for triggering fast retransmit when the amount of outstanding data is
204 small and when no previously unsent data can be transmitted (such
6ba8a3b1 205 that limited transmit could be used). Also controls the use of
3dd17ede 206 Tail loss probe (TLP) that converts RTOs occurring due to tail
6ba8a3b1 207 losses into fast recovery (draft-dukkipati-tcpm-tcp-loss-probe-01).
eed530b6
YC
208 Possible values:
209 0 disables ER
210 1 enables ER
211 2 enables ER but delays fast recovery and fast retransmit
212 by a fourth of RTT. This mitigates connection falsely
213 recovers when network has a small degree of reordering
214 (less than 3 packets).
6ba8a3b1
ND
215 3 enables delayed ER and TLP.
216 4 enables TLP only.
217 Default: 3
eed530b6 218
34a6ef38 219tcp_ecn - INTEGER
7e3a2dc5
RJ
220 Control use of Explicit Congestion Notification (ECN) by TCP.
221 ECN is used only when both ends of the TCP connection indicate
222 support for it. This feature is useful in avoiding losses due
223 to congestion by allowing supporting routers to signal
224 congestion before having to drop packets.
255cac91 225 Possible values are:
7e3a2dc5 226 0 Disable ECN. Neither initiate nor accept ECN.
3d55b323
VS
227 1 Enable ECN when requested by incoming connections and
228 also request ECN on outgoing connection attempts.
229 2 Enable ECN when requested by incoming connections
7e3a2dc5 230 but do not request ECN on outgoing connections.
255cac91 231 Default: 2
ef56e622
SH
232
233tcp_fack - BOOLEAN
234 Enable FACK congestion avoidance and fast retransmission.
235 The value is not used, if tcp_sack is not enabled.
1da177e4
LT
236
237tcp_fin_timeout - INTEGER
d825da2e
RJ
238 The length of time an orphaned (no longer referenced by any
239 application) connection will remain in the FIN_WAIT_2 state
240 before it is aborted at the local end. While a perfectly
241 valid "receive only" state for an un-orphaned connection, an
242 orphaned connection in FIN_WAIT_2 state could otherwise wait
243 forever for the remote to close its end of the connection.
244 Cf. tcp_max_orphans
245 Default: 60 seconds
1da177e4 246
89808060 247tcp_frto - INTEGER
e33099f9 248 Enables Forward RTO-Recovery (F-RTO) defined in RFC5682.
cd99889c 249 F-RTO is an enhanced recovery algorithm for TCP retransmission
e33099f9
YC
250 timeouts. It is particularly beneficial in networks where the
251 RTT fluctuates (e.g., wireless). F-RTO is sender-side only
252 modification. It does not require any support from the peer.
253
254 By default it's enabled with a non-zero value. 0 disables F-RTO.
1da177e4 255
ef56e622
SH
256tcp_keepalive_time - INTEGER
257 How often TCP sends out keepalive messages when keepalive is enabled.
258 Default: 2hours.
1da177e4 259
ef56e622
SH
260tcp_keepalive_probes - INTEGER
261 How many keepalive probes TCP sends out, until it decides that the
262 connection is broken. Default value: 9.
263
264tcp_keepalive_intvl - INTEGER
265 How frequently the probes are send out. Multiplied by
266 tcp_keepalive_probes it is time to kill not responding connection,
267 after probes started. Default value: 75sec i.e. connection
268 will be aborted after ~11 minutes of retries.
269
270tcp_low_latency - BOOLEAN
271 If set, the TCP stack makes decisions that prefer lower
272 latency as opposed to higher throughput. By default, this
273 option is not set meaning that higher throughput is preferred.
274 An example of an application where this default should be
275 changed would be a Beowulf compute cluster.
276 Default: 0
1da177e4
LT
277
278tcp_max_orphans - INTEGER
279 Maximal number of TCP sockets not attached to any user file handle,
280 held by system. If this number is exceeded orphaned connections are
281 reset immediately and warning is printed. This limit exists
282 only to prevent simple DoS attacks, you _must_ not rely on this
283 or lower the limit artificially, but rather increase it
284 (probably, after increasing installed memory),
285 if network conditions require more than default value,
286 and tune network services to linger and kill such states
287 more aggressively. Let me to remind again: each orphan eats
288 up to ~64K of unswappable memory.
289
1da177e4 290tcp_max_syn_backlog - INTEGER
99b53bdd
PP
291 Maximal number of remembered connection requests, which have not
292 received an acknowledgment from connecting client.
293 The minimal value is 128 for low memory machines, and it will
294 increase in proportion to the memory of machine.
295 If server suffers from overload, try increasing this number.
1da177e4 296
ef56e622
SH
297tcp_max_tw_buckets - INTEGER
298 Maximal number of timewait sockets held by system simultaneously.
299 If this number is exceeded time-wait socket is immediately destroyed
300 and warning is printed. This limit exists only to prevent
301 simple DoS attacks, you _must_ not lower the limit artificially,
302 but rather increase it (probably, after increasing installed memory),
303 if network conditions require more than default value.
1da177e4 304
ef56e622
SH
305tcp_mem - vector of 3 INTEGERs: min, pressure, max
306 min: below this number of pages TCP is not bothered about its
307 memory appetite.
1da177e4 308
ef56e622
SH
309 pressure: when amount of memory allocated by TCP exceeds this number
310 of pages, TCP moderates its memory consumption and enters memory
311 pressure mode, which is exited when memory consumption falls
312 under "min".
1da177e4 313
ef56e622 314 max: number of pages allowed for queueing by all TCP sockets.
1da177e4 315
ef56e622
SH
316 Defaults are calculated at boot time from amount of available
317 memory.
1da177e4 318
71599cd1 319tcp_moderate_rcvbuf - BOOLEAN
4edc2f34 320 If set, TCP performs receive buffer auto-tuning, attempting to
71599cd1
JH
321 automatically size the buffer (no greater than tcp_rmem[2]) to
322 match the size required by the path for full throughput. Enabled by
323 default.
324
325tcp_mtu_probing - INTEGER
326 Controls TCP Packetization-Layer Path MTU Discovery. Takes three
327 values:
328 0 - Disabled
329 1 - Disabled by default, enabled when an ICMP black hole detected
330 2 - Always enabled, use initial MSS of tcp_base_mss.
331
332tcp_no_metrics_save - BOOLEAN
333 By default, TCP saves various connection metrics in the route cache
334 when the connection closes, so that connections established in the
335 near future can use these to set initial conditions. Usually, this
336 increases overall performance, but may sometimes cause performance
0f035b8e 337 degradation. If set, TCP will not cache metrics on closing
71599cd1
JH
338 connections.
339
ef56e622 340tcp_orphan_retries - INTEGER
5d789229
DL
341 This value influences the timeout of a locally closed TCP connection,
342 when RTO retransmissions remain unacknowledged.
343 See tcp_retries2 for more details.
344
06b8fc5d 345 The default value is 8.
5d789229 346 If your machine is a loaded WEB server,
ef56e622
SH
347 you should think about lowering this value, such sockets
348 may consume significant resources. Cf. tcp_max_orphans.
1da177e4
LT
349
350tcp_reordering - INTEGER
351 Maximal reordering of packets in a TCP stream.
e18f5feb 352 Default: 3
1da177e4
LT
353
354tcp_retrans_collapse - BOOLEAN
355 Bug-to-bug compatibility with some broken printers.
356 On retransmit try to send bigger packets to work around bugs in
357 certain TCP stacks.
358
ef56e622 359tcp_retries1 - INTEGER
5d789229
DL
360 This value influences the time, after which TCP decides, that
361 something is wrong due to unacknowledged RTO retransmissions,
362 and reports this suspicion to the network layer.
363 See tcp_retries2 for more details.
364
365 RFC 1122 recommends at least 3 retransmissions, which is the
366 default.
1da177e4 367
ef56e622 368tcp_retries2 - INTEGER
5d789229
DL
369 This value influences the timeout of an alive TCP connection,
370 when RTO retransmissions remain unacknowledged.
371 Given a value of N, a hypothetical TCP connection following
372 exponential backoff with an initial RTO of TCP_RTO_MIN would
373 retransmit N times before killing the connection at the (N+1)th RTO.
374
375 The default value of 15 yields a hypothetical timeout of 924.6
376 seconds and is a lower bound for the effective timeout.
377 TCP will effectively time out at the first RTO which exceeds the
378 hypothetical timeout.
379
380 RFC 1122 recommends at least 100 seconds for the timeout,
381 which corresponds to a value of at least 8.
1da177e4 382
ef56e622
SH
383tcp_rfc1337 - BOOLEAN
384 If set, the TCP stack behaves conforming to RFC1337. If unset,
385 we are not conforming to RFC, but prevent TCP TIME_WAIT
386 assassination.
387 Default: 0
1da177e4
LT
388
389tcp_rmem - vector of 3 INTEGERs: min, default, max
390 min: Minimal size of receive buffer used by TCP sockets.
391 It is guaranteed to each TCP socket, even under moderate memory
392 pressure.
6539fefd 393 Default: 1 page
1da177e4 394
53025f5e 395 default: initial size of receive buffer used by TCP sockets.
1da177e4
LT
396 This value overrides net.core.rmem_default used by other protocols.
397 Default: 87380 bytes. This value results in window of 65535 with
398 default setting of tcp_adv_win_scale and tcp_app_win:0 and a bit
399 less for default tcp_app_win. See below about these variables.
400
401 max: maximal size of receive buffer allowed for automatically
402 selected receiver buffers for TCP socket. This value does not override
53025f5e
BF
403 net.core.rmem_max. Calling setsockopt() with SO_RCVBUF disables
404 automatic tuning of that socket's receive buffer size, in which
405 case this value is ignored.
b49960a0 406 Default: between 87380B and 6MB, depending on RAM size.
1da177e4 407
ef56e622
SH
408tcp_sack - BOOLEAN
409 Enable select acknowledgments (SACKS).
1da177e4 410
ef56e622
SH
411tcp_slow_start_after_idle - BOOLEAN
412 If set, provide RFC2861 behavior and time out the congestion
413 window after an idle period. An idle period is defined at
414 the current RTO. If unset, the congestion window will not
415 be timed out after an idle period.
416 Default: 1
1da177e4 417
ef56e622 418tcp_stdurg - BOOLEAN
4edc2f34 419 Use the Host requirements interpretation of the TCP urgent pointer field.
ef56e622
SH
420 Most hosts use the older BSD interpretation, so if you turn this on
421 Linux might not communicate correctly with them.
422 Default: FALSE
1da177e4 423
ef56e622
SH
424tcp_synack_retries - INTEGER
425 Number of times SYNACKs for a passive TCP connection attempt will
426 be retransmitted. Should not be higher than 255. Default value
6c9ff979
AB
427 is 5, which corresponds to 31seconds till the last retransmission
428 with the current initial RTO of 1second. With this the final timeout
429 for a passive TCP connection will happen after 63seconds.
1da177e4 430
ef56e622 431tcp_syncookies - BOOLEAN
a3c910d2 432 Only valid when the kernel was compiled with CONFIG_SYN_COOKIES
ef56e622 433 Send out syncookies when the syn backlog queue of a socket
4edc2f34 434 overflows. This is to prevent against the common 'SYN flood attack'
a3c910d2 435 Default: 1
1da177e4 436
ef56e622
SH
437 Note, that syncookies is fallback facility.
438 It MUST NOT be used to help highly loaded servers to stand
4edc2f34 439 against legal connection rate. If you see SYN flood warnings
ef56e622
SH
440 in your logs, but investigation shows that they occur
441 because of overload with legal connections, you should tune
442 another parameters until this warning disappear.
443 See: tcp_max_syn_backlog, tcp_synack_retries, tcp_abort_on_overflow.
1da177e4 444
ef56e622
SH
445 syncookies seriously violate TCP protocol, do not allow
446 to use TCP extensions, can result in serious degradation
447 of some services (f.e. SMTP relaying), visible not by you,
448 but your clients and relays, contacting you. While you see
4edc2f34 449 SYN flood warnings in logs not being really flooded, your server
ef56e622 450 is seriously misconfigured.
1da177e4 451
5ad37d5d
HFS
452 If you want to test which effects syncookies have to your
453 network connections you can set this knob to 2 to enable
454 unconditionally generation of syncookies.
455
cf60af03
YC
456tcp_fastopen - INTEGER
457 Enable TCP Fast Open feature (draft-ietf-tcpm-fastopen) to send data
458 in the opening SYN packet. To use this feature, the client application
10467163
JC
459 must use sendmsg() or sendto() with MSG_FASTOPEN flag rather than
460 connect() to perform a TCP handshake automatically.
461
462 The values (bitmap) are
0d41cca4 463 1: Enables sending data in the opening SYN on the client w/ MSG_FASTOPEN.
10467163
JC
464 2: Enables TCP Fast Open on the server side, i.e., allowing data in
465 a SYN packet to be accepted and passed to the application before
466 3-way hand shake finishes.
467 4: Send data in the opening SYN regardless of cookie availability and
468 without a cookie option.
469 0x100: Accept SYN data w/o validating the cookie.
470 0x200: Accept data-in-SYN w/o any cookie option present.
471 0x400/0x800: Enable Fast Open on all listeners regardless of the
472 TCP_FASTOPEN socket option. The two different flags designate two
473 different ways of setting max_qlen without the TCP_FASTOPEN socket
474 option.
cf60af03 475
0d41cca4 476 Default: 1
cf60af03 477
10467163
JC
478 Note that the client & server side Fast Open flags (1 and 2
479 respectively) must be also enabled before the rest of flags can take
480 effect.
481
482 See include/net/tcp.h and the code for more details.
483
ef56e622
SH
484tcp_syn_retries - INTEGER
485 Number of times initial SYNs for an active TCP connection attempt
486 will be retransmitted. Should not be higher than 255. Default value
3b09adcb 487 is 6, which corresponds to 63seconds till the last retransmission
6c9ff979
AB
488 with the current initial RTO of 1second. With this the final timeout
489 for an active TCP connection attempt will happen after 127seconds.
ef56e622
SH
490
491tcp_timestamps - BOOLEAN
492 Enable timestamps as defined in RFC1323.
1da177e4 493
95bd09eb
ED
494tcp_min_tso_segs - INTEGER
495 Minimal number of segments per TSO frame.
496 Since linux-3.12, TCP does an automatic sizing of TSO frames,
497 depending on flow rate, instead of filling 64Kbytes packets.
498 For specific usages, it's possible to force TCP to build big
499 TSO frames. Note that TCP stack might split too big TSO packets
500 if available window is too small.
501 Default: 2
502
1da177e4 503tcp_tso_win_divisor - INTEGER
ef56e622
SH
504 This allows control over what percentage of the congestion window
505 can be consumed by a single TSO frame.
506 The setting of this parameter is a choice between burstiness and
507 building larger TSO frames.
508 Default: 3
1da177e4 509
ef56e622
SH
510tcp_tw_recycle - BOOLEAN
511 Enable fast recycling TIME-WAIT sockets. Default value is 0.
512 It should not be changed without advice/request of technical
513 experts.
1da177e4 514
ef56e622
SH
515tcp_tw_reuse - BOOLEAN
516 Allow to reuse TIME-WAIT sockets for new connections when it is
517 safe from protocol viewpoint. Default value is 0.
518 It should not be changed without advice/request of technical
519 experts.
ce7bc3bf 520
ef56e622
SH
521tcp_window_scaling - BOOLEAN
522 Enable window scaling as defined in RFC1323.
3ff825b2 523
ef56e622 524tcp_wmem - vector of 3 INTEGERs: min, default, max
53025f5e 525 min: Amount of memory reserved for send buffers for TCP sockets.
ef56e622 526 Each TCP socket has rights to use it due to fact of its birth.
6539fefd 527 Default: 1 page
9d7bcfc6 528
53025f5e
BF
529 default: initial size of send buffer used by TCP sockets. This
530 value overrides net.core.wmem_default used by other protocols.
531 It is usually lower than net.core.wmem_default.
ef56e622
SH
532 Default: 16K
533
53025f5e
BF
534 max: Maximal amount of memory allowed for automatically tuned
535 send buffers for TCP sockets. This value does not override
536 net.core.wmem_max. Calling setsockopt() with SO_SNDBUF disables
537 automatic tuning of that socket's send buffer size, in which case
538 this value is ignored.
539 Default: between 64K and 4MB, depending on RAM size.
1da177e4 540
c9bee3b7
ED
541tcp_notsent_lowat - UNSIGNED INTEGER
542 A TCP socket can control the amount of unsent bytes in its write queue,
543 thanks to TCP_NOTSENT_LOWAT socket option. poll()/select()/epoll()
544 reports POLLOUT events if the amount of unsent bytes is below a per
545 socket value, and if the write queue is not full. sendmsg() will
546 also not add new buffers if the limit is hit.
547
548 This global variable controls the amount of unsent data for
549 sockets not using TCP_NOTSENT_LOWAT. For these sockets, a change
550 to the global variable has immediate effect.
551
552 Default: UINT_MAX (0xFFFFFFFF)
553
15d99e02
RJ
554tcp_workaround_signed_windows - BOOLEAN
555 If set, assume no receipt of a window scaling option means the
556 remote TCP is broken and treats the window as a signed quantity.
557 If unset, assume the remote TCP is not broken even if we do
558 not receive a window scaling option from them.
559 Default: 0
560
72d0b7a8
CL
561tcp_dma_copybreak - INTEGER
562 Lower limit, in bytes, of the size of socket reads that will be
563 offloaded to a DMA copy engine, if one is present in the system
564 and CONFIG_NET_DMA is enabled.
565 Default: 4096
566
36e31b0a
AP
567tcp_thin_linear_timeouts - BOOLEAN
568 Enable dynamic triggering of linear timeouts for thin streams.
569 If set, a check is performed upon retransmission by timeout to
570 determine if the stream is thin (less than 4 packets in flight).
571 As long as the stream is found to be thin, up to 6 linear
572 timeouts may be performed before exponential backoff mode is
573 initiated. This improves retransmission latency for
574 non-aggressive thin streams, often found to be time-dependent.
575 For more information on thin streams, see
576 Documentation/networking/tcp-thin.txt
577 Default: 0
578
7e380175
AP
579tcp_thin_dupack - BOOLEAN
580 Enable dynamic triggering of retransmissions after one dupACK
581 for thin streams. If set, a check is performed upon reception
582 of a dupACK to determine if the stream is thin (less than 4
583 packets in flight). As long as the stream is found to be thin,
584 data is retransmitted on the first received dupACK. This
585 improves retransmission latency for non-aggressive thin
586 streams, often found to be time-dependent.
587 For more information on thin streams, see
588 Documentation/networking/tcp-thin.txt
589 Default: 0
590
46d3ceab
ED
591tcp_limit_output_bytes - INTEGER
592 Controls TCP Small Queue limit per tcp socket.
593 TCP bulk sender tends to increase packets in flight until it
594 gets losses notifications. With SNDBUF autotuning, this can
595 result in a large amount of packets queued in qdisc/device
596 on the local machine, hurting latency of other flows, for
597 typical pfifo_fast qdiscs.
598 tcp_limit_output_bytes limits the number of bytes on qdisc
599 or device to reduce artificial RTT/cwnd and reduce bufferbloat.
46d3ceab
ED
600 Default: 131072
601
282f23c6
ED
602tcp_challenge_ack_limit - INTEGER
603 Limits number of Challenge ACK sent per second, as recommended
604 in RFC 5961 (Improving TCP's Robustness to Blind In-Window Attacks)
605 Default: 100
606
95766fff
HA
607UDP variables:
608
609udp_mem - vector of 3 INTEGERs: min, pressure, max
610 Number of pages allowed for queueing by all UDP sockets.
611
612 min: Below this number of pages UDP is not bothered about its
613 memory appetite. When amount of memory allocated by UDP exceeds
614 this number, UDP starts to moderate memory usage.
615
616 pressure: This value was introduced to follow format of tcp_mem.
617
618 max: Number of pages allowed for queueing by all UDP sockets.
619
620 Default is calculated at boot time from amount of available memory.
621
622udp_rmem_min - INTEGER
623 Minimal size of receive buffer used by UDP sockets in moderation.
624 Each UDP socket is able to use the size for receiving data, even if
625 total pages of UDP sockets exceed udp_mem pressure. The unit is byte.
6539fefd 626 Default: 1 page
95766fff
HA
627
628udp_wmem_min - INTEGER
629 Minimal size of send buffer used by UDP sockets in moderation.
630 Each UDP socket is able to use the size for sending data, even if
631 total pages of UDP sockets exceed udp_mem pressure. The unit is byte.
6539fefd 632 Default: 1 page
95766fff 633
8802f616
PM
634CIPSOv4 Variables:
635
636cipso_cache_enable - BOOLEAN
637 If set, enable additions to and lookups from the CIPSO label mapping
638 cache. If unset, additions are ignored and lookups always result in a
639 miss. However, regardless of the setting the cache is still
640 invalidated when required when means you can safely toggle this on and
641 off and the cache will always be "safe".
642 Default: 1
643
644cipso_cache_bucket_size - INTEGER
645 The CIPSO label cache consists of a fixed size hash table with each
646 hash bucket containing a number of cache entries. This variable limits
647 the number of entries in each hash bucket; the larger the value the
648 more CIPSO label mappings that can be cached. When the number of
649 entries in a given hash bucket reaches this limit adding new entries
650 causes the oldest entry in the bucket to be removed to make room.
651 Default: 10
652
653cipso_rbm_optfmt - BOOLEAN
654 Enable the "Optimized Tag 1 Format" as defined in section 3.4.2.6 of
655 the CIPSO draft specification (see Documentation/netlabel for details).
656 This means that when set the CIPSO tag will be padded with empty
657 categories in order to make the packet data 32-bit aligned.
658 Default: 0
659
660cipso_rbm_structvalid - BOOLEAN
661 If set, do a very strict check of the CIPSO option when
662 ip_options_compile() is called. If unset, relax the checks done during
663 ip_options_compile(). Either way is "safe" as errors are caught else
664 where in the CIPSO processing code but setting this to 0 (False) should
665 result in less work (i.e. it should be faster) but could cause problems
666 with other implementations that require strict checking.
667 Default: 0
668
1da177e4
LT
669IP Variables:
670
671ip_local_port_range - 2 INTEGERS
672 Defines the local port range that is used by TCP and UDP to
e18f5feb 673 choose the local port. The first number is the first, the
5d6bd861
FLVC
674 second the last local port number. The default values are
675 32768 and 61000 respectively.
1da177e4 676
e3826f1e
AW
677ip_local_reserved_ports - list of comma separated ranges
678 Specify the ports which are reserved for known third-party
679 applications. These ports will not be used by automatic port
680 assignments (e.g. when calling connect() or bind() with port
681 number 0). Explicit port allocation behavior is unchanged.
682
683 The format used for both input and output is a comma separated
684 list of ranges (e.g. "1,2-4,10-10" for ports 1, 2, 3, 4 and
685 10). Writing to the file will clear all previously reserved
686 ports and update the current list with the one given in the
687 input.
688
689 Note that ip_local_port_range and ip_local_reserved_ports
690 settings are independent and both are considered by the kernel
691 when determining which ports are available for automatic port
692 assignments.
693
694 You can reserve ports which are not in the current
695 ip_local_port_range, e.g.:
696
697 $ cat /proc/sys/net/ipv4/ip_local_port_range
698 32000 61000
699 $ cat /proc/sys/net/ipv4/ip_local_reserved_ports
700 8080,9148
701
702 although this is redundant. However such a setting is useful
703 if later the port range is changed to a value that will
704 include the reserved ports.
705
706 Default: Empty
707
1da177e4
LT
708ip_nonlocal_bind - BOOLEAN
709 If set, allows processes to bind() to non-local IP addresses,
710 which can be quite useful - but may break some applications.
711 Default: 0
712
713ip_dynaddr - BOOLEAN
714 If set non-zero, enables support for dynamic addresses.
715 If set to a non-zero value larger than 1, a kernel log
716 message will be printed when dynamic address rewriting
717 occurs.
718 Default: 0
719
e3d73bce
CW
720ip_early_demux - BOOLEAN
721 Optimize input packet processing down to one demux for
722 certain kinds of local sockets. Currently we only do this
723 for established TCP sockets.
724
725 It may add an additional cost for pure routing workloads that
726 reduces overall throughput, in such case you should disable it.
727 Default: 1
728
1da177e4 729icmp_echo_ignore_all - BOOLEAN
7ce31246
DM
730 If set non-zero, then the kernel will ignore all ICMP ECHO
731 requests sent to it.
732 Default: 0
733
1da177e4 734icmp_echo_ignore_broadcasts - BOOLEAN
7ce31246
DM
735 If set non-zero, then the kernel will ignore all ICMP ECHO and
736 TIMESTAMP requests sent to it via broadcast/multicast.
737 Default: 1
1da177e4
LT
738
739icmp_ratelimit - INTEGER
740 Limit the maximal rates for sending ICMP packets whose type matches
741 icmp_ratemask (see below) to specific targets.
6dbf4bca
SH
742 0 to disable any limiting,
743 otherwise the minimal space between responses in milliseconds.
744 Default: 1000
1da177e4
LT
745
746icmp_ratemask - INTEGER
747 Mask made of ICMP types for which rates are being limited.
748 Significant bits: IHGFEDCBA9876543210
749 Default mask: 0000001100000011000 (6168)
750
751 Bit definitions (see include/linux/icmp.h):
752 0 Echo Reply
753 3 Destination Unreachable *
754 4 Source Quench *
755 5 Redirect
756 8 Echo Request
757 B Time Exceeded *
758 C Parameter Problem *
759 D Timestamp Request
760 E Timestamp Reply
761 F Info Request
762 G Info Reply
763 H Address Mask Request
764 I Address Mask Reply
765
766 * These are rate limited by default (see default mask above)
767
768icmp_ignore_bogus_error_responses - BOOLEAN
769 Some routers violate RFC1122 by sending bogus responses to broadcast
770 frames. Such violations are normally logged via a kernel warning.
771 If this is set to TRUE, the kernel will not give such warnings, which
772 will avoid log file clutter.
e8b265e8 773 Default: 1
1da177e4 774
95f7daf1
H
775icmp_errors_use_inbound_ifaddr - BOOLEAN
776
777 If zero, icmp error messages are sent with the primary address of
778 the exiting interface.
e18f5feb 779
95f7daf1
H
780 If non-zero, the message will be sent with the primary address of
781 the interface that received the packet that caused the icmp error.
782 This is the behaviour network many administrators will expect from
783 a router. And it can make debugging complicated network layouts
e18f5feb 784 much easier.
95f7daf1
H
785
786 Note that if no primary address exists for the interface selected,
787 then the primary address of the first non-loopback interface that
d6bc8ac9 788 has one will be used regardless of this setting.
95f7daf1
H
789
790 Default: 0
791
1da177e4
LT
792igmp_max_memberships - INTEGER
793 Change the maximum number of multicast groups we can subscribe to.
794 Default: 20
795
d67ef35f
JE
796 Theoretical maximum value is bounded by having to send a membership
797 report in a single datagram (i.e. the report can't span multiple
798 datagrams, or risk confusing the switch and leaving groups you don't
799 intend to).
1da177e4 800
d67ef35f
JE
801 The number of supported groups 'M' is bounded by the number of group
802 report entries you can fit into a single datagram of 65535 bytes.
803
804 M = 65536-sizeof (ip header)/(sizeof(Group record))
805
806 Group records are variable length, with a minimum of 12 bytes.
807 So net.ipv4.igmp_max_memberships should not be set higher than:
808
809 (65536-24) / 12 = 5459
810
811 The value 5459 assumes no IP header options, so in practice
812 this number may be lower.
813
814 conf/interface/* changes special settings per interface (where
815 "interface" is the name of your network interface)
816
817 conf/all/* is special, changes the settings for all interfaces
1da177e4
LT
818
819log_martians - BOOLEAN
820 Log packets with impossible addresses to kernel log.
821 log_martians for the interface will be enabled if at least one of
822 conf/{all,interface}/log_martians is set to TRUE,
823 it will be disabled otherwise
824
825accept_redirects - BOOLEAN
826 Accept ICMP redirect messages.
827 accept_redirects for the interface will be enabled if:
e18f5feb
JDB
828 - both conf/{all,interface}/accept_redirects are TRUE in the case
829 forwarding for the interface is enabled
1da177e4 830 or
e18f5feb
JDB
831 - at least one of conf/{all,interface}/accept_redirects is TRUE in the
832 case forwarding for the interface is disabled
1da177e4
LT
833 accept_redirects for the interface will be disabled otherwise
834 default TRUE (host)
835 FALSE (router)
836
837forwarding - BOOLEAN
838 Enable IP forwarding on this interface.
839
840mc_forwarding - BOOLEAN
841 Do multicast routing. The kernel needs to be compiled with CONFIG_MROUTE
842 and a multicast routing daemon is required.
e18f5feb
JDB
843 conf/all/mc_forwarding must also be set to TRUE to enable multicast
844 routing for the interface
1da177e4
LT
845
846medium_id - INTEGER
847 Integer value used to differentiate the devices by the medium they
848 are attached to. Two devices can have different id values when
849 the broadcast packets are received only on one of them.
850 The default value 0 means that the device is the only interface
851 to its medium, value of -1 means that medium is not known.
e18f5feb 852
1da177e4
LT
853 Currently, it is used to change the proxy_arp behavior:
854 the proxy_arp feature is enabled for packets forwarded between
855 two devices attached to different media.
856
857proxy_arp - BOOLEAN
858 Do proxy arp.
859 proxy_arp for the interface will be enabled if at least one of
860 conf/{all,interface}/proxy_arp is set to TRUE,
861 it will be disabled otherwise
862
65324144
JDB
863proxy_arp_pvlan - BOOLEAN
864 Private VLAN proxy arp.
865 Basically allow proxy arp replies back to the same interface
866 (from which the ARP request/solicitation was received).
867
868 This is done to support (ethernet) switch features, like RFC
869 3069, where the individual ports are NOT allowed to
870 communicate with each other, but they are allowed to talk to
871 the upstream router. As described in RFC 3069, it is possible
872 to allow these hosts to communicate through the upstream
873 router by proxy_arp'ing. Don't need to be used together with
874 proxy_arp.
875
876 This technology is known by different names:
877 In RFC 3069 it is called VLAN Aggregation.
878 Cisco and Allied Telesyn call it Private VLAN.
879 Hewlett-Packard call it Source-Port filtering or port-isolation.
880 Ericsson call it MAC-Forced Forwarding (RFC Draft).
881
1da177e4
LT
882shared_media - BOOLEAN
883 Send(router) or accept(host) RFC1620 shared media redirects.
884 Overrides ip_secure_redirects.
885 shared_media for the interface will be enabled if at least one of
886 conf/{all,interface}/shared_media is set to TRUE,
887 it will be disabled otherwise
888 default TRUE
889
890secure_redirects - BOOLEAN
891 Accept ICMP redirect messages only for gateways,
892 listed in default gateway list.
893 secure_redirects for the interface will be enabled if at least one of
894 conf/{all,interface}/secure_redirects is set to TRUE,
895 it will be disabled otherwise
896 default TRUE
897
898send_redirects - BOOLEAN
899 Send redirects, if router.
900 send_redirects for the interface will be enabled if at least one of
901 conf/{all,interface}/send_redirects is set to TRUE,
902 it will be disabled otherwise
903 Default: TRUE
904
905bootp_relay - BOOLEAN
906 Accept packets with source address 0.b.c.d destined
907 not to this host as local ones. It is supposed, that
908 BOOTP relay daemon will catch and forward such packets.
909 conf/all/bootp_relay must also be set to TRUE to enable BOOTP relay
910 for the interface
911 default FALSE
912 Not Implemented Yet.
913
914accept_source_route - BOOLEAN
915 Accept packets with SRR option.
916 conf/all/accept_source_route must also be set to TRUE to accept packets
917 with SRR option on the interface
918 default TRUE (router)
919 FALSE (host)
920
8153a10c 921accept_local - BOOLEAN
c801e3cc
DM
922 Accept packets with local source addresses. In combination
923 with suitable routing, this can be used to direct packets
924 between two local interfaces over the wire and have them
925 accepted properly.
926
927 rp_filter must be set to a non-zero value in order for
928 accept_local to have an effect.
929
8153a10c
PM
930 default FALSE
931
d0daebc3
TG
932route_localnet - BOOLEAN
933 Do not consider loopback addresses as martian source or destination
934 while routing. This enables the use of 127/8 for local routing purposes.
935 default FALSE
936
c1cf8422 937rp_filter - INTEGER
1da177e4 938 0 - No source validation.
c1cf8422
SH
939 1 - Strict mode as defined in RFC3704 Strict Reverse Path
940 Each incoming packet is tested against the FIB and if the interface
941 is not the best reverse path the packet check will fail.
942 By default failed packets are discarded.
943 2 - Loose mode as defined in RFC3704 Loose Reverse Path
944 Each incoming packet's source address is also tested against the FIB
945 and if the source address is not reachable via any interface
946 the packet check will fail.
947
e18f5feb 948 Current recommended practice in RFC3704 is to enable strict mode
bf869c30 949 to prevent IP spoofing from DDos attacks. If using asymmetric routing
e18f5feb 950 or other complicated routing, then loose mode is recommended.
c1cf8422 951
1f5865e7
SW
952 The max value from conf/{all,interface}/rp_filter is used
953 when doing source validation on the {interface}.
1da177e4
LT
954
955 Default value is 0. Note that some distributions enable it
956 in startup scripts.
957
958arp_filter - BOOLEAN
959 1 - Allows you to have multiple network interfaces on the same
960 subnet, and have the ARPs for each interface be answered
961 based on whether or not the kernel would route a packet from
962 the ARP'd IP out that interface (therefore you must use source
963 based routing for this to work). In other words it allows control
964 of which cards (usually 1) will respond to an arp request.
965
966 0 - (default) The kernel can respond to arp requests with addresses
967 from other interfaces. This may seem wrong but it usually makes
968 sense, because it increases the chance of successful communication.
969 IP addresses are owned by the complete host on Linux, not by
970 particular interfaces. Only for more complex setups like load-
971 balancing, does this behaviour cause problems.
972
973 arp_filter for the interface will be enabled if at least one of
974 conf/{all,interface}/arp_filter is set to TRUE,
975 it will be disabled otherwise
976
977arp_announce - INTEGER
978 Define different restriction levels for announcing the local
979 source IP address from IP packets in ARP requests sent on
980 interface:
981 0 - (default) Use any local address, configured on any interface
982 1 - Try to avoid local addresses that are not in the target's
983 subnet for this interface. This mode is useful when target
984 hosts reachable via this interface require the source IP
985 address in ARP requests to be part of their logical network
986 configured on the receiving interface. When we generate the
987 request we will check all our subnets that include the
988 target IP and will preserve the source address if it is from
989 such subnet. If there is no such subnet we select source
990 address according to the rules for level 2.
991 2 - Always use the best local address for this target.
992 In this mode we ignore the source address in the IP packet
993 and try to select local address that we prefer for talks with
994 the target host. Such local address is selected by looking
995 for primary IP addresses on all our subnets on the outgoing
996 interface that include the target IP address. If no suitable
997 local address is found we select the first local address
998 we have on the outgoing interface or on all other interfaces,
999 with the hope we will receive reply for our request and
1000 even sometimes no matter the source IP address we announce.
1001
1002 The max value from conf/{all,interface}/arp_announce is used.
1003
1004 Increasing the restriction level gives more chance for
1005 receiving answer from the resolved target while decreasing
1006 the level announces more valid sender's information.
1007
1008arp_ignore - INTEGER
1009 Define different modes for sending replies in response to
1010 received ARP requests that resolve local target IP addresses:
1011 0 - (default): reply for any local target IP address, configured
1012 on any interface
1013 1 - reply only if the target IP address is local address
1014 configured on the incoming interface
1015 2 - reply only if the target IP address is local address
1016 configured on the incoming interface and both with the
1017 sender's IP address are part from same subnet on this interface
1018 3 - do not reply for local addresses configured with scope host,
1019 only resolutions for global and link addresses are replied
1020 4-7 - reserved
1021 8 - do not reply for all local addresses
1022
1023 The max value from conf/{all,interface}/arp_ignore is used
1024 when ARP request is received on the {interface}
1025
eefef1cf
SH
1026arp_notify - BOOLEAN
1027 Define mode for notification of address and device changes.
1028 0 - (default): do nothing
3f8dc236 1029 1 - Generate gratuitous arp requests when device is brought up
eefef1cf
SH
1030 or hardware address changes.
1031
c1b1bce8 1032arp_accept - BOOLEAN
6d955180
OP
1033 Define behavior for gratuitous ARP frames who's IP is not
1034 already present in the ARP table:
1035 0 - don't create new entries in the ARP table
1036 1 - create new entries in the ARP table
1037
1038 Both replies and requests type gratuitous arp will trigger the
1039 ARP table to be updated, if this setting is on.
1040
1041 If the ARP table already contains the IP address of the
1042 gratuitous arp frame, the arp table will be updated regardless
1043 if this setting is on or off.
1044
c1b1bce8 1045
1da177e4
LT
1046app_solicit - INTEGER
1047 The maximum number of probes to send to the user space ARP daemon
1048 via netlink before dropping back to multicast probes (see
1049 mcast_solicit). Defaults to 0.
1050
1051disable_policy - BOOLEAN
1052 Disable IPSEC policy (SPD) for this interface
1053
1054disable_xfrm - BOOLEAN
1055 Disable IPSEC encryption on this interface, whatever the policy
1056
fc4eba58
HFS
1057igmpv2_unsolicited_report_interval - INTEGER
1058 The interval in milliseconds in which the next unsolicited
1059 IGMPv1 or IGMPv2 report retransmit will take place.
1060 Default: 10000 (10 seconds)
1da177e4 1061
fc4eba58
HFS
1062igmpv3_unsolicited_report_interval - INTEGER
1063 The interval in milliseconds in which the next unsolicited
1064 IGMPv3 report retransmit will take place.
1065 Default: 1000 (1 seconds)
1da177e4
LT
1066
1067tag - INTEGER
1068 Allows you to write a number, which can be used as required.
1069 Default value is 0.
1070
1da177e4
LT
1071Alexey Kuznetsov.
1072kuznet@ms2.inr.ac.ru
1073
1074Updated by:
1075Andi Kleen
1076ak@muc.de
1077Nicolas Delon
1078delon.nicolas@wanadoo.fr
1079
1080
1081
1082
1083/proc/sys/net/ipv6/* Variables:
1084
1085IPv6 has no global variables such as tcp_*. tcp_* settings under ipv4/ also
1086apply to IPv6 [XXX?].
1087
1088bindv6only - BOOLEAN
1089 Default value for IPV6_V6ONLY socket option,
e18f5feb 1090 which restricts use of the IPv6 socket to IPv6 communication
1da177e4
LT
1091 only.
1092 TRUE: disable IPv4-mapped address feature
1093 FALSE: enable IPv4-mapped address feature
1094
d5c073ca 1095 Default: FALSE (as specified in RFC3493)
1da177e4
LT
1096
1097IPv6 Fragmentation:
1098
1099ip6frag_high_thresh - INTEGER
e18f5feb 1100 Maximum memory used to reassemble IPv6 fragments. When
1da177e4
LT
1101 ip6frag_high_thresh bytes of memory is allocated for this purpose,
1102 the fragment handler will toss packets until ip6frag_low_thresh
1103 is reached.
e18f5feb 1104
1da177e4 1105ip6frag_low_thresh - INTEGER
e18f5feb 1106 See ip6frag_high_thresh
1da177e4
LT
1107
1108ip6frag_time - INTEGER
1109 Time in seconds to keep an IPv6 fragment in memory.
1110
1111ip6frag_secret_interval - INTEGER
e18f5feb 1112 Regeneration interval (in seconds) of the hash secret (or lifetime
1da177e4
LT
1113 for the hash secret) for IPv6 fragments.
1114 Default: 600
1115
1116conf/default/*:
1117 Change the interface-specific default settings.
1118
1119
1120conf/all/*:
e18f5feb 1121 Change all the interface-specific settings.
1da177e4
LT
1122
1123 [XXX: Other special features than forwarding?]
1124
1125conf/all/forwarding - BOOLEAN
e18f5feb 1126 Enable global IPv6 forwarding between all interfaces.
1da177e4 1127
e18f5feb 1128 IPv4 and IPv6 work differently here; e.g. netfilter must be used
1da177e4
LT
1129 to control which interfaces may forward packets and which not.
1130
e18f5feb 1131 This also sets all interfaces' Host/Router setting
1da177e4
LT
1132 'forwarding' to the specified value. See below for details.
1133
1134 This referred to as global forwarding.
1135
fbea49e1
YH
1136proxy_ndp - BOOLEAN
1137 Do proxy ndp.
1138
1da177e4
LT
1139conf/interface/*:
1140 Change special settings per interface.
1141
e18f5feb 1142 The functional behaviour for certain settings is different
1da177e4
LT
1143 depending on whether local forwarding is enabled or not.
1144
605b91c8 1145accept_ra - INTEGER
1da177e4 1146 Accept Router Advertisements; autoconfigure using them.
e18f5feb 1147
026359bc
TA
1148 It also determines whether or not to transmit Router
1149 Solicitations. If and only if the functional setting is to
1150 accept Router Advertisements, Router Solicitations will be
1151 transmitted.
1152
ae8abfa0
TG
1153 Possible values are:
1154 0 Do not accept Router Advertisements.
1155 1 Accept Router Advertisements if forwarding is disabled.
1156 2 Overrule forwarding behaviour. Accept Router Advertisements
1157 even if forwarding is enabled.
1158
1da177e4
LT
1159 Functional default: enabled if local forwarding is disabled.
1160 disabled if local forwarding is enabled.
1161
65f5c7c1
YH
1162accept_ra_defrtr - BOOLEAN
1163 Learn default router in Router Advertisement.
1164
1165 Functional default: enabled if accept_ra is enabled.
1166 disabled if accept_ra is disabled.
1167
c4fd30eb 1168accept_ra_pinfo - BOOLEAN
2fe0ae78 1169 Learn Prefix Information in Router Advertisement.
c4fd30eb
YH
1170
1171 Functional default: enabled if accept_ra is enabled.
1172 disabled if accept_ra is disabled.
1173
09c884d4
YH
1174accept_ra_rt_info_max_plen - INTEGER
1175 Maximum prefix length of Route Information in RA.
1176
1177 Route Information w/ prefix larger than or equal to this
1178 variable shall be ignored.
1179
1180 Functional default: 0 if accept_ra_rtr_pref is enabled.
1181 -1 if accept_ra_rtr_pref is disabled.
1182
930d6ff2
YH
1183accept_ra_rtr_pref - BOOLEAN
1184 Accept Router Preference in RA.
1185
1186 Functional default: enabled if accept_ra is enabled.
1187 disabled if accept_ra is disabled.
1188
1da177e4
LT
1189accept_redirects - BOOLEAN
1190 Accept Redirects.
1191
1192 Functional default: enabled if local forwarding is disabled.
1193 disabled if local forwarding is enabled.
1194
0bcbc926
YH
1195accept_source_route - INTEGER
1196 Accept source routing (routing extension header).
1197
bb4dbf9e 1198 >= 0: Accept only routing header type 2.
0bcbc926
YH
1199 < 0: Do not accept routing header.
1200
1201 Default: 0
1202
1da177e4 1203autoconf - BOOLEAN
e18f5feb 1204 Autoconfigure addresses using Prefix Information in Router
1da177e4
LT
1205 Advertisements.
1206
c4fd30eb
YH
1207 Functional default: enabled if accept_ra_pinfo is enabled.
1208 disabled if accept_ra_pinfo is disabled.
1da177e4
LT
1209
1210dad_transmits - INTEGER
1211 The amount of Duplicate Address Detection probes to send.
1212 Default: 1
e18f5feb 1213
605b91c8 1214forwarding - INTEGER
e18f5feb 1215 Configure interface-specific Host/Router behaviour.
1da177e4 1216
e18f5feb 1217 Note: It is recommended to have the same setting on all
1da177e4
LT
1218 interfaces; mixed router/host scenarios are rather uncommon.
1219
ae8abfa0
TG
1220 Possible values are:
1221 0 Forwarding disabled
1222 1 Forwarding enabled
ae8abfa0
TG
1223
1224 FALSE (0):
1da177e4
LT
1225
1226 By default, Host behaviour is assumed. This means:
1227
1228 1. IsRouter flag is not set in Neighbour Advertisements.
026359bc
TA
1229 2. If accept_ra is TRUE (default), transmit Router
1230 Solicitations.
e18f5feb 1231 3. If accept_ra is TRUE (default), accept Router
1da177e4
LT
1232 Advertisements (and do autoconfiguration).
1233 4. If accept_redirects is TRUE (default), accept Redirects.
1234
ae8abfa0 1235 TRUE (1):
1da177e4 1236
e18f5feb 1237 If local forwarding is enabled, Router behaviour is assumed.
1da177e4
LT
1238 This means exactly the reverse from the above:
1239
1240 1. IsRouter flag is set in Neighbour Advertisements.
026359bc 1241 2. Router Solicitations are not sent unless accept_ra is 2.
ae8abfa0 1242 3. Router Advertisements are ignored unless accept_ra is 2.
1da177e4
LT
1243 4. Redirects are ignored.
1244
ae8abfa0
TG
1245 Default: 0 (disabled) if global forwarding is disabled (default),
1246 otherwise 1 (enabled).
1da177e4
LT
1247
1248hop_limit - INTEGER
1249 Default Hop Limit to set.
1250 Default: 64
1251
1252mtu - INTEGER
1253 Default Maximum Transfer Unit
1254 Default: 1280 (IPv6 required minimum)
1255
52e16356
YH
1256router_probe_interval - INTEGER
1257 Minimum interval (in seconds) between Router Probing described
1258 in RFC4191.
1259
1260 Default: 60
1261
1da177e4
LT
1262router_solicitation_delay - INTEGER
1263 Number of seconds to wait after interface is brought up
1264 before sending Router Solicitations.
1265 Default: 1
1266
1267router_solicitation_interval - INTEGER
1268 Number of seconds to wait between Router Solicitations.
1269 Default: 4
1270
1271router_solicitations - INTEGER
e18f5feb 1272 Number of Router Solicitations to send until assuming no
1da177e4
LT
1273 routers are present.
1274 Default: 3
1275
1276use_tempaddr - INTEGER
1277 Preference for Privacy Extensions (RFC3041).
1278 <= 0 : disable Privacy Extensions
1279 == 1 : enable Privacy Extensions, but prefer public
1280 addresses over temporary addresses.
1281 > 1 : enable Privacy Extensions and prefer temporary
1282 addresses over public addresses.
1283 Default: 0 (for most devices)
1284 -1 (for point-to-point devices and loopback devices)
1285
1286temp_valid_lft - INTEGER
1287 valid lifetime (in seconds) for temporary addresses.
1288 Default: 604800 (7 days)
1289
1290temp_prefered_lft - INTEGER
1291 Preferred lifetime (in seconds) for temporary addresses.
1292 Default: 86400 (1 day)
1293
1294max_desync_factor - INTEGER
1295 Maximum value for DESYNC_FACTOR, which is a random value
e18f5feb 1296 that ensures that clients don't synchronize with each
1da177e4
LT
1297 other and generate new addresses at exactly the same time.
1298 value is in seconds.
1299 Default: 600
e18f5feb 1300
1da177e4
LT
1301regen_max_retry - INTEGER
1302 Number of attempts before give up attempting to generate
1303 valid temporary addresses.
1304 Default: 5
1305
1306max_addresses - INTEGER
e79dc484
BH
1307 Maximum number of autoconfigured addresses per interface. Setting
1308 to zero disables the limitation. It is not recommended to set this
1309 value too large (or to zero) because it would be an easy way to
1310 crash the kernel by allowing too many addresses to be created.
1da177e4
LT
1311 Default: 16
1312
778d80be 1313disable_ipv6 - BOOLEAN
9bdd8d40
BH
1314 Disable IPv6 operation. If accept_dad is set to 2, this value
1315 will be dynamically set to TRUE if DAD fails for the link-local
1316 address.
778d80be
YH
1317 Default: FALSE (enable IPv6 operation)
1318
56d417b1
BH
1319 When this value is changed from 1 to 0 (IPv6 is being enabled),
1320 it will dynamically create a link-local address on the given
1321 interface and start Duplicate Address Detection, if necessary.
1322
1323 When this value is changed from 0 to 1 (IPv6 is being disabled),
1324 it will dynamically delete all address on the given interface.
1325
1b34be74
YH
1326accept_dad - INTEGER
1327 Whether to accept DAD (Duplicate Address Detection).
1328 0: Disable DAD
1329 1: Enable DAD (default)
1330 2: Enable DAD, and disable IPv6 operation if MAC-based duplicate
1331 link-local address has been found.
1332
f7734fdf
OP
1333force_tllao - BOOLEAN
1334 Enable sending the target link-layer address option even when
1335 responding to a unicast neighbor solicitation.
1336 Default: FALSE
1337
1338 Quoting from RFC 2461, section 4.4, Target link-layer address:
1339
1340 "The option MUST be included for multicast solicitations in order to
1341 avoid infinite Neighbor Solicitation "recursion" when the peer node
1342 does not have a cache entry to return a Neighbor Advertisements
1343 message. When responding to unicast solicitations, the option can be
1344 omitted since the sender of the solicitation has the correct link-
1345 layer address; otherwise it would not have be able to send the unicast
1346 solicitation in the first place. However, including the link-layer
1347 address in this case adds little overhead and eliminates a potential
1348 race condition where the sender deletes the cached link-layer address
1349 prior to receiving a response to a previous solicitation."
1350
db2b620a
HFS
1351ndisc_notify - BOOLEAN
1352 Define mode for notification of address and device changes.
1353 0 - (default): do nothing
1354 1 - Generate unsolicited neighbour advertisements when device is brought
1355 up or hardware address changes.
1356
fc4eba58
HFS
1357mldv1_unsolicited_report_interval - INTEGER
1358 The interval in milliseconds in which the next unsolicited
1359 MLDv1 report retransmit will take place.
1360 Default: 10000 (10 seconds)
1361
1362mldv2_unsolicited_report_interval - INTEGER
1363 The interval in milliseconds in which the next unsolicited
1364 MLDv2 report retransmit will take place.
1365 Default: 1000 (1 second)
1366
f2127810
DB
1367force_mld_version - INTEGER
1368 0 - (default) No enforcement of a MLD version, MLDv1 fallback allowed
1369 1 - Enforce to use MLD version 1
1370 2 - Enforce to use MLD version 2
1371
b800c3b9
HFS
1372suppress_frag_ndisc - INTEGER
1373 Control RFC 6980 (Security Implications of IPv6 Fragmentation
1374 with IPv6 Neighbor Discovery) behavior:
1375 1 - (default) discard fragmented neighbor discovery packets
1376 0 - allow fragmented neighbor discovery packets
1377
1da177e4
LT
1378icmp/*:
1379ratelimit - INTEGER
1380 Limit the maximal rates for sending ICMPv6 packets.
6dbf4bca
SH
1381 0 to disable any limiting,
1382 otherwise the minimal space between responses in milliseconds.
1383 Default: 1000
1da177e4
LT
1384
1385
1386IPv6 Update by:
1387Pekka Savola <pekkas@netcore.fi>
1388YOSHIFUJI Hideaki / USAGI Project <yoshfuji@linux-ipv6.org>
1389
1390
1391/proc/sys/net/bridge/* Variables:
1392
1393bridge-nf-call-arptables - BOOLEAN
1394 1 : pass bridged ARP traffic to arptables' FORWARD chain.
1395 0 : disable this.
1396 Default: 1
1397
1398bridge-nf-call-iptables - BOOLEAN
1399 1 : pass bridged IPv4 traffic to iptables' chains.
1400 0 : disable this.
1401 Default: 1
1402
1403bridge-nf-call-ip6tables - BOOLEAN
1404 1 : pass bridged IPv6 traffic to ip6tables' chains.
1405 0 : disable this.
1406 Default: 1
1407
1408bridge-nf-filter-vlan-tagged - BOOLEAN
516299d2
MM
1409 1 : pass bridged vlan-tagged ARP/IP/IPv6 traffic to {arp,ip,ip6}tables.
1410 0 : disable this.
4981682c 1411 Default: 0
516299d2
MM
1412
1413bridge-nf-filter-pppoe-tagged - BOOLEAN
1414 1 : pass bridged pppoe-tagged IP/IPv6 traffic to {ip,ip6}tables.
1da177e4 1415 0 : disable this.
4981682c 1416 Default: 0
1da177e4 1417
4981682c
PNA
1418bridge-nf-pass-vlan-input-dev - BOOLEAN
1419 1: if bridge-nf-filter-vlan-tagged is enabled, try to find a vlan
1420 interface on the bridge and set the netfilter input device to the vlan.
1421 This allows use of e.g. "iptables -i br0.1" and makes the REDIRECT
1422 target work with vlan-on-top-of-bridge interfaces. When no matching
1423 vlan interface is found, or this switch is off, the input device is
1424 set to the bridge interface.
1425 0: disable bridge netfilter vlan interface lookup.
1426 Default: 0
1da177e4 1427
32e8d494
VY
1428proc/sys/net/sctp/* Variables:
1429
1430addip_enable - BOOLEAN
1431 Enable or disable extension of Dynamic Address Reconfiguration
1432 (ADD-IP) functionality specified in RFC5061. This extension provides
1433 the ability to dynamically add and remove new addresses for the SCTP
1434 associations.
1435
1436 1: Enable extension.
1437
1438 0: Disable extension.
1439
1440 Default: 0
1441
1442addip_noauth_enable - BOOLEAN
1443 Dynamic Address Reconfiguration (ADD-IP) requires the use of
1444 authentication to protect the operations of adding or removing new
1445 addresses. This requirement is mandated so that unauthorized hosts
1446 would not be able to hijack associations. However, older
1447 implementations may not have implemented this requirement while
1448 allowing the ADD-IP extension. For reasons of interoperability,
1449 we provide this variable to control the enforcement of the
1450 authentication requirement.
1451
1452 1: Allow ADD-IP extension to be used without authentication. This
1453 should only be set in a closed environment for interoperability
1454 with older implementations.
1455
1456 0: Enforce the authentication requirement
1457
1458 Default: 0
1459
1460auth_enable - BOOLEAN
1461 Enable or disable Authenticated Chunks extension. This extension
1462 provides the ability to send and receive authenticated chunks and is
1463 required for secure operation of Dynamic Address Reconfiguration
1464 (ADD-IP) extension.
1465
1466 1: Enable this extension.
1467 0: Disable this extension.
1468
1469 Default: 0
1470
1471prsctp_enable - BOOLEAN
1472 Enable or disable the Partial Reliability extension (RFC3758) which
1473 is used to notify peers that a given DATA should no longer be expected.
1474
1475 1: Enable extension
1476 0: Disable
1477
1478 Default: 1
1479
1480max_burst - INTEGER
1481 The limit of the number of new packets that can be initially sent. It
1482 controls how bursty the generated traffic can be.
1483
1484 Default: 4
1485
1486association_max_retrans - INTEGER
1487 Set the maximum number for retransmissions that an association can
1488 attempt deciding that the remote end is unreachable. If this value
1489 is exceeded, the association is terminated.
1490
1491 Default: 10
1492
1493max_init_retransmits - INTEGER
1494 The maximum number of retransmissions of INIT and COOKIE-ECHO chunks
1495 that an association will attempt before declaring the destination
1496 unreachable and terminating.
1497
1498 Default: 8
1499
1500path_max_retrans - INTEGER
1501 The maximum number of retransmissions that will be attempted on a given
1502 path. Once this threshold is exceeded, the path is considered
1503 unreachable, and new traffic will use a different path when the
1504 association is multihomed.
1505
1506 Default: 5
1507
5aa93bcf
NH
1508pf_retrans - INTEGER
1509 The number of retransmissions that will be attempted on a given path
1510 before traffic is redirected to an alternate transport (should one
1511 exist). Note this is distinct from path_max_retrans, as a path that
1512 passes the pf_retrans threshold can still be used. Its only
1513 deprioritized when a transmission path is selected by the stack. This
1514 setting is primarily used to enable fast failover mechanisms without
1515 having to reduce path_max_retrans to a very low value. See:
1516 http://www.ietf.org/id/draft-nishida-tsvwg-sctp-failover-05.txt
1517 for details. Note also that a value of pf_retrans > path_max_retrans
1518 disables this feature
1519
1520 Default: 0
1521
32e8d494
VY
1522rto_initial - INTEGER
1523 The initial round trip timeout value in milliseconds that will be used
1524 in calculating round trip times. This is the initial time interval
1525 for retransmissions.
1526
1527 Default: 3000
1da177e4 1528
32e8d494
VY
1529rto_max - INTEGER
1530 The maximum value (in milliseconds) of the round trip timeout. This
1531 is the largest time interval that can elapse between retransmissions.
1532
1533 Default: 60000
1534
1535rto_min - INTEGER
1536 The minimum value (in milliseconds) of the round trip timeout. This
1537 is the smallest time interval the can elapse between retransmissions.
1538
1539 Default: 1000
1540
1541hb_interval - INTEGER
1542 The interval (in milliseconds) between HEARTBEAT chunks. These chunks
1543 are sent at the specified interval on idle paths to probe the state of
1544 a given path between 2 associations.
1545
1546 Default: 30000
1547
1548sack_timeout - INTEGER
1549 The amount of time (in milliseconds) that the implementation will wait
1550 to send a SACK.
1551
1552 Default: 200
1553
1554valid_cookie_life - INTEGER
1555 The default lifetime of the SCTP cookie (in milliseconds). The cookie
1556 is used during association establishment.
1557
1558 Default: 60000
1559
1560cookie_preserve_enable - BOOLEAN
1561 Enable or disable the ability to extend the lifetime of the SCTP cookie
1562 that is used during the establishment phase of SCTP association
1563
1564 1: Enable cookie lifetime extension.
1565 0: Disable
1566
1567 Default: 1
1568
3c68198e
NH
1569cookie_hmac_alg - STRING
1570 Select the hmac algorithm used when generating the cookie value sent by
1571 a listening sctp socket to a connecting client in the INIT-ACK chunk.
1572 Valid values are:
1573 * md5
1574 * sha1
1575 * none
1576 Ability to assign md5 or sha1 as the selected alg is predicated on the
3b09adcb 1577 configuration of those algorithms at build time (CONFIG_CRYPTO_MD5 and
3c68198e
NH
1578 CONFIG_CRYPTO_SHA1).
1579
1580 Default: Dependent on configuration. MD5 if available, else SHA1 if
1581 available, else none.
1582
32e8d494
VY
1583rcvbuf_policy - INTEGER
1584 Determines if the receive buffer is attributed to the socket or to
1585 association. SCTP supports the capability to create multiple
1586 associations on a single socket. When using this capability, it is
1587 possible that a single stalled association that's buffering a lot
1588 of data may block other associations from delivering their data by
1589 consuming all of the receive buffer space. To work around this,
1590 the rcvbuf_policy could be set to attribute the receiver buffer space
1591 to each association instead of the socket. This prevents the described
1592 blocking.
1593
1594 1: rcvbuf space is per association
3b09adcb 1595 0: rcvbuf space is per socket
32e8d494
VY
1596
1597 Default: 0
1598
1599sndbuf_policy - INTEGER
1600 Similar to rcvbuf_policy above, this applies to send buffer space.
1601
1602 1: Send buffer is tracked per association
1603 0: Send buffer is tracked per socket.
1604
1605 Default: 0
1606
1607sctp_mem - vector of 3 INTEGERs: min, pressure, max
1608 Number of pages allowed for queueing by all SCTP sockets.
1609
1610 min: Below this number of pages SCTP is not bothered about its
1611 memory appetite. When amount of memory allocated by SCTP exceeds
1612 this number, SCTP starts to moderate memory usage.
1613
1614 pressure: This value was introduced to follow format of tcp_mem.
1615
1616 max: Number of pages allowed for queueing by all SCTP sockets.
1617
1618 Default is calculated at boot time from amount of available memory.
1619
1620sctp_rmem - vector of 3 INTEGERs: min, default, max
a6e1204b
MM
1621 Only the first value ("min") is used, "default" and "max" are
1622 ignored.
1623
1624 min: Minimal size of receive buffer used by SCTP socket.
1625 It is guaranteed to each SCTP socket (but not association) even
1626 under moderate memory pressure.
1627
1628 Default: 1 page
32e8d494
VY
1629
1630sctp_wmem - vector of 3 INTEGERs: min, default, max
a6e1204b 1631 Currently this tunable has no effect.
32e8d494 1632
72388433
BD
1633addr_scope_policy - INTEGER
1634 Control IPv4 address scoping - draft-stewart-tsvwg-sctp-ipv4-00
1635
1636 0 - Disable IPv4 address scoping
1637 1 - Enable IPv4 address scoping
1638 2 - Follow draft but allow IPv4 private addresses
1639 3 - Follow draft but allow IPv4 link local addresses
1640
1641 Default: 1
1642
1da177e4 1643
4edc2f34 1644/proc/sys/net/core/*
c60f6aa8 1645 Please see: Documentation/sysctl/net.txt for descriptions of these entries.
705efc3b 1646
4edc2f34
SH
1647
1648/proc/sys/net/unix/*
705efc3b
WT
1649max_dgram_qlen - INTEGER
1650 The maximum length of dgram socket receive queue
1651
1652 Default: 10
1653
1654
1655UNDOCUMENTED:
4edc2f34
SH
1656
1657/proc/sys/net/irda/*
1658 fast_poll_increase FIXME
1659 warn_noreply_time FIXME
1660 discovery_slots FIXME
1661 slot_timeout FIXME
1662 max_baud_rate FIXME
1663 discovery_timeout FIXME
1664 lap_keepalive_time FIXME
1665 max_noreply_time FIXME
1666 max_tx_data_size FIXME
1667 max_tx_window FIXME
1668 min_tx_turn_time FIXME