[mirror_ubuntu-bionic-kernel.git] / Documentation / DMA-attributes.txt

==============
DMA attributes
==============

This document describes the semantics of the DMA attributes that are
defined in linux/dma-mapping.h.

DMA_ATTR_WRITE_BARRIER
----------------------

DMA_ATTR_WRITE_BARRIER is a (write) barrier attribute for DMA.  DMA
to a memory region with the DMA_ATTR_WRITE_BARRIER attribute forces
all pending DMA writes to complete, and thus provides a mechanism to
strictly order DMA from a device across all intervening busses and
bridges.  This barrier is not specific to a particular type of
interconnect, it applies to the system as a whole, and so its
implementation must account for the idiosyncrasies of the system all
the way from the DMA device to memory.

As an example of a situation where DMA_ATTR_WRITE_BARRIER would be
useful, suppose that a device does a DMA write to indicate that data is
ready and available in memory.  The DMA of the "completion indication"
could race with data DMA.  Mapping the memory used for completion
indications with DMA_ATTR_WRITE_BARRIER would prevent the race.

DMA_ATTR_WEAK_ORDERING
----------------------

DMA_ATTR_WEAK_ORDERING specifies that reads and writes to the mapping
may be weakly ordered, that is that reads and writes may pass each other.

Since it is optional for platforms to implement DMA_ATTR_WEAK_ORDERING,
those that do not will simply ignore the attribute and exhibit default
behavior.

DMA_ATTR_WRITE_COMBINE
----------------------

DMA_ATTR_WRITE_COMBINE specifies that writes to the mapping may be
buffered to improve performance.

Since it is optional for platforms to implement DMA_ATTR_WRITE_COMBINE,
those that do not will simply ignore the attribute and exhibit default
behavior.

DMA_ATTR_NON_CONSISTENT
-----------------------

DMA_ATTR_NON_CONSISTENT lets the platform to choose to return either
consistent or non-consistent memory as it sees fit.  By using this API,
you are guaranteeing to the platform that you have all the correct and
necessary sync points for this memory in the driver.

DMA_ATTR_NO_KERNEL_MAPPING
--------------------------

DMA_ATTR_NO_KERNEL_MAPPING lets the platform to avoid creating a kernel
virtual mapping for the allocated buffer. On some architectures creating
such mapping is non-trivial task and consumes very limited resources
(like kernel virtual address space or dma consistent address space).
Buffers allocated with this attribute can be only passed to user space
by calling dma_mmap_attrs(). By using this API, you are guaranteeing
that you won't dereference the pointer returned by dma_alloc_attr(). You
can treat it as a cookie that must be passed to dma_mmap_attrs() and
dma_free_attrs(). Make sure that both of these also get this attribute
set on each call.

Since it is optional for platforms to implement
DMA_ATTR_NO_KERNEL_MAPPING, those that do not will simply ignore the
attribute and exhibit default behavior.

DMA_ATTR_SKIP_CPU_SYNC
----------------------

By default dma_map_{single,page,sg} functions family transfer a given
buffer from CPU domain to device domain. Some advanced use cases might
require sharing a buffer between more than one device. This requires
having a mapping created separately for each device and is usually
performed by calling dma_map_{single,page,sg} function more than once
for the given buffer with device pointer to each device taking part in
the buffer sharing. The first call transfers a buffer from 'CPU' domain
to 'device' domain, what synchronizes CPU caches for the given region
(usually it means that the cache has been flushed or invalidated
depending on the dma direction). However, next calls to
dma_map_{single,page,sg}() for other devices will perform exactly the
same synchronization operation on the CPU cache. CPU cache synchronization
might be a time consuming operation, especially if the buffers are
large, so it is highly recommended to avoid it if possible.
DMA_ATTR_SKIP_CPU_SYNC allows platform code to skip synchronization of
the CPU cache for the given buffer assuming that it has been already
transferred to 'device' domain. This attribute can be also used for
dma_unmap_{single,page,sg} functions family to force buffer to stay in
device domain after releasing a mapping for it. Use this attribute with
care!

DMA_ATTR_FORCE_CONTIGUOUS
-------------------------

By default DMA-mapping subsystem is allowed to assemble the buffer
allocated by dma_alloc_attrs() function from individual pages if it can
be mapped as contiguous chunk into device dma address space. By
specifying this attribute the allocated buffer is forced to be contiguous
also in physical memory.

DMA_ATTR_ALLOC_SINGLE_PAGES
---------------------------

This is a hint to the DMA-mapping subsystem that it's probably not worth
the time to try to allocate memory to in a way that gives better TLB
efficiency (AKA it's not worth trying to build the mapping out of larger
pages).  You might want to specify this if:

- You know that the accesses to this memory won't thrash the TLB.
  You might know that the accesses are likely to be sequential or
  that they aren't sequential but it's unlikely you'll ping-pong
  between many addresses that are likely to be in different physical
  pages.
- You know that the penalty of TLB misses while accessing the
  memory will be small enough to be inconsequential.  If you are
  doing a heavy operation like decryption or decompression this
  might be the case.
- You know that the DMA mapping is fairly transitory.  If you expect
  the mapping to have a short lifetime then it may be worth it to
  optimize allocation (avoid coming up with large pages) instead of
  getting the slight performance win of larger pages.

Setting this hint doesn't guarantee that you won't get huge pages, but it
means that we won't try quite as hard to get them.

.. note:: At the moment DMA_ATTR_ALLOC_SINGLE_PAGES is only implemented on ARM,
	  though ARM64 patches will likely be posted soon.

DMA_ATTR_NO_WARN
----------------

This tells the DMA-mapping subsystem to suppress allocation failure reports
(similarly to __GFP_NOWARN).

On some architectures allocation failures are reported with error messages
to the system logs.  Although this can help to identify and debug problems,
drivers which handle failures (eg, retry later) have no problems with them,
and can actually flood the system logs with error messages that aren't any
problem at all, depending on the implementation of the retry mechanism.

So, this provides a way for drivers to avoid those error messages on calls
where allocation failures are not a problem, and shouldn't bother the logs.

.. note:: At the moment DMA_ATTR_NO_WARN is only implemented on PowerPC.

DMA_ATTR_PRIVILEGED
-------------------

Some advanced peripherals such as remote processors and GPUs perform
accesses to DMA buffers in both privileged "supervisor" and unprivileged
"user" modes.  This attribute is used to indicate to the DMA-mapping
subsystem that the buffer is fully accessible at the elevated privilege
level (and ideally inaccessible or at least read-only at the
lesser-privileged levels).
Commit	Line	Data
36c682f6 MCC	1	==============
	2	DMA attributes
	3	==============
a75b0a2f AK	4
a75b0a2f AK	5	This document describes the semantics of the DMA attributes that are
00085f1e	6	defined in linux/dma-mapping.h.
a75b0a2f AK	7
	8	DMA_ATTR_WRITE_BARRIER
	9	----------------------
	10
	11	DMA_ATTR_WRITE_BARRIER is a (write) barrier attribute for DMA. DMA
	12	to a memory region with the DMA_ATTR_WRITE_BARRIER attribute forces
	13	all pending DMA writes to complete, and thus provides a mechanism to
	14	strictly order DMA from a device across all intervening busses and
	15	bridges. This barrier is not specific to a particular type of
	16	interconnect, it applies to the system as a whole, and so its
bf038227	17	implementation must account for the idiosyncrasies of the system all
a75b0a2f AK	18	the way from the DMA device to memory.
	19
	20	As an example of a situation where DMA_ATTR_WRITE_BARRIER would be
	21	useful, suppose that a device does a DMA write to indicate that data is
	22	ready and available in memory. The DMA of the "completion indication"
	23	could race with data DMA. Mapping the memory used for completion
	24	indications with DMA_ATTR_WRITE_BARRIER would prevent the race.
	25
1ed6af73 MN	26	DMA_ATTR_WEAK_ORDERING
	27	----------------------
	28
	29	DMA_ATTR_WEAK_ORDERING specifies that reads and writes to the mapping
	30	may be weakly ordered, that is that reads and writes may pass each other.
	31
	32	Since it is optional for platforms to implement DMA_ATTR_WEAK_ORDERING,
	33	those that do not will simply ignore the attribute and exhibit default
	34	behavior.
8a413432 MS	35
	36	DMA_ATTR_WRITE_COMBINE
	37	----------------------
	38
	39	DMA_ATTR_WRITE_COMBINE specifies that writes to the mapping may be
	40	buffered to improve performance.
	41
	42	Since it is optional for platforms to implement DMA_ATTR_WRITE_COMBINE,
	43	those that do not will simply ignore the attribute and exhibit default
	44	behavior.
64d70fe5 MS	45
	46	DMA_ATTR_NON_CONSISTENT
	47	-----------------------
	48
	49	DMA_ATTR_NON_CONSISTENT lets the platform to choose to return either
	50	consistent or non-consistent memory as it sees fit. By using this API,
	51	you are guaranteeing to the platform that you have all the correct and
	52	necessary sync points for this memory in the driver.
d5724f17 MS	53
	54	DMA_ATTR_NO_KERNEL_MAPPING
	55	--------------------------
	56
	57	DMA_ATTR_NO_KERNEL_MAPPING lets the platform to avoid creating a kernel
	58	virtual mapping for the allocated buffer. On some architectures creating
	59	such mapping is non-trivial task and consumes very limited resources
	60	(like kernel virtual address space or dma consistent address space).
	61	Buffers allocated with this attribute can be only passed to user space
	62	by calling dma_mmap_attrs(). By using this API, you are guaranteeing
	63	that you won't dereference the pointer returned by dma_alloc_attr(). You
bf038227	64	can treat it as a cookie that must be passed to dma_mmap_attrs() and
d5724f17 MS	65	dma_free_attrs(). Make sure that both of these also get this attribute
	66	set on each call.
	67
	68	Since it is optional for platforms to implement
	69	DMA_ATTR_NO_KERNEL_MAPPING, those that do not will simply ignore the
	70	attribute and exhibit default behavior.
bdf5e487 MS	71
	72	DMA_ATTR_SKIP_CPU_SYNC
	73	----------------------
	74
	75	By default dma_map_{single,page,sg} functions family transfer a given
	76	buffer from CPU domain to device domain. Some advanced use cases might
	77	require sharing a buffer between more than one device. This requires
	78	having a mapping created separately for each device and is usually
	79	performed by calling dma_map_{single,page,sg} function more than once
	80	for the given buffer with device pointer to each device taking part in
	81	the buffer sharing. The first call transfers a buffer from 'CPU' domain
	82	to 'device' domain, what synchronizes CPU caches for the given region
	83	(usually it means that the cache has been flushed or invalidated
	84	depending on the dma direction). However, next calls to
	85	dma_map_{single,page,sg}() for other devices will perform exactly the
bf038227	86	same synchronization operation on the CPU cache. CPU cache synchronization
bdf5e487 MS	87	might be a time consuming operation, especially if the buffers are
	88	large, so it is highly recommended to avoid it if possible.
	89	DMA_ATTR_SKIP_CPU_SYNC allows platform code to skip synchronization of
	90	the CPU cache for the given buffer assuming that it has been already
	91	transferred to 'device' domain. This attribute can be also used for
	92	dma_unmap_{single,page,sg} functions family to force buffer to stay in
	93	device domain after releasing a mapping for it. Use this attribute with
	94	care!
4b9347dc MS	95
	96	DMA_ATTR_FORCE_CONTIGUOUS
	97	-------------------------
	98
	99	By default DMA-mapping subsystem is allowed to assemble the buffer
	100	allocated by dma_alloc_attrs() function from individual pages if it can
	101	be mapped as contiguous chunk into device dma address space. By
c98be0c9	102	specifying this attribute the allocated buffer is forced to be contiguous
4b9347dc	103	also in physical memory.
df05c6f6 DA	104
	105	DMA_ATTR_ALLOC_SINGLE_PAGES
	106	---------------------------
	107
	108	This is a hint to the DMA-mapping subsystem that it's probably not worth
	109	the time to try to allocate memory to in a way that gives better TLB
	110	efficiency (AKA it's not worth trying to build the mapping out of larger
	111	pages). You might want to specify this if:
36c682f6	112
df05c6f6 DA	113	- You know that the accesses to this memory won't thrash the TLB.
	114	You might know that the accesses are likely to be sequential or
	115	that they aren't sequential but it's unlikely you'll ping-pong
	116	between many addresses that are likely to be in different physical
	117	pages.
	118	- You know that the penalty of TLB misses while accessing the
	119	memory will be small enough to be inconsequential. If you are
	120	doing a heavy operation like decryption or decompression this
	121	might be the case.
	122	- You know that the DMA mapping is fairly transitory. If you expect
	123	the mapping to have a short lifetime then it may be worth it to
	124	optimize allocation (avoid coming up with large pages) instead of
	125	getting the slight performance win of larger pages.
36c682f6	126
df05c6f6 DA	127	Setting this hint doesn't guarantee that you won't get huge pages, but it
	128	means that we won't try quite as hard to get them.
	129
36c682f6 MCC	130	.. note:: At the moment DMA_ATTR_ALLOC_SINGLE_PAGES is only implemented on ARM,
36c682f6 MCC	131	though ARM64 patches will likely be posted soon.
a9a62c93 MFO	132
	133	DMA_ATTR_NO_WARN
	134	----------------
	135
	136	This tells the DMA-mapping subsystem to suppress allocation failure reports
	137	(similarly to __GFP_NOWARN).
	138
	139	On some architectures allocation failures are reported with error messages
	140	to the system logs. Although this can help to identify and debug problems,
	141	drivers which handle failures (eg, retry later) have no problems with them,
	142	and can actually flood the system logs with error messages that aren't any
	143	problem at all, depending on the implementation of the retry mechanism.
	144
	145	So, this provides a way for drivers to avoid those error messages on calls
	146	where allocation failures are not a problem, and shouldn't bother the logs.
	147
36c682f6	148	.. note:: At the moment DMA_ATTR_NO_WARN is only implemented on PowerPC.
b2fb3664 MH	149
b2fb3664 MH	150	DMA_ATTR_PRIVILEGED
36c682f6	151	-------------------
b2fb3664 MH	152
	153	Some advanced peripherals such as remote processors and GPUs perform
	154	accesses to DMA buffers in both privileged "supervisor" and unprivileged
	155	"user" modes. This attribute is used to indicate to the DMA-mapping
	156	subsystem that the buffer is fully accessible at the elevated privilege
	157	level (and ideally inaccessible or at least read-only at the
	158	lesser-privileged levels).