[mirror_ubuntu-bionic-kernel.git] / Documentation / trace / tracepoints.txt

	             Using the Linux Kernel Tracepoints

			    Mathieu Desnoyers


This document introduces Linux Kernel Tracepoints and their use. It
provides examples of how to insert tracepoints in the kernel and
connect probe functions to them and provides some examples of probe
functions.


* Purpose of tracepoints

A tracepoint placed in code provides a hook to call a function (probe)
that you can provide at runtime. A tracepoint can be "on" (a probe is
connected to it) or "off" (no probe is attached). When a tracepoint is
"off" it has no effect, except for adding a tiny time penalty
(checking a condition for a branch) and space penalty (adding a few
bytes for the function call at the end of the instrumented function
and adds a data structure in a separate section).  When a tracepoint
is "on", the function you provide is called each time the tracepoint
is executed, in the execution context of the caller. When the function
provided ends its execution, it returns to the caller (continuing from
the tracepoint site).

You can put tracepoints at important locations in the code. They are
lightweight hooks that can pass an arbitrary number of parameters,
which prototypes are described in a tracepoint declaration placed in a
header file.

They can be used for tracing and performance accounting.


* Usage

Two elements are required for tracepoints :

- A tracepoint definition, placed in a header file.
- The tracepoint statement, in C code.

In order to use tracepoints, you should include linux/tracepoint.h.

In include/trace/events/subsys.h :

#undef TRACE_SYSTEM
#define TRACE_SYSTEM subsys

#if !defined(_TRACE_SUBSYS_H) || defined(TRACE_HEADER_MULTI_READ)
#define _TRACE_SUBSYS_H

#include <linux/tracepoint.h>

DECLARE_TRACE(subsys_eventname,
	TP_PROTO(int firstarg, struct task_struct *p),
	TP_ARGS(firstarg, p));

#endif /* _TRACE_SUBSYS_H */

/* This part must be outside protection */
#include <trace/define_trace.h>

In subsys/file.c (where the tracing statement must be added) :

#include <trace/events/subsys.h>

#define CREATE_TRACE_POINTS
DEFINE_TRACE(subsys_eventname);

void somefct(void)
{
	...
	trace_subsys_eventname(arg, task);
	...
}

Where :
- subsys_eventname is an identifier unique to your event
    - subsys is the name of your subsystem.
    - eventname is the name of the event to trace.

- TP_PROTO(int firstarg, struct task_struct *p) is the prototype of the
  function called by this tracepoint.

- TP_ARGS(firstarg, p) are the parameters names, same as found in the
  prototype.

- if you use the header in multiple source files, #define CREATE_TRACE_POINTS
  should appear only in one source file.

Connecting a function (probe) to a tracepoint is done by providing a
probe (function to call) for the specific tracepoint through
register_trace_subsys_eventname().  Removing a probe is done through
unregister_trace_subsys_eventname(); it will remove the probe.

tracepoint_synchronize_unregister() must be called before the end of
the module exit function to make sure there is no caller left using
the probe. This, and the fact that preemption is disabled around the
probe call, make sure that probe removal and module unload are safe.

The tracepoint mechanism supports inserting multiple instances of the
same tracepoint, but a single definition must be made of a given
tracepoint name over all the kernel to make sure no type conflict will
occur. Name mangling of the tracepoints is done using the prototypes
to make sure typing is correct. Verification of probe type correctness
is done at the registration site by the compiler. Tracepoints can be
put in inline functions, inlined static functions, and unrolled loops
as well as regular functions.

The naming scheme "subsys_event" is suggested here as a convention
intended to limit collisions. Tracepoint names are global to the
kernel: they are considered as being the same whether they are in the
core kernel image or in modules.

If the tracepoint has to be used in kernel modules, an
EXPORT_TRACEPOINT_SYMBOL_GPL() or EXPORT_TRACEPOINT_SYMBOL() can be
used to export the defined tracepoints.

If you need to do a bit of work for a tracepoint parameter, and
that work is only used for the tracepoint, that work can be encapsulated
within an if statement with the following:

	if (trace_foo_bar_enabled()) {
		int i;
		int tot = 0;

		for (i = 0; i < count; i++)
			tot += calculate_nuggets();

		trace_foo_bar(tot);
	}

All trace_<tracepoint>() calls have a matching trace_<tracepoint>_enabled()
function defined that returns true if the tracepoint is enabled and
false otherwise. The trace_<tracepoint>() should always be within the
block of the if (trace_<tracepoint>_enabled()) to prevent races between
the tracepoint being enabled and the check being seen.

The advantage of using the trace_<tracepoint>_enabled() is that it uses
the static_key of the tracepoint to allow the if statement to be implemented
with jump labels and avoid conditional branches.

Note: The convenience macro TRACE_EVENT provides an alternative way to
      define tracepoints. Check http://lwn.net/Articles/379903,
      http://lwn.net/Articles/381064 and http://lwn.net/Articles/383362
      for a series of articles with more details.
Commit	Line	Data
24b8d831 MD	1	Using the Linux Kernel Tracepoints
	2
	3	Mathieu Desnoyers
	4
	5
0a7ad645 IM	6	This document introduces Linux Kernel Tracepoints and their use. It
	7	provides examples of how to insert tracepoints in the kernel and
	8	connect probe functions to them and provides some examples of probe
	9	functions.
24b8d831 MD	10
	11
	12	* Purpose of tracepoints
	13
0a7ad645 IM	14	A tracepoint placed in code provides a hook to call a function (probe)
	15	that you can provide at runtime. A tracepoint can be "on" (a probe is
	16	connected to it) or "off" (no probe is attached). When a tracepoint is
	17	"off" it has no effect, except for adding a tiny time penalty
	18	(checking a condition for a branch) and space penalty (adding a few
	19	bytes for the function call at the end of the instrumented function
	20	and adds a data structure in a separate section). When a tracepoint
	21	is "on", the function you provide is called each time the tracepoint
	22	is executed, in the execution context of the caller. When the function
	23	provided ends its execution, it returns to the caller (continuing from
	24	the tracepoint site).
24b8d831 MD	25
	26	You can put tracepoints at important locations in the code. They are
	27	lightweight hooks that can pass an arbitrary number of parameters,
0a7ad645 IM	28	which prototypes are described in a tracepoint declaration placed in a
0a7ad645 IM	29	header file.
24b8d831 MD	30
	31	They can be used for tracing and performance accounting.
	32
	33
	34	* Usage
	35
	36	Two elements are required for tracepoints :
	37
	38	- A tracepoint definition, placed in a header file.
	39	- The tracepoint statement, in C code.
	40
	41	In order to use tracepoints, you should include linux/tracepoint.h.
	42
fd8176e3 ZK	43	In include/trace/events/subsys.h :
	44
	45	#undef TRACE_SYSTEM
	46	#define TRACE_SYSTEM subsys
	47
	48	#if !defined(_TRACE_SUBSYS_H) \|\| defined(TRACE_HEADER_MULTI_READ)
	49	#define _TRACE_SUBSYS_H
24b8d831 MD	50
	51	#include <linux/tracepoint.h>
	52
7e066fb8	53	DECLARE_TRACE(subsys_eventname,
2939b046 SR	54	TP_PROTO(int firstarg, struct task_struct *p),
2939b046 SR	55	TP_ARGS(firstarg, p));
24b8d831	56
fd8176e3 ZK	57	#endif /* _TRACE_SUBSYS_H */
	58
	59	/* This part must be outside protection */
	60	#include <trace/define_trace.h>
	61
24b8d831 MD	62	In subsys/file.c (where the tracing statement must be added) :
24b8d831 MD	63
fd8176e3	64	#include <trace/events/subsys.h>
24b8d831	65
fd8176e3	66	#define CREATE_TRACE_POINTS
7e066fb8 MD	67	DEFINE_TRACE(subsys_eventname);
7e066fb8 MD	68
24b8d831 MD	69	void somefct(void)
	70	{
	71	...
	72	trace_subsys_eventname(arg, task);
	73	...
	74	}
	75
	76	Where :
	77	- subsys_eventname is an identifier unique to your event
	78	- subsys is the name of your subsystem.
	79	- eventname is the name of the event to trace.
24b8d831	80
2939b046	81	- TP_PROTO(int firstarg, struct task_struct *p) is the prototype of the
0a7ad645 IM	82	function called by this tracepoint.
0a7ad645 IM	83
2939b046	84	- TP_ARGS(firstarg, p) are the parameters names, same as found in the
0a7ad645 IM	85	prototype.
0a7ad645 IM	86
fd8176e3 ZK	87	- if you use the header in multiple source files, #define CREATE_TRACE_POINTS
	88	should appear only in one source file.
	89
0a7ad645 IM	90	Connecting a function (probe) to a tracepoint is done by providing a
0a7ad645 IM	91	probe (function to call) for the specific tracepoint through
24b8d831	92	register_trace_subsys_eventname(). Removing a probe is done through
8fd88d15	93	unregister_trace_subsys_eventname(); it will remove the probe.
0a7ad645 IM	94
	95	tracepoint_synchronize_unregister() must be called before the end of
	96	the module exit function to make sure there is no caller left using
	97	the probe. This, and the fact that preemption is disabled around the
	98	probe call, make sure that probe removal and module unload are safe.
0a7ad645 IM	99
	100	The tracepoint mechanism supports inserting multiple instances of the
	101	same tracepoint, but a single definition must be made of a given
	102	tracepoint name over all the kernel to make sure no type conflict will
	103	occur. Name mangling of the tracepoints is done using the prototypes
	104	to make sure typing is correct. Verification of probe type correctness
	105	is done at the registration site by the compiler. Tracepoints can be
	106	put in inline functions, inlined static functions, and unrolled loops
	107	as well as regular functions.
	108
	109	The naming scheme "subsys_event" is suggested here as a convention
	110	intended to limit collisions. Tracepoint names are global to the
	111	kernel: they are considered as being the same whether they are in the
	112	core kernel image or in modules.
24b8d831	113
7e066fb8	114	If the tracepoint has to be used in kernel modules, an
0a7ad645 IM	115	EXPORT_TRACEPOINT_SYMBOL_GPL() or EXPORT_TRACEPOINT_SYMBOL() can be
0a7ad645 IM	116	used to export the defined tracepoints.
c7708649	117
7c65bbc7 SRRH	118	If you need to do a bit of work for a tracepoint parameter, and
	119	that work is only used for the tracepoint, that work can be encapsulated
	120	within an if statement with the following:
	121
	122	if (trace_foo_bar_enabled()) {
	123	int i;
	124	int tot = 0;
	125
	126	for (i = 0; i < count; i++)
	127	tot += calculate_nuggets();
	128
	129	trace_foo_bar(tot);
	130	}
	131
	132	All trace_<tracepoint>() calls have a matching trace_<tracepoint>_enabled()
	133	function defined that returns true if the tracepoint is enabled and
	134	false otherwise. The trace_<tracepoint>() should always be within the
	135	block of the if (trace_<tracepoint>_enabled()) to prevent races between
	136	the tracepoint being enabled and the check being seen.
	137
	138	The advantage of using the trace_<tracepoint>_enabled() is that it uses
	139	the static_key of the tracepoint to allow the if statement to be implemented
	140	with jump labels and avoid conditional branches.
	141
c7708649 SR	142	Note: The convenience macro TRACE_EVENT provides an alternative way to
	143	define tracepoints. Check http://lwn.net/Articles/379903,
	144	http://lwn.net/Articles/381064 and http://lwn.net/Articles/383362
	145	for a series of articles with more details.