]> git.proxmox.com Git - mirror_ubuntu-bionic-kernel.git/blame - Documentation/trace/kprobetrace.txt
Merge branch 'stable/for-linus-4.10' of git://git.kernel.org/pub/scm/linux/kernel...
[mirror_ubuntu-bionic-kernel.git] / Documentation / trace / kprobetrace.txt
CommitLineData
77b44d1b
MH
1 Kprobe-based Event Tracing
2 ==========================
d8ec9185
MH
3
4 Documentation is written by Masami Hiramatsu
5
6
7Overview
8--------
77b44d1b
MH
9These events are similar to tracepoint based events. Instead of Tracepoint,
10this is based on kprobes (kprobe and kretprobe). So it can probe wherever
11kprobes can probe (this means, all functions body except for __kprobes
12functions). Unlike the Tracepoint based event, this can be added and removed
13dynamically, on the fly.
d8ec9185 14
9c33c512 15To enable this feature, build your kernel with CONFIG_KPROBE_EVENT=y.
d8ec9185 16
77b44d1b
MH
17Similar to the events tracer, this doesn't need to be activated via
18current_tracer. Instead of that, add probe points via
19/sys/kernel/debug/tracing/kprobe_events, and enable it via
20/sys/kernel/debug/tracing/events/kprobes/<EVENT>/enabled.
d8ec9185
MH
21
22
23Synopsis of kprobe_events
24-------------------------
61424318
MH
25 p[:[GRP/]EVENT] [MOD:]SYM[+offs]|MEMADDR [FETCHARGS] : Set a probe
26 r[:[GRP/]EVENT] [MOD:]SYM[+0] [FETCHARGS] : Set a return probe
df3ab708 27 -:[GRP/]EVENT : Clear a probe
d8ec9185 28
f52487e9 29 GRP : Group name. If omitted, use "kprobes" for it.
2fba0c88 30 EVENT : Event name. If omitted, the event name is generated
61424318
MH
31 based on SYM+offs or MEMADDR.
32 MOD : Module name which has given SYM.
33 SYM[+offs] : Symbol+offset where the probe is inserted.
2fba0c88 34 MEMADDR : Address where the probe is inserted.
d8ec9185 35
2fba0c88 36 FETCHARGS : Arguments. Each probe can have up to 128 args.
2e06ff63
MH
37 %REG : Fetch register REG
38 @ADDR : Fetch memory at ADDR (ADDR should be in kernel)
d8ec9185 39 @SYM[+|-offs] : Fetch memory at SYM +|- offs (SYM should be a data symbol)
2e06ff63
MH
40 $stackN : Fetch Nth entry of stack (N >= 0)
41 $stack : Fetch stack address.
14640106 42 $retval : Fetch return value.(*)
35abb67d 43 $comm : Fetch current task comm.
14640106 44 +|-offs(FETCHARG) : Fetch memory at FETCHARG +|- offs address.(**)
93ccae7a
MH
45 NAME=FETCHARG : Set NAME as the argument name of FETCHARG.
46 FETCHARG:TYPE : Set TYPE as the type of FETCHARG. Currently, basic types
17ce3dc7
MH
47 (u8/u16/u32/u64/s8/s16/s32/s64), hexadecimal types
48 (x8/x16/x32/x64), "string" and bitfield are supported.
d8ec9185 49
14640106
MH
50 (*) only for return probe.
51 (**) this is useful for fetching a field of data structures.
d8ec9185 52
1ff511e3
MH
53Types
54-----
55Several types are supported for fetch-args. Kprobe tracer will access memory
56by given type. Prefix 's' and 'u' means those types are signed and unsigned
bdca79c2
MH
57respectively. 'x' prefix implies it is unsigned. Traced arguments are shown
58in decimal ('s' and 'u') or hexadecimal ('x'). Without type casting, 'x32'
59or 'x64' is used depends on the architecture (e.g. x86-32 uses x32, and
60x86-64 uses x64).
1ff511e3
MH
61String type is a special type, which fetches a "null-terminated" string from
62kernel space. This means it will fail and store NULL if the string container
63has been paged out.
64Bitfield is another special type, which takes 3 parameters, bit-width, bit-
65offset, and container-size (usually 32). The syntax is;
66
67 b<bit-width>@<bit-offset>/<container-size>
68
35abb67d
OS
69For $comm, the default type is "string"; any other type is invalid.
70
d8ec9185
MH
71
72Per-Probe Event Filtering
73-------------------------
74 Per-probe event filtering feature allows you to set different filter on each
75probe and gives you what arguments will be shown in trace buffer. If an event
77b44d1b
MH
76name is specified right after 'p:' or 'r:' in kprobe_events, it adds an event
77under tracing/events/kprobes/<EVENT>, at the directory you can see 'id',
78'enabled', 'format' and 'filter'.
d8ec9185
MH
79
80enabled:
81 You can enable/disable the probe by writing 1 or 0 on it.
82
83format:
eca0d916 84 This shows the format of this probe event.
d8ec9185
MH
85
86filter:
eca0d916 87 You can write filtering rules of this event.
d8ec9185 88
e08d1c65
MH
89id:
90 This shows the id of this probe event.
d8ec9185 91
77b44d1b 92
cd7e7bd5
MH
93Event Profiling
94---------------
95 You can check the total number of probe hits and probe miss-hits via
96/sys/kernel/debug/tracing/kprobe_profile.
97 The first column is event name, the second is the number of probe hits,
98the third is the number of probe miss-hits.
99
100
d8ec9185
MH
101Usage examples
102--------------
103To add a probe as a new event, write a new definition to kprobe_events
104as below.
105
580d9e00 106 echo 'p:myprobe do_sys_open dfd=%ax filename=%dx flags=%cx mode=+4($stack)' > /sys/kernel/debug/tracing/kprobe_events
d8ec9185
MH
107
108 This sets a kprobe on the top of do_sys_open() function with recording
14640106
MH
1091st to 4th arguments as "myprobe" event. Note, which register/stack entry is
110assigned to each function argument depends on arch-specific ABI. If you unsure
111the ABI, please try to use probe subcommand of perf-tools (you can find it
112under tools/perf/).
113As this example shows, users can choose more familiar names for each arguments.
d8ec9185 114
580d9e00 115 echo 'r:myretprobe do_sys_open $retval' >> /sys/kernel/debug/tracing/kprobe_events
d8ec9185
MH
116
117 This sets a kretprobe on the return point of do_sys_open() function with
99329c44 118recording return value as "myretprobe" event.
d8ec9185
MH
119 You can see the format of these events via
120/sys/kernel/debug/tracing/events/kprobes/<EVENT>/format.
121
122 cat /sys/kernel/debug/tracing/events/kprobes/myprobe/format
123name: myprobe
ec3a9039 124ID: 780
d8ec9185 125format:
ec3a9039
MH
126 field:unsigned short common_type; offset:0; size:2; signed:0;
127 field:unsigned char common_flags; offset:2; size:1; signed:0;
128 field:unsigned char common_preempt_count; offset:3; size:1;signed:0;
129 field:int common_pid; offset:4; size:4; signed:1;
ec3a9039
MH
130
131 field:unsigned long __probe_ip; offset:12; size:4; signed:0;
132 field:int __probe_nargs; offset:16; size:4; signed:1;
133 field:unsigned long dfd; offset:20; size:4; signed:0;
134 field:unsigned long filename; offset:24; size:4; signed:0;
135 field:unsigned long flags; offset:28; size:4; signed:0;
136 field:unsigned long mode; offset:32; size:4; signed:0;
137
138
139print fmt: "(%lx) dfd=%lx filename=%lx flags=%lx mode=%lx", REC->__probe_ip,
140REC->dfd, REC->filename, REC->flags, REC->mode
d8ec9185 141
eca0d916 142 You can see that the event has 4 arguments as in the expressions you specified.
d8ec9185
MH
143
144 echo > /sys/kernel/debug/tracing/kprobe_events
145
5a0d9050
MH
146 This clears all probe points.
147
df3ab708
MK
148 Or,
149
150 echo -:myprobe >> kprobe_events
151
152 This clears probe points selectively.
153
5a0d9050
MH
154 Right after definition, each event is disabled by default. For tracing these
155events, you need to enable it.
156
157 echo 1 > /sys/kernel/debug/tracing/events/kprobes/myprobe/enable
158 echo 1 > /sys/kernel/debug/tracing/events/kprobes/myretprobe/enable
159
160 And you can see the traced information via /sys/kernel/debug/tracing/trace.
d8ec9185
MH
161
162 cat /sys/kernel/debug/tracing/trace
163# tracer: nop
164#
165# TASK-PID CPU# TIMESTAMP FUNCTION
166# | | | | |
6e9f23d1 167 <...>-1447 [001] 1038282.286875: myprobe: (do_sys_open+0x0/0xd6) dfd=3 filename=7fffd1ec4440 flags=8000 mode=0
2e06ff63 168 <...>-1447 [001] 1038282.286878: myretprobe: (sys_openat+0xc/0xe <- do_sys_open) $retval=fffffffffffffffe
6e9f23d1 169 <...>-1447 [001] 1038282.286885: myprobe: (do_sys_open+0x0/0xd6) dfd=ffffff9c filename=40413c flags=8000 mode=1b6
2e06ff63 170 <...>-1447 [001] 1038282.286915: myretprobe: (sys_open+0x1b/0x1d <- do_sys_open) $retval=3
6e9f23d1 171 <...>-1447 [001] 1038282.286969: myprobe: (do_sys_open+0x0/0xd6) dfd=ffffff9c filename=4041c6 flags=98800 mode=10
2e06ff63 172 <...>-1447 [001] 1038282.286976: myretprobe: (sys_open+0x1b/0x1d <- do_sys_open) $retval=3
d8ec9185
MH
173
174
6e9f23d1 175 Each line shows when the kernel hits an event, and <- SYMBOL means kernel
d8ec9185
MH
176returns from SYMBOL(e.g. "sys_open+0x1b/0x1d <- do_sys_open" means kernel
177returns from do_sys_open to sys_open+0x1b).
178