[rustc.git] / src / librustc / dep_graph / README.md

# Dependency graph for incremental compilation

This module contains the infrastructure for managing the incremental
compilation dependency graph. This README aims to explain how it ought
to be used. In this document, we'll first explain the overall
strategy, and then share some tips for handling specific scenarios.

The high-level idea is that we want to instrument the compiler to
track which parts of the AST and other IR are read/written by what.
This way, when we come back later, we can look at this graph and
determine what work needs to be redone.

### The dependency graph

The nodes of the graph are defined by the enum `DepNode`. They represent
one of three things:

1. HIR nodes (like `Hir(DefId)`) represent the HIR input itself.
2. Data nodes (like `ItemSignature(DefId)`) represent some computed
   information about a particular item.
3. Procedure notes (like `CoherenceCheckImpl(DefId)`) represent some
   procedure that is executing. Usually this procedure is
   performing some kind of check for errors. You can think of them as
   computed values where the value being computed is `()` (and the
   value may fail to be computed, if an error results).

An edge `N1 -> N2` is added between two nodes if either:

- the value of `N1` is used to compute `N2`;
- `N1` is read by the procedure `N2`;
- the procedure `N1` writes the value `N2`.

The latter two conditions are equivalent to the first one if you think
of procedures as values.

### Basic tracking

There is a very general strategy to ensure that you have a correct, if
sometimes overconservative, dependency graph. The two main things you have
to do are (a) identify shared state and (b) identify the current tasks.

### Identifying shared state

Identify "shared state" that will be written by one pass and read by
another. In particular, we need to identify shared state that will be
read "across items" -- that is, anything where changes in one item
could invalidate work done for other items. So, for example:

1. The signature for a function is "shared state".
2. The computed type of some expression in the body of a function is
   not shared state, because if it changes it does not itself
   invalidate other functions (though it may be that it causes new
   monomorphizations to occur, but that's handled independently).

Put another way: if the HIR for an item changes, we are going to
recompile that item for sure. But we need the dep tracking map to tell
us what *else* we have to recompile. Shared state is anything that is
used to communicate results from one item to another.

### Identifying the current task

The dep graph always tracks a current task: this is basically the
`DepNode` that the compiler is computing right now. Typically it would
be a procedure node, but it can also be a data node (as noted above,
the two are kind of equivalent).

You set the current task by calling `dep_graph.in_task(node)`. For example:

```rust
let _task = tcx.dep_graph.in_task(DepNode::Privacy);
```

Now all the code until `_task` goes out of scope will be considered
part of the "privacy task".

The tasks are maintained in a stack, so it is perfectly fine to nest
one task within another. Because pushing a task is considered to be
computing a value, when you nest a task `N2` inside of a task `N1`, we
automatically add an edge `N2 -> N1` (since `N1` presumably needed the
result of `N2` to complete):

```rust
let _n1 = tcx.dep_graph.in_task(DepNode::N1);
let _n2 = tcx.dep_graph.in_task(DepNode::N2);
// this will result in an edge N1 -> n2
```

### Ignore tasks

Although it is rarely needed, you can also push a special "ignore"
task:

```rust
let _ignore = tc.dep_graph.in_ignore();
```

This will cause all read/write edges to be ignored until it goes out
of scope or until something else is pushed. For example, we could
suppress the edge between nested tasks like so:

```rust
let _n1 = tcx.dep_graph.in_task(DepNode::N1);
let _ignore = tcx.dep_graph.in_ignore();
let _n2 = tcx.dep_graph.in_task(DepNode::N2);
// now no edge is added
```

### Tracking reads and writes

We need to identify what shared state is read/written by the current
task as it executes. The most fundamental way of doing that is to invoke
the `read` and `write` methods on `DepGraph`:

```rust
// Adds an edge from DepNode::Hir(some_def_id) to the current task
tcx.dep_graph.read(DepNode::Hir(some_def_id))

// Adds an edge from the current task to DepNode::ItemSignature(some_def_id)
tcx.dep_graph.write(DepNode::ItemSignature(some_def_id))
```

However, you should rarely need to invoke those methods directly.
Instead, the idea is to *encapsulate* shared state into some API that
will invoke `read` and `write` automatically. The most common way to
do this is to use a `DepTrackingMap`, described in the next section,
but any sort of abstraction barrier will do. In general, the strategy
is that getting access to information implicitly adds an appropriate
`read`. So, for example, when you use the
`dep_graph::visit_all_items_in_krate` helper method, it will visit
each item `X`, start a task `Foo(X)` for that item, and automatically
add an edge `Hir(X) -> Foo(X)`. This edge is added because the code is
being given access to the HIR node for `X`, and hence it is expected
to read from it. Similarly, reading from the `tcache` map for item `X`
(which is a `DepTrackingMap`, described below) automatically invokes
`dep_graph.read(ItemSignature(X))`.

**Note:** adding `Hir` nodes requires a bit of caution due to the
"inlining" that old trans and constant evaluation still use. See the
section on inlining below.

To make this strategy work, a certain amount of indirection is
required. For example, modules in the HIR do not have direct pointers
to the items that they contain. Rather, they contain node-ids -- one
can then ask the HIR map for the item with a given node-id. This gives
us an opportunity to add an appropriate read edge.

#### Explicit calls to read and write when starting a new subtask

One time when you *may* need to call `read` and `write` directly is
when you push a new task onto the stack, either by calling `in_task`
as shown above or indirectly, such as with the `memoize` pattern
described below. In that case, any data that the task has access to
from the surrounding environment must be explicitly "read". For
example, in `librustc_typeck`, the collection code visits all items
and, among other things, starts a subtask producing its signature
(what follows is simplified pseudocode, of course):

```rust
fn visit_item(item: &hir::Item) {
    // Here, current subtask is "Collect(X)", and an edge Hir(X) -> Collect(X)
    // has automatically been added by `visit_all_items_in_krate`.
    let sig = signature_of_item(item);
}

fn signature_of_item(item: &hir::Item) {
    let def_id = tcx.map.local_def_id(item.id);
    let task = tcx.dep_graph.in_task(DepNode::ItemSignature(def_id));
    tcx.dep_graph.read(DepNode::Hir(def_id)); // <-- the interesting line
    ...
}
```

Here you can see that, in `signature_of_item`, we started a subtask
corresponding to producing the `ItemSignature`. This subtask will read from
`item` -- but it gained access to `item` implicitly. This means that if it just
reads from `item`, there would be missing edges in the graph:

    Hir(X) --+ // added by the explicit call to `read`
      |      |
      |      +---> ItemSignature(X) -> Collect(X)
      |                                 ^
      |                                 |
      +---------------------------------+ // added by `visit_all_items_in_krate`

In particular, the edge from `Hir(X)` to `ItemSignature(X)` is only
present because we called `read` ourselves when entering the `ItemSignature(X)`
task.

So, the rule of thumb: when entering a new task yourself, register
reads on any shared state that you inherit. (This actually comes up
fairly infrequently though: the main place you need caution is around
memoization.)

#### Dependency tracking map

`DepTrackingMap` is a particularly convenient way to correctly store
shared state. A `DepTrackingMap` is a special hashmap that will add
edges automatically when `get` and `insert` are called. The idea is
that, when you get/insert a value for the key `K`, we will add an edge
from/to the node `DepNode::Variant(K)` (for some variant specific to
the map).

Each `DepTrackingMap` is parameterized by a special type `M` that
implements `DepTrackingMapConfig`; this trait defines the key and value
types of the map, and also defines a fn for converting from the key to
a `DepNode` label. You don't usually have to muck about with this by
hand, there is a macro for creating it. You can see the complete set
of `DepTrackingMap` definitions in `librustc/middle/ty/maps.rs`.

As an example, let's look at the `adt_defs` map. The `adt_defs` map
maps from the def-id of a struct/enum to its `AdtDef`. It is defined
using this macro:

```rust
dep_map_ty! { AdtDefs: ItemSignature(DefId) -> ty::AdtDefMaster<'tcx> }
//            ~~~~~~~  ~~~~~~~~~~~~~ ~~~~~     ~~~~~~~~~~~~~~~~~~~~~~
//               |           |      Key type       Value type
//               |    DepNode variant
//      Name of map id type
```

this indicates that a map id type `AdtDefs` will be created. The key
of the map will be a `DefId` and value will be
`ty::AdtDefMaster<'tcx>`. The `DepNode` will be created by
`DepNode::ItemSignature(K)` for a given key.

Once that is done, you can just use the `DepTrackingMap` like any
other map:

```rust
let mut map: DepTrackingMap<M> = DepTrackingMap::new(dep_graph);
map.insert(key, value); // registers dep_graph.write
map.get(key; // registers dep_graph.read
```

#### Memoization

One particularly interesting case is memoization. If you have some
shared state that you compute in a memoized fashion, the correct thing
to do is to define a `RefCell<DepTrackingMap>` for it and use the
`memoize` helper:

```rust
map.memoize(key, || /* compute value */)
```

This will create a graph that looks like

    ... -> MapVariant(key) -> CurrentTask

where `MapVariant` is the `DepNode` variant that the map is associated with,
and `...` are whatever edges the `/* compute value */` closure creates.

In particular, using the memoize helper is much better than writing
the obvious code yourself:

```
if let Some(result) = map.get(key) {
    return result;
}
let value = /* compute value */;
map.insert(key, value);
```

If you write that code manually, the dependency graph you get will
include artificial edges that are not necessary. For example, imagine that
two tasks, A and B, both invoke the manual memoization code, but A happens
to go first. The resulting graph will be:

    ... -> A -> MapVariant(key) -> B
    ~~~~~~~~~~~~~~~~~~~~~~~~~~~       // caused by A writing to MapVariant(key)
                ~~~~~~~~~~~~~~~~~~~~  // caused by B reading from MapVariant(key)

This graph is not *wrong*, but it encodes a path from A to B that
should not exist.  In contrast, using the memoized helper, you get:

    ... -> MapVariant(key) -> A
                 |
                 +----------> B

which is much cleaner.

**Be aware though that the closure is executed with `MapVariant(key)`
pushed onto the stack as the current task!** That means that you must
add explicit `read` calls for any shared state that it accesses
implicitly from its environment. See the section on "explicit calls to
read and write when starting a new subtask" above for more details.

### How to decide where to introduce a new task

Certainly, you need at least one task on the stack: any attempt to
`read` or `write` shared state will panic if there is no current
task. But where does it make sense to introduce subtasks? The basic
rule is that a subtask makes sense for any discrete unit of work you
may want to skip in the future. Adding a subtask separates out the
reads/writes from *that particular subtask* versus the larger
context. An example: you might have a 'meta' task for all of borrow
checking, and then subtasks for borrow checking individual fns.  (Seen
in this light, memoized computations are just a special case where we
may want to avoid redoing the work even within the context of one
compilation.)

The other case where you might want a subtask is to help with refining
the reads/writes for some later bit of work that needs to be memoized.
For example, we create a subtask for type-checking the body of each
fn.  However, in the initial version of incr. comp. at least, we do
not expect to actually *SKIP* type-checking -- we only expect to skip
trans. However, it's still useful to create subtasks for type-checking
individual items, because, otherwise, if a fn sig changes, we won't
know which callers are affected -- in fact, because the graph would be
so coarse, we'd just have to retrans everything, since we can't
distinguish which fns used which fn sigs.

### Testing the dependency graph

There are various ways to write tests against the dependency graph.
The simplest mechanism are the
`#[rustc_if_this_changed]` and `#[rustc_then_this_would_need]`
annotations. These are used in compile-fail tests to test whether the
expected set of paths exist in the dependency graph. As an example,
see `src/test/compile-fail/dep-graph-caller-callee.rs`.

The idea is that you can annotate a test like:

```rust
#[rustc_if_this_changed]
fn foo() { }

#[rustc_then_this_would_need(TypeckItemBody)] //~ ERROR OK
fn bar() { foo(); }

#[rustc_then_this_would_need(TypeckItemBody)] //~ ERROR no path
fn baz() { }
```

This will check whether there is a path in the dependency graph from
`Hir(foo)` to `TypeckItemBody(bar)`. An error is reported for each
`#[rustc_then_this_would_need]` annotation that indicates whether a
path exists. `//~ ERROR` annotations can then be used to test if a
path is found (as demonstrated above).

### Debugging the dependency graph

The compiler is also capable of dumping the dependency graph for your
debugging pleasure. To do so, pass the `-Z dump-dep-graph` flag. The
graph will be dumped to `dep_graph.{txt,dot}` in the current
directory.  You can override the filename with the `RUST_DEP_GRAPH`
environment variable.

Frequently, though, the full dep graph is quite overwhelming and not
particularly helpful. Therefore, the compiler also allows you to filter
the graph. You can filter in three ways:

1. All edges originating in a particular set of nodes (usually a single node).
2. All edges reaching a particular set of nodes.
3. All edges that lie between given start and end nodes.

To filter, use the `RUST_DEP_GRAPH_FILTER` environment variable, which should
look like one of the following:

```
source_filter     // nodes originating from source_filter
-> target_filter  // nodes that can reach target_filter
source_filter -> target_filter // nodes in between source_filter and target_filter
```

`source_filter` and `target_filter` are a `&`-separated list of strings.
A node is considered to match a filter if all of those strings appear in its
label. So, for example:

```
RUST_DEP_GRAPH_FILTER='-> TypeckItemBody'
```

would select the predecessors of all `TypeckItemBody` nodes. Usually though you
want the `TypeckItemBody` node for some particular fn, so you might write:

```
RUST_DEP_GRAPH_FILTER='-> TypeckItemBody & bar'
```

This will select only the `TypeckItemBody` nodes for fns with `bar` in their name.

Perhaps you are finding that when you change `foo` you need to re-type-check `bar`,
but you don't think you should have to. In that case, you might do:

```
RUST_DEP_GRAPH_FILTER='Hir&foo -> TypeckItemBody & bar'
```

This will dump out all the nodes that lead from `Hir(foo)` to
`TypeckItemBody(bar)`, from which you can (hopefully) see the source
of the erroneous edge.

### Inlining of HIR nodes

For the time being, at least, we still sometimes "inline" HIR nodes
from other crates into the current HIR map. This creates a weird
scenario where the same logical item (let's call it `X`) has two
def-ids: the original def-id `X` and a new, inlined one `X'`. `X'` is
in the current crate, but it's not like other HIR nodes: in
particular, when we restart compilation, it will not be available to
hash. Therefore, we do not want `Hir(X')` nodes appearing in our
graph.  Instead, we want a "read" of `Hir(X')` to be represented as a
read of `MetaData(X)`, since the metadata for `X` is where the inlined
representation originated in the first place.

To achieve this, the HIR map will detect if the def-id originates in
an inlined node and add a dependency to a suitable `MetaData` node
instead. If you are reading a HIR node and are not sure if it may be
inlined or not, you can use `tcx.map.read(node_id)` and it will detect
whether the node is inlined or not and do the right thing.  You can
also use `tcx.map.is_inlined_def_id()` and
`tcx.map.is_inlined_node_id()` to test.
Commit	Line	Data
9cc50fc6 SL	1	# Dependency graph for incremental compilation
	2
	3	This module contains the infrastructure for managing the incremental
	4	compilation dependency graph. This README aims to explain how it ought
	5	to be used. In this document, we'll first explain the overall
	6	strategy, and then share some tips for handling specific scenarios.
	7
	8	The high-level idea is that we want to instrument the compiler to
	9	track which parts of the AST and other IR are read/written by what.
	10	This way, when we come back later, we can look at this graph and
	11	determine what work needs to be redone.
	12
	13	### The dependency graph
	14
	15	The nodes of the graph are defined by the enum `DepNode`. They represent
	16	one of three things:
	17
	18	1. HIR nodes (like `Hir(DefId)`) represent the HIR input itself.
	19	2. Data nodes (like `ItemSignature(DefId)`) represent some computed
	20	information about a particular item.
	21	3. Procedure notes (like `CoherenceCheckImpl(DefId)`) represent some
	22	procedure that is executing. Usually this procedure is
	23	performing some kind of check for errors. You can think of them as
	24	computed values where the value being computed is `()` (and the
	25	value may fail to be computed, if an error results).
	26
	27	An edge `N1 -> N2` is added between two nodes if either:
	28
	29	- the value of `N1` is used to compute `N2`;
	30	- `N1` is read by the procedure `N2`;
	31	- the procedure `N1` writes the value `N2`.
	32
	33	The latter two conditions are equivalent to the first one if you think
	34	of procedures as values.
	35
	36	### Basic tracking
	37
	38	There is a very general strategy to ensure that you have a correct, if
	39	sometimes overconservative, dependency graph. The two main things you have
	40	to do are (a) identify shared state and (b) identify the current tasks.
	41
	42	### Identifying shared state
	43
	44	Identify "shared state" that will be written by one pass and read by
	45	another. In particular, we need to identify shared state that will be
	46	read "across items" -- that is, anything where changes in one item
	47	could invalidate work done for other items. So, for example:
	48
	49	1. The signature for a function is "shared state".
	50	2. The computed type of some expression in the body of a function is
	51	not shared state, because if it changes it does not itself
	52	invalidate other functions (though it may be that it causes new
	53	monomorphizations to occur, but that's handled independently).
54a0048b	54
9cc50fc6 SL	55	Put another way: if the HIR for an item changes, we are going to
	56	recompile that item for sure. But we need the dep tracking map to tell
	57	us what else we have to recompile. Shared state is anything that is
	58	used to communicate results from one item to another.
	59
	60	### Identifying the current task
	61
	62	The dep graph always tracks a current task: this is basically the
	63	`DepNode` that the compiler is computing right now. Typically it would
	64	be a procedure node, but it can also be a data node (as noted above,
	65	the two are kind of equivalent).
	66
	67	You set the current task by calling `dep_graph.in_task(node)`. For example:
	68
	69	```rust
	70	let _task = tcx.dep_graph.in_task(DepNode::Privacy);
	71	```
	72
	73	Now all the code until `_task` goes out of scope will be considered
	74	part of the "privacy task".
	75
	76	The tasks are maintained in a stack, so it is perfectly fine to nest
	77	one task within another. Because pushing a task is considered to be
	78	computing a value, when you nest a task `N2` inside of a task `N1`, we
	79	automatically add an edge `N2 -> N1` (since `N1` presumably needed the
	80	result of `N2` to complete):
	81
	82	```rust
	83	let _n1 = tcx.dep_graph.in_task(DepNode::N1);
	84	let _n2 = tcx.dep_graph.in_task(DepNode::N2);
	85	// this will result in an edge N1 -> n2
	86	```
	87
	88	### Ignore tasks
	89
	90	Although it is rarely needed, you can also push a special "ignore"
	91	task:
	92
	93	```rust
	94	let _ignore = tc.dep_graph.in_ignore();
	95	```
	96
	97	This will cause all read/write edges to be ignored until it goes out
	98	of scope or until something else is pushed. For example, we could
	99	suppress the edge between nested tasks like so:
	100
	101	```rust
	102	let _n1 = tcx.dep_graph.in_task(DepNode::N1);
	103	let _ignore = tcx.dep_graph.in_ignore();
	104	let _n2 = tcx.dep_graph.in_task(DepNode::N2);
	105	// now no edge is added
	106	```
	107
	108	### Tracking reads and writes
	109
	110	We need to identify what shared state is read/written by the current
	111	task as it executes. The most fundamental way of doing that is to invoke
	112	the `read` and `write` methods on `DepGraph`:
	113
	114	```rust
	115	// Adds an edge from DepNode::Hir(some_def_id) to the current task
	116	tcx.dep_graph.read(DepNode::Hir(some_def_id))
	117
	118	// Adds an edge from the current task to DepNode::ItemSignature(some_def_id)
119	tcx.dep_graph.write(DepNode::ItemSignature(some_def_id))
120	```
121
122	However, you should rarely need to invoke those methods directly.
123	Instead, the idea is to encapsulate shared state into some API that
124	will invoke `read` and `write` automatically. The most common way to
125	do this is to use a `DepTrackingMap`, described in the next section,
126	but any sort of abstraction barrier will do. In general, the strategy
127	is that getting access to information implicitly adds an appropriate
128	`read`. So, for example, when you use the
129	`dep_graph::visit_all_items_in_krate` helper method, it will visit
130	each item `X`, start a task `Foo(X)` for that item, and automatically
131	add an edge `Hir(X) -> Foo(X)`. This edge is added because the code is
132	being given access to the HIR node for `X`, and hence it is expected
133	to read from it. Similarly, reading from the `tcache` map for item `X`
134	(which is a `DepTrackingMap`, described below) automatically invokes
135	`dep_graph.read(ItemSignature(X))`.
136
5bcae85e SL	137	Note: adding `Hir` nodes requires a bit of caution due to the
	138	"inlining" that old trans and constant evaluation still use. See the
	139	section on inlining below.
	140
9cc50fc6 SL	141	To make this strategy work, a certain amount of indirection is
	142	required. For example, modules in the HIR do not have direct pointers
	143	to the items that they contain. Rather, they contain node-ids -- one
	144	can then ask the HIR map for the item with a given node-id. This gives
	145	us an opportunity to add an appropriate read edge.
	146
	147	#### Explicit calls to read and write when starting a new subtask
	148
	149	One time when you may need to call `read` and `write` directly is
	150	when you push a new task onto the stack, either by calling `in_task`
	151	as shown above or indirectly, such as with the `memoize` pattern
	152	described below. In that case, any data that the task has access to
	153	from the surrounding environment must be explicitly "read". For
	154	example, in `librustc_typeck`, the collection code visits all items
	155	and, among other things, starts a subtask producing its signature
	156	(what follows is simplified pseudocode, of course):
	157
	158	```rust
	159	fn visit_item(item: &hir::Item) {
	160	// Here, current subtask is "Collect(X)", and an edge Hir(X) -> Collect(X)
	161	// has automatically been added by `visit_all_items_in_krate`.
	162	let sig = signature_of_item(item);
	163	}
	164
	165	fn signature_of_item(item: &hir::Item) {
	166	let def_id = tcx.map.local_def_id(item.id);
	167	let task = tcx.dep_graph.in_task(DepNode::ItemSignature(def_id));
	168	tcx.dep_graph.read(DepNode::Hir(def_id)); // <-- the interesting line
	169	...
	170	}
	171	```
	172
	173	Here you can see that, in `signature_of_item`, we started a subtask
	174	corresponding to producing the `ItemSignature`. This subtask will read from
	175	`item` -- but it gained access to `item` implicitly. This means that if it just
	176	reads from `item`, there would be missing edges in the graph:
	177
	178	Hir(X) --+ // added by the explicit call to `read`
	179	\| \|
	180	\| +---> ItemSignature(X) -> Collect(X)
	181	\| ^
	182	\| \|
	183	+---------------------------------+ // added by `visit_all_items_in_krate`
54a0048b	184
9cc50fc6 SL	185	In particular, the edge from `Hir(X)` to `ItemSignature(X)` is only
	186	present because we called `read` ourselves when entering the `ItemSignature(X)`
	187	task.
	188
	189	So, the rule of thumb: when entering a new task yourself, register
	190	reads on any shared state that you inherit. (This actually comes up
	191	fairly infrequently though: the main place you need caution is around
	192	memoization.)
	193
	194	#### Dependency tracking map
	195
	196	`DepTrackingMap` is a particularly convenient way to correctly store
	197	shared state. A `DepTrackingMap` is a special hashmap that will add
	198	edges automatically when `get` and `insert` are called. The idea is
	199	that, when you get/insert a value for the key `K`, we will add an edge
	200	from/to the node `DepNode::Variant(K)` (for some variant specific to
	201	the map).
	202
	203	Each `DepTrackingMap` is parameterized by a special type `M` that
	204	implements `DepTrackingMapConfig`; this trait defines the key and value
	205	types of the map, and also defines a fn for converting from the key to
	206	a `DepNode` label. You don't usually have to muck about with this by
	207	hand, there is a macro for creating it. You can see the complete set
	208	of `DepTrackingMap` definitions in `librustc/middle/ty/maps.rs`.
	209
	210	As an example, let's look at the `adt_defs` map. The `adt_defs` map
	211	maps from the def-id of a struct/enum to its `AdtDef`. It is defined
	212	using this macro:
	213
	214	```rust
	215	dep_map_ty! { AdtDefs: ItemSignature(DefId) -> ty::AdtDefMaster<'tcx> }
	216	// ~~~~~~~ ~~~~~~~~~~~~~ ~~~~~ ~~~~~~~~~~~~~~~~~~~~~~
	217	// \| \| Key type Value type
	218	// \| DepNode variant
	219	// Name of map id type
	220	```
	221
	222	this indicates that a map id type `AdtDefs` will be created. The key
	223	of the map will be a `DefId` and value will be
	224	`ty::AdtDefMaster<'tcx>`. The `DepNode` will be created by
	225	`DepNode::ItemSignature(K)` for a given key.
	226
	227	Once that is done, you can just use the `DepTrackingMap` like any
	228	other map:
	229
	230	```rust
	231	let mut map: DepTrackingMap<M> = DepTrackingMap::new(dep_graph);
	232	map.insert(key, value); // registers dep_graph.write
	233	map.get(key; // registers dep_graph.read
	234	```
	235
	236	#### Memoization
	237
	238	One particularly interesting case is memoization. If you have some
	239	shared state that you compute in a memoized fashion, the correct thing
	240	to do is to define a `RefCell<DepTrackingMap>` for it and use the
	241	`memoize` helper:
	242
	243	```rust
	244	map.memoize(key, \|\| /* compute value */)
	245	```
	246
	247	This will create a graph that looks like
	248
249	... -> MapVariant(key) -> CurrentTask
250
251	where `MapVariant` is the `DepNode` variant that the map is associated with,
252	and `...` are whatever edges the `/* compute value */` closure creates.
253
254	In particular, using the memoize helper is much better than writing
255	the obvious code yourself:
256
257	```
258	if let Some(result) = map.get(key) {
259	return result;
260	}
261	let value = /* compute value */;
262	map.insert(key, value);
263	```
264
265	If you write that code manually, the dependency graph you get will
266	include artificial edges that are not necessary. For example, imagine that
267	two tasks, A and B, both invoke the manual memoization code, but A happens
268	to go first. The resulting graph will be:
269
270	... -> A -> MapVariant(key) -> B
271	~~~~~~~~~~~~~~~~~~~~~~~~~~~ // caused by A writing to MapVariant(key)
272	~~~~~~~~~~~~~~~~~~~~ // caused by B reading from MapVariant(key)
273
274	This graph is not wrong, but it encodes a path from A to B that
275	should not exist. In contrast, using the memoized helper, you get:
276
277	... -> MapVariant(key) -> A
278	\|
279	+----------> B
54a0048b SL	280
54a0048b SL	281	which is much cleaner.
9cc50fc6 SL	282
	283	**Be aware though that the closure is executed with `MapVariant(key)`
	284	pushed onto the stack as the current task!** That means that you must
	285	add explicit `read` calls for any shared state that it accesses
	286	implicitly from its environment. See the section on "explicit calls to
	287	read and write when starting a new subtask" above for more details.
	288
	289	### How to decide where to introduce a new task
	290
	291	Certainly, you need at least one task on the stack: any attempt to
	292	`read` or `write` shared state will panic if there is no current
	293	task. But where does it make sense to introduce subtasks? The basic
	294	rule is that a subtask makes sense for any discrete unit of work you
	295	may want to skip in the future. Adding a subtask separates out the
	296	reads/writes from that particular subtask versus the larger
	297	context. An example: you might have a 'meta' task for all of borrow
	298	checking, and then subtasks for borrow checking individual fns. (Seen
	299	in this light, memoized computations are just a special case where we
	300	may want to avoid redoing the work even within the context of one
	301	compilation.)
	302
	303	The other case where you might want a subtask is to help with refining
	304	the reads/writes for some later bit of work that needs to be memoized.
	305	For example, we create a subtask for type-checking the body of each
	306	fn. However, in the initial version of incr. comp. at least, we do
	307	not expect to actually SKIP type-checking -- we only expect to skip
	308	trans. However, it's still useful to create subtasks for type-checking
	309	individual items, because, otherwise, if a fn sig changes, we won't
	310	know which callers are affected -- in fact, because the graph would be
	311	so coarse, we'd just have to retrans everything, since we can't
	312	distinguish which fns used which fn sigs.
	313
	314	### Testing the dependency graph
	315
	316	There are various ways to write tests against the dependency graph.
	317	The simplest mechanism are the
	318	`#[rustc_if_this_changed]` and `#[rustc_then_this_would_need]`
	319	annotations. These are used in compile-fail tests to test whether the
	320	expected set of paths exist in the dependency graph. As an example,
	321	see `src/test/compile-fail/dep-graph-caller-callee.rs`.
	322
	323	The idea is that you can annotate a test like:
	324
	325	```rust
	326	#[rustc_if_this_changed]
	327	fn foo() { }
	328
	329	#[rustc_then_this_would_need(TypeckItemBody)] //~ ERROR OK
	330	fn bar() { foo(); }
	331
	332	#[rustc_then_this_would_need(TypeckItemBody)] //~ ERROR no path
	333	fn baz() { }
	334	```
	335
	336	This will check whether there is a path in the dependency graph from
	337	`Hir(foo)` to `TypeckItemBody(bar)`. An error is reported for each
	338	`#[rustc_then_this_would_need]` annotation that indicates whether a
	339	path exists. `//~ ERROR` annotations can then be used to test if a
	340	path is found (as demonstrated above).
	341
	342	### Debugging the dependency graph
	343
	344	The compiler is also capable of dumping the dependency graph for your
	345	debugging pleasure. To do so, pass the `-Z dump-dep-graph` flag. The
346	graph will be dumped to `dep_graph.{txt,dot}` in the current
347	directory. You can override the filename with the `RUST_DEP_GRAPH`
348	environment variable.
349
350	Frequently, though, the full dep graph is quite overwhelming and not
351	particularly helpful. Therefore, the compiler also allows you to filter
352	the graph. You can filter in three ways:
353
354	1. All edges originating in a particular set of nodes (usually a single node).
355	2. All edges reaching a particular set of nodes.
356	3. All edges that lie between given start and end nodes.
357
358	To filter, use the `RUST_DEP_GRAPH_FILTER` environment variable, which should
359	look like one of the following:
360
361	```
362	source_filter // nodes originating from source_filter
363	-> target_filter // nodes that can reach target_filter
364	source_filter -> target_filter // nodes in between source_filter and target_filter
365	```
366
367	`source_filter` and `target_filter` are a `&`-separated list of strings.
368	A node is considered to match a filter if all of those strings appear in its
369	label. So, for example:
370
371	```
372	RUST_DEP_GRAPH_FILTER='-> TypeckItemBody'
373	```
374
375	would select the predecessors of all `TypeckItemBody` nodes. Usually though you
376	want the `TypeckItemBody` node for some particular fn, so you might write:
377
378	```
379	RUST_DEP_GRAPH_FILTER='-> TypeckItemBody & bar'
380	```
381
382	This will select only the `TypeckItemBody` nodes for fns with `bar` in their name.
383
384	Perhaps you are finding that when you change `foo` you need to re-type-check `bar`,
385	but you don't think you should have to. In that case, you might do:
386
387	```
388	RUST_DEP_GRAPH_FILTER='Hir&foo -> TypeckItemBody & bar'
389	```
390
391	This will dump out all the nodes that lead from `Hir(foo)` to
392	`TypeckItemBody(bar)`, from which you can (hopefully) see the source
393	of the erroneous edge.
5bcae85e SL	394
	395	### Inlining of HIR nodes
	396
	397	For the time being, at least, we still sometimes "inline" HIR nodes
	398	from other crates into the current HIR map. This creates a weird
	399	scenario where the same logical item (let's call it `X`) has two
	400	def-ids: the original def-id `X` and a new, inlined one `X'`. `X'` is
	401	in the current crate, but it's not like other HIR nodes: in
	402	particular, when we restart compilation, it will not be available to
	403	hash. Therefore, we do not want `Hir(X')` nodes appearing in our
	404	graph. Instead, we want a "read" of `Hir(X')` to be represented as a
	405	read of `MetaData(X)`, since the metadata for `X` is where the inlined
	406	representation originated in the first place.
	407
	408	To achieve this, the HIR map will detect if the def-id originates in
	409	an inlined node and add a dependency to a suitable `MetaData` node
	410	instead. If you are reading a HIR node and are not sure if it may be
	411	inlined or not, you can use `tcx.map.read(node_id)` and it will detect
	412	whether the node is inlined or not and do the right thing. You can
	413	also use `tcx.map.is_inlined_def_id()` and
	414	`tcx.map.is_inlined_node_id()` to test.