[rustc.git] / src / doc / trpl / static-and-dynamic-dispatch.md

% Static and Dynamic Dispatch

When code involves polymorphism, there needs to be a mechanism to determine
which specific version is actually run. This is called 'dispatch.' There are
two major forms of dispatch: static dispatch and dynamic dispatch. While Rust
favors static dispatch, it also supports dynamic dispatch through a mechanism
called 'trait objects.'

## Background

For the rest of this chapter, we'll need a trait and some implementations.
Let's make a simple one, `Foo`. It has one method that is expected to return a
`String`.

```rust
trait Foo {
    fn method(&self) -> String;
}
```

We'll also implement this trait for `u8` and `String`:

```rust
# trait Foo { fn method(&self) -> String; }
impl Foo for u8 {
    fn method(&self) -> String { format!("u8: {}", *self) }
}

impl Foo for String {
    fn method(&self) -> String { format!("string: {}", *self) }
}
```


## Static dispatch

We can use this trait to perform static dispatch with trait bounds:

```rust
# trait Foo { fn method(&self) -> String; }
# impl Foo for u8 { fn method(&self) -> String { format!("u8: {}", *self) } }
# impl Foo for String { fn method(&self) -> String { format!("string: {}", *self) } }
fn do_something<T: Foo>(x: T) {
    x.method();
}

fn main() {
    let x = 5u8;
    let y = "Hello".to_string();

    do_something(x);
    do_something(y);
}
```

Rust uses 'monomorphization' to perform static dispatch here. This means that
Rust will create a special version of `do_something()` for both `u8` and
`String`, and then replace the call sites with calls to these specialized
functions. In other words, Rust generates something like this:

```rust
# trait Foo { fn method(&self) -> String; }
# impl Foo for u8 { fn method(&self) -> String { format!("u8: {}", *self) } }
# impl Foo for String { fn method(&self) -> String { format!("string: {}", *self) } }
fn do_something_u8(x: u8) {
    x.method();
}

fn do_something_string(x: String) {
    x.method();
}

fn main() {
    let x = 5u8;
    let y = "Hello".to_string();

    do_something_u8(x);
    do_something_string(y);
}
```

This has a great upside: static dispatch allows function calls to be
inlined because the callee is known at compile time, and inlining is
the key to good optimization. Static dispatch is fast, but it comes at
a tradeoff: 'code bloat', due to many copies of the same function
existing in the binary, one for each type.

Furthermore, compilers aren’t perfect and may “optimize” code to become slower.
For example, functions inlined too eagerly will bloat the instruction cache
(cache rules everything around us). This is part of the reason that `#[inline]`
and `#[inline(always)]` should be used carefully, and one reason why using a
dynamic dispatch is sometimes more efficient.

However, the common case is that it is more efficient to use static dispatch,
and one can always have a thin statically-dispatched wrapper function that does
a dynamic dispatch, but not vice versa, meaning static calls are more flexible.
The standard library tries to be statically dispatched where possible for this
reason.

## Dynamic dispatch

Rust provides dynamic dispatch through a feature called 'trait objects.' Trait
objects, like `&Foo` or `Box<Foo>`, are normal values that store a value of
*any* type that implements the given trait, where the precise type can only be
known at runtime.

A trait object can be obtained from a pointer to a concrete type that
implements the trait by *casting* it (e.g. `&x as &Foo`) or *coercing* it
(e.g. using `&x` as an argument to a function that takes `&Foo`).

These trait object coercions and casts also work for pointers like `&mut T` to
`&mut Foo` and `Box<T>` to `Box<Foo>`, but that's all at the moment. Coercions
and casts are identical.

This operation can be seen as "erasing" the compiler's knowledge about the
specific type of the pointer, and hence trait objects are sometimes referred to
as "type erasure".

Coming back to the example above, we can use the same trait to perform dynamic
dispatch with trait objects by casting:

```rust
# trait Foo { fn method(&self) -> String; }
# impl Foo for u8 { fn method(&self) -> String { format!("u8: {}", *self) } }
# impl Foo for String { fn method(&self) -> String { format!("string: {}", *self) } }

fn do_something(x: &Foo) {
    x.method();
}

fn main() {
    let x = 5u8;
    do_something(&x as &Foo);
}
```

or by coercing:

```rust
# trait Foo { fn method(&self) -> String; }
# impl Foo for u8 { fn method(&self) -> String { format!("u8: {}", *self) } }
# impl Foo for String { fn method(&self) -> String { format!("string: {}", *self) } }

fn do_something(x: &Foo) {
    x.method();
}

fn main() {
    let x = "Hello".to_string();
    do_something(&x);
}
```

A function that takes a trait object is not specialized to each of the types
that implements `Foo`: only one copy is generated, often (but not always)
resulting in less code bloat. However, this comes at the cost of requiring
slower virtual function calls, and effectively inhibiting any chance of
inlining and related optimisations from occurring.

### Why pointers?

Rust does not put things behind a pointer by default, unlike many managed
languages, so types can have different sizes. Knowing the size of the value at
compile time is important for things like passing it as an argument to a
function, moving it about on the stack and allocating (and deallocating) space
on the heap to store it.

For `Foo`, we would need to have a value that could be at least either a
`String` (24 bytes) or a `u8` (1 byte), as well as any other type for which
dependent crates may implement `Foo` (any number of bytes at all). There's no
way to guarantee that this last point can work if the values are stored without
a pointer, because those other types can be arbitrarily large.

Putting the value behind a pointer means the size of the value is not relevant
when we are tossing a trait object around, only the size of the pointer itself.

### Representation

The methods of the trait can be called on a trait object via a special record
of function pointers traditionally called a 'vtable' (created and managed by
the compiler).

Trait objects are both simple and complicated: their core representation and
layout is quite straight-forward, but there are some curly error messages and
surprising behaviors to discover.

Let's start simple, with the runtime representation of a trait object. The
`std::raw` module contains structs with layouts that are the same as the
complicated built-in types, [including trait objects][stdraw]:

```rust
# mod foo {
pub struct TraitObject {
    pub data: *mut (),
    pub vtable: *mut (),
}
# }
```

[stdraw]: ../std/raw/struct.TraitObject.html

That is, a trait object like `&Foo` consists of a "data" pointer and a "vtable"
pointer.

The data pointer addresses the data (of some unknown type `T`) that the trait
object is storing, and the vtable pointer points to the vtable ("virtual method
table") corresponding to the implementation of `Foo` for `T`.


A vtable is essentially a struct of function pointers, pointing to the concrete
piece of machine code for each method in the implementation. A method call like
`trait_object.method()` will retrieve the correct pointer out of the vtable and
then do a dynamic call of it. For example:

```{rust,ignore}
struct FooVtable {
    destructor: fn(*mut ()),
    size: usize,
    align: usize,
    method: fn(*const ()) -> String,
}

// u8:

fn call_method_on_u8(x: *const ()) -> String {
    // the compiler guarantees that this function is only called
    // with `x` pointing to a u8
    let byte: &u8 = unsafe { &*(x as *const u8) };

    byte.method()
}

static Foo_for_u8_vtable: FooVtable = FooVtable {
    destructor: /* compiler magic */,
    size: 1,
    align: 1,

    // cast to a function pointer
    method: call_method_on_u8 as fn(*const ()) -> String,
};


// String:

fn call_method_on_String(x: *const ()) -> String {
    // the compiler guarantees that this function is only called
    // with `x` pointing to a String
    let string: &String = unsafe { &*(x as *const String) };

    string.method()
}

static Foo_for_String_vtable: FooVtable = FooVtable {
    destructor: /* compiler magic */,
    // values for a 64-bit computer, halve them for 32-bit ones
    size: 24,
    align: 8,

    method: call_method_on_String as fn(*const ()) -> String,
};
```

The `destructor` field in each vtable points to a function that will clean up
any resources of the vtable's type, for `u8` it is trivial, but for `String` it
will free the memory. This is necessary for owning trait objects like
`Box<Foo>`, which need to clean-up both the `Box` allocation as well as the
internal type when they go out of scope. The `size` and `align` fields store
the size of the erased type, and its alignment requirements; these are
essentially unused at the moment since the information is embedded in the
destructor, but will be used in the future, as trait objects are progressively
made more flexible.

Suppose we've got some values that implement `Foo`, then the explicit form of
construction and use of `Foo` trait objects might look a bit like (ignoring the
type mismatches: they're all just pointers anyway):

```{rust,ignore}
let a: String = "foo".to_string();
let x: u8 = 1;

// let b: &Foo = &a;
let b = TraitObject {
    // store the data
    data: &a,
    // store the methods
    vtable: &Foo_for_String_vtable
};

// let y: &Foo = x;
let y = TraitObject {
    // store the data
    data: &x,
    // store the methods
    vtable: &Foo_for_u8_vtable
};

// b.method();
(b.vtable.method)(b.data);

// y.method();
(y.vtable.method)(y.data);
```

If `b` or `y` were owning trait objects (`Box<Foo>`), there would be a
`(b.vtable.destructor)(b.data)` (respectively `y`) call when they went out of
scope.
Commit	Line	Data
85aaf69f SL	1	% Static and Dynamic Dispatch
	2
	3	When code involves polymorphism, there needs to be a mechanism to determine
	4	which specific version is actually run. This is called 'dispatch.' There are
	5	two major forms of dispatch: static dispatch and dynamic dispatch. While Rust
	6	favors static dispatch, it also supports dynamic dispatch through a mechanism
	7	called 'trait objects.'
	8
	9	## Background
	10
	11	For the rest of this chapter, we'll need a trait and some implementations.
	12	Let's make a simple one, `Foo`. It has one method that is expected to return a
	13	`String`.
	14
	15	```rust
	16	trait Foo {
	17	fn method(&self) -> String;
	18	}
	19	```
	20
	21	We'll also implement this trait for `u8` and `String`:
	22
	23	```rust
	24	# trait Foo { fn method(&self) -> String; }
	25	impl Foo for u8 {
	26	fn method(&self) -> String { format!("u8: {}", *self) }
	27	}
	28
	29	impl Foo for String {
	30	fn method(&self) -> String { format!("string: {}", *self) }
	31	}
	32	```
	33
	34
	35	## Static dispatch
	36
	37	We can use this trait to perform static dispatch with trait bounds:
	38
	39	```rust
	40	# trait Foo { fn method(&self) -> String; }
	41	# impl Foo for u8 { fn method(&self) -> String { format!("u8: {}", *self) } }
	42	# impl Foo for String { fn method(&self) -> String { format!("string: {}", *self) } }
	43	fn do_something<T: Foo>(x: T) {
	44	x.method();
	45	}
	46
	47	fn main() {
	48	let x = 5u8;
	49	let y = "Hello".to_string();
	50
	51	do_something(x);
	52	do_something(y);
	53	}
	54	```
	55
	56	Rust uses 'monomorphization' to perform static dispatch here. This means that
	57	Rust will create a special version of `do_something()` for both `u8` and
	58	`String`, and then replace the call sites with calls to these specialized
	59	functions. In other words, Rust generates something like this:
	60
	61	```rust
	62	# trait Foo { fn method(&self) -> String; }
	63	# impl Foo for u8 { fn method(&self) -> String { format!("u8: {}", *self) } }
	64	# impl Foo for String { fn method(&self) -> String { format!("string: {}", *self) } }
65	fn do_something_u8(x: u8) {
66	x.method();
67	}
68
69	fn do_something_string(x: String) {
70	x.method();
71	}
72
73	fn main() {
74	let x = 5u8;
75	let y = "Hello".to_string();
76
77	do_something_u8(x);
78	do_something_string(y);
79	}
80	```
81
c34b1796 AL	82	This has a great upside: static dispatch allows function calls to be
	83	inlined because the callee is known at compile time, and inlining is
	84	the key to good optimization. Static dispatch is fast, but it comes at
	85	a tradeoff: 'code bloat', due to many copies of the same function
	86	existing in the binary, one for each type.
85aaf69f	87
c34b1796	88	Furthermore, compilers aren’t perfect and may “optimize” code to become slower.
85aaf69f SL	89	For example, functions inlined too eagerly will bloat the instruction cache
	90	(cache rules everything around us). This is part of the reason that `#[inline]`
	91	and `#[inline(always)]` should be used carefully, and one reason why using a
	92	dynamic dispatch is sometimes more efficient.
	93
	94	However, the common case is that it is more efficient to use static dispatch,
	95	and one can always have a thin statically-dispatched wrapper function that does
c34b1796 AL	96	a dynamic dispatch, but not vice versa, meaning static calls are more flexible.
	97	The standard library tries to be statically dispatched where possible for this
	98	reason.
85aaf69f SL	99
	100	## Dynamic dispatch
	101
	102	Rust provides dynamic dispatch through a feature called 'trait objects.' Trait
	103	objects, like `&Foo` or `Box<Foo>`, are normal values that store a value of
	104	any type that implements the given trait, where the precise type can only be
c34b1796	105	known at runtime.
85aaf69f	106
c34b1796 AL	107	A trait object can be obtained from a pointer to a concrete type that
	108	implements the trait by casting it (e.g. `&x as &Foo`) or coercing it
	109	(e.g. using `&x` as an argument to a function that takes `&Foo`).
85aaf69f	110
c34b1796 AL	111	These trait object coercions and casts also work for pointers like `&mut T` to
	112	`&mut Foo` and `Box<T>` to `Box<Foo>`, but that's all at the moment. Coercions
	113	and casts are identical.
	114
	115	This operation can be seen as "erasing" the compiler's knowledge about the
	116	specific type of the pointer, and hence trait objects are sometimes referred to
	117	as "type erasure".
	118
	119	Coming back to the example above, we can use the same trait to perform dynamic
	120	dispatch with trait objects by casting:
85aaf69f	121
c34b1796 AL	122	```rust
	123	# trait Foo { fn method(&self) -> String; }
	124	# impl Foo for u8 { fn method(&self) -> String { format!("u8: {}", *self) } }
	125	# impl Foo for String { fn method(&self) -> String { format!("string: {}", *self) } }
85aaf69f	126
c34b1796 AL	127	fn do_something(x: &Foo) {
	128	x.method();
	129	}
85aaf69f	130
c34b1796 AL	131	fn main() {
	132	let x = 5u8;
	133	do_something(&x as &Foo);
	134	}
	135	```
	136
	137	or by coercing:
85aaf69f	138
c34b1796 AL	139	```rust
	140	# trait Foo { fn method(&self) -> String; }
	141	# impl Foo for u8 { fn method(&self) -> String { format!("u8: {}", *self) } }
	142	# impl Foo for String { fn method(&self) -> String { format!("string: {}", *self) } }
85aaf69f	143
c34b1796 AL	144	fn do_something(x: &Foo) {
	145	x.method();
	146	}
85aaf69f	147
c34b1796 AL	148	fn main() {
	149	let x = "Hello".to_string();
	150	do_something(&x);
	151	}
85aaf69f SL	152	```
85aaf69f SL	153
c34b1796 AL	154	A function that takes a trait object is not specialized to each of the types
	155	that implements `Foo`: only one copy is generated, often (but not always)
	156	resulting in less code bloat. However, this comes at the cost of requiring
	157	slower virtual function calls, and effectively inhibiting any chance of
	158	inlining and related optimisations from occurring.
85aaf69f	159
c34b1796 AL	160	### Why pointers?
	161
	162	Rust does not put things behind a pointer by default, unlike many managed
	163	languages, so types can have different sizes. Knowing the size of the value at
	164	compile time is important for things like passing it as an argument to a
	165	function, moving it about on the stack and allocating (and deallocating) space
	166	on the heap to store it.
	167
	168	For `Foo`, we would need to have a value that could be at least either a
	169	`String` (24 bytes) or a `u8` (1 byte), as well as any other type for which
	170	dependent crates may implement `Foo` (any number of bytes at all). There's no
	171	way to guarantee that this last point can work if the values are stored without
	172	a pointer, because those other types can be arbitrarily large.
	173
	174	Putting the value behind a pointer means the size of the value is not relevant
	175	when we are tossing a trait object around, only the size of the pointer itself.
85aaf69f SL	176
	177	### Representation
	178
c34b1796 AL	179	The methods of the trait can be called on a trait object via a special record
	180	of function pointers traditionally called a 'vtable' (created and managed by
	181	the compiler).
	182
	183	Trait objects are both simple and complicated: their core representation and
	184	layout is quite straight-forward, but there are some curly error messages and
	185	surprising behaviors to discover.
	186
85aaf69f SL	187	Let's start simple, with the runtime representation of a trait object. The
85aaf69f SL	188	`std::raw` module contains structs with layouts that are the same as the
c34b1796	189	complicated built-in types, [including trait objects][stdraw]:
85aaf69f SL	190
	191	```rust
	192	# mod foo {
	193	pub struct TraitObject {
	194	pub data: *mut (),
	195	pub vtable: *mut (),
	196	}
	197	# }
	198	```
	199
	200	[stdraw]: ../std/raw/struct.TraitObject.html
	201
	202	That is, a trait object like `&Foo` consists of a "data" pointer and a "vtable"
	203	pointer.
	204
	205	The data pointer addresses the data (of some unknown type `T`) that the trait
	206	object is storing, and the vtable pointer points to the vtable ("virtual method
	207	table") corresponding to the implementation of `Foo` for `T`.
	208
	209
	210	A vtable is essentially a struct of function pointers, pointing to the concrete
	211	piece of machine code for each method in the implementation. A method call like
	212	`trait_object.method()` will retrieve the correct pointer out of the vtable and
	213	then do a dynamic call of it. For example:
	214
	215	```{rust,ignore}
	216	struct FooVtable {
	217	destructor: fn(*mut ()),
	218	size: usize,
	219	align: usize,
	220	method: fn(*const ()) -> String,
	221	}
	222
	223	// u8:
	224
	225	fn call_method_on_u8(x: *const ()) -> String {
	226	// the compiler guarantees that this function is only called
	227	// with `x` pointing to a u8
	228	let byte: &u8 = unsafe { &(x as const u8) };
	229
	230	byte.method()
	231	}
	232
	233	static Foo_for_u8_vtable: FooVtable = FooVtable {
	234	destructor: /* compiler magic */,
	235	size: 1,
	236	align: 1,
	237
	238	// cast to a function pointer
	239	method: call_method_on_u8 as fn(*const ()) -> String,
	240	};
	241
	242
	243	// String:
	244
	245	fn call_method_on_String(x: *const ()) -> String {
	246	// the compiler guarantees that this function is only called
	247	// with `x` pointing to a String
	248	let string: &String = unsafe { &(x as const String) };
	249
	250	string.method()
	251	}
	252
	253	static Foo_for_String_vtable: FooVtable = FooVtable {
254	destructor: /* compiler magic */,
255	// values for a 64-bit computer, halve them for 32-bit ones
256	size: 24,
257	align: 8,
258
259	method: call_method_on_String as fn(*const ()) -> String,
260	};
261	```
262
263	The `destructor` field in each vtable points to a function that will clean up
264	any resources of the vtable's type, for `u8` it is trivial, but for `String` it
265	will free the memory. This is necessary for owning trait objects like
c34b1796	266	`Box<Foo>`, which need to clean-up both the `Box` allocation as well as the
85aaf69f SL	267	internal type when they go out of scope. The `size` and `align` fields store
	268	the size of the erased type, and its alignment requirements; these are
	269	essentially unused at the moment since the information is embedded in the
c34b1796 AL	270	destructor, but will be used in the future, as trait objects are progressively
c34b1796 AL	271	made more flexible.
85aaf69f	272
c34b1796	273	Suppose we've got some values that implement `Foo`, then the explicit form of
85aaf69f SL	274	construction and use of `Foo` trait objects might look a bit like (ignoring the
	275	type mismatches: they're all just pointers anyway):
	276
	277	```{rust,ignore}
	278	let a: String = "foo".to_string();
	279	let x: u8 = 1;
	280
	281	// let b: &Foo = &a;
	282	let b = TraitObject {
	283	// store the data
	284	data: &a,
	285	// store the methods
	286	vtable: &Foo_for_String_vtable
	287	};
	288
	289	// let y: &Foo = x;
	290	let y = TraitObject {
	291	// store the data
	292	data: &x,
	293	// store the methods
	294	vtable: &Foo_for_u8_vtable
	295	};
	296
	297	// b.method();
	298	(b.vtable.method)(b.data);
	299
	300	// y.method();
	301	(y.vtable.method)(y.data);
	302	```
	303
	304	If `b` or `y` were owning trait objects (`Box<Foo>`), there would be a
	305	`(b.vtable.destructor)(b.data)` (respectively `y`) call when they went out of
	306	scope.