git.proxmox.com Git - mirror_qemu.git/commit

author	Stefan Hajnoczi <stefanha@redhat.com>
	Sat, 8 Mar 2014 15:00:43 +0000 (16:00 +0100)
committer	Stefan Hajnoczi <stefanha@redhat.com>
	Wed, 12 Mar 2014 09:12:00 +0000 (10:12 +0100)
commit	68e5ec64009812dbaa03ed9cfded9344986f5304
tree	20381c49e3d53ea1afa6f0107ff53cbcd1b602d6	tree
parent	239618707637ec87eba8c452d2b2f75dc5ca20c7	commit \| diff

tap: avoid deadlocking rx

The net subsystem has a control flow mechanism so peer NetClientStates
can tell each other to stop sending packets.  This is used to stop
monitoring the tap file descriptor for incoming packets if the guest rx
ring has no spare buffers.

There is a corner case when tap_can_send() is true at the beginning of
an event loop iteration but becomes false before the tap_send() fd
handler is invoked.

tap_send() will read the packet from the tap file descriptor and attempt
to send it.  The net queue will hold on to the packet and return 0,
indicating that further I/O is not possible.  tap then stops monitoring
the file descriptor for reads.

This is unlike the normal case where tap_can_send() is the same before
and during the event loop iteration.  The event loop would simply not
monitor the file descriptor if tap_can_send() returns true.  Upon next
iteration it would check tap_can_send() again and begin monitoring if we
can send.

The deadlock happens because tap_send() explicitly disabled read_poll.
This is done with the expectation that the peer will call
qemu_net_queue_flush().  But hw/net/virtio-net.c does not monitor
vm_running transitions and issue the flush.  Hence we're left with a
broken tap device.

Cc: qemu-stable@nongnu.org
Reported-by: Neil Skrypuch <neil@tembosocial.com>
Tested-by: Neil Skrypuch <neil@tembosocial.com>
Signed-off-by: Stefan Hajnoczi <stefanha@redhat.com>