Re: [PATCH 0/6] Kill off the virtio_net tx mitigation timer

Avi Kivity Mon, 03 Nov 2008 07:20:26 -0800

Mark McLoughlin wrote:

But it will increase overhead, since suddenly we aren't queueinganymore. One vmexit per small packet.
Yes in theory, but the packet copies are acting to mitigate exits since
we don't re-enable notifications again until we're sure the ring is
empty.

You mean, the guest and the copy proceed in parallel, and while they do,exits are disabled?

With copyless, though, we'd have an unacceptable vmexit rate.


Right.

If the timer affects latency, then something is very wrong. We'relacking an adjustable window.
The way I see it, the notification window should be adjusted accordingto the current workload. If the link is idle, the window should be onepacket -- notify as soon as something is queued. As the workloadincreases, the window increases to (safety_factor * allowable_latency /packet_rate). The timer is set to allowable_latency to catch changes inworkload.
For example:

- allowable_latency 1ms (implies 1K vmexits/sec desired)
- current packet_rate 20K packets/sec
- safety_factor 0.8
So we request notifications every 0.8 * 20K * 1m = 16 packets, and setthe timer to 1ms. Usually we get a notification every 16 packets, justbefore timer expiration. If the workload increases, we getnotifications sooner, so we increase the window. If the workload drops,the timer fires and we decrease the window.
The timer should never fire on an all-out benchmark, or in a ping test.
Yeah, I do like the sound of this.

However, since it requires a new guest feature and I don't expect it'll
improve the situation over the proposed patch until we have copyless
transmit, I think we should do this as part of the copyless effort.

Hopefully copyless and this can be done in parallel. I think they havevalue independently.

One thing I'd worry about with this scheme is all-out receive - e.g. any
delay in returning a TCP ACK to the sending side, might cause us to hit
the TCP window size.

Consider a real NIC, that also has latency for ACKs that is determinedby the queue length. The proposal doesn't change that, exceptmomentarily when transitioning from high throughput to low throughput.

In any case, latency is never more than allowable_latency (not includingtime spent in the guest network stack queues, but we aren't responsiblefor that).

(one day we can add a queue for acks and other high priority stuff, butwe have enough on our hands now)

We're hurting our cache, and this won't work well with many nics. Atthe very least this should be done in a dedicated thread.
A thread per nic is doable, but it'd be especially tricky on the receive
side without more "short-cut the one producer, one consumer case" work.

We can start with transmit. I'm somewhat worried about furtherdivergence from qemu mainline (just completed a merge...).


--
error compiling committee.c: too many arguments to function

--
To unsubscribe from this list: send the line "unsubscribe kvm" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: [PATCH 0/6] Kill off the virtio_net tx mitigation timer

Reply via email to