Re: Errors on MMIO read access on VM suspend / resume operations

Avi Kivity Thu, 13 Jan 2011 02:22:45 -0800

On 01/11/2011 06:19 PM, Stefan Berger wrote:

Hi!
I am currently doing some long-term testing of a device model usingmemory mapped IO (TPM TIS) and am seeing some strange errors when thesuspend occurs in the middle of a read operation in the Linux TPM TISdevice driver where the driver reads the result packet from the mmiolocation.
Short background: The TPM response packet is read in 2 chunks. Firstthe first 10 bytes are read containing the response's header.Subsequently the rest of the packet is read using knowledge of thetotal size of the response packet from the header (bytes 2-5 in bigendian format). The corresponding code reading the data from thehardware interface is here:
http://lxr.linux.no/#linux+v2.6.37/drivers/char/tpm/tpm_tis.c#L228


The test I am running is setup as follows:
- inside the VM keys are permanently generated by sending commands tothe TPM; packets read from the interface are dumped to the screen
- on the host a script suspends the VM every 6 seconds and resumes itimmediately afterwards (using libvirt)
As it happens, sometimes the VM is suspended in the middle of a readoperation on the TPM TIS interface -- see above code reference. I seethat because I do dump the state of the TPM TIS when suspending andsee that the read offset is pointing to a location somewhere in themiddle of the packet - so the TPM TIS Linux driver is in the aboveloop currently reading the data. I am observing two types of resultsif this happens:
- either the result read by the Linux TPM TIS driver is ok, so noproblem here
- or the problematic case where the TPM TIS driver reads a packet witha byte missing and then at the end gets a zero byte from the TPM TISinterface indicating that it read beyond the available data. If thesuspend happened while reading the first chunk of data (header), theTPM TIS driver will also complain that the available data for the 2ndchunk (burst size) is less than what's expected -- it's an off-by-oneerror
So, I then modified the TPM TIS device model to decrement the readoffset pointer by '1' in case it was detected that the suspendhappened in the middle of the read operation -- in Qemu I do this inthe post-load 'method'. This then leads to the following types ofresults:
- the problematic(!) case where the read packet was ok
- the expected case where the TPM TIS driver reads the packet and endsup having two same bytes in the result in consecutive array locations;besides that the TPM TIS driver will in this case complain that it hasleft-over data
So my conclusion from the above tests are:
- for some reason the memory read to the MMIO location happens as thelast instruction executed on suspend and again as the very first onexecuted on resume. This explains to me that the TPM TIS modelinternal pointer into the packet was advanced by '1' (the packet isread by subsequently reading from the same memory location) and theabove problematic cases make sense

Most likely this is qemu-kvm failing to obey this snippet fromDocumentation/kvm/api.txt:

NOTE: For KVM_EXIT_IO, KVM_EXIT_MMIO and KVM_EXIT_OSI, the corresponding

operations are complete (and guest state is consistent) only afteruserspace

has re-entered the kernel with KVM_RUN.  The kernel side will first finish
incomplete operations and then check for pending signals.  Userspace
can re-enter the guest with an unmasked signal pending to complete
pending operations.

However, the code appears to be correct. kvm_run() calls handle_mmio(),which returns 0. The following bit


    if (!r) {
        goto again;
    }

at the end of kvm_run() makes it enter the kernel again (delivering asignal to itself in case we want to stop).

- the other instruction in the Linux TPM TIS drivers that for exampleadvance the buffer location do not execute twice, i.e., size++ in thebuf[size++] = ... in the Linux driver.
What puzzles me is that the read operation may be run twice but othersdon't.

Reads have split execution: kvm emulates the mmio instruction, noticesthat it cannot satisfy the read request, exits to qemu, then restartsthe instruction. If the last step is omitted due to savevm, then kvmwill exit back to qemu again and your device will see the read duplicated.

If you have insights as why the above may be occurring, please let meknow. A simple solution to work around this may be to introduce aregister holding the index into the result packet where to read thenext byte from (rather than advancing an internal pointer to the nextbyte), though this would deviate the driver from the standardinterface the model currently implements.


Most undesirable, I'd like to fix the bug.

Can you sprinkle some printfs() arount kvm_run (in qemu-kvm.c) to verifythis?


Good pattern:

  ioctl(KVM_RUN)
  -> KVM_EXIT_MMIO
  ioctl(KVM_RUN)
  -> ENTR
  no further KVM_RUNs

or

   ioctl(KVM_RUN)
   -> something other than KVM_EXIT_MMIO
   no further KVM_RUNs

Bad pattern:

   ioctl(KVM_RUN)
   -> KVM_EXIT_MMIO
   no further KVM_RUNs

(in the save portion of your test)

--
error compiling committee.c: too many arguments to function

--
To unsubscribe from this list: send the line "unsubscribe kvm" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: Errors on MMIO read access on VM suspend / resume operations

Reply via email to