Re: [patch 02/10] KVM: MMU: move local TLB flush to mmu_set_spte

Avi Kivity Mon, 22 Sep 2008 13:30:05 -0700

Marcelo Tosatti wrote:

I think we had cases where the spte.pfn contents changed, for examplewhen a large page was replaced by a normal page,
True. And the TLB is not flushed now for large->normal replace, in case
the pte thats faulting is read-only. The local (and remote) TLB's must
be flushed on large->normal replace.


Can you prepare a patch for that, for -stable?

and also:

       } else if (pfn != spte_to_pfn(*shadow_pte)) {


That one is likely to crash the guest anyway, so I don't see the need
for a flush there:

Did you find out what's causing the errors in the first place (if
zap is not used)?  It worries me greatly.

Yes, the problem is that the rmap code does not handle the qemu
process
mappings from vanishing while there is a present rmap. If that
happens,
and there is a fault for a gfn whose qemu mapping has been removed, a
different physical zero page will be allocated:

     rmap a -> gfn 0 -> physical host page 0
     mapping for gfn 0 gets removed
     guest faults in gfn 0 through the same pte "chain"
     rmap a -> gfn 0 -> physical host page 1

When instantiating the shadow mapping for the second time, the
"is_rmap_pte" check succeeds, so we release the reference grabbed by
gfn_to_page() at mmu_set_spte(). We now have a shadow mapping
pointing
to a physical page without having an additional reference on that
page.

The following makes the host not crash under such a condition, but
the condition itself is invalid leading to inconsistent state on the
guest.
So IMHO it shouldnt be allowed to happen in the first place.


And it isn't, with mmu notifiers.

--
I have a truly marvellous patch that fixes the bug which this
signature is too narrow to contain.

--
To unsubscribe from this list: send the line "unsubscribe kvm" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: [patch 02/10] KVM: MMU: move local TLB flush to mmu_set_spte

Reply via email to