Re: [PATCH 2/2] KVM: PPC: Book3E: Get vcpu's last instruction for emulation

Scott Wood Tue, 09 Jul 2013 11:48:30 -0700

On 07/09/2013 12:44:32 PM, Alexander Graf wrote:

On 07/09/2013 07:13 PM, Scott Wood wrote:
On 07/08/2013 08:39:05 AM, Alexander Graf wrote:
On 28.06.2013, at 11:20, Mihai Caraman wrote:
> lwepx faults needs to be handled by KVM and this impliesadditional code> in DO_KVM macro to identify the source of the exceptionoriginated from> host context. This requires to check the Exception SyndromeRegister> (ESR[EPID]) and External PID Load Context Register (EPLC[EGS])for DTB_MISS,
> DSI and LRAT exceptions which is too intrusive for the host.
>
> Get rid of lwepx and acquire last instuction inkvmppc_handle_exit() by> searching for the physical address and kmap it. This fixes aninfinite loop
What's the difference in speed for this?
Also, could we call lwepx later in host code, whenkvmppc_get_last_inst() gets invoked?
Any use of lwepx is problematic unless we want to add overhead tothe main Linux TLB miss handler.
What exactly would be missing?

If lwepx faults, it goes to the normal host TLB miss handler. Withoutadding code to it to recognize that it's an external-PID fault, it willtry to search the normal Linux page tables and insert a normal hostentry. If it thinks it has succeeded, it will retry the instructionrather than search for an exception handler. The instruction willfault again, and you get a hang.

I'd also still like to see some performance benchmarks on this tomake sure we're not walking into a bad direction.

I doubt it'll be significantly different. There's overhead involved insetting up for lwepx as well. It doesn't hurt to test, though this isa functional correctness issue, so I'm not sure what betteralternatives we have. I don't want to slow down non-KVM TLB misses forthis.

> +    addr = (mas7_mas3 & (~0ULL << psize_shift)) |
> +           (geaddr & ((1ULL << psize_shift) - 1ULL));
> +
> +    /* Map a page and get guest's instruction */
> +    page = pfn_to_page(addr >> PAGE_SHIFT);
So it seems to me like you're jumping through a lot of hoops tomake sure this works for LRAT and non-LRAT at the same time. Can'twe just treat them as the different things they are?
What if we have different MMU backends for LRAT and non-LRAT? Thenon-LRAT case could then try lwepx, if that fails, fall back toread the shadow TLB. For the LRAT case, we'd do lwepx, if thatfails fall back to this logic.
This isn't about LRAT; it's about hardware threads. It also fixesthe handling of execute-only pages on current chips.
On non-LRAT systems we could always check our shadow copy of theguest's TLB, no? I'd really like to know what the performancedifference would be for the 2 approaches.

I suspect that tlbsx is faster, or at worst similar. And unlikecomparing tlbsx to lwepx (not counting a fix for the threadingproblem), we don't already have code to search the guest TLB, sotesting would be more work.


-Scott
--
To unsubscribe from this list: send the line "unsubscribe kvm-ppc" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: [PATCH 2/2] KVM: PPC: Book3E: Get vcpu's last instruction for emulation

Reply via email to