On Sun, Jul 22, 2018 at 10:45 AM Andy Lutomirski <l...@kernel.org> wrote: > > This patch changes the code to map the percpu TSS into the user page > tables to allow the non-trampoline SYSCALL64 path to work under PTI.
Me likey. However: > This does not add a new direct information leak, since the TSS is > readable by Meltdown from the cpu_entry_area alias regardless. Afaik, it does now potentially expose through meltdown the per-thread entry stack info, which is new. But I don't think that's a show-stopper. > static void __init pti_clone_user_shared(void) > { > + for_each_possible_cpu(cpu) { But this code is pretty disgusting and seems wrong. Do you really want to do all trhe _possible_ cpu's, not just the online ones? I'd rather expose less (think MAXCPU) and then have the CPU hotplug code expose the page as the CPU comes up? > + unsigned long va = (unsigned long)&per_cpu(cpu_tss_rw, cpu); > + phys_addr_t pa = per_cpu_ptr_to_phys((void *)va); > + pte_t *target_pte; > + > + target_pte = pti_user_pagetable_walk_pte(va); This function only exists if CONFIG_X86_VSYSCALL_EMULATION, so it won't even compile under (very unusual) configurations. The "disgusting" part is that I think it could/should share more code with the vsyscall case, and the whole target-pte checking and setting should be shared too. Beause not being shared, I react to this: > + set_pte(target_pte, pfn_pte(pa >> PAGE_SHIFT, PAGE_KERNEL)); Hmm. The vsyscall code just does *target_pte = .. without any set_pte() stuff. Do we want/need the PVOP cases, and if so, why doesn't the vsyscall case need it? Anyway, I love the approach, and how this gets rid of the nasty trampoline, so no real complaints, just "this needs some fixups". Linus