Hi, Tony & all: Recently Xen-IA64 community is considering to add paravirt_ops support to keep sync with X86 and reduce maintenance effort. With pv_ops, sensitive instructions or some high level primitive functionalities (such as MMU ops) are replaced with pv_ops which is a function table call whose exact function pointer is initialized at Linux startup time depending on different hypervisor (or native) runing underlayer.
With this, we can reuse many code with X86 such as irqchip with X86, and similar dma support with X86, similar xenoprof/PMU profiling support etc. While CPU side pv_ops is quit different especially for those ASM code, since IA64 processor doesn;t have memory/stack ready at most IVT handler code. In X86, ASM side pv_ops can save clobber registers to stack and do function call, but IA64 can't due to unavailable of memory access. #define DISABLE_INTERRUPTS(clobbers) \ PARA_SITE(PARA_PATCH(pv_irq_ops, PV_IRQ_irq_disable), clobbers, \ pushl %eax; pushl %ecx; pushl %edx; \ call *%cs:pv_irq_ops+PV_IRQ_irq_disable; \ popl %edx; popl %ecx; popl %eax) \ One of the 1st biggest argument is how to support those ASM IVT handler code. Some ideas discussed include: 1: Dual IVT source code, dual IVT table. This is current Xen did, and probably are not warmly welcomed since it is not in upstream yet and have maintenance effort. 2: Same IVT source code, but dual/mulitple compile to generate dual/multiple IVT table. I.e. we replace those primitive ops (sensitive instructions) with a MACRO which uses compile option for different hypervisor type. The pseudo code of the MACRO could be: (take read CR.IVR as example) AltA: #define ASM_READ_IVR /* read IVR to GR24 */ #ifdef XEN breg1 = return address br xen_readivr #else /* native mov GR24=CR.IVR; #endif Or AltB: #define ASM_READ_IVR /* read IVR to GR24 */ #ifdef XEN in place code of function xen_readivr #else /* native mov GR24=CR.IVR; #endif From maintenance effort point of view, it is minimized, but not exactly what X86 pv_ops look like. Both approach will cause code size issue, but altB is much worse in this area, while AltA need one additional BR clobber register 3: Single IVT table, using indirect function call for pv_ops. This is more like X86 pv_ops, but we need to pay 2 additional BR clobber registers due to indirect function call, like following pseudo code: AltC: breg0 = pv_ops base breg0 += offset for this pv_ops breg1 = return address; br breg0. /* pv_ops clobbered breg0/breg1 */ For both #2 & #3, we need to modify Linux IVT code to get clobber register for those MACROs, #3 need 2 br registers and 1-2 GR registers for the function body. #2A needs least clobber register, just 1-2 GR registers. In X86, there are another enhancement (dynamic patching) base on pv_ops. The purpose is to improve cpu predication by converting indriect function call to direct function call for both C & ASM code. We may take similar approach some time later too. We really need advices from community before we jump into coding. CC some active members that I though may be interested in pv_ops since KVM-IA64 mailinglist doesn;t exist yet. Thanks a lot, Eddie _______________________________________________ Xen-ia64-devel mailing list Xen-ia64-devel@lists.xensource.com http://lists.xensource.com/xen-ia64-devel