On Jun 4, 2013, at 4:50 AM, Haggai Eran <hagg...@mellanox.com> wrote:
>> Does this mean that an MPI implementation still has to register memory upon >> usage, and maintain its own registered memory cache? > Yes. However, since registration doesn't pin memory, you can leave > registered memory regions in the cache for longer periods, and you can > register larger memory regions without needing to back them with > physical memory. Hmm; I'm confused. How does this fix the MPI-needs-to-intercept-freed-memory problem? >> >>> We chose to support only 2 concurrent page faults per QP since this >>> allows us to maintain order between the QP's operations and the >>> user-space code using it. >> >> >> I talked to someone who was at the OpenFabrics workshop and saw the ODP >> presentation in person; he tells me that a fault will be incurred when a >> page is not in the HCA's TLB cache (vs. when a registered page is not in >> memory and must be swapped back in), and that this will trigger an RNR NAK. >> >> Is this correct? > > Our HCAs use their own page tables, in addition to a TLB cache. A miss > in the TLB cache that can be filled from the HCA's page tables will not > cause an RNR NAK, since the HCA can fill it relatively fast without the > help of the operating system. If the page is missing from the HCA's page > table though it will trigger a page fault and ask the OS to bring that > page. Since this might take longer, in these cases we send an RNR NAK. Ok. But the primary use case I care about is fixing the MPI-needs-to-intercept-freed-memory problem, and it doesn't sounds like ODP fixes this. >> He was very concerned about what the size of the TLB on the HCA, and >> therefore what the actual run-time behavior would be for sending around >> large messages via MPI -- i.e., would RDMA'ing 1GB messages now incur this >> HCA-must-reload-its-TLB-and-therefore-incur-RNR-NAKs behavior? >> > We have a mechanism to prefetch the pages needed for a large message > upon the first page fault, which can also help amortizing the cost of > the page fault for larger messages. Ok, thanks. -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/ -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html