On Jun 4, 2013, at 4:50 AM, Haggai Eran <hagg...@mellanox.com> wrote:

>> Does this mean that an MPI implementation still has to register memory upon 
>> usage, and maintain its own registered memory cache?
> Yes. However, since registration doesn't pin memory, you can leave
> registered memory regions in the cache for longer periods, and you can
> register larger memory regions without needing to back them with
> physical memory.

Hmm; I'm confused.  How does this fix the MPI-needs-to-intercept-freed-memory 
problem?

>> 
>>> We chose to support only 2 concurrent page faults per QP since this
>>> allows us to maintain order between the QP's operations and the
>>> user-space code using it.
>> 
>> 
>> I talked to someone who was at the OpenFabrics workshop and saw the ODP 
>> presentation in person; he tells me that a fault will be incurred when a 
>> page is not in the HCA's TLB cache (vs. when a registered page is not in 
>> memory and must be swapped back in), and that this will trigger an RNR NAK.
>> 
>> Is this correct?
> 
> Our HCAs use their own page tables, in addition to a TLB cache. A miss
> in the TLB cache that can be filled from the HCA's page tables will not
> cause an RNR NAK, since the HCA can fill it relatively fast without the
> help of the operating system. If the page is missing from the HCA's page
> table though it will trigger a page fault and ask the OS to bring that
> page. Since this might take longer, in these cases we send an RNR NAK.

Ok.

But the primary use case I care about is fixing the 
MPI-needs-to-intercept-freed-memory problem, and it doesn't sounds like ODP 
fixes this.

>> He was very concerned about what the size of the TLB on the HCA, and 
>> therefore what the actual run-time behavior would be for sending around 
>> large messages via MPI -- i.e., would RDMA'ing 1GB messages now incur this 
>> HCA-must-reload-its-TLB-and-therefore-incur-RNR-NAKs behavior?
>> 
> We have a mechanism to prefetch the pages needed for a large message
> upon the first page fault, which can also help amortizing the cost of
> the page fault for larger messages.


Ok, thanks.

-- 
Jeff Squyres
jsquy...@cisco.com
For corporate legal information go to: 
http://www.cisco.com/web/about/doing_business/legal/cri/

--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to