Whoops.  Fixed.

Rolf

>-----Original Message-----
>From: devel-boun...@open-mpi.org [mailto:devel-boun...@open-mpi.org]
>On Behalf Of Nathan Hjelm
>Sent: Wednesday, June 27, 2012 6:20 PM
>To: Open MPI Developers
>Subject: Re: [OMPI devel] RFC: add asynchronous copies for large GPU
>buffers
>
>Can you make your repository public or add me to the access list?
>
>-Nathan
>
>On Wed, Jun 27, 2012 at 03:12:34PM -0700, Rolf vandeVaart wrote:
>> WHAT: Add support for doing asynchronous copies of GPU memory with
>larger messages.
>> WHY: Improve performance for sending/receiving of larger GPU messages
>> over IB
>> WHERE: ob1, openib, and convertor code.  All is protected by compiler
>directives
>>                so no effect on non-CUDA builds.
>> REFERENCE BRANCH: https://bitbucket.org/rolfv/ompi-trunk-cuda-async
>>
>> DETAILS:
>> When sending/receiving GPU memory through IB, all data first passes into
>host memory.
>> The copy of GPU memory into and out of the host memory can be done
>> asynchronously to improve performance.  This RFC adds that feature for the
>fragments of larger messages.
>>
>> On the sending side, the completion function is essentially broken in
>> two.  The first function is called when the copy completes which then
>> initiates the send.  When the send completes, the second function is called.
>>
>> Likewise, on the receiving side, a callback is called when the
>> fragment arrives which initiates the copy of the data out of the
>> buffer.  When the copy completes, a second function is called which
>> also calls back into the BTL so it can free resources that were being used.
>>
>> M       opal/datatype/opal_datatype_copy.c
>> M       opal/datatype/opal_convertor.c
>> M       opal/datatype/opal_convertor.h
>> M       opal/datatype/opal_datatype_cuda.c
>> M       opal/datatype/opal_datatype_cuda.h
>> M       opal/datatype/opal_datatype_unpack.c
>> M       opal/datatype/opal_datatype_pack.h
>> M       opal/datatype/opal_datatype_unpack.h
>> M       ompi/mca/btl/btl.h
>> M       ompi/mca/btl/openib/btl_openib_component.c
>> M       ompi/mca/btl/openib/btl_openib.c
>> M       ompi/mca/btl/openib/btl_openib.h
>> M       ompi/mca/btl/openib/btl_openib_mca.c
>> M       ompi/mca/pml/ob1/pml_ob1_recvfrag.c
>> M       ompi/mca/pml/ob1/pml_ob1_sendreq.c
>> M       ompi/mca/pml/ob1/pml_ob1_progress.c
>> M       ompi/mca/pml/ob1/pml_ob1_recvreq.c
>> M       ompi/mca/pml/ob1/pml_ob1_cuda.c
>> M       ompi/mca/pml/ob1/pml_ob1_recvreq.h
>> ----------------------------------------------------------------------
>> ------------- This email message is for the sole use of the intended
>> recipient(s) and may contain confidential information.  Any
>> unauthorized review, use, disclosure or distribution is prohibited.
>> If you are not the intended recipient, please contact the sender by
>> reply email and destroy all copies of the original message.
>> ----------------------------------------------------------------------
>> -------------
>>
>> _______________________________________________
>> devel mailing list
>> de...@open-mpi.org
>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>_______________________________________________
>devel mailing list
>de...@open-mpi.org
>http://www.open-mpi.org/mailman/listinfo.cgi/devel

Reply via email to