Thanks, George. I think we’re on the same page. I’d love for Nathan to jump
in here, since I’m guessing he has opinions on this subject. Once we reach
consensus, Wei or I will submit a PR to clarify the BTL documentation.
Brian
On 9/29/21, 7:40 AM, "George Bosilca"
mailto:bosi...@icl.utk.ed
Brian,
My comment was mainly about the BTL code. MPI_Win_fence does not require
remote completion, the call only guarantees that all outbound operations
have been locally completed, and that all inbound operations from other
sources on the process are also complete. I agree with you on the Win_flu
George –
Is your comment about the code path referring to the BTL code or the OSC RDMA
code? The OSC code seems to expect remote completion, at least for the fence
operation. Fence is implemented as a btl flush followed by a window-wide
barrier. There’s no ordering specified between the RDMA
Based on my high-level understanding of the code path and according to the
UCX implementation of the flush, the required level of completion is local.
George.
On Tue, Sep 28, 2021 at 19:26 Zhang, Wei via devel
wrote:
> Dear All,
>
>
>
> I have a question regarding the completion semantics of