Re: [OMPI users] MPI-3 RMA on Cray XC40

2018-05-17 Thread Nathan Hjelm
The invalid writes in uGNI are nothing. I suggest adding any GNI_ call to a suppression file. The RB tree invalid write looks like a bug. I will take a look and see what might be causing it. BTW, you can add --with-valgrind(=DIR) to configure. This will suppress some uninitialized value errors

Re: [OMPI users] MPI-3 RMA on Cray XC40

2018-05-17 Thread Joseph Schuchart
Nathan, I am trying to track down some memory corruption that leads to crashes in my application running on the Cray system using Open MPI (git-6093f2d). Valgrind reports quite some invalid reads and writes inside Open MPI when running the benchmark that I sent you earlier. There are plenty

Re: [OMPI users] MPI-3 RMA on Cray XC40

2018-05-11 Thread Joseph Schuchart
Nathan, That is good news! Are the improvements that are scheduled for 4.0.0 already stable enough to be tested? I'd be interested in trying them to see whether and how they affect our use-cases. Also, thanks for pointing me to the RMA-MT benchmark suite, I wasn't aware of that project. I

Re: [OMPI users] MPI-3 RMA on Cray XC40

2018-05-09 Thread Nathan Hjelm
Thanks for confirming that it works for you as well. I have a PR open on v3.1.x that brings osc/rdma up to date with master. I will also be bringing some code that greatly improves the multi-threaded RMA performance on Aries systems (at least with benchmarks— github.com/hpc/rma-mt). That will

Re: [OMPI users] MPI-3 RMA on Cray XC40

2018-05-09 Thread Joseph Schuchart
Nathan, Thank you, I can confirm that it works as expected with master on our system. I will stick to this version then until 3.1.1 is out. Joseph On 05/08/2018 05:34 PM, Nathan Hjelm wrote: Looks like it doesn't fail with master so at some point I fixed this bug. The current plan is to

Re: [OMPI users] MPI-3 RMA on Cray XC40

2018-05-08 Thread Nathan Hjelm
Looks like it doesn't fail with master so at some point I fixed this bug. The current plan is to bring all the master changes into v3.1.1. This includes a number of bug fixes. -Nathan On May 08, 2018, at 08:25 AM, Joseph Schuchart wrote: Nathan, Thanks for looking into

Re: [OMPI users] MPI-3 RMA on Cray XC40

2018-05-08 Thread Joseph Schuchart
Nathan, Thanks for looking into that. My test program is attached. Best Joseph On 05/08/2018 02:56 PM, Nathan Hjelm wrote: I will take a look today. Can you send me your test program? -Nathan On May 8, 2018, at 2:49 AM, Joseph Schuchart wrote: All, I have been

Re: [OMPI users] MPI-3 RMA on Cray XC40

2018-05-08 Thread Nathan Hjelm
I will take a look today. Can you send me your test program? -Nathan > On May 8, 2018, at 2:49 AM, Joseph Schuchart wrote: > > All, > > I have been experimenting with using Open MPI 3.1.0 on our Cray XC40 > (Haswell-based nodes, Aries interconnect) for multi-threaded MPI