The invalid writes in uGNI are nothing. I suggest adding any GNI_ call to a
suppression file. The RB tree invalid write looks like a bug. I will take a
look and see what might be causing it.
BTW, you can add --with-valgrind(=DIR) to configure. This will suppress some
uninitialized value errors
Nathan,
I am trying to track down some memory corruption that leads to crashes
in my application running on the Cray system using Open MPI
(git-6093f2d). Valgrind reports quite some invalid reads and writes
inside Open MPI when running the benchmark that I sent you earlier.
There are plenty
Nathan,
That is good news! Are the improvements that are scheduled for 4.0.0
already stable enough to be tested? I'd be interested in trying them to
see whether and how they affect our use-cases.
Also, thanks for pointing me to the RMA-MT benchmark suite, I wasn't
aware of that project. I
Thanks for confirming that it works for you as well. I have a PR open on v3.1.x
that brings osc/rdma up to date with master. I will also be bringing some code
that greatly improves the multi-threaded RMA performance on Aries systems (at
least with benchmarks— github.com/hpc/rma-mt). That will
Nathan,
Thank you, I can confirm that it works as expected with master on our
system. I will stick to this version then until 3.1.1 is out.
Joseph
On 05/08/2018 05:34 PM, Nathan Hjelm wrote:
Looks like it doesn't fail with master so at some point I fixed this
bug. The current plan is to
Looks like it doesn't fail with master so at some point I fixed this bug. The
current plan is to bring all the master changes into v3.1.1. This includes a
number of bug fixes.
-Nathan
On May 08, 2018, at 08:25 AM, Joseph Schuchart wrote:
Nathan,
Thanks for looking into
Nathan,
Thanks for looking into that. My test program is attached.
Best
Joseph
On 05/08/2018 02:56 PM, Nathan Hjelm wrote:
I will take a look today. Can you send me your test program?
-Nathan
On May 8, 2018, at 2:49 AM, Joseph Schuchart wrote:
All,
I have been
I will take a look today. Can you send me your test program?
-Nathan
> On May 8, 2018, at 2:49 AM, Joseph Schuchart wrote:
>
> All,
>
> I have been experimenting with using Open MPI 3.1.0 on our Cray XC40
> (Haswell-based nodes, Aries interconnect) for multi-threaded MPI