Am 01.09.2015 um 17:43 schrieb Scot Breitenfeld:
> I was also getting the same error with MOAB from ANL when we were
> benchmarking small mesh reads with large number of processors. When I
> ran on 16384 processes the job would terminate with:
> 
> Out of memory in file
> /bgsys/source/srcV1R2M1.17463/comm/lib/dev/mpich2/src/mpi/romio/adio/ad_bg/ad_bg_rdcoll.c,
> line 1073
> 
> A semi-discussion about the problem can be found here:
> 
> http://lists.mpich.org/pipermail/devel/2013-May/000154.html
> 
> We did not have time in the project to look into the problem any further.
> 
> Scot

Thanks for pointing out this discussion, Scot. It seems that not only
you did not have time to investigate the problem further, but neither
IBM nor MPICH did :)

I guess this indicates that it's not an HDF5 problem but an MPICH
problem, at heart, and that there's some memory allocations that scale
with the number of ranks.

Though it seems your team hit the "invisible barrier" much later than we
did.

Cheers,
Wolf

-- 




_______________________________________________
Hdf-forum is for HDF software users discussion.
[email protected]
http://lists.hdfgroup.org/mailman/listinfo/hdf-forum_lists.hdfgroup.org
Twitter: https://twitter.com/hdf5

Reply via email to