Am 01.09.2015 um 17:43 schrieb Scot Breitenfeld: > I was also getting the same error with MOAB from ANL when we were > benchmarking small mesh reads with large number of processors. When I > ran on 16384 processes the job would terminate with: > > Out of memory in file > /bgsys/source/srcV1R2M1.17463/comm/lib/dev/mpich2/src/mpi/romio/adio/ad_bg/ad_bg_rdcoll.c, > line 1073 > > A semi-discussion about the problem can be found here: > > http://lists.mpich.org/pipermail/devel/2013-May/000154.html > > We did not have time in the project to look into the problem any further. > > Scot
Thanks for pointing out this discussion, Scot. It seems that not only you did not have time to investigate the problem further, but neither IBM nor MPICH did :) I guess this indicates that it's not an HDF5 problem but an MPICH problem, at heart, and that there's some memory allocations that scale with the number of ranks. Though it seems your team hit the "invisible barrier" much later than we did. Cheers, Wolf -- _______________________________________________ Hdf-forum is for HDF software users discussion. [email protected] http://lists.hdfgroup.org/mailman/listinfo/hdf-forum_lists.hdfgroup.org Twitter: https://twitter.com/hdf5
