Hi,

I am trying to debug a large iterative solver that has been compiled using 
intel fortran 10 and open mpi 1.2.6. that is run in a SLES10 based PC cluster 
using valgrind 3.5.0. To supress the openmpi error messages I've put mpi 
argument "--mca btl tcp, self", otherwise the log file is simply filled by open 
mpi messages, which may amount to 20Mb. Here, I am not trying to debug the open 
mpi.

The problem appears at random in after several iterations which suggest a 
memory related problem. I expect Memcheck to report an error when the offending 
lines is executed for the first time. However, Memcheck does not report up any 
problem prior to the point of failure, i.e. the part of the code that fail has 
been passed by Memcheck as problem free in previous iterations. At the point of 
failure, typically the following error is logged by memcheck

Invalid write of size 8
  at 0x511BF6B: _int_malloc (in 
/opt/openmpi-1.2.6/intel/lib/libopen-pal.so.0.0.0)  
  by 0x511B710: malloc (in /opt/openmpi-1.2.6/intel/lib/libopen-pal.so.0.0.0)
  by 0x907161A: ompi_coll_tuned_allreduce_intra_recursivedoubling (in 
/opt/openmpi-1.2.6/intel/lib/mca_coll_tuned.so)
  by 0x906FFED: ompi_coll_tuned_allreduce_intra_dec_fixed (in 
/opt/openmpi-1.2.6/intel/lib/mca_coll_tuned.so)

  by 0x4DFFEF7: PMPI_Allreduce (in /opt/openmpi-1.2.6/intel/lib/libmpi.so.0.0.0)

  by 0x4C9C612: PMPI_ALLREDUCE (in 
/opt/openmpi-1.2.6/intel/lib/libmpi_f77.so.0.0.0)
  by 0x6E006E: my_mpireduce_call (my_routine.F)
  .
  .
  .
Adress 0x10 is not stack'd, malloc'd or (recently) free'd
 
 
Since memcheck does not detect any problem earlier when the point of failure is 
executed, does it mean the program is fairly sound? I could not see any bug at 
the program line that call the mpi reduce. 

Does that error message suggest the dynamic memory allocation within open-mpi 
allreduce operation is at fault? If this is the case, could I capture the 
problem earlier by removing the mpi suppression ?

Regards


                                          
_________________________________________________________________
View your other email accounts from your Hotmail inbox. Add them now.
http://clk.atdmt.com/UKM/go/186394592/direct/01/
------------------------------------------------------------------------------
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with
Crystal Reports now.  http://p.sf.net/sfu/bobj-july
_______________________________________________
Valgrind-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/valgrind-users

Reply via email to