Re: [OMPI users] MPI_Alltoallv function crashes when np > 100

2011-05-21 Thread
my INBOX, sorry. > > Can you be more specific about what exact error is occurring? You just say > that the application crashes...? Please send all the information listed > here: > >http://www.open-mpi.org/community/help/ > > > On Apr 26, 2011, at 10:51 PM, 孟宪军 wrote: &g

Re: [OMPI users] MPI_Alltoallv function crashes when np > 100

2011-04-26 Thread
It seems that the const variable SOMAXCONN who used by listen() system call causes this problem. Can anybody help me resolve this question? 2011/4/25 孟宪军 <xjun.m...@gmail.com> > Dear all, > > As I mentioned, when I mpiruned an application with the parameter "

[OMPI users] MPI_Alltoallv function crashes when np > 100

2011-04-25 Thread
Dear all, As I mentioned, when I mpiruned an application with the parameter "np = 150(or bigger)", the application who used the MPI_Alltoallv function would carsh. The problem would recur no matter how many nodes we used. The edition of OpenMPI: 1.4.1 or 1.4.3 The OS: linux redhat 2.6.32 BTW,

[OMPI users] MPI_Allgather/MPI_Allgatherv with odd-number nodes

2011-03-07 Thread
Dear all, I am doing some tests using MPI_Allgatherv function recently. After these tests, I found a wield problem of if. When I wanted to use the MPI_Allgatherv function to gather a large number of data from some processes(for example, 2GB data per process). If the number of processes was even

Re: [OMPI users] Openmpi Checkpoint/Restart failed

2010-12-23 Thread
Dear all, I have figured it out. It was a simple issue, I didn't add the "blcr lib" to the $PATH environment varable. However, it can make checkpoint operation, but can't make restart operation successfully. It was so wield. Best regards Xianjun Meng 在 2010年12月23日 下午5:35,孟

Re: [OMPI users] Openmpi Checkpoint/Restart failed

2010-12-23 Thread
16:39:47 2010 # Finished Seq: 0* Does anabody know why? Thanks Xianjun Meng 2010/12/23 孟宪军 <xjun.m...@gmail.com> > Dear all, > > I had to try the checkpoint/restart function of Openmpi recently, and after > several failure and checking lots of the docement, I am still very co

[OMPI users] Openmpi Checkpoint/Restart failed

2010-12-23 Thread
Dear all, I had to try the checkpoint/restart function of Openmpi recently, and after several failure and checking lots of the docement, I am still very confused about how to config the checkpoint/restart function. Can anybody give me a $HOME/.openmpi/mca-params.conf script and introduce me what

Re: [OMPI users] MPI_Send doesn't work if the data >= 2GB

2010-12-07 Thread
Hi Bowen, Thanks very much. I had checked my writev system call, I thought it was him that caused all these bad things :) Best Regards Xianjun Meng 2010/12/8 Bowen Zhou > On 12/05/2010 10:13 PM, > >> hi, >> >> I met a question recently when I tested the MPI_send and

Re: [OMPI users] MPI_Send doesn't work if the data >= 2GB

2010-12-07 Thread
Hi Gus Correa First of all, thanks for your suggestions. 1) The malloc function do return a non_NULL pointer. 2) I didn't tried the MPI_Isend function, actually, The really function I need to use is MPI_Allgatherv(). When I used it, I found this function didn't work when the the data >= 2GB,

Re: [OMPI users] MPI_Send doesn't work if the data >= 2GB

2010-12-06 Thread
Hi Are you running on two processes (mpiexec -n 2)? Yes Have you tried to print Gsize? Yes, I had checked my codes several times, and I thought the errors came from the OpenMpi. :) The command line I used: "mpirun -hostfile ./Serverlist -np 2 ./test". The "Serverlist" file include several

Re: [OMPI users] MPI_Send doesn't work if the data >= 2GB

2010-12-05 Thread
the Openmpi. So, my codes still don't work. :( Further, I found when I called the collective routines(such as, MPI_Allgatherv(...)) which are implemented by the Point 2 Point don't work either when the data > 2GB. Thanks Xianjun 2010/12/6 Tim Prince <n...@aol.com> > On 12/5/2010 7:13 P

[OMPI users] MPI_Send doesn't work if the data >= 2GB

2010-12-05 Thread
hi, I met a question recently when I tested the MPI_send and MPI_Recv functions. When I run the following codes, the processes hanged and I found there was not data transmission in my network at all. BTW: I finished this test on two X86-64 computers with 16GB memory and installed Linux. 1