Re: [OMPI users] Multiple Communicators for communication

2012-06-15 Thread Jeff Squyres
On Jun 14, 2012, at 8:43 PM, Ramesh Vinayagam wrote: > I was wondering is there a way to communicate between two processes with two > different communicators simultaneously in MPI? Like having two channels for > communication? I'm not quite sure what you're asking. Are you asking if it's possi

Re: [OMPI users] An idea about a semi-automatic optimized parallel I/O with Open MPI

2012-06-15 Thread Jeff Squyres
There's nothing that says that your daemons have to be MPI processes. They could be proper system-level daemons that live "forever", for example. You might not be able to speak to them via MPI easily (e.g., you may need to use TCP sockets or some other network transport), but this is fairly co

Re: [OMPI users] How to enable progress threads in 1.6?

2012-06-15 Thread Jeff Squyres
On Jun 13, 2012, at 10:35 AM, Filippo Donida wrote: > My idea is to have a thread to be launched just after the MPI_Isend call with > a fake MPI_Test function call, just to make the sending operation > progressing. Sorry for jumping in late (usual disclaimer about my INBOX being a disaster...)

Re: [OMPI users] Myricom MX2G Segmentation fault on OMPI 1.6

2012-06-15 Thread Jeff Squyres
On Jun 11, 2012, at 7:48 PM, Yong Qin wrote: > ah, I guess my original understanding of PML was wrong. Adding "-mca > pml ob1" does help to ease the problem. See the README for a little more discussion about this issue. There can only be 1 PML in use by a given MPI job -- using "--mca pml ob1"

Re: [OMPI users] openib_reg_mr

2012-06-15 Thread Jeff Squyres
Were there any clues in /var/log/messages or dmesg? You might also want to check out this IBM writeup about some Mellanox parameters: http://www.ibm.com/developerworks/wikis/display/hpccentral/Using+RDMA+with+pagepool+larger+than+8GB That IBM server seems to be misbehaving right now (500/intern

Re: [OMPI users] openib_reg_mr

2012-06-15 Thread Daniels, Marcus G
On Jun 15, 2012, at 8:02 AM, Jeff Squyres wrote: > Were there any clues in /var/log/messages or dmesg? > Thanks. I found a suggestion from Nathan Hjelm to add "options mlx4_core log_mtts_per_seg=X" (where X is 5 in my case). Offline suggestions (which also included that) were also add "--mc

Re: [OMPI users] openib_reg_mr

2012-06-15 Thread Ralph Castain
Hi Marcus Sounds like you might be running out of IB resources as opposed to main memory - not much we can suggest there other than trying to set queue sizes, which is a complicated option. You might look at "ompi_info --param btl openib" and see if adjusting some of those helps. Ralph On Ju

Re: [OMPI users] openib_reg_mr

2012-06-15 Thread Jeff Squyres
On Jun 15, 2012, at 11:26 AM, Daniels, Marcus G wrote: >> Were there any clues in /var/log/messages or dmesg? > > Thanks. I found a suggestion from Nathan Hjelm to add "options mlx4_core > log_mtts_per_seg=X" (where X is 5 in my case). > Offline suggestions (which also included that) were als

Re: [OMPI users] Building MPI on Windows

2012-06-15 Thread VimalMathew
Hi Damien, I installed MS Visual Studio 2010 and tried the whole procedure again and it worked! That's the great news. Now the bad news is that I'm trying to run the program again using mpiexec and it won't! I get these error messages: orte_rml_base_select failed orte_ess_set_name fail

Re: [OMPI users] Building MPI on Windows

2012-06-15 Thread Damien
Hmmm. Two things. Can you run helloworldMPI.exe on it's own? It should output "Number of threads = 1, My rank = 0" Also, can you post the output of ompi_info ? I think you might still have some path mixups. A successful OpenMPI build with this simple program should just work. If you sti

Re: [OMPI users] Building MPI on Windows

2012-06-15 Thread VimalMathew
Just figured it out. The only thing different from when it ran yesterday to today was I was connected to a network. So I connected my laptop to a network and it worked again. Thanks for all your help, Damien! I'm sure I'm gonna get stuck more along the way so hoping you can help. -- Vima

Re: [OMPI users] Building MPI on Windows

2012-06-15 Thread Damien
OK, that's what orte_rml_base_select failed means, no TCP connection. But you should be able to make OpenMPI & mpiexec work without a network if you're just running in local memory. There's probably a runtime parameter to set but I don't know what it is. Maybe Jeff or Shiqing can weigh in wi

Re: [OMPI users] Multiple Communicators for communication

2012-06-15 Thread Ramesh Vinayagam
Thanks Jeff, I was looking more into this scenario : if (rank ==0){ MPI_Send(&tmp,2048, MPI_INT, 1,123, myComm); MPI_Recv(&tmp2,2048, MPI_INT, 1,321, MPI_COMM_WORLD, MPI_STATUS_IGNORE); } if (rank == 1){ MPI_Send(&tmp1,2048, MPI_INT, 0,321, MPI_COMM_WO

Re: [OMPI users] Multiple Communicators for communication

2012-06-15 Thread Jeff Squyres
On Jun 15, 2012, at 4:38 PM, Ramesh Vinayagam wrote: > I was looking more into this scenario : > > if (rank ==0){ > MPI_Send(&tmp,2048, MPI_INT, 1,123, myComm); > MPI_Recv(&tmp2,2048, MPI_INT, 1,321, > MPI_COMM_WORLD, MPI_STATUS_IGNORE); > } > if (rank == 1){