Re: [OMPI users] Problems with mpirun in openmpi-1.8.1 and -2.0.0

2016-08-23 Thread Juan A. Cordero Varelaq
Hi Gilles, so if I use rthe option --mca pml ob1, I use infiniband and it will be as fast as normal, right? Thanks On 22/08/16 14:22, Gilles Gouaillardet wrote: Juan, to keep things simple, --mca pml ob1 ensures you are not using mxm (yet an other way to use infiniband) IPoIB is unlikely

Re: [OMPI users] Problems with mpirun in openmpi-1.8.1 and -2.0.0

2016-08-23 Thread Gilles Gouaillardet
Juan, if you want to use infiniband with the openib/btl (i am assuming MXM is not available on your platform, and you to not want to use infiniband via usnic/libfabric), you can mpirun --mca pml ob1 --mca btl ^usnic ... /* i am pretty sure mpirun ... would do the trick too */ if you get th

Re: [OMPI users] Problems with mpirun in openmpi-1.8.1 and -2.0.0

2016-08-23 Thread Juan A. Cordero Varelaq
Hi Gilles, If I run it like this: mpirun --mca btl ^openib,usnic --mca pml ob1 --mca btl_sm_use_knem 0 -np 5 myscript.sh it works fine. Am I using infiniband in this way? However, if I remove *openib*, I get the *librdmacm: Fatal: unable to open RDMA device* error. So what would be the most

Re: [OMPI users] OMPI users] Problems with mpirun in openmpi-1.8.1 and -2.0.0

2016-08-23 Thread Gilles Gouaillardet
Juan, You are not using infiniband with these options. Have your infiniband fabric fixed, and you will be able to mpirun without any options, and with native infiniband. Cheers, Gilles "Juan A. Cordero Varelaq" wrote: >Hi Gilles, > >If I run it like this: > >mpirun --mca btl ^openib,usnic --m

Re: [OMPI users] Problems with mpirun in openmpi-1.8.1 and -2.0.0

2016-08-23 Thread Nathan Hjelm
Might be worth trying with --mca btl_openib_cpc_include udcm   and see if that works. -Nathan On Aug 23, 2016, at 02:41 AM, "Juan A. Cordero Varelaq" wrote: Hi Gilles, If I run it like this: mpirun --mca btl ^openib,usnic --mca pml ob1 --mca btl_sm_use_knem 0 -np 5 myscript.sh it works fine

[OMPI users] OS X El Capitan 10.11.6 ld: symbol(s) not found for architecture x86_64

2016-08-23 Thread Richard G French
Hi, all - I'm trying to build the SPH code Gadget2 ( http://wwwmpa.mpa-garching.mpg.de/gadget/) under OS X 10.11.6 and I am getting the following type of error: 222 rfrench@cosmos> make mpicc main.o run.o predict.o begrun.o endrun.o global.o timestep.o init.o restart.o io.o accel.o read_ic.o

Re: [OMPI users] OS X El Capitan 10.11.6 ld: symbol(s) not found for architecture x86_64

2016-08-23 Thread Douglas L Reeder
Richard, It looks like you need to add some -l arguments to the the specific openmpi libraries hat you need (e.g., -lmpi -lmpi_cxx) Doug > On Aug 23, 2016, at 1:43 PM, Richard G French wrote: > > Hi, all - > I'm trying to build the SPH code Gadget2 > (http://wwwmpa.mpa-garching.mpg.de/gad

Re: [OMPI users] OS X El Capitan 10.11.6 ld: symbol(s) not found for architecture x86_64

2016-08-23 Thread Richard G French
Thanks for the suggestion, Doug - but I can't seem to find the missing function ompi_mpi_byte in any of those other libraries. I'll keep looking! I wonder if I failed to configure mpich properly when I built it. Dick On Tue, Aug 23, 2016 at 4:01 PM, Douglas L Reeder wrote: > Richard, > > It loo

Re: [OMPI users] OS X El Capitan 10.11.6 ld: symbol(s) not found for architecture x86_64

2016-08-23 Thread r...@open-mpi.org
I’m confused - you keep talking about MPICH, but the symbol you are looking for is from OMPI. You cannot mix the two MPI libraries - is that what you are trying to do? > On Aug 23, 2016, at 1:30 PM, Richard G French wrote: > > Thanks for the suggestion, Doug - but I can't seem to find the miss

Re: [OMPI users] OS X El Capitan 10.11.6 ld: symbol(s) not found for architecture x86_64

2016-08-23 Thread Douglas L Reeder
Dick, In the openmpi library directory you can do nm library_file_name|grep ompi_mpi_byte And that wil tell you if that library contains ompi_mpi_byte Doug > On Aug 23, 2016, at 2:30 PM, Richard G French wrote: > > Thanks for the suggestion, Doug - but I can't seem to find the missing > func

Re: [OMPI users] OS X El Capitan 10.11.6 ld: symbol(s) not found for architecture x86_64

2016-08-23 Thread Reuti
Hi, Am 23.08.2016 um 21:43 schrieb Richard G French: > Hi, all - > I'm trying to build the SPH code Gadget2 > (http://wwwmpa.mpa-garching.mpg.de/gadget/) under OS X 10.11.6 and I am > getting the following type of error: > > 222 rfrench@cosmos> make > > mpicc main.o run.o predict.o begrun.o

Re: [OMPI users] OS X El Capitan 10.11.6 ld: symbol(s) not found for architecture x86_64

2016-08-23 Thread Richard G French
It sounds like I need to understand what flavor of MPI I need for this project. I'll do some homework to try to understand the differences. Thanks for the leads. Dick On Tue, Aug 23, 2016 at 4:40 PM, Douglas L Reeder wrote: > Dick, > > In the openmpi library directory you can do > > nm library_

Re: [OMPI users] OS X El Capitan 10.11.6 ld: symbol(s) not found for architecture x86_64

2016-08-23 Thread Richard G French
2415 rfrench@cosmos> mpicc -show gcc -I/usr/local/include -L/usr/local/lib -lmpi 2416 rfrench@cosmos> which mpicc /usr/local/bin/mpicc I'll get rid of the mpich-mp stuff and give it another try! On Tue, Aug 23, 2016 at 4:40 PM, Reuti wrote: > Hi, > > Am 23.08.2016 um 21:43 schrieb Richard

Re: [OMPI users] OS X El Capitan 10.11.6 ld: symbol(s) not found for architecture x86_64

2016-08-23 Thread Richard G French
Problem solved! I had to remove a macports directory from my path that had the improper MPI library, and at last the code is working. Thanks so much to everyone for your friendly and prompt suggestions - I appreciate it very much. Dick On Tue, Aug 23, 2016 at 4:51 PM, Richard G French wrote: >

Re: [OMPI users] stdin issue with openmpi/2.0.0

2016-08-23 Thread Jingchao Zhang
Hi Ralph, I tested v2.0.1rc1 with your code but has the same issue. I also installed v2.0.1rc1 on a different cluster which has Mellanox QDR Infiniband and get the same result. For the tests you have done, how many cores and nodes did you use? I can trigger the problem by using multiple nodes

Re: [OMPI users] stdin issue with openmpi/2.0.0

2016-08-23 Thread r...@open-mpi.org
The IO forwarding messages all flow over the Ethernet, so the type of fabric is irrelevant. The number of procs involved would definitely have an impact, but that might not be due to the IO forwarding subsystem. We know we have flow control issues with collectives like Bcast that don’t have buil

Re: [OMPI users] stdin issue with openmpi/2.0.0

2016-08-23 Thread Jingchao Zhang
Everything stuck at MPI_Init. For a test job with 2 nodes and 10 cores each node, I got the following $ mpirun ./a.out < test.in Rank 2 has cleared MPI_Init Rank 4 has cleared MPI_Init Rank 7 has cleared MPI_Init Rank 8 has cleared MPI_Init Rank 0 has cleared MPI_Init Rank 5 has cleared MPI_Init

[OMPI users] Using Open MPI with PBS Pro

2016-08-23 Thread Andy Riebs
I gleaned from the web that I need to comment out "opal_event_include=epoll" in /etc/openmpi-mca-params.conf in order to use Open MPI with PBS Pro. Can we also disable that in other cases, like Slurm, or is this something specific to PBS Pro? Andy -- Andy Riebs andy.ri...@hpe.com Hewlett-Pa

Re: [OMPI users] Using Open MPI with PBS Pro

2016-08-23 Thread r...@open-mpi.org
I’ve never heard of that, and cannot imagine what it has to do with the resource manager. Can you point to where you heard that one? FWIW: we don’t ship OMPI with anything in the default mca params file, so somebody must have put it in there for you. > On Aug 23, 2016, at 4:48 PM, Andy Riebs

Re: [OMPI users] stdin issue with openmpi/2.0.0

2016-08-23 Thread r...@open-mpi.org
Very strange. I cannot reproduce it as I’m able to run any number of nodes and procs, pushing over 100Mbytes thru without any problem. Which leads me to suspect that the issue here is with the tty interface. Can you tell me what shell and OS you are running? > On Aug 23, 2016, at 3:25 PM, Jing