Although this particular bug should be fixed in 1.6.5 and 1.7.2; which
version of Open MPI are you using?

Brian

On 7/10/13 10:29 AM, "Ralph Castain" <r...@open-mpi.org> wrote:

>Yeah, we discussed taking things from your thread, plus the wiki page on
>cross-compiling OMPI, and creating a new FAQ area. I'll do so - thanks!
>
>On Jul 10, 2013, at 9:14 AM, Tim Carlson <tim.carl...@pnl.gov> wrote:
>
>> I've polluted the previous thread on GPU abilites with so much
>>Intel/Phi bits that I decided a few new threads might be a good idea.
>>First off I think the following could be a FAQ entry.
>> 
>> If you have cluster with Phi cards and are using the SCIF interface
>>with OFED, OpenMPI between two hosts (not two Phi cards) is going to
>>choose the wrong interface at runtime. I'll show this by example.
>> 
>> On a node that has a Phi card and has ofed-mic enabled, you end up with
>>two IB interfaces.
>> 
>> tim@phi001 osu]$ ibv_devices
>>    device                 node GUID
>>    ------              ----------------
>>    scif0               4c79bafffe300005
>>    mlx4_0              003048ffff95f98c
>> 
>> The scif0 interface is not the one you want to use but it is the one
>>that shows up first in the list. By default OpenMPI won't even know what
>>to do with this interface.
>> 
>> $ mpicc osu_bw.c -o osu_bw.openmpi.x
>> 
>> $ mpirun -np 2 -hostfile hosts.nodes osu_bw.openmpi.x
>> 
>>-------------------------------------------------------------------------
>>-
>> WARNING: No preset parameters were found for the device that Open MPI
>> detected:
>> 
>>  Local host:            phi002.local
>>  Device name:           scif0
>>  Device vendor ID:      0x8086
>>  Device vendor part ID: 0
>> 
>> 
>> It completely fails. However if you specify the correct interface
>>(mlx4_0) you get the expected results.
>> 
>> $ mpirun -np 2 -hostfile hosts.nodes --mca btl openib,self,sm -mca
>>btl_openib_if_include mlx4_0 osu_bw.openmpi.x
>> # OSU MPI Bandwidth Test
>> # Size        Bandwidth (MB/s)
>> 1                         3.25
>> 2                         6.40
>> 4                        12.65
>> 8                        25.53
>> 16                       50.42
>> 32                       97.06
>> 64                      187.02
>> 128                     357.88
>> 256                     663.64
>> 512                    1228.23
>> 1024                   2142.46
>> 2048                   3128.06
>> 4096                   4110.78
>> 8192                   4870.81
>> 16384                  5864.45
>> 32768                  6135.67
>> 65536                  6264.35
>> 131072                 6307.70
>> 262144                 6340.24
>> 524288                 6329.59
>> 1048576                6343.71
>> 2097152                6315.45
>> 4194304                6322.65
>> 
>> Tim
>> _______________________________________________
>> users mailing list
>> us...@open-mpi.org
>> http://www.open-mpi.org/mailman/listinfo.cgi/users
>
>
>_______________________________________________
>users mailing list
>us...@open-mpi.org
>http://www.open-mpi.org/mailman/listinfo.cgi/users
>


--
  Brian W. Barrett
  Scalable System Software Group
  Sandia National Laboratories



Attachment: smime.p7s
Description: S/MIME cryptographic signature

Reply via email to