Re: [OMPI users] RoCE (IBoE) & OpenMPI

2011-03-07 Thread Jeff Squyres
ed in the code > >>>> and it seems that RDMA_CM uses the SL from the SA. > >>>> So if you want to configure a specific SL, you need to do it via the SM. > >>>> > >>>> Doron > >>>> > >>>> -Original Message

Re: [OMPI users] RoCE (IBoE) & OpenMPI

2011-03-06 Thread Doron Shoham
MA CM, I've checked in the > code and it seems that RDMA_CM uses the SL from the SA. > >>>> So if you want to configure a specific SL, you need to do it via the > SM. > >>>> > >>>> Doron > >>>> > >>>> -Original Mess

Re: [OMPI users] RoCE (IBoE) & OpenMPI

2011-03-02 Thread Jeff Squyres
c SL, you need to do it via the SM. >>>> >>>> Doron >>>> >>>> -Original Message- >>>> From: Jeff Squyres [mailto:jsquy...@cisco.com] >>>> Sent: Thursday, February 24, 2011 3:45 PM >>>> To: Michael Shuey &g

Re: [OMPI users] RoCE (IBoE) & OpenMPI

2011-03-02 Thread Michael Shuey
RDMA_CM uses the SL from the SA. >>> So if you want to configure a specific SL, you need to do it via the SM. >>> >>> Doron >>> >>> -----Original Message- >>> From: Jeff Squyres [mailto:jsquy...@cisco.com] >>> Sent: Thursday, February 24, 20

Re: [OMPI users] RoCE (IBoE) & OpenMPI

2011-03-01 Thread Jeff Squyres
t;> From: Jeff Squyres [mailto:jsquy...@cisco.com] >> Sent: Thursday, February 24, 2011 3:45 PM >> To: Michael Shuey >> Cc: Open MPI Users , Mike Dubman >> Subject: Re: [OMPI users] RoCE (IBoE) & OpenMPI >> >> On Feb 24, 2011, at 8:00 AM, Michael Shuey wrote

Re: [OMPI users] RoCE (IBoE) & OpenMPI

2011-03-01 Thread Michael Shuey
squy...@cisco.com] > Sent: Thursday, February 24, 2011 3:45 PM > To: Michael Shuey > Cc: Open MPI Users , Mike Dubman > Subject: Re: [OMPI users] RoCE (IBoE) & OpenMPI > > On Feb 24, 2011, at 8:00 AM, Michael Shuey wrote: > >> Late yesterday I did have a chance to test t

Re: [OMPI users] RoCE (IBoE) & OpenMPI

2011-02-24 Thread Jeff Squyres
On Feb 24, 2011, at 8:00 AM, Michael Shuey wrote: > Late yesterday I did have a chance to test the patch Jeff provided > (against 1.4.3 - testing 1.5.x is on the docket for today). While it > works, in that I can specify a gid_index, Great! I'll commit that to the trunk and start the process of

Re: [OMPI users] RoCE (IBoE) & OpenMPI

2011-02-24 Thread Michael Shuey
Late yesterday I did have a chance to test the patch Jeff provided (against 1.4.3 - testing 1.5.x is on the docket for today). While it works, in that I can specify a gid_index, it doesn't do everything required - my traffic won't match a lossless CoS on the ethernet switch. Specifying a GID is o

Re: [OMPI users] RoCE (IBoE) & OpenMPI

2011-02-23 Thread Jeff Squyres
On Feb 23, 2011, at 3:54 PM, Shamis, Pavel wrote: > I remember that I updated the trunk to select by default RDMACM connection > manager for RoCE ports - https://svn.open-mpi.org/trac/ompi/changeset/22311 > > I'm not sure it the change made his way to any production version. I don't > work on t

Re: [OMPI users] RoCE (IBoE) & OpenMPI

2011-02-23 Thread Shamis, Pavel
Here is what OFA says: http://www.google.com/url?sa=t&source=web&cd=1&ved=0CBMQFjAA&url=http%3A%2F%2Fwww.openfabrics.org%2Farchives%2Fspring2010sonoma%2FWednesday%2FLiran%2520Liss%2520RoCE%2520in%2520OFED%2Frocee_update_liss.ppt&ei=QW9lTfO-L8HYgQf2tdHhBw&usg=AFQjCNEPltfVwWeZ2d4uvaj1wMpumcxrEw&sig2=

Re: [OMPI users] RoCE (IBoE) & OpenMPI

2011-02-22 Thread Michael Shuey
Could you re-enable the SL param (btl_openib_ib_service_level) for RoCE? Jeff was kind enough to provide a patch to let me specify the gid_index, but that doesn't seem to be working. To get RoCE to work correctly (at least, on Nexus switches) I'll need to specify both a gid_index and an IB servic

Re: [OMPI users] RoCE (IBoE) & OpenMPI

2011-02-21 Thread Jeff Squyres
Random thought: is there a check to ensure that the SL MCA param is not set in a RoCE environment? If not, we should probably add a show_help warning if the SL MCA param is set when using RoCE (i.e., that its value will be ignored). On Feb 19, 2011, at 12:22 AM, Shamis, Pavel wrote: > As far

Re: [OMPI users] RoCE (IBoE) & OpenMPI

2011-02-19 Thread Shamis, Pavel
As far as I remember we don't allow to user to specify SL for RoCE. RoCE considered kinda ethernet device and RDMACM connection manager is used to setup the connections. it means that in order to select network X or Y, you may use ip/netmask (btl_openib_ipaddr_include) . Pavel (Pasha) Shamis -

Re: [OMPI users] RoCE (IBoE) & OpenMPI

2011-02-18 Thread Michael Shuey
Per-node GID & SL settings == bad. Site-wide GID & SL settings == good. If this could be an MCA param (like btl_openib_ib_service_level) that'd be great - we already have a global config file of similar params. We'd definitely want the same N everywhere. -- Mike Shuey On Fri, Feb 18, 2011 at

Re: [OMPI users] RoCE (IBoE) & OpenMPI

2011-02-18 Thread Jeff Squyres
On Feb 18, 2011, at 1:39 PM, Michael Shuey wrote: > RoCE HCAs keep a GID table, like normal HCAs. Every time you bring up > a vlan interface, another entry gets automatically added to the table. > If I select one of these other GIDs, packets get a VLAN tag, and that > contains the necessary prior

Re: [OMPI users] RoCE (IBoE) & OpenMPI

2011-02-18 Thread Michael Shuey
It's a little different in RoCE. There's no subnet manager, so (as near as I can tell) you don't really have a subnet ID. Instead, the GID = GUID + VLAN tag (more or less). gid[0] has special bits in the VLAN tag section, to indicate that packets relating to this GID don't get a VLAN tag. Unfor

Re: [OMPI users] RoCE (IBoE) & OpenMPI

2011-02-18 Thread Jeff Squyres
Greetings Mike. I'll answer today because Fri-Sat is the weekend in Israel (i.e., the MPI team at Mellanox won't see this until Sunday). I don't have a lot of experience with RoCE; do you need a different GUID or a different subnet ID? At least in IB, the GID = GUID + Subnet ID. The GUID sho

[OMPI users] RoCE (IBoE) & OpenMPI

2011-02-18 Thread Michael Shuey
I've been looking into OpenMPI's support for RoCE (Mellanox's recent Infiniband-over-Ethernet) lately. While it's promising, I've hit a snag: RoCE requires lossless ethernet, and on my switches the only way to guarantee this is with CoS. RoCE adapters cannot emit CoS priority tags unless the clie