Re: [ofa-general] Re: [ewg] OFED March 24 meeting summary on OFED 1.4 plans

2008-04-16 Thread Edward Mascarenhas

The SGI Altix ICE cluster system supports 2 InfiniBand fabrics.
http://www.sgi.com/products/servers/altix/ice/
Each compute node has 2 HCAs and each is connected to a separate 
fabric. We recommend that users use one fabric for storage traffic and 
the other for MPI, but there is no reason why both fabrics could not 
be used for MPI. OpenMPI requires setting a separate subnet prefix for 
each fabric to use both fabrics for MPI and OpenSM supports this 
setting of subnet prefix. Other MPIs do not require this.


Edward


on 04/04/2008 08:08 AM Tang, Changqing said the following:

What I mean claim to support is to have more people to test with this config.

--CQ


-Original Message-
From: Or Gerlitz [mailto:[EMAIL PROTECTED]
Sent: Thursday, April 03, 2008 11:18 PM
To: Tang, Changqing
Cc: [EMAIL PROTECTED]; ewg@lists.openfabrics.org
Subject: Re: [ofa-general] Re: [ewg] OFED March 24 meeting
summary on OFED 1.4 plans

On Thu, Apr 3, 2008 at 5:40 PM, Tang, Changqing
[EMAIL PROTECTED] wrote:


 The problem is, from MPI side, (and by default), we don't

know which

port is on which  fabric, since the subnet prefix is the

same. We rely

on system admin to config two  different subnet prefixes

for HP-MPI to work.

 No vendor has claimed to support this.

CQ, not supporting a different subnet prefix per IB subnet is
against IB nature, I don't think there should be any problem
to configure a different prefix at each open SM instance and
the Linux host stack would work perfectly under this config.
If you are a ware to any problem in the opensm and/or the
host stack please let the community know and the maintainers
will fix it.

Or.


___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg

___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg


RE: [ofa-general] Re: [ewg] OFED March 24 meeting summary on OFED 1.4 plans

2008-04-04 Thread Tang, Changqing

What I mean claim to support is to have more people to test with this config.

--CQ

 -Original Message-
 From: Or Gerlitz [mailto:[EMAIL PROTECTED]
 Sent: Thursday, April 03, 2008 11:18 PM
 To: Tang, Changqing
 Cc: [EMAIL PROTECTED]; ewg@lists.openfabrics.org
 Subject: Re: [ofa-general] Re: [ewg] OFED March 24 meeting
 summary on OFED 1.4 plans

 On Thu, Apr 3, 2008 at 5:40 PM, Tang, Changqing
 [EMAIL PROTECTED] wrote:

   The problem is, from MPI side, (and by default), we don't
 know which
  port is on which  fabric, since the subnet prefix is the
 same. We rely
  on system admin to config two  different subnet prefixes
 for HP-MPI to work.
 
   No vendor has claimed to support this.

 CQ, not supporting a different subnet prefix per IB subnet is
 against IB nature, I don't think there should be any problem
 to configure a different prefix at each open SM instance and
 the Linux host stack would work perfectly under this config.
 If you are a ware to any problem in the opensm and/or the
 host stack please let the community know and the maintainers
 will fix it.

 Or.

___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg


RE: [ofa-general] Re: [ewg] OFED March 24 meeting summary on OFED 1.4 plans

2008-04-04 Thread Tang, Changqing
   for example, in MPI, process A know the HCA guid on another node.
  After running for  some time, the switch is restarted for
 some reason, and the whole fabric is re-configured.


 CQ,

 If by the whole fabric is re-configured you refer to a case
 where a subnet prefix changes while a job runs and a process
 is detached/reattached to the job  so now you want to adopt
 your design to handle it, is over engineering, why you want
 to do that?


I am concerning the port lid change. It is always the best if a process can 
figure
the info it needs by itself, SA query is the right way and is in IB spec.

while it is possible to let processes to exchange information(port lid) again, 
but
there are difficulties: during the middle of a long job run, it is hard to let 
two
processes to coordinate such infomation exchange, and it requires a second 
channel
to do so. If the second channel is IPoIB, it is broken as well, and we need to 
re-establish
it again.

I just ask for the SA functionalities. If it is not possible, we have to use a 
very
complicated way to let HP-MPI to survive from network failure.


--CQ



 Or.

___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg


RE: [ofa-general] Re: [ewg] OFED March 24 meeting summary on OFED 1.4 plans

2008-04-03 Thread Hal Rosenstock
On Thu, 2008-04-03 at 14:40 +, Tang, Changqing wrote:
 You have a system, all HCAs have two ports, all port 1 are connected to the 
 first switch,
 all port 2 are connected to the second switch, there is NO link between the 
 two switches.
 We call this system has two physically separated fabrics. If you have a 
 bridge link
 between the two switches, then it becomes a single fabric.
 
 The same thing for multiple HCAs on nodes.
 
 The problem is, from MPI side, (and by default), we don't know which port is 
 on which
 fabric, since the subnet prefix is the same. We rely on system admin to 
 config two
 different subnet prefixes for HP-MPI to work.

Yes, these two IB subnets need two different subnet prefixes. (I think
it's more than just HP MPI which needs this).

-- Hal

 No vendor has claimed to support this.
 
 --CQ
 
  -Original Message-
  From: Hal Rosenstock [mailto:[EMAIL PROTECTED]
  Sent: Thursday, April 03, 2008 9:32 AM
  To: Tang, Changqing
  Cc: Erez Zilber; Tziporet Koren; ewg@lists.openfabrics.org;
  [EMAIL PROTECTED]
  Subject: RE: [ofa-general] Re: [ewg] OFED March 24 meeting
  summary on OFED 1.4 plans
 
  CQ,
 
  On Thu, 2008-04-03 at 14:27 +, Tang, Changqing wrote:
   Can we address multiple-fabrics (physically separated) support ?
 
  Can you elaborate on what you mean by physically separated ?
 
  -- Hal
 
  
  
   --CQ Tang
  
-Original Message-
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] On Behalf Of Erez
Zilber
Sent: Thursday, April 03, 2008 8:51 AM
To: Tziporet Koren
Cc: ewg@lists.openfabrics.org; [EMAIL PROTECTED]
Subject: [ofa-general] Re: [ewg] OFED March 24 meeting summary on
OFED 1.4 plans
   

 *OFED 1.4:*
 1. Kernel base: since we target 1.4 release to Sep we
  target the
 kernel base to be 2.6.27
 This is a good target, but we may need to stay with
2.6.26 if the
 kernel progress will not be aligned.

 2. Suggestions for new features:

 * NFS-RDMA
 * Verbs: Reliable Multicast (to be presented at Sonoma)
 * SDP - Zero copy (There was a question on IPv6 support
- seems no
   one interested for now)
 * IPoIB - continue with performance enhancements
 * Xsigo new virtual NIC
 * New vendor HW support - non was reported so far (IBM
and Chelsio
   - do you have something?)
 * OpenSM:
   o Incremental routing
   o Temporary SA DB - to answer queries and a heavy
sweep is done
   o APM - disjoint paths (?)
   o MKey manager (?)
   o Sasha to send more management features
 * MPI:
   o Open MPI 1.3
   o APM support in MPI
   o mvapich ???
 * uDAPl
   o Extensions for new APIs (like XRC) - ?
   o uDAPL provider for interop between Windows  Linux
   o 1.2 and 2.0 will stay

   
As I wrote in an earlier discussion (~2 months ago), we
  plan to add
tgt (SCSI target) with iSCSI over iSER (and TCP of
course) support. The git tree for tgt already exists on the ofa
server.
   
Erez
   
___
general mailing list
[EMAIL PROTECTED]
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general
   
To unsubscribe, please visit
http://openib.org/mailman/listinfo/openib-general
   
   ___
   general mailing list
   [EMAIL PROTECTED]
   http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general
  
   To unsubscribe, please visit
   http://openib.org/mailman/listinfo/openib-general
 
 
 ___
 general mailing list
 [EMAIL PROTECTED]
 http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general
 
 To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg


RE: [ofa-general] Re: [ewg] OFED March 24 meeting summary on OFED 1.4 plans

2008-04-03 Thread Hal Rosenstock
On Thu, 2008-04-03 at 14:53 +, Tang, Changqing wrote:
 One other thing I hope to talk is some fabric query functionalities for 
 normal user,
 not only just for root. This is at IB verbs level, not rdma_cm level.
 
 for example, in MPI, process A know the HCA guid on another node. After 
 running for
 some time, the switch is restarted for some reason, and the whole fabric is 
 re-configured.
 
 Now process A wants to know if the port lid on another node has changed or 
 not, it knows
 the HCA guid,  is there any function to query this ?

 I know as root, we can use the mad/umad library to do this kind of query, I 
 want to do
 such query in MPI, which is a normal user.

In the IB arch, there are SA registrations and queries for the specific
example you used. However, these are not directly exposed to Linux user
space directly (for the normal user as opposed to MAD user (note there
are some difficulties in making this available to the normal user)) (at
least not yet AFAIK). While these are not (direct) fabric query (really
SA query), they serve the same function in a different way.

-- Hal

 --CQ Tang, HP-MPI
 
 
 
  -Original Message-
  From: [EMAIL PROTECTED]
  [mailto:[EMAIL PROTECTED] On Behalf Of
  Erez Zilber
  Sent: Thursday, April 03, 2008 8:51 AM
  To: Tziporet Koren
  Cc: ewg@lists.openfabrics.org; [EMAIL PROTECTED]
  Subject: [ofa-general] Re: [ewg] OFED March 24 meeting
  summary on OFED 1.4 plans
 
  
   *OFED 1.4:*
   1. Kernel base: since we target 1.4 release to Sep we target the
   kernel base to be 2.6.27
   This is a good target, but we may need to stay with
  2.6.26 if the
   kernel progress will not be aligned.
  
   2. Suggestions for new features:
  
   * NFS-RDMA
   * Verbs: Reliable Multicast (to be presented at Sonoma)
   * SDP - Zero copy (There was a question on IPv6 support
  - seems no
 one interested for now)
   * IPoIB - continue with performance enhancements
   * Xsigo new virtual NIC
   * New vendor HW support - non was reported so far (IBM
  and Chelsio
 - do you have something?)
   * OpenSM:
 o Incremental routing
 o Temporary SA DB - to answer queries and a heavy
  sweep is done
 o APM - disjoint paths (?)
 o MKey manager (?)
 o Sasha to send more management features
   * MPI:
 o Open MPI 1.3
 o APM support in MPI
 o mvapich ???
   * uDAPl
 o Extensions for new APIs (like XRC) - ?
 o uDAPL provider for interop between Windows  Linux
 o 1.2 and 2.0 will stay
  
 
  As I wrote in an earlier discussion (~2 months ago), we plan
  to add tgt (SCSI target) with iSCSI over iSER (and TCP of
  course) support. The git tree for tgt already exists on the
  ofa server.
 
  Erez
 
  ___
  general mailing list
  [EMAIL PROTECTED]
  http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general
 
  To unsubscribe, please visit
  http://openib.org/mailman/listinfo/openib-general
 
 ___
 ewg mailing list
 ewg@lists.openfabrics.org
 http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg

___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg


RE: [ofa-general] Re: [ewg] OFED March 24 meeting summary on OFED 1.4 plans

2008-04-03 Thread Tang, Changqing

Thanks. When can we have the SA features, very soon, long time, or never ?


--CQ

 -Original Message-
 From: Hal Rosenstock [mailto:[EMAIL PROTECTED]
 Sent: Thursday, April 03, 2008 10:02 AM
 To: Tang, Changqing
 Cc: Erez Zilber; Tziporet Koren; ewg@lists.openfabrics.org;
 [EMAIL PROTECTED]
 Subject: RE: [ofa-general] Re: [ewg] OFED March 24 meeting
 summary on OFED 1.4 plans

 On Thu, 2008-04-03 at 14:53 +, Tang, Changqing wrote:
  One other thing I hope to talk is some fabric query functionalities
  for normal user, not only just for root. This is at IB
 verbs level, not rdma_cm level.
 
  for example, in MPI, process A know the HCA guid on another node.
  After running for some time, the switch is restarted for
 some reason, and the whole fabric is re-configured.
 
  Now process A wants to know if the port lid on another node has
  changed or not, it knows the HCA guid,  is there any
 function to query this ?

  I know as root, we can use the mad/umad library to do this kind of
  query, I want to do such query in MPI, which is a normal user.

 In the IB arch, there are SA registrations and queries for
 the specific example you used. However, these are not
 directly exposed to Linux user space directly (for the normal
 user as opposed to MAD user (note there are some difficulties
 in making this available to the normal user)) (at least not
 yet AFAIK). While these are not (direct) fabric query (really
 SA query), they serve the same function in a different way.

 -- Hal

  --CQ Tang, HP-MPI
 
 
 
   -Original Message-
   From: [EMAIL PROTECTED]
   [mailto:[EMAIL PROTECTED] On Behalf Of Erez
   Zilber
   Sent: Thursday, April 03, 2008 8:51 AM
   To: Tziporet Koren
   Cc: ewg@lists.openfabrics.org; [EMAIL PROTECTED]
   Subject: [ofa-general] Re: [ewg] OFED March 24 meeting summary on
   OFED 1.4 plans
  
   
*OFED 1.4:*
1. Kernel base: since we target 1.4 release to Sep we
 target the
kernel base to be 2.6.27
This is a good target, but we may need to stay with
   2.6.26 if the
kernel progress will not be aligned.
   
2. Suggestions for new features:
   
* NFS-RDMA
* Verbs: Reliable Multicast (to be presented at Sonoma)
* SDP - Zero copy (There was a question on IPv6 support
   - seems no
  one interested for now)
* IPoIB - continue with performance enhancements
* Xsigo new virtual NIC
* New vendor HW support - non was reported so far (IBM
   and Chelsio
  - do you have something?)
* OpenSM:
  o Incremental routing
  o Temporary SA DB - to answer queries and a heavy
   sweep is done
  o APM - disjoint paths (?)
  o MKey manager (?)
  o Sasha to send more management features
* MPI:
  o Open MPI 1.3
  o APM support in MPI
  o mvapich ???
* uDAPl
  o Extensions for new APIs (like XRC) - ?
  o uDAPL provider for interop between Windows  Linux
  o 1.2 and 2.0 will stay
   
  
   As I wrote in an earlier discussion (~2 months ago), we
 plan to add
   tgt (SCSI target) with iSCSI over iSER (and TCP of
   course) support. The git tree for tgt already exists on the ofa
   server.
  
   Erez
  
   ___
   general mailing list
   [EMAIL PROTECTED]
   http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general
  
   To unsubscribe, please visit
   http://openib.org/mailman/listinfo/openib-general
  
  ___
  ewg mailing list
  ewg@lists.openfabrics.org
  http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg


___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg


RE: [ofa-general] Re: [ewg] OFED March 24 meeting summary on OFED 1.4 plans

2008-04-03 Thread Hal Rosenstock
On Thu, 2008-04-03 at 15:11 +, Tang, Changqing wrote:
 Thanks. When can we have the SA features, very soon, long time, or never ?

I'm unaware of any current plans to implement these but my knowledge is
far from complete...

-- Hal

 --CQ
 
  -Original Message-
  From: Hal Rosenstock [mailto:[EMAIL PROTECTED]
  Sent: Thursday, April 03, 2008 10:02 AM
  To: Tang, Changqing
  Cc: Erez Zilber; Tziporet Koren; ewg@lists.openfabrics.org;
  [EMAIL PROTECTED]
  Subject: RE: [ofa-general] Re: [ewg] OFED March 24 meeting
  summary on OFED 1.4 plans
 
  On Thu, 2008-04-03 at 14:53 +, Tang, Changqing wrote:
   One other thing I hope to talk is some fabric query functionalities
   for normal user, not only just for root. This is at IB
  verbs level, not rdma_cm level.
  
   for example, in MPI, process A know the HCA guid on another node.
   After running for some time, the switch is restarted for
  some reason, and the whole fabric is re-configured.
  
   Now process A wants to know if the port lid on another node has
   changed or not, it knows the HCA guid,  is there any
  function to query this ?
 
   I know as root, we can use the mad/umad library to do this kind of
   query, I want to do such query in MPI, which is a normal user.
 
  In the IB arch, there are SA registrations and queries for
  the specific example you used. However, these are not
  directly exposed to Linux user space directly (for the normal
  user as opposed to MAD user (note there are some difficulties
  in making this available to the normal user)) (at least not
  yet AFAIK). While these are not (direct) fabric query (really
  SA query), they serve the same function in a different way.
 
  -- Hal
 
   --CQ Tang, HP-MPI
  
  
  
-Original Message-
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] On Behalf Of Erez
Zilber
Sent: Thursday, April 03, 2008 8:51 AM
To: Tziporet Koren
Cc: ewg@lists.openfabrics.org; [EMAIL PROTECTED]
Subject: [ofa-general] Re: [ewg] OFED March 24 meeting summary on
OFED 1.4 plans
   

 *OFED 1.4:*
 1. Kernel base: since we target 1.4 release to Sep we
  target the
 kernel base to be 2.6.27
 This is a good target, but we may need to stay with
2.6.26 if the
 kernel progress will not be aligned.

 2. Suggestions for new features:

 * NFS-RDMA
 * Verbs: Reliable Multicast (to be presented at Sonoma)
 * SDP - Zero copy (There was a question on IPv6 support
- seems no
   one interested for now)
 * IPoIB - continue with performance enhancements
 * Xsigo new virtual NIC
 * New vendor HW support - non was reported so far (IBM
and Chelsio
   - do you have something?)
 * OpenSM:
   o Incremental routing
   o Temporary SA DB - to answer queries and a heavy
sweep is done
   o APM - disjoint paths (?)
   o MKey manager (?)
   o Sasha to send more management features
 * MPI:
   o Open MPI 1.3
   o APM support in MPI
   o mvapich ???
 * uDAPl
   o Extensions for new APIs (like XRC) - ?
   o uDAPL provider for interop between Windows  Linux
   o 1.2 and 2.0 will stay

   
As I wrote in an earlier discussion (~2 months ago), we
  plan to add
tgt (SCSI target) with iSCSI over iSER (and TCP of
course) support. The git tree for tgt already exists on the ofa
server.
   
Erez
   
___
general mailing list
[EMAIL PROTECTED]
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general
   
To unsubscribe, please visit
http://openib.org/mailman/listinfo/openib-general
   
   ___
   ewg mailing list
   ewg@lists.openfabrics.org
   http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg
 
 

___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg


Re: [ofa-general] Re: [ewg] OFED March 24 meeting summary on OFED 1.4 plans

2008-04-03 Thread Or Gerlitz
On Thu, Apr 3, 2008 at 5:40 PM, Tang, Changqing [EMAIL PROTECTED] wrote:

  The problem is, from MPI side, (and by default), we don't know which port is 
 on which
  fabric, since the subnet prefix is the same. We rely on system admin to 
 config two
  different subnet prefixes for HP-MPI to work.

  No vendor has claimed to support this.

CQ, not supporting a different subnet prefix per IB subnet is against
IB nature, I don't think there
should be any problem to configure a different prefix at each open SM
instance and the Linux host stack
would work perfectly under this config. If you are a ware to any
problem in the opensm and/or the host stack
please let the community know and the maintainers will fix it.

Or.
___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg


Re: [ofa-general] Re: [ewg] OFED March 24 meeting summary on OFED 1.4 plans

2008-04-03 Thread Or Gerlitz
On Thu, Apr 3, 2008 at 5:53 PM, Tang, Changqing [EMAIL PROTECTED] wrote:
  for example, in MPI, process A know the HCA guid on another node. After 
 running for
  some time, the switch is restarted for some reason, and the whole fabric is 
 re-configured.


CQ,

If by the whole fabric is re-configured you refer to a case where a
subnet prefix changes while a job runs and a process
is detached/reattached to the job  so now you want to adopt your
design to handle it, is over engineering, why you want to do that?

Or.
___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg