[ewg] OFED backport patch error

2011-02-21 Thread Todd Rimmer
FYI, I took the 1.5.3rc4 and attempted to ofed_patch for each of the supported 
backports.

For the 2.6.18-chaos backport, the following failed to apply:

kernel_patches/backport/2.6.18-chaos/core_4_show_res_to_xxx.patch^M
patching file drivers/infiniband/core/uverbs_main.c^M
Hunk #1 FAILED at 870.^M
Hunk #2 FAILED at 935.^M
2 out of 2 hunks FAILED -- saving rejects to file drivers/infiniband/core/uverbs
_main.c.rej^M


I went back and tried it with 1.5.3rc3 and observed the same error.

I do not have a 2.6.18-chaos system handy, so I can't verify if this failure of 
this backport is important to operation or builds for that kernel.

Todd Rimmer
Chief Architect
QLogic Network Systems Group
Voice: 610-233-4852 Fax: 610-233-4777
todd.rim...@qlogic.com  www.QLogic.com

This message and any attached documents contain information from QLogic 
Corporation or its wholly-owned subsidiaries that may be confidential. If you 
are not the intended recipient, you may not read, copy, distribute, or use this 
information. If you have received this transmission in error, please notify the 
sender immediately by reply e-mail and then delete this message.

___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg


[ewg] OFED 1.5.2rc4 fails to build on RHEL4 Update 8 x86_64

2010-08-18 Thread Todd Rimmer
I tried an install.pl --all on this distro.

The ibacm module fails to build with undefined references to 
__sync_fetch_and_add and __sync_fetch_and_sub.

I see these exist on newer distros, but they are not available on RHEL4.  ibacm 
is using this in its linux/osd.h file to define atomic_inc and atomic_dec.  

An alternative might be to use /usr/include/asm/atomic.h as this has atomic_inc 
and atomic_dec macros already defined.  I would also be more portable.

Todd Rimmer
Chief Architect 
QLogic Network Systems Group
Voice: 610-233-4852 Fax: 610-233-4777
todd.rim...@qlogic.com  www.QLogic.com
___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg


Re: [ewg] OFED 1.5.2rc4 fails to build on RHEL4 Update 8 x86_64

2010-08-18 Thread Todd Rimmer
On my RHEL4 system, rpm -q --whatprovides /usr/include/asm/atomic.h

Indicates:
glibc-kernheaders-2.4-9.1.103.EL

Todd Rimmer
Chief Architect 
QLogic Network Systems Group
Voice: 610-233-4852 Fax: 610-233-4777
todd.rim...@qlogic.com  www.QLogic.com
 

 -Original Message-
 From: Hefty, Sean [mailto:sean.he...@intel.com]
 Sent: Wednesday, August 18, 2010 4:16 PM
 To: Todd Rimmer
 Cc: OpenFabrics EWG
 Subject: RE: OFED 1.5.2rc4 fails to build on RHEL4 Update 8 x86_64
 
  An alternative might be to use /usr/include/asm/atomic.h as this has
  atomic_inc and atomic_dec macros already defined.  I would also be
 more
  portable.
 
 This file doesn't exist on my systems.

___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg


Re: [ewg] libibumad.so version bump in OFED 1.5.2

2010-07-14 Thread Todd Rimmer
 Is there a way to accomplish this feature without an ABI change?  Or 
 should ABI altering features be only included in Major releases (such 
 as OFED 1.6?)

 I urged about ABI change on the list when this patch was posted. And I don't 
 think that we can change the version back.

Since 1.5.2 is not yet GA, we always have the option of altering this patch and 
moving the version back.

It sounds like you too were concerned about this when the change went in, I 
suspect those on the list missed the ramifications to end users and ISVs of ABI 
changes during maintenance releases.  A situation like this will leave many 
ISVs unable to support OFED 1.5.2.

We've already seen that situation with OFED 1.4 - 1.5.  The various API and 
ABI changes impacted many ISVs, so there are still some ISVs which are stuck on 
OFED 1.4 and have not yet released OFED 1.5 support.  This of course is causing 
issues for customers which need OFED 1.5 features (such as newer kernel levels) 
but require a given ISV's software.

Let's not repeat that same mistake for OFED 1.5.1 - 1.5.2.

In contrast, other aspects of linux provide forward migration.  For example I 
can generally build a binary on RHEL4 and run it on RHEL5.  I can certainly 
build files on RHEL5.0 and run them on RHEL5.5.  Heck, I've got quite a few 
tools whose binaries were built on even earlier redhat versions and still work 
fine on RHEL5.5.   OFED applications should be no different.

Todd Rimmer
Chief Architect 
QLogic Network Systems Group
Voice: 610-233-4852 Fax: 610-233-4777
todd.rim...@qlogic.com  www.QLogic.com
 

 -Original Message-
 From: Sasha Khapyorsky [mailto:sashakv...@gmail.com] On Behalf Of Sasha
 Khapyorsky
 Sent: Tuesday, July 13, 2010 11:34 AM
 To: Todd Rimmer
 Cc: e...@openfabrics.org; Yevgeny Kliteynik; Tziporet Koren
 Subject: Re: libibumad.so version bump in OFED 1.5.2
 
 On 12:58 Wed 07 Jul , Todd Rimmer wrote:
  Is there a way to accomplish this feature without an ABI change?  Or
 should ABI altering features be only included in Major releases (such
 as OFED 1.6?)
 
 I urged about ABI change on the list when this patch was posted. And I
 don't think that we can change the version back.
 
  Keep in mind, that ABI changes mean that 3rd parties must rebuild and
 re-release their applications.  While this isn't so bad for OFED itself
 and IB vendors, it's a major issue for other 3rd party apps.
 Introducing a ABI change in a maintenance release just makes things
 that much more confusing.
 
  One alternative would be for OFED 1.5.2 to include both the .2 and .3
 versions of the library.
 
 We discussed such option - to have a package compat-libs or so, but as
 far as I can remember no one wanted to take responsibility to maintain
 this.
 
  Another alternative would be to design the structures so there is
 room for expansion or pointers to opaque datatypes for fields only used
 by the library.
 
 Currently it would mean API redesign and more version changes.
 
 Sasha
 
 
  Todd Rimmer
  Chief Architect
  QLogic Network Systems Group
  Voice: 610-233-4852 Fax: 610-233-4777
  todd.rim...@qlogic.com  www.QLogic.com
 
 
   -Original Message-
   From: Sasha Khapyorsky [mailto:sashakv...@gmail.com] On Behalf Of
 Sasha
   Khapyorsky
   Sent: Wednesday, July 07, 2010 9:34 AM
   To: Todd Rimmer
   Cc: e...@openfabrics.org
   Subject: Re: libibumad.so version bump in OFED 1.5.2
  
   Hi Todd,
  
   On 06:48 Tue 06 Jul , Todd Rimmer wrote:
I recently discovered that 3rd party applications built for OFED
 1.5
   will not run on OFED 1.5.2rc2.
   
The issue I found was that libibumad had changed from version 2
 to
   version 3.  Can you provide insight as to why this change was made?
   
In reviewing the source code and change logs there do not seem to
 be
   any changes which would impact the application API nor application
 ABI.
   
   
   
I diffed the code and the main change seems to be:
diff -r --exclude CVS ./src/src/umad.c-orig ./src/src/umad.c
161a162,166
   if (sys_read_string(port_dir, SYS_PORT_LINK_LAYER,
   port-link_layer, UMAD_CA_NAME_LEN)  0)
   /* assume IB by default */
   sprintf(port-link_layer, IB);

The above change might imply an update to an rpm dependency on
 ib-
   kernel versions, but should not result in a new library version.
 If
   the library version remained at 2, then applications built for OFED
 1.5
   would also run on OFED 1.5.2rc2 without recompile.
  
   This is not an only change. The whole commit is:
  
  
   commit 99d0901f3702e22c178352524af18ec567781d51
   Author: Yevgeny Kliteynik klit...@dev.mellanox.co.il
   Date:   Tue Mar 9 12:39:40 2010 +0200
  
   libibumad: added link_layer for RoCEE support and updated umad
   version
  
   Added link_layer field to umad_port_t.
   The field is implemented as char[].
   If the relevant file doesn't exist in sysfs, the link layer
   is IB. Otherwise

Re: [ewg] libibumad.so version bump in OFED 1.5.2

2010-07-07 Thread Todd Rimmer
Is there a way to accomplish this feature without an ABI change?  Or should ABI 
altering features be only included in Major releases (such as OFED 1.6?)

Keep in mind, that ABI changes mean that 3rd parties must rebuild and 
re-release their applications.  While this isn't so bad for OFED itself and IB 
vendors, it's a major issue for other 3rd party apps.  Introducing a ABI change 
in a maintenance release just makes things that much more confusing.

One alternative would be for OFED 1.5.2 to include both the .2 and .3 versions 
of the library.

Another alternative would be to design the structures so there is room for 
expansion or pointers to opaque datatypes for fields only used by the library.

Todd Rimmer
Chief Architect 
QLogic Network Systems Group
Voice: 610-233-4852 Fax: 610-233-4777
todd.rim...@qlogic.com  www.QLogic.com
 

 -Original Message-
 From: Sasha Khapyorsky [mailto:sashakv...@gmail.com] On Behalf Of Sasha
 Khapyorsky
 Sent: Wednesday, July 07, 2010 9:34 AM
 To: Todd Rimmer
 Cc: e...@openfabrics.org
 Subject: Re: libibumad.so version bump in OFED 1.5.2
 
 Hi Todd,
 
 On 06:48 Tue 06 Jul , Todd Rimmer wrote:
  I recently discovered that 3rd party applications built for OFED 1.5
 will not run on OFED 1.5.2rc2.
 
  The issue I found was that libibumad had changed from version 2 to
 version 3.  Can you provide insight as to why this change was made?
 
  In reviewing the source code and change logs there do not seem to be
 any changes which would impact the application API nor application ABI.
 
 
 
  I diffed the code and the main change seems to be:
  diff -r --exclude CVS ./src/src/umad.c-orig ./src/src/umad.c
  161a162,166
 if (sys_read_string(port_dir, SYS_PORT_LINK_LAYER,
 port-link_layer, UMAD_CA_NAME_LEN)  0)
 /* assume IB by default */
 sprintf(port-link_layer, IB);
  
  The above change might imply an update to an rpm dependency on ib-
 kernel versions, but should not result in a new library version.  If
 the library version remained at 2, then applications built for OFED 1.5
 would also run on OFED 1.5.2rc2 without recompile.
 
 This is not an only change. The whole commit is:
 
 
 commit 99d0901f3702e22c178352524af18ec567781d51
 Author: Yevgeny Kliteynik klit...@dev.mellanox.co.il
 Date:   Tue Mar 9 12:39:40 2010 +0200
 
 libibumad: added link_layer for RoCEE support and updated umad
 version
 
 Added link_layer field to umad_port_t.
 The field is implemented as char[].
 If the relevant file doesn't exist in sysfs, the link layer
 is IB. Otherwise, the content of link_layer file is used.
 
 The libibumad version is promoted.
 
 Signed-off-by: Yevgeny Kliteynik klit...@dev.mellanox.co.il
 Signed-off-by: Sasha Khapyorsky sas...@voltaire.com
 
 diff --git a/libibumad/include/infiniband/umad.h
 b/libibumad/include/infiniband/umad.h
 index 1f82183..f9da204 100644
 --- a/libibumad/include/infiniband/umad.h
 +++ b/libibumad/include/infiniband/umad.h
 @@ -116,6 +116,7 @@ typedef struct ib_user_mad {
  #define SYS_PORT_RATErate
  #define SYS_PORT_GUIDport_guid
  #define SYS_PORT_GID gids/0
 +#define SYS_PORT_LINK_LAYER  link_layer
 
  typedef struct umad_port {
   char ca_name[UMAD_CA_NAME_LEN];
 @@ -132,6 +133,7 @@ typedef struct umad_port {
   uint64_t port_guid;
   unsigned pkeys_size;
   uint16_t *pkeys;
 + char link_layer[UMAD_CA_NAME_LEN];
  } umad_port_t;
 
  typedef struct umad_ca {
 diff --git a/libibumad/libibumad.ver b/libibumad/libibumad.ver
 index 57cddbd..d36fbbc 100644
 --- a/libibumad/libibumad.ver
 +++ b/libibumad/libibumad.ver
 @@ -6,4 +6,4 @@
  # API_REV - advance on any added API
  # RUNNING_REV - advance any change to the vendor files
  # AGE - number of backward versions the API still supports
 -LIBVERSION=2:1:0
 +LIBVERSION=2:2:0
 diff --git a/libibumad/src/umad.c b/libibumad/src/umad.c
 index 277ae6b..d16e750 100644
 --- a/libibumad/src/umad.c
 +++ b/libibumad/src/umad.c
 @@ -159,6 +159,11 @@ static int get_port(char *ca_name, char *dir, int
 portnum, umad_port_t * port)
   if (sys_read_uint(port_dir, SYS_PORT_CAPMASK, port-capmask) 
 0)
   goto clean;
 
 + if (sys_read_string(port_dir, SYS_PORT_LINK_LAYER,
 + port-link_layer, UMAD_CA_NAME_LEN)  0)
 + /* assume IB by default */
 + sprintf(port-link_layer, IB);
 +
   port-capmask = htonl(port-capmask);
 
   if (sys_read_gid(port_dir, SYS_PORT_GID, gid)  0)
 
 
 Altering 'struct umad_port' effectively breaks ABI.
 
 Sasha
___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg


[ewg] libibumad.so version bump in OFED 1.5.2

2010-07-06 Thread Todd Rimmer
I recently discovered that 3rd party applications built for OFED 1.5 will not 
run on OFED 1.5.2rc2.

The issue I found was that libibumad had changed from version 2 to version 3.  
Can you provide insight as to why this change was made?

In reviewing the source code and change logs there do not seem to be any 
changes which would impact the application API nor application ABI.



I diffed the code and the main change seems to be:
diff -r --exclude CVS ./src/src/umad.c-orig ./src/src/umad.c
161a162,166
   if (sys_read_string(port_dir, SYS_PORT_LINK_LAYER,
   port-link_layer, UMAD_CA_NAME_LEN)  0)
   /* assume IB by default */
   sprintf(port-link_layer, IB);

The above change might imply an update to an rpm dependency on ib-kernel 
versions, but should not result in a new library version.  If the library 
version remained at 2, then applications built for OFED 1.5 would also run on 
OFED 1.5.2rc2 without recompile.




The ChangeLog simply says:

** Version: libibumad-1.3.4

Sun Dec 13 19:39:11 2009 +0200 Sasha Khapyorsky
2ebf19e61bb887b226017c0ac5a7f30ea0cd9e3f

* management: package versions bump

Sun Dec 13 19:36:19 2009 +0200 Sasha Khapyorsky
b7f0e58ada74b4da01a39a9c5fd2f5517d70d26a

* management: update library versions

Tue Nov 17 14:39:37 2009 -0800 Ira Weiny
f25da9f189638c1772d7c3cc5e1942b8d1e8abf8

* libibmad: libibumad: Print warnings and errors to stderr not stdout




Todd Rimmer
Chief Architect 
QLogic Network Systems Group
Voice: 610-233-4852 Fax: 610-233-4777
todd.rim...@qlogic.com  www.QLogic.com
 

___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg


Re: [ewg] [PATCH] pkey fix for ipoib - resubmission

2010-06-18 Thread Todd Rimmer
Jason wrote:

 What happens if the SM reprograms the pkey table later?
 
 I'm not sure this is the right approach.. See the recent dicussion
 about using pkey indexes at all.
 
 Each ipoib interface is assigned a pkey, by number, not index. If the
 SM has not supplied that pkey to the port then the ipoib should not
 come up. If the SM adds it later then it should come up, if it the SM
 removes it later then it should go down.
 
 The default ipoib interface uses the default pkey. If your network
 need ipoib to use a different pkey then you have to change the pkey
 number assignment when the ipoib interface is created, not magically
 attempt to autoconfigure.
 
 I believe you cannot safely change the broadcast GID once the ipoib
 interface has started.

The goal for IPoIB should be a mechanism which can mimic the ease of setting up 
VLANs in Ethernet.  In Ethernet a default VLAN can be programmed into a switch 
in which case no configuration of the server is needed.  Ethernet switch 
vendors often provide centralized tools for setting up VLANS in all the 
switches.  It is only when you want a given server to participate in multiple 
VLANs that you need to setup specific configuration on the server to do so.

For IB, the centralized tool is the SM and a VLAN is an IB Partition.  The 
present capability of using the 1st PKey table entry is a nice simple and 
powerful way to mimic the ease of use of Ethernet VLANs.  Only when a user 
wants IPoIB to run on two different partitions (which is rare) is a server 
specific configuration needed.

When IPoIB sees a PKey table change, it will reregister with the broadcast GID. 
 PKey table changes typically also involve a restart of the SM and hence a 
reregistration request is necessary anyway.

Todd Rimmer
Chief Architect 
QLogic Network Systems Group
Voice: 610-233-4852 Fax: 610-233-4777
todd.rim...@qlogic.com  www.QLogic.com
 

___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg


Re: [ewg] [PATCH] pkey fix for ipoib - resubmission

2010-06-18 Thread Todd Rimmer
Jason,

 Fixing the maddr table, and dealing with race issues with ongoing
 joins my not be straightforward, but, IMHO, necessary for this patch
 to be acceptable.
 

So you are pointing out an existing bug, which is no worse with this patch than 
without it.  Namely that pkey changes with IPv6 may have issues.  IPv4 pkey 
changes should be ok.

In any case, this patch still successfully addresses the most important 
requirement of having a rebooted server come up with the correct MGID.

In at least the HPC world, PKey changes are extremely rare, however having 
server's reliably boot is important.

 ipoib bonding had much the same problem with invalid maddrs, and a
 patch was put in that flushed the maddr table in certain bond
 scenarios. Perhaps something like that is a straightforward solution
 here as well?
Perhaps that could be a second patch which would build on top of this initial 
fix for the most important requirement.  It would seem that using the PKEY 
change and/or re-registration HCA events to trigger a maddr flush in IPoIB 
would be straightforward.  If that approach worked well for bonding, it should 
work equally well in these cases.

Todd Rimmer
Chief Architect 
QLogic Network Systems Group
Voice: 610-233-4852 Fax: 610-233-4777
todd.rim...@qlogic.com  www.QLogic.com
 

___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg


RE: [ewg] RE: OFED libraries - on the web at last

2009-06-17 Thread Todd Rimmer
 From: Sasha Khapyorsky [mailto:sashakv...@gmail.com] On Behalf Of Sasha
 Khapyorsky
 Sent: Wednesday, June 17, 2009 11:32 AM
 To: Todd Rimmer
 Cc: Tziporet Koren; ewg@lists.openfabrics.org
 Subject: Re: [ewg] RE: OFED libraries - on the web at last
 
 On 08:14 Wed 17 Jun , Todd Rimmer wrote:
 
  The issue isn't with the management tree.  It's with the mvapich
 included in OFED 1.4.1 and previous versions.
 
 Then it should be fixed.
[Todd Rimmer] Agreed, but customers will need a few releases of notice that 
they should recompile their apps.

 
  It links with libibcommon and as a result,
 
 Do you know why exactly was it linked (and how)? I searched over
 mvapich
 source and didn't find any references in sources and makefiles. It is
 only mentioned in spec files. So is there any issue really?
[Todd Rimmer] Yes, the .spec file controls what libraries mvapich uses, that 
same list of libraries is then part of mpicc and mpif77 and used to link all 
applications built with mvapich.

 
  ALL end user and ISV apps linked with mvapich also link with
 libibcommon.
 
  This means that in moving to OFED 1.5 end users and ISVs would have
 to recompile all their apps.  Such a change should not be taken
 lightly.  Instead we should provide what is needed in OFED 1.5 such
 that existing app binaries will continue to work (might be as simple as
 stubbed library).  Then we can issue notice to end users and ISVs that
 their apps will need to be rebuilt and allow them at least 1 release to
 accomplish that.
 
 
 AFAIR libibcommon removal was discussed on the list couple of times
 before, during and after OFED-1.4.
[Todd Rimmer] Yes, but unfortunately it didn't come into practice in mvapich in 
OFED 1.4.1 or previous.

ABI compatibility is an important requirement.  In this case the ABI is that of 
applications built with mvapich.  Is it ideal, NO.  But we can't penalize 
customers.  I suspect a dummy library would satisfy the mvapich need, but that 
would need to be confirmed (FYI, OFED 1.4.1 libibumad also references ibcommon).

 
 Sasha
___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg


RE: [ewg] RE: OFED libraries - on the web at last

2009-06-11 Thread Todd Rimmer
 You are correct and this is actually old link. MPI will not use it in
 1.5, since libibcommon was removed from the new management tree.

I'm not sure what you mean by this statement.  It's good that MPI in 1.5 will 
not use it.

However, if we remove libibcommon from the library path in OFED 1.5 that would 
break existing applications.  The fact that user apps have ended up being 
linked with it implies we must be very careful how we handle libibcommon.  A 
case in point will be ISVs which have built against an older version of MPI and 
hence have this dependency in their application.

Someone should see what routines in libibcommon (if any) MPI is using.  A 
simple answer in OFED 1.5 might be to include a subset/dummy/empty libibcommon 
library to satisfy the application dependency.

Todd Rimmer
Chief Architect 
QLogic Network Systems Group
Voice: 610-233-4852 Fax: 610-233-4777
todd.rim...@qlogic.com  www.QLogic.com
 

 -Original Message-
 From: Tziporet Koren [mailto:tzipo...@dev.mellanox.co.il]
 Sent: Thursday, June 11, 2009 5:15 AM
 To: Todd Rimmer
 Cc: ewg@lists.openfabrics.org
 Subject: Re: [ewg] RE: OFED libraries - on the web at last
 
 Todd Rimmer wrote:
 
  What does this category mean:
 
  Private but used by some apps:
 
 
 
 This means that although  these libraries were designed as private some
 apps/libs used them.
 We found this when we did this list.
 The apps should be changed not to use these libraries but this is the
 situation in 1.4.x
 This is the reason we publish such a list - so new apps will use only
 public libraries and will not count on any private library
 
  I see that libibcommon is in this category, however I have observed
  that mvapich and mvapich2, link with this library.  Which in turn
  means that all MPI user applications built with mvapich or mvapich2
  also link with it.
 
 You are correct and this is actually old link. MPI will not use it in
 1.5, since libibcommon was removed from the new management tree.
 
 Tziporet
 

___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg


RE: [ewg] RE: OFED libraries - on the web at last

2009-06-11 Thread Todd Rimmer
 libibcommon was removed from the new management package that will be
 part of OFED 1.5
 The link in MPI was old and actually MPI does not use this library any
 more for a long time

I beg to differ.  I downloaded OFED 1.4.1 and it includes 
mvapich-1.1.0-3355.src.rpm

Looking in that rpm, the mvapich.spec file requires ibcommon and uses it.
If I build any apps with that version of mvapich, the apps end up with ibcommon 
in their ldd dependencies.

At a more generic level, the golden rule should be to not require end users (or 
ISVs) to rebuild their apps.  In this case it will be necessary for OFED 1.5 to 
include a libibcommon file which meets whatever requirements mvapich had.  I 
see the use of ibcommon in mvapich.spec is mentioned as a 2007 addition to work 
on SLES10.  Not sure what APIs in libibcommon (if any) were needed.

Todd Rimmer
Chief Architect 
QLogic Network Systems Group
Voice: 610-233-4852 Fax: 610-233-4777
todd.rim...@qlogic.com  www.QLogic.com
 

 -Original Message-
 From: Tziporet Koren [mailto:tzipo...@dev.mellanox.co.il]
 Sent: Thursday, June 11, 2009 11:09 AM
 To: Todd Rimmer
 Cc: tzipo...@dev.mellanox.co.il; ewg@lists.openfabrics.org
 Subject: Re: [ewg] RE: OFED libraries - on the web at last
 
 Todd Rimmer wrote:
  You are correct and this is actually old link. MPI will not use it
 in
  1.5, since libibcommon was removed from the new management tree.
 
 
  I'm not sure what you mean by this statement.  It's good that MPI in
 1.5 will not use it.
 
 libibcommon was removed from the new management package that will be
 part of OFED 1.5
 The link in MPI was old and actually MPI does not use this library any
 more for a long time
 
 Tziporet
 

___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg


RE: [ofa-general] Re: [ewg] RFC: Do we wish to take MPI out of OFED?

2009-06-08 Thread Todd Rimmer
I agree with DK from OSU.  There are clear advantages to having MPI included 
with OFED.  Not only will it make testing of a complete solution easier by both 
OFED and MPI suppliers, but it will also improve ease of use for end users.  As 
DK points out there are continual improvements in MPIs which may depend on bug 
fixes and/or new features in newer versions of OFED.  Identifying a known good 
combination will be important to most end users, etc.

I would point out that the OFED API seems to be holding up quite well.  
Applications which we built against OFED 1.3 have worked in binary form against 
OFED 1.4 and 1.4.1 without issue, in fact there are many cases where we build 
binaries for applications against OFED 1.3 and ship the same binary for use on 
both old and new OFED versions.

However MPI has some differences in this area.  In the constant search for 
better latency and bandwidth, there will be ongoing development and tuning.  So 
in many cases the best performance for MPI will require the latest MPI and the 
latest OFED.  This does not necessarily mean the API is broken, but rather it 
means a combination has been well tuned and tested.  In most if not all cases, 
other combinations will work well, but may not achieve the same performance 
level.  This is simply the nature of the HPC marketplace.

Todd Rimmer
Chief Architect 
QLogic Network Systems Group
Voice: 610-233-4852 Fax: 610-233-4777
todd.rim...@qlogic.com  www.QLogic.com
 

 -Original Message-
 From: general-boun...@lists.openfabrics.org [mailto:general-
 boun...@lists.openfabrics.org] On Behalf Of Dhabaleswar Panda
 Sent: Thursday, June 04, 2009 2:02 PM
 To: Tziporet Koren
 Cc: EWG; OpenFabrics General
 Subject: [ofa-general] Re: [ewg] RFC: Do we wish to take MPI out of
 OFED?
 
 Tziporet,
 
 Main reasons to keep MPI in OFED:
 - All participants test with the same MPI versions, and when
 installing
 OFED it is ensured that MPI will work fine with this version.
 - Customers convenience in install (no need to go to more sites to get
 MPI)
 - MPI is an important RDMA ULP and although it is not developed in OFA
 it is widely used by OFED customers
 
 I support keeping MPI packages in the OFED because of the above
 positive
 points you have mentioned.
 
 I would also like to mention that keeping MPI packages in OFED helps to
 test out various new features and functionalities (such as APM and XRC
 in
 the past and the new memory registration scheme being discussed now) as
 they get introduced. Such an integrated approach helps to test out
 these
 features at the lower layers as well as at the MPI layer. This process
 helps to resolve out any bugs with the new features during the testing
 process itself. It also accelerates the deployment and use of these new
 features in the community.
 
 However, to make the complete OFED release process work smoothly for
 everybody (vendors, distros, users, etc.) without affecting the release
 schedule, it is essential that stable MPI packages are added to OFED.
 This
 is what we have been doing wrt MVAPICH and MVAPICH2 for the last
 several
 years.
 
 If the developers of any MPI package do not want it to be a part of the
 OFED due to any constraints, it should be allowed. However, such an
 action
 should not force to remove all MPI packages.
 
 From the point of view of MVAPICH and MVAPICH2 packages in OFED, we
 have
 been providing stable packages to OFED for the last several years
 helping
 the OFED community and would like to continue with this process.
 
 Thanks,
 
 DK
 
 
 
 ___
 general mailing list
 gene...@lists.openfabrics.org
 http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general
 
 To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-
 general
___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg


RE: [ewg] shipping firmware with ofed

2009-03-03 Thread Todd Rimmer
I have encountered many customers whose clusters are purposely not on the 
internet and hence could not perform a download.

It would be best if the FW was included in the release.  I think there is 
precedence with other Linux drivers which have included binary images of ASIC 
FW without having to release the source code.

Todd Rimmer
Chief Architect 
QLogic Network Systems Group
Voice: 610-233-4852 Fax: 610-233-4777
todd.rim...@qlogic.com  www.QLogic.com
 

 -Original Message-
 From: ewg-boun...@lists.openfabrics.org [mailto:ewg-
 boun...@lists.openfabrics.org] On Behalf Of John Russo
 Sent: Tuesday, March 03, 2009 1:19 PM
 To: Woodruff, Robert J; Steve Wise; tzipo...@dev.mellanox.co.il
 Cc: OpenFabrics EWG
 Subject: RE: [ewg] shipping firmware with ofed
 
 How about something along the lines of a firmware update tool built into
 OFED that checks the HCA for the proper firmware and gives the user to
 download the current version.  It could automatically pull from Mellanox's
 site and therefore avoid the licensing issue while still providing a
 customer service.
 
 -Original Message-
 From: ewg-boun...@lists.openfabrics.org [mailto:ewg-
 boun...@lists.openfabrics.org] On Behalf Of Woodruff, Robert J
 Sent: Tuesday, March 03, 2009 1:17 PM
 To: Steve Wise; tzipo...@dev.mellanox.co.il
 Cc: OpenFabrics EWG
 Subject: RE: [ewg] shipping firmware with ofed
 
 Steve Wrote,
 
 I request that we do add this.  Forcing customers to download firmware
 isn't very user friendly.
 
 It should be fairly easy to do, yes?  I would take on the work involved
 to add the infrastructure needed...
 
 All that would be needed, I think, is to make a src rpm that allows
 generating an rpm that will install the firmware in /lib/firmware.  Each
 provider could manage their own.  Or we could have a firmware src rpm
 that holds all the providers' firmware images...
 
 Thoughts?
 
 Steve.
 
 
 If people submit firmware to ofed, then would it have to be submitted
 under the dual BSD/GPL license as agreed to in the OFA bylaws. If so,
 then you would probably have to submit the firmware source code as well,
 and I am not sure if you are willing to do that...
 
 woody
 ___
 ewg mailing list
 ewg@lists.openfabrics.org
 http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg
 ___
 ewg mailing list
 ewg@lists.openfabrics.org
 http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg
___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg


RE: [ewg] SLES10 X86_64 build problem found with 1.3rc4

2008-02-20 Thread Todd Rimmer


Todd Rimmer
Chief Architect 
QLogic System Interconnect Group
Voice: 610-233-4852 Fax: 610-233-4777
[EMAIL PROTECTED]  www.QLogic.com
 

 From: Sasha Khapyorsky [mailto:[EMAIL PROTECTED]
 
 Currently I cannot reproduce it with RHEL. Will try to find SLES10...

[Todd Rimmer] It seems to only occur on SLES10, it worked fine on RHEL5
x86_64.  I have not yet tried SLES10 ppc64

 
 
  FYI, how do I go about getting a bugzilla account so I can enter
  problems like this directly into bugzilla?
 
 https://bugs.openfabrics.org/createaccount.cgi
[Todd Rimmer] Thanks I have created an account and entered PR 942 to
track this problem.

Todd Rimmer
___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg