[openib-general] [Bug 17] [CHECKER] NULL deref in drivers/infiniband/ulp/srp/ib_srp.c

2006-06-09 Thread bugzilla-daemon
http://openib.org/bugzilla/show_bug.cgi?id=17 [EMAIL PROTECTED] changed: What|Removed |Added Status|RESOLVED|CLOSED --- Comment #2 from [EMAIL

[openib-general] [Bug 12] [CHECKER] drivers/infiniband/ulp/ipoib/ipoib_main.c: confusion over NULL pointer

2006-06-09 Thread bugzilla-daemon
http://openib.org/bugzilla/show_bug.cgi?id=12 [EMAIL PROTECTED] changed: What|Removed |Added Status|RESOLVED|CLOSED --- Comment #2 from [EMAIL

[openib-general] [Bug 11] [CHECKER] Return value of idr_find not checked for NULL

2006-06-09 Thread bugzilla-daemon
http://openib.org/bugzilla/show_bug.cgi?id=11 [EMAIL PROTECTED] changed: What|Removed |Added Status|RESOLVED|CLOSED --- Comment #3 from [EMAIL

[openib-general] [Bug 8] [CHECKER] Leak in drivers/infiniband/core/sysfs.c:alloc_group_attrs

2006-06-09 Thread bugzilla-daemon
http://openib.org/bugzilla/show_bug.cgi?id=8 [EMAIL PROTECTED] changed: What|Removed |Added Status|RESOLVED|CLOSED --- Comment #3 from [EMAIL P

[openib-general] [Bug 10] [CHECKER] NULL deref in drivers/infiniband/core/ucm.c:ib_ucm_event_process

2006-06-09 Thread bugzilla-daemon
http://openib.org/bugzilla/show_bug.cgi?id=10 [EMAIL PROTECTED] changed: What|Removed |Added Status|RESOLVED|CLOSED --- Comment #2 from [EMAIL

[openib-general] [Bug 9] [CHECKER] Fencepost error in drivers/infiniband/core/sysfs.c:state_show

2006-06-09 Thread bugzilla-daemon
http://openib.org/bugzilla/show_bug.cgi?id=9 [EMAIL PROTECTED] changed: What|Removed |Added Status|RESOLVED|CLOSED --- Comment #2 from [EMAIL P

[openib-general] OFED 1.0-rc6 tarball available with working ipath driver

2006-06-09 Thread Bryan O'Sullivan
Due to unfortunate timing, the ipath driver in OFED 1.0-rc6 does not work correctly. You can download an updated tarball from here, for which the ipath driver works fine: http://openib.red-bean.com/OFED-1.0-rc6+ipath.tar.bz2 Alternatively, pull the necessary patches from SVN. http://ope

Re: [openib-general] [openfabrics-ewg] OFED-1.0-rc6 is available

2006-06-09 Thread Woodruff, Robert J
Betzy wrote, >Woody - The short answer is yes - Bryan has created patches in >the subversion tree, which will install on top of what Tziporet >pulled from Roland's tree. These will be in the 1.0 release (and, >we will be testing an early version of that on Monday). We've >tested the ipath driver c

Re: [openib-general] [openfabrics-ewg] OFED-1.0-rc6 is available

2006-06-09 Thread Betsy Zeller
Woody - The short answer is yes - Bryan has created patches in the subversion tree, which will install on top of what Tziporet pulled from Roland's tree. These will be in the 1.0 release (and, we will be testing an early version of that on Monday). We've tested the ipath driver code pretty thorough

[openib-general] [PATCH] uDAPL openib_cma, cleanup reported CM error events, add TIMEOUT

2006-06-09 Thread Arlin Davis
James, I cleaned up the connection error events to report the proper events during address resolution errors and timeouts. It was returning incorrect DAT event codes. -arlin Signed-off by: Arlin Davis <[EMAIL PROTECTED]> Index: dapl_ib_cm.c ===

Re: [openib-general] [PATCH 0/5] multicast abstraction

2006-06-09 Thread Sean Hefty
Sean Hefty wrote: > This patch series enhances support for joining and leaving multicast groups, > providing the following functionality: > > 1. Users identify a multicast group by a multicast IP address. > 2. A user binds to a local RDMA device based on resolving the IP address. > 3. A new multic

[openib-general] [PATCH 5/5] ucma: export multicast suport to userspace

2006-06-09 Thread Sean Hefty
Expose multicast abstraction through the CMA to userspace. Signed-off-by: Sean Hefty <[EMAIL PROTECTED]> --- --- svn3/gen2/trunk/src/linux-kernel/infiniband/include/rdma/rdma_user_cm.h 2006-06-06 16:53:46.0 -0700 +++ svn/gen2/trunk/src/linux-kernel/infiniband/include/rdma/rdma_user_cm

[openib-general] [PATCH 4/5] rdma cm: add support to join / leave multicast groups

2006-06-09 Thread Sean Hefty
Add IB multicast abstraction to the CMA. Signed-off-by: Sean Hefty <[EMAIL PROTECTED]> --- --- svn3/gen2/trunk/src/linux-kernel/infiniband/include/rdma/rdma_cm.h 2006-06-06 16:53:56.0 -0700 +++ svn/gen2/trunk/src/linux-kernel/infiniband/include/rdma/rdma_cm.h 2006-06-02 10:22:29.0

Re: [openib-general] [PATCH] uDAPL openib-cma provider - add support for IB_CM_REQ_OPTIONS

2006-06-09 Thread James Lentini
On Fri, 9 Jun 2006, Arlin Davis wrote: > James Lentini wrote: > > > On Thu, 8 Jun 2006, Jack Morgenstein wrote: > > > > > > > On Wednesday 07 June 2006 18:26, James Lentini wrote: > > > > > > > On Wed, 7 Jun 2006, Jack Morgenstein wrote: > > > > > > > > > This (bug fix) can still b

Re: [openib-general] Failed multicast join withnew multicast module

2006-06-09 Thread Hal Rosenstock
On Fri, 2006-06-09 at 17:18, Sean Hefty wrote: > Hal Rosenstock wrote: > > What does mesh mean in this instance ? How do you know the multicast > > routing tables are indeed valid and that the SM didn't corrupt them ? > > (Why did the SM need restarting ?) > > I meant that the values agree with ea

Re: [openib-general] [PATCH] uDAPL openib-cma provider - add support for IB_CM_REQ_OPTIONS

2006-06-09 Thread Arlin Davis
James Lentini wrote: >On Thu, 8 Jun 2006, Jack Morgenstein wrote: > > > >>On Wednesday 07 June 2006 18:26, James Lentini wrote: >> >> >>>On Wed, 7 Jun 2006, Jack Morgenstein wrote: >>> >>> This (bug fix) can still be included in next-week's release, if you think it is important

[openib-general] [PATCH 3/5] sa_query: add call to initialize ah_attr from an mcmember record

2006-06-09 Thread Sean Hefty
Export a call to initialize an ib_ah_attr structure based on an MCMemberRecord returned from a multicast join request. Signed-off-by: Sean Hefty <[EMAIL PROTECTED]> --- --- svn3/gen2/trunk/src/linux-kernel/infiniband/include/rdma/ib_sa.h 2006-06-06 15:21:05.0 -0700 +++ svn/gen2/trunk/s

[openib-general] [PATCH 2/5] multicast: allow retrieving an MCMemberRecord based on MGID

2006-06-09 Thread Sean Hefty
Add an API to allow retrieving an MCMemberRecord from the local cache based on an MGID. Signed-off-by: Sean Hefty <[EMAIL PROTECTED]> --- This allows an existing MCMemberRecord to be used as a template for creating other multicast groups. --- svn3/gen2/trunk/src/linux-kernel/infiniband/include/rd

[openib-general] [PATCH 1/5] ib_addr: retrieve MGID from device address

2006-06-09 Thread Sean Hefty
Extract the MGID used by ipoib for broadcast traffic from the device address. Signed-off-by: Sean Hefty <[EMAIL PROTECTED]> --- This will be used to get the MCMemberRecord for the ipoib broadcast group. --- svn3/gen2/trunk/src/linux-kernel/infiniband/include/rdma/ib_addr.h 2006-05-25 11:18:47.0

[openib-general] [PATCH 0/5] multicast abstraction

2006-06-09 Thread Sean Hefty
This patch series enhances support for joining and leaving multicast groups, providing the following functionality: 1. Users identify a multicast group by a multicast IP address. 2. A user binds to a local RDMA device based on resolving the IP address. 3. A new multicast group is created. The par

Re: [openib-general] Failed multicast join withnew multicast module

2006-06-09 Thread Sean Hefty
Hal Rosenstock wrote: > What does mesh mean in this instance ? How do you know the multicast > routing tables are indeed valid and that the SM didn't corrupt them ? > (Why did the SM need restarting ?) I meant that the values agree with each other, and there are no conflicts. > The MLID is suppli

Re: [openib-general] Failed multicast join withnew multicast module

2006-06-09 Thread Hal Rosenstock
On Fri, 2006-06-09 at 16:35, Sean Hefty wrote: > Hal Rosenstock wrote: > > The other issue is whether you trust the state of the network or not > > when the SM comes up. That's sometimes a dangerous proposition. > > I considered this, but I think there's a difference between trusting one of > the

[openib-general] [Bug 122] mad layer problem

2006-06-09 Thread bugzilla-daemon
http://openib.org/bugzilla/show_bug.cgi?id=122 [EMAIL PROTECTED] changed: What|Removed |Added Status|NEW |ASSIGNED --- You are receiving

[openib-general] [Bug 126] New: RDMA_CM and UCM not loaded on boot

2006-06-09 Thread bugzilla-daemon
http://openib.org/bugzilla/show_bug.cgi?id=126 Summary: RDMA_CM and UCM not loaded on boot Product: OpenFabrics Linux Version: 1.0rc6 Platform: Other OS/Version: Other Status: NEW Severity: normal Priority: P2

[openib-general] Re: Failed multicast join withnew multicast module

2006-06-09 Thread Sean Hefty
Hal Rosenstock wrote: The other issue is whether you trust the state of the network or not when the SM comes up. That's sometimes a dangerous proposition. I considered this, but I think there's a difference between trusting one of the systems on the network, versus the network as a whole. For

[openib-general] RE: [openfabrics-ewg] OFED-1.0-rc6 is available

2006-06-09 Thread Woodruff, Robert J
Is there any plan to release an RC6 package (or an RC7) that has a Pathscale driver that compiles on RHEL4 - U3 that we can test before the release ?   woody   From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Tziporet KorenSent: Wednesday, June 07, 2006 7:59 AMTo: Tziporet K

[openib-general] Re: Failed multicast join withnew multicast module

2006-06-09 Thread Hal Rosenstock
On Fri, 2006-06-09 at 12:46, Sean Hefty wrote: > Hal Rosenstock wrote: > > Note the MGRPs are MGIDs and switches are programmed with MLIDs and > > these can be 1:1 or many:1 depending on the implementation. Most do not > > do the many:1 but this is allowed by the spec. Also, note that switches > >

Re: [openib-general] [PATCH] rping: Erroneous check for minumum ping buffer size

2006-06-09 Thread Tom Tucker
Well it's almost a puzzle at this point. just hard coding 10 with a comment is probably easier to read. But ... for the curious, this will do what you want ... but may cause you to lose your breakfast. #define _stringify( _x ) # _x #define stringify( _x ) _stringify( _x ) Then printf("

Re: [openib-general] IB MTU tunable for uDAPL and/or Intel MPI?

2006-06-09 Thread Arlin Davis
Scott Weitzenkamp (sweitzen) wrote: While we're talking about MTUs, is the IB MTU tunable in uDAPL and/or Intel MPI via env var or config file? Looks like Intel MPI 2.0.1 uses 2K for IB MTU like MVAPICH does in OFED 1.0 rc4 and rc6, I'd like to try 1K with Intel MPI. Scott There is no me

Re: [openib-general] [PATCH 2/2] ipoib: handle multicast group reset notification

2006-06-09 Thread Sean Hefty
Sean Hefty wrote: Ipoib already checks for events that require rejoining multicast groups. We just need to add code to handle (i.e. ignore) multicast group reset notifications. Roland, Any issue committing this? - Sean ___ openib-general mailing li

Re: [openib-general] Re: [PATCH 1/2] multicast: notify users on membership errors

2006-06-09 Thread Sean Hefty
Michael S. Tsirkin wrote: These should eliminate any races with ipoib leaving, then quickly re-joining a group as a result of an event. Is there a chance this will fix the crashes me and Or were seeing? It shouldn't. The race that I was referring to only involved whether or not a MAD is sen

[openib-general] IB MTU tunable for uDAPL and/or Intel MPI?

2006-06-09 Thread Scott Weitzenkamp (sweitzen)
While we're talking about MTUs, is the IB MTU tunable in uDAPL and/or Intel MPI via env var or config file?   Looks like Intel MPI 2.0.1 uses 2K for IB MTU like MVAPICH does in OFED 1.0 rc4 and rc6, I'd like to try 1K with Intel MPI.   Scott From: [EMAIL PROTECTED] [mailto:[EMAIL

[openib-general] Re: Failed multicast join withnew multicast module

2006-06-09 Thread Sean Hefty
Hal Rosenstock wrote: Note the MGRPs are MGIDs and switches are programmed with MLIDs and these can be 1:1 or many:1 depending on the implementation. Most do not do the many:1 but this is allowed by the spec. Also, note that switches know nothing about the groups themselves (only MLIDs and which

[openib-general] [PATCH] ibnetdiscover: Indicate SP0 type

2006-06-09 Thread Hal Rosenstock
ibnetdiscover: Indicate SP0 type Signed-off-by: Hal Rosenstock <[EMAIL PROTECTED]> Index: diags/src/ibnetdiscover.c === --- diags/src/ibnetdiscover.c (revision 7842) +++ diags/src/ibnetdiscover.c (working copy) @@ -126,7 +126,9 @

[openib-general] [PATCH] ibnetdiscover: Add LMC display to switch port 0

2006-06-09 Thread Hal Rosenstock
ibnetdiscover: Add LMC display to switch port 0 Signed-off-by: Hal Rosenstock <[EMAIL PROTECTED]> Index: src/ibnetdiscover.c === --- src/ibnetdiscover.c (revision 7841) +++ src/ibnetdiscover.c (working copy) @@ -158,6 +158,7 @@ get_n

Re: [openib-general] Re: Mellanox HCAs: outstanding RDMAs

2006-06-09 Thread Michael Krause
Whether iWARP or IB, there is a fixed number of RDMA Requests allowed to be outstanding at any given time.  If one posts more RDMA Read requests than the fixed number, the transmit queue is stalled.  This is documented in both technology specifications.  It is something that all ULP should be aw

Re: [openib-general] Re: Failed multicast join withnew multicast module

2006-06-09 Thread Hal Rosenstock
On Fri, 2006-06-09 at 06:43, Hal Rosenstock wrote: > On Thu, 2006-06-08 at 18:00, Sean Hefty wrote: > > Hal Rosenstock wrote: > > > 2. There is lazy deletion of MC groups allowed so the reclamation may be > > > difficult. > > > > I'm not familiar with the switch programming. > > Note the MGRPs ar

[openib-general] [PATCH] osmtest: Support LMC > 0

2006-06-09 Thread Hal Rosenstock
osmtest: Support LMC > 0 Signed-off-by: Hal Rosenstock <[EMAIL PROTECTED]> Index: osmtest/osmtest.c === --- osmtest/osmtest.c (revision 7839) +++ osmtest/osmtest.c (working copy) @@ -1609,6 +1609,74 @@ osmtest_stress_port_recs_sm

[openib-general] Re: [PATCH 1/2] multicast: notify users on membership errors

2006-06-09 Thread Hal Rosenstock
On Fri, 2006-06-09 at 00:38, Sean Hefty wrote: > Modify ib_multicast module to detect events that require clients to rejoin > multicast groups. Add tracking of clients which are members of any groups, > and provide notification to those clients when such an event occurs. > > This patch tracks all

[openib-general] [PATCH] ibroute: When multiple paths, indicate port GUID on alternate paths

2006-06-09 Thread Hal Rosenstock
ibroute: When multiple paths, indicate port GUID on alternate paths Signed-off-by: Hal Rosenstock <[EMAIL PROTECTED]> Index: diags/src/ibroute.c === --- diags/src/ibroute.c (revision 7646) +++ diags/src/ibroute.c (working copy) @@ -2

[openib-general] [Bug 122] New: mad layer problem

2006-06-09 Thread Eli Cohen
Hi, Here is some info: Attached are the SysRq messages. The relation of MADs to ARP is that after ARP resolves a hardware address it is required to use an SM query to resolve the path to the host bearing the hardware address. How to invoke the tests: Attached

[openib-general] Re: Failed multicast join withnew multicast module

2006-06-09 Thread Hal Rosenstock
On Thu, 2006-06-08 at 18:00, Sean Hefty wrote: > Hal Rosenstock wrote: > > 2. There is lazy deletion of MC groups allowed so the reclamation may be > > difficult. > > I'm not familiar with the switch programming. Note the MGRPs are MGIDs and switches are programmed with MLIDs and these can be 1:1

[openib-general] [PATCH] mthca: send opcode in error CQE for debug

2006-06-09 Thread Michael S. Tsirkin
I find the following helpful for debug. Pls consider for 2.6.18 -- While IP spec does not require opcode to be valid in error CQEs, Mellanox HCAs differentiate between send/receive errors, which is useful for debugging purposes. Signed-off-by: Michael S. Tsirkin <[EMAIL PROTECTED]> Index: last_

Re: [openib-general] [PATCH] rping: Erroneous check for minumum ping buffer size

2006-06-09 Thread Pradipta Kumar Banerjee
Pradipta Kumar Banerjee wrote: rping didn't checked correctly for the minimum size of the ping buffer resulting in the following error from glibc "*** glibc detected *** free(): invalid next size (fast)" Signed-off-by: Pradipta Kumar Banerjee <[EMAIL PROTECTED]> --- Index: rping.c