Re: [ewg] Re: Possible process deadlock in RMPP flow

2009-10-20 Thread Tziporet Koren
Sean Hefty wrote: I can't find anything off in the code for this. Eventually it was a FW issue that is fixed in our new 2.7.0 release Tziporet ___ ewg mailing list ewg@lists.openfabrics.org http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg

Re: [ewg] Re: Possible process deadlock in RMPP flow

2009-10-20 Thread Eli Cohen
On Mon, Oct 19, 2009 at 01:30:47PM -0700, Sean Hefty wrote: I can't find anything off in the code for this. It's odd, since unregister_mad_agent() does: flush_workqueue(port_priv-wq); ib_cancel_rmpp_recvs(mad_agent_priv); and ib_cancel_rmpp_recvs() does:

RE: [ewg] Re: Possible process deadlock in RMPP flow

2009-10-19 Thread Sean Hefty
Thanks Or. This one is already in OFED 1.4.2 but apparently this is a different problem. Once I have information whether the patch Roland posted fixed it I will update the list. Eli, did you find a commit that fixes the problem you reported on? Or. Not yet :-( I can't find anything off in

Re: [ewg] Re: Possible process deadlock in RMPP flow

2009-10-04 Thread Or Gerlitz
Eli Cohen wrote: Thanks Or. This one is already in OFED 1.4.2 but apparently this is a different problem. Once I have information whether the patch Roland posted fixed it I will update the list. Eli, did you find a commit that fixes the problem you reported on? Or.

Re: [ewg] Re: Possible process deadlock in RMPP flow

2009-10-04 Thread Tziporet Koren
Or Gerlitz wrote: Eli Cohen wrote: Thanks Or. This one is already in OFED 1.4.2 but apparently this is a different problem. Once I have information whether the patch Roland posted fixed it I will update the list. Eli, did you find a commit that fixes the problem you reported on? Or. Not

[ewg] Re: Possible process deadlock in RMPP flow

2009-09-27 Thread Eli Cohen
On Thu, Sep 24, 2009 at 08:53:24AM -0700, Sean Hefty wrote: Thanks Or. This one is already in OFED 1.4.2 but apparently this is a different problem. Once I have information whether the patch Roland posted fixed it I will update the list. If ibnetdiscover doesn't use RMPP as Hal indicated, I

[ewg] Re: Possible process deadlock in RMPP flow

2009-09-24 Thread Or Gerlitz
Eli Cohen wrote: On Wed, Sep 23, 2009 at 09:08:28AM -0700, Sean Hefty wrote: What kernel does 1.4.2 map to? I think OFED 1.4.2 is based on kernel 2.6.27 but they're using RHEL 5.3 Yes, the usual mess: ofed X is based on kernel Y1 but with some additions from kernel Y2 plus plenty of unreviwed

[ewg] Re: Possible process deadlock in RMPP flow

2009-09-24 Thread Eli Cohen
On Thu, Sep 24, 2009 at 09:38:43AM +0300, Or Gerlitz wrote: commit b61d92d8ae6aa13b17d1c31e69d123879cec2ee2 Author: Sean Hefty sean.he...@intel.com Date: Fri Nov 30 17:30:18 2007 -0800 IB/mad: Fix incorrect access to items on local_list Thanks Or. This one is already in OFED 1.4.2

[ewg] RE: Possible process deadlock in RMPP flow

2009-09-24 Thread Sean Hefty
Thanks Or. This one is already in OFED 1.4.2 but apparently this is a different problem. Once I have information whether the patch Roland posted fixed it I will update the list. If ibnetdiscover doesn't use RMPP as Hal indicated, I don't think Roland's patch will help.

[ewg] RE: Possible process deadlock in RMPP flow

2009-09-23 Thread Sean Hefty
ibnetdiscover D 80149b8d 0 26968 26544 (L-TLB) 8102c900bd88 0046 81037e8e 81037e8e02e8 8102c900bd78 000a 8102c5b50820 81038a929820 011837bf6105 0ede 8102c5b50a08 0001 Call Trace: [80064207]

[ewg] Re: Possible process deadlock in RMPP flow

2009-09-23 Thread Hal Rosenstock
On Wed, Sep 23, 2009 at 12:08 PM, Sean Hefty sean.he...@intel.com wrote: ibnetdiscover D 80149b8d 0 26968 26544 (L-TLB) 8102c900bd88 0046 81037e8e 81037e8e02e8 8102c900bd78 000a 8102c5b50820 81038a929820 011837bf6105

[ewg] Re: Possible process deadlock in RMPP flow

2009-09-23 Thread Eli Cohen
On Wed, Sep 23, 2009 at 09:08:28AM -0700, Sean Hefty wrote: Roland just submitted a patch in this area yesterday. I don't know if the patch would fix their issue, but it may be worth trying. What kernel does 1.4.2 map to? I think OFED 1.4.2 is based on kernel 2.6.27 but they're using RHEL