Re: NFS over RDMA crashing

2013-02-07 Thread Tom Tucker
On 2/6/13 3:28 PM, Steve Wise wrote: On 2/6/2013 4:24 PM, J. Bruce Fields wrote: On Wed, Feb 06, 2013 at 05:48:15PM +0200, Yan Burman wrote: When killing mount command that got stuck: --- BUG: unable to handle kernel paging request at 880324dc7ff8 IP

[PATCH 1/2] ibsim: Fix PerformanceSet parsing corner case

2013-02-07 Thread Albert Chu
Parse of attribute did not properly remove whitespace before it. So PerformanceSet "H-0002c90300325280" PortCounters.SymbolErrorCounter=3 would work but PerformanceSet "H-0002c90300325280" PortCounters.SymbolErrorCounter=3\ would not. Signed-off-by: Albert Chu --- ibsim/sim_cmd.c |4 ++

[PATCH 2/2] ibsim: Output error on bad input to PerformanceSet

2013-02-07 Thread Albert Chu
Signed-off-by: Albert Chu --- ibsim/sim_cmd.c |1 + 1 files changed, 1 insertions(+), 0 deletions(-) diff --git a/ibsim/sim_cmd.c b/ibsim/sim_cmd.c index 83ad52d..4822a0b 100644 --- a/ibsim/sim_cmd.c +++ b/ibsim/sim_cmd.c @@ -1086,6 +1086,7 @@ static int do_perf_counter_set(FILE *f, char *li

Re: [LSF/MM TOPIC] Reducing the SRP initiator failover time

2013-02-07 Thread Vu Pham
It is known that it takes about two to three minutes before the upstream SRP initiator fails over from a failed path to a working path. This is not only considered longer than acceptable but is also longer than other Linux SCSI initiators (e.g. iSCSI and FC). Progress so far with improving

[PATCH] IB/qib: fix qp locate remove race

2013-02-07 Thread Mike Marciniszyn
remove_qp() can execute concurrently with a qib_lookup_qpn() on another CPU, which in of itself, is ok, given the RCU locking. The issue is that remove_qp() NULLs out the qp->next field so that a qib_lookup_qpn() might fail to find a qp if it occurs after the one that is being deleted. This is a

Re: [PATCH for 3.8 v3, resend 0/3] IB/SRP patches for kernel 3.8

2013-02-07 Thread Vu Pham
Bart Van Assche wrote: On 02/06/13 22:42, Vu Pham wrote: Conclusion: 1. disable the port/path long enough >35 minutes, we have dangling scsi host. 2. enable the port within 30 minute, scsi host re-establish connection, path re-instate and then scsi_host was removed (no entry in sysfs) I attache

Re: OFA Management maintainership

2013-02-07 Thread Ira Weiny
Indeed, Thank you Alex, Ira On Thu, 7 Feb 2013 07:05:58 + Hal Rosenstock wrote: > Hi, > > I think we all owe a debt of gratitude for Alex's excellent 2+ years of > OpenSM, libibumad, and ibsim maintainership. I hope I can live up to the high > standard Alex set. Thanks for all you've done

Re: NFS over RDMA crashing

2013-02-07 Thread J. Bruce Fields
On Wed, Feb 06, 2013 at 05:24:35PM -0500, J. Bruce Fields wrote: > On Wed, Feb 06, 2013 at 05:48:15PM +0200, Yan Burman wrote: > > When killing mount command that got stuck: > > --- > > > > BUG: unable to handle kernel paging request at 880324dc7ff8 > >

RE: NFS over RDMA crashing

2013-02-07 Thread Yan Burman
> -Original Message- > From: Jeff Becker [mailto:jeffrey.c.bec...@nasa.gov] > Sent: Wednesday, February 06, 2013 19:07 > To: Steve Wise > Cc: Yan Burman; bfie...@fieldses.org; linux-...@vger.kernel.org; linux- > r...@vger.kernel.org; Or Gerlitz; Tom Tucker > Subject: Re: NFS over RDMA crash

Re: [PATCH for 3.8 v3, resend 0/3] IB/SRP patches for kernel 3.8

2013-02-07 Thread Bart Van Assche
On 02/07/13 10:41, Or Gerlitz wrote: (BTW - if the fourth patch that Vu used "save & restore host_scribble during error handling" is also needed, maybe you add it to this series, so they are reviewed/accepted together). Hello Or, The three patches I posted guarantee timely host removal even w

Re: [PATCH for 3.8 v3, resend 0/3] IB/SRP patches for kernel 3.8

2013-02-07 Thread Or Gerlitz
On 07/02/2013 11:05, Bart Van Assche wrote: On 02/06/13 22:42, Vu Pham wrote: Conclusion: 1. disable the port/path long enough >35 minutes, we have dangling scsi host. 2. enable the port within 30 minute, scsi host re-establish connection, path re-instate and then scsi_host was removed (no entry

Re: [PATCH for 3.8 v3, resend 0/3] IB/SRP patches for kernel 3.8

2013-02-07 Thread Bart Van Assche
On 02/06/13 22:42, Vu Pham wrote: Conclusion: 1. disable the port/path long enough >35 minutes, we have dangling scsi host. 2. enable the port within 30 minute, scsi host re-establish connection, path re-instate and then scsi_host was removed (no entry in sysfs) I attached a log here to show wha