Re: blk-mq problem on proliant DL380 G3 (cciss)

2014-11-03 Thread Meelis Roos
03-11-2014 12:08 kirjutas Christoph Hellwig: Meelis, can you give the patch below a try? This only tries to locked the door on devices that actually were reset. Given that on a reset device we fail all commands before resuming operations it should work fine there as all tags should be released.

Re: blk-mq problem on proliant DL380 G3 (cciss)

2014-11-03 Thread Christoph Hellwig
Meelis, can you give the patch below a try? This only tries to locked the door on devices that actually were reset. Given that on a reset device we fail all commands before resuming operations it should work fine there as all tags should be released. diff --git a/drivers/scsi/scsi_error.c b/driv

Re: blk-mq problem on proliant DL380 G3 (cciss)

2014-11-02 Thread Jens Axboe
On 2014-10-30 11:45, Christoph Hellwig wrote: On Thu, Oct 30, 2014 at 07:32:52PM +0200, Meelis Roos wrote: can you try the patch below? It's a hack and not a proper fix, but it addresses what seems to be your culprit, given that it is the only place allocating a request from the error handler.

Re: blk-mq problem on proliant DL380 G3 (cciss)

2014-10-30 Thread Christoph Hellwig
On Thu, Oct 30, 2014 at 07:32:52PM +0200, Meelis Roos wrote: > > can you try the patch below? It's a hack and not a proper fix, but it > > addresses what seems to be your culprit, given that it is the only > > place allocating a request from the error handler. > > Applied it on top of 3.18-rc2, b

Re: blk-mq problem on proliant DL380 G3 (cciss)

2014-10-30 Thread Meelis Roos
> can you try the patch below? It's a hack and not a proper fix, but it > addresses what seems to be your culprit, given that it is the only > place allocating a request from the error handler. Applied it on top of 3.18-rc2, booted with scsi_mod.use_blk_mq=1 and it booted up fine. > diff --git

Re: blk-mq problem on proliant DL380 G3 (cciss)

2014-10-30 Thread Christoph Hellwig
Meelis, can you try the patch below? It's a hack and not a proper fix, but it addresses what seems to be your culprit, given that it is the only place allocating a request from the error handler. diff --git a/drivers/scsi/scsi_error.c b/drivers/scsi/scsi_error.c index fa7b5ec..5804ea0 100644 --

Re: blk-mq problem on proliant DL380 G3 (cciss)

2014-10-30 Thread Meelis Roos
> >> On Wed, Oct 29, 2014 at 09:08:46AM -0600, Jens Axboe wrote: > Another test server with MPT SCSI RAID has similar problem, > scsi_mode.use_blk_mq=0 cures it but I can not get good trace (no serail > console). 3.18.0-rc2-00043-gf7e87a4 was tested there. > >>> > >>> The first issue

Re: blk-mq problem on proliant DL380 G3 (cciss)

2014-10-29 Thread Meelis Roos
> >> On Wed, Oct 29, 2014 at 09:08:46AM -0600, Jens Axboe wrote: > Another test server with MPT SCSI RAID has similar problem, > scsi_mode.use_blk_mq=0 cures it but I can not get good trace (no serail > console). 3.18.0-rc2-00043-gf7e87a4 was tested there. > >>> > >>> The first issue

RE: blk-mq problem on proliant DL380 G3 (cciss)

2014-10-29 Thread Elliott, Robert (Server Storage)
> -Original Message- > From: linux-scsi-ow...@vger.kernel.org [mailto:linux-scsi- > ow...@vger.kernel.org] On Behalf Of Meelis Roos > Sent: Wednesday, 29 October, 2014 10:38 AM > To: Jens Axboe > Cc: linux-scsi@vger.kernel.org; Christoph Hellwig > Subject: Re: blk-m

Re: blk-mq problem on proliant DL380 G3 (cciss)

2014-10-29 Thread Jens Axboe
On 10/29/2014 02:06 PM, Meelis Roos wrote: >> On Wed, Oct 29, 2014 at 09:08:46AM -0600, Jens Axboe wrote: Another test server with MPT SCSI RAID has similar problem, scsi_mode.use_blk_mq=0 cures it but I can not get good trace (no serail console). 3.18.0-rc2-00043-gf7e87a4 was tested

Re: blk-mq problem on proliant DL380 G3 (cciss)

2014-10-29 Thread Meelis Roos
> On Wed, Oct 29, 2014 at 09:08:46AM -0600, Jens Axboe wrote: > > >Another test server with MPT SCSI RAID has similar problem, > > >scsi_mode.use_blk_mq=0 cures it but I can not get good trace (no serail > > >console). 3.18.0-rc2-00043-gf7e87a4 was tested there. > > > > The first issue looks like

Re: blk-mq problem on proliant DL380 G3 (cciss)

2014-10-29 Thread Christoph Hellwig
On Wed, Oct 29, 2014 at 09:08:46AM -0600, Jens Axboe wrote: > >Another test server with MPT SCSI RAID has similar problem, > >scsi_mode.use_blk_mq=0 cures it but I can not get good trace (no serail > >console). 3.18.0-rc2-00043-gf7e87a4 was tested there. > > The first issue looks like scsi cdrom a

Re: blk-mq problem on proliant DL380 G3 (cciss)

2014-10-29 Thread Meelis Roos
> On 2014-10-29 05:46, Meelis Roos wrote: > > > I tried 3.18-rc2 with blk-mq default on on HP ProLiant DL380 G3 (with HP > > > CCISS RAID controller). It fails late in the bootup with "task > > > scsi_eh_1:720 blocked for more than 120 seconds." messages. > > > > > > Booting with scsi_mod.use_blk_m

Re: blk-mq problem on proliant DL380 G3 (cciss)

2014-10-29 Thread Jens Axboe
On 2014-10-29 05:46, Meelis Roos wrote: I tried 3.18-rc2 with blk-mq default on on HP ProLiant DL380 G3 (with HP CCISS RAID controller). It fails late in the bootup with "task scsi_eh_1:720 blocked for more than 120 seconds." messages. Booting with scsi_mod.use_blk_mq=0 fixes the problem. Anot

Re: blk-mq problem on proliant DL380 G3 (cciss)

2014-10-29 Thread Meelis Roos
> I tried 3.18-rc2 with blk-mq default on on HP ProLiant DL380 G3 (with HP > CCISS RAID controller). It fails late in the bootup with "task > scsi_eh_1:720 blocked for more than 120 seconds." messages. > > Booting with scsi_mod.use_blk_mq=0 fixes the problem. Another test server with MPT SCSI R