Re: Monthly md check == hung machine; how do I debug?

2008-02-05 Thread Robin Lee Powell
On Wed, Feb 06, 2008 at 07:27:56AM +1100, Neil Brown wrote: > On Tuesday February 5, [EMAIL PROTECTED] wrote: > > > > I was able to solve the problem, however, like so: > > > > 132c133 > > < # CONFIG_PREEMPT_NONE is not set > > --- > > > CONFIG_PREEMPT_NONE=y > > 134,135c135,136 > > <

Re: Monthly md check == hung machine; how do I debug?

2008-02-05 Thread Neil Brown
On Tuesday February 5, [EMAIL PROTECTED] wrote: > > I was able to solve the problem, however, like so: > > 132c133 > < # CONFIG_PREEMPT_NONE is not set > --- > > CONFIG_PREEMPT_NONE=y > 134,135c135,136 > < CONFIG_PREEMPT=y > < CONFIG_PREEMPT_BKL=y > --- > > # CONFIG_PREEMPT is not set > > #

Re: Monthly md check == hung machine; how do I debug?

2008-02-05 Thread Robin Lee Powell
On Tue, Feb 05, 2008 at 01:55:17PM -0500, Lennart Sorensen wrote: > I tend to adjust the max disk speed raid is allowed to use, since > the default of 200MB/s makes the system close to unusable while it > is taking place. Could having slow disk access be causing things > to lock up? I don't know

Re: Monthly md check == hung machine; how do I debug?

2008-02-05 Thread Lennart Sorensen
On Tue, Feb 05, 2008 at 09:10:05AM -0800, Robin Lee Powell wrote: > On Mon, Feb 04, 2008 at 09:40:55PM +1100, Nick Piggin wrote: > > On Monday 04 February 2008 08:21, Robin Lee Powell wrote: > > > I've got a machine with a 4 disk SATA raid10 configuration using > > > md. The entire disk is

Re: Monthly md check == hung machine; how do I debug?

2008-02-05 Thread Robin Lee Powell
On Mon, Feb 04, 2008 at 09:40:55PM +1100, Nick Piggin wrote: > On Monday 04 February 2008 08:21, Robin Lee Powell wrote: > > I've got a machine with a 4 disk SATA raid10 configuration using > > md. The entire disk is loop-AES encrypted, but that shouldn't > > matter here. > > > > Once a month,

Re: Monthly md check == hung machine; how do I debug?

2008-02-05 Thread Robin Lee Powell
On Mon, Feb 04, 2008 at 09:40:55PM +1100, Nick Piggin wrote: On Monday 04 February 2008 08:21, Robin Lee Powell wrote: I've got a machine with a 4 disk SATA raid10 configuration using md. The entire disk is loop-AES encrypted, but that shouldn't matter here. Once a month, Debian runs:

Re: Monthly md check == hung machine; how do I debug?

2008-02-05 Thread Lennart Sorensen
On Tue, Feb 05, 2008 at 09:10:05AM -0800, Robin Lee Powell wrote: On Mon, Feb 04, 2008 at 09:40:55PM +1100, Nick Piggin wrote: On Monday 04 February 2008 08:21, Robin Lee Powell wrote: I've got a machine with a 4 disk SATA raid10 configuration using md. The entire disk is loop-AES

Re: Monthly md check == hung machine; how do I debug?

2008-02-05 Thread Robin Lee Powell
On Tue, Feb 05, 2008 at 01:55:17PM -0500, Lennart Sorensen wrote: I tend to adjust the max disk speed raid is allowed to use, since the default of 200MB/s makes the system close to unusable while it is taking place. Could having slow disk access be causing things to lock up? I don't know if

Re: Monthly md check == hung machine; how do I debug?

2008-02-05 Thread Neil Brown
On Tuesday February 5, [EMAIL PROTECTED] wrote: I was able to solve the problem, however, like so: 132c133 # CONFIG_PREEMPT_NONE is not set --- CONFIG_PREEMPT_NONE=y 134,135c135,136 CONFIG_PREEMPT=y CONFIG_PREEMPT_BKL=y --- # CONFIG_PREEMPT is not set # CONFIG_PREEMPT_BKL is

Re: Monthly md check == hung machine; how do I debug?

2008-02-05 Thread Robin Lee Powell
On Wed, Feb 06, 2008 at 07:27:56AM +1100, Neil Brown wrote: On Tuesday February 5, [EMAIL PROTECTED] wrote: I was able to solve the problem, however, like so: 132c133 # CONFIG_PREEMPT_NONE is not set --- CONFIG_PREEMPT_NONE=y 134,135c135,136 CONFIG_PREEMPT=y

Re: Monthly md check == hung machine; how do I debug?

2008-02-04 Thread Nick Piggin
On Monday 04 February 2008 08:21, Robin Lee Powell wrote: > I've got a machine with a 4 disk SATA raid10 configuration using md. > The entire disk is loop-AES encrypted, but that shouldn't matter > here. > > Once a month, Debian runs: > > /usr/share/mdadm/checkarray --cron --all --quiet > >

Re: Monthly md check == hung machine; how do I debug?

2008-02-04 Thread Nick Piggin
On Monday 04 February 2008 08:21, Robin Lee Powell wrote: I've got a machine with a 4 disk SATA raid10 configuration using md. The entire disk is loop-AES encrypted, but that shouldn't matter here. Once a month, Debian runs: /usr/share/mdadm/checkarray --cron --all --quiet and the

Re: Monthly md check == hung machine; how do I debug?

2008-02-03 Thread Robin Lee Powell
On Mon, Feb 04, 2008 at 06:37:02PM +1300, martin f krafft wrote: > also sprach Robin Lee Powell <[EMAIL PROTECTED]> [2008.02.04.1021 +1300]: > > /usr/share/mdadm/checkarray --cron --all --quiet > > FYI: > http://git.debian.org/?p=pkg-mdadm/mdadm.git;a=blob;f=debian/checkarray > > It

Re: Monthly md check == hung machine; how do I debug?

2008-02-03 Thread martin f krafft
also sprach Robin Lee Powell <[EMAIL PROTECTED]> [2008.02.04.1021 +1300]: > /usr/share/mdadm/checkarray --cron --all --quiet FYI: http://git.debian.org/?p=pkg-mdadm/mdadm.git;a=blob;f=debian/checkarray It basically does echo check > /sys/block/$array/md/sync_action for all arrays. --

Monthly md check == hung machine; how do I debug?

2008-02-03 Thread Robin Lee Powell
I've got a machine with a 4 disk SATA raid10 configuration using md. The entire disk is loop-AES encrypted, but that shouldn't matter here. Once a month, Debian runs: /usr/share/mdadm/checkarray --cron --all --quiet and the machine hangs within 30 minutes of that starting. It seems that I

Monthly md check == hung machine; how do I debug?

2008-02-03 Thread Robin Lee Powell
I've got a machine with a 4 disk SATA raid10 configuration using md. The entire disk is loop-AES encrypted, but that shouldn't matter here. Once a month, Debian runs: /usr/share/mdadm/checkarray --cron --all --quiet and the machine hangs within 30 minutes of that starting. It seems that I

Re: Monthly md check == hung machine; how do I debug?

2008-02-03 Thread martin f krafft
also sprach Robin Lee Powell [EMAIL PROTECTED] [2008.02.04.1021 +1300]: /usr/share/mdadm/checkarray --cron --all --quiet FYI: http://git.debian.org/?p=pkg-mdadm/mdadm.git;a=blob;f=debian/checkarray It basically does echo check /sys/block/$array/md/sync_action for all arrays. --

Re: Monthly md check == hung machine; how do I debug?

2008-02-03 Thread Robin Lee Powell
On Mon, Feb 04, 2008 at 06:37:02PM +1300, martin f krafft wrote: also sprach Robin Lee Powell [EMAIL PROTECTED] [2008.02.04.1021 +1300]: /usr/share/mdadm/checkarray --cron --all --quiet FYI: http://git.debian.org/?p=pkg-mdadm/mdadm.git;a=blob;f=debian/checkarray It basically does