[Bug 208551] Re: mdadm, Raid5 and XFS stuck in uninterruptable sleep

2008-09-11 Thread Colin King
SRU justification: Impact: mdadm, Raid5 get stuck in uninterruptable sleep under heavy I/O load. Copying data to a Raid 5 XFS partition results in a permanent lock on several processes related to it, getting stuck in the D(+) state. Occurs when large quantities of data (10-40 GB) is copied,

[Bug 208551] Re: mdadm, Raid5 and XFS stuck in uninterruptable sleep

2008-09-09 Thread Andrew Cholakian
I've tested Colin's patch and it's live on 2 production 64bit servers. Seems to work just fine. -- mdadm, Raid5 and XFS stuck in uninterruptable sleep https://bugs.launchpad.net/bugs/208551 You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu.

[Bug 208551] Re: mdadm, Raid5 and XFS stuck in uninterruptable sleep

2008-09-02 Thread Colin King
Hi Andrew, If you can try out the my kernel in the PPA just to verify this kernel with the single patch to fix this bug it would give us a clear indication that this issue is fixed against the current Hardy kernel sources. This allows us to the OK it for inclusion into the Hardy kernel for the

[Bug 208551] Re: mdadm, Raid5 and XFS stuck in uninterruptable sleep

2008-08-28 Thread Colin King
Since we don't have DesktopMan now to test this fix, marking it as Won't Fix, unless anyone has the same hardware and is willing to test this for Hardy. ** Changed in: linux (Ubuntu Hardy) Status: In Progress = Won't Fix -- mdadm, Raid5 and XFS stuck in uninterruptable sleep

[Bug 208551] Re: mdadm, Raid5 and XFS stuck in uninterruptable sleep

2008-08-28 Thread Andrew Cholakian
I used to have this bug, I'd be willing to test out your kernel. Right now I'm using my own kernel w/ patches applied. Would that work Colin? Have you done any testing yourself? I'm running a 6x1TB raid5 array with XFS on top on a Dell Poweredge 1800 (An older 64 bit xeon). -- mdadm, Raid5 and

[Bug 208551] Re: mdadm, Raid5 and XFS stuck in uninterruptable sleep

2008-08-21 Thread Colin King
Hi, I've applied commit 6ed3003c19a96fe18edf8179c4be6fe14abbebbc and built for testing linux - 2.6.24-20.39cking4 package - you can download the package from my PPA at: https://launchpad.net/~colin-king/+archive Please can you test this fix and let me know if it works so that we can add it to

Re: [Bug 208551] Re: mdadm, Raid5 and XFS stuck in uninterruptable sleep

2008-08-21 Thread DesktopMan
I had to swap to Debian as this bug made the server useless. I haven't had any deadlocks here yet, but it might still apply for all I know. Or is Debian using different code? I'm running testing, on 2.6.25-2 I am happy someone eventually identified the cause though. Christian Colin King

[Bug 208551] Re: mdadm, Raid5 and XFS stuck in uninterruptable sleep

2008-08-20 Thread Colin King
** Changed in: linux (Ubuntu Hardy) Assignee: Ubuntu Kernel Team (ubuntu-kernel-team) = Colin King (colin-king) Status: Triaged = In Progress -- mdadm, Raid5 and XFS stuck in uninterruptable sleep https://bugs.launchpad.net/bugs/208551 You received this bug notification because you

[Bug 208551] Re: mdadm, Raid5 and XFS stuck in uninterruptable sleep

2008-08-19 Thread Leann Ogasawara
Hi Guys, I just wanted to let you know the latest Alpha for the upcoming Intrepid Ibex 8.10 is available. The kernel for Intrepid is based on a 2.6.26 kernel at the moment. This 2.6.26 kernel has the patch which was referenced to have fixed this issue in 2.6.25. For more information regarding

[Bug 208551] Re: mdadm, Raid5 and XFS stuck in uninterruptable sleep

2008-08-07 Thread Andrew Cholakian
I managed to patch the stock ubuntu kernel (2.6.24-18) with the patches I found on the second link on the above post on the LKML. Seems stable, I've been running it in production on two large raid5 arrays without issue. The patches didn't apply perfectly but they do work. -- mdadm, Raid5 and XFS

[Bug 208551] Re: mdadm, Raid5 and XFS stuck in uninterruptable sleep

2008-06-16 Thread Carl Streeter
I'm having the same issue pointed to in the thread mentioned above: http://www.issociate.de/board/post/471929/2.6.24-rc6_reproducible_raid5_hang.html It seems that this was fixed in kernel version 2.6.25. Would it be possible to backport this to ubuntu kernels? It's basically impossible to use

[Bug 208551] Re: mdadm, Raid5 and XFS stuck in uninterruptable sleep

2008-05-06 Thread Twigathy
I'm not certain if I'm having the same trouble as you, but mdadm fell over pretty hard for me on 2.6.24-16-server, mdadm - v2.6.3 - 20th August 2007 when expanding 5x500GB - 6x500GB. I lost all the data on the raid (oops). Possibly this is a bug in sata_sil with lots of disk writes? 5 of the 6

[Bug 208551] Re: mdadm, Raid5 and XFS stuck in uninterruptable sleep

2008-05-06 Thread DesktopMan
Not sure if it is related, might be. I honestly gave up on it after concluding that the problem was too erratic and virtually impossible for me to debug. If I remember correctly I also got messages about the SATA link going down, then reset and back up. -- mdadm, Raid5 and XFS stuck in

[Bug 208551] Re: mdadm, Raid5 and XFS stuck in uninterruptable sleep

2008-05-06 Thread Twigathy
Hm, so what did you do instead? Buy new controller cards or give up on raid? ;) -- mdadm, Raid5 and XFS stuck in uninterruptable sleep https://bugs.launchpad.net/bugs/208551 You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. --

[Bug 208551] Re: mdadm, Raid5 and XFS stuck in uninterruptable sleep

2008-05-06 Thread Twigathy
Hi, I googled a little further; looks like this is a bug in sata_siI after all Check out http://www.ussg.iu.edu/hypermail/linux/kernel/0707.1/0024.html Doesn't seem to be a fix for it! This isn't too good for me - I have 3 of these cards :-( -- mdadm, Raid5 and XFS stuck in uninterruptable

[Bug 208551] Re: mdadm, Raid5 and XFS stuck in uninterruptable sleep

2008-03-30 Thread DesktopMan
Was copying from a file set up with losetup + cryptsetup on a raid5 array (the one above) to a raid6 array, all of which with XFS. During this copy I ran mdadm --examine --scan, and the raid5 crashed (the one I was reading from), giving me input/output errors. md device is fine on the other hand,

[Bug 208551] Re: mdadm, Raid5 and XFS stuck in uninterruptable sleep

2008-03-28 Thread DesktopMan
Sigh. Spoke too soon. Ran mdadm -D while it was beeing copied to, hanged again. 2TB transferred. Guess it's directly related to the number of processes that access to the device. Won't be able to restart it until tomorrow, but I can try any suggestions on the hanged system. -- mdadm, Raid5 and

[Bug 208551] Re: mdadm, Raid5 and XFS stuck in uninterruptable sleep

2008-03-28 Thread DesktopMan
mdadm -D returned after a couple of minutes, at which point it started writing again. During the period it was running nothing was written. -- mdadm, Raid5 and XFS stuck in uninterruptable sleep https://bugs.launchpad.net/bugs/208551 You received this bug notification because you are a member of