Re: btrfs-transaction blocked for more than 120 seconds

Chris Murphy Sun, 05 Jan 2014 15:49:13 -0800

On Jan 5, 2014, at 2:17 PM, Sulla <su...@gmx.at> wrote:

> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
> 
> Dear Chris!
> 
> Certainly: I have 3 HDDs, all of which WD20EARS.


These drives don't have a configurable SCT ERC, so you need to modify the SCSI 
block layer timeout:

echo 120 >/sys/block/sdX/device/timeout

You also need to schedule regular scrubs at the md level as well.

echo check > /sys/block/mdX/md/sync_action
cat /sys/block/mdX/mismatch_cnt

More info about this is in man 4 md, and on the linux-raid list.

> 
>      3904907520 blocks super 1.2 level 5, 8k chunk, algorithm 2 [3/3] [UUU]

OK so 8KB chunk, 16KB full stripe, so that doesn't apply to what I was thinking 
might be the case. The workload is presumably small file sizes, like a mail 
server?


> any other information I can supply?

I'm not a developer, I don't know if this problem is known or maybe fixed in a 
newer kernel than 3.11.0 - which has been around for 5-6 months. I think the 
main suggestion is to try a newer kernel, granted with the configuration of md, 
lvm, and btrfs you have three layers that will likely have kernel changes. I'd 
make sure you have backups. While this layout is valid and should work, it's 
also probably less common and therefore less tested.

Usually in case of blocking devs want to see sysrq+w issued. The setup is dmesg 
-n7, and enable sysrq functions. Then reproduce the block, and during the block 
issue w to the sysrq trigger, then capture dmesg contents and post the block 
and any other nearby btrfs messages.

https://www.kernel.org/doc/Documentation/sysrq.txt


Chris Murphy--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: btrfs-transaction blocked for more than 120 seconds

Reply via email to