Re: kernel hangs during balance

2017-12-23 Thread Rich Rauenzahn
I finally got a full stack trace via sysrq. A lot of stacks seem to end in page fault -- I wonder if its because my page file is a loopback on btrfs. https://pastebin.com/GyWAu1EP $ cat /proc/cmdline BOOT_IMAGE=/vmlinuz-4.14.8-1.el7.elrepo.x86_64 root=UUID=35f0ce3f-0902-47a3-8ad8-86179d1f3e3a

Re: kernel hangs during balance

2017-12-20 Thread Duncan
Holger Hoffstätte posted on Wed, 20 Dec 2017 20:58:14 +0100 as excerpted: > On 12/20/17 20:02, Chris Murphy wrote: >> I don't know if it's the sending MUA or the list server, but the line >> wrapping makes this much harder to follow. I suggest putting it in a >> text file and attaching the text

Re: kernel hangs during balance

2017-12-20 Thread Rich Rauenzahn
I switched to the LT kernel because of this issue. I was running mainline and thought that LT would get me stability. I can switch back to LT while we RCA. At the risk of changing two things, I could add that (scsi_mod.use_blk_mq=n) to my boot and also switch back to ML. I do notice that disk

Re: kernel hangs during balance

2017-12-20 Thread Holger Hoffstätte
On 12/20/17 20:02, Chris Murphy wrote: > I don't know if it's the sending MUA or the list server, but the line > wrapping makes this much harder to follow. I suggest putting it in a > text file and attaching the text file. It's definitely not on the > receiving side, I see it here also: >

Re: kernel hangs during balance

2017-12-20 Thread Chris Murphy
I don't know if it's the sending MUA or the list server, but the line wrapping makes this much harder to follow. I suggest putting it in a text file and attaching the text file. It's definitely not on the receiving side, I see it here also: https://www.spinics.net/lists/linux-btrfs/msg72872.html

Re: kernel hangs during balance

2017-12-20 Thread Rich Rauenzahn
Ok, caught the hung tasks last night. I don't *think* this is related, because I pretty sure this isn't happening on the same filesystem, but I do have a loopback swap on one btrfs drive. The hang might have occurred after the btrfs balance was finished which is confusing. I'm adding

Re: kernel hangs during balance

2017-12-19 Thread Rich Rauenzahn
On Tue, Dec 19, 2017 at 9:14 AM, Hans van Kranenburg wrote: > Just wild first guess... are you also using btrfs send/receive > functionality where the system having problems is the sending part? No. >>> Every night I'm getting a kernel hang, sometimes caught by

Re: kernel hangs during balance

2017-12-19 Thread Hans van Kranenburg
On 12/19/2017 06:08 PM, Rich Rauenzahn wrote: > What's also confusing is I just ran a manual balance on the fs using > defaults (which are aggressive) and it completed with no problems. > It smells more like a race condition than a particular corruption. Just wild first guess... are you also

Re: kernel hangs during balance

2017-12-19 Thread Rich Rauenzahn
What's also confusing is I just ran a manual balance on the fs using defaults (which are aggressive) and it completed with no problems. It smells more like a race condition than a particular corruption. On Tue, Dec 19, 2017 at 8:09 AM, Rich Rauenzahn wrote: > I'm running

kernel hangs during balance

2017-12-19 Thread Rich Rauenzahn
I'm running 4.4.106-1.el7.elrepo.x86_64 and I do a btrfs balance everynight. Every night I'm getting a kernel hang, sometimes caught by my watchdog, sometimes not. Last night's hang was on the balance of DATA on / at 70. I'm not sure how to further trace this down to help you -- the console by