On 07/19/2015 07:54 PM, Dave Chinner wrote:
On Fri, Jul 17, 2015 at 05:10:50PM +0530, Chandan Rajendra wrote:
On Friday 17 Jul 2015 06:16:02 Brian Foster wrote:
On Fri, Jul 17, 2015 at 12:56:43AM -0400, Chandan Rajendra wrote:
When running generic/311 on Btrfs' subpagesize-blocksize patchset (on
ppc64
with 4k sectorsize and 16k node/leaf size) I noticed the following call
trace,
BTRFS (device dm-0): parent transid verify failed on 29720576 wanted 160
found 158 BTRFS (device dm-0): parent transid verify failed on 29720576
wanted 160 found 158 BTRFS: Transaction aborted (error -5)
WARNING: at /root/repos/linux/fs/btrfs/super.c:260
Modules linked in:
CPU: 3 PID: 30769 Comm: umount Tainted: G W L
4.0.0-rc5-11671-g8b82e73e #63 task: c000000079aaddb0 ti: c000000079a48000
task.ti: c000000079a48000 NIP: c000000000499aa0 LR: c000000000499a9c CTR:
c000000000779630
REGS: c000000079a4b480 TRAP: 0700 Tainted: G W L
(4.0.0-rc5-11671-g8b82e73e) MSR: 8000000100029032 <SF,EE,ME,IR,DR,RI>
CR: 28008828 XER: 20000000 CFAR: c000000000a23914 SOFTE: 1
GPR00: c000000000499a9c c000000079a4b700 c00000000103bdf8 0000000000000025
GPR04: 0000000000000001 0000000000000502 c00000000107e918 0000000000000cda
GPR08: 0000000000000007 0000000000000007 0000000000000001 c0000000010f5044
GPR12: 0000000028008822 c00000000fdc0d80 0000000020000000 0000000010152e00
GPR16: 0000010002979380 0000000010140724 0000000000000000 0000000000000000
GPR20: ffffffffffffffff 0000000000000000 0000000000000000 0000000000000000
GPR24: c0000000151f61a8 0000000000000000 c000000055e5e800 c000000000aac270
GPR28: 00000000000004a4 fffffffffffffffb c000000055e5e800 c0000000679204d0
NIP [c000000000499aa0] .__btrfs_abort_transaction+0x180/0x190
LR [c000000000499a9c] .__btrfs_abort_transaction+0x17c/0x190
Call Trace:
[c000000079a4b700] [c000000000499a9c]
.__btrfs_abort_transaction+0x17c/0x190 (unreliable) [c000000079a4b7a0]
[c000000000541678] .__btrfs_run_delayed_items+0xe8/0x220
[c000000079a4b850] [c0000000004d5b3c]
.btrfs_commit_transaction+0x37c/0xca0 [c000000079a4b960]
[c00000000049824c] .btrfs_sync_fs+0x6c/0x1a0
[c000000079a4ba00] [c000000000255270] .sync_filesystem+0xd0/0x100
[c000000079a4ba80] [c000000000218070] .generic_shutdown_super+0x40/0x170
[c000000079a4bb10] [c000000000218598] .kill_anon_super+0x18/0x30
[c000000079a4bb90] [c000000000498418] .btrfs_kill_super+0x18/0xc0
[c000000079a4bc10] [c000000000218ac8] .deactivate_locked_super+0x98/0xe0
[c000000079a4bc90] [c00000000023e744] .cleanup_mnt+0x54/0xa0
[c000000079a4bd10] [c0000000000b7d14] .task_work_run+0x114/0x150
[c000000079a4bdb0] [c000000000015f84] .do_notify_resume+0x74/0x80
[c000000079a4be30] [c000000000009838] .ret_from_except_lite+0x64/0x68
Instruction dump:
ebc1fff0 ebe1fff8 4bfffb28 60000000 3ce2ffcd 38e7e818 4bffffbc 3c62ffd2
7fa4eb78 3863b808 48589e1d 60000000 <0fe00000> 4bfffedc 60000000 60000000
BTRFS: error (device dm-0) in __btrfs_run_delayed_items:1188: errno=-5 IO
failure
The call trace is seen when executing _run_test() for the 8th time.
The above trace is actually a false-positive failure as indicated below,
fsync-tester
fsync(fd)
Write delayed inode item to fs tree
(assume transid to be 160)
(assume tree block to start at logical address 29720576)
md5sum $testfile
This causes a delayed inode to be added
Load flakey table
i.e. drop writes that are initiated from now onwards
Unmount filesystem
btrfs_sync_fs is invoked
Write 29720576 metadata block to disk
free_extent_buffer(29720576)
release_extent_buffer(29720576)
Start writing delayed inode
Traverse the fs tree
(assume the parent tree block of 29720576 is still in memory)
When reading 29720576 from disk, parent's blkptr will have
generation
set to 160. But the on-disk tree block will have an older
generation (say, 158). Transid verification fails and hence the
transaction gets aborted
The test only cares about the FS instance before the unmount
operation (i.e. the synced FS). Hence to get the test to pass, ignore the
false-positive trace that could be generated.
Signed-off-by: Chandan Rajendra <chan...@linux.vnet.ibm.com>
---
tests/generic/311 | 2 ++
1 file changed, 2 insertions(+)
diff --git a/tests/generic/311 b/tests/generic/311
index d21b6eb..cd6391d 100755
--- a/tests/generic/311
+++ b/tests/generic/311
@@ -64,6 +64,8 @@ _require_xfs_io_command "falloc"
[ -x $here/src/fsync-tester ] || _notrun "fsync-tester not built"
+_disable_dmesg_check
+
Hmm, I don't think this is something we'd want to do unconditionally.
E.g., if something hits the logs for xfs or ext4, we probably want to
hear about it.
Sorry, I forgot that I was dealing with generic tests. I agree with what you
say.
Perhaps check that the fs is btrfs and possibly the fs params are such
that the known warning occurs..? I'd defer to the btrfs folks on how
best to check that, so long as it doesn't affect other fs'.
The above trace is independent of any Btrfs parameters. Its just that it was
easily reproducible on ppc64 with the subpagesize-blocksize patchset
applied. So IMHO disabling call trace check when testing Btrfs filesystem is
the way to go. But again, as you said, we can wait for comments from other
Btrfs devs.
As a general principle, we don't disable warnings/errors in tests
just to make tests pass. The bug in the code being tested should be
fixed so the warning/error is not generated. If you do not want
to see the error, then expung the test from your
subpagesize-blocksize test runs.
The same problem would happen if you did echo 3>
/proc/sys/vm/drop_caches right after dropping writes before doing the
unmount. The sub page blocksize stuff is just exposing the race and
thus causing this warning to happen. The warning has nothing to do with
the test, it happens after drop writes so I think it's fine to ignore
it. Maybe a different mechanism could be put into place to say "hey
ignore dmesg for this part" or something like that.
That all being said, it is a little weird that the sub page blocksize
stuff hits this so consistently, I wonder why things are getting evicted
that quickly. Thanks,
Josef
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html