Re: kernel BUG at fs/jbd2/journal.c:1951!

2014-06-03 Thread Jan Kara
On Sun 25-05-14 22:50:53, Meelis Roos wrote:
> Tried yesterdays git v3.15-rc6-190-g1ee1cea on Sun Fire T2000 (32-thread 
> sparc64) and got reproducibly stuck on boot with ext journal replay like 
> below. Booting Debian 3.14.2-1 replayed the log and mounted FS fine. 
> After that, my test kernel works fine too. SO I can not reproduce it any 
> more but it seems like I hit a bug.
> 
> [   62.064329] EXT4-fs (sda1): INFO: recovery required on readonly filesystem
> [   62.064395] EXT4-fs (sda1): write access will be enabled during recovery
> [   62.077043] EXT4-fs (sda1): orphan cleanup on readonly fs
> [   62.156585] EXT4-fs (sda1): 14 orphan inodes deleted
> [   62.156645] EXT4-fs (sda1): recovery complete
> [   62.156702] kernel BUG at fs/jbd2/journal.c:1951!
  So if I'm right this is assertion:
J_ASSERT(!journal->j_running_transaction);

  But I don't really see how that could happen especially during mount when
there are really just two processes operating in the filesystem - mount and
jbd2 process. Hum, after some more thought, maybe this is some strange
interaction between writeback completion starting a reserved handle and
journal_flush()... There is some potential for problems. Let me think about
it for some time.

Honza

> [   62.156752]   \|/  \|/
> [   62.156752]   "@'/ .. \`@"
> [   62.156752]   /_| \__/ |_\
> [   62.156752]  \__U_/
> [   62.156893] swapper/0(1): Kernel bad sw trap 5 [#1]
> [   62.156943] CPU: 8 PID: 1 Comm: swapper/0 Not tainted 3.15.0-rc6+ #6
> [   62.157004] task: ff8008677640 ti: ff800870c000 task.ti: 
> ff800870c000
> [   62.157079] TSTATE: 004480001605 TPC: 005f3c00 TNPC: 
> 005f3c04 Y: Not tainted
> [   62.157180] TPC: 
> [   62.157225] g0: 009528c8 g1:  g2: 00a17f88 
> g3: 00a17f80
> [   62.157301] g4: ff8008677640 g5: ff83f6a4a000 g6: ff800870c000 
> g7: ff8008633fa0
> [   62.157375] o0: 0025 o1: 00961ed8 o2: 079f 
> o3: ff83f3aab888
> [   62.157451] o4: ff83ffb72000 o5: 00961ed8 sp: ff800870ed11 
> ret_pc: 005f3bf8
> [   62.157530] RPC: 
> [   62.157575] l0:  l1: 000e l2: 10018001 
> l3: 0088e438
> [   62.157650] l4: 009528c8 l5: ff00 l6: 00ff 
> l7: 0095ee30
> [   62.157726] i0: ff83f3aab800 i1: ff83f3aab824 i2: ff83f3aabb98 
> i3: ff83f3aab8e8
> [   62.157801] i4: ff83f3aab824 i5: ff83f3aab8e8 i6: ff800870edc1 
> i7: 005bd640
> [   62.157887] I7: 
> [   62.157942] Call Trace:
> [   62.157990]  [005bd640] 
> ext4_mark_recovery_complete.isra.232+0x4c/0xb0
> [   62.158068]  [005c3860] ext4_fill_super+0x303c/0x33e8
> [   62.158132]  [00538fc4] mount_bdev+0x124/0x18c
> [   62.158192]  [00539a4c] mount_fs+0x60/0x17c
> [   62.158246]  [0055087c] vfs_kern_mount+0x48/0xe4
> [   62.158309]  [00552e24] do_mount+0x808/0x974
> [   62.158370]  [005531b0] SyS_mount+0x6c/0xa4
> [   62.158427]  [00a74cb8] mount_block_root+0x108/0x2ac
> [   62.158490]  [00a75180] prepare_namespace+0x180/0x1bc
> [   62.158552]  [00a74a80] kernel_init_freeable+0x208/0x218
> [   62.158619]  [00864724] kernel_init+0x4/0xf4
> [   62.158684]  [00406084] ret_from_fork+0x1c/0x2c
> [   62.158738]  []   (null)
> [   62.158780] Disabling lock debugging due to kernel taint
> [   62.158847] Caller[005bd640]: 
> ext4_mark_recovery_complete.isra.232+0x4c/0xb0
> [   62.158926] Caller[005c3860]: ext4_fill_super+0x303c/0x33e8
> [   62.158990] Caller[00538fc4]: mount_bdev+0x124/0x18c
> [   62.159051] Caller[00539a4c]: mount_fs+0x60/0x17c
> [   62.159143] Caller[0055087c]: vfs_kern_mount+0x48/0xe4
> [   62.159215] Caller[00552e24]: do_mount+0x808/0x974
> [   62.159279] Caller[005531b0]: SyS_mount+0x6c/0xa4
> [   62.159345] Caller[00a74cb8]: mount_block_root+0x108/0x2ac
> [   62.159411] Caller[00a75180]: prepare_namespace+0x180/0x1bc
> [   62.159475] Caller[00a74a80]: kernel_init_freeable+0x208/0x218
> [   62.159540] Caller[00864724]: kernel_init+0x4/0xf4
> [   62.159604] Caller[00406084]: ret_from_fork+0x1c/0x2c
> [   62.159660] Caller[]:   (null)
> [   62.166652] Instruction DUMP: 901222d8  7ff8d20a  0100 <91d02005> 
> 11002587  921027a1  0af07ffb  901222d8  c45e2310
> [   62.207443] Kernel panic - not syncing: Attempted to

Re: kernel BUG at fs/jbd2/journal.c:1951!

2014-06-03 Thread Jan Kara
On Sun 25-05-14 22:50:53, Meelis Roos wrote:
 Tried yesterdays git v3.15-rc6-190-g1ee1cea on Sun Fire T2000 (32-thread 
 sparc64) and got reproducibly stuck on boot with ext journal replay like 
 below. Booting Debian 3.14.2-1 replayed the log and mounted FS fine. 
 After that, my test kernel works fine too. SO I can not reproduce it any 
 more but it seems like I hit a bug.
 
 [   62.064329] EXT4-fs (sda1): INFO: recovery required on readonly filesystem
 [   62.064395] EXT4-fs (sda1): write access will be enabled during recovery
 [   62.077043] EXT4-fs (sda1): orphan cleanup on readonly fs
 [   62.156585] EXT4-fs (sda1): 14 orphan inodes deleted
 [   62.156645] EXT4-fs (sda1): recovery complete
 [   62.156702] kernel BUG at fs/jbd2/journal.c:1951!
  So if I'm right this is assertion:
J_ASSERT(!journal-j_running_transaction);

  But I don't really see how that could happen especially during mount when
there are really just two processes operating in the filesystem - mount and
jbd2 process. Hum, after some more thought, maybe this is some strange
interaction between writeback completion starting a reserved handle and
journal_flush()... There is some potential for problems. Let me think about
it for some time.

Honza

 [   62.156752]   \|/  \|/
 [   62.156752]   @'/ .. \`@
 [   62.156752]   /_| \__/ |_\
 [   62.156752]  \__U_/
 [   62.156893] swapper/0(1): Kernel bad sw trap 5 [#1]
 [   62.156943] CPU: 8 PID: 1 Comm: swapper/0 Not tainted 3.15.0-rc6+ #6
 [   62.157004] task: ff8008677640 ti: ff800870c000 task.ti: 
 ff800870c000
 [   62.157079] TSTATE: 004480001605 TPC: 005f3c00 TNPC: 
 005f3c04 Y: Not tainted
 [   62.157180] TPC: jbd2_journal_flush+0x11c/0x190
 [   62.157225] g0: 009528c8 g1:  g2: 00a17f88 
 g3: 00a17f80
 [   62.157301] g4: ff8008677640 g5: ff83f6a4a000 g6: ff800870c000 
 g7: ff8008633fa0
 [   62.157375] o0: 0025 o1: 00961ed8 o2: 079f 
 o3: ff83f3aab888
 [   62.157451] o4: ff83ffb72000 o5: 00961ed8 sp: ff800870ed11 
 ret_pc: 005f3bf8
 [   62.157530] RPC: jbd2_journal_flush+0x114/0x190
 [   62.157575] l0:  l1: 000e l2: 10018001 
 l3: 0088e438
 [   62.157650] l4: 009528c8 l5: ff00 l6: 00ff 
 l7: 0095ee30
 [   62.157726] i0: ff83f3aab800 i1: ff83f3aab824 i2: ff83f3aabb98 
 i3: ff83f3aab8e8
 [   62.157801] i4: ff83f3aab824 i5: ff83f3aab8e8 i6: ff800870edc1 
 i7: 005bd640
 [   62.157887] I7: ext4_mark_recovery_complete.isra.232+0x4c/0xb0
 [   62.157942] Call Trace:
 [   62.157990]  [005bd640] 
 ext4_mark_recovery_complete.isra.232+0x4c/0xb0
 [   62.158068]  [005c3860] ext4_fill_super+0x303c/0x33e8
 [   62.158132]  [00538fc4] mount_bdev+0x124/0x18c
 [   62.158192]  [00539a4c] mount_fs+0x60/0x17c
 [   62.158246]  [0055087c] vfs_kern_mount+0x48/0xe4
 [   62.158309]  [00552e24] do_mount+0x808/0x974
 [   62.158370]  [005531b0] SyS_mount+0x6c/0xa4
 [   62.158427]  [00a74cb8] mount_block_root+0x108/0x2ac
 [   62.158490]  [00a75180] prepare_namespace+0x180/0x1bc
 [   62.158552]  [00a74a80] kernel_init_freeable+0x208/0x218
 [   62.158619]  [00864724] kernel_init+0x4/0xf4
 [   62.158684]  [00406084] ret_from_fork+0x1c/0x2c
 [   62.158738]  []   (null)
 [   62.158780] Disabling lock debugging due to kernel taint
 [   62.158847] Caller[005bd640]: 
 ext4_mark_recovery_complete.isra.232+0x4c/0xb0
 [   62.158926] Caller[005c3860]: ext4_fill_super+0x303c/0x33e8
 [   62.158990] Caller[00538fc4]: mount_bdev+0x124/0x18c
 [   62.159051] Caller[00539a4c]: mount_fs+0x60/0x17c
 [   62.159143] Caller[0055087c]: vfs_kern_mount+0x48/0xe4
 [   62.159215] Caller[00552e24]: do_mount+0x808/0x974
 [   62.159279] Caller[005531b0]: SyS_mount+0x6c/0xa4
 [   62.159345] Caller[00a74cb8]: mount_block_root+0x108/0x2ac
 [   62.159411] Caller[00a75180]: prepare_namespace+0x180/0x1bc
 [   62.159475] Caller[00a74a80]: kernel_init_freeable+0x208/0x218
 [   62.159540] Caller[00864724]: kernel_init+0x4/0xf4
 [   62.159604] Caller[00406084]: ret_from_fork+0x1c/0x2c
 [   62.159660] Caller[]:   (null)
 [   62.166652] Instruction DUMP: 901222d8  7ff8d20a  0100 91d02005 
 11002587  921027a1  0af07ffb  901222d8  c45e2310
 [   62.207443] Kernel panic - not syncing: Attempted to kill init! 
 exitcode=0x0009
 
 
 -- 
 Meelis Roos (mr...@linux.ee)
 --
 To unsubscribe from this list: send the line unsubscribe linux-ext4 in
 the body of a message to majord...@vger.kernel.org
 More majordomo info

kernel BUG at fs/jbd2/journal.c:1951!

2014-05-25 Thread Meelis Roos
Tried yesterdays git v3.15-rc6-190-g1ee1cea on Sun Fire T2000 (32-thread 
sparc64) and got reproducibly stuck on boot with ext journal replay like 
below. Booting Debian 3.14.2-1 replayed the log and mounted FS fine. 
After that, my test kernel works fine too. SO I can not reproduce it any 
more but it seems like I hit a bug.

[   62.064329] EXT4-fs (sda1): INFO: recovery required on readonly filesystem
[   62.064395] EXT4-fs (sda1): write access will be enabled during recovery
[   62.077043] EXT4-fs (sda1): orphan cleanup on readonly fs
[   62.156585] EXT4-fs (sda1): 14 orphan inodes deleted
[   62.156645] EXT4-fs (sda1): recovery complete
[   62.156702] kernel BUG at fs/jbd2/journal.c:1951!
[   62.156752]   \|/  \|/
[   62.156752]   "@'/ .. \`@"
[   62.156752]   /_| \__/ |_\
[   62.156752]  \__U_/
[   62.156893] swapper/0(1): Kernel bad sw trap 5 [#1]
[   62.156943] CPU: 8 PID: 1 Comm: swapper/0 Not tainted 3.15.0-rc6+ #6
[   62.157004] task: ff8008677640 ti: ff800870c000 task.ti: 
ff800870c000
[   62.157079] TSTATE: 004480001605 TPC: 005f3c00 TNPC: 
005f3c04 Y: Not tainted
[   62.157180] TPC: 
[   62.157225] g0: 009528c8 g1:  g2: 00a17f88 
g3: 00a17f80
[   62.157301] g4: ff8008677640 g5: ff83f6a4a000 g6: ff800870c000 
g7: ff8008633fa0
[   62.157375] o0: 0025 o1: 00961ed8 o2: 079f 
o3: ff83f3aab888
[   62.157451] o4: ff83ffb72000 o5: 00961ed8 sp: ff800870ed11 
ret_pc: 005f3bf8
[   62.157530] RPC: 
[   62.157575] l0:  l1: 000e l2: 10018001 
l3: 0088e438
[   62.157650] l4: 009528c8 l5: ff00 l6: 00ff 
l7: 0095ee30
[   62.157726] i0: ff83f3aab800 i1: ff83f3aab824 i2: ff83f3aabb98 
i3: ff83f3aab8e8
[   62.157801] i4: ff83f3aab824 i5: ff83f3aab8e8 i6: ff800870edc1 
i7: 005bd640
[   62.157887] I7: 
[   62.157942] Call Trace:
[   62.157990]  [005bd640] 
ext4_mark_recovery_complete.isra.232+0x4c/0xb0
[   62.158068]  [005c3860] ext4_fill_super+0x303c/0x33e8
[   62.158132]  [00538fc4] mount_bdev+0x124/0x18c
[   62.158192]  [00539a4c] mount_fs+0x60/0x17c
[   62.158246]  [0055087c] vfs_kern_mount+0x48/0xe4
[   62.158309]  [00552e24] do_mount+0x808/0x974
[   62.158370]  [005531b0] SyS_mount+0x6c/0xa4
[   62.158427]  [00a74cb8] mount_block_root+0x108/0x2ac
[   62.158490]  [00a75180] prepare_namespace+0x180/0x1bc
[   62.158552]  [00a74a80] kernel_init_freeable+0x208/0x218
[   62.158619]  [00864724] kernel_init+0x4/0xf4
[   62.158684]  [00406084] ret_from_fork+0x1c/0x2c
[   62.158738]  []   (null)
[   62.158780] Disabling lock debugging due to kernel taint
[   62.158847] Caller[005bd640]: 
ext4_mark_recovery_complete.isra.232+0x4c/0xb0
[   62.158926] Caller[005c3860]: ext4_fill_super+0x303c/0x33e8
[   62.158990] Caller[00538fc4]: mount_bdev+0x124/0x18c
[   62.159051] Caller[00539a4c]: mount_fs+0x60/0x17c
[   62.159143] Caller[0055087c]: vfs_kern_mount+0x48/0xe4
[   62.159215] Caller[00552e24]: do_mount+0x808/0x974
[   62.159279] Caller[005531b0]: SyS_mount+0x6c/0xa4
[   62.159345] Caller[00a74cb8]: mount_block_root+0x108/0x2ac
[   62.159411] Caller[00a75180]: prepare_namespace+0x180/0x1bc
[   62.159475] Caller[00a74a80]: kernel_init_freeable+0x208/0x218
[   62.159540] Caller[00864724]: kernel_init+0x4/0xf4
[   62.159604] Caller[00406084]: ret_from_fork+0x1c/0x2c
[   62.159660] Caller[]:   (null)
[   62.166652] Instruction DUMP: 901222d8  7ff8d20a  0100 <91d02005> 
11002587  921027a1  0af07ffb  901222d8  c45e2310
[   62.207443] Kernel panic - not syncing: Attempted to kill init! 
exitcode=0x0009


-- 
Meelis Roos (mr...@linux.ee)
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


kernel BUG at fs/jbd2/journal.c:1951!

2014-05-25 Thread Meelis Roos
Tried yesterdays git v3.15-rc6-190-g1ee1cea on Sun Fire T2000 (32-thread 
sparc64) and got reproducibly stuck on boot with ext journal replay like 
below. Booting Debian 3.14.2-1 replayed the log and mounted FS fine. 
After that, my test kernel works fine too. SO I can not reproduce it any 
more but it seems like I hit a bug.

[   62.064329] EXT4-fs (sda1): INFO: recovery required on readonly filesystem
[   62.064395] EXT4-fs (sda1): write access will be enabled during recovery
[   62.077043] EXT4-fs (sda1): orphan cleanup on readonly fs
[   62.156585] EXT4-fs (sda1): 14 orphan inodes deleted
[   62.156645] EXT4-fs (sda1): recovery complete
[   62.156702] kernel BUG at fs/jbd2/journal.c:1951!
[   62.156752]   \|/  \|/
[   62.156752]   @'/ .. \`@
[   62.156752]   /_| \__/ |_\
[   62.156752]  \__U_/
[   62.156893] swapper/0(1): Kernel bad sw trap 5 [#1]
[   62.156943] CPU: 8 PID: 1 Comm: swapper/0 Not tainted 3.15.0-rc6+ #6
[   62.157004] task: ff8008677640 ti: ff800870c000 task.ti: 
ff800870c000
[   62.157079] TSTATE: 004480001605 TPC: 005f3c00 TNPC: 
005f3c04 Y: Not tainted
[   62.157180] TPC: jbd2_journal_flush+0x11c/0x190
[   62.157225] g0: 009528c8 g1:  g2: 00a17f88 
g3: 00a17f80
[   62.157301] g4: ff8008677640 g5: ff83f6a4a000 g6: ff800870c000 
g7: ff8008633fa0
[   62.157375] o0: 0025 o1: 00961ed8 o2: 079f 
o3: ff83f3aab888
[   62.157451] o4: ff83ffb72000 o5: 00961ed8 sp: ff800870ed11 
ret_pc: 005f3bf8
[   62.157530] RPC: jbd2_journal_flush+0x114/0x190
[   62.157575] l0:  l1: 000e l2: 10018001 
l3: 0088e438
[   62.157650] l4: 009528c8 l5: ff00 l6: 00ff 
l7: 0095ee30
[   62.157726] i0: ff83f3aab800 i1: ff83f3aab824 i2: ff83f3aabb98 
i3: ff83f3aab8e8
[   62.157801] i4: ff83f3aab824 i5: ff83f3aab8e8 i6: ff800870edc1 
i7: 005bd640
[   62.157887] I7: ext4_mark_recovery_complete.isra.232+0x4c/0xb0
[   62.157942] Call Trace:
[   62.157990]  [005bd640] 
ext4_mark_recovery_complete.isra.232+0x4c/0xb0
[   62.158068]  [005c3860] ext4_fill_super+0x303c/0x33e8
[   62.158132]  [00538fc4] mount_bdev+0x124/0x18c
[   62.158192]  [00539a4c] mount_fs+0x60/0x17c
[   62.158246]  [0055087c] vfs_kern_mount+0x48/0xe4
[   62.158309]  [00552e24] do_mount+0x808/0x974
[   62.158370]  [005531b0] SyS_mount+0x6c/0xa4
[   62.158427]  [00a74cb8] mount_block_root+0x108/0x2ac
[   62.158490]  [00a75180] prepare_namespace+0x180/0x1bc
[   62.158552]  [00a74a80] kernel_init_freeable+0x208/0x218
[   62.158619]  [00864724] kernel_init+0x4/0xf4
[   62.158684]  [00406084] ret_from_fork+0x1c/0x2c
[   62.158738]  []   (null)
[   62.158780] Disabling lock debugging due to kernel taint
[   62.158847] Caller[005bd640]: 
ext4_mark_recovery_complete.isra.232+0x4c/0xb0
[   62.158926] Caller[005c3860]: ext4_fill_super+0x303c/0x33e8
[   62.158990] Caller[00538fc4]: mount_bdev+0x124/0x18c
[   62.159051] Caller[00539a4c]: mount_fs+0x60/0x17c
[   62.159143] Caller[0055087c]: vfs_kern_mount+0x48/0xe4
[   62.159215] Caller[00552e24]: do_mount+0x808/0x974
[   62.159279] Caller[005531b0]: SyS_mount+0x6c/0xa4
[   62.159345] Caller[00a74cb8]: mount_block_root+0x108/0x2ac
[   62.159411] Caller[00a75180]: prepare_namespace+0x180/0x1bc
[   62.159475] Caller[00a74a80]: kernel_init_freeable+0x208/0x218
[   62.159540] Caller[00864724]: kernel_init+0x4/0xf4
[   62.159604] Caller[00406084]: ret_from_fork+0x1c/0x2c
[   62.159660] Caller[]:   (null)
[   62.166652] Instruction DUMP: 901222d8  7ff8d20a  0100 91d02005 
11002587  921027a1  0af07ffb  901222d8  c45e2310
[   62.207443] Kernel panic - not syncing: Attempted to kill init! 
exitcode=0x0009


-- 
Meelis Roos (mr...@linux.ee)
--
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/