Re: linux-2.6.20-rc4-mm1 Reiser4 filesystem freeze and corruption

2007-02-01 Thread Zan Lynx
On Thu, 2007-02-01 at 18:54 +0300, Edward Shishkin wrote:
[snip]
> Thanks for the dump.
> 
> >[ 3138.456588]  [] current_atom_finish_all_fq+0x12e/0x280
> >[ 3138.456661]  [] autoremove_wake_function+0x0/0x30
> >[ 3138.456674]  [] submit_wb_list+0x11c/0x130
> >[ 3138.456690]  [] reiser4_txn_end+0x349/0x530
> >[ 3138.456710]  [] reiser4_txn_restart+0x9/0x20
> >[ 3138.456781]  [] force_commit_atom+0x50/0x60
> >[ 3138.456793]  [] writepages_unix_file+0x671/0x780
> >[ 3138.456824]  [] do_writepages+0x43/0x80
> >[ 3138.456838]  [] __filemap_fdatawrite_range+0x58/0x70
> >[ 3138.456914]  [] do_fsync+0x3d/0xe0
> >[ 3138.456930]  [] sys_msync+0x143/0x1f0
> >[ 3138.456945]  [] system_call+0x7e/0x83
> >  
> >
> 
> This is waiting for IO completion, and no success because of new plugging
> policy introduced by block layer folks. The attached patch should help.
> Andrew, please apply.

OK, I have been using it with your patch for many hours and it has not
frozen up yet.  I believe that the patch did indeed fix it.

Thank you.
-- 
Zan Lynx <[EMAIL PROTECTED]>


signature.asc
Description: This is a digitally signed message part


Re: linux-2.6.20-rc4-mm1 Reiser4 filesystem freeze and corruption

2007-02-01 Thread Edward Shishkin

Zan Lynx wrote:


On Sat, 2007-01-20 at 03:34 +0300, Vladimir V. Saveliev wrote:
 


Hello

On Friday 19 January 2007 20:58, Zan Lynx wrote:
   


I have been running 2.6.20-rc2-mm1 without problems, but both rc3-mm1
and rc4-mm1 have been giving me these freezes.  They were happening
inside X and without external console it was impossible to get anything,
plus I was reluctant to test it since the freeze sometimes requires a
full fsck.reiser4 --build-fs to recover the filesystem.

But I finally got some output in a console session.  I wasn't able to
get it all, I made some notes of what I think the problem is.  I may try
again later once I get netconsole working (netconsole fails as a
built-in, I'll try it as a module next).
 


[snip]
 


yes, please provide more information. Full kernel output at time of freeze is 
very desirable.
   



Here comes a full sized bug report, as best as I can do it.  This is
kernel 2.6.20-rc6-mm3 instead of rc4-mm1.  Still has the problem.
 



Thanks for the dump.


[ 3138.456588]  [] current_atom_finish_all_fq+0x12e/0x280
[ 3138.456661]  [] autoremove_wake_function+0x0/0x30
[ 3138.456674]  [] submit_wb_list+0x11c/0x130
[ 3138.456690]  [] reiser4_txn_end+0x349/0x530
[ 3138.456710]  [] reiser4_txn_restart+0x9/0x20
[ 3138.456781]  [] force_commit_atom+0x50/0x60
[ 3138.456793]  [] writepages_unix_file+0x671/0x780
[ 3138.456824]  [] do_writepages+0x43/0x80
[ 3138.456838]  [] __filemap_fdatawrite_range+0x58/0x70
[ 3138.456914]  [] do_fsync+0x3d/0xe0
[ 3138.456930]  [] sys_msync+0x143/0x1f0
[ 3138.456945]  [] system_call+0x7e/0x83
 



This is waiting for IO completion, and no success because of new plugging
policy introduced by block layer folks. The attached patch should help.
Andrew, please apply.

Thanks,
Edward.

Signed-off-by: Edward Shishkin <[EMAIL PROTECTED]>
---
 linux-2.6.20-rc6-mm3/fs/reiser4/status_flags.c |2 ++
 linux-2.6.20-rc6-mm3/fs/reiser4/wander.c   |   18 +++---
 2 files changed, 13 insertions(+), 7 deletions(-)

--- linux-2.6.20-rc6-mm3/fs/reiser4/status_flags.c.orig
+++ linux-2.6.20-rc6-mm3/fs/reiser4/status_flags.c
@@ -63,6 +63,7 @@
 	}
 	lock_page(page);
 	submit_bio(READ, bio);
+	blk_replug_current_nested();
 	wait_on_page_locked(page);
 	if (!PageUptodate(page)) {
 		warning("green-2007",
@@ -157,6 +158,7 @@
 	lock_page(get_super_private(sb)->status_page);	// Safe as nobody should touch our page.
 	/* We can block now, but we have no other choice anyway */
 	submit_bio(WRITE, bio);
+	blk_replug_current_nested();
 	return 0;		// We do not wait for io to finish.
 }
 
--- linux-2.6.20-rc6-mm3/fs/reiser4/wander.c.orig
+++ linux-2.6.20-rc6-mm3/fs/reiser4/wander.c
@@ -718,6 +718,7 @@
 	jnode *first, int nr, const reiser4_block_nr *block_p,
 	flush_queue_t *fq, int flags)
 {
+	int ret = 0;
 	struct super_block *super = reiser4_get_current_sb();
 	int write_op = ( flags & WRITEOUT_BARRIER ) ? WRITE_BARRIER : WRITE;
 	int max_blocks;
@@ -738,9 +739,10 @@
 		int nr_used;
 
 		bio = bio_alloc(GFP_NOIO, nr_blocks);
-		if (!bio)
-			return RETERR(-ENOMEM);
-
+		if (!bio) {
+			ret = RETERR(-ENOMEM);
+			break;
+		}
 		bio->bi_bdev = super->s_bdev;
 		bio->bi_sector = block * (super->s_blocksize >> 9);
 		for (nr_used = 0, i = 0; i < nr_blocks; i++) {
@@ -843,8 +845,10 @@
 reiser4_submit_bio(write_op, bio);
 not_supported = bio_flagged(bio, BIO_EOPNOTSUPP);
 bio_put(bio);
-if (not_supported)
-	return -EOPNOTSUPP;
+if (not_supported) {
+	ret = -EOPNOTSUPP;
+	break;
+}
 			}
 
 			block += nr_used - 1;
@@ -855,8 +859,8 @@
 		}
 		nr -= nr_used;
 	}
-
-	return 0;
+	blk_replug_current_nested();
+	return ret;
 }
 
 /* This is a procedure which recovers a contiguous sequences of disk block


Re: linux-2.6.20-rc4-mm1 Reiser4 filesystem freeze and corruption

2007-02-01 Thread Edward Shishkin

Zan Lynx wrote:


On Sat, 2007-01-20 at 03:34 +0300, Vladimir V. Saveliev wrote:
 


Hello

On Friday 19 January 2007 20:58, Zan Lynx wrote:
   


I have been running 2.6.20-rc2-mm1 without problems, but both rc3-mm1
and rc4-mm1 have been giving me these freezes.  They were happening
inside X and without external console it was impossible to get anything,
plus I was reluctant to test it since the freeze sometimes requires a
full fsck.reiser4 --build-fs to recover the filesystem.

But I finally got some output in a console session.  I wasn't able to
get it all, I made some notes of what I think the problem is.  I may try
again later once I get netconsole working (netconsole fails as a
built-in, I'll try it as a module next).
 


[snip]
 


yes, please provide more information. Full kernel output at time of freeze is 
very desirable.
   



Here comes a full sized bug report, as best as I can do it.  This is
kernel 2.6.20-rc6-mm3 instead of rc4-mm1.  Still has the problem.
 



Thanks for the dump.


[ 3138.456588]  [8033f5de] current_atom_finish_all_fq+0x12e/0x280
[ 3138.456661]  [80296510] autoremove_wake_function+0x0/0x30
[ 3138.456674]  [803350ac] submit_wb_list+0x11c/0x130
[ 3138.456690]  [80335409] reiser4_txn_end+0x349/0x530
[ 3138.456710]  [803355f9] reiser4_txn_restart+0x9/0x20
[ 3138.456781]  [80335680] force_commit_atom+0x50/0x60
[ 3138.456793]  [8034cfb1] writepages_unix_file+0x671/0x780
[ 3138.456824]  [802590b3] do_writepages+0x43/0x80
[ 3138.456838]  [8024dbf8] __filemap_fdatawrite_range+0x58/0x70
[ 3138.456914]  [8024e19d] do_fsync+0x3d/0xe0
[ 3138.456930]  [802c2473] sys_msync+0x143/0x1f0
[ 3138.456945]  [8025c11e] system_call+0x7e/0x83
 



This is waiting for IO completion, and no success because of new plugging
policy introduced by block layer folks. The attached patch should help.
Andrew, please apply.

Thanks,
Edward.

Signed-off-by: Edward Shishkin [EMAIL PROTECTED]
---
 linux-2.6.20-rc6-mm3/fs/reiser4/status_flags.c |2 ++
 linux-2.6.20-rc6-mm3/fs/reiser4/wander.c   |   18 +++---
 2 files changed, 13 insertions(+), 7 deletions(-)

--- linux-2.6.20-rc6-mm3/fs/reiser4/status_flags.c.orig
+++ linux-2.6.20-rc6-mm3/fs/reiser4/status_flags.c
@@ -63,6 +63,7 @@
 	}
 	lock_page(page);
 	submit_bio(READ, bio);
+	blk_replug_current_nested();
 	wait_on_page_locked(page);
 	if (!PageUptodate(page)) {
 		warning(green-2007,
@@ -157,6 +158,7 @@
 	lock_page(get_super_private(sb)-status_page);	// Safe as nobody should touch our page.
 	/* We can block now, but we have no other choice anyway */
 	submit_bio(WRITE, bio);
+	blk_replug_current_nested();
 	return 0;		// We do not wait for io to finish.
 }
 
--- linux-2.6.20-rc6-mm3/fs/reiser4/wander.c.orig
+++ linux-2.6.20-rc6-mm3/fs/reiser4/wander.c
@@ -718,6 +718,7 @@
 	jnode *first, int nr, const reiser4_block_nr *block_p,
 	flush_queue_t *fq, int flags)
 {
+	int ret = 0;
 	struct super_block *super = reiser4_get_current_sb();
 	int write_op = ( flags  WRITEOUT_BARRIER ) ? WRITE_BARRIER : WRITE;
 	int max_blocks;
@@ -738,9 +739,10 @@
 		int nr_used;
 
 		bio = bio_alloc(GFP_NOIO, nr_blocks);
-		if (!bio)
-			return RETERR(-ENOMEM);
-
+		if (!bio) {
+			ret = RETERR(-ENOMEM);
+			break;
+		}
 		bio-bi_bdev = super-s_bdev;
 		bio-bi_sector = block * (super-s_blocksize  9);
 		for (nr_used = 0, i = 0; i  nr_blocks; i++) {
@@ -843,8 +845,10 @@
 reiser4_submit_bio(write_op, bio);
 not_supported = bio_flagged(bio, BIO_EOPNOTSUPP);
 bio_put(bio);
-if (not_supported)
-	return -EOPNOTSUPP;
+if (not_supported) {
+	ret = -EOPNOTSUPP;
+	break;
+}
 			}
 
 			block += nr_used - 1;
@@ -855,8 +859,8 @@
 		}
 		nr -= nr_used;
 	}
-
-	return 0;
+	blk_replug_current_nested();
+	return ret;
 }
 
 /* This is a procedure which recovers a contiguous sequences of disk block


Re: linux-2.6.20-rc4-mm1 Reiser4 filesystem freeze and corruption

2007-02-01 Thread Zan Lynx
On Thu, 2007-02-01 at 18:54 +0300, Edward Shishkin wrote:
[snip]
 Thanks for the dump.
 
 [ 3138.456588]  [8033f5de] current_atom_finish_all_fq+0x12e/0x280
 [ 3138.456661]  [80296510] autoremove_wake_function+0x0/0x30
 [ 3138.456674]  [803350ac] submit_wb_list+0x11c/0x130
 [ 3138.456690]  [80335409] reiser4_txn_end+0x349/0x530
 [ 3138.456710]  [803355f9] reiser4_txn_restart+0x9/0x20
 [ 3138.456781]  [80335680] force_commit_atom+0x50/0x60
 [ 3138.456793]  [8034cfb1] writepages_unix_file+0x671/0x780
 [ 3138.456824]  [802590b3] do_writepages+0x43/0x80
 [ 3138.456838]  [8024dbf8] __filemap_fdatawrite_range+0x58/0x70
 [ 3138.456914]  [8024e19d] do_fsync+0x3d/0xe0
 [ 3138.456930]  [802c2473] sys_msync+0x143/0x1f0
 [ 3138.456945]  [8025c11e] system_call+0x7e/0x83
   
 
 
 This is waiting for IO completion, and no success because of new plugging
 policy introduced by block layer folks. The attached patch should help.
 Andrew, please apply.

OK, I have been using it with your patch for many hours and it has not
frozen up yet.  I believe that the patch did indeed fix it.

Thank you.
-- 
Zan Lynx [EMAIL PROTECTED]


signature.asc
Description: This is a digitally signed message part


Re: linux-2.6.20-rc4-mm1 Reiser4 filesystem freeze and corruption

2007-01-24 Thread Vince

Vladimir V. Saveliev wrote:

Hello

On Tuesday 23 January 2007 10:38, Vince wrote:

[...]
I don't know if it is related, but I've had the following BUG on 
2.6.20-rc4-mm1 (+ hot-fixes patches applied) :


---
kernel BUG at fs/reiser4/plugin/item/extent_file_ops.c:973!


This is another problem than Zan's one. The attached patch should fix it.

Andrew, please apply.


From: Vladimir Saveliev <[EMAIL PROTECTED]>

remove_suid may open a transaction in reiser4 which is to be restarted
before entering into main write loop.

Signed-off-by: Vladimir Saveliev <[EMAIL PROTECTED]>


I'm pleased to confirm I wasn't able to reproduce the bug with your 
patch applied.


Regards,

Vince
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: linux-2.6.20-rc4-mm1 Reiser4 filesystem freeze and corruption

2007-01-24 Thread Vince

Vladimir V. Saveliev wrote:

Hello

On Tuesday 23 January 2007 10:38, Vince wrote:

[...]
I don't know if it is related, but I've had the following BUG on 
2.6.20-rc4-mm1 (+ hot-fixes patches applied) :


---
kernel BUG at fs/reiser4/plugin/item/extent_file_ops.c:973!


This is another problem than Zan's one. The attached patch should fix it.

Andrew, please apply.


From: Vladimir Saveliev [EMAIL PROTECTED]

remove_suid may open a transaction in reiser4 which is to be restarted
before entering into main write loop.

Signed-off-by: Vladimir Saveliev [EMAIL PROTECTED]


I'm pleased to confirm I wasn't able to reproduce the bug with your 
patch applied.


Regards,

Vince
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: linux-2.6.20-rc4-mm1 Reiser4 filesystem freeze and corruption

2007-01-23 Thread Vladimir V. Saveliev
Hello

On Tuesday 23 January 2007 10:38, Vince wrote:
> Zan Lynx wrote:
> > I have been running 2.6.20-rc2-mm1 without problems, but both rc3-mm1
> > and rc4-mm1 have been giving me these freezes.  They were happening
> > inside X and without external console it was impossible to get anything,
> > plus I was reluctant to test it since the freeze sometimes requires a
> > full fsck.reiser4 --build-fs to recover the filesystem.
>  > [...]
> 
> Hi,
> 
> I don't know if it is related, but I've had the following BUG on 
> 2.6.20-rc4-mm1 (+ hot-fixes patches applied) :
> 
> ---
> kernel BUG at fs/reiser4/plugin/item/extent_file_ops.c:973!

This is another problem than Zan's one. The attached patch should fix it.

Andrew, please apply.


From: Vladimir Saveliev <[EMAIL PROTECTED]>

remove_suid may open a transaction in reiser4 which is to be restarted
before entering into main write loop.

Signed-off-by: Vladimir Saveliev <[EMAIL PROTECTED]>




diff -puN 
fs/reiser4/plugin/file/file.c~reiser4-restart-transaction-after-remove_suid 
fs/reiser4/plugin/file/file.c
--- 
linux-2.6.20-rc3-mm1/fs/reiser4/plugin/file/file.c~reiser4-restart-transaction-after-remove_suid
2007-01-23 18:59:14.0 +0300
+++ linux-2.6.20-rc3-mm1-vs/fs/reiser4/plugin/file/file.c   2007-01-23 
19:00:37.0 +0300
@@ -2175,6 +2175,8 @@ ssize_t write_unix_file(struct file *fil
reiser4_exit_context(ctx);
return result;
}
+   /* remove_suid might create a transaction */
+   reiser4_txn_restart(ctx);
 
uf_info = unix_file_inode_data(inode);
 

_


> invalid opcode:  [#1]
> PREEMPT
> last sysfs file: /devices/pci:00/:00:13.0/eth0/statistics/collisions
> Modules linked in: binfmt_misc nfs lockd sunrpc radeon drm reiser4 
> ati_remote fuse usbhid snd_via82xx snd_ac97_codec ac97_bus snd_pcm_oss 
> snd_mixer_oss snd_pcm snd_page_alloc snd_mpu401_uart snd_seq_oss 
> snd_seq_midi snd_rawmidi snd_seq_midi_event snd_seq snd_timer 
> snd_seq_device ohci1394 ieee1394 psmouse sr_mod cdrom sg ehci_hcd 
> via_agp agpgart uhci_hcd usbcore i2c_viapro snd soundcore
> CPU:0
> EIP:0060:[]Not tainted VLI
> EFLAGS: 00010282   (2.6.20-rc4-mm1 #1)
> EIP is at reiser4_write_extent+0xd5/0x626 [reiser4]
> eax: ccca139c   ebx: 0200   ecx: f5bec400   edx: ffe4
> esi:    edi: f5bec414   ebp: da6ff274   esp: e17d7e34
> ds: 007b   es: 007b   fs: 00d8  gs: 0033  ss: 0068
> Process sstrip (pid: 23858, ti=e17d6000 task=d8ffc570 task.ti=e17d6000)
> Stack:  00100100 00200200 00100100 0034 bf826a50 e083ff00 
> 
> c000 da6ff2c8 dccba4c0 0005 01ff 021e  
> 
>   0004 f9b6cdad  0004 0004 
> 0001
> Call Trace:
>   [] reiser4_update_sd+0x22/0x28 [reiser4]
>   [] notify_change+0x200/0x20f
>   [] vsscanf+0x1e2/0x3ff
>   [] write_unix_file+0x0/0x495 [reiser4]
>   [] __remove_suid+0x10/0x14
>   [] mark_page_accessed+0x1c/0x2e
>   [] reiser4_txn_begin+0x1c/0x2e [reiser4]
>   [] reiser4_write_extent+0x0/0x626 [reiser4]
>   [] write_unix_file+0x25a/0x495 [reiser4]
>   [] __handle_mm_fault+0x2bd/0x79b
>   [] write_unix_file+0x0/0x495 [reiser4]
>   [] vfs_write+0x8a/0x136
>   [] sys_write+0x41/0x67
>   [] sysenter_past_esp+0x5f/0x85
>   ===
> Code: 04 89 0c 24 31 c9 89 5c 24 04 e8 52 fc ff ff 31 d2 e9 59 05 00 00 
> 64 a1 08 00 00 00 8b 80 b4 04 00 00 8b 40 38 83 78 08 00 74 04 <0f> 0b 
> eb fe 8b 8c 24 e0 00 00 00 31 db 8b 01 8b 51 04 89 c1 0f
> EIP: [] reiser4_write_extent+0xd5/0x626 [reiser4] SS:ESP 
> 0068:e17d7e34
>   <4>reiser4[sstrip(23858)]: release_unix_file 
> (fs/reiser4/plugin/file/file.c:2417)[vs-44]:
> WARNING: out of memory?
> reiser4[sstrip(23858)]: release_unix_file 
> (fs/reiser4/plugin/file/file.c:2417)[vs-44]:
> WARNING: out of memory?
> 
> 
> 
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: linux-2.6.20-rc4-mm1 Reiser4 filesystem freeze and corruption

2007-01-23 Thread Vladimir V. Saveliev
Hello

On Tuesday 23 January 2007 10:38, Vince wrote:
 Zan Lynx wrote:
  I have been running 2.6.20-rc2-mm1 without problems, but both rc3-mm1
  and rc4-mm1 have been giving me these freezes.  They were happening
  inside X and without external console it was impossible to get anything,
  plus I was reluctant to test it since the freeze sometimes requires a
  full fsck.reiser4 --build-fs to recover the filesystem.
   [...]
 
 Hi,
 
 I don't know if it is related, but I've had the following BUG on 
 2.6.20-rc4-mm1 (+ hot-fixes patches applied) :
 
 ---
 kernel BUG at fs/reiser4/plugin/item/extent_file_ops.c:973!

This is another problem than Zan's one. The attached patch should fix it.

Andrew, please apply.


From: Vladimir Saveliev [EMAIL PROTECTED]

remove_suid may open a transaction in reiser4 which is to be restarted
before entering into main write loop.

Signed-off-by: Vladimir Saveliev [EMAIL PROTECTED]




diff -puN 
fs/reiser4/plugin/file/file.c~reiser4-restart-transaction-after-remove_suid 
fs/reiser4/plugin/file/file.c
--- 
linux-2.6.20-rc3-mm1/fs/reiser4/plugin/file/file.c~reiser4-restart-transaction-after-remove_suid
2007-01-23 18:59:14.0 +0300
+++ linux-2.6.20-rc3-mm1-vs/fs/reiser4/plugin/file/file.c   2007-01-23 
19:00:37.0 +0300
@@ -2175,6 +2175,8 @@ ssize_t write_unix_file(struct file *fil
reiser4_exit_context(ctx);
return result;
}
+   /* remove_suid might create a transaction */
+   reiser4_txn_restart(ctx);
 
uf_info = unix_file_inode_data(inode);
 

_


 invalid opcode:  [#1]
 PREEMPT
 last sysfs file: /devices/pci:00/:00:13.0/eth0/statistics/collisions
 Modules linked in: binfmt_misc nfs lockd sunrpc radeon drm reiser4 
 ati_remote fuse usbhid snd_via82xx snd_ac97_codec ac97_bus snd_pcm_oss 
 snd_mixer_oss snd_pcm snd_page_alloc snd_mpu401_uart snd_seq_oss 
 snd_seq_midi snd_rawmidi snd_seq_midi_event snd_seq snd_timer 
 snd_seq_device ohci1394 ieee1394 psmouse sr_mod cdrom sg ehci_hcd 
 via_agp agpgart uhci_hcd usbcore i2c_viapro snd soundcore
 CPU:0
 EIP:0060:[f9b8a2e0]Not tainted VLI
 EFLAGS: 00010282   (2.6.20-rc4-mm1 #1)
 EIP is at reiser4_write_extent+0xd5/0x626 [reiser4]
 eax: ccca139c   ebx: 0200   ecx: f5bec400   edx: ffe4
 esi:    edi: f5bec414   ebp: da6ff274   esp: e17d7e34
 ds: 007b   es: 007b   fs: 00d8  gs: 0033  ss: 0068
 Process sstrip (pid: 23858, ti=e17d6000 task=d8ffc570 task.ti=e17d6000)
 Stack:  00100100 00200200 00100100 0034 bf826a50 e083ff00 
 
 c000 da6ff2c8 dccba4c0 0005 01ff 021e  
 
   0004 f9b6cdad  0004 0004 
 0001
 Call Trace:
   [f9b6cdad] reiser4_update_sd+0x22/0x28 [reiser4]
   [c0162459] notify_change+0x200/0x20f
   [c01b89ed] vsscanf+0x1e2/0x3ff
   [f9b75c80] write_unix_file+0x0/0x495 [reiser4]
   [c013630d] __remove_suid+0x10/0x14
   [c013d847] mark_page_accessed+0x1c/0x2e
   [f9b5fbc2] reiser4_txn_begin+0x1c/0x2e [reiser4]
   [f9b8a20b] reiser4_write_extent+0x0/0x626 [reiser4]
   [f9b75eda] write_unix_file+0x25a/0x495 [reiser4]
   [c0142601] __handle_mm_fault+0x2bd/0x79b
   [f9b75c80] write_unix_file+0x0/0x495 [reiser4]
   [c01514e9] vfs_write+0x8a/0x136
   [c0151a27] sys_write+0x41/0x67
   [c0103c86] sysenter_past_esp+0x5f/0x85
   ===
 Code: 04 89 0c 24 31 c9 89 5c 24 04 e8 52 fc ff ff 31 d2 e9 59 05 00 00 
 64 a1 08 00 00 00 8b 80 b4 04 00 00 8b 40 38 83 78 08 00 74 04 0f 0b 
 eb fe 8b 8c 24 e0 00 00 00 31 db 8b 01 8b 51 04 89 c1 0f
 EIP: [f9b8a2e0] reiser4_write_extent+0xd5/0x626 [reiser4] SS:ESP 
 0068:e17d7e34
   4reiser4[sstrip(23858)]: release_unix_file 
 (fs/reiser4/plugin/file/file.c:2417)[vs-44]:
 WARNING: out of memory?
 reiser4[sstrip(23858)]: release_unix_file 
 (fs/reiser4/plugin/file/file.c:2417)[vs-44]:
 WARNING: out of memory?
 
 
 
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: linux-2.6.20-rc4-mm1 Reiser4 filesystem freeze and corruption

2007-01-22 Thread Vince

Zan Lynx wrote:

I have been running 2.6.20-rc2-mm1 without problems, but both rc3-mm1
and rc4-mm1 have been giving me these freezes.  They were happening
inside X and without external console it was impossible to get anything,
plus I was reluctant to test it since the freeze sometimes requires a
full fsck.reiser4 --build-fs to recover the filesystem.

> [...]

Hi,

I don't know if it is related, but I've had the following BUG on 
2.6.20-rc4-mm1 (+ hot-fixes patches applied) :


---
kernel BUG at fs/reiser4/plugin/item/extent_file_ops.c:973!
invalid opcode:  [#1]
PREEMPT
last sysfs file: /devices/pci:00/:00:13.0/eth0/statistics/collisions
Modules linked in: binfmt_misc nfs lockd sunrpc radeon drm reiser4 
ati_remote fuse usbhid snd_via82xx snd_ac97_codec ac97_bus snd_pcm_oss 
snd_mixer_oss snd_pcm snd_page_alloc snd_mpu401_uart snd_seq_oss 
snd_seq_midi snd_rawmidi snd_seq_midi_event snd_seq snd_timer 
snd_seq_device ohci1394 ieee1394 psmouse sr_mod cdrom sg ehci_hcd 
via_agp agpgart uhci_hcd usbcore i2c_viapro snd soundcore

CPU:0
EIP:0060:[]Not tainted VLI
EFLAGS: 00010282   (2.6.20-rc4-mm1 #1)
EIP is at reiser4_write_extent+0xd5/0x626 [reiser4]
eax: ccca139c   ebx: 0200   ecx: f5bec400   edx: ffe4
esi:    edi: f5bec414   ebp: da6ff274   esp: e17d7e34
ds: 007b   es: 007b   fs: 00d8  gs: 0033  ss: 0068
Process sstrip (pid: 23858, ti=e17d6000 task=d8ffc570 task.ti=e17d6000)
Stack:  00100100 00200200 00100100 0034 bf826a50 e083ff00 

   c000 da6ff2c8 dccba4c0 0005 01ff 021e  

     0004 f9b6cdad  0004 0004 
0001

Call Trace:
 [] reiser4_update_sd+0x22/0x28 [reiser4]
 [] notify_change+0x200/0x20f
 [] vsscanf+0x1e2/0x3ff
 [] write_unix_file+0x0/0x495 [reiser4]
 [] __remove_suid+0x10/0x14
 [] mark_page_accessed+0x1c/0x2e
 [] reiser4_txn_begin+0x1c/0x2e [reiser4]
 [] reiser4_write_extent+0x0/0x626 [reiser4]
 [] write_unix_file+0x25a/0x495 [reiser4]
 [] __handle_mm_fault+0x2bd/0x79b
 [] write_unix_file+0x0/0x495 [reiser4]
 [] vfs_write+0x8a/0x136
 [] sys_write+0x41/0x67
 [] sysenter_past_esp+0x5f/0x85
 ===
Code: 04 89 0c 24 31 c9 89 5c 24 04 e8 52 fc ff ff 31 d2 e9 59 05 00 00 
64 a1 08 00 00 00 8b 80 b4 04 00 00 8b 40 38 83 78 08 00 74 04 <0f> 0b 
eb fe 8b 8c 24 e0 00 00 00 31 db 8b 01 8b 51 04 89 c1 0f
EIP: [] reiser4_write_extent+0xd5/0x626 [reiser4] SS:ESP 
0068:e17d7e34
 <4>reiser4[sstrip(23858)]: release_unix_file 
(fs/reiser4/plugin/file/file.c:2417)[vs-44]:

WARNING: out of memory?
reiser4[sstrip(23858)]: release_unix_file 
(fs/reiser4/plugin/file/file.c:2417)[vs-44]:

WARNING: out of memory?

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: linux-2.6.20-rc4-mm1 Reiser4 filesystem freeze and corruption

2007-01-22 Thread Vince

Zan Lynx wrote:

I have been running 2.6.20-rc2-mm1 without problems, but both rc3-mm1
and rc4-mm1 have been giving me these freezes.  They were happening
inside X and without external console it was impossible to get anything,
plus I was reluctant to test it since the freeze sometimes requires a
full fsck.reiser4 --build-fs to recover the filesystem.

 [...]

Hi,

I don't know if it is related, but I've had the following BUG on 
2.6.20-rc4-mm1 (+ hot-fixes patches applied) :


---
kernel BUG at fs/reiser4/plugin/item/extent_file_ops.c:973!
invalid opcode:  [#1]
PREEMPT
last sysfs file: /devices/pci:00/:00:13.0/eth0/statistics/collisions
Modules linked in: binfmt_misc nfs lockd sunrpc radeon drm reiser4 
ati_remote fuse usbhid snd_via82xx snd_ac97_codec ac97_bus snd_pcm_oss 
snd_mixer_oss snd_pcm snd_page_alloc snd_mpu401_uart snd_seq_oss 
snd_seq_midi snd_rawmidi snd_seq_midi_event snd_seq snd_timer 
snd_seq_device ohci1394 ieee1394 psmouse sr_mod cdrom sg ehci_hcd 
via_agp agpgart uhci_hcd usbcore i2c_viapro snd soundcore

CPU:0
EIP:0060:[f9b8a2e0]Not tainted VLI
EFLAGS: 00010282   (2.6.20-rc4-mm1 #1)
EIP is at reiser4_write_extent+0xd5/0x626 [reiser4]
eax: ccca139c   ebx: 0200   ecx: f5bec400   edx: ffe4
esi:    edi: f5bec414   ebp: da6ff274   esp: e17d7e34
ds: 007b   es: 007b   fs: 00d8  gs: 0033  ss: 0068
Process sstrip (pid: 23858, ti=e17d6000 task=d8ffc570 task.ti=e17d6000)
Stack:  00100100 00200200 00100100 0034 bf826a50 e083ff00 

   c000 da6ff2c8 dccba4c0 0005 01ff 021e  

     0004 f9b6cdad  0004 0004 
0001

Call Trace:
 [f9b6cdad] reiser4_update_sd+0x22/0x28 [reiser4]
 [c0162459] notify_change+0x200/0x20f
 [c01b89ed] vsscanf+0x1e2/0x3ff
 [f9b75c80] write_unix_file+0x0/0x495 [reiser4]
 [c013630d] __remove_suid+0x10/0x14
 [c013d847] mark_page_accessed+0x1c/0x2e
 [f9b5fbc2] reiser4_txn_begin+0x1c/0x2e [reiser4]
 [f9b8a20b] reiser4_write_extent+0x0/0x626 [reiser4]
 [f9b75eda] write_unix_file+0x25a/0x495 [reiser4]
 [c0142601] __handle_mm_fault+0x2bd/0x79b
 [f9b75c80] write_unix_file+0x0/0x495 [reiser4]
 [c01514e9] vfs_write+0x8a/0x136
 [c0151a27] sys_write+0x41/0x67
 [c0103c86] sysenter_past_esp+0x5f/0x85
 ===
Code: 04 89 0c 24 31 c9 89 5c 24 04 e8 52 fc ff ff 31 d2 e9 59 05 00 00 
64 a1 08 00 00 00 8b 80 b4 04 00 00 8b 40 38 83 78 08 00 74 04 0f 0b 
eb fe 8b 8c 24 e0 00 00 00 31 db 8b 01 8b 51 04 89 c1 0f
EIP: [f9b8a2e0] reiser4_write_extent+0xd5/0x626 [reiser4] SS:ESP 
0068:e17d7e34
 4reiser4[sstrip(23858)]: release_unix_file 
(fs/reiser4/plugin/file/file.c:2417)[vs-44]:

WARNING: out of memory?
reiser4[sstrip(23858)]: release_unix_file 
(fs/reiser4/plugin/file/file.c:2417)[vs-44]:

WARNING: out of memory?

-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: linux-2.6.20-rc4-mm1 Reiser4 filesystem freeze and corruption

2007-01-19 Thread Vladimir V. Saveliev
Hello

On Friday 19 January 2007 20:58, Zan Lynx wrote:
> I have been running 2.6.20-rc2-mm1 without problems, but both rc3-mm1
> and rc4-mm1 have been giving me these freezes.  They were happening
> inside X and without external console it was impossible to get anything,
> plus I was reluctant to test it since the freeze sometimes requires a
> full fsck.reiser4 --build-fs to recover the filesystem.
> 
> But I finally got some output in a console session.  I wasn't able to
> get it all, I made some notes of what I think the problem is.  I may try
> again later once I get netconsole working (netconsole fails as a
> built-in, I'll try it as a module next).
> 
> 1 lock held by pdflush/185:
> #0: (>s_umount_key#15) ... writeback_inodes+0x89
> 
> 3 locks held by realsync/12942:
> #0: (>s_umount_key#15) at ... __sync_inodes+0x78
> #1: (>commit_mutex) ... reiser4_txn_end+0x37a
> #2: (>mutex) ... synchronize_qrcu+0x19
> 
> So, I *think* the problem is two locks on s_umount_key#15.  Does that
> sound likely?  I also noticed QRCU may be involved.
> 
> Perhaps someone will look at this and instantly know what the problem
> is.
> 
> If not, I'll be following up with more details like .config and perhaps
> a full sysrq-T dump as soon as that fsck finishes.
> 
yes, please provide more information. Full kernel output at time of freeze is 
very desirable.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: linux-2.6.20-rc4-mm1 Reiser4 filesystem freeze and corruption

2007-01-19 Thread Edward Shishkin

Zan Lynx wrote:


I have been running 2.6.20-rc2-mm1 without problems, but both rc3-mm1
and rc4-mm1 have been giving me these freezes. 



I didn't investigate it in details yet, other file systems also freeze 
for me:

http://marc.theaimsgroup.com/?l=linux-kernel=116809282829254=2


They were happening
inside X and without external console it was impossible to get anything,
plus I was reluctant to test it since the freeze sometimes requires a
full fsck.reiser4 --build-fs to recover the filesystem.
 



Why did you decide to recover? Got oops after mount, or?


But I finally got some output in a console session.  I wasn't able to
get it all, I made some notes of what I think the problem is.  I may try
again later once I get netconsole working (netconsole fails as a
built-in, I'll try it as a module next).

1 lock held by pdflush/185:
#0: (>s_umount_key#15) ... writeback_inodes+0x89

3 locks held by realsync/12942:
#0: (>s_umount_key#15) at ... __sync_inodes+0x78
#1: (>commit_mutex) ... reiser4_txn_end+0x37a
#2: (>mutex) ... synchronize_qrcu+0x19

So, I *think* the problem is two locks on s_umount_key#15.  Does that
sound likely?  I also noticed QRCU may be involved.

Perhaps someone will look at this and instantly know what the problem
is.

If not, I'll be following up with more details like .config and perhaps
a full sysrq-T dump as soon as that fsck finishes.
 



-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


linux-2.6.20-rc4-mm1 Reiser4 filesystem freeze and corruption

2007-01-19 Thread Zan Lynx
I have been running 2.6.20-rc2-mm1 without problems, but both rc3-mm1
and rc4-mm1 have been giving me these freezes.  They were happening
inside X and without external console it was impossible to get anything,
plus I was reluctant to test it since the freeze sometimes requires a
full fsck.reiser4 --build-fs to recover the filesystem.

But I finally got some output in a console session.  I wasn't able to
get it all, I made some notes of what I think the problem is.  I may try
again later once I get netconsole working (netconsole fails as a
built-in, I'll try it as a module next).

1 lock held by pdflush/185:
#0: (>s_umount_key#15) ... writeback_inodes+0x89

3 locks held by realsync/12942:
#0: (>s_umount_key#15) at ... __sync_inodes+0x78
#1: (>commit_mutex) ... reiser4_txn_end+0x37a
#2: (>mutex) ... synchronize_qrcu+0x19

So, I *think* the problem is two locks on s_umount_key#15.  Does that
sound likely?  I also noticed QRCU may be involved.

Perhaps someone will look at this and instantly know what the problem
is.

If not, I'll be following up with more details like .config and perhaps
a full sysrq-T dump as soon as that fsck finishes.


signature.asc
Description: This is a digitally signed message part


linux-2.6.20-rc4-mm1 Reiser4 filesystem freeze and corruption

2007-01-19 Thread Zan Lynx
I have been running 2.6.20-rc2-mm1 without problems, but both rc3-mm1
and rc4-mm1 have been giving me these freezes.  They were happening
inside X and without external console it was impossible to get anything,
plus I was reluctant to test it since the freeze sometimes requires a
full fsck.reiser4 --build-fs to recover the filesystem.

But I finally got some output in a console session.  I wasn't able to
get it all, I made some notes of what I think the problem is.  I may try
again later once I get netconsole working (netconsole fails as a
built-in, I'll try it as a module next).

1 lock held by pdflush/185:
#0: (type-s_umount_key#15) ... writeback_inodes+0x89

3 locks held by realsync/12942:
#0: (type-s_umount_key#15) at ... __sync_inodes+0x78
#1: (mgr-commit_mutex) ... reiser4_txn_end+0x37a
#2: (qp-mutex) ... synchronize_qrcu+0x19

So, I *think* the problem is two locks on s_umount_key#15.  Does that
sound likely?  I also noticed QRCU may be involved.

Perhaps someone will look at this and instantly know what the problem
is.

If not, I'll be following up with more details like .config and perhaps
a full sysrq-T dump as soon as that fsck finishes.


signature.asc
Description: This is a digitally signed message part


Re: linux-2.6.20-rc4-mm1 Reiser4 filesystem freeze and corruption

2007-01-19 Thread Edward Shishkin

Zan Lynx wrote:


I have been running 2.6.20-rc2-mm1 without problems, but both rc3-mm1
and rc4-mm1 have been giving me these freezes. 



I didn't investigate it in details yet, other file systems also freeze 
for me:

http://marc.theaimsgroup.com/?l=linux-kernelm=116809282829254w=2


They were happening
inside X and without external console it was impossible to get anything,
plus I was reluctant to test it since the freeze sometimes requires a
full fsck.reiser4 --build-fs to recover the filesystem.
 



Why did you decide to recover? Got oops after mount, or?


But I finally got some output in a console session.  I wasn't able to
get it all, I made some notes of what I think the problem is.  I may try
again later once I get netconsole working (netconsole fails as a
built-in, I'll try it as a module next).

1 lock held by pdflush/185:
#0: (type-s_umount_key#15) ... writeback_inodes+0x89

3 locks held by realsync/12942:
#0: (type-s_umount_key#15) at ... __sync_inodes+0x78
#1: (mgr-commit_mutex) ... reiser4_txn_end+0x37a
#2: (qp-mutex) ... synchronize_qrcu+0x19

So, I *think* the problem is two locks on s_umount_key#15.  Does that
sound likely?  I also noticed QRCU may be involved.

Perhaps someone will look at this and instantly know what the problem
is.

If not, I'll be following up with more details like .config and perhaps
a full sysrq-T dump as soon as that fsck finishes.
 



-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: linux-2.6.20-rc4-mm1 Reiser4 filesystem freeze and corruption

2007-01-19 Thread Vladimir V. Saveliev
Hello

On Friday 19 January 2007 20:58, Zan Lynx wrote:
 I have been running 2.6.20-rc2-mm1 without problems, but both rc3-mm1
 and rc4-mm1 have been giving me these freezes.  They were happening
 inside X and without external console it was impossible to get anything,
 plus I was reluctant to test it since the freeze sometimes requires a
 full fsck.reiser4 --build-fs to recover the filesystem.
 
 But I finally got some output in a console session.  I wasn't able to
 get it all, I made some notes of what I think the problem is.  I may try
 again later once I get netconsole working (netconsole fails as a
 built-in, I'll try it as a module next).
 
 1 lock held by pdflush/185:
 #0: (type-s_umount_key#15) ... writeback_inodes+0x89
 
 3 locks held by realsync/12942:
 #0: (type-s_umount_key#15) at ... __sync_inodes+0x78
 #1: (mgr-commit_mutex) ... reiser4_txn_end+0x37a
 #2: (qp-mutex) ... synchronize_qrcu+0x19
 
 So, I *think* the problem is two locks on s_umount_key#15.  Does that
 sound likely?  I also noticed QRCU may be involved.
 
 Perhaps someone will look at this and instantly know what the problem
 is.
 
 If not, I'll be following up with more details like .config and perhaps
 a full sysrq-T dump as soon as that fsck finishes.
 
yes, please provide more information. Full kernel output at time of freeze is 
very desirable.
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/