Re: 2.6.20-mm2: Oops in generic_make_request

2007-02-21 Thread Oliver Pahl
Is this Problem still investigated? I got the same kernel bug a few
minutes after booting my system with 2.6.20-mm2. As a student i am not
yet as familiar with kernel programming as i want to be, but i will try,
i promise. In the meantime, i always try to test the newest mm patches,
and i have to say, this one is the first which doesent work right on my
sys since 2.6.15-mm1. Thanks, keep up the good work an please help me
with this Problem

Greetz

Oli


On 18/02/07, Andrew Morton <[EMAIL PROTECTED]> wrote:
> On Sun, 18 Feb 2007 18:58:05 +0100 Mattia Dongili <[EMAIL PROTECTED]>
wrote:
>
> > On Sun, Feb 18, 2007 at 02:06:59PM +0100, Laurent Riffard wrote:
> > > Le 18.02.2007 06:51, Andrew Morton a écrit :
> > > >Temporarily at
> > > >
> > > >  http://userweb.kernel.org/~akpm/2.6.20-mm2/
> > > >
> > > >Will appear later at
> > > >
> > > >
ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.20/2.6.20-mm2/
> > >
> > > Hello, I've got a fully reproducible Oops. I just have to boot to
> > > runlevel 2 and wait less than one minute.
> >
> > Maybe this oops is related too?
>
> Looks that way.
>
> > Can't yet tell if it's easily reproducible, this is on a JFS filesystem.
> >
> > BUG: unable to handle kernel NULL pointer dereference at virtual
address 0004
> >  printing eip:
> > c02206aa
> > *pde = 
> > Oops:  [#1]
> > PREEMPT
> > last sysfs file: /devices/pci:00/:00:00.0/class
> > CPU:0
> > EIP:0060:[]Not tainted VLI
> > EFLAGS: 00010297   (2.6.20-mm2-1 #1)
> > EIP is at __make_request+0x13a/0x390
> > eax:    ebx:    ecx: 042b0dd8   edx: 00010485
> > esi: c5f8cea0   edi: cfd587f8   ebp: c653dadc   esp: c653dabc
> > ds: 007b   es: 007b   fs: 00d8  gs: 0033  ss: 0068
> > Process sh (pid: 4549, ti=c653c000 task=ccca2090 task.ti=c653c000)
> > Stack: c137eae8 0001 cfd587e4 c5f8cea0 c5f893e4 c137eae8
c5f8cea0 0008
> >c653db28 c021e0ca cfa3ffa0 cee6a2e0 cfa3ffa0 
0008 0100
> >00011250 cfedaf60 c653db10 0206  c653db18
c014a31e c653db48
> > Call Trace:
> >  [] show_trace_log_lvl+0x1a/0x30
> >  [] show_stack_log_lvl+0xa9/0xd0
> >  [] show_registers+0x206/0x350
> >  [] die+0x10e/0x210
> >  [] do_page_fault+0x2d2/0x600
> >  [] error_code+0x74/0x7c
> >  [] generic_make_request+0x15a/0x200
> >  [] submit_bio+0x58/0xe0
> >  [] metapage_writepage+0x18f/0x1b0
> >  [] __writepage+0xb/0x30
> >  [] write_cache_pages+0x22f/0x300
> >  [] generic_writepages+0x23/0x30
> >  [] do_writepages+0x5c/0x60
> >  [] __filemap_fdatawrite_range+0x67/0x80
> >  [] filemap_flush+0x25/0x30
> >  [] lmLogSync+0x19d/0x230
> >  [] lmLog+0x5e/0x190
> >  [] txCommit+0x8c0/0x1010
> >  [] jfs_create+0x324/0x370
> >  [] vfs_create+0xaf/0x100
> >  [] open_namei+0x58c/0x640
> >  [] do_filp_open+0x2c/0x50
> >  [] do_sys_open+0x47/0xe0
> >  [] sys_open+0x1c/0x20
> >  [] syscall_call+0x7/0xb
>
> Michal Piotrowski is hitting it too, and has bisected it down to
> git-block.patch.
>

> I have bisected it down to this patches

> revert-md-avoid-possible-bug_on-in-md-bitmap-handling-for-git-block.patch
> git-block.patch
> git-block-fixup.patch
> git-block-dupe-definitions.patch
> git-block-xfs-barriers-broke.patch

> Regards,
> Michal

> --
> Michal K. K. Piotrowski
> LTG - Linux Testers Group (PL)
> (http://www.stardust.webpages.pl/ltg/)
> LTG - Linux Testers Group (EN)
> (http://www.stardust.webpages.pl/linux_testers_group_en/)
> -
> To unsubscribe from this list: send the line "unsubscribe
linux-kernel" > in
> the body of a message to [EMAIL PROTECTED]
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> Please read the FAQ at  http://www.tux.org/lkml/
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: 2.6.20-mm2: Oops in generic_make_request

2007-02-21 Thread Oliver Pahl
Is this Problem still investigated? I got the same kernel bug a few
minutes after booting my system with 2.6.20-mm2. As a student i am not
yet as familiar with kernel programming as i want to be, but i will try,
i promise. In the meantime, i always try to test the newest mm patches,
and i have to say, this one is the first which doesent work right on my
sys since 2.6.15-mm1. Thanks, keep up the good work an please help me
with this Problem

Greetz

Oli


On 18/02/07, Andrew Morton [EMAIL PROTECTED] wrote:
 On Sun, 18 Feb 2007 18:58:05 +0100 Mattia Dongili [EMAIL PROTECTED]
wrote:

  On Sun, Feb 18, 2007 at 02:06:59PM +0100, Laurent Riffard wrote:
   Le 18.02.2007 06:51, Andrew Morton a écrit :
   Temporarily at
   
 http://userweb.kernel.org/~akpm/2.6.20-mm2/
   
   Will appear later at
   
   
ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.20/2.6.20-mm2/
  
   Hello, I've got a fully reproducible Oops. I just have to boot to
   runlevel 2 and wait less than one minute.
 
  Maybe this oops is related too?

 Looks that way.

  Can't yet tell if it's easily reproducible, this is on a JFS filesystem.
 
  BUG: unable to handle kernel NULL pointer dereference at virtual
address 0004
   printing eip:
  c02206aa
  *pde = 
  Oops:  [#1]
  PREEMPT
  last sysfs file: /devices/pci:00/:00:00.0/class
  CPU:0
  EIP:0060:[c02206aa]Not tainted VLI
  EFLAGS: 00010297   (2.6.20-mm2-1 #1)
  EIP is at __make_request+0x13a/0x390
  eax:    ebx:    ecx: 042b0dd8   edx: 00010485
  esi: c5f8cea0   edi: cfd587f8   ebp: c653dadc   esp: c653dabc
  ds: 007b   es: 007b   fs: 00d8  gs: 0033  ss: 0068
  Process sh (pid: 4549, ti=c653c000 task=ccca2090 task.ti=c653c000)
  Stack: c137eae8 0001 cfd587e4 c5f8cea0 c5f893e4 c137eae8
c5f8cea0 0008
 c653db28 c021e0ca cfa3ffa0 cee6a2e0 cfa3ffa0 
0008 0100
 00011250 cfedaf60 c653db10 0206  c653db18
c014a31e c653db48
  Call Trace:
   [c01048ba] show_trace_log_lvl+0x1a/0x30
   [c0104979] show_stack_log_lvl+0xa9/0xd0
   [c0104ba6] show_registers+0x206/0x350
   [c0104dfe] die+0x10e/0x210
   [c0114652] do_page_fault+0x2d2/0x600
   [c0351f84] error_code+0x74/0x7c
   [c021e0ca] generic_make_request+0x15a/0x200
   [c0220318] submit_bio+0x58/0xe0
   [c0205d0f] metapage_writepage+0x18f/0x1b0
   [c014cc2b] __writepage+0xb/0x30
   [c014d20f] write_cache_pages+0x22f/0x300
   [c014d303] generic_writepages+0x23/0x30
   [c014d36c] do_writepages+0x5c/0x60
   [c0148307] __filemap_fdatawrite_range+0x67/0x80
   [c0149785] filemap_flush+0x25/0x30
   [c02080cd] lmLogSync+0x19d/0x230
   [c020871e] lmLog+0x5e/0x190
   [c020a360] txCommit+0x8c0/0x1010
   [c01eecf4] jfs_create+0x324/0x370
   [c016f38f] vfs_create+0xaf/0x100
   [c017269c] open_namei+0x58c/0x640
   [c0165b7c] do_filp_open+0x2c/0x50
   [c0165be7] do_sys_open+0x47/0xe0
   [c0165cbc] sys_open+0x1c/0x20
   [c01041c0] syscall_call+0x7/0xb

 Michal Piotrowski is hitting it too, and has bisected it down to
 git-block.patch.


 I have bisected it down to this patches

 revert-md-avoid-possible-bug_on-in-md-bitmap-handling-for-git-block.patch
 git-block.patch
 git-block-fixup.patch
 git-block-dupe-definitions.patch
 git-block-xfs-barriers-broke.patch

 Regards,
 Michal

 --
 Michal K. K. Piotrowski
 LTG - Linux Testers Group (PL)
 (http://www.stardust.webpages.pl/ltg/)
 LTG - Linux Testers Group (EN)
 (http://www.stardust.webpages.pl/linux_testers_group_en/)
 -
 To unsubscribe from this list: send the line unsubscribe
linux-kernel  in
 the body of a message to [EMAIL PROTECTED]
 More majordomo info at  http://vger.kernel.org/majordomo-info.html
 Please read the FAQ at  http://www.tux.org/lkml/
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [-mm patch] fix locking in __make_request (was Re: 2.6.20-mm2: Oops in generic_make_request)

2007-02-19 Thread Frederik Deweerdt
On Mon, Feb 19, 2007 at 03:08:03PM +0100, Michal Piotrowski wrote:
> Michal Piotrowski napisał(a):
> > Hi Frederik,
> > 
> > On 20/02/07, Frederik Deweerdt <[EMAIL PROTECTED]> wrote:
> >> Hi Michal,
> >>
> >> This seems to be a locking problem in __make_request, check_plug_merge()
> >> should be called with the q->queue_lock held.
> >> Could you try the following patch? It silenced the oops for me.
> > 
> > For me too, but Jens dislikes this patch.
> 
> Now I know why...
> 
> Code:  Bad EIP value.
> EIP: [<6b6b6b6b>] 0x6b6b6b6b SS:ESP 0068:f45f9bf8

:) It may not be related though. I think that my patch avoids the
BUG_ON(!rq_mergeable(req)); in ll_rw_blk.c:2782. This looks like another
beast to me.

Regards,
Frederik
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [-mm patch] fix locking in __make_request (was Re: 2.6.20-mm2: Oops in generic_make_request)

2007-02-19 Thread Frederik Deweerdt
On Mon, Feb 19, 2007 at 02:34:53PM +0100, Jens Axboe wrote:
> On Tue, Feb 20 2007, Frederik Deweerdt wrote:
> > On Sun, Feb 18, 2007 at 09:05:33PM +0100, Michal Piotrowski wrote:
> > > On 18/02/07, Andrew Morton <[EMAIL PROTECTED]> wrote:
> > > >On Sun, 18 Feb 2007 18:58:05 +0100 Mattia Dongili <[EMAIL PROTECTED]> 
> > > >wrote:
> > > >
> > > >> On Sun, Feb 18, 2007 at 02:06:59PM +0100, Laurent Riffard wrote:
> > > >> > Le 18.02.2007 06:51, Andrew Morton a ?crit :
> > > >> > >Temporarily at
> > > >> > >
> > > >> > >  http://userweb.kernel.org/~akpm/2.6.20-mm2/
> > > >> > >
> > > >> > >Will appear later at
> > > >> > >
> > > >> > > ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.20/2.6.20-mm2/
> > > >> >
> > > >> > Hello, I've got a fully reproducible Oops. I just have to boot to
> > > >> > runlevel 2 and wait less than one minute.
> > > >>
> > > >> Maybe this oops is related too?
> > > >
> > > >Looks that way.
> > > >
> > Hi Michal,
> > 
> > This seems to be a locking problem in __make_request, check_plug_merge()
> > should be called with the q->queue_lock held.
> > Could you try the following patch? It silenced the oops for me.
> 
> Just back from travel... That is not correct, why do you think the queue
> lock needs to be held there?
The unprotected:
check_plug_merge
-> bio_attempt_back_merge
   -> ll_back_merge_fn
  -> q->last_merge = NULL
made me think that. I don't know the code enough though.

Regards,
Frederik
> 
> -- 
> Jens Axboe
> 
> -
> To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
> the body of a message to [EMAIL PROTECTED]
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> Please read the FAQ at  http://www.tux.org/lkml/
> 
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [-mm patch] fix locking in __make_request (was Re: 2.6.20-mm2: Oops in generic_make_request)

2007-02-19 Thread Michal Piotrowski
Michal Piotrowski napisał(a):
> Hi Frederik,
> 
> On 20/02/07, Frederik Deweerdt <[EMAIL PROTECTED]> wrote:
>> Hi Michal,
>>
>> This seems to be a locking problem in __make_request, check_plug_merge()
>> should be called with the q->queue_lock held.
>> Could you try the following patch? It silenced the oops for me.
> 
> For me too, but Jens dislikes this patch.

Now I know why...

Code:  Bad EIP value.
EIP: [<6b6b6b6b>] 0x6b6b6b6b SS:ESP 0068:f45f9bf8
note: ksmserver[20993] exited with preempt_count 2
BUG: sleeping function called from invalid context at 
/mnt/md0/devel/linux-mm/kernel/rwsem.c:20
in_atomic():1, irqs_disabled():1
no locks held by ksmserver/20993.
irq event stamp: 0
hardirqs last  enabled at (0): [<>] 0x0
hardirqs last disabled at (0): [] copy_process+0x539/0x137d
softirqs last  enabled at (0): [] copy_process+0x557/0x137d


BUG: unable to handle kernel NULL pointer dereference at virtual address 
0118
 printing eip:
c01f3f12
*pde = 
Oops:  [#1]
PREEMPT SMP
last sysfs file: /devices/platform/i2c-9191/9191-0290/temp2_input
Modules linked in: ipt_MASQUERADE iptable_nat nf_nat nfsd exportfs lockd 
nfs_acl autofs4 sunrpc af_packet nf_conntrack_netbios_ns ipt_REJECT 
nf_conntrack_ipv4 xt_state nf_conntrack nfnetlink xt_tcpudp iptable_filter 
ip_tables x_tables ipv6 binfmt_misc thermal processor fan container nvram 
snd_intel8x0 snd_ac97_codec ac97_bus snd_seq_dummy snd_seq_oss 
snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss evdev 
snd_pcm skge intel_agp snd_timer agpgart snd 8139too soundcore sk98lin i2c_i801 
mii snd_page_alloc ide_cd cdrom rtc unix
CPU:0
EIP:0060:[]Not tainted VLI
EFLAGS: 00210297   (2.6.20-mm2 #19)
EIP is at blk_unplug_current+0x60/0x156
eax: f3f97298   ebx: 0001   ecx: c043db74   edx: 
esi: f3f97284   edi:    ebp: dda39d94   esp: dda39d58
ds: 007b   es: 007b   fs: 00d8  gs: 0033  ss: 0068
Process swriter.bin (pid: 20846, ti=dda38000 task=dda79510 task.ti=dda38000)
Stack: dda79510 c03369fd  dda79510 c03369fd  dda39d90 f3f97298
   c011e710 c7403fac 00200046 dda39d90 0001 c04abda0 06f75d80 dda39da8
   c03339bb dda79f38 dda79f30 c7403f98 dda39db0 c0333a2f dda39db8 c015e380
Call Trace:
 [] show_trace_log_lvl+0x1a/0x2f
 [] show_stack_log_lvl+0x9d/0xac
 [] show_registers+0x1ed/0x34c
 [] die+0x11d/0x234
 [] do_page_fault+0x47c/0x55b
 [] error_code+0x7c/0x84
 [] io_schedule+0x3d/0x9a
 [] io_wait_schedule+0x17/0x1d
 [] sleep_on_page+0xa/0xc
 [] __wait_on_bit_lock+0x34/0x66
 [] lock_page_blocking+0x2c/0x30
 [] do_generic_mapping_read+0x29f/0x51a
 [] generic_file_aio_read+0x19a/0x1c8
 [] do_sync_read+0xd7/0x114
 [] vfs_read+0xcf/0x158
 [] sys_read+0x3d/0x72
 [] syscall_call+0x7/0xb
 ===
Code: 0b eb fe 8b 46 0c 48 89 46 0c 85 c0 0f 85 07 01 00 00 8b 7e 1c c7 46 1c 
00 00 00 00 8d 46 14 89 45 e0 39 46 14 0f 84 d4 00 00 00 <8b> 87 18 01 00 00 e8 
c0 26 14 00 c7 45 e4 00 00 00 00 8b 5e 14
EIP: [] blk_unplug_current+0x60/0x156 SS:ESP 0068:dda39d58
BUG: unable to handle kernel paging request at virtual address 6b6b6b6b
 printing eip:
6b6b6b6b
*pde = 
Oops:  [#2]
PREEMPT SMP
last sysfs file: /devices/platform/i2c-9191/9191-0290/temp2_input
Modules linked in: ipt_MASQUERADE iptable_nat nf_nat nfsd exportfs lockd 
nfs_acl autofs4 sunrpc af_packet nf_conntrack_netbios_ns ipt_REJECT 
nf_conntrack_ipv4 xt_state nf_conntrack nfnetlink xt_tcpudp iptable_filter 
ip_tables x_tables ipv6 binfmt_misc thermal processor fan container nvram 
snd_intel8x0 snd_ac97_codec ac97_bus snd_seq_dummy snd_seq_oss 
snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss evdev 
snd_pcm skge intel_agp snd_timer agpgart snd 8139too soundcore sk98lin i2c_i801 
mii snd_page_alloc ide_cd cdrom rtc unix
CPU:0
EIP:0060:[<6b6b6b6b>]Not tainted VLI
EFLAGS: 00210012   (2.6.20-mm2 #19)
EIP is at 0x6b6b6b6b
eax: dda79f38   ebx: dda79f38   ecx:    edx: 0003
esi: 6b6b6b6b   edi: 0001   ebp: f45f9c18   esp: f45f9bf8
ds: 007b   es: 007b   fs: 00d8  gs: 0033  ss: 0068
Process ksmserver (pid: 20993, ti=f45f8000 task=f335b510 task.ti=f45f8000)
Stack: c011b5e0 f45f9c44 0003 c7403f98 6b6b6b6b c7403f98 0001 00200292
   f45f9c38 c011c3fb  f45f9c44 0003 c7403f98 f7d71f4c f7d71f4c
   f45f9c50 c01353c3 f45f9c44 f7d71f4c 0002 0002 f45f9c60 c01353e0
Call Trace:
 [] show_trace_log_lvl+0x1a/0x2f
 [] show_stack_log_lvl+0x9d/0xac
 [] show_registers+0x1ed/0x34c
 [] die+0x11d/0x234
 [] do_page_fault+0x47c/0x55b
 [] error_code+0x7c/0x84
 [] __wake_up+0x31/0x42
 [] __wake_up_bit+0x2e/0x34
 [] wake_up_bit+0x17/0x1b
 [] unlock_buffer+0x12/0x14
 [] do_get_write_access+0x185/0x657
 [] journal_get_write_access+0x1b/0x2a
 [] __ext3_journal_get_write_access+0x19/0x3f
 [] ext3_reserve_inode_write+0x34/0x68
 [] ext3_mark_inode_dirty+0x2a/0x41
 [] ext3_dirty_inode+0x6a/0x7d
 [] __mark_inode_dirty+0x2a/0x16d
 [] touch_atime+0xc1/0xcb
 [] 

Re: [-mm patch] fix locking in __make_request (was Re: 2.6.20-mm2: Oops in generic_make_request)

2007-02-19 Thread Michal Piotrowski

Hi Frederik,

On 20/02/07, Frederik Deweerdt <[EMAIL PROTECTED]> wrote:

Hi Michal,

This seems to be a locking problem in __make_request, check_plug_merge()
should be called with the q->queue_lock held.
Could you try the following patch? It silenced the oops for me.


For me too, but Jens dislikes this patch.



Regards,
Frederik


Regards,
Michal

--
Michal K. K. Piotrowski
LTG - Linux Testers Group (PL)
(http://www.stardust.webpages.pl/ltg/)
LTG - Linux Testers Group (EN)
(http://www.stardust.webpages.pl/linux_testers_group_en/)
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [-mm patch] fix locking in __make_request (was Re: 2.6.20-mm2: Oops in generic_make_request)

2007-02-19 Thread Jens Axboe
On Tue, Feb 20 2007, Frederik Deweerdt wrote:
> On Sun, Feb 18, 2007 at 09:05:33PM +0100, Michal Piotrowski wrote:
> > On 18/02/07, Andrew Morton <[EMAIL PROTECTED]> wrote:
> > >On Sun, 18 Feb 2007 18:58:05 +0100 Mattia Dongili <[EMAIL PROTECTED]> 
> > >wrote:
> > >
> > >> On Sun, Feb 18, 2007 at 02:06:59PM +0100, Laurent Riffard wrote:
> > >> > Le 18.02.2007 06:51, Andrew Morton a ?crit :
> > >> > >Temporarily at
> > >> > >
> > >> > >  http://userweb.kernel.org/~akpm/2.6.20-mm2/
> > >> > >
> > >> > >Will appear later at
> > >> > >
> > >> > > ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.20/2.6.20-mm2/
> > >> >
> > >> > Hello, I've got a fully reproducible Oops. I just have to boot to
> > >> > runlevel 2 and wait less than one minute.
> > >>
> > >> Maybe this oops is related too?
> > >
> > >Looks that way.
> > >
> Hi Michal,
> 
> This seems to be a locking problem in __make_request, check_plug_merge()
> should be called with the q->queue_lock held.
> Could you try the following patch? It silenced the oops for me.

Just back from travel... That is not correct, why do you think the queue
lock needs to be held there? I'll look into this tomorrow.

-- 
Jens Axboe

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


[-mm patch] fix locking in __make_request (was Re: 2.6.20-mm2: Oops in generic_make_request)

2007-02-19 Thread Frederik Deweerdt
On Sun, Feb 18, 2007 at 09:05:33PM +0100, Michal Piotrowski wrote:
> On 18/02/07, Andrew Morton <[EMAIL PROTECTED]> wrote:
> >On Sun, 18 Feb 2007 18:58:05 +0100 Mattia Dongili <[EMAIL PROTECTED]> wrote:
> >
> >> On Sun, Feb 18, 2007 at 02:06:59PM +0100, Laurent Riffard wrote:
> >> > Le 18.02.2007 06:51, Andrew Morton a �crit :
> >> > >Temporarily at
> >> > >
> >> > >  http://userweb.kernel.org/~akpm/2.6.20-mm2/
> >> > >
> >> > >Will appear later at
> >> > >
> >> > > ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.20/2.6.20-mm2/
> >> >
> >> > Hello, I've got a fully reproducible Oops. I just have to boot to
> >> > runlevel 2 and wait less than one minute.
> >>
> >> Maybe this oops is related too?
> >
> >Looks that way.
> >
Hi Michal,

This seems to be a locking problem in __make_request, check_plug_merge()
should be called with the q->queue_lock held.
Could you try the following patch? It silenced the oops for me.

Regards,
Frederik

Signed-off-by: Frederik Deweerdt <[EMAIL PROTECTED]>

diff --git a/block/ll_rw_blk.c b/block/ll_rw_blk.c
index 577f448..666f34e 100644
--- a/block/ll_rw_blk.c
+++ b/block/ll_rw_blk.c
@@ -2919,14 +2919,14 @@ static int __make_request(request_queue_
 */
blk_queue_bounce(q, );
 
+   spin_lock_irq(q->queue_lock);
/*
 * Check if we can merge with the plugged list before grabbing
 * any locks.
 */
if (!check_plug_merge(q, ioc, bio))
-   goto out;
+   goto out_unlock;
 
-   spin_lock_irq(q->queue_lock);
el_ret = elv_merge(q, , bio);
if (el_ret == ELEVATOR_BACK_MERGE) {
if (bio_attempt_back_merge(q, req, bio)) {
@@ -2984,7 +2984,6 @@ out_unlock:
list_add_tail(>queuelist, >plugged_list);
}
 
-out:
return 0;
 
 end_io_eopnotsupp:
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


[-mm patch] fix locking in __make_request (was Re: 2.6.20-mm2: Oops in generic_make_request)

2007-02-19 Thread Frederik Deweerdt
On Sun, Feb 18, 2007 at 09:05:33PM +0100, Michal Piotrowski wrote:
 On 18/02/07, Andrew Morton [EMAIL PROTECTED] wrote:
 On Sun, 18 Feb 2007 18:58:05 +0100 Mattia Dongili [EMAIL PROTECTED] wrote:
 
  On Sun, Feb 18, 2007 at 02:06:59PM +0100, Laurent Riffard wrote:
   Le 18.02.2007 06:51, Andrew Morton a �crit :
   Temporarily at
   
 http://userweb.kernel.org/~akpm/2.6.20-mm2/
   
   Will appear later at
   
ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.20/2.6.20-mm2/
  
   Hello, I've got a fully reproducible Oops. I just have to boot to
   runlevel 2 and wait less than one minute.
 
  Maybe this oops is related too?
 
 Looks that way.
 
Hi Michal,

This seems to be a locking problem in __make_request, check_plug_merge()
should be called with the q-queue_lock held.
Could you try the following patch? It silenced the oops for me.

Regards,
Frederik

Signed-off-by: Frederik Deweerdt [EMAIL PROTECTED]

diff --git a/block/ll_rw_blk.c b/block/ll_rw_blk.c
index 577f448..666f34e 100644
--- a/block/ll_rw_blk.c
+++ b/block/ll_rw_blk.c
@@ -2919,14 +2919,14 @@ static int __make_request(request_queue_
 */
blk_queue_bounce(q, bio);
 
+   spin_lock_irq(q-queue_lock);
/*
 * Check if we can merge with the plugged list before grabbing
 * any locks.
 */
if (!check_plug_merge(q, ioc, bio))
-   goto out;
+   goto out_unlock;
 
-   spin_lock_irq(q-queue_lock);
el_ret = elv_merge(q, req, bio);
if (el_ret == ELEVATOR_BACK_MERGE) {
if (bio_attempt_back_merge(q, req, bio)) {
@@ -2984,7 +2984,6 @@ out_unlock:
list_add_tail(req-queuelist, ioc-plugged_list);
}
 
-out:
return 0;
 
 end_io_eopnotsupp:
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [-mm patch] fix locking in __make_request (was Re: 2.6.20-mm2: Oops in generic_make_request)

2007-02-19 Thread Jens Axboe
On Tue, Feb 20 2007, Frederik Deweerdt wrote:
 On Sun, Feb 18, 2007 at 09:05:33PM +0100, Michal Piotrowski wrote:
  On 18/02/07, Andrew Morton [EMAIL PROTECTED] wrote:
  On Sun, 18 Feb 2007 18:58:05 +0100 Mattia Dongili [EMAIL PROTECTED] 
  wrote:
  
   On Sun, Feb 18, 2007 at 02:06:59PM +0100, Laurent Riffard wrote:
Le 18.02.2007 06:51, Andrew Morton a ?crit :
Temporarily at

  http://userweb.kernel.org/~akpm/2.6.20-mm2/

Will appear later at

 ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.20/2.6.20-mm2/
   
Hello, I've got a fully reproducible Oops. I just have to boot to
runlevel 2 and wait less than one minute.
  
   Maybe this oops is related too?
  
  Looks that way.
  
 Hi Michal,
 
 This seems to be a locking problem in __make_request, check_plug_merge()
 should be called with the q-queue_lock held.
 Could you try the following patch? It silenced the oops for me.

Just back from travel... That is not correct, why do you think the queue
lock needs to be held there? I'll look into this tomorrow.

-- 
Jens Axboe

-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [-mm patch] fix locking in __make_request (was Re: 2.6.20-mm2: Oops in generic_make_request)

2007-02-19 Thread Michal Piotrowski

Hi Frederik,

On 20/02/07, Frederik Deweerdt [EMAIL PROTECTED] wrote:

Hi Michal,

This seems to be a locking problem in __make_request, check_plug_merge()
should be called with the q-queue_lock held.
Could you try the following patch? It silenced the oops for me.


For me too, but Jens dislikes this patch.



Regards,
Frederik


Regards,
Michal

--
Michal K. K. Piotrowski
LTG - Linux Testers Group (PL)
(http://www.stardust.webpages.pl/ltg/)
LTG - Linux Testers Group (EN)
(http://www.stardust.webpages.pl/linux_testers_group_en/)
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [-mm patch] fix locking in __make_request (was Re: 2.6.20-mm2: Oops in generic_make_request)

2007-02-19 Thread Michal Piotrowski
Michal Piotrowski napisał(a):
 Hi Frederik,
 
 On 20/02/07, Frederik Deweerdt [EMAIL PROTECTED] wrote:
 Hi Michal,

 This seems to be a locking problem in __make_request, check_plug_merge()
 should be called with the q-queue_lock held.
 Could you try the following patch? It silenced the oops for me.
 
 For me too, but Jens dislikes this patch.

Now I know why...

Code:  Bad EIP value.
EIP: [6b6b6b6b] 0x6b6b6b6b SS:ESP 0068:f45f9bf8
note: ksmserver[20993] exited with preempt_count 2
BUG: sleeping function called from invalid context at 
/mnt/md0/devel/linux-mm/kernel/rwsem.c:20
in_atomic():1, irqs_disabled():1
no locks held by ksmserver/20993.
irq event stamp: 0
hardirqs last  enabled at (0): [] 0x0
hardirqs last disabled at (0): [c0121bc6] copy_process+0x539/0x137d
softirqs last  enabled at (0): [c0121be4] copy_process+0x557/0x137d


BUG: unable to handle kernel NULL pointer dereference at virtual address 
0118
 printing eip:
c01f3f12
*pde = 
Oops:  [#1]
PREEMPT SMP
last sysfs file: /devices/platform/i2c-9191/9191-0290/temp2_input
Modules linked in: ipt_MASQUERADE iptable_nat nf_nat nfsd exportfs lockd 
nfs_acl autofs4 sunrpc af_packet nf_conntrack_netbios_ns ipt_REJECT 
nf_conntrack_ipv4 xt_state nf_conntrack nfnetlink xt_tcpudp iptable_filter 
ip_tables x_tables ipv6 binfmt_misc thermal processor fan container nvram 
snd_intel8x0 snd_ac97_codec ac97_bus snd_seq_dummy snd_seq_oss 
snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss evdev 
snd_pcm skge intel_agp snd_timer agpgart snd 8139too soundcore sk98lin i2c_i801 
mii snd_page_alloc ide_cd cdrom rtc unix
CPU:0
EIP:0060:[c01f3f12]Not tainted VLI
EFLAGS: 00210297   (2.6.20-mm2 #19)
EIP is at blk_unplug_current+0x60/0x156
eax: f3f97298   ebx: 0001   ecx: c043db74   edx: 
esi: f3f97284   edi:    ebp: dda39d94   esp: dda39d58
ds: 007b   es: 007b   fs: 00d8  gs: 0033  ss: 0068
Process swriter.bin (pid: 20846, ti=dda38000 task=dda79510 task.ti=dda38000)
Stack: dda79510 c03369fd  dda79510 c03369fd  dda39d90 f3f97298
   c011e710 c7403fac 00200046 dda39d90 0001 c04abda0 06f75d80 dda39da8
   c03339bb dda79f38 dda79f30 c7403f98 dda39db0 c0333a2f dda39db8 c015e380
Call Trace:
 [c0105312] show_trace_log_lvl+0x1a/0x2f
 [c01053c4] show_stack_log_lvl+0x9d/0xac
 [c01055c0] show_registers+0x1ed/0x34c
 [c010583c] die+0x11d/0x234
 [c011a8e1] do_page_fault+0x47c/0x55b
 [c0336d54] error_code+0x7c/0x84
 [c03339bb] io_schedule+0x3d/0x9a
 [c0333a2f] io_wait_schedule+0x17/0x1d
 [c015e380] sleep_on_page+0xa/0xc
 [c0334217] __wait_on_bit_lock+0x34/0x66
 [c015e372] lock_page_blocking+0x2c/0x30
 [c015ebe2] do_generic_mapping_read+0x29f/0x51a
 [c0160daa] generic_file_aio_read+0x19a/0x1c8
 [c017f300] do_sync_read+0xd7/0x114
 [c017fc2a] vfs_read+0xcf/0x158
 [c0180090] sys_read+0x3d/0x72
 [c010432c] syscall_call+0x7/0xb
 ===
Code: 0b eb fe 8b 46 0c 48 89 46 0c 85 c0 0f 85 07 01 00 00 8b 7e 1c c7 46 1c 
00 00 00 00 8d 46 14 89 45 e0 39 46 14 0f 84 d4 00 00 00 8b 87 18 01 00 00 e8 
c0 26 14 00 c7 45 e4 00 00 00 00 8b 5e 14
EIP: [c01f3f12] blk_unplug_current+0x60/0x156 SS:ESP 0068:dda39d58
BUG: unable to handle kernel paging request at virtual address 6b6b6b6b
 printing eip:
6b6b6b6b
*pde = 
Oops:  [#2]
PREEMPT SMP
last sysfs file: /devices/platform/i2c-9191/9191-0290/temp2_input
Modules linked in: ipt_MASQUERADE iptable_nat nf_nat nfsd exportfs lockd 
nfs_acl autofs4 sunrpc af_packet nf_conntrack_netbios_ns ipt_REJECT 
nf_conntrack_ipv4 xt_state nf_conntrack nfnetlink xt_tcpudp iptable_filter 
ip_tables x_tables ipv6 binfmt_misc thermal processor fan container nvram 
snd_intel8x0 snd_ac97_codec ac97_bus snd_seq_dummy snd_seq_oss 
snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss evdev 
snd_pcm skge intel_agp snd_timer agpgart snd 8139too soundcore sk98lin i2c_i801 
mii snd_page_alloc ide_cd cdrom rtc unix
CPU:0
EIP:0060:[6b6b6b6b]Not tainted VLI
EFLAGS: 00210012   (2.6.20-mm2 #19)
EIP is at 0x6b6b6b6b
eax: dda79f38   ebx: dda79f38   ecx:    edx: 0003
esi: 6b6b6b6b   edi: 0001   ebp: f45f9c18   esp: f45f9bf8
ds: 007b   es: 007b   fs: 00d8  gs: 0033  ss: 0068
Process ksmserver (pid: 20993, ti=f45f8000 task=f335b510 task.ti=f45f8000)
Stack: c011b5e0 f45f9c44 0003 c7403f98 6b6b6b6b c7403f98 0001 00200292
   f45f9c38 c011c3fb  f45f9c44 0003 c7403f98 f7d71f4c f7d71f4c
   f45f9c50 c01353c3 f45f9c44 f7d71f4c 0002 0002 f45f9c60 c01353e0
Call Trace:
 [c0105312] show_trace_log_lvl+0x1a/0x2f
 [c01053c4] show_stack_log_lvl+0x9d/0xac
 [c01055c0] show_registers+0x1ed/0x34c
 [c010583c] die+0x11d/0x234
 [c011a8e1] do_page_fault+0x47c/0x55b
 [c0336d54] error_code+0x7c/0x84
 [c011c3fb] __wake_up+0x31/0x42
 [c01353c3] __wake_up_bit+0x2e/0x34
 [c01353e0] wake_up_bit+0x17/0x1b
 [c019ecbf] unlock_buffer+0x12/0x14
 [c01ccad4] do_get_write_access+0x185/0x657
 [c01ccfc1] 

Re: [-mm patch] fix locking in __make_request (was Re: 2.6.20-mm2: Oops in generic_make_request)

2007-02-19 Thread Frederik Deweerdt
On Mon, Feb 19, 2007 at 02:34:53PM +0100, Jens Axboe wrote:
 On Tue, Feb 20 2007, Frederik Deweerdt wrote:
  On Sun, Feb 18, 2007 at 09:05:33PM +0100, Michal Piotrowski wrote:
   On 18/02/07, Andrew Morton [EMAIL PROTECTED] wrote:
   On Sun, 18 Feb 2007 18:58:05 +0100 Mattia Dongili [EMAIL PROTECTED] 
   wrote:
   
On Sun, Feb 18, 2007 at 02:06:59PM +0100, Laurent Riffard wrote:
 Le 18.02.2007 06:51, Andrew Morton a ?crit :
 Temporarily at
 
   http://userweb.kernel.org/~akpm/2.6.20-mm2/
 
 Will appear later at
 
  ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.20/2.6.20-mm2/

 Hello, I've got a fully reproducible Oops. I just have to boot to
 runlevel 2 and wait less than one minute.
   
Maybe this oops is related too?
   
   Looks that way.
   
  Hi Michal,
  
  This seems to be a locking problem in __make_request, check_plug_merge()
  should be called with the q-queue_lock held.
  Could you try the following patch? It silenced the oops for me.
 
 Just back from travel... That is not correct, why do you think the queue
 lock needs to be held there?
The unprotected:
check_plug_merge
- bio_attempt_back_merge
   - ll_back_merge_fn
  - q-last_merge = NULL
made me think that. I don't know the code enough though.

Regards,
Frederik
 
 -- 
 Jens Axboe
 
 -
 To unsubscribe from this list: send the line unsubscribe linux-kernel in
 the body of a message to [EMAIL PROTECTED]
 More majordomo info at  http://vger.kernel.org/majordomo-info.html
 Please read the FAQ at  http://www.tux.org/lkml/
 
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [-mm patch] fix locking in __make_request (was Re: 2.6.20-mm2: Oops in generic_make_request)

2007-02-19 Thread Frederik Deweerdt
On Mon, Feb 19, 2007 at 03:08:03PM +0100, Michal Piotrowski wrote:
 Michal Piotrowski napisał(a):
  Hi Frederik,
  
  On 20/02/07, Frederik Deweerdt [EMAIL PROTECTED] wrote:
  Hi Michal,
 
  This seems to be a locking problem in __make_request, check_plug_merge()
  should be called with the q-queue_lock held.
  Could you try the following patch? It silenced the oops for me.
  
  For me too, but Jens dislikes this patch.
 
 Now I know why...
 
 Code:  Bad EIP value.
 EIP: [6b6b6b6b] 0x6b6b6b6b SS:ESP 0068:f45f9bf8

:) It may not be related though. I think that my patch avoids the
BUG_ON(!rq_mergeable(req)); in ll_rw_blk.c:2782. This looks like another
beast to me.

Regards,
Frederik
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: 2.6.20-mm2: Oops in generic_make_request

2007-02-18 Thread Michal Piotrowski

On 18/02/07, Andrew Morton <[EMAIL PROTECTED]> wrote:

On Sun, 18 Feb 2007 18:58:05 +0100 Mattia Dongili <[EMAIL PROTECTED]> wrote:

> On Sun, Feb 18, 2007 at 02:06:59PM +0100, Laurent Riffard wrote:
> > Le 18.02.2007 06:51, Andrew Morton a écrit :
> > >Temporarily at
> > >
> > >  http://userweb.kernel.org/~akpm/2.6.20-mm2/
> > >
> > >Will appear later at
> > >
> > > 
ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.20/2.6.20-mm2/
> >
> > Hello, I've got a fully reproducible Oops. I just have to boot to
> > runlevel 2 and wait less than one minute.
>
> Maybe this oops is related too?

Looks that way.

> Can't yet tell if it's easily reproducible, this is on a JFS filesystem.
>
> BUG: unable to handle kernel NULL pointer dereference at virtual address 
0004
>  printing eip:
> c02206aa
> *pde = 
> Oops:  [#1]
> PREEMPT
> last sysfs file: /devices/pci:00/:00:00.0/class
> CPU:0
> EIP:0060:[]Not tainted VLI
> EFLAGS: 00010297   (2.6.20-mm2-1 #1)
> EIP is at __make_request+0x13a/0x390
> eax:    ebx:    ecx: 042b0dd8   edx: 00010485
> esi: c5f8cea0   edi: cfd587f8   ebp: c653dadc   esp: c653dabc
> ds: 007b   es: 007b   fs: 00d8  gs: 0033  ss: 0068
> Process sh (pid: 4549, ti=c653c000 task=ccca2090 task.ti=c653c000)
> Stack: c137eae8 0001 cfd587e4 c5f8cea0 c5f893e4 c137eae8 c5f8cea0 0008
>c653db28 c021e0ca cfa3ffa0 cee6a2e0 cfa3ffa0  0008 0100
>00011250 cfedaf60 c653db10 0206  c653db18 c014a31e c653db48
> Call Trace:
>  [] show_trace_log_lvl+0x1a/0x30
>  [] show_stack_log_lvl+0xa9/0xd0
>  [] show_registers+0x206/0x350
>  [] die+0x10e/0x210
>  [] do_page_fault+0x2d2/0x600
>  [] error_code+0x74/0x7c
>  [] generic_make_request+0x15a/0x200
>  [] submit_bio+0x58/0xe0
>  [] metapage_writepage+0x18f/0x1b0
>  [] __writepage+0xb/0x30
>  [] write_cache_pages+0x22f/0x300
>  [] generic_writepages+0x23/0x30
>  [] do_writepages+0x5c/0x60
>  [] __filemap_fdatawrite_range+0x67/0x80
>  [] filemap_flush+0x25/0x30
>  [] lmLogSync+0x19d/0x230
>  [] lmLog+0x5e/0x190
>  [] txCommit+0x8c0/0x1010
>  [] jfs_create+0x324/0x370
>  [] vfs_create+0xaf/0x100
>  [] open_namei+0x58c/0x640
>  [] do_filp_open+0x2c/0x50
>  [] do_sys_open+0x47/0xe0
>  [] sys_open+0x1c/0x20
>  [] syscall_call+0x7/0xb

Michal Piotrowski is hitting it too, and has bisected it down to
git-block.patch.



I have bisected it down to this patches

revert-md-avoid-possible-bug_on-in-md-bitmap-handling-for-git-block.patch
git-block.patch
git-block-fixup.patch
git-block-dupe-definitions.patch
git-block-xfs-barriers-broke.patch

Regards,
Michal

--
Michal K. K. Piotrowski
LTG - Linux Testers Group (PL)
(http://www.stardust.webpages.pl/ltg/)
LTG - Linux Testers Group (EN)
(http://www.stardust.webpages.pl/linux_testers_group_en/)
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: 2.6.20-mm2: Oops in generic_make_request

2007-02-18 Thread Andrew Morton
On Sun, 18 Feb 2007 18:58:05 +0100 Mattia Dongili <[EMAIL PROTECTED]> wrote:

> On Sun, Feb 18, 2007 at 02:06:59PM +0100, Laurent Riffard wrote:
> > Le 18.02.2007 06:51, Andrew Morton a écrit :
> > >Temporarily at
> > >
> > >  http://userweb.kernel.org/~akpm/2.6.20-mm2/
> > >
> > >Will appear later at
> > >
> > > ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.20/2.6.20-mm2/
> > 
> > Hello, I've got a fully reproducible Oops. I just have to boot to 
> > runlevel 2 and wait less than one minute.
> 
> Maybe this oops is related too?

Looks that way.

> Can't yet tell if it's easily reproducible, this is on a JFS filesystem.
> 
> BUG: unable to handle kernel NULL pointer dereference at virtual address 
> 0004
>  printing eip:
> c02206aa
> *pde = 
> Oops:  [#1]
> PREEMPT 
> last sysfs file: /devices/pci:00/:00:00.0/class
> CPU:0
> EIP:0060:[]Not tainted VLI
> EFLAGS: 00010297   (2.6.20-mm2-1 #1)
> EIP is at __make_request+0x13a/0x390
> eax:    ebx:    ecx: 042b0dd8   edx: 00010485
> esi: c5f8cea0   edi: cfd587f8   ebp: c653dadc   esp: c653dabc
> ds: 007b   es: 007b   fs: 00d8  gs: 0033  ss: 0068
> Process sh (pid: 4549, ti=c653c000 task=ccca2090 task.ti=c653c000)
> Stack: c137eae8 0001 cfd587e4 c5f8cea0 c5f893e4 c137eae8 c5f8cea0 
> 0008 
>c653db28 c021e0ca cfa3ffa0 cee6a2e0 cfa3ffa0  0008 
> 0100 
>00011250 cfedaf60 c653db10 0206  c653db18 c014a31e 
> c653db48 
> Call Trace:
>  [] show_trace_log_lvl+0x1a/0x30
>  [] show_stack_log_lvl+0xa9/0xd0
>  [] show_registers+0x206/0x350
>  [] die+0x10e/0x210
>  [] do_page_fault+0x2d2/0x600
>  [] error_code+0x74/0x7c
>  [] generic_make_request+0x15a/0x200
>  [] submit_bio+0x58/0xe0
>  [] metapage_writepage+0x18f/0x1b0
>  [] __writepage+0xb/0x30
>  [] write_cache_pages+0x22f/0x300
>  [] generic_writepages+0x23/0x30
>  [] do_writepages+0x5c/0x60
>  [] __filemap_fdatawrite_range+0x67/0x80
>  [] filemap_flush+0x25/0x30
>  [] lmLogSync+0x19d/0x230
>  [] lmLog+0x5e/0x190
>  [] txCommit+0x8c0/0x1010
>  [] jfs_create+0x324/0x370
>  [] vfs_create+0xaf/0x100
>  [] open_namei+0x58c/0x640
>  [] do_filp_open+0x2c/0x50
>  [] do_sys_open+0x47/0xe0
>  [] sys_open+0x1c/0x20
>  [] syscall_call+0x7/0xb

Michal Piotrowski is hitting it too, and has bisected it down to
git-block.patch.

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: 2.6.20-mm2: Oops in generic_make_request

2007-02-18 Thread Mattia Dongili
On Sun, Feb 18, 2007 at 02:06:59PM +0100, Laurent Riffard wrote:
> Le 18.02.2007 06:51, Andrew Morton a écrit :
> >Temporarily at
> >
> >  http://userweb.kernel.org/~akpm/2.6.20-mm2/
> >
> >Will appear later at
> >
> > ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.20/2.6.20-mm2/
> 
> Hello, I've got a fully reproducible Oops. I just have to boot to 
> runlevel 2 and wait less than one minute.

Maybe this oops is related too?
Can't yet tell if it's easily reproducible, this is on a JFS filesystem.

BUG: unable to handle kernel NULL pointer dereference at virtual address 
0004
 printing eip:
c02206aa
*pde = 
Oops:  [#1]
PREEMPT 
last sysfs file: /devices/pci:00/:00:00.0/class
CPU:0
EIP:0060:[]Not tainted VLI
EFLAGS: 00010297   (2.6.20-mm2-1 #1)
EIP is at __make_request+0x13a/0x390
eax:    ebx:    ecx: 042b0dd8   edx: 00010485
esi: c5f8cea0   edi: cfd587f8   ebp: c653dadc   esp: c653dabc
ds: 007b   es: 007b   fs: 00d8  gs: 0033  ss: 0068
Process sh (pid: 4549, ti=c653c000 task=ccca2090 task.ti=c653c000)
Stack: c137eae8 0001 cfd587e4 c5f8cea0 c5f893e4 c137eae8 c5f8cea0 0008 
   c653db28 c021e0ca cfa3ffa0 cee6a2e0 cfa3ffa0  0008 0100 
   00011250 cfedaf60 c653db10 0206  c653db18 c014a31e c653db48 
Call Trace:
 [] show_trace_log_lvl+0x1a/0x30
 [] show_stack_log_lvl+0xa9/0xd0
 [] show_registers+0x206/0x350
 [] die+0x10e/0x210
 [] do_page_fault+0x2d2/0x600
 [] error_code+0x74/0x7c
 [] generic_make_request+0x15a/0x200
 [] submit_bio+0x58/0xe0
 [] metapage_writepage+0x18f/0x1b0
 [] __writepage+0xb/0x30
 [] write_cache_pages+0x22f/0x300
 [] generic_writepages+0x23/0x30
 [] do_writepages+0x5c/0x60
 [] __filemap_fdatawrite_range+0x67/0x80
 [] filemap_flush+0x25/0x30
 [] lmLogSync+0x19d/0x230
 [] lmLog+0x5e/0x190
 [] txCommit+0x8c0/0x1010
 [] jfs_create+0x324/0x370
 [] vfs_create+0xaf/0x100
 [] open_namei+0x58c/0x640
 [] do_filp_open+0x2c/0x50
 [] do_sys_open+0x47/0xe0
 [] sys_open+0x1c/0x20
 [] syscall_call+0x7/0xb
 ===
Code: 5d c3 8b 58 18 8b 43 04 0f 18 00 90 8b 7d e8 83 c7 14 39 fb 75 2
4 e9 2d ff ff ff 8d b6 00 00 00 00 48 0f 84 d9 00 00 00 8b 5b 04 <8b> 43 04 0f 
18 00 90 39 fb 0f 84
0e ff ff ff 89 f2 89 d8 e8 2e 
EIP: [] __make_request+0x13a/0x390 SS:ESP 0068:c653dabc

-- 
mattia
:wq!
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: 2.6.20-mm2: Oops in generic_make_request

2007-02-18 Thread Mattia Dongili
On Sun, Feb 18, 2007 at 02:06:59PM +0100, Laurent Riffard wrote:
 Le 18.02.2007 06:51, Andrew Morton a écrit :
 Temporarily at
 
   http://userweb.kernel.org/~akpm/2.6.20-mm2/
 
 Will appear later at
 
  ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.20/2.6.20-mm2/
 
 Hello, I've got a fully reproducible Oops. I just have to boot to 
 runlevel 2 and wait less than one minute.

Maybe this oops is related too?
Can't yet tell if it's easily reproducible, this is on a JFS filesystem.

BUG: unable to handle kernel NULL pointer dereference at virtual address 
0004
 printing eip:
c02206aa
*pde = 
Oops:  [#1]
PREEMPT 
last sysfs file: /devices/pci:00/:00:00.0/class
CPU:0
EIP:0060:[c02206aa]Not tainted VLI
EFLAGS: 00010297   (2.6.20-mm2-1 #1)
EIP is at __make_request+0x13a/0x390
eax:    ebx:    ecx: 042b0dd8   edx: 00010485
esi: c5f8cea0   edi: cfd587f8   ebp: c653dadc   esp: c653dabc
ds: 007b   es: 007b   fs: 00d8  gs: 0033  ss: 0068
Process sh (pid: 4549, ti=c653c000 task=ccca2090 task.ti=c653c000)
Stack: c137eae8 0001 cfd587e4 c5f8cea0 c5f893e4 c137eae8 c5f8cea0 0008 
   c653db28 c021e0ca cfa3ffa0 cee6a2e0 cfa3ffa0  0008 0100 
   00011250 cfedaf60 c653db10 0206  c653db18 c014a31e c653db48 
Call Trace:
 [c01048ba] show_trace_log_lvl+0x1a/0x30
 [c0104979] show_stack_log_lvl+0xa9/0xd0
 [c0104ba6] show_registers+0x206/0x350
 [c0104dfe] die+0x10e/0x210
 [c0114652] do_page_fault+0x2d2/0x600
 [c0351f84] error_code+0x74/0x7c
 [c021e0ca] generic_make_request+0x15a/0x200
 [c0220318] submit_bio+0x58/0xe0
 [c0205d0f] metapage_writepage+0x18f/0x1b0
 [c014cc2b] __writepage+0xb/0x30
 [c014d20f] write_cache_pages+0x22f/0x300
 [c014d303] generic_writepages+0x23/0x30
 [c014d36c] do_writepages+0x5c/0x60
 [c0148307] __filemap_fdatawrite_range+0x67/0x80
 [c0149785] filemap_flush+0x25/0x30
 [c02080cd] lmLogSync+0x19d/0x230
 [c020871e] lmLog+0x5e/0x190
 [c020a360] txCommit+0x8c0/0x1010
 [c01eecf4] jfs_create+0x324/0x370
 [c016f38f] vfs_create+0xaf/0x100
 [c017269c] open_namei+0x58c/0x640
 [c0165b7c] do_filp_open+0x2c/0x50
 [c0165be7] do_sys_open+0x47/0xe0
 [c0165cbc] sys_open+0x1c/0x20
 [c01041c0] syscall_call+0x7/0xb
 ===
Code: 5d c3 8b 58 18 8b 43 04 0f 18 00 90 8b 7d e8 83 c7 14 39 fb 75 2
4 e9 2d ff ff ff 8d b6 00 00 00 00 48 0f 84 d9 00 00 00 8b 5b 04 8b 43 04 0f 
18 00 90 39 fb 0f 84
0e ff ff ff 89 f2 89 d8 e8 2e 
EIP: [c02206aa] __make_request+0x13a/0x390 SS:ESP 0068:c653dabc

-- 
mattia
:wq!
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: 2.6.20-mm2: Oops in generic_make_request

2007-02-18 Thread Andrew Morton
On Sun, 18 Feb 2007 18:58:05 +0100 Mattia Dongili [EMAIL PROTECTED] wrote:

 On Sun, Feb 18, 2007 at 02:06:59PM +0100, Laurent Riffard wrote:
  Le 18.02.2007 06:51, Andrew Morton a écrit :
  Temporarily at
  
http://userweb.kernel.org/~akpm/2.6.20-mm2/
  
  Will appear later at
  
   ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.20/2.6.20-mm2/
  
  Hello, I've got a fully reproducible Oops. I just have to boot to 
  runlevel 2 and wait less than one minute.
 
 Maybe this oops is related too?

Looks that way.

 Can't yet tell if it's easily reproducible, this is on a JFS filesystem.
 
 BUG: unable to handle kernel NULL pointer dereference at virtual address 
 0004
  printing eip:
 c02206aa
 *pde = 
 Oops:  [#1]
 PREEMPT 
 last sysfs file: /devices/pci:00/:00:00.0/class
 CPU:0
 EIP:0060:[c02206aa]Not tainted VLI
 EFLAGS: 00010297   (2.6.20-mm2-1 #1)
 EIP is at __make_request+0x13a/0x390
 eax:    ebx:    ecx: 042b0dd8   edx: 00010485
 esi: c5f8cea0   edi: cfd587f8   ebp: c653dadc   esp: c653dabc
 ds: 007b   es: 007b   fs: 00d8  gs: 0033  ss: 0068
 Process sh (pid: 4549, ti=c653c000 task=ccca2090 task.ti=c653c000)
 Stack: c137eae8 0001 cfd587e4 c5f8cea0 c5f893e4 c137eae8 c5f8cea0 
 0008 
c653db28 c021e0ca cfa3ffa0 cee6a2e0 cfa3ffa0  0008 
 0100 
00011250 cfedaf60 c653db10 0206  c653db18 c014a31e 
 c653db48 
 Call Trace:
  [c01048ba] show_trace_log_lvl+0x1a/0x30
  [c0104979] show_stack_log_lvl+0xa9/0xd0
  [c0104ba6] show_registers+0x206/0x350
  [c0104dfe] die+0x10e/0x210
  [c0114652] do_page_fault+0x2d2/0x600
  [c0351f84] error_code+0x74/0x7c
  [c021e0ca] generic_make_request+0x15a/0x200
  [c0220318] submit_bio+0x58/0xe0
  [c0205d0f] metapage_writepage+0x18f/0x1b0
  [c014cc2b] __writepage+0xb/0x30
  [c014d20f] write_cache_pages+0x22f/0x300
  [c014d303] generic_writepages+0x23/0x30
  [c014d36c] do_writepages+0x5c/0x60
  [c0148307] __filemap_fdatawrite_range+0x67/0x80
  [c0149785] filemap_flush+0x25/0x30
  [c02080cd] lmLogSync+0x19d/0x230
  [c020871e] lmLog+0x5e/0x190
  [c020a360] txCommit+0x8c0/0x1010
  [c01eecf4] jfs_create+0x324/0x370
  [c016f38f] vfs_create+0xaf/0x100
  [c017269c] open_namei+0x58c/0x640
  [c0165b7c] do_filp_open+0x2c/0x50
  [c0165be7] do_sys_open+0x47/0xe0
  [c0165cbc] sys_open+0x1c/0x20
  [c01041c0] syscall_call+0x7/0xb

Michal Piotrowski is hitting it too, and has bisected it down to
git-block.patch.

-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: 2.6.20-mm2: Oops in generic_make_request

2007-02-18 Thread Michal Piotrowski

On 18/02/07, Andrew Morton [EMAIL PROTECTED] wrote:

On Sun, 18 Feb 2007 18:58:05 +0100 Mattia Dongili [EMAIL PROTECTED] wrote:

 On Sun, Feb 18, 2007 at 02:06:59PM +0100, Laurent Riffard wrote:
  Le 18.02.2007 06:51, Andrew Morton a écrit :
  Temporarily at
  
http://userweb.kernel.org/~akpm/2.6.20-mm2/
  
  Will appear later at
  
   
ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.20/2.6.20-mm2/
 
  Hello, I've got a fully reproducible Oops. I just have to boot to
  runlevel 2 and wait less than one minute.

 Maybe this oops is related too?

Looks that way.

 Can't yet tell if it's easily reproducible, this is on a JFS filesystem.

 BUG: unable to handle kernel NULL pointer dereference at virtual address 
0004
  printing eip:
 c02206aa
 *pde = 
 Oops:  [#1]
 PREEMPT
 last sysfs file: /devices/pci:00/:00:00.0/class
 CPU:0
 EIP:0060:[c02206aa]Not tainted VLI
 EFLAGS: 00010297   (2.6.20-mm2-1 #1)
 EIP is at __make_request+0x13a/0x390
 eax:    ebx:    ecx: 042b0dd8   edx: 00010485
 esi: c5f8cea0   edi: cfd587f8   ebp: c653dadc   esp: c653dabc
 ds: 007b   es: 007b   fs: 00d8  gs: 0033  ss: 0068
 Process sh (pid: 4549, ti=c653c000 task=ccca2090 task.ti=c653c000)
 Stack: c137eae8 0001 cfd587e4 c5f8cea0 c5f893e4 c137eae8 c5f8cea0 0008
c653db28 c021e0ca cfa3ffa0 cee6a2e0 cfa3ffa0  0008 0100
00011250 cfedaf60 c653db10 0206  c653db18 c014a31e c653db48
 Call Trace:
  [c01048ba] show_trace_log_lvl+0x1a/0x30
  [c0104979] show_stack_log_lvl+0xa9/0xd0
  [c0104ba6] show_registers+0x206/0x350
  [c0104dfe] die+0x10e/0x210
  [c0114652] do_page_fault+0x2d2/0x600
  [c0351f84] error_code+0x74/0x7c
  [c021e0ca] generic_make_request+0x15a/0x200
  [c0220318] submit_bio+0x58/0xe0
  [c0205d0f] metapage_writepage+0x18f/0x1b0
  [c014cc2b] __writepage+0xb/0x30
  [c014d20f] write_cache_pages+0x22f/0x300
  [c014d303] generic_writepages+0x23/0x30
  [c014d36c] do_writepages+0x5c/0x60
  [c0148307] __filemap_fdatawrite_range+0x67/0x80
  [c0149785] filemap_flush+0x25/0x30
  [c02080cd] lmLogSync+0x19d/0x230
  [c020871e] lmLog+0x5e/0x190
  [c020a360] txCommit+0x8c0/0x1010
  [c01eecf4] jfs_create+0x324/0x370
  [c016f38f] vfs_create+0xaf/0x100
  [c017269c] open_namei+0x58c/0x640
  [c0165b7c] do_filp_open+0x2c/0x50
  [c0165be7] do_sys_open+0x47/0xe0
  [c0165cbc] sys_open+0x1c/0x20
  [c01041c0] syscall_call+0x7/0xb

Michal Piotrowski is hitting it too, and has bisected it down to
git-block.patch.



I have bisected it down to this patches

revert-md-avoid-possible-bug_on-in-md-bitmap-handling-for-git-block.patch
git-block.patch
git-block-fixup.patch
git-block-dupe-definitions.patch
git-block-xfs-barriers-broke.patch

Regards,
Michal

--
Michal K. K. Piotrowski
LTG - Linux Testers Group (PL)
(http://www.stardust.webpages.pl/ltg/)
LTG - Linux Testers Group (EN)
(http://www.stardust.webpages.pl/linux_testers_group_en/)
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/