BUG: unable to handle kernel paging request at virtual address

2008-01-04 Thread Alexander Shaduri

I got the following message, shortly followed by a system hang.
BUG: unable to handle kernel paging request at virtual address 48464443

(see the oops below).

I've been getting kernel oopses for quite some time now, ever since I
got a new computer (several months now).

The problem manifests when I use mplayer/mencoder to capture from
TV card (Avermedia 203, BT878 chipset). Doesn't matter if it's viewing
or encoding. Usually happens within first 2 hours of capturing, quite randomly.
Note that other system-intensive tasks (video encoding from file, 3D games,
heavy fs activity) don't cause any instabilities.

Reproduced with kernels 2.6.21, 2.6.22 and 2.6.23.12 (the first two
being on openSUSE 10.2, and the 2.6.23(.1 and .12) on openSUSE 10.3).
Unfortunately, I only have the oops output from 2.6.23.12.

Doesn't seem to be the memory problem (I tested it with memtest86),
and that other commercial OS seems to be working fine (well, at least
no hangs during capturing and no suspicious freezes).

The messages below indicate that I have a proprietary driver (nvidia)
loaded, but the kernel hangs without it too - using pure vesa framebuffer,
without running an X server.


mplayer command:
gmplayer -sws 2 tv:// -tv driver=v4l2:input=1:width=768:height=576\
:forceaudio:amode=1:adevice=/dev/dsp -mc 0


Hardware:
CPU: Athlon X2 5200+ (dual-core).
MB: Asus M2N-E SLI (nforce 500 chipset, latest BIOS update).
TV card: Avermedia 203 (BT878 chipset).
Ram: 2G Kingston
Video card: NVidia 8600GT (MSI)


The oops itself (from dmesg - I had its output logged every second, so it 
managed
to get saved before the real freeze several seconds later):


BUG: unable to handle kernel paging request at virtual address 48464443
 printing eip:
c011d6f8
*pde = 
Oops:  [#1]
PREEMPT SMP
Modules linked in: cdc_acm usb_storage libusual ppp_generic slhc iptable_filter 
ip_tables ip6table_filter ip6_tables x_tables ipv6 cpufreq_conservative 
cpufreq_ondemand cpufreq_userspace cpufreq_powersave powernow_k8 freq_table 
snd_pcm_oss snd_mixer_oss snd_seq_midi snd_emu10k1_synth snd_emux_synth 
snd_seq_virmidi snd_seq_midi_event snd_seq_midi_emul snd_seq capability 
commoncap fuse nls_koi8_r nls_cp866 loop dm_mod binfmt_misc uhci_hcd it87 
hwmon_vid eeprom bt878 tuner tvaudio nvidia(P) bttv snd_emu10k1 snd_rawmidi 
video_buf firmware_class snd_ac97_codec ir_common ac97_bus snd_pcm 
snd_seq_device snd_timer snd_page_alloc compat_ioctl32 snd_util_mem 
i2c_algo_bit ide_cd ohci1394 emu10k1_gp snd_hwdep snd ieee1394 gameport k8temp 
cdrom thermal btcx_risc tveeprom videodev hwmon agpgart button rtc_cmos 
rtc_core rtc_lib soundcore v4l2_common v4l1_compat i2c_nforce2 i2c_core 
forcedeth sg ohci_hcd ehci_hcd usbcore edd fan processor pata_amd
CPU:1
EIP:0060:[]Tainted: PVLI
EFLAGS: 00010202   (2.6.23.12 #1)
EIP is at pgd_alloc+0x28/0x80
eax: c0564101   ebx: 48464443   ecx: f1403bc0   edx: c2019148
esi: 0001   edi: fff4   ebp: edb16bc0   esp: cf517f5c
ds: 007b   es: 007b   fs: 00d8  gs: 0033  ss: 0068
Process sh (pid: 18310, ti=cf516000 task=f0b83aa0 task.ti=cf516000)
Stack: f7fc7240 e100c000 c0126ec6 d1063480 c01859df bf978390 0011 d1063480
   e100c000 edb16bc0 d1063480 c0186f36 8009f7f0 800a0ef0 e100c000 e100c000
   b7e6fff4 cf516000 c010260f cf517fb8 800a0f98 800a0f98 c0104362 800a0f98
Call Trace:
 [] mm_init+0xb6/0xe0
 [] bprm_mm_init+0x1f/0x170
 [] do_execve+0x66/0x1e0
 [] sys_execve+0x2f/0x80
 [] sysenter_past_esp+0x5f/0x85
 ===
Code: 74 26 00 56 53 89 e0 25 00 e0 ff ff ff 40 14 64 8b 15 84 30 56 c0 b8 48 
41 56 c0 01 c2 8b 1a 85 db 0f 95 c0 0f b6 f0 85 f6 74 0d <8b> 03 89 02 c7 03 00 
00 00 00 ff 4a 04 89 e0 25 00 e0 ff ff ff
EIP: [] pgd_alloc+0x28/0x80 SS:ESP 0068:cf517f5c
note: sh[18310] exited with preempt_count 1
BUG: scheduling while atomic: sh/0x1002/18310
 [] schedule+0x5c2/0x6c0
 [] __wake_up+0x38/0x50
 [] pipe_release+0x86/0xb0
 [] dput+0x1c/0x100
 [] __fput+0x10f/0x170
 [] __cond_resched+0x18/0x30
 [] cond_resched+0x2a/0x40
 [] put_files_struct+0x9e/0xb0
 [] do_exit+0x170/0x840
 [] die+0x23d/0x250
 [] do_page_fault+0x29f/0x670
 [] do_path_lookup+0x7c/0x210
 [] __next_cpu+0x12/0x30
 [] do_page_fault+0x0/0x670
 [] error_code+0x72/0x78
 [] pgd_alloc+0x28/0x80
 [] mm_init+0xb6/0xe0
 [] bprm_mm_init+0x1f/0x170
 [] do_execve+0x66/0x1e0
 [] sys_execve+0x2f/0x80
 [] sysenter_past_esp+0x5f/0x85
 ===
BUG: unable to handle kernel paging request at virtual address 48464443
 printing eip:
c011d6f8
*pde = 
Oops:  [#2]
PREEMPT SMP
Modules linked in: cdc_acm usb_storage libusual ppp_generic slhc iptable_filter 
ip_tables ip6table_filter ip6_tables x_tables ipv6 cpufreq_conservative 
cpufreq_ondemand cpufreq_userspace cpufreq_powersave powernow_k8 freq_table 
snd_pcm_oss snd_mixer_oss snd_seq_midi snd_emu10k1_synth snd_emux_synth 
snd_seq_virmidi snd_seq_midi_event snd_seq_midi_emul snd_seq capability 
commoncap fuse nls_koi8_r nls_cp866 loop dm_mod bin

Re: BUG: unable to handle kernel paging request at virtual address

2008-01-05 Thread Alexander Shaduri
On Sat, 5 Jan 2008 09:10:12 +
Al Viro <[EMAIL PROTECTED]> wrote:

> and we have q->page == 0x48464443.  Seeing how we assign that sucker, that
> smells like we've got a page on quicklist with {0x43, 0x44, 0x46, 0x48}
> in its first 4 bytes.  Instead of having address of the next page stored
> in there...
> 
> Do other oopsen of the same kind give the same value?

I've got another oops here with a different value. This time a bttv message
preceded it. Note that the oops happened shortly *after* I stopped capturing
(watching the tv through mplayer).

Output of dmesg:

bttv0: OCERR @ 375e2014,bits: HSYNC OFLOW FDSR OCERR*
(two pages of the same message here)

bttv0: OCERR @ 375e2014,bits: HSYNC OFLOW FDSR OCERR*
bttv0: OCERR @ 375e2014,bits: HSYNC OFLOW FBUS FDSR OCERR*
BUG: unable to handle kernel paging request at virtual address 23232323
 printing eip:
c011d6f8
*pde = 
Oops:  [#1]
PREEMPT SMP
Modules linked in: ppp_generic slhc iptable_filter ip_tables ip6table_filter 
ip6_tables x_tables ipv6 cpufreq_conservative cpufreq_ondemand 
cpufreq_userspace cpufreq_powersave powernow_k8 freq_table snd_pcm_oss 
snd_mixer_oss snd_seq_midi snd_emu10k1_synth snd_emux_synth snd_seq_virmidi 
snd_seq_midi_event snd_seq_midi_emul snd_seq capability commoncap fuse 
nls_koi8_r nls_cp866 loop dm_mod binfmt_misc uhci_hcd it87 hwmon_vid eeprom 
nvidia(P) tuner tvaudio snd_emu10k1 bttv snd_rawmidi snd_ac97_codec video_buf 
firmware_class ir_common ac97_bus snd_pcm snd_seq_device compat_ioctl32 
i2c_algo_bit snd_timer snd_page_alloc emu10k1_gp snd_util_mem btcx_risc 
tveeprom videodev gameport ohci1394 ieee1394 ide_cd snd_hwdep snd v4l2_common 
v4l1_compat agpgart soundcore i2c_nforce2 thermal button rtc_cmos rtc_core 
rtc_lib forcedeth k8temp i2c_core hwmon cdrom sg ohci_hcd ehci_hcd usbcore edd 
fan processor pata_amd
CPU:0
EIP:0060:[]Tainted: PVLI
EFLAGS: 00010202   (2.6.23.12 #1)
EIP is at pgd_alloc+0x28/0x80
eax: c0564101   ebx: 23232323   ecx: f79a5bc0   edx: c2011148
esi: 0001   edi: ec539544   ebp: 01200011   esp: f5cd9efc
ds: 007b   es: 007b   fs: 00d8  gs: 0033  ss: 0068
Process amdpwrmon (pid: 5578, ti=f5cd8000 task=f7a14aa0 task.ti=f5cd8000)
Stack: ec539380 f79a5d84 c0126ec6 f5891200 c01277c2 f73ca080 c1dfc40c 4636c065
   f5cd9f1c 0286 f6edb640 f5cd9fb8 bfb59498  f6edb550 f6edb550
   ec539380 f04b9264 c04f4c30 60ae f79a5bc0 ec539380 f5cd9fb8 01200011
Call Trace:
 [] mm_init+0xb6/0xe0
 [] copy_process+0x762/0x1240
 [] do_fork+0x76/0x230
 [] do_pipe+0x7c/0xf0
 [] sys_clone+0x32/0x40
 [] sysenter_past_esp+0x5f/0x85
 ===
Code: 74 26 00 56 53 89 e0 25 00 e0 ff ff ff 40 14 64 8b 15 84 30 56 c0 b8 48 
41 56 c0 01 c2 8b 1a 85 db 0f 95 c0 0f b6 f0 85 f6 74 0d <8b> 03 89 02 c7 03 00 
00 00 00 ff 4a 04 89 e0 25 00 e0 ff ff ff
EIP: [] pgd_alloc+0x28/0x80 SS:ESP 0068:f5cd9efc
note: amdpwrmon[5578] exited with preempt_count 1
BUG: unable to handle kernel paging request at virtual address 23232323
 printing eip:
c011d6f8
*pde = 
Oops:  [#2]
PREEMPT SMP
Modules linked in: ppp_generic slhc iptable_filter ip_tables ip6table_filter 
ip6_tables x_tables ipv6 cpufreq_conservative cpufreq_ondemand 
cpufreq_userspace cpufreq_powersave powernow_k8 freq_table snd_pcm_oss 
snd_mixer_oss snd_seq_midi snd_emu10k1_synth snd_emux_synth snd_seq_virmidi 
snd_seq_midi_event snd_seq_midi_emul snd_seq capability commoncap fuse 
nls_koi8_r nls_cp866 loop dm_mod binfmt_misc uhci_hcd it87 hwmon_vid eeprom 
nvidia(P) tuner tvaudio snd_emu10k1 bttv snd_rawmidi snd_ac97_codec video_buf 
firmware_class ir_common ac97_bus snd_pcm snd_seq_device compat_ioctl32 
i2c_algo_bit snd_timer snd_page_alloc emu10k1_gp snd_util_mem btcx_risc 
tveeprom videodev gameport ohci1394 ieee1394 ide_cd snd_hwdep snd v4l2_common 
v4l1_compat agpgart soundcore i2c_nforce2 thermal button rtc_cmos rtc_core 
rtc_lib forcedeth k8temp i2c_core hwmon cdrom sg ohci_hcd ehci_hcd usbcore edd 
fan processor pata_amd
CPU:0
EIP:0060:[]Tainted: P  D VLI
EFLAGS: 00010202   (2.6.23.12 #1)
EIP is at pgd_alloc+0x28/0x80
eax: c0564101   ebx: 23232323   ecx: f03a32c0   edx: c2011148
esi: 0001   edi: fff4   ebp: f739c6c0   esp: dace3f5c
ds: 007b   es: 007b   fs: 00d8  gs: 0033  ss: 0068
Process master (pid: 24800, ti=dace2000 task=ee93aaa0 task.ti=dace2000)
Stack: f03a36c0 db9e c0126ec6 efdb25c0 c01859df f7e6a000 0011 efdb25c0
   db9e f739c6c0 efdb25c0 c0186f36 8000e310 80010c80 db9e db9e
   b7bf0ff4 dace2000 c010260f dace3fb8 80010be0  c0104362 80010be0
Call Trace:
 [] mm_init+0xb6/0xe0
 [] bprm_mm_init+0x1f/0x170
 [] do_execve+0x66/0x1e0
 [] sys_execve+0x2f/0x80
 [] sysenter_past_esp+0x5f/0x85
 ===
Code: 74 26 00 56 53 89 e0 25 00 e0 ff ff ff 40 14 64 8b 15 84 30 56 c0 b8 48 
41 56 c0 01 c2 8b 1a 85 db 0f 95 c0 0f b6 f0 85 f6 74 0d <8b> 03 89 02 c7 03 00 
00 00 00 ff 4a 04 89 e0 25 00 e0 ff ff ff
EIP: [] pgd_alloc+0x28/0x80 S

Re: BUG: unable to handle kernel paging request at virtual address

2008-01-05 Thread Alexander Shaduri
On Sat, 5 Jan 2008 15:34:25 +0100
Mikael Pettersson <[EMAIL PROTECTED]> wrote:

> This kernel is tainted by the nvidia module...

I know that, but as I wrote in the original message, the hangs occur
without it too.
I just tested it today. Had to leave it running
(in vesa framebuffer mode), without any nvidia modules.
Crash occurred after 4 hours of tv capturing, and didn't leave
any traces in dmesg logs (logged every second).
Unfortunately, I couldn't save any messages.

All I'm saying is that the problem is reproduceable without
nvidia module and without any X server running.

Thanks,
Alexander
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: BUG: unable to handle kernel paging request at virtual address

2008-01-05 Thread Alexander Shaduri

> Get a serial console?  Take another box, plug e.g. pl2303-based
> usb-to-serial (several bucks these days) into it, stick null-modem
> convertor (ditto) on its serial end and attach to ttyS0 on the
> victim.  console=ttyS0 on victim, something like minicom on watcher
> and tell it to capture log into file...

I will try that (as soon as I acquire the necessary parts).

Thanks,
Alexander
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: BUG: unable to handle kernel paging request at virtual address

2008-01-05 Thread Alexander Shaduri
On Sun, 6 Jan 2008 00:20:50 +0300
Alexey Dobriyan <[EMAIL PROTECTED]> wrote:

> netconsole should be more quick:

Thanks a lot for the tip, I'll try that.

Alexander
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: BUG: unable to handle kernel paging request at virtual address

2008-01-22 Thread Alexander Shaduri
On Sun, 6 Jan 2008 00:20:50 +0300
Alexey Dobriyan <[EMAIL PROTECTED]> wrote:

> On Sun, Jan 06, 2008 at 12:30:34AM +0400, Alexander Shaduri wrote:
> > > Get a serial console?  Take another box, plug e.g. pl2303-based
> > > usb-to-serial (several bucks these days) into it, stick null-modem
> > > convertor (ditto) on its serial end and attach to ttyS0 on the
> > > victim.  console=ttyS0 on victim, something like minicom on watcher
> > > and tell it to capture log into file...
> > 
> > I will try that (as soon as I acquire the necessary parts).
> 
> netconsole should be more quick:

Sorry for late reply, but all my attempts to log via network failed - every time
only ~700 bytes were transferred, so the logs were non-informative.

Finally, I got the logging working with a serial console, using
console=tty0 console=ttyS0,9600 ignore_loglevel

Anyway, see the crash log below.

Just a few additional notes - I was trying to get rid of the problem by
trying various things, like:
* Plugging the bttv card to an alternative PCI slot.
* Swapping the bttv card with another card: I had an Avermedia 203
tv tuner card before and I tried using Avermedia EZCapture (same bttv
chipset, without a tv tuner).
* loading bttv module with no_overlay=1.
* "maxcpus=1" kernel option
* Compiling kernel git snapshot (2008-01-09) from
http://git.kernel.org/?p=linux/kernel/git/mchehab/v4l-dvb.git
in case it was fixed in the development tree.

None of the above helped anything.

Here's a log (2.6.23.12, no X running, Avermedia EZCapture card;
capturing for several hours, then stopping, then capturing again - crash!):

--

BUG: unable to handle kernel paging request at virtual address 968e8787
 printing eip:
c017dc67
*pde = 
Oops: 0002 [#1]
PREEMPT SMP
Modules linked in: bttv video_buf ir_common compat_ioctl32 i2c_algo_bit 
btcx_risc tveeprom videodev v4l2_common v4l1_compat netconsole iptable_filter 
ip_tables ip6table_filter ip6_tables x_tables ipv6 cpufreq_conservative 
cpufreq_ondemand cpufreq_userspace cpufreq_powersave powernow_k8 freq_table 
snd_pcm_oss snd_mixer_oss snd_seq_midi snd_emu10k1_synth snd_emux_synth 
snd_seq_virmidi snd_seq_midi_event snd_seq_midi_emul snd_seq capability 
commoncap fuse nls_koi8_r nls_cp866 loop dm_mod binfmt_misc uhci_hcd it87 
hwmon_vid eeprom snd_emu10k1 snd_rawmidi firmware_class snd_ac97_codec ac97_bus 
snd_pcm snd_seq_device snd_timer snd_page_alloc snd_util_mem snd_hwdep rtc_cmos 
snd rtc_core forcedeth parport_pc ide_cd ohci1394 agpgart emu10k1_gp soundcore 
gameport i2c_nforce2 k8temp rtc_lib parport ieee1394 hwmon cdrom thermal button 
sg i2c_core ohci_hcd ehci_hcd usbcore edd fan processor pata_amd
CPU:1
EIP:0060:[]Not tainted VLI
EFLAGS: 00210046   (2.6.23.12 #3)
EIP is at cache_alloc_refill+0x147/0x520
eax: c20eec80   ebx:    ecx: 003c   edx: 968e8783
esi: 0022   edi: f52f   ebp: c2131a00   esp: f6a7fc1c
ds: 007b   es: 007b   fs: 00d8  gs: 0033  ss: 0068
Process mencoder (pid: 5465, ti=f6a7e000 task=c2206550 task.ti=f6a7e000)
Stack: 0050  c20eec88 c20eec90 c20eeca4 c03fca90 8050 c20fc7c0
   c20e9800 c20eec80 f785bc00 1000 000c 00060007 f52f001c c1b279a0
   8050 00200286 c20fc7c0 c017dafb c1b279a0   1000
Call Trace:
 [] kmem_cache_alloc+0x6b/0x90
 [] alloc_buffer_head+0x10/0x60
 [] alloc_page_buffers+0x78/0xc0
 [] create_empty_buffers+0x11/0x90
 [] __block_prepare_write+0x41e/0x4f0
 [] start_this_handle+0x368/0x4d0
 [] journal_start+0xb3/0xe0
 [] block_prepare_write+0x28/0x40
 [] ext3_get_block+0x0/0x100
 [] ext3_prepare_write+0x4a/0x190
 [] ext3_get_block+0x0/0x100
 [] generic_file_buffered_write+0x289/0x6b0
 [] schedule+0x1d6/0x6c0
 [] check_preempt_curr_fair+0x57/0x90
 [] __generic_file_aio_write_nolock+0x2b4/0x530
 [] futex_wait+0x23e/0x310
 [] generic_file_aio_write+0x65/0xe0
 [] get_futex_key+0x51/0x150
 [] ext3_file_write+0x30/0xc0
 [] do_sync_write+0xd5/0x120
 [] do_futex+0x239/0xb20
 [] autoremove_wake_function+0x0/0x40
 [] __atomic_notifier_call_chain+0x26/0x50
 [] do_sync_write+0x0/0x120
 [] vfs_write+0xbf/0x140
 [] sys_write+0x41/0x70
 [] sysenter_past_esp+0x5f/0x85
 ===
Code: 77 10 8b 77 14 8b 44 24 38 03 57 0c 8b 34 b0 89 77 14 89 54 8d 18 41 89 
4d 00 8b 44 24 1c 8b 77 10 3b 70 38 72 c4 8b 17 8b 47 04 <89> 42 04 89 10 83 7f 
14 ff c7 07 00 01 10 00 c7 47 04 00 02 20
EIP: [] cache_alloc_refill+0x147/0x520 SS:ESP 0068:f6a7fc1c
note: mencoder[5465] exited with preempt_count 1
bttv0: timeout: drop=3 irq=1002400/1002400, risc=350dc3c4, bits: HSYNC OFLOW
bttv0: reset, reinitialize
bttv0: PLL: 28636363 => 35468950 . ok
bttv0: timeout: drop=12 irq=1002413/1002413, risc=35a10a8c, bits: OFLOW
bttv0: reset, reinitialize
bttv0: PLL: 28636363 => 35468950 . ok
bttv0: timeout: drop=22 irq=1002425/1002425, risc=35a111a4, bits: OFLOW
bttv0: reset, reinitialize
bttv0: PLL: 2863