BUG: unable to handle kernel paging request at virtual address
I got the following message, shortly followed by a system hang. BUG: unable to handle kernel paging request at virtual address 48464443 (see the oops below). I've been getting kernel oopses for quite some time now, ever since I got a new computer (several months now). The problem manifests when I use mplayer/mencoder to capture from TV card (Avermedia 203, BT878 chipset). Doesn't matter if it's viewing or encoding. Usually happens within first 2 hours of capturing, quite randomly. Note that other system-intensive tasks (video encoding from file, 3D games, heavy fs activity) don't cause any instabilities. Reproduced with kernels 2.6.21, 2.6.22 and 2.6.23.12 (the first two being on openSUSE 10.2, and the 2.6.23(.1 and .12) on openSUSE 10.3). Unfortunately, I only have the oops output from 2.6.23.12. Doesn't seem to be the memory problem (I tested it with memtest86), and that other commercial OS seems to be working fine (well, at least no hangs during capturing and no suspicious freezes). The messages below indicate that I have a proprietary driver (nvidia) loaded, but the kernel hangs without it too - using pure vesa framebuffer, without running an X server. mplayer command: gmplayer -sws 2 tv:// -tv driver=v4l2:input=1:width=768:height=576\ :forceaudio:amode=1:adevice=/dev/dsp -mc 0 Hardware: CPU: Athlon X2 5200+ (dual-core). MB: Asus M2N-E SLI (nforce 500 chipset, latest BIOS update). TV card: Avermedia 203 (BT878 chipset). Ram: 2G Kingston Video card: NVidia 8600GT (MSI) The oops itself (from dmesg - I had its output logged every second, so it managed to get saved before the real freeze several seconds later): BUG: unable to handle kernel paging request at virtual address 48464443 printing eip: c011d6f8 *pde = Oops: [#1] PREEMPT SMP Modules linked in: cdc_acm usb_storage libusual ppp_generic slhc iptable_filter ip_tables ip6table_filter ip6_tables x_tables ipv6 cpufreq_conservative cpufreq_ondemand cpufreq_userspace cpufreq_powersave powernow_k8 freq_table snd_pcm_oss snd_mixer_oss snd_seq_midi snd_emu10k1_synth snd_emux_synth snd_seq_virmidi snd_seq_midi_event snd_seq_midi_emul snd_seq capability commoncap fuse nls_koi8_r nls_cp866 loop dm_mod binfmt_misc uhci_hcd it87 hwmon_vid eeprom bt878 tuner tvaudio nvidia(P) bttv snd_emu10k1 snd_rawmidi video_buf firmware_class snd_ac97_codec ir_common ac97_bus snd_pcm snd_seq_device snd_timer snd_page_alloc compat_ioctl32 snd_util_mem i2c_algo_bit ide_cd ohci1394 emu10k1_gp snd_hwdep snd ieee1394 gameport k8temp cdrom thermal btcx_risc tveeprom videodev hwmon agpgart button rtc_cmos rtc_core rtc_lib soundcore v4l2_common v4l1_compat i2c_nforce2 i2c_core forcedeth sg ohci_hcd ehci_hcd usbcore edd fan processor pata_amd CPU:1 EIP:0060:[]Tainted: PVLI EFLAGS: 00010202 (2.6.23.12 #1) EIP is at pgd_alloc+0x28/0x80 eax: c0564101 ebx: 48464443 ecx: f1403bc0 edx: c2019148 esi: 0001 edi: fff4 ebp: edb16bc0 esp: cf517f5c ds: 007b es: 007b fs: 00d8 gs: 0033 ss: 0068 Process sh (pid: 18310, ti=cf516000 task=f0b83aa0 task.ti=cf516000) Stack: f7fc7240 e100c000 c0126ec6 d1063480 c01859df bf978390 0011 d1063480 e100c000 edb16bc0 d1063480 c0186f36 8009f7f0 800a0ef0 e100c000 e100c000 b7e6fff4 cf516000 c010260f cf517fb8 800a0f98 800a0f98 c0104362 800a0f98 Call Trace: [] mm_init+0xb6/0xe0 [] bprm_mm_init+0x1f/0x170 [] do_execve+0x66/0x1e0 [] sys_execve+0x2f/0x80 [] sysenter_past_esp+0x5f/0x85 === Code: 74 26 00 56 53 89 e0 25 00 e0 ff ff ff 40 14 64 8b 15 84 30 56 c0 b8 48 41 56 c0 01 c2 8b 1a 85 db 0f 95 c0 0f b6 f0 85 f6 74 0d <8b> 03 89 02 c7 03 00 00 00 00 ff 4a 04 89 e0 25 00 e0 ff ff ff EIP: [] pgd_alloc+0x28/0x80 SS:ESP 0068:cf517f5c note: sh[18310] exited with preempt_count 1 BUG: scheduling while atomic: sh/0x1002/18310 [] schedule+0x5c2/0x6c0 [] __wake_up+0x38/0x50 [] pipe_release+0x86/0xb0 [] dput+0x1c/0x100 [] __fput+0x10f/0x170 [] __cond_resched+0x18/0x30 [] cond_resched+0x2a/0x40 [] put_files_struct+0x9e/0xb0 [] do_exit+0x170/0x840 [] die+0x23d/0x250 [] do_page_fault+0x29f/0x670 [] do_path_lookup+0x7c/0x210 [] __next_cpu+0x12/0x30 [] do_page_fault+0x0/0x670 [] error_code+0x72/0x78 [] pgd_alloc+0x28/0x80 [] mm_init+0xb6/0xe0 [] bprm_mm_init+0x1f/0x170 [] do_execve+0x66/0x1e0 [] sys_execve+0x2f/0x80 [] sysenter_past_esp+0x5f/0x85 === BUG: unable to handle kernel paging request at virtual address 48464443 printing eip: c011d6f8 *pde = Oops: [#2] PREEMPT SMP Modules linked in: cdc_acm usb_storage libusual ppp_generic slhc iptable_filter ip_tables ip6table_filter ip6_tables x_tables ipv6 cpufreq_conservative cpufreq_ondemand cpufreq_userspace cpufreq_powersave powernow_k8 freq_table snd_pcm_oss snd_mixer_oss snd_seq_midi snd_emu10k1_synth snd_emux_synth snd_seq_virmidi snd_seq_midi_event snd_seq_midi_emul snd_seq capability commoncap fuse nls_koi8_r nls_cp866 loop dm_mod bin
Re: BUG: unable to handle kernel paging request at virtual address
On Sat, 5 Jan 2008 09:10:12 + Al Viro <[EMAIL PROTECTED]> wrote: > and we have q->page == 0x48464443. Seeing how we assign that sucker, that > smells like we've got a page on quicklist with {0x43, 0x44, 0x46, 0x48} > in its first 4 bytes. Instead of having address of the next page stored > in there... > > Do other oopsen of the same kind give the same value? I've got another oops here with a different value. This time a bttv message preceded it. Note that the oops happened shortly *after* I stopped capturing (watching the tv through mplayer). Output of dmesg: bttv0: OCERR @ 375e2014,bits: HSYNC OFLOW FDSR OCERR* (two pages of the same message here) bttv0: OCERR @ 375e2014,bits: HSYNC OFLOW FDSR OCERR* bttv0: OCERR @ 375e2014,bits: HSYNC OFLOW FBUS FDSR OCERR* BUG: unable to handle kernel paging request at virtual address 23232323 printing eip: c011d6f8 *pde = Oops: [#1] PREEMPT SMP Modules linked in: ppp_generic slhc iptable_filter ip_tables ip6table_filter ip6_tables x_tables ipv6 cpufreq_conservative cpufreq_ondemand cpufreq_userspace cpufreq_powersave powernow_k8 freq_table snd_pcm_oss snd_mixer_oss snd_seq_midi snd_emu10k1_synth snd_emux_synth snd_seq_virmidi snd_seq_midi_event snd_seq_midi_emul snd_seq capability commoncap fuse nls_koi8_r nls_cp866 loop dm_mod binfmt_misc uhci_hcd it87 hwmon_vid eeprom nvidia(P) tuner tvaudio snd_emu10k1 bttv snd_rawmidi snd_ac97_codec video_buf firmware_class ir_common ac97_bus snd_pcm snd_seq_device compat_ioctl32 i2c_algo_bit snd_timer snd_page_alloc emu10k1_gp snd_util_mem btcx_risc tveeprom videodev gameport ohci1394 ieee1394 ide_cd snd_hwdep snd v4l2_common v4l1_compat agpgart soundcore i2c_nforce2 thermal button rtc_cmos rtc_core rtc_lib forcedeth k8temp i2c_core hwmon cdrom sg ohci_hcd ehci_hcd usbcore edd fan processor pata_amd CPU:0 EIP:0060:[]Tainted: PVLI EFLAGS: 00010202 (2.6.23.12 #1) EIP is at pgd_alloc+0x28/0x80 eax: c0564101 ebx: 23232323 ecx: f79a5bc0 edx: c2011148 esi: 0001 edi: ec539544 ebp: 01200011 esp: f5cd9efc ds: 007b es: 007b fs: 00d8 gs: 0033 ss: 0068 Process amdpwrmon (pid: 5578, ti=f5cd8000 task=f7a14aa0 task.ti=f5cd8000) Stack: ec539380 f79a5d84 c0126ec6 f5891200 c01277c2 f73ca080 c1dfc40c 4636c065 f5cd9f1c 0286 f6edb640 f5cd9fb8 bfb59498 f6edb550 f6edb550 ec539380 f04b9264 c04f4c30 60ae f79a5bc0 ec539380 f5cd9fb8 01200011 Call Trace: [] mm_init+0xb6/0xe0 [] copy_process+0x762/0x1240 [] do_fork+0x76/0x230 [] do_pipe+0x7c/0xf0 [] sys_clone+0x32/0x40 [] sysenter_past_esp+0x5f/0x85 === Code: 74 26 00 56 53 89 e0 25 00 e0 ff ff ff 40 14 64 8b 15 84 30 56 c0 b8 48 41 56 c0 01 c2 8b 1a 85 db 0f 95 c0 0f b6 f0 85 f6 74 0d <8b> 03 89 02 c7 03 00 00 00 00 ff 4a 04 89 e0 25 00 e0 ff ff ff EIP: [] pgd_alloc+0x28/0x80 SS:ESP 0068:f5cd9efc note: amdpwrmon[5578] exited with preempt_count 1 BUG: unable to handle kernel paging request at virtual address 23232323 printing eip: c011d6f8 *pde = Oops: [#2] PREEMPT SMP Modules linked in: ppp_generic slhc iptable_filter ip_tables ip6table_filter ip6_tables x_tables ipv6 cpufreq_conservative cpufreq_ondemand cpufreq_userspace cpufreq_powersave powernow_k8 freq_table snd_pcm_oss snd_mixer_oss snd_seq_midi snd_emu10k1_synth snd_emux_synth snd_seq_virmidi snd_seq_midi_event snd_seq_midi_emul snd_seq capability commoncap fuse nls_koi8_r nls_cp866 loop dm_mod binfmt_misc uhci_hcd it87 hwmon_vid eeprom nvidia(P) tuner tvaudio snd_emu10k1 bttv snd_rawmidi snd_ac97_codec video_buf firmware_class ir_common ac97_bus snd_pcm snd_seq_device compat_ioctl32 i2c_algo_bit snd_timer snd_page_alloc emu10k1_gp snd_util_mem btcx_risc tveeprom videodev gameport ohci1394 ieee1394 ide_cd snd_hwdep snd v4l2_common v4l1_compat agpgart soundcore i2c_nforce2 thermal button rtc_cmos rtc_core rtc_lib forcedeth k8temp i2c_core hwmon cdrom sg ohci_hcd ehci_hcd usbcore edd fan processor pata_amd CPU:0 EIP:0060:[]Tainted: P D VLI EFLAGS: 00010202 (2.6.23.12 #1) EIP is at pgd_alloc+0x28/0x80 eax: c0564101 ebx: 23232323 ecx: f03a32c0 edx: c2011148 esi: 0001 edi: fff4 ebp: f739c6c0 esp: dace3f5c ds: 007b es: 007b fs: 00d8 gs: 0033 ss: 0068 Process master (pid: 24800, ti=dace2000 task=ee93aaa0 task.ti=dace2000) Stack: f03a36c0 db9e c0126ec6 efdb25c0 c01859df f7e6a000 0011 efdb25c0 db9e f739c6c0 efdb25c0 c0186f36 8000e310 80010c80 db9e db9e b7bf0ff4 dace2000 c010260f dace3fb8 80010be0 c0104362 80010be0 Call Trace: [] mm_init+0xb6/0xe0 [] bprm_mm_init+0x1f/0x170 [] do_execve+0x66/0x1e0 [] sys_execve+0x2f/0x80 [] sysenter_past_esp+0x5f/0x85 === Code: 74 26 00 56 53 89 e0 25 00 e0 ff ff ff 40 14 64 8b 15 84 30 56 c0 b8 48 41 56 c0 01 c2 8b 1a 85 db 0f 95 c0 0f b6 f0 85 f6 74 0d <8b> 03 89 02 c7 03 00 00 00 00 ff 4a 04 89 e0 25 00 e0 ff ff ff EIP: [] pgd_alloc+0x28/0x80 S
Re: BUG: unable to handle kernel paging request at virtual address
On Sat, 5 Jan 2008 15:34:25 +0100 Mikael Pettersson <[EMAIL PROTECTED]> wrote: > This kernel is tainted by the nvidia module... I know that, but as I wrote in the original message, the hangs occur without it too. I just tested it today. Had to leave it running (in vesa framebuffer mode), without any nvidia modules. Crash occurred after 4 hours of tv capturing, and didn't leave any traces in dmesg logs (logged every second). Unfortunately, I couldn't save any messages. All I'm saying is that the problem is reproduceable without nvidia module and without any X server running. Thanks, Alexander -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: BUG: unable to handle kernel paging request at virtual address
> Get a serial console? Take another box, plug e.g. pl2303-based > usb-to-serial (several bucks these days) into it, stick null-modem > convertor (ditto) on its serial end and attach to ttyS0 on the > victim. console=ttyS0 on victim, something like minicom on watcher > and tell it to capture log into file... I will try that (as soon as I acquire the necessary parts). Thanks, Alexander -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: BUG: unable to handle kernel paging request at virtual address
On Sun, 6 Jan 2008 00:20:50 +0300 Alexey Dobriyan <[EMAIL PROTECTED]> wrote: > netconsole should be more quick: Thanks a lot for the tip, I'll try that. Alexander -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: BUG: unable to handle kernel paging request at virtual address
On Sun, 6 Jan 2008 00:20:50 +0300 Alexey Dobriyan <[EMAIL PROTECTED]> wrote: > On Sun, Jan 06, 2008 at 12:30:34AM +0400, Alexander Shaduri wrote: > > > Get a serial console? Take another box, plug e.g. pl2303-based > > > usb-to-serial (several bucks these days) into it, stick null-modem > > > convertor (ditto) on its serial end and attach to ttyS0 on the > > > victim. console=ttyS0 on victim, something like minicom on watcher > > > and tell it to capture log into file... > > > > I will try that (as soon as I acquire the necessary parts). > > netconsole should be more quick: Sorry for late reply, but all my attempts to log via network failed - every time only ~700 bytes were transferred, so the logs were non-informative. Finally, I got the logging working with a serial console, using console=tty0 console=ttyS0,9600 ignore_loglevel Anyway, see the crash log below. Just a few additional notes - I was trying to get rid of the problem by trying various things, like: * Plugging the bttv card to an alternative PCI slot. * Swapping the bttv card with another card: I had an Avermedia 203 tv tuner card before and I tried using Avermedia EZCapture (same bttv chipset, without a tv tuner). * loading bttv module with no_overlay=1. * "maxcpus=1" kernel option * Compiling kernel git snapshot (2008-01-09) from http://git.kernel.org/?p=linux/kernel/git/mchehab/v4l-dvb.git in case it was fixed in the development tree. None of the above helped anything. Here's a log (2.6.23.12, no X running, Avermedia EZCapture card; capturing for several hours, then stopping, then capturing again - crash!): -- BUG: unable to handle kernel paging request at virtual address 968e8787 printing eip: c017dc67 *pde = Oops: 0002 [#1] PREEMPT SMP Modules linked in: bttv video_buf ir_common compat_ioctl32 i2c_algo_bit btcx_risc tveeprom videodev v4l2_common v4l1_compat netconsole iptable_filter ip_tables ip6table_filter ip6_tables x_tables ipv6 cpufreq_conservative cpufreq_ondemand cpufreq_userspace cpufreq_powersave powernow_k8 freq_table snd_pcm_oss snd_mixer_oss snd_seq_midi snd_emu10k1_synth snd_emux_synth snd_seq_virmidi snd_seq_midi_event snd_seq_midi_emul snd_seq capability commoncap fuse nls_koi8_r nls_cp866 loop dm_mod binfmt_misc uhci_hcd it87 hwmon_vid eeprom snd_emu10k1 snd_rawmidi firmware_class snd_ac97_codec ac97_bus snd_pcm snd_seq_device snd_timer snd_page_alloc snd_util_mem snd_hwdep rtc_cmos snd rtc_core forcedeth parport_pc ide_cd ohci1394 agpgart emu10k1_gp soundcore gameport i2c_nforce2 k8temp rtc_lib parport ieee1394 hwmon cdrom thermal button sg i2c_core ohci_hcd ehci_hcd usbcore edd fan processor pata_amd CPU:1 EIP:0060:[]Not tainted VLI EFLAGS: 00210046 (2.6.23.12 #3) EIP is at cache_alloc_refill+0x147/0x520 eax: c20eec80 ebx: ecx: 003c edx: 968e8783 esi: 0022 edi: f52f ebp: c2131a00 esp: f6a7fc1c ds: 007b es: 007b fs: 00d8 gs: 0033 ss: 0068 Process mencoder (pid: 5465, ti=f6a7e000 task=c2206550 task.ti=f6a7e000) Stack: 0050 c20eec88 c20eec90 c20eeca4 c03fca90 8050 c20fc7c0 c20e9800 c20eec80 f785bc00 1000 000c 00060007 f52f001c c1b279a0 8050 00200286 c20fc7c0 c017dafb c1b279a0 1000 Call Trace: [] kmem_cache_alloc+0x6b/0x90 [] alloc_buffer_head+0x10/0x60 [] alloc_page_buffers+0x78/0xc0 [] create_empty_buffers+0x11/0x90 [] __block_prepare_write+0x41e/0x4f0 [] start_this_handle+0x368/0x4d0 [] journal_start+0xb3/0xe0 [] block_prepare_write+0x28/0x40 [] ext3_get_block+0x0/0x100 [] ext3_prepare_write+0x4a/0x190 [] ext3_get_block+0x0/0x100 [] generic_file_buffered_write+0x289/0x6b0 [] schedule+0x1d6/0x6c0 [] check_preempt_curr_fair+0x57/0x90 [] __generic_file_aio_write_nolock+0x2b4/0x530 [] futex_wait+0x23e/0x310 [] generic_file_aio_write+0x65/0xe0 [] get_futex_key+0x51/0x150 [] ext3_file_write+0x30/0xc0 [] do_sync_write+0xd5/0x120 [] do_futex+0x239/0xb20 [] autoremove_wake_function+0x0/0x40 [] __atomic_notifier_call_chain+0x26/0x50 [] do_sync_write+0x0/0x120 [] vfs_write+0xbf/0x140 [] sys_write+0x41/0x70 [] sysenter_past_esp+0x5f/0x85 === Code: 77 10 8b 77 14 8b 44 24 38 03 57 0c 8b 34 b0 89 77 14 89 54 8d 18 41 89 4d 00 8b 44 24 1c 8b 77 10 3b 70 38 72 c4 8b 17 8b 47 04 <89> 42 04 89 10 83 7f 14 ff c7 07 00 01 10 00 c7 47 04 00 02 20 EIP: [] cache_alloc_refill+0x147/0x520 SS:ESP 0068:f6a7fc1c note: mencoder[5465] exited with preempt_count 1 bttv0: timeout: drop=3 irq=1002400/1002400, risc=350dc3c4, bits: HSYNC OFLOW bttv0: reset, reinitialize bttv0: PLL: 28636363 => 35468950 . ok bttv0: timeout: drop=12 irq=1002413/1002413, risc=35a10a8c, bits: OFLOW bttv0: reset, reinitialize bttv0: PLL: 28636363 => 35468950 . ok bttv0: timeout: drop=22 irq=1002425/1002425, risc=35a111a4, bits: OFLOW bttv0: reset, reinitialize bttv0: PLL: 2863