Re: oops in __nodemgr_remove_host_dev (was Re: Ooops with suspend to RAM)

2007-03-16 Thread Stefan Richter
Ismail Dönmez wrote:
> On Thursday 15 March 2007 02:08:43 Stefan Richter wrote:
> [...]
>> Ismail, if you have the opportunity, the next thing you could test would
>> be to unload eth1394 explicitly before ohci1394 on 2.6.21-rc3. This
>> would _not_ oops according to my observation.
> 
> On a clean reboot it works as expected ;
> 
> southpark cartman # rmmod eth1394
> southpark cartman # rmmod ohci1394
> southpark cartman #
> 
> No oops.

I now tested 2.6.20-rc4 with the following two commits reverted:

43cb76d91ee85f579a69d42bc8efc08bac560278
"Network: convert network devices to use struct device instead of
class_device"

40cf67c5fcc513406558c01b91129280208e57bf
"Driver core: remove class_device_rename"

I can now unload ohci1394 again while eth1394 is loaded. The reverting
patch is available at
http://me.in-berlin.de/~s5r6/linux1394/work-in-progress/revert-network-convert-network-devices-to-use-struct-device-instead-of-class_device.patch
(The server may be briefly down tonight and sometime during tomorrow.)

Next thing to do: Find a minimal fix which keeps Greg's net conversions.
-- 
Stefan Richter
-=-=-=== --== =
http://arcgraph.de/sr/
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: oops in __nodemgr_remove_host_dev (was Re: Ooops with suspend to RAM)

2007-03-16 Thread Stefan Richter
Ismail Dönmez wrote:
 On Thursday 15 March 2007 02:08:43 Stefan Richter wrote:
 [...]
 Ismail, if you have the opportunity, the next thing you could test would
 be to unload eth1394 explicitly before ohci1394 on 2.6.21-rc3. This
 would _not_ oops according to my observation.
 
 On a clean reboot it works as expected ;
 
 southpark cartman # rmmod eth1394
 southpark cartman # rmmod ohci1394
 southpark cartman #
 
 No oops.

I now tested 2.6.20-rc4 with the following two commits reverted:

43cb76d91ee85f579a69d42bc8efc08bac560278
Network: convert network devices to use struct device instead of
class_device

40cf67c5fcc513406558c01b91129280208e57bf
Driver core: remove class_device_rename

I can now unload ohci1394 again while eth1394 is loaded. The reverting
patch is available at
http://me.in-berlin.de/~s5r6/linux1394/work-in-progress/revert-network-convert-network-devices-to-use-struct-device-instead-of-class_device.patch
(The server may be briefly down tonight and sometime during tomorrow.)

Next thing to do: Find a minimal fix which keeps Greg's net conversions.
-- 
Stefan Richter
-=-=-=== --== =
http://arcgraph.de/sr/
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: oops in __nodemgr_remove_host_dev (was Re: Ooops with suspend to RAM)

2007-03-14 Thread Ismail Dönmez
On Thursday 15 March 2007 02:08:43 Stefan Richter wrote:
[...]
>
> Ismail, if you have the opportunity, the next thing you could test would
> be to unload eth1394 explicitly before ohci1394 on 2.6.21-rc3. This
> would _not_ oops according to my observation.

On a clean reboot it works as expected ;

southpark cartman # rmmod eth1394
southpark cartman # rmmod ohci1394
southpark cartman #

No oops.

Thanks.

-- 
Happiness in intelligent people is the rarest thing I know. (Ernest Hemingway)

Ismail Donmez ismail (at) pardus.org.tr
GPG Fingerprint: 7ACD 5836 7827 5598 D721 DF0D 1A9D 257A 5B88 F54C
Pardus Linux / KDE developer
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: oops in __nodemgr_remove_host_dev (was Re: Ooops with suspend to RAM)

2007-03-14 Thread Ismail Dönmez
On Thursday 15 March 2007 02:08:43 Stefan Richter wrote:
[...]
> Ismail, if you have the opportunity, the next thing you could test would
> be to unload eth1394 explicitly before ohci1394 on 2.6.21-rc3. This
> would _not_ oops according to my observation.

rmmod eth1394 and modprobe -r eth1394 both hangs here no oops nothing.

Regards.

-- 
Happiness in intelligent people is the rarest thing I know. (Ernest Hemingway)

Ismail Donmez ismail (at) pardus.org.tr
GPG Fingerprint: 7ACD 5836 7827 5598 D721 DF0D 1A9D 257A 5B88 F54C
Pardus Linux / KDE developer
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: oops in __nodemgr_remove_host_dev (was Re: Ooops with suspend to RAM)

2007-03-14 Thread Stefan Richter
I wrote:
> according to a quick test I made right now it is a regression post 2.6.20.
> # modprobe ohci1394   # wait a bit, eth1394 is auto-loaded
> # modprobe -r eth1394
> # modprobe -r ohci1394
> works.
> # modprobe ohci1394   # wait a bit, eth1394 is auto-loaded
> # modprobe -r ohci1394
> oopses with the same trace as Ismael posted. And indeed, looking at his
> trace once more I now also spot eth1394 among his linked-in modules.

To avoid any misunderstandings: Both the former and the latter sequence
work under 2.6.20 and earlier.
-- 
Stefan Richter
-=-=-=== --== -
http://arcgraph.de/sr/
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: oops in __nodemgr_remove_host_dev (was Re: Ooops with suspend to RAM)

2007-03-14 Thread Stefan Richter
Ismail Dönmez wrote:
> On Wednesday 14 March 2007 20:25:24 Stefan Richter wrote:
>> Ismail Dönmez wrote:
>> > Are you able to rmmod it?
>>
>> Yes, but on 2.6.20 and earlier kernels, most of the time with
>> development versions of the 1394 drivers. I still haven't tried
>> 2.6.21-rc, will hopefully get to it tonight.
> 
> Ok then that explains a bit, without suspend if I rmmod ohci1394 module I got 
> the exact oops.

Elsewhere, Adrian Bunk wrote:
| Is this an old problem, or what was the last kernel that worked
| for you?

Adrian,

according to a quick test I made right now it is a regression post 2.6.20.
# modprobe ohci1394   # wait a bit, eth1394 is auto-loaded
# modprobe -r eth1394
# modprobe -r ohci1394
works.
# modprobe ohci1394   # wait a bit, eth1394 is auto-loaded
# modprobe -r ohci1394
oopses with the same trace as Ismael posted. And indeed, looking at his
trace once more I now also spot eth1394 among his linked-in modules.

Ismail, if you have the opportunity, the next thing you could test would
be to unload eth1394 explicitly before ohci1394 on 2.6.21-rc3. This
would _not_ oops according to my observation.

Thanks to Ismail's link to the similar report on 2.6.19-rc5-mm2 we
already have a hot candidate to be the trigger (not necessarily to be
the actual bug):
http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=43cb76d91ee85f579a69d42bc8efc08bac560278
"Network: convert network devices to use struct device instead of
class_device"

Alas I didn't remember that older 2.6.19-rc5-mm2 discussion when I saw
Greg's pull request with this conversion patch (February 7) and didn't
react and test Linus' newest.

Advice would be appreciated...
-- 
Stefan Richter
-=-=-=== --== -
http://arcgraph.de/sr/
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: oops in __nodemgr_remove_host_dev (was Re: Ooops with suspend to RAM)

2007-03-14 Thread Ismail Dönmez
On Wednesday 14 March 2007 20:25:24 Stefan Richter wrote:
> Ismail Dönmez wrote:
> > On Wednesday 14 March 2007 13:14:42 Stefan Richter wrote:
> >> Do you have a script or config which marks the
> >> ohci1394 module to be unloaded before suspend?
> >
> > I used kpowersave to suspend, I failed to find anything related to
> > ohci1394 in its config but rmmod ohci1394 gives exact oops so it must be
> > rmmoding it.
>
> [...]
>
> > Are you able to rmmod it?
>
> Yes, but on 2.6.20 and earlier kernels, most of the time with
> development versions of the 1394 drivers. I still haven't tried
> 2.6.21-rc, will hopefully get to it tonight.

Ok then that explains a bit, without suspend if I rmmod ohci1394 module I got 
the exact oops.

Regards.

-- 
Happiness in intelligent people is the rarest thing I know. (Ernest Hemingway)

Ismail Donmez ismail (at) pardus.org.tr
GPG Fingerprint: 7ACD 5836 7827 5598 D721 DF0D 1A9D 257A 5B88 F54C
Pardus Linux / KDE developer
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: oops in __nodemgr_remove_host_dev (was Re: Ooops with suspend to RAM)

2007-03-14 Thread Stefan Richter
Ismail Dönmez wrote:
> On Wednesday 14 March 2007 13:14:42 Stefan Richter wrote:
>> Do you have a script or config which marks the
>> ohci1394 module to be unloaded before suspend?  
> 
> I used kpowersave to suspend, I failed to find anything related to ohci1394 
> in 
> its config but rmmod ohci1394 gives exact oops so it must be rmmoding it. 
[...]
> Are you able to rmmod it?

Yes, but on 2.6.20 and earlier kernels, most of the time with
development versions of the 1394 drivers. I still haven't tried
2.6.21-rc, will hopefully get to it tonight.

[...]
> I'll give -mm a try sometime this weekend then.

It will most certainly behave the same, unless you use the new
alternative FireWire drivers which are a whole different game. I hate to
say it but they don't have suspend + resume support yet. Like the
vanilla Linux drivers before 2.6.20/21, they have to be unloaded before
suspend or at least reloaded after resume.
-- 
Stefan Richter
-=-=-=== --== -===-
http://arcgraph.de/sr/
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: oops in __nodemgr_remove_host_dev (was Re: Ooops with suspend to RAM)

2007-03-14 Thread Ismail Dönmez
On Wednesday 14 March 2007 13:14:42 Stefan Richter wrote:
> (Cc'ing Greg KH and linux1394-devel)
>
> Ismail Dönmez wrote at lkml:
> > With latest GIT tree I am getting the following oops when I try to
> > suspend to RAM:
> >
> > BUG: unable to handle kernel NULL pointer dereference at virtual address
> > 0094
> >  printing eip:
> > c0222af4
> > *pde = 
> > Oops:  [#1]
> > PREEMPT
> > Modules linked in: i915 drm snd_pcm_oss snd_mixer_oss snd_seq_dummy
> > snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device usbhid eth1394
> > ipw2200 ieee80211 ieee80211_crypt snd_hda_intel snd_hda_codec snd_pcm
> > snd_timer snd snd_page_alloc tifm_7xx1 tifm_core i2c_i801 i2c_core
> > ehci_hcd uhci_hcd ohci1394 ieee1394 pcmcia usbcore yenta_socket
> > rsrc_nonstatic pcmcia_core sony_laptop backlight
> > CPU:0
> > EIP:0060:[]Not tainted VLI
> > EFLAGS: 00010246   (2.6.21-rc3 #12)
> > EIP is at class_device_remove_attrs+0xa/0x30
> > eax: f7cb5b18   ebx:    ecx: f8bde010   edx: 
> > esi:    edi: f7cb5b18   ebp:    esp: d93e7e1c
> > ds: 007b   es: 007b   fs: 00d8  gs: 0033  ss: 0068
> > Process modprobe (pid: 12200, ti=d93e6000 task=e5770a50 task.ti=d93e6000)
> > Stack: f7cb5b18 f7cb5b20  c0222bc3 f7cb5990  f7cb5b18
> > f7cb59c4 f8bcdc0f  c0222bfb f7cb5990 f8bcdbf6 f8bd3275 04e2c100
> > 000f 03c3 f8dcf05f  f7e3e000  f8bcdc17 c0220567
> > f7e3e0a4 Call Trace:
> >  [] class_device_del+0xa9/0xd9
> >  [] __nodemgr_remove_host_dev+0x0/0xb [ieee1394]
> >  [] class_device_unregister+0x8/0x10
> >  [] nodemgr_remove_ne+0x61/0x7a [ieee1394]
> >  [] ether1394_mac_addr+0x0/0x12 [eth1394]
> >  [] __nodemgr_remove_host_dev+0x8/0xb [ieee1394]
> >  [] device_for_each_child+0x1a/0x3c
> >  [] nodemgr_remove_host+0x30/0x90 [ieee1394]
> >  [] __unregister_host+0x1a/0xac [ieee1394]
> >  [] flush_cpu_workqueue+0x98/0xb7
> >  [] highlevel_remove_host+0x21/0x42 [ieee1394]
> >  [] hpsb_remove_host+0x37/0x58 [ieee1394]
> >  [] ohci1394_pci_remove+0x47/0x1ec [ohci1394]
> >  [] sysfs_hash_and_remove+0xfa/0x111
> >  [] pci_device_remove+0x16/0x35
> >  [] __device_release_driver+0x6e/0x8b
> >  [] driver_detach+0x99/0xda
> >  [] bus_remove_driver+0x57/0x75
> >  [] driver_unregister+0x8/0x13
> >  [] pci_unregister_driver+0xc/0x67
> >  [] sys_delete_module+0x15c/0x19d
> >  [] remove_vma+0x31/0x36
> >  [] do_munmap+0x19b/0x1b4
> >  [] sysenter_past_esp+0x5f/0x85
> >  [] packet_notifier+0xf3/0x157
> >  ===
> > Code: ff c3 85 c0 74 08 83 c0 08 e9 83 6d f6 ff b8 ea ff ff ff c3 85 c0
> > 74 08 83 c0 08 e9 4c 51 f6 ff c3 57 89 c7 56 53 8b 70 44 31 db <83> be 94
> > 00 00 00 00 75 09 eb 17 89 f8 e8 d7 ff ff ff 89 da 83
> > EIP: [] class_device_remove_attrs+0xa/0x30 SS:ESP 0068:d93e7e1c
> >
> >
> > Checking Google I see a similar oops was reported long ago:
> > http://lkml.org/lkml/2006/11/16/147 .
> >
> > Any ideas/patches to test? Please CC me in your replies.
>
> Thanks for the report.  Do you have a script or config which marks the
> ohci1394 module to be unloaded before suspend?  


I used kpowersave to suspend, I failed to find anything related to ohci1394 in 
its config but rmmod ohci1394 gives exact oops so it must be rmmoding it. 
Also doing echo mem > /sys/power/state from konsole tries to suspend but 
freezes at Suspending console(s) but gives no oops.

> This should not be 
> necessary since 2.6.21-rc1 anymore.  (Although I tested this only with
> APM suspend to RAM and only with the sbp2 driver as IEEE 1394
> application-layer software, and only with current 1394 drivers on top of
> 2.6.20-rcX instead of 2.6.21-rcX.  I heard that raw1394 survives
> suspend/resume thanks to the ohci1394 updates already in 2.6.20.)

Are you able to rmmod it?

> But back to your problem.  The older report which you pointed to was a
> hickup caused by the ongoing conversion away from class_device.  Further
> down that discussion, a 2.6.19-rcX-mmY patch was discovered to trigger
> this:  http://lkml.org/lkml/2006/11/19/53
>
> |  the winner is... gregkh-driver-network-device.patch
>
> By "trigger" I mean that I don't know where the bug was, i.e. in the
> then partial driver core conversion or in the ieee1394 nodemgr.
>
> *However*, this time it's different --- you don't have eth1394 present.

Right, no firewire devices are attached.

> I will boot 2.6.21-rc3 on a spare machine and see how it goes.

Thanks.

> As a side note, the IEEE 1394 subsystem features quite a fat usage of
> the driver core.  We have (in order of parent devices to child devices)
> the host adapter's PCI device's device, the 1394 host device
> (hpsb_host), the node entry devices, the unit directory devices.  And
> all of them have respective class devices.  But really important outside
> of the ieee1394 core are only the first and the last, i.e. PCI device
> and unit directories.  Maybe we should redesign nodemgr to work without
> host devices and node entry devices.
>
> Side 

Re: oops in __nodemgr_remove_host_dev (was Re: Ooops with suspend to RAM)

2007-03-14 Thread Ismail Dönmez
On Wednesday 14 March 2007 13:14:42 Stefan Richter wrote:
 (Cc'ing Greg KH and linux1394-devel)

 Ismail Dönmez wrote at lkml:
  With latest GIT tree I am getting the following oops when I try to
  suspend to RAM:
 
  BUG: unable to handle kernel NULL pointer dereference at virtual address
  0094
   printing eip:
  c0222af4
  *pde = 
  Oops:  [#1]
  PREEMPT
  Modules linked in: i915 drm snd_pcm_oss snd_mixer_oss snd_seq_dummy
  snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device usbhid eth1394
  ipw2200 ieee80211 ieee80211_crypt snd_hda_intel snd_hda_codec snd_pcm
  snd_timer snd snd_page_alloc tifm_7xx1 tifm_core i2c_i801 i2c_core
  ehci_hcd uhci_hcd ohci1394 ieee1394 pcmcia usbcore yenta_socket
  rsrc_nonstatic pcmcia_core sony_laptop backlight
  CPU:0
  EIP:0060:[c0222af4]Not tainted VLI
  EFLAGS: 00010246   (2.6.21-rc3 #12)
  EIP is at class_device_remove_attrs+0xa/0x30
  eax: f7cb5b18   ebx:    ecx: f8bde010   edx: 
  esi:    edi: f7cb5b18   ebp:    esp: d93e7e1c
  ds: 007b   es: 007b   fs: 00d8  gs: 0033  ss: 0068
  Process modprobe (pid: 12200, ti=d93e6000 task=e5770a50 task.ti=d93e6000)
  Stack: f7cb5b18 f7cb5b20  c0222bc3 f7cb5990  f7cb5b18
  f7cb59c4 f8bcdc0f  c0222bfb f7cb5990 f8bcdbf6 f8bd3275 04e2c100
  000f 03c3 f8dcf05f  f7e3e000  f8bcdc17 c0220567
  f7e3e0a4 Call Trace:
   [c0222bc3] class_device_del+0xa9/0xd9
   [f8bcdc0f] __nodemgr_remove_host_dev+0x0/0xb [ieee1394]
   [c0222bfb] class_device_unregister+0x8/0x10
   [f8bcdbf6] nodemgr_remove_ne+0x61/0x7a [ieee1394]
   [f8dcf05f] ether1394_mac_addr+0x0/0x12 [eth1394]
   [f8bcdc17] __nodemgr_remove_host_dev+0x8/0xb [ieee1394]
   [c0220567] device_for_each_child+0x1a/0x3c
   [f8bcdf34] nodemgr_remove_host+0x30/0x90 [ieee1394]
   [f8bcb4b1] __unregister_host+0x1a/0xac [ieee1394]
   [c0125e1c] flush_cpu_workqueue+0x98/0xb7
   [f8bcb6da] highlevel_remove_host+0x21/0x42 [ieee1394]
   [f8bcb247] hpsb_remove_host+0x37/0x58 [ieee1394]
   [f8be1229] ohci1394_pci_remove+0x47/0x1ec [ohci1394]
   [c01877b9] sysfs_hash_and_remove+0xfa/0x111
   [c01ccc9c] pci_device_remove+0x16/0x35
   [c0222321] __device_release_driver+0x6e/0x8b
   [c022279b] driver_detach+0x99/0xda
   [c0221fa2] bus_remove_driver+0x57/0x75
   [c02227fd] driver_unregister+0x8/0x13
   [c01ccdfd] pci_unregister_driver+0xc/0x67
   [c0134133] sys_delete_module+0x15c/0x19d
   [c0149fc0] remove_vma+0x31/0x36
   [c014a946] do_munmap+0x19b/0x1b4
   [c0104cca] sysenter_past_esp+0x5f/0x85
   [c030] packet_notifier+0xf3/0x157
   ===
  Code: ff c3 85 c0 74 08 83 c0 08 e9 83 6d f6 ff b8 ea ff ff ff c3 85 c0
  74 08 83 c0 08 e9 4c 51 f6 ff c3 57 89 c7 56 53 8b 70 44 31 db 83 be 94
  00 00 00 00 75 09 eb 17 89 f8 e8 d7 ff ff ff 89 da 83
  EIP: [c0222af4] class_device_remove_attrs+0xa/0x30 SS:ESP 0068:d93e7e1c
 
 
  Checking Google I see a similar oops was reported long ago:
  http://lkml.org/lkml/2006/11/16/147 .
 
  Any ideas/patches to test? Please CC me in your replies.

 Thanks for the report.  Do you have a script or config which marks the
 ohci1394 module to be unloaded before suspend?  


I used kpowersave to suspend, I failed to find anything related to ohci1394 in 
its config but rmmod ohci1394 gives exact oops so it must be rmmoding it. 
Also doing echo mem  /sys/power/state from konsole tries to suspend but 
freezes at Suspending console(s) but gives no oops.

 This should not be 
 necessary since 2.6.21-rc1 anymore.  (Although I tested this only with
 APM suspend to RAM and only with the sbp2 driver as IEEE 1394
 application-layer software, and only with current 1394 drivers on top of
 2.6.20-rcX instead of 2.6.21-rcX.  I heard that raw1394 survives
 suspend/resume thanks to the ohci1394 updates already in 2.6.20.)

Are you able to rmmod it?

 But back to your problem.  The older report which you pointed to was a
 hickup caused by the ongoing conversion away from class_device.  Further
 down that discussion, a 2.6.19-rcX-mmY patch was discovered to trigger
 this:  http://lkml.org/lkml/2006/11/19/53

 |  the winner is... gregkh-driver-network-device.patch

 By trigger I mean that I don't know where the bug was, i.e. in the
 then partial driver core conversion or in the ieee1394 nodemgr.

 *However*, this time it's different --- you don't have eth1394 present.

Right, no firewire devices are attached.

 I will boot 2.6.21-rc3 on a spare machine and see how it goes.

Thanks.

 As a side note, the IEEE 1394 subsystem features quite a fat usage of
 the driver core.  We have (in order of parent devices to child devices)
 the host adapter's PCI device's device, the 1394 host device
 (hpsb_host), the node entry devices, the unit directory devices.  And
 all of them have respective class devices.  But really important outside
 of the ieee1394 core are only the first and the last, i.e. PCI device
 and unit directories.  Maybe we should redesign nodemgr to work without

Re: oops in __nodemgr_remove_host_dev (was Re: Ooops with suspend to RAM)

2007-03-14 Thread Stefan Richter
Ismail Dönmez wrote:
 On Wednesday 14 March 2007 13:14:42 Stefan Richter wrote:
 Do you have a script or config which marks the
 ohci1394 module to be unloaded before suspend?  
 
 I used kpowersave to suspend, I failed to find anything related to ohci1394 
 in 
 its config but rmmod ohci1394 gives exact oops so it must be rmmoding it. 
[...]
 Are you able to rmmod it?

Yes, but on 2.6.20 and earlier kernels, most of the time with
development versions of the 1394 drivers. I still haven't tried
2.6.21-rc, will hopefully get to it tonight.

[...]
 I'll give -mm a try sometime this weekend then.

It will most certainly behave the same, unless you use the new
alternative FireWire drivers which are a whole different game. I hate to
say it but they don't have suspend + resume support yet. Like the
vanilla Linux drivers before 2.6.20/21, they have to be unloaded before
suspend or at least reloaded after resume.
-- 
Stefan Richter
-=-=-=== --== -===-
http://arcgraph.de/sr/
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: oops in __nodemgr_remove_host_dev (was Re: Ooops with suspend to RAM)

2007-03-14 Thread Ismail Dönmez
On Wednesday 14 March 2007 20:25:24 Stefan Richter wrote:
 Ismail Dönmez wrote:
  On Wednesday 14 March 2007 13:14:42 Stefan Richter wrote:
  Do you have a script or config which marks the
  ohci1394 module to be unloaded before suspend?
 
  I used kpowersave to suspend, I failed to find anything related to
  ohci1394 in its config but rmmod ohci1394 gives exact oops so it must be
  rmmoding it.

 [...]

  Are you able to rmmod it?

 Yes, but on 2.6.20 and earlier kernels, most of the time with
 development versions of the 1394 drivers. I still haven't tried
 2.6.21-rc, will hopefully get to it tonight.

Ok then that explains a bit, without suspend if I rmmod ohci1394 module I got 
the exact oops.

Regards.

-- 
Happiness in intelligent people is the rarest thing I know. (Ernest Hemingway)

Ismail Donmez ismail (at) pardus.org.tr
GPG Fingerprint: 7ACD 5836 7827 5598 D721 DF0D 1A9D 257A 5B88 F54C
Pardus Linux / KDE developer
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: oops in __nodemgr_remove_host_dev (was Re: Ooops with suspend to RAM)

2007-03-14 Thread Stefan Richter
Ismail Dönmez wrote:
 On Wednesday 14 March 2007 20:25:24 Stefan Richter wrote:
 Ismail Dönmez wrote:
  Are you able to rmmod it?

 Yes, but on 2.6.20 and earlier kernels, most of the time with
 development versions of the 1394 drivers. I still haven't tried
 2.6.21-rc, will hopefully get to it tonight.
 
 Ok then that explains a bit, without suspend if I rmmod ohci1394 module I got 
 the exact oops.

Elsewhere, Adrian Bunk wrote:
| Is this an old problem, or what was the last kernel that worked
| for you?

Adrian,

according to a quick test I made right now it is a regression post 2.6.20.
# modprobe ohci1394   # wait a bit, eth1394 is auto-loaded
# modprobe -r eth1394
# modprobe -r ohci1394
works.
# modprobe ohci1394   # wait a bit, eth1394 is auto-loaded
# modprobe -r ohci1394
oopses with the same trace as Ismael posted. And indeed, looking at his
trace once more I now also spot eth1394 among his linked-in modules.

Ismail, if you have the opportunity, the next thing you could test would
be to unload eth1394 explicitly before ohci1394 on 2.6.21-rc3. This
would _not_ oops according to my observation.

Thanks to Ismail's link to the similar report on 2.6.19-rc5-mm2 we
already have a hot candidate to be the trigger (not necessarily to be
the actual bug):
http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=43cb76d91ee85f579a69d42bc8efc08bac560278
Network: convert network devices to use struct device instead of
class_device

Alas I didn't remember that older 2.6.19-rc5-mm2 discussion when I saw
Greg's pull request with this conversion patch (February 7) and didn't
react and test Linus' newest.

Advice would be appreciated...
-- 
Stefan Richter
-=-=-=== --== -
http://arcgraph.de/sr/
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: oops in __nodemgr_remove_host_dev (was Re: Ooops with suspend to RAM)

2007-03-14 Thread Stefan Richter
I wrote:
 according to a quick test I made right now it is a regression post 2.6.20.
 # modprobe ohci1394   # wait a bit, eth1394 is auto-loaded
 # modprobe -r eth1394
 # modprobe -r ohci1394
 works.
 # modprobe ohci1394   # wait a bit, eth1394 is auto-loaded
 # modprobe -r ohci1394
 oopses with the same trace as Ismael posted. And indeed, looking at his
 trace once more I now also spot eth1394 among his linked-in modules.

To avoid any misunderstandings: Both the former and the latter sequence
work under 2.6.20 and earlier.
-- 
Stefan Richter
-=-=-=== --== -
http://arcgraph.de/sr/
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: oops in __nodemgr_remove_host_dev (was Re: Ooops with suspend to RAM)

2007-03-14 Thread Ismail Dönmez
On Thursday 15 March 2007 02:08:43 Stefan Richter wrote:
[...]
 Ismail, if you have the opportunity, the next thing you could test would
 be to unload eth1394 explicitly before ohci1394 on 2.6.21-rc3. This
 would _not_ oops according to my observation.

rmmod eth1394 and modprobe -r eth1394 both hangs here no oops nothing.

Regards.

-- 
Happiness in intelligent people is the rarest thing I know. (Ernest Hemingway)

Ismail Donmez ismail (at) pardus.org.tr
GPG Fingerprint: 7ACD 5836 7827 5598 D721 DF0D 1A9D 257A 5B88 F54C
Pardus Linux / KDE developer
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: oops in __nodemgr_remove_host_dev (was Re: Ooops with suspend to RAM)

2007-03-14 Thread Ismail Dönmez
On Thursday 15 March 2007 02:08:43 Stefan Richter wrote:
[...]

 Ismail, if you have the opportunity, the next thing you could test would
 be to unload eth1394 explicitly before ohci1394 on 2.6.21-rc3. This
 would _not_ oops according to my observation.

On a clean reboot it works as expected ;

southpark cartman # rmmod eth1394
southpark cartman # rmmod ohci1394
southpark cartman #

No oops.

Thanks.

-- 
Happiness in intelligent people is the rarest thing I know. (Ernest Hemingway)

Ismail Donmez ismail (at) pardus.org.tr
GPG Fingerprint: 7ACD 5836 7827 5598 D721 DF0D 1A9D 257A 5B88 F54C
Pardus Linux / KDE developer
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/