Re: RELENG_7 2008/01/10 desktop system also periodically freezes
On Sun, 13.01.2008 at 17:25:24 -0500, J.R. Oldroyd wrote: David's suggestion re powerd may be relevant. I'd noticed that the problem seems to happen when the system is idle. I posted earlier that it seems like I can do all sorts of work without a problem then I stop for a phone call and when I resume it hangs. I tend to notice a lot of hangs when typing an email. Try with running/looping some MP3 or WAV files. My system never, ever froze during sound playback. Only when idle. But since I'm running multiple wmdocklets that update periodically idle is not really true. Cheers, Ulrich Spoerlein -- It is better to remain silent and be thought a fool, than to speak, and remove all doubt. ___ freebsd-stable@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-stable To unsubscribe, send any mail to [EMAIL PROTECTED]
Re: RELENG_7 2008/01/10 desktop system also periodically freezes
Here is another ktr dump. This freeze was a longer one, getting on for two minutes: http://opal.com/jr/freebsd/releng_7-freeze/200801141259-ktr.out This one again shows some post-freeze activity, and yet again the only activity during the freeze is that shared ath0/pcm irq and the ath0 taskq. -jr signature.asc Description: PGP signature
Re: RELENG_7 2008/01/10 desktop system also periodically freezes
J.R. Oldroyd wrote: On Sat, 12 Jan 2008 13:40:34 -0500, I wrote: Ah! Just experienced a short freeze ... And another! This one about 3 or 4 seconds. The profile from the last minute is here: http://opal.com/jr/freebsd/releng_7-freeze/200801121342-unknown.txt This one shows a long hold_avg (I meant hold_avg in the previous message, too) for: /usr/src/sys/kern/vfs_vnops.c:515 (lockmgr:ufs) -jr Thanks. Those particular things you mention are both normal, I will look at the traces in more detail later. Kris ___ freebsd-stable@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-stable To unsubscribe, send any mail to [EMAIL PROTECTED]
Re: RELENG_7 2008/01/10 desktop system also periodically freezes
J.R. Oldroyd wrote: On Sat, 12 Jan 2008 13:40:34 -0500, I wrote: Ah! Just experienced a short freeze ... And another! This one about 3 or 4 seconds. The profile from the last minute is here: http://opal.com/jr/freebsd/releng_7-freeze/200801121342-unknown.txt This one shows a long hold_avg (I meant hold_avg in the previous message, too) for: /usr/src/sys/kern/vfs_vnops.c:515 (lockmgr:ufs) -jr In both of these cases there is a non-mpsafe interrupt (probably your mouse) blocked on a sysctl call (both of these require Giant). That would be fixed once sysctl no longer requires Giant (hopefully before 7.1, I have a WIP), or once someone makes the psm driver mpsafe. I don't think it accounts for a multi-second pause though. Probably a sched_graph trace will also be required to diagnose that better. Kris ___ freebsd-stable@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-stable To unsubscribe, send any mail to [EMAIL PROTECTED]
Re: RELENG_7 2008/01/10 desktop system also periodically freezes
On Sat, 12 Jan 2008 18:30:53 +0100, J.R. Oldroyd [EMAIL PROTECTED] wrote: On Fri, 11 Jan 2008 12:49:29 -0500, I wrote: I have yet to experience a random freeze not directly attributable to a softupdate while running the lock profiling. I am running with lock profiling on, and resetting the profiling counters once a minute. Yesterday and this morning, I've run for quite a while now with lock profiling on but without a random freeze. I'll wait some more, but I'm hoping that enabling the lock profiling hasn't masked the freeze. I'll post again when I see one.. It is looking more likely to me that enabling lock profiling does mask the freeze. I ran for more than 10 hours yesterday with lock profiling enabled and did not observe a single freeze. After about 7 hours, I stopped the lock profiling and within 20 mins or so, I experienced a NINE MINUTE freeze!! On re-enabling the lock profiling, I ran for about 3 more hours with no further freezes. At the time of that long freeze, all I was doing was typing an email message. The load average was almost 0. Mail client is claws-email. Also running but idle were firefox, ical, several xterms, fvwm its children (Fvwm{Buttons,Event,Pager,IconMan}), xload and xclock. And xorg which uses the xf86-video-intel driver. Daemons running were wpa_supplicant, dhclient, devd, syslogd, cupsd, ntpd, powerd, sshd, sendmail, cron, moused and xdm. That is all. I had only one browser window open, the last page I'd viewed there was one containing some flash, I have linux-flash7 installed and nspluginwrapper to make that work in the native browser. All code (kernel, world and ports) have been compiled locally - there are no packages installed. Kernel is RELENG_7 from two days ago, world and ports are about one month old. If I get time today I may have a go at getting schedgraph info instead of continuing with lock profiling. -jr What version of xf86-video-intel? I'm having trouble since I upgraded xf86-video-i810, which is an older version of the intel driver, from 1.6.x to 1.7.x and a colleague had trouble with the 2.x version of the xf86-video-intel driver. My trouble is a hanging system (I never waited for it to come back) and then ctrl-alt-F1 (to console) gives a panic a lot of times. I just prepared my machinde for a crash dump. So maybe next panic I have one. Maybe downgrading to 1.6.x helps me again. After the next panic I'll try that. But maybe it helps you also. I upgrade my RELENG_7 about every week, so it's fairly recent. Ronald. -- Ronald Klop Amsterdam, The Netherlands ___ freebsd-stable@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-stable To unsubscribe, send any mail to [EMAIL PROTECTED]
Re: RELENG_7 2008/01/10 desktop system also periodically freezes
On Sun, 13 Jan 2008 22:29:34 +0100, Ronald Klop [EMAIL PROTECTED] wrote: What version of xf86-video-intel? I'm having trouble since I upgraded xf86-video-i810, which is an older version of the intel driver, from 1.6.x to 1.7.x and a colleague had trouble with the 2.x version of the xf86-video-intel driver. xf86-video-intel-2.1.1 My trouble is a hanging system (I never waited for it to come back) and then ctrl-alt-F1 (to console) gives a panic a lot of times. I just prepared my machinde for a crash dump. So maybe next panic I have one. I have not seen any panics switching between vtys. Your hang might be the same as my long freeze. Next time it happens, try waiting for a while. I've had some freezes many minutes long. A 9 min one yesterday, and I've had a 12 min one before. Just move the mouse and then sit back until the cursor actually moves! Maybe downgrading to 1.6.x helps me again. After the next panic I'll try that. But maybe it helps you also. When I started my 7.0BETA testing the installation process gave me xf86-video-i810 and I had several other problems which resolved on switching to the xf86-video-intel driver. I'd be hesitant to go back. I upgrade my RELENG_7 about every week, so it's fairly recent. Ronald. Mine is now also up-to-date RELENG_7. David's suggestion re powerd may be relevant. I'd noticed that the problem seems to happen when the system is idle. I posted earlier that it seems like I can do all sorts of work without a problem then I stop for a phone call and when I resume it hangs. I tend to notice a lot of hangs when typing an email. I have disabled powerd for now; don't really need it anyway, that was a left-over from something quite some time back. No freezes yet, but it'll take several more hours to know if this might be relevant. -jr signature.asc Description: PGP signature
Re: RELENG_7 2008/01/10 desktop system also periodically freezes
Well, after running the ktr_sched-enabled kernel for about 4h50 now, I did just see a short freeze. Abt 2-3 seconds. And I got a ktr dump right after it came back. It can be downloaded here (I guess rt-click and save the link): http://opal.com/jr/freebsd/releng_7-freeze/200801132250-ktr.out When I run schedgraph, all it shows for the whole period is just: CPU 0 irq 17: pcm0 ath0 ath0 taskq that's it. I have a shared irq between ath and the sound. Funnily enough, within a few mins before the freeze, I was just listening to a voicemail (i.e., I ran mplayer on a .wav file), but that was done easily a minute or more before the freeze. I tried repeating that, listening to the file and waiting a bit, but no more freezes. So not sure if this is a possible indication of the cause, or not. It's annoying not to be able to find a way of triggering this problem on demand, though. Anyway, I looked at the other system where I've had long freezes. It has a shared [irq9: pcm0 cbb0++*] and on cbb0 I have an ath card! So there, too, is an irq shared between pcm and ath. On this system, if I find I'm in a long freeze and don't want to be, I've found that pulling the ath card causes an immediate un-freeze. Admittedly, based on the recent days' discussion in this thread, I was more expecting to see moused or powerd or an xorg problem. Oh! Another freeze, right then! This dump shows pcm0/ath0 too, but also a bit more activity just after the return. http://opal.com/jr/freebsd/releng_7-freeze/200801132337-ktr.out This time I was not listening to sound. I was typing this email. Wasn't there a thread about shared irqs here (or maybe on current) recently? -jr signature.asc Description: PGP signature
Re: RELENG_7 2008/01/10 desktop system also periodically freezes
Yet another: http://opal.com/jr/freebsd/releng_7-freeze/200801132359-ktr.out Shows just the same as the first, just: CPU 0 irq 17: pcm0 ath0 ath0 taskq -jr signature.asc Description: PGP signature
Re: RELENG_7 2008/01/10 desktop system also periodically freezes
On Mon, Jan 14, 2008 at 12:03:54AM -0500, J.R. Oldroyd wrote: Yet another: http://opal.com/jr/freebsd/releng_7-freeze/200801132359-ktr.out Shows just the same as the first, just: CPU 0 irq 17: pcm0 ath0 ath0 taskq BTW, I am experiencing hard hang on my laptop when ath0 is brought up and powerd is running. It takes several seconds when this happens with no AP nearby. It may take up to the hour when machine is able associate to the AP. Reliable workaround for me was to stop powerd before turning on ath0 on the laptop. On the other hand, one of my desktop workstation has atheros card, and runs powerd without problem, lowering the frequency of the CPU. pgpyq2DqBvrmq.pgp Description: PGP signature
Re: RELENG_7 2008/01/10 desktop system also periodically freezes
On Fri, 11 Jan 2008 12:49:29 -0500, I wrote: I have yet to experience a random freeze not directly attributable to a softupdate while running the lock profiling. I am running with lock profiling on, and resetting the profiling counters once a minute. Yesterday and this morning, I've run for quite a while now with lock profiling on but without a random freeze. I'll wait some more, but I'm hoping that enabling the lock profiling hasn't masked the freeze. I'll post again when I see one.. It is looking more likely to me that enabling lock profiling does mask the freeze. I ran for more than 10 hours yesterday with lock profiling enabled and did not observe a single freeze. After about 7 hours, I stopped the lock profiling and within 20 mins or so, I experienced a NINE MINUTE freeze!! On re-enabling the lock profiling, I ran for about 3 more hours with no further freezes. At the time of that long freeze, all I was doing was typing an email message. The load average was almost 0. Mail client is claws-email. Also running but idle were firefox, ical, several xterms, fvwm its children (Fvwm{Buttons,Event,Pager,IconMan}), xload and xclock. And xorg which uses the xf86-video-intel driver. Daemons running were wpa_supplicant, dhclient, devd, syslogd, cupsd, ntpd, powerd, sshd, sendmail, cron, moused and xdm. That is all. I had only one browser window open, the last page I'd viewed there was one containing some flash, I have linux-flash7 installed and nspluginwrapper to make that work in the native browser. All code (kernel, world and ports) have been compiled locally - there are no packages installed. Kernel is RELENG_7 from two days ago, world and ports are about one month old. If I get time today I may have a go at getting schedgraph info instead of continuing with lock profiling. -jr signature.asc Description: PGP signature
Re: RELENG_7 2008/01/10 desktop system also periodically freezes
On Sat, 12 Jan 2008 12:30:53 -0500, I wrote: On Fri, 11 Jan 2008 12:49:29 -0500, I wrote: I have yet to experience a random freeze not directly attributable It is looking more likely to me that enabling lock profiling does mask ... At the time of that long freeze, all I was doing was typing an email message. The load average was almost 0. Mail client is claws-email. Also running but idle were firefox, ical, several xterms, fvwm its children (Fvwm{Buttons,Event,Pager,IconMan}), xload and xclock. And xorg which uses the xf86-video-intel driver. Daemons running were wpa_supplicant, dhclient, devd, syslogd, cupsd, ntpd, powerd, sshd, sendmail, cron, moused and xdm. That is all. I had only one browser window open, the last page I'd viewed there was one containing some flash, I have linux-flash7 installed and nspluginwrapper to make that work in the native browser. Ah! Just experienced a short freeze (it seemed like about 5 secs) while lock profiling was enabled. The profile from the last minute is here: http://opal.com/jr/freebsd/releng_7-freeze/200801121331-unknown.txt There's an almost 4 second wait_avg for: /usr/src/sys/sys/buf.h:280 (lockmgr:bufwait) I had exactly the same mix of programs open as shown above. -jr signature.asc Description: PGP signature
Re: RELENG_7 2008/01/10 desktop system also periodically freezes
On Sat, 12 Jan 2008 13:40:34 -0500, I wrote: Ah! Just experienced a short freeze ... And another! This one about 3 or 4 seconds. The profile from the last minute is here: http://opal.com/jr/freebsd/releng_7-freeze/200801121342-unknown.txt This one shows a long hold_avg (I meant hold_avg in the previous message, too) for: /usr/src/sys/kern/vfs_vnops.c:515 (lockmgr:ufs) -jr signature.asc Description: PGP signature
Re: RELENG_7 2008/01/10 desktop system also periodically freezes
On Fri, 11 Jan 2008 12:49:29 -0500, I wrote: I have yet to experience a random freeze not directly attributable to a softupdate while running the lock profiling. I am running with lock profiling on, and resetting the profiling counters once a minute. Yesterday and this morning, I've run for quite a while now with lock profiling on but without a random freeze. I'll wait some more, but I'm hoping that enabling the lock profiling hasn't masked the freeze. I'll post again when I see one.. It is looking more likely to me that enabling lock profiling does mask the freeze. I ran for more than 10 hours yesterday with lock profiling enabled and did not observe a single freeze. After about 7 hours, I stopped the lock profiling and within 20 mins or so, I experienced a NINE MINUTE freeze!! On re-enabling the lock profiling, I ran for about 3 more hours with no further freezes. At the time of that long freeze, all I was doing was typing an email message. The load average was almost 0. Mail client is claws-email. Also running but idle were firefox, ical, several xterms, fvwm its children (Fvwm{Buttons,Event,Pager,IconMan}), xload and xclock. And xorg which uses the xf86-video-intel driver. Daemons running were wpa_supplicant, dhclient, devd, syslogd, cupsd, ntpd, powerd, sshd, sendmail, cron, moused and xdm. That is all. You might want to try disabling powerd and see if that mitigates the problem. powerd is going to be messing with the CPU clock when it is near idle. Your system would be less idle with lock profiling enabled, which might explain why the problem seems to happen less often in that case. -DG David G. Lawrence President Download Technologies, Inc. - http://www.downloadtech.com - (866) 399 8500 The FreeBSD Project - http://www.freebsd.org Pave the road of life with opportunities. ___ freebsd-stable@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-stable To unsubscribe, send any mail to [EMAIL PROTECTED]
Re: RELENG_7 2008/01/10 desktop system also periodically freezes
On Thu, 10 Jan 2008 08:32:12 +0100, Kris Kennaway [EMAIL PROTECTED] wrote: J.R. Oldroyd wrote: On Wed, 09 Jan 2008 20:38:29 +0100, Kris Kennaway [EMAIL PROTECTED] wrote: OK, same requests as to the others then. I presume you mean hwpmc... LOCK_PROFILING, sched_graph, hwpmc. Quick update on this so that folk know what's going on. Firstly, I've updated the code here to RELENG_7 csup'd yesterday and if I run a normal kernel, without debugging, I do see the same freezes with that, too. Secondly, I am unable to process hwpcm dumps using pmcstat because it core dumps when using the -R option to decode a dump. I will talk to jkoshy@ about that. I am now running a kernel with LOCK_PROFILING. I am able to create a freeze due to the softupdate activity by removing a large file; a lock profile captured surrounding such a freeze can be seen here: http://opal.com/jr/freebsd/releng_7-freeze/20080135-softupdate.txt This is easily repeatable. The sequence is: create 1.5GB file sysctl debug.lock.prof.enable=1 rm file ... wait, moving mouse until it freezes then unfreezes ... sysctl debug.lock.prof.enable=0 sysctl debug.lock.prof.stats Kostik, is this of any help to you? I have yet to experience a random freeze not directly attributable to a softupdate while running the lock profiling. I am running with lock profiling on, and resetting the profiling counters once a minute. Yesterday and this morning, I've run for quite a while now with lock profiling on but without a random freeze. I'll wait some more, but I'm hoping that enabling the lock profiling hasn't masked the freeze. I'll post again when I see one.. -jr signature.asc Description: PGP signature