Re: RELENG_7 2008/01/10 desktop system also periodically freezes

2008-01-15 Thread Ulrich Spoerlein
On Sun, 13.01.2008 at 17:25:24 -0500, J.R. Oldroyd wrote:
 David's suggestion re powerd may be relevant.  I'd noticed that the
 problem seems to happen when the system is idle.  I posted earlier that
 it seems like I can do all sorts of work without a problem then I stop for
 a phone call and when I resume it hangs.  I tend to notice a lot of hangs
 when typing an email.

Try with running/looping some MP3 or WAV files. My system never, ever
froze during sound playback. Only when idle. But since I'm running
multiple wmdocklets that update periodically idle is not really true.


Cheers,
Ulrich Spoerlein
-- 
It is better to remain silent and be thought a fool,
than to speak, and remove all doubt.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: RELENG_7 2008/01/10 desktop system also periodically freezes

2008-01-14 Thread J.R. Oldroyd
Here is another ktr dump.  This freeze was a longer one, getting on for
two minutes:
http://opal.com/jr/freebsd/releng_7-freeze/200801141259-ktr.out

This one again shows some post-freeze activity, and yet again the only
activity during the freeze is that shared ath0/pcm irq and the ath0 taskq.

-jr


signature.asc
Description: PGP signature


Re: RELENG_7 2008/01/10 desktop system also periodically freezes

2008-01-13 Thread Kris Kennaway

J.R. Oldroyd wrote:

On Sat, 12 Jan 2008 13:40:34 -0500, I wrote:

Ah!  Just experienced a short freeze ...


And another!  This one about 3 or 4 seconds.

The profile from the last minute is here:
http://opal.com/jr/freebsd/releng_7-freeze/200801121342-unknown.txt

This one shows a long hold_avg (I meant hold_avg in the previous
message, too) for:
/usr/src/sys/kern/vfs_vnops.c:515 (lockmgr:ufs)

-jr


Thanks.  Those particular things you mention are both normal, I will 
look at the traces in more detail later.


Kris
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: RELENG_7 2008/01/10 desktop system also periodically freezes

2008-01-13 Thread Kris Kennaway

J.R. Oldroyd wrote:

On Sat, 12 Jan 2008 13:40:34 -0500, I wrote:

Ah!  Just experienced a short freeze ...


And another!  This one about 3 or 4 seconds.

The profile from the last minute is here:
http://opal.com/jr/freebsd/releng_7-freeze/200801121342-unknown.txt

This one shows a long hold_avg (I meant hold_avg in the previous
message, too) for:
/usr/src/sys/kern/vfs_vnops.c:515 (lockmgr:ufs)

-jr


In both of these cases there is a non-mpsafe interrupt (probably your 
mouse) blocked on a sysctl call (both of these require Giant).  That 
would be fixed once sysctl no longer requires Giant (hopefully before 
7.1, I have a WIP), or once someone makes the psm driver mpsafe.  I 
don't think it accounts for a multi-second pause though.  Probably a 
sched_graph trace will also be required to diagnose that better.


Kris
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: RELENG_7 2008/01/10 desktop system also periodically freezes

2008-01-13 Thread Ronald Klop

On Sat, 12 Jan 2008 18:30:53 +0100, J.R. Oldroyd [EMAIL PROTECTED] wrote:


On Fri, 11 Jan 2008 12:49:29 -0500, I wrote:


I have yet to experience a random freeze not directly attributable
to a softupdate while running the lock profiling.  I am running with
lock profiling on, and resetting the profiling counters once a minute.
Yesterday and this morning, I've run for quite a while now with lock
profiling on but without a random freeze.  I'll wait some more, but
I'm hoping that enabling the lock profiling hasn't masked the freeze.
I'll post again when I see one..



It is looking more likely to me that enabling lock profiling does mask
the freeze.  I ran for more than 10 hours yesterday with lock profiling
enabled and did not observe a single freeze.  After about 7 hours, I
stopped the lock profiling and within 20 mins or so, I experienced a
NINE MINUTE freeze!!  On re-enabling the lock profiling, I ran for about
3 more hours with no further freezes.

At the time of that long freeze, all I was doing was typing an email
message.  The load average was almost 0.  Mail client is claws-email.
Also running but idle were firefox, ical, several xterms, fvwm  its
children (Fvwm{Buttons,Event,Pager,IconMan}), xload and xclock.  And
xorg which uses the xf86-video-intel driver.  Daemons running were
wpa_supplicant, dhclient, devd, syslogd, cupsd, ntpd, powerd, sshd,
sendmail, cron, moused and xdm.  That is all.

I had only one browser window open, the last page I'd viewed there
was one containing some flash, I have linux-flash7 installed and
nspluginwrapper to make that work in the native browser.

All code (kernel, world and ports) have been compiled locally - there
are no packages installed.  Kernel is RELENG_7 from two days ago,
world and ports are about one month old.

If I get time today I may have a go at getting schedgraph info
instead of continuing with lock profiling.

-jr


What version of xf86-video-intel? I'm having trouble since I upgraded  
xf86-video-i810, which is an older version of the intel driver, from 1.6.x  
to 1.7.x and a colleague had trouble with the 2.x version of the  
xf86-video-intel driver.
My trouble is a hanging system (I never waited for it to come back) and  
then ctrl-alt-F1 (to console) gives a panic a lot of times. I just  
prepared my machinde for a crash dump. So maybe next panic I have one.


Maybe downgrading to 1.6.x helps me again. After the next panic I'll try  
that. But maybe it helps you also.


I upgrade my RELENG_7 about every week, so it's fairly recent.

Ronald.

--
 Ronald Klop
 Amsterdam, The Netherlands
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: RELENG_7 2008/01/10 desktop system also periodically freezes

2008-01-13 Thread J.R. Oldroyd
On Sun, 13 Jan 2008 22:29:34 +0100, Ronald Klop [EMAIL PROTECTED] wrote:

 What version of xf86-video-intel? I'm having trouble since I upgraded  
 xf86-video-i810, which is an older version of the intel driver, from 1.6.x  
 to 1.7.x and a colleague had trouble with the 2.x version of the  
 xf86-video-intel driver.

xf86-video-intel-2.1.1

 My trouble is a hanging system (I never waited for it to come back) and  
 then ctrl-alt-F1 (to console) gives a panic a lot of times. I just  
 prepared my machinde for a crash dump. So maybe next panic I have one.
 

I have not seen any panics switching between vtys.  Your hang might
be the same as my long freeze.  Next time it happens, try waiting for
a while.  I've had some freezes many minutes long.  A 9 min one yesterday,
and I've had a 12 min one before.  Just move the mouse and then sit back
until the cursor actually moves!

 Maybe downgrading to 1.6.x helps me again. After the next panic I'll try  
 that. But maybe it helps you also.

When I started my 7.0BETA testing the installation process gave me
xf86-video-i810 and I had several other problems which resolved on
switching to the xf86-video-intel driver.  I'd be hesitant to go
back.
 
 I upgrade my RELENG_7 about every week, so it's fairly recent.
 
 Ronald.
 

Mine is now also up-to-date RELENG_7.

David's suggestion re powerd may be relevant.  I'd noticed that the
problem seems to happen when the system is idle.  I posted earlier that
it seems like I can do all sorts of work without a problem then I stop for
a phone call and when I resume it hangs.  I tend to notice a lot of hangs
when typing an email.  I have disabled powerd for now; don't really need
it anyway, that was a left-over from something quite some time back.  No
freezes yet, but it'll take several more hours to know if this might be
relevant.

-jr


signature.asc
Description: PGP signature


Re: RELENG_7 2008/01/10 desktop system also periodically freezes

2008-01-13 Thread J.R. Oldroyd
Well, after running the ktr_sched-enabled kernel for about 4h50 now, I
did just see a short freeze.  Abt 2-3 seconds.  And I got a ktr dump
right after it came back.  It can be downloaded here (I guess rt-click
and save the link):
http://opal.com/jr/freebsd/releng_7-freeze/200801132250-ktr.out

When I run schedgraph, all it shows for the whole period is just:
CPU 0
irq 17: pcm0 ath0
ath0 taskq
that's it.  I have a shared irq between ath and the sound.  Funnily
enough, within a few mins before the freeze, I was just listening
to a voicemail (i.e., I ran mplayer on a .wav file), but that was done
easily a minute or more before the freeze.  I tried repeating that,
listening to the file and waiting a bit, but no more freezes.  So not
sure if this is a possible indication of the cause, or not.  It's
annoying not to be able to find a way of triggering this problem on
demand, though.

Anyway, I looked at the other system where I've had long freezes.
It has a shared [irq9: pcm0 cbb0++*] and on cbb0 I have an ath card!
So there, too, is an irq shared between pcm and ath.  On this system,
if I find I'm in a long freeze and don't want to be, I've found that
pulling the ath card causes an immediate un-freeze.

Admittedly, based on the recent days' discussion in this thread, I
was more expecting to see moused or powerd or an xorg problem.

Oh! Another freeze, right then!  This dump shows pcm0/ath0 too, but
also a bit more activity just after the return.
http://opal.com/jr/freebsd/releng_7-freeze/200801132337-ktr.out
This time I was not listening to sound.  I was typing this email.

Wasn't there a thread about shared irqs here (or maybe on current)
recently?

-jr


signature.asc
Description: PGP signature


Re: RELENG_7 2008/01/10 desktop system also periodically freezes

2008-01-13 Thread J.R. Oldroyd
Yet another:
 http://opal.com/jr/freebsd/releng_7-freeze/200801132359-ktr.out

Shows just the same as the first, just:
CPU 0
irq 17: pcm0 ath0
ath0 taskq

-jr



signature.asc
Description: PGP signature


Re: RELENG_7 2008/01/10 desktop system also periodically freezes

2008-01-13 Thread Kostik Belousov
On Mon, Jan 14, 2008 at 12:03:54AM -0500, J.R. Oldroyd wrote:
 Yet another:
  http://opal.com/jr/freebsd/releng_7-freeze/200801132359-ktr.out
 
 Shows just the same as the first, just:
   CPU 0
   irq 17: pcm0 ath0
   ath0 taskq

BTW, I am experiencing hard hang on my laptop when ath0 is brought up and
powerd is running. It takes several seconds when this happens with no AP
nearby. It may take up to the hour when machine is able associate to the AP.

Reliable workaround for me was to stop powerd before turning on ath0 on the
laptop.

On the other hand, one of my desktop workstation has atheros card, and
runs powerd without problem, lowering the frequency of the CPU.


pgpyq2DqBvrmq.pgp
Description: PGP signature


Re: RELENG_7 2008/01/10 desktop system also periodically freezes

2008-01-12 Thread J.R. Oldroyd
On Fri, 11 Jan 2008 12:49:29 -0500, I wrote:

 I have yet to experience a random freeze not directly attributable
 to a softupdate while running the lock profiling.  I am running with
 lock profiling on, and resetting the profiling counters once a minute.
 Yesterday and this morning, I've run for quite a while now with lock
 profiling on but without a random freeze.  I'll wait some more, but
 I'm hoping that enabling the lock profiling hasn't masked the freeze.
 I'll post again when I see one..
 

It is looking more likely to me that enabling lock profiling does mask
the freeze.  I ran for more than 10 hours yesterday with lock profiling
enabled and did not observe a single freeze.  After about 7 hours, I
stopped the lock profiling and within 20 mins or so, I experienced a
NINE MINUTE freeze!!  On re-enabling the lock profiling, I ran for about
3 more hours with no further freezes.

At the time of that long freeze, all I was doing was typing an email
message.  The load average was almost 0.  Mail client is claws-email.
Also running but idle were firefox, ical, several xterms, fvwm  its
children (Fvwm{Buttons,Event,Pager,IconMan}), xload and xclock.  And
xorg which uses the xf86-video-intel driver.  Daemons running were
wpa_supplicant, dhclient, devd, syslogd, cupsd, ntpd, powerd, sshd,
sendmail, cron, moused and xdm.  That is all.

I had only one browser window open, the last page I'd viewed there
was one containing some flash, I have linux-flash7 installed and
nspluginwrapper to make that work in the native browser.

All code (kernel, world and ports) have been compiled locally - there
are no packages installed.  Kernel is RELENG_7 from two days ago,
world and ports are about one month old.

If I get time today I may have a go at getting schedgraph info
instead of continuing with lock profiling.

-jr


signature.asc
Description: PGP signature


Re: RELENG_7 2008/01/10 desktop system also periodically freezes

2008-01-12 Thread J.R. Oldroyd
On Sat, 12 Jan 2008 12:30:53 -0500, I wrote:

 On Fri, 11 Jan 2008 12:49:29 -0500, I wrote:
 
  I have yet to experience a random freeze not directly attributable
 
 It is looking more likely to me that enabling lock profiling does mask
 ...
 
 At the time of that long freeze, all I was doing was typing an email
 message.  The load average was almost 0.  Mail client is claws-email.
 Also running but idle were firefox, ical, several xterms, fvwm  its
 children (Fvwm{Buttons,Event,Pager,IconMan}), xload and xclock.  And
 xorg which uses the xf86-video-intel driver.  Daemons running were
 wpa_supplicant, dhclient, devd, syslogd, cupsd, ntpd, powerd, sshd,
 sendmail, cron, moused and xdm.  That is all.
 
 I had only one browser window open, the last page I'd viewed there
 was one containing some flash, I have linux-flash7 installed and
 nspluginwrapper to make that work in the native browser.
 

Ah!  Just experienced a short freeze (it seemed like about 5 secs) while
lock profiling was enabled.

The profile from the last minute is here:
http://opal.com/jr/freebsd/releng_7-freeze/200801121331-unknown.txt

There's an almost 4 second wait_avg for:
/usr/src/sys/sys/buf.h:280 (lockmgr:bufwait)

I had exactly the same mix of programs open as shown above.

-jr


signature.asc
Description: PGP signature


Re: RELENG_7 2008/01/10 desktop system also periodically freezes

2008-01-12 Thread J.R. Oldroyd
On Sat, 12 Jan 2008 13:40:34 -0500, I wrote:

 Ah!  Just experienced a short freeze ...

And another!  This one about 3 or 4 seconds.

The profile from the last minute is here:
http://opal.com/jr/freebsd/releng_7-freeze/200801121342-unknown.txt

This one shows a long hold_avg (I meant hold_avg in the previous
message, too) for:
/usr/src/sys/kern/vfs_vnops.c:515 (lockmgr:ufs)

-jr


signature.asc
Description: PGP signature


Re: RELENG_7 2008/01/10 desktop system also periodically freezes

2008-01-12 Thread David G Lawrence
 On Fri, 11 Jan 2008 12:49:29 -0500, I wrote:
 
  I have yet to experience a random freeze not directly attributable
  to a softupdate while running the lock profiling.  I am running with
  lock profiling on, and resetting the profiling counters once a minute.
  Yesterday and this morning, I've run for quite a while now with lock
  profiling on but without a random freeze.  I'll wait some more, but
  I'm hoping that enabling the lock profiling hasn't masked the freeze.
  I'll post again when I see one..
  
 
 It is looking more likely to me that enabling lock profiling does mask
 the freeze.  I ran for more than 10 hours yesterday with lock profiling
 enabled and did not observe a single freeze.  After about 7 hours, I
 stopped the lock profiling and within 20 mins or so, I experienced a
 NINE MINUTE freeze!!  On re-enabling the lock profiling, I ran for about
 3 more hours with no further freezes.
 
 At the time of that long freeze, all I was doing was typing an email
 message.  The load average was almost 0.  Mail client is claws-email.
 Also running but idle were firefox, ical, several xterms, fvwm  its
 children (Fvwm{Buttons,Event,Pager,IconMan}), xload and xclock.  And
 xorg which uses the xf86-video-intel driver.  Daemons running were
 wpa_supplicant, dhclient, devd, syslogd, cupsd, ntpd, powerd, sshd,
 sendmail, cron, moused and xdm.  That is all.

   You might want to try disabling powerd and see if that mitigates the
problem. powerd is going to be messing with the CPU clock when it is near
idle. Your system would be less idle with lock profiling enabled, which
might explain why the problem seems to happen less often in that case.
 
-DG

David G. Lawrence
President
Download Technologies, Inc. - http://www.downloadtech.com - (866) 399 8500
The FreeBSD Project - http://www.freebsd.org
Pave the road of life with opportunities.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: RELENG_7 2008/01/10 desktop system also periodically freezes

2008-01-11 Thread J.R. Oldroyd
On Thu, 10 Jan 2008 08:32:12 +0100, Kris Kennaway [EMAIL PROTECTED] wrote:

 J.R. Oldroyd wrote:
  On Wed, 09 Jan 2008 20:38:29 +0100, Kris Kennaway [EMAIL PROTECTED] wrote:
 
  OK, same requests as to the others then.
 
  
  I presume you mean hwpmc...
 
 LOCK_PROFILING, sched_graph, hwpmc.
 

Quick update on this so that folk know what's going on.

Firstly, I've updated the code here to RELENG_7 csup'd yesterday and
if I run a normal kernel, without debugging, I do see the same freezes
with that, too.

Secondly, I am unable to process hwpcm dumps using pmcstat because it
core dumps when using the -R option to decode a dump.  I will talk to
jkoshy@ about that.

I am now running a kernel with LOCK_PROFILING.  I am able to create a
freeze due to the softupdate activity by removing a large file; a lock
profile captured surrounding such a freeze can be seen here:
http://opal.com/jr/freebsd/releng_7-freeze/20080135-softupdate.txt
This is easily repeatable.  The sequence is:
create 1.5GB file
sysctl debug.lock.prof.enable=1
rm file
... wait, moving mouse until it freezes then unfreezes ...
sysctl debug.lock.prof.enable=0
sysctl debug.lock.prof.stats
Kostik, is this of any help to you?

I have yet to experience a random freeze not directly attributable
to a softupdate while running the lock profiling.  I am running with
lock profiling on, and resetting the profiling counters once a minute.
Yesterday and this morning, I've run for quite a while now with lock
profiling on but without a random freeze.  I'll wait some more, but
I'm hoping that enabling the lock profiling hasn't masked the freeze.
I'll post again when I see one..

-jr


signature.asc
Description: PGP signature