Re: SMP SS10 freezes hard intermittently

2000-07-10 Thread David Butts
Hello, all-

I apologize for taking so long to respond.  I'm (hopefully) not nearly
the ungrateful bastard I seem to be...

On Tue, 13 Jun 2000, Anton Blanchard wrote:

 In-Reply-To: [EMAIL PROTECTED]; from
[EMAIL PROTECTED] on Mon, Jun 12, 2000 at 08:53:49PM -0400
 
  
  I've got an SS10 with two CPUs, running a 2.2.15 kernel with SMP
  enabled.  It locks up hard periodically, and won't respond to
  stop-A, a break on the serial port, the three-fingered salute,
  or, in fact, anything short of a power-cycle.
 
 When I was fixing sparc32 SMP in 2.2, I only had access to one cpu and
 machine type (viking supersparc on 4m/690). As such it tends to be stable :)
 
 ftoomsh# uptime
 3:01am  up 99 days, 13:59, 31 users,  load average: 1.42, 1.38, 1.36
 
 ftoomsh# cat /proc/cpuinfo 
 cpu : Texas Instruments, Inc. - SuperSparc 50
 fpu : SuperSparc on-chip FPU
 promlib : Version 3 Revision 2
 prom: 2.14
 type: sun4m
 ncpus probed: 4
 ncpus active: 4
 Cpu0Bogo: 49.86
 Cpu1Bogo: 49.97
 Cpu2Bogo: 49.97
 Cpu3Bogo: 49.97
 MMU type: TI Viking/MXCC
 contexts: 65536
 CPU0: online
 CPU1: online
 CPU2: online
 CPU3: online

FWIW,

cpu : Texas Instruments, Inc. - MicroSparc
fpu : SuperSparc on-chip FPU
promlib : Version 3 Revision 2
prom: 2.12
type: sun4m
ncpus probed: 2
ncpus active: 2
Cpu0Bogo: 39.83
Cpu1Bogo: 39.93
MMU type: TI Viking
invall  : 0
invmm   : 0
invrnge : 0
invpg   : 0
contexts: 65536
CPU0: online
CPU1: online

 There are too many things to fix in 2.2 (that I am fixing in 2.4) so
 random lockups are no surprise to me. 2.4 should be ready for sparc32 real
 soon.

Does that mean I should rush out and grab myself some 2.4.0-test2
sources and try them out, or have your changes not been incorporated
yet?

Also, if it's possible to describe generally, what sort of problems
exist with the Sparc32 SMP code in 2.2?

Thanks (belatedly)-

David




RE: SMP SS10 freezes hard intermittently

2000-07-10 Thread David Butts
Hello-

Again, my apologies for the embarassingly long delay...

On Mon, 12 Jun 2000, Gregory Leblanc wrote:

  -Original Message-
  From: David Butts [mailto:[EMAIL PROTECTED]
  Sent: Monday, June 12, 2000 8:00 PM
  To: debian-sparc@lists.debian.org
  Cc: sparclinux@vger.rutgers.edu
  Subject: RE: SMP SS10 freezes hard intermittently
  
  Hello-
  
  I've got a CG6.  I had included the other FBs for the sake of ...
  completeness (read: figuring out what I had in the box).
  
  From /var/log/messages:
  
  fb0: cgsix at e. TEC Rev 4 CPU sparc Rev b [TurboGX]
  
  The only (permanent) change I made to the devices was the addition
  of a symlink to /dev/fb0:
  
  lrwxrwxrwx1 root root3 Jun  9 17:56 
  /dev/cgsix0 - fb0
  crw--w--w-1 root tty   29,   0 May  4 08:32 /dev/fb0
  
  HTH (particularly since I'm the one it will ultimately end up
  helping :)
 
 ok, time for some more random guessing.  What CPU modules do you have in
 there?  Is this machine stable under Solaris (any version)?  Any other SBUS
 cards in there?  What about sound, is DBRI compiled in, and does it work?
 (DBRI is EXTREMELY flaky on my SS20).  Hopefully somebody else will have
 more guesses, I only have 1 situation where I can crash Linux/X, and that's
 about 100% reliable of a crash.  :-(  I just don't do that.  Later,

cpu : Texas Instruments, Inc. - MicroSparc
fpu : SuperSparc on-chip FPU
promlib : Version 3 Revision 2
prom: 2.12
type: sun4m
ncpus probed: 2
ncpus active: 2
Cpu0Bogo: 39.83
Cpu1Bogo: 39.93
MMU type: TI Viking
invall  : 0
invmm   : 0
invrnge : 0
invpg   : 0
contexts: 65536
CPU0: online
CPU1: online

This particular beast is actually the unholy union of the innards of
two otherwise identical SS10's, so it's never booted anything else in
its current incarnation.  The existing OS on the disks was SunOS
4.something, which was not amused to find a second CPU.

As I expect you saw, Anton Blanchard suspects that this sort of
unpleasantness is to be expected from a 2.2 kernel, so I'll either
just be patient, or continue shotgun debugging.

Thanks-

David




RE: SMP SS10 freezes hard intermittently

2000-06-13 Thread Gregory Leblanc
 -Original Message-
 From: David Butts [mailto:[EMAIL PROTECTED]
 Sent: Monday, June 12, 2000 8:00 PM
 To: debian-sparc@lists.debian.org
 Cc: sparclinux@vger.rutgers.edu
 Subject: RE: SMP SS10 freezes hard intermittently
 
 Hello-
 
 I've got a CG6.  I had included the other FBs for the sake of ...
 completeness (read: figuring out what I had in the box).
 
 From /var/log/messages:
 
 fb0: cgsix at e. TEC Rev 4 CPU sparc Rev b [TurboGX]
 
 The only (permanent) change I made to the devices was the addition
 of a symlink to /dev/fb0:
 
 lrwxrwxrwx1 root root3 Jun  9 17:56 
 /dev/cgsix0 - fb0
 crw--w--w-1 root tty   29,   0 May  4 08:32 /dev/fb0
 
 HTH (particularly since I'm the one it will ultimately end up
 helping :)

ok, time for some more random guessing.  What CPU modules do you have in
there?  Is this machine stable under Solaris (any version)?  Any other SBUS
cards in there?  What about sound, is DBRI compiled in, and does it work?
(DBRI is EXTREMELY flaky on my SS20).  Hopefully somebody else will have
more guesses, I only have 1 situation where I can crash Linux/X, and that's
about 100% reliable of a crash.  :-(  I just don't do that.  Later,
Greg



Re: SMP SS10 freezes hard intermittently

2000-06-13 Thread Anton Blanchard
 
 I've got an SS10 with two CPUs, running a 2.2.15 kernel with SMP
 enabled.  It locks up hard periodically, and won't respond to
 stop-A, a break on the serial port, the three-fingered salute,
 or, in fact, anything short of a power-cycle.

When I was fixing sparc32 SMP in 2.2, I only had access to one cpu and
machine type (viking supersparc on 4m/690). As such it tends to be stable :)

ftoomsh# uptime
3:01am  up 99 days, 13:59, 31 users,  load average: 1.42, 1.38, 1.36

ftoomsh# cat /proc/cpuinfo 
cpu : Texas Instruments, Inc. - SuperSparc 50
fpu : SuperSparc on-chip FPU
promlib : Version 3 Revision 2
prom: 2.14
type: sun4m
ncpus probed: 4
ncpus active: 4
Cpu0Bogo: 49.86
Cpu1Bogo: 49.97
Cpu2Bogo: 49.97
Cpu3Bogo: 49.97
MMU type: TI Viking/MXCC
contexts: 65536
CPU0: online
CPU1: online
CPU2: online
CPU3: online

There are too many things to fix in 2.2 (that I am fixing in 2.4) so
random lockups are no surprise to me. 2.4 should be ready for sparc32 real
soon.

Cheers,
Anton



RE: SMP SS10 freezes hard intermittently

2000-06-12 Thread Gregory Leblanc
 -Original Message-
 From: David Butts [mailto:[EMAIL PROTECTED]
 Sent: Monday, June 12, 2000 5:54 PM
 To: debian-sparc@lists.debian.org
 Cc: sparclinux@vger.rutgers.edu
 Subject: SMP SS10 freezes hard intermittently
 
 Hello, all-
 
 I've got an SS10 with two CPUs, running a 2.2.15 kernel with SMP
 enabled.  It locks up hard periodically, and won't respond to
 stop-A, a break on the serial port, the three-fingered salute,
 or, in fact, anything short of a power-cycle.
 
 It doesn't do it consistantly, but the times it happened were (a)
 when installing packages with apt-get (unfortunately, I didn't
 pay enough attention to which, but it froze immedaitely after it
 finished the download of the .deb files), and (b) in X (both when
 I was strace'ing the startup to find out why it thought I didn't
 have a screen, and later, while WindowMaker was starting up).
 
 There are no obviously strange log entries, and my strace output
 was in /tmp, so it was removed after teh power-cycle.  If there
 are any other diagnostic steps I should try, please let me know.
 
 I've included my .config below.  AFAIK, there were no modules
 loaded at the time of the crashes.
 
 I've cc'ed sparclinux@vger.rutgers.edu based on a suggestion in
 response to a similar problem a couple of months ago posted to
 debian-sparc, but I'm not on that list, so please cc me on any
 mail send to the vger list.

What's your console device?  CG3?
Greg



RE: SMP SS10 freezes hard intermittently

2000-06-12 Thread David Butts
Hello-

I've got a CG6.  I had included the other FBs for the sake of ...
completeness (read: figuring out what I had in the box).

From /var/log/messages:

fb0: cgsix at e. TEC Rev 4 CPU sparc Rev b [TurboGX]

The only (permanent) change I made to the devices was the addition
of a symlink to /dev/fb0:

lrwxrwxrwx1 root root3 Jun  9 17:56 /dev/cgsix0 - fb0
crw--w--w-1 root tty   29,   0 May  4 08:32 /dev/fb0

HTH (particularly since I'm the one it will ultimately end up
helping :)

David

On Mon, 12 Jun 2000, Gregory Leblanc wrote:

  -Original Message-
  From: David Butts [mailto:[EMAIL PROTECTED]
  Sent: Monday, June 12, 2000 5:54 PM
  To: debian-sparc@lists.debian.org
  Cc: sparclinux@vger.rutgers.edu
  Subject: SMP SS10 freezes hard intermittently
  
  Hello, all-
  
  I've got an SS10 with two CPUs, running a 2.2.15 kernel with SMP
  enabled.  It locks up hard periodically, and won't respond to
  stop-A, a break on the serial port, the three-fingered salute,
  or, in fact, anything short of a power-cycle.
  
  It doesn't do it consistantly, but the times it happened were (a)
  when installing packages with apt-get (unfortunately, I didn't
  pay enough attention to which, but it froze immediately after it
  finished the download of the .deb files), and (b) in X (both when
  I was strace'ing the startup to find out why it thought I didn't
  have a screen, and later, while WindowMaker was starting up).
  
  There are no obviously strange log entries, and my strace output
  was in /tmp, so it was removed after the power-cycle.  If there
  are any other diagnostic steps I should try, please let me know.
  
  I've included my .config below.  AFAIK, there were no modules
  loaded at the time of the crashes.
  
  I've cc'ed sparclinux@vger.rutgers.edu based on a suggestion in
  response to a similar problem a couple of months ago posted to
  debian-sparc, but I'm not on that list, so please cc me on any
  mail send to the vger list.
 
 What's your console device?  CG3?
   Greg