Re: SMP SS10 freezes hard intermittently
Hello, all- I apologize for taking so long to respond. I'm (hopefully) not nearly the ungrateful bastard I seem to be... On Tue, 13 Jun 2000, Anton Blanchard wrote: In-Reply-To: [EMAIL PROTECTED]; from [EMAIL PROTECTED] on Mon, Jun 12, 2000 at 08:53:49PM -0400 I've got an SS10 with two CPUs, running a 2.2.15 kernel with SMP enabled. It locks up hard periodically, and won't respond to stop-A, a break on the serial port, the three-fingered salute, or, in fact, anything short of a power-cycle. When I was fixing sparc32 SMP in 2.2, I only had access to one cpu and machine type (viking supersparc on 4m/690). As such it tends to be stable :) ftoomsh# uptime 3:01am up 99 days, 13:59, 31 users, load average: 1.42, 1.38, 1.36 ftoomsh# cat /proc/cpuinfo cpu : Texas Instruments, Inc. - SuperSparc 50 fpu : SuperSparc on-chip FPU promlib : Version 3 Revision 2 prom: 2.14 type: sun4m ncpus probed: 4 ncpus active: 4 Cpu0Bogo: 49.86 Cpu1Bogo: 49.97 Cpu2Bogo: 49.97 Cpu3Bogo: 49.97 MMU type: TI Viking/MXCC contexts: 65536 CPU0: online CPU1: online CPU2: online CPU3: online FWIW, cpu : Texas Instruments, Inc. - MicroSparc fpu : SuperSparc on-chip FPU promlib : Version 3 Revision 2 prom: 2.12 type: sun4m ncpus probed: 2 ncpus active: 2 Cpu0Bogo: 39.83 Cpu1Bogo: 39.93 MMU type: TI Viking invall : 0 invmm : 0 invrnge : 0 invpg : 0 contexts: 65536 CPU0: online CPU1: online There are too many things to fix in 2.2 (that I am fixing in 2.4) so random lockups are no surprise to me. 2.4 should be ready for sparc32 real soon. Does that mean I should rush out and grab myself some 2.4.0-test2 sources and try them out, or have your changes not been incorporated yet? Also, if it's possible to describe generally, what sort of problems exist with the Sparc32 SMP code in 2.2? Thanks (belatedly)- David
RE: SMP SS10 freezes hard intermittently
Hello- Again, my apologies for the embarassingly long delay... On Mon, 12 Jun 2000, Gregory Leblanc wrote: -Original Message- From: David Butts [mailto:[EMAIL PROTECTED] Sent: Monday, June 12, 2000 8:00 PM To: debian-sparc@lists.debian.org Cc: sparclinux@vger.rutgers.edu Subject: RE: SMP SS10 freezes hard intermittently Hello- I've got a CG6. I had included the other FBs for the sake of ... completeness (read: figuring out what I had in the box). From /var/log/messages: fb0: cgsix at e. TEC Rev 4 CPU sparc Rev b [TurboGX] The only (permanent) change I made to the devices was the addition of a symlink to /dev/fb0: lrwxrwxrwx1 root root3 Jun 9 17:56 /dev/cgsix0 - fb0 crw--w--w-1 root tty 29, 0 May 4 08:32 /dev/fb0 HTH (particularly since I'm the one it will ultimately end up helping :) ok, time for some more random guessing. What CPU modules do you have in there? Is this machine stable under Solaris (any version)? Any other SBUS cards in there? What about sound, is DBRI compiled in, and does it work? (DBRI is EXTREMELY flaky on my SS20). Hopefully somebody else will have more guesses, I only have 1 situation where I can crash Linux/X, and that's about 100% reliable of a crash. :-( I just don't do that. Later, cpu : Texas Instruments, Inc. - MicroSparc fpu : SuperSparc on-chip FPU promlib : Version 3 Revision 2 prom: 2.12 type: sun4m ncpus probed: 2 ncpus active: 2 Cpu0Bogo: 39.83 Cpu1Bogo: 39.93 MMU type: TI Viking invall : 0 invmm : 0 invrnge : 0 invpg : 0 contexts: 65536 CPU0: online CPU1: online This particular beast is actually the unholy union of the innards of two otherwise identical SS10's, so it's never booted anything else in its current incarnation. The existing OS on the disks was SunOS 4.something, which was not amused to find a second CPU. As I expect you saw, Anton Blanchard suspects that this sort of unpleasantness is to be expected from a 2.2 kernel, so I'll either just be patient, or continue shotgun debugging. Thanks- David
RE: SMP SS10 freezes hard intermittently
-Original Message- From: David Butts [mailto:[EMAIL PROTECTED] Sent: Monday, June 12, 2000 8:00 PM To: debian-sparc@lists.debian.org Cc: sparclinux@vger.rutgers.edu Subject: RE: SMP SS10 freezes hard intermittently Hello- I've got a CG6. I had included the other FBs for the sake of ... completeness (read: figuring out what I had in the box). From /var/log/messages: fb0: cgsix at e. TEC Rev 4 CPU sparc Rev b [TurboGX] The only (permanent) change I made to the devices was the addition of a symlink to /dev/fb0: lrwxrwxrwx1 root root3 Jun 9 17:56 /dev/cgsix0 - fb0 crw--w--w-1 root tty 29, 0 May 4 08:32 /dev/fb0 HTH (particularly since I'm the one it will ultimately end up helping :) ok, time for some more random guessing. What CPU modules do you have in there? Is this machine stable under Solaris (any version)? Any other SBUS cards in there? What about sound, is DBRI compiled in, and does it work? (DBRI is EXTREMELY flaky on my SS20). Hopefully somebody else will have more guesses, I only have 1 situation where I can crash Linux/X, and that's about 100% reliable of a crash. :-( I just don't do that. Later, Greg
Re: SMP SS10 freezes hard intermittently
I've got an SS10 with two CPUs, running a 2.2.15 kernel with SMP enabled. It locks up hard periodically, and won't respond to stop-A, a break on the serial port, the three-fingered salute, or, in fact, anything short of a power-cycle. When I was fixing sparc32 SMP in 2.2, I only had access to one cpu and machine type (viking supersparc on 4m/690). As such it tends to be stable :) ftoomsh# uptime 3:01am up 99 days, 13:59, 31 users, load average: 1.42, 1.38, 1.36 ftoomsh# cat /proc/cpuinfo cpu : Texas Instruments, Inc. - SuperSparc 50 fpu : SuperSparc on-chip FPU promlib : Version 3 Revision 2 prom: 2.14 type: sun4m ncpus probed: 4 ncpus active: 4 Cpu0Bogo: 49.86 Cpu1Bogo: 49.97 Cpu2Bogo: 49.97 Cpu3Bogo: 49.97 MMU type: TI Viking/MXCC contexts: 65536 CPU0: online CPU1: online CPU2: online CPU3: online There are too many things to fix in 2.2 (that I am fixing in 2.4) so random lockups are no surprise to me. 2.4 should be ready for sparc32 real soon. Cheers, Anton
SMP SS10 freezes hard intermittently
Hello, all- I've got an SS10 with two CPUs, running a 2.2.15 kernel with SMP enabled. It locks up hard periodically, and won't respond to stop-A, a break on the serial port, the three-fingered salute, or, in fact, anything short of a power-cycle. It doesn't do it consistantly, but the times it happened were (a) when installing packages with apt-get (unfortunately, I didn't pay enough attention to which, but it froze immedaitely after it finished the download of the .deb files), and (b) in X (both when I was strace'ing the startup to find out why it thought I didn't have a screen, and later, while WindowMaker was starting up). There are no obviously strange log entries, and my strace output was in /tmp, so it was removed after teh power-cycle. If there are any other diagnostic steps I should try, please let me know. I've included my .config below. AFAIK, there were no modules loaded at the time of the crashes. I've cc'ed sparclinux@vger.rutgers.edu based on a suggestion in response to a similar problem a couple of months ago posted to debian-sparc, but I'm not on that list, so please cc me on any mail send to the vger list. Thanks, David Butts # # Automatically generated make config: don't edit # # # Code maturity level options # # CONFIG_EXPERIMENTAL is not set # # Loadable module support # CONFIG_MODULES=y CONFIG_MODVERSIONS=y CONFIG_KMOD=y # # General setup # CONFIG_VT=y CONFIG_VT_CONSOLE=y # CONFIG_AP1000 is not set CONFIG_SMP=y # CONFIG_SUN4 is not set # CONFIG_PCI is not set # # Console drivers # CONFIG_PROM_CONSOLE=y CONFIG_FB=y CONFIG_DUMMY_CONSOLE=y CONFIG_FB_SBUS=y CONFIG_FB_CGSIX=y CONFIG_FB_BWTWO=y CONFIG_FB_CGTHREE=y # CONFIG_FB_TCX is not set # CONFIG_FB_CGFOURTEEN is not set # CONFIG_FB_LEO is not set # CONFIG_FB_VIRTUAL is not set # CONFIG_FBCON_ADVANCED is not set CONFIG_FBCON_MFB=y CONFIG_FBCON_CFB8=y CONFIG_FBCON_FONTWIDTH8_ONLY=y CONFIG_FONT_SUN8x16=y CONFIG_FBCON_FONTS=y CONFIG_FONT_8x8=y # CONFIG_FONT_8x16 is not set # CONFIG_FONT_PEARL_8x8 is not set # CONFIG_FONT_ACORN_8x8 is not set CONFIG_SBUS=y CONFIG_SBUSCHAR=y CONFIG_SUN_MOUSE=y CONFIG_SERIAL=y CONFIG_SUN_SERIAL=y CONFIG_SERIAL_CONSOLE=y CONFIG_SUN_KEYBOARD=y CONFIG_SUN_CONSOLE=y CONFIG_SUN_AUXIO=y CONFIG_SUN_IO=y # # Misc Linux/SPARC drivers # CONFIG_SUN_OPENPROMIO=m CONFIG_SUN_MOSTEK_RTC=y # CONFIG_SUN_OPENPROMFS is not set CONFIG_NET=y CONFIG_SYSVIPC=y CONFIG_BSD_PROCESS_ACCT=y CONFIG_SYSCTL=y CONFIG_BINFMT_AOUT=y CONFIG_BINFMT_ELF=y CONFIG_BINFMT_MISC=m # CONFIG_SUNOS_EMUL is not set # # Floppy, IDE, and other block devices # CONFIG_BLK_DEV_FD=y CONFIG_BLK_DEV_MD=y CONFIG_MD_LINEAR=m CONFIG_MD_STRIPED=m CONFIG_MD_MIRRORING=m CONFIG_MD_RAID5=m CONFIG_BLK_DEV_RAM=m CONFIG_BLK_DEV_LOOP=m # CONFIG_BLK_DEV_NBD is not set # # Networking options # CONFIG_PACKET=y CONFIG_NETLINK=y # CONFIG_RTNETLINK is not set CONFIG_NETLINK_DEV=m # CONFIG_FIREWALL is not set # CONFIG_FILTER is not set CONFIG_UNIX=y CONFIG_INET=y # CONFIG_IP_MULTICAST is not set # CONFIG_IP_ADVANCED_ROUTER is not set # CONFIG_IP_PNP is not set # CONFIG_IP_ROUTER is not set # CONFIG_NET_IPIP is not set # CONFIG_NET_IPGRE is not set # CONFIG_IP_ALIAS is not set # CONFIG_SYN_COOKIES is not set # # (it is safe to leave these untouched) # CONFIG_INET_RARP=m CONFIG_SKB_LARGE=y # # # # CONFIG_IPX is not set # CONFIG_ATALK is not set # # Amateur Radio support # # CONFIG_HAMRADIO is not set # # ISDN subsystem # # CONFIG_ISDN is not set # # SCSI support # CONFIG_SCSI=y # # SCSI support type (disk, tape, CDrom) # CONFIG_BLK_DEV_SD=y CONFIG_CHR_DEV_ST=m CONFIG_BLK_DEV_SR=y # CONFIG_BLK_DEV_SR_VENDOR is not set CONFIG_CHR_DEV_SG=m # # Some SCSI devices (e.g. CD jukebox) support multiple LUNs # CONFIG_SCSI_MULTI_LUN=y CONFIG_SCSI_CONSTANTS=y # # SCSI low-level drivers # CONFIG_SCSI_SUNESP=y # CONFIG_SCSI_QLOGICPTI is not set # # Fibre Channel support # # CONFIG_FC4 is not set # CONFIG_FC4_SOC is not set # CONFIG_FC4_SOCAL is not set # CONFIG_SCSI_PLUTO is not set # CONFIG_SCSI_FCAL is not set # # Network device support # CONFIG_NETDEVICES=y CONFIG_DUMMY=m # CONFIG_BONDING is not set CONFIG_PPP=m # # CCP compressors for PPP are only built as modules. # # CONFIG_SLIP is not set CONFIG_SUNLANCE=y CONFIG_HAPPYMEAL=m CONFIG_SUNQE=m # CONFIG_MYRI_SBUS is not set # # Unix98 PTY support # CONFIG_UNIX98_PTYS=y CONFIG_UNIX98_PTY_COUNT=256 # # Filesystems # # CONFIG_QUOTA is not set # CONFIG_AUTOFS_FS is not set # CONFIG_AFFS_FS is not set # CONFIG_HFS_FS is not set CONFIG_FAT_FS=m CONFIG_MSDOS_FS=m # CONFIG_UMSDOS_FS is not set CONFIG_VFAT_FS=m CONFIG_ISO9660_FS=y # CONFIG_JOLIET is not set CONFIG_MINIX_FS=m # CONFIG_NTFS_FS is not set # CONFIG_HPFS_FS is not set CONFIG_PROC_FS=y CONFIG_DEVPTS_FS=y # CONFIG_ROMFS_FS is not set CONFIG_EXT2_FS=y # CONFIG_SYSV_FS is not set CONFIG_UFS_FS=m # CONFIG_UFS_FS_WRITE is not set # # Network File Systems # CONFIG_CODA_FS=m CONFIG_NFS_FS=m CONFIG_SUNRPC=m CONFIG_LOCKD=m # CONFIG_SMB_FS is not set # CONFIG_NCP_FS is
RE: SMP SS10 freezes hard intermittently
-Original Message- From: David Butts [mailto:[EMAIL PROTECTED] Sent: Monday, June 12, 2000 5:54 PM To: debian-sparc@lists.debian.org Cc: sparclinux@vger.rutgers.edu Subject: SMP SS10 freezes hard intermittently Hello, all- I've got an SS10 with two CPUs, running a 2.2.15 kernel with SMP enabled. It locks up hard periodically, and won't respond to stop-A, a break on the serial port, the three-fingered salute, or, in fact, anything short of a power-cycle. It doesn't do it consistantly, but the times it happened were (a) when installing packages with apt-get (unfortunately, I didn't pay enough attention to which, but it froze immedaitely after it finished the download of the .deb files), and (b) in X (both when I was strace'ing the startup to find out why it thought I didn't have a screen, and later, while WindowMaker was starting up). There are no obviously strange log entries, and my strace output was in /tmp, so it was removed after teh power-cycle. If there are any other diagnostic steps I should try, please let me know. I've included my .config below. AFAIK, there were no modules loaded at the time of the crashes. I've cc'ed sparclinux@vger.rutgers.edu based on a suggestion in response to a similar problem a couple of months ago posted to debian-sparc, but I'm not on that list, so please cc me on any mail send to the vger list. What's your console device? CG3? Greg
RE: SMP SS10 freezes hard intermittently
Hello- I've got a CG6. I had included the other FBs for the sake of ... completeness (read: figuring out what I had in the box). From /var/log/messages: fb0: cgsix at e. TEC Rev 4 CPU sparc Rev b [TurboGX] The only (permanent) change I made to the devices was the addition of a symlink to /dev/fb0: lrwxrwxrwx1 root root3 Jun 9 17:56 /dev/cgsix0 - fb0 crw--w--w-1 root tty 29, 0 May 4 08:32 /dev/fb0 HTH (particularly since I'm the one it will ultimately end up helping :) David On Mon, 12 Jun 2000, Gregory Leblanc wrote: -Original Message- From: David Butts [mailto:[EMAIL PROTECTED] Sent: Monday, June 12, 2000 5:54 PM To: debian-sparc@lists.debian.org Cc: sparclinux@vger.rutgers.edu Subject: SMP SS10 freezes hard intermittently Hello, all- I've got an SS10 with two CPUs, running a 2.2.15 kernel with SMP enabled. It locks up hard periodically, and won't respond to stop-A, a break on the serial port, the three-fingered salute, or, in fact, anything short of a power-cycle. It doesn't do it consistantly, but the times it happened were (a) when installing packages with apt-get (unfortunately, I didn't pay enough attention to which, but it froze immediately after it finished the download of the .deb files), and (b) in X (both when I was strace'ing the startup to find out why it thought I didn't have a screen, and later, while WindowMaker was starting up). There are no obviously strange log entries, and my strace output was in /tmp, so it was removed after the power-cycle. If there are any other diagnostic steps I should try, please let me know. I've included my .config below. AFAIK, there were no modules loaded at the time of the crashes. I've cc'ed sparclinux@vger.rutgers.edu based on a suggestion in response to a similar problem a couple of months ago posted to debian-sparc, but I'm not on that list, so please cc me on any mail send to the vger list. What's your console device? CG3? Greg