Olav, I don't remember from your first posts, is the server locking up or actually crashing in a kernel panic?
I have used top running once a minute from cron to help tracking down intermittent crashes in the past. A line like: * * * * * root /usr/bin/top -bin 1 >> /var/log/top_log in root's crontab will log the system state (memory, load, running processes) once a minute. May help you determine what is happening right before the system crashes. Pete -- http://www.elbnet.com ELB Internet Service, Inc. Web Design, Computer Consulting, Internet Hosting Olav Øiehaug wrote: > > Hi! > Few weeks ago I told you about a problem with server crash at Haroy school > (Norway). I still have problems with thw server. The problem is: > > We have two servers. One application server (sirius) with LTSP and one > fileserver (tellus) with /home and samba > We use RedHat 7.3 > Sirius, the LTSP server we have a dual AMD 1800+ MSI-mainboard with 1GB Ram. > Tellus, one AMD 1800+, Asus mainboard, 768MB ram > > We use NIS and NFS to connect the two servers. > We have 18 workstations. About 50 % of the ws have 100Mbit NIC, the rest have > 10Mbit. > Both servers have SMC Gigabit NIC, They are both connected to a SMC Gigabit > switch. > > Everything works works OK, but sirius server crashes several times during a > day. I have tried to disable Mozilla, and I have tried to disable printing, > new and better fileserver (tellus), swapped the ram, upgraded the kernel to > 2.4-18.3, but that did not help. > I have made several copies of /var/log/messages, but I can not find anything > significant. > > However, when running in level 3, occasionally some konsoll messages occured > on the screen. > A few times on sirius: NFS server not responding > > To day on sirius: > RPC: sendmsg returned error 22 > nfs: RPC call returned error 22 > At tellus at the same incident: > rejected NSM callback from 7f000001: 32771 > (five lines with the same message) > > rpcinfo -p localhost (on sirius after reboot) gave this: > program vers proto port > 100000 2 tcp 111 portmapper > 100000 2 udp 111 portmapper > 100024 1 udp 32768 status > 100024 1 tcp 32768 status > 100021 1 udp 32769 nlockmgr > 100021 3 udp 32769 nlockmgr > 100021 4 udp 32769 nlockmgr > 100007 2 udp 971 ypbind > 100007 1 udp 971 ypbind > 100007 2 tcp 974 ypbind > 100007 1 tcp 974 ypbind > 391002 2 tcp 32769 sgi_fam > 100011 1 udp 695 rquotad > 100011 2 udp 695 rquotad > 100011 1 tcp 698 rquotad > 100011 2 tcp 698 rquotad > 100005 1 udp 32771 mountd > 100005 1 tcp 32770 mountd > 100005 2 udp 32771 mountd > 100005 2 tcp 32770 mountd > 100005 3 udp 32771 mountd > 100005 3 tcp 32770 mountd > 100003 2 udp 2049 nfs > 100003 3 udp 2049 nfs > > nmap localhost (on sirius) gave this: > Starting nmap V. 2.54BETA31 ( www.insecure.org/nmap/ ) > Interesting ports on sirius (127.0.0.1): > (The 1544 ports scanned but not shown below are in state: closed) > Port State Service > 22/tcp open ssh > 25/tcp open smtp > 111/tcp open sunrpc > 515/tcp open printer > 698/tcp open unknown > 974/tcp open unknown > 6000/tcp open X11 > 10000/tcp open snet-sensor-mgmt > 32770/tcp open sometimes-rpc3 > 32771/tcp open sometimes-rpc5 > > Nmap run completed -- 1 IP address (1 host up) scanned in 1 second > > It seems to me that the problem is related to the communication bethween the > two servers. > Can anyone help? > > ------------------------------------------------------- > This sf.net email is sponsored by: viaVerio will pay you up to > $1,000 for every account that you consolidate with us. > http://ad.doubleclick.net/clk;4749864;7604308;v? > http://www.viaverio.com/consolidator/osdn.cfm > _____________________________________________________________________ > Ltsp-discuss mailing list. To un-subscribe, or change prefs, goto: > https://lists.sourceforge.net/lists/listinfo/ltsp-discuss > For additional LTSP help, try #ltsp channel on irc.openprojects.net ------------------------------------------------------- This sf.net email is sponsored by: viaVerio will pay you up to $1,000 for every account that you consolidate with us. http://ad.doubleclick.net/clk;4749864;7604308;v? http://www.viaverio.com/consolidator/osdn.cfm _____________________________________________________________________ Ltsp-discuss mailing list. To un-subscribe, or change prefs, goto: https://lists.sourceforge.net/lists/listinfo/ltsp-discuss For additional LTSP help, try #ltsp channel on irc.openprojects.net