Hi! Few weeks ago I told you about a problem with server crash at Haroy school (Norway). I still have problems with thw server. The problem is:
We have two servers. One application server (sirius) with LTSP and one fileserver (tellus) with /home and samba We use RedHat 7.3 Sirius, the LTSP server we have a dual AMD 1800+ MSI-mainboard with 1GB Ram. Tellus, one AMD 1800+, Asus mainboard, 768MB ram We use NIS and NFS to connect the two servers. We have 18 workstations. About 50 % of the ws have 100Mbit NIC, the rest have 10Mbit. Both servers have SMC Gigabit NIC, They are both connected to a SMC Gigabit switch. Everything works works OK, but sirius server crashes several times during a day. I have tried to disable Mozilla, and I have tried to disable printing, new and better fileserver (tellus), swapped the ram, upgraded the kernel to 2.4-18.3, but that did not help. I have made several copies of /var/log/messages, but I can not find anything significant. However, when running in level 3, occasionally some konsoll messages occured on the screen. A few times on sirius: NFS server not responding To day on sirius: RPC: sendmsg returned error 22 nfs: RPC call returned error 22 At tellus at the same incident: rejected NSM callback from 7f000001: 32771 (five lines with the same message) rpcinfo -p localhost (on sirius after reboot) gave this: program vers proto port 100000 2 tcp 111 portmapper 100000 2 udp 111 portmapper 100024 1 udp 32768 status 100024 1 tcp 32768 status 100021 1 udp 32769 nlockmgr 100021 3 udp 32769 nlockmgr 100021 4 udp 32769 nlockmgr 100007 2 udp 971 ypbind 100007 1 udp 971 ypbind 100007 2 tcp 974 ypbind 100007 1 tcp 974 ypbind 391002 2 tcp 32769 sgi_fam 100011 1 udp 695 rquotad 100011 2 udp 695 rquotad 100011 1 tcp 698 rquotad 100011 2 tcp 698 rquotad 100005 1 udp 32771 mountd 100005 1 tcp 32770 mountd 100005 2 udp 32771 mountd 100005 2 tcp 32770 mountd 100005 3 udp 32771 mountd 100005 3 tcp 32770 mountd 100003 2 udp 2049 nfs 100003 3 udp 2049 nfs nmap localhost (on sirius) gave this: Starting nmap V. 2.54BETA31 ( www.insecure.org/nmap/ ) Interesting ports on sirius (127.0.0.1): (The 1544 ports scanned but not shown below are in state: closed) Port State Service 22/tcp open ssh 25/tcp open smtp 111/tcp open sunrpc 515/tcp open printer 698/tcp open unknown 974/tcp open unknown 6000/tcp open X11 10000/tcp open snet-sensor-mgmt 32770/tcp open sometimes-rpc3 32771/tcp open sometimes-rpc5 Nmap run completed -- 1 IP address (1 host up) scanned in 1 second It seems to me that the problem is related to the communication bethween the two servers. Can anyone help? ------------------------------------------------------- This sf.net email is sponsored by: viaVerio will pay you up to $1,000 for every account that you consolidate with us. http://ad.doubleclick.net/clk;4749864;7604308;v? http://www.viaverio.com/consolidator/osdn.cfm _____________________________________________________________________ Ltsp-discuss mailing list. To un-subscribe, or change prefs, goto: https://lists.sourceforge.net/lists/listinfo/ltsp-discuss For additional LTSP help, try #ltsp channel on irc.openprojects.net