Hi!
Few weeks ago I told you about a problem with server crash at Haroy school 
(Norway). I still have problems with thw server. The problem is:

We have two servers. One application server (sirius) with LTSP and one 
fileserver (tellus) with /home and samba 
We use RedHat 7.3 
Sirius, the LTSP server we have a dual AMD 1800+ MSI-mainboard with 1GB Ram.
Tellus, one AMD 1800+, Asus mainboard, 768MB ram

We use NIS and NFS to connect the two servers. 
We have 18 workstations. About 50 % of the ws have 100Mbit NIC, the rest have 
10Mbit. 
Both servers have SMC Gigabit NIC, They are both connected to a SMC Gigabit 
switch.

Everything works works OK, but sirius server crashes several times during a 
day. I have tried to disable Mozilla, and I have tried to disable printing, 
new and better fileserver (tellus), swapped the ram, upgraded the kernel to 
2.4-18.3, but that did not help. 
I have made several copies of /var/log/messages, but I can not find anything 
significant.

However, when running in level 3, occasionally some konsoll messages occured 
on the screen.
A few times on sirius: NFS server not responding

To day on sirius: 
RPC: sendmsg returned error 22
nfs: RPC call returned error 22
At tellus at the same incident:
rejected NSM callback from 7f000001: 32771
(five lines with the same message)

rpcinfo -p localhost (on sirius  after reboot) gave this:
program vers proto   port
    100000    2   tcp    111  portmapper
    100000    2   udp    111  portmapper
    100024    1   udp  32768  status
    100024    1   tcp  32768  status
    100021    1   udp  32769  nlockmgr
    100021    3   udp  32769  nlockmgr
    100021    4   udp  32769  nlockmgr
    100007    2   udp    971  ypbind
    100007    1   udp    971  ypbind
    100007    2   tcp    974  ypbind
    100007    1   tcp    974  ypbind
    391002    2   tcp  32769  sgi_fam
    100011    1   udp    695  rquotad
    100011    2   udp    695  rquotad
    100011    1   tcp    698  rquotad
    100011    2   tcp    698  rquotad
    100005    1   udp  32771  mountd
    100005    1   tcp  32770  mountd
    100005    2   udp  32771  mountd
    100005    2   tcp  32770  mountd
    100005    3   udp  32771  mountd
    100005    3   tcp  32770  mountd
    100003    2   udp   2049  nfs
    100003    3   udp   2049  nfs

nmap localhost (on sirius) gave this:
Starting nmap V. 2.54BETA31 ( www.insecure.org/nmap/ )
Interesting ports on sirius (127.0.0.1):
(The 1544 ports scanned but not shown below are in state: closed)
Port       State       Service
22/tcp     open        ssh
25/tcp     open        smtp
111/tcp    open        sunrpc
515/tcp    open        printer
698/tcp    open        unknown
974/tcp    open        unknown
6000/tcp   open        X11
10000/tcp  open        snet-sensor-mgmt
32770/tcp  open        sometimes-rpc3
32771/tcp  open        sometimes-rpc5

Nmap run completed -- 1 IP address (1 host up) scanned in 1 second

It seems to me that the problem is related to the communication bethween the 
two servers.
Can anyone help?



-------------------------------------------------------
This sf.net email is sponsored by: viaVerio will pay you up to
$1,000 for every account that you consolidate with us.
http://ad.doubleclick.net/clk;4749864;7604308;v?
http://www.viaverio.com/consolidator/osdn.cfm
_____________________________________________________________________
Ltsp-discuss mailing list.   To un-subscribe, or change prefs, goto:
      https://lists.sourceforge.net/lists/listinfo/ltsp-discuss
For additional LTSP help,   try #ltsp channel on irc.openprojects.net

Reply via email to