Diskless boot problem

2005-05-19 Thread Sławek Żak
Hi,

I have a problem with booting Dell 2850 over network. The machine
reads kernel over net, boots upto mounting / from NFS and then
crashes.

Tcpdump output:

12:15:58.919683 arp who-has 10.158.190.73 tell 10.158.190.74
12:15:58.919702 arp reply 10.158.190.73 is-at 00:11:43:d3:6e:e1
12:15:58.920058 IP 10.158.190.74.475209176  10.158.190.73.2049: 92
getattr [|nfs]
12:15:58.920134 IP 10.158.190.73.2049  10.158.190.74.475209176: reply
ok 28 getattr ERROR: Stale NFS file handle
12:15:58.920432 arp who-has 10.158.190.73 tell 10.158.190.74
12:15:58.920442 arp reply 10.158.190.73 is-at 00:11:43:d3:6e:e1
12:15:58.920681 IP 10.158.190.74.475209177  10.158.190.73.2049: 100
lookup [|nfs]
12:15:58.920707 IP 10.158.190.73.2049  10.158.190.74.475209177: reply
ok 28 lookup ERROR: Stale NFS file handle
12:15:58.920932 IP 10.158.190.74.475209178  10.158.190.73.2049: 100
lookup [|nfs]
12:15:58.920963 IP 10.158.190.73.2049  10.158.190.74.475209178: reply
ok 28 lookup ERROR: Stale NFS file handle
12:15:58.952180 IP 10.158.190.74.475209179  10.158.190.73.2049: 100
lookup [|nfs]
12:15:58.952277 IP 10.158.190.73.2049  10.158.190.74.475209179: reply
ok 28 lookup ERROR: Stale NFS file handle
12:15:58.984785 IP 10.158.190.74.475209180  10.158.190.73.2049: 100
lookup [|nfs]
12:15:58.984866 IP 10.158.190.73.2049  10.158.190.74.475209180: reply
ok 28 lookup ERROR: Stale NFS file handle
12:15:59.020500 IP 10.158.190.74.475209181  10.158.190.73.2049: 104
lookup [|nfs]
12:15:59.020573 IP 10.158.190.73.2049  10.158.190.74.475209181: reply
ok 28 lookup ERROR: Stale NFS file handle
12:15:59.054130 IP 10.158.190.74.475209182  10.158.190.73.2049: 104
lookup [|nfs]
12:15:59.054224 IP 10.158.190.73.2049  10.158.190.74.475209182: reply
ok 28 lookup ERROR: Stale NFS file handle

I wonder where the `Stale NFS handle'  error comes from, as the client
doesn't seem to have mounted the filesystem over NFS from what I can
see. On the console of the diskless I have this:

NFS ROOT: 10.158.190.73:/var/www/FreeBSD-5.4-x86-PXE
em0: Link is up 100 Mbps Half Duplex
exec /sbin/init: error 70
exec /sbin/oinit: error 70
exec /sbin/init.bak: error 70
exec /rescue/init: error 70
exec /stand/sysinstall: error 70
init: not found in path
/sbin/init:/sbin/oinit:/sbin/init.bak:/rescue/init:/stand/sysinstall
panic: no init
Uptime: 55s
Cannot dump. No dump device defined.
Automatic reboot in 15 seconds - press a key on the console to abort

The speed for em0 is obviously wrong. Setting on the switch is 100
full-duplex. Our network wizards can f***kup autonegotiation on Cisco
Catalyst, so it must stay that way. Intel em-s tend to hang for a
couple of seconds before getting on the net so it might be the
problem. On the other hand kernel loads just fine over TFTP.

Any thoughts?

Thanks, /S
-- 
Sawek ak / UNIX Systems Administrator
___
freebsd-hackers@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
To unsubscribe, send any mail to [EMAIL PROTECTED]


5.4-RC2 keyboard problem on Dell PowerEdge 2850

2005-04-18 Thread Sławek Żak
Hi,

After install from CD the keyboard doesn't work on this machine. Has
anyone else seen it?

/S
-- 
Sawek ak / UNIX Systems Administrator
___
freebsd-hackers@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: 5.4-RC2 keyboard problem on Dell PowerEdge 2850

2005-04-18 Thread Sławek Żak
On 4/18/05, c0ldbyte [EMAIL PROTECTED] wrote:
 On Mon, 18 Apr 2005, [ISO-8859-2] Sawek ak wrote:
 
  Hi,
 
  After install from CD the keyboard doesn't work on this machine. Has
  anyone else seen it?
 
  /S
 
 Select the correct key map screen map etc... ?

Erm. When I say keyboard doesn't work I *mean* it doesn't work at all.
The only key which works on the box is BRS, which doesn't give me
sufficient interaction with the system. I've skipped morse code
lessons and boy scouting in my life altogether.

/S
-- 
Sawek ak / UNIX Systems Administrator
___
freebsd-hackers@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
To unsubscribe, send any mail to [EMAIL PROTECTED]


Performance trouble on AMD64 webserver

2005-04-13 Thread Sławek Żak
Hi,

I've an 2 processor Opteron server (Sun v20z) with 4GB of RAM. It runs
FreeBSD 5.4-PRERELEASE. Everything is roses for 1-2 minutes after
starting Apache. Then the server freezes for 10-20 seconds then
resumes normal operation and chokes every couple of minutes. I tried
with 512 and 1024 httpd processes running. It is a fairly dynamic
webpage. Apache is coupled with mod_php. Each process takes 18-19MB of
resident memory.

I was tuning the machine after getting the nefarious collectng PV
entries ... message. I tried to increase PMAP_SHPGPERPROC and all. It
doesn't make the performance better.

Snapshot of vmstat -z gives:
...
PV ENTRY: 48,  4312872, 1611440, 616384,  8657151

The free value decreases to something close to 0 then the machine
chokes to get back up to some large value.

My loader.conf:

accf_data_load=YES
accf_http_load=YES

kern.ipc.nmbclusters=131072
kern.maxusers=1024

The sysctl.conf:

kern.ipc.somaxconn=1024

The customized part of kernel config:

options VM_BCACHE_SIZE_MAX=(500*1024*1024)
options MAXDSIZ=(1000*1024*1024)
options MAXSSIZ=(1000*1024*1024)
options DFLDSIZ=(1000*1024*1024)

options PMAP_SHPGPERPROC=4096
options PMAP_SHPGPERPROC=16384

options SHMALL=131072
options SHMMAXPGS=131072

I've also removed lots of SCSI hardware, Wifi and USB parts which I
don't need from the GENERIC config.

Any Ideas how to make it work?

Thanks, /S
--
Sawek ak / UNIX Systems Administrator
___
freebsd-hackers@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
To unsubscribe, send any mail to [EMAIL PROTECTED]