Kyle McDonald wrote: > Kyle McDonald wrote: > >> Hi all. >> >> I have a boot/install server with b74 installed. I'm trying to install >> b76 to a client. I don't know if it's specific to the SUNWlibmsr >> package, but the install has been hung there for more then 14 hours: >> >> >> > > I rebooted the client, and the nfsd's are still running the CPU at 100%. > > I should add that this is a Dual Intel Xeon based machine with 2GB memory. > psrinfo shows cpu's 0-7, so both processors must be dual core, and > hyperthreaded? > > Can the hyperthreading be causing some sort of race condition? I know > they aren't really full CPU's. > > I'm considering liveupgrading the server to b76, but I'm not sure if I > should grab anymore info about what the nfsd's are doing right now? How > does one generate a stack dump? >
Two things might help. Grab a truss output: truss -p <nfsd pid> To get the stack use pstack <nfsd pid> Regards, Moinak. > -Kyle > > >> SUNWproduct-registry-root........done. 2126.20 Mbytes remaining. >> SUNWwsr2.........................done. 2125.84 Mbytes remaining. >> SUNWpkgcmdsu.....................done. 2123.44 Mbytes remaining. >> SUNWswmt.........................done. 2123.00 Mbytes remaining. >> SUNWccccrr.......................done. 2122.95 Mbytes remaining. >> SUNWccccr........................done. 2122.33 Mbytes remaining. >> SUNWccfw.........................done. 2122.23 Mbytes remaining. >> SUNWperl584core..................done. 2117.21 Mbytes remaining. >> SUNWlibmsr....................... >> >> >> >> On the boot server, everything has basically ground to a halt. >> >> prstat show that nfsd/18 is taking up 100% of the CPU, and i can't >> figure out why that would be? >> I've included output from it and other programs below. I didn't copy >> much of each since it's been consisyent the whole time. The snoop output >> is very slow. it's not like >> >> Any ideas? Could this be because I have the ISO files mounted through >> lofi, and then shared out through NFS? >> >> Thanks again for all the great help on these lists, >> >> -Kyle >> >> prstat: >> -------- >> PID USERNAME SIZE RSS STATE PRI NICE TIME CPU >> PROCESS/NLWP >> 648 daemon 2744K 1360K cpu7 60 -20 103:58:53 100% nfsd/18 >> 6198 root 4696K 2972K cpu5 59 0 0:00:00 0.0% prstat/1 >> 753 noaccess 87M 18M run 59 0 0:20:10 0.0% java/31 >> 691 root 35M 9024K run 59 0 0:01:43 0.0% Xorg/1 >> 608 root 6784K 3172K sleep 59 0 0:00:31 0.0% intrd/1 >> 4736 root 9196K 8344K stop 59 0 5:54:26 0.0% bzip2/1 >> 560 smmsp 7996K 1036K sleep 59 0 0:00:00 0.0% sendmail/1 >> 756 root 11M 1336K sleep 59 0 0:00:00 0.0% dtlogin/1 >> 928 root 3296K 1244K sleep 59 0 0:00:00 0.0% bash/1 >> 562 root 7988K 1288K run 59 0 0:00:05 0.0% sendmail/1 >> 536 root 2460K 752K sleep 59 0 0:00:00 0.0% in.rarpd/4 >> 532 root 4152K 1272K sleep 59 0 0:00:00 0.0% syslogd/11 >> 476 root 5088K 848K sleep 59 0 0:00:00 0.0% automountd/4 >> >> >> vmstat 1: >> ---------- >> kthr memory page disk faults cpu >> r b w swap free re mf pi po fr de sr f0 lf lf lf in sy cs us >> sy id >> 19 0 0 1982224 213464 0 2 0 0 0 0 0 0 0 0 0 753 79 284 0 >> 100 0 >> 22 0 0 1982224 213464 0 1 0 0 0 0 0 0 0 0 0 782 50 254 0 >> 100 0 >> 17 0 0 1982224 213464 0 3 0 0 0 0 0 0 0 0 0 732 69 242 0 >> 100 0 >> 21 0 0 1982224 213464 0 3 0 0 0 0 0 0 0 0 0 737 94 209 0 >> 100 0 >> 19 0 0 1982220 213460 0 1 0 0 0 0 0 0 0 0 0 605 56 254 0 >> 100 0 >> 19 0 0 1982216 213460 0 1 0 0 0 0 0 0 0 0 0 785 34 252 0 >> 100 0 >> >> >> iostat 1: >> ---------- >> tty lofi1 lofi2 lofi3 lofi4 cpu >> tin tout kps tps serv kps tps serv kps tps serv kps tps serv us sy >> wt id >> 0 20 0 0 0 0 0 0 0 0 0 0 0 0 0 >> 100 0 0 >> 0 16 0 0 0 0 0 0 0 0 0 0 0 0 0 >> 100 0 0 >> 0 4 0 0 0 0 0 0 0 0 0 0 0 0 0 >> 100 0 0 >> 0 6 0 0 0 0 0 0 0 0 0 0 0 0 0 >> 100 0 0 >> 0 10 0 0 0 0 0 0 0 0 0 0 0 0 0 >> 100 0 0 >> 0 7 0 0 0 0 0 0 0 0 0 0 0 0 0 >> 100 0 0 >> 0 33 0 0 0 0 0 0 0 0 0 0 0 0 0 >> 100 0 0 >> >> >> snoop: >> ------- >> KeyMaster -> GateKeeper TCP D=1022 S=2049 Ack=153729846 >> Seq=4282747754 Len=0 Win=49640 >> GateKeeper -> (broadcast) ARP C Who is 172.30.172.90, GateKeeper ? >> GateKeeper -> KeyMaster NFS C 4 (readdir ) PUTFH FH=894D >> READDIR Cookie=564 (0000000000000000) for 8192/1048576 (retransmit) >> KeyMaster -> GateKeeper TCP D=1022 S=2049 Ack=153730030 >> Seq=4282747754 Len=0 Win=49640 >> GateKeeper -> KeyMaster NFS C 4 (readdir ) PUTFH FH=894D >> READDIR Cookie=564 (0000000000000000) for 8192/1048576 (retransmit) >> GateKeeper -> KeyMaster ARP R 172.30.172.90, GateKeeper is >> 0:14:5e:2b:a6:bd >> KeyMaster -> GateKeeper TCP D=1022 S=2049 Ack=153730214 >> Seq=4282747754 Len=0 Win=49640 >> GateKeeper -> (broadcast) ARP C Who is 172.30.172.90, GateKeeper ? >> GateKeeper -> (broadcast) ARP C Who is 172.30.172.10, KeyMaster ? >> GateKeeper -> KeyMaster NFS C 4 (readdir ) PUTFH FH=894D >> READDIR Cookie=564 (0000000000000000) for 8192/1048576 (retransmit) >> KeyMaster -> GateKeeper TCP D=1022 S=2049 Ack=153730398 >> Seq=4282747754 Len=0 Win=49640 >> GateKeeper -> (broadcast) ARP C Who is 172.30.172.90, GateKeeper ? >> GateKeeper -> KeyMaster NFS C 4 (readdir ) PUTFH FH=894D >> READDIR Cookie=564 (0000000000000000) for 8192/1048576 (retransmit) >> KeyMaster -> GateKeeper TCP D=1022 S=2049 Ack=153730582 >> Seq=4282747754 Len=0 Win=49640 >> _______________________________________________ >> install-discuss mailing list >> install-discuss at opensolaris.org >> http://mail.opensolaris.org/mailman/listinfo/install-discuss >> >> > > _______________________________________________ > opensolaris-discuss mailing list > opensolaris-discuss at opensolaris.org >
