Kyle McDonald wrote:
> Hi all.
>
> I have a boot/install server with b74 installed. I'm trying to install 
> b76 to a client. I don't know if it's specific to the SUNWlibmsr 
> package, but the install has been hung there for more then 14 hours:
>
>   

I rebooted the client, and the nfsd's are still running the CPU at 100%.

I should add that this is a Dual Intel Xeon based machine with 2GB memory.
psrinfo shows cpu's 0-7, so both processors must be dual core, and 
hyperthreaded?

Can the hyperthreading be causing some sort of race condition? I know 
they aren't really full CPU's.

I'm considering liveupgrading the server to b76, but I'm not sure if I 
should grab anymore info about what the nfsd's are doing right now? How 
does one generate a stack dump?

   -Kyle

>         SUNWproduct-registry-root........done.  2126.20 Mbytes remaining.
>         SUNWwsr2.........................done.  2125.84 Mbytes remaining.
>         SUNWpkgcmdsu.....................done.  2123.44 Mbytes remaining.
>         SUNWswmt.........................done.  2123.00 Mbytes remaining.
>         SUNWccccrr.......................done.  2122.95 Mbytes remaining.
>         SUNWccccr........................done.  2122.33 Mbytes remaining.
>         SUNWccfw.........................done.  2122.23 Mbytes remaining.
>         SUNWperl584core..................done.  2117.21 Mbytes remaining.
>         SUNWlibmsr.......................
>
>
>
> On the boot server, everything has basically ground to a halt.
>
> prstat show that nfsd/18 is taking up 100% of the CPU, and i can't 
> figure out why that would be?
> I've included output from it and other programs below. I didn't copy 
> much of each since it's been consisyent the whole time. The snoop output 
> is very slow. it's not like
>
> Any ideas? Could this be because I have the ISO files mounted through 
> lofi, and then shared out through NFS?
>
> Thanks again for all the great help on these lists,
>
>    -Kyle
>
> prstat:
> --------
>    PID USERNAME  SIZE   RSS STATE  PRI NICE      TIME  CPU 
> PROCESS/NLWP      
>    648 daemon   2744K 1360K cpu7    60  -20 103:58:53 100% nfsd/18
>   6198 root     4696K 2972K cpu5    59    0   0:00:00 0.0% prstat/1
>    753 noaccess   87M   18M run     59    0   0:20:10 0.0% java/31
>    691 root       35M 9024K run     59    0   0:01:43 0.0% Xorg/1
>    608 root     6784K 3172K sleep   59    0   0:00:31 0.0% intrd/1
>   4736 root     9196K 8344K stop    59    0   5:54:26 0.0% bzip2/1
>    560 smmsp    7996K 1036K sleep   59    0   0:00:00 0.0% sendmail/1
>    756 root       11M 1336K sleep   59    0   0:00:00 0.0% dtlogin/1
>    928 root     3296K 1244K sleep   59    0   0:00:00 0.0% bash/1
>    562 root     7988K 1288K run     59    0   0:00:05 0.0% sendmail/1
>    536 root     2460K  752K sleep   59    0   0:00:00 0.0% in.rarpd/4
>    532 root     4152K 1272K sleep   59    0   0:00:00 0.0% syslogd/11
>    476 root     5088K  848K sleep   59    0   0:00:00 0.0% automountd/4
>
>
> vmstat 1:
> ----------
>  kthr      memory            page            disk          faults      cpu
>  r b w   swap  free  re  mf pi po fr de sr f0 lf lf lf   in   sy   cs us 
> sy id
>  19 0 0 1982224 213464 0  2  0  0  0  0  0  0  0  0  0  753   79  284  0 
> 100 0
>  22 0 0 1982224 213464 0  1  0  0  0  0  0  0  0  0  0  782   50  254  0 
> 100 0
>  17 0 0 1982224 213464 0  3  0  0  0  0  0  0  0  0  0  732   69  242  0 
> 100 0
>  21 0 0 1982224 213464 0  3  0  0  0  0  0  0  0  0  0  737   94  209  0 
> 100 0
>  19 0 0 1982220 213460 0  1  0  0  0  0  0  0  0  0  0  605   56  254  0 
> 100 0
>  19 0 0 1982216 213460 0  1  0  0  0  0  0  0  0  0  0  785   34  252  0 
> 100 0
>
>
> iostat 1:
> ----------
>    tty       lofi1         lofi2         lofi3         lofi4           cpu
>  tin tout kps tps serv  kps tps serv  kps tps serv  kps tps serv   us sy 
> wt id
>    0   20   0   0    0    0   0    0    0   0    0    0   0    0    0 
> 100  0  0
>    0   16   0   0    0    0   0    0    0   0    0    0   0    0    0 
> 100  0  0
>    0    4   0   0    0    0   0    0    0   0    0    0   0    0    0 
> 100  0  0
>    0    6   0   0    0    0   0    0    0   0    0    0   0    0    0 
> 100  0  0
>    0   10   0   0    0    0   0    0    0   0    0    0   0    0    0 
> 100  0  0
>    0    7   0   0    0    0   0    0    0   0    0    0   0    0    0 
> 100  0  0
>    0   33   0   0    0    0   0    0    0   0    0    0   0    0    0 
> 100  0  0
>
>
> snoop:
> -------
>    KeyMaster -> GateKeeper   TCP D=1022 S=2049 Ack=153729846 
> Seq=4282747754 Len=0 Win=49640
>   GateKeeper -> (broadcast)  ARP C Who is 172.30.172.90, GateKeeper ?
>   GateKeeper -> KeyMaster    NFS C 4 (readdir     ) PUTFH FH=894D 
> READDIR Cookie=564 (0000000000000000) for 8192/1048576  (retransmit)
>    KeyMaster -> GateKeeper   TCP D=1022 S=2049 Ack=153730030 
> Seq=4282747754 Len=0 Win=49640
>   GateKeeper -> KeyMaster    NFS C 4 (readdir     ) PUTFH FH=894D 
> READDIR Cookie=564 (0000000000000000) for 8192/1048576  (retransmit)
>   GateKeeper -> KeyMaster    ARP R 172.30.172.90, GateKeeper is 
> 0:14:5e:2b:a6:bd
>    KeyMaster -> GateKeeper   TCP D=1022 S=2049 Ack=153730214 
> Seq=4282747754 Len=0 Win=49640
>   GateKeeper -> (broadcast)  ARP C Who is 172.30.172.90, GateKeeper ?
>   GateKeeper -> (broadcast)  ARP C Who is 172.30.172.10, KeyMaster ?
>   GateKeeper -> KeyMaster    NFS C 4 (readdir     ) PUTFH FH=894D 
> READDIR Cookie=564 (0000000000000000) for 8192/1048576  (retransmit)
>    KeyMaster -> GateKeeper   TCP D=1022 S=2049 Ack=153730398 
> Seq=4282747754 Len=0 Win=49640
>   GateKeeper -> (broadcast)  ARP C Who is 172.30.172.90, GateKeeper ?
>   GateKeeper -> KeyMaster    NFS C 4 (readdir     ) PUTFH FH=894D 
> READDIR Cookie=564 (0000000000000000) for 8192/1048576  (retransmit)
>    KeyMaster -> GateKeeper   TCP D=1022 S=2049 Ack=153730582 
> Seq=4282747754 Len=0 Win=49640
> _______________________________________________
> install-discuss mailing list
> install-discuss at opensolaris.org
> http://mail.opensolaris.org/mailman/listinfo/install-discuss
>   


Reply via email to