We have an overloaded server (V490 with one CPU board) - CPU bound.
Here is a sample prstat -mL taken during a time of high load(uptime
Total: 278 processes, 1710 lwps, load averages: 20.72, 13.21, 6.74):
PID USERNAME USR SYS TRP TFL DFL LCK SLP LAT VCX ICX SCL SIG
PROCESS/LWPID
5617 cognos8 53 0.5 0.0 0.0 0.0 2.1 0.0 45 1K 200 3K 0
BIBusTKServe/18
5617 cognos8 51 0.5 0.0 0.0 0.0 3.6 0.0 45 1K 274 3K 0
BIBusTKServe/17
6084 cognos8 43 0.6 0.0 0.0 0.0 1.9 0.0 54 2K 222 5K 0
BIBusTKServe/20
6084 cognos8 43 0.6 0.0 0.0 0.0 1.1 0.0 55 1K 244 4K 0
BIBusTKServe/15
6084 cognos8 39 0.6 0.0 0.0 0.0 1.8 0.0 59 2K 212 4K 0
BIBusTKServe/22
5617 cognos8 39 0.4 0.0 0.0 0.0 1.4 0.0 59 1K 223 3K 0
BIBusTKServe/22
6084 cognos8 35 0.4 0.0 0.0 0.0 1.1 0.0 64 1K 262 2K 0
BIBusTKServe/19
5617 cognos8 34 0.4 0.0 0.0 0.0 2.2 0.0 64 1K 465 2K 0
BIBusTKServe/23
29514 oracle 28 1.2 0.1 0.0 0.0 0.0 8.6 62 217 990 899 0
oracle/1
29948 root 2.4 0.4 0.0 0.0 0.0 0.0 77 20 109 561 961 0
cfagent/1
5610 oracle 1.5 0.5 0.0 0.0 0.0 0.0 98 0.1 3 8 871 0
oracle/1
942 oracle 1.2 0.6 0.0 0.0 0.0 0.0 98 0.0 15 50 506 0
oracle/1
9378 root 0.4 1.1 0.1 0.0 0.0 0.0 98 0.9 40 9 994 0
prstat/1
1475 oracle 1.1 0.2 0.4 0.0 0.0 0.0 98 0.2 111 55 945 0
emagent/3047304
11646 oracle 0.8 0.0 0.0 0.0 0.0 0.0 91 8.7 1 45 80 0
java/56
11479 oracle 0.6 0.1 0.0 0.0 0.0 0.0 98 1.0 4 4 615 0
oracle/1
10520 oracle 0.6 0.0 0.0 0.0 0.0 0.0 98 1.4 5 0 45 5
nmccollector/1
835 sysnav 0.1 0.2 0.1 0.0 0.0 0.0 57 42 19 240 471 0
bb-local.sh/1
7375 oracle 0.2 0.0 0.0 0.0 0.0 0.0 100 0.0 9 3 192 0
oracle/1
11712 oracle 0.2 0.0 0.0 0.0 0.0 0.0 100 0.0 8 2 178 0
oracle/1
11815 oracle 0.2 0.0 0.0 0.0 0.0 100 0.0 0.2 1 3 18 0
java/37
576 root 0.1 0.1 0.0 0.0 0.0 0.0 100 0.1 331 1 1K 0
nscd/11
17855 oracle 0.1 0.0 0.0 0.0 0.0 100 0.0 0.1 5 0 5 0 java/2
11805 oracle 0.1 0.1 0.0 0.0 0.0 0.0 96 3.8 4 7 62 2 perl/1
11649 oracle 0.1 0.0 0.0 0.0 0.0 0.0 100 0.0 9 0 118 0
oracle/1
11780 oracle 0.0 0.1 0.0 0.0 0.0 0.0 92 8.3 52 0 354 47
webcached/1
1 root 0.0 0.1 0.0 0.0 0.0 0.0 100 0.2 13 0 361 14 init/1
4987 cognos8 0.0 0.1 0.0 0.0 0.0 0.0 57 43 338 4 232 0 java/5
4972 cognos8 0.1 0.0 0.0 0.0 0.0 0.0 91 8.5 68 0 77 0
cogbootstrap/3
17855 oracle 0.0 0.1 0.0 0.0 0.0 0.0 51 49 312 2 209 0 java/5
>From looking at the LAT column how to I compute the CPU resources
needed to reduce LAT to more "normal levels".
Page 24 of Solaris Performance and Tools includes the following
statement referring to LAT:
"This is an extremely useful metric--we can use it to estimate the
potential speedup for a thread if more CPU resources are added ..."
I have been unable to find any information on how to turn LAT into CPU
resources. I'm reluctant to use USR + SYS (370.5 the top 9 processes) +
LAT (507 for the same top 9 processes) / 100. This seems way too
simple.
Thanks
gleng
Glen Gunselman
Systems Software Specialist
TCS
Emporia State University
_______________________________________________
perf-discuss mailing list
[email protected]