-------- Original Message --------
Subject: Re: [Fwd: Re: Parsing Olio runtimeStats]
Date: Wed, 12 May 2010 15:18:36 -0700
From: Akara Sucharitakul <[email protected]>
Reply-To: [email protected]
Organization: Oracle
To: Vasileios Kontorinis <[email protected]>
CC: [email protected], "akara.sucharitakul"
<[email protected]>, Shanti Subramanyam
<[email protected]>
References: <[email protected]>
<[email protected]>
I don't think so. CErr is very straightforward logic. If the requests
all wait for the response but have not yet timed out, we cannot conclude
there are any errors. So the CErr count is 0 for that period. This is
very common behavior when the system runs out of resources.
-Akara
On 05/12/10 02:53 PM, Vasileios Kontorinis wrote:
Akara hi again,
Something weird is going on. Check the following driver.log part
from the steady state...
May 4, 2010 10:22:00 PM
com.sun.faban.driver.engine.MasterImpl$StatsWriter dumpStats
INFO: 115.01s - UIDriver:
HomePage/Login/TagSearch/EventDetail/PersonDetail/AddPerson/AddEvent
CThru=13.997/3.999/17.397/12.398/1.800/0.400/0.800
OThru=12.763/5.141/17.398/12.399/1.388/0.518/0.953
CErr=0.000/0.000/0.000/0.000/0.000/0.000/0.000
CResp=0.008/0.008/0.020/0.012/0.075/0.024/0.140
OResp=0.056/0.058/0.076/0.057/0.150/0.035/0.186
CSD=0.000/0.000/0.059/0.000/0.000/0.005/0.000
OSD=0.329/0.302/0.349/0.345/0.307/0.033/0.200
C90%Resp=0.020/0.020/0.040/0.020/0.140/0.020/0.160
O90%Resp=0.020/0.020/0.060/0.040/0.180/0.080/0.160
May 4, 2010 10:22:05 PM
com.sun.faban.driver.engine.MasterImpl$StatsWriter dumpStats
INFO: 120.01s - UIDriver:
HomePage/Login/TagSearch/EventDetail/PersonDetail/AddPerson/AddEvent
CThru=12.803/5.401/14.603/12.202/2.200/0.000/1.400
OThru=12.765/5.155/17.243/12.388/1.433/0.489/0.978
CErr=0.000/0.000/0.000/0.000/0.000/0.000/0.000
CResp=0.008/0.008/0.018/0.010/0.070/-/0.149
OResp=0.053/0.055/0.073/0.055/0.143/0.035/0.184
CSD=0.007/0.000/0.000/0.000/0.000/-/0.000
OSD=0.319/0.293/0.341/0.336/0.292/0.033/0.193
C90%Resp=0.020/0.020/0.040/0.020/0.120/-/0.180
O90%Resp=0.020/0.020/0.060/0.040/0.180/0.080/0.180
May 4, 2010 10:22:10 PM
com.sun.faban.driver.engine.MasterImpl$StatsWriter dumpStats
INFO: 125.01s - UIDriver:
HomePage/Login/TagSearch/EventDetail/PersonDetail/AddPerson/AddEvent
CThru=11.400/5.800/18.400/11.800/0.800/0.600/1.000
OThru=12.694/5.189/17.304/12.357/1.400/0.495/0.979
CErr=0.000/0.000/0.000/0.000/0.000/0.000/0.000
CResp=0.008/0.007/0.018/0.011/0.129/0.031/0.138
OResp=0.051/0.052/0.070/0.052/0.142/0.035/0.181
CSD=0.000/0.019/0.017/0.006/0.000/0.000/0.010
OSD=0.312/0.285/0.332/0.328/0.288/0.032/0.188
C90%Resp=0.020/0.020/0.040/0.020/0.180/0.040/0.160
O90%Resp=0.020/0.020/0.060/0.040/0.180/0.080/0.180
May 4, 2010 10:22:19 PM
com.sun.faban.driver.engine.MasterImpl$StatsWriter dumpStats
INFO: 133.57s - UIDriver:
HomePage/Login/TagSearch/EventDetail/PersonDetail/AddPerson/AddEvent
CThru=9.116/4.324/12.973/9.233/0.468/0.117/0.468
OThru=12.398/5.118/16.946/12.099/1.323/0.463/0.937
CErr=0.000/0.000/0.000/0.000/0.000/0.000/0.000
CResp=0.021/0.007/0.033/0.075/0.094/0.033/0.177
OResp=0.049/0.049/0.068/0.054/0.141/0.035/0.181
CSD=0.115/2.938/0.108/0.372/0.000/0.000/0.380
OSD=0.304/0.276/0.322/0.331/0.284/0.031/0.199
C90%Resp=0.020/0.020/0.060/0.040/0.120/0.020/0.220
O90%Resp=0.020/0.020/0.060/0.040/0.180/0.080/0.180
May 4, 2010 10:22:20 PM
com.sun.faban.driver.engine.MasterImpl$StatsWriter dumpStats
INFO: 135.01s - UIDriver:
HomePage/Login/TagSearch/EventDetail/PersonDetail/AddPerson/AddEvent
CThru=3.463/1.385/4.155/4.155/0.000/0.000/0.000
OThru=12.275/5.066/16.770/11.989/1.305/0.457/0.924
CErr=0.000/0.000/0.000/0.000/0.000/0.000/0.000
CResp=0.775/0.170/0.131/1.253/-/-/-
OResp=0.052/0.050/0.068/0.059/0.141/0.035/0.181
CSD=1.237/0.078/13.954/0.935/-/-/-
OSD=0.316/0.276/0.321/0.346/0.284/0.031/0.199
C90%Resp=0.320/0.160/0.180/1.700/-/-/-
O90%Resp=0.020/0.020/0.060/0.040/0.180/0.080/0.180
May 4, 2010 10:22:25 PM
com.sun.faban.driver.engine.MasterImpl$StatsWriter dumpStats
INFO: 140.01s - UIDriver:
HomePage/Login/TagSearch/EventDetail/PersonDetail/AddPerson/AddEvent
CThru=0.000/0.000/0.000/0.000/0.200/0.000/0.000
OThru=11.717/4.836/16.008/11.444/1.254/0.436/0.882
CErr=0.000/0.000/0.000/0.000/0.000/0.000/0.000 CResp=-/-/-/-/1.751/-/-
OResp=0.052/0.050/0.068/0.059/0.153/0.035/0.181 CSD=-/-/-/-/0.000/-/-
OSD=0.316/0.276/0.321/0.346/0.315/0.031/0.199
C90%Resp=-/-/-/-/0.020/-/-
O90%Resp=0.020/0.020/0.060/0.040/0.180/0.080/0.180
May 4, 2010 10:22:30 PM
com.sun.faban.driver.engine.MasterImpl$StatsWriter dumpStats
INFO: 145.01s - UIDriver:
HomePage/Login/TagSearch/EventDetail/PersonDetail/AddPerson/AddEvent
CThru=0.000/0.000/0.000/0.000/0.000/0.000/0.000
OThru=11.208/4.626/15.312/10.947/1.200/0.417/0.843
CErr=0.000/0.000/0.000/0.000/0.000/0.000/0.000 CResp=-/-/-/-/-/-/-
OResp=0.052/0.050/0.068/0.059/0.153/0.035/0.181 CSD=-/-/-/-/-/-/-
OSD=0.316/0.276/0.321/0.346/0.315/0.031/0.199 C90%Resp=-/-/-/-/-/-/-
O90%Resp=0.020/0.020/0.060/0.040/0.180/0.080/0.180
May 4, 2010 10:22:35 PM
com.sun.faban.driver.engine.MasterImpl$StatsWriter dumpStats
INFO: 150.01s - UIDriver:
HomePage/Login/TagSearch/EventDetail/PersonDetail/AddPerson/AddEvent
CThru=0.000/0.000/0.000/0.000/0.000/0.000/0.000
OThru=10.741/4.433/14.674/10.491/1.150/0.400/0.808
CErr=0.000/0.000/0.000/0.000/0.000/0.000/0.000 CResp=-/-/-/-/-/-/-
OResp=0.052/0.050/0.068/0.059/0.153/0.035/0.181 CSD=-/-/-/-/-/-/-
OSD=0.316/0.276/0.321/0.346/0.315/0.031/0.199 C90%Resp=-/-/-/-/-/-/-
O90%Resp=0.020/0.020/0.060/0.040/0.180/0.080/0.180
May 4, 2010 10:22:40 PM
com.sun.faban.driver.engine.MasterImpl$StatsWriter dumpStats
INFO: 155.01s - UIDriver:
HomePage/Login/TagSearch/EventDetail/PersonDetail/AddPerson/AddEvent
CThru=0.000/0.000/0.000/0.000/0.000/0.000/0.000
OThru=10.311/4.256/14.087/10.071/1.104/0.384/0.776
CErr=0.000/0.000/0.000/0.000/0.000/0.000/0.000 CResp=-/-/-/-/-/-/-
OResp=0.052/0.050/0.068/0.059/0.153/0.035/0.181 CSD=-/-/-/-/-/-/-
OSD=0.316/0.276/0.321/0.346/0.315/0.031/0.199 C90%Resp=-/-/-/-/-/-/-
O90%Resp=0.020/0.020/0.060/0.040/0.180/0.080/0.180
May 4, 2010 10:22:45 PM
com.sun.faban.driver.engine.MasterImpl$StatsWriter dumpStats
I have highlighted the interesting parts.
The execution proceeds nornally till a certain point, when (I have
verified with vmstat) it starts swapping (there is not enough memory
in the virtual machine). At this point cpu goes to 100% and CThru is
stack at 0 after that and no CErr are reported.
Any intuition on CErr is stuck at 0 as well?
Is there any way to detect this situation, besides looking at the
throughput numbers? The expected behavior would be to log a huge
amount of CErr, all the operations basically during the interval. Right?
Thanks
-------------------------------------------------------------------
Kontorinis Vasileios
Phd student, University of California San Diego
San Diego, CA 92122
Cell. phone: (858) 717 6899
[email protected] <mailto:[email protected]>,
[email protected] <mailto:[email protected]>
-------------------------------------------------------------------
2010/5/5 Akara Sucharitakul <[email protected]
<mailto:[email protected]>>
Hmmm, the apache mailing list won't let my current email post a
reply. Trying again.
-Akara
-------- Original Message --------
Subject: Re: Parsing Olio runtimeStats
Date: Wed, 05 May 2010 14:17:35 -0700
From: Akara Sucharitakul <[email protected]>
<mailto:[email protected]>
Reply-To: [email protected]
<mailto:[email protected]>
Organization: Oracle
To: [email protected]
<mailto:[email protected]>
CC: akara.sucharitakul <[email protected]>
<mailto:[email protected]>, Shanti Subramanyam
<[email protected]> <mailto:[email protected]>
References:
<[email protected]>
<mailto:[email protected]>
Let me try to address this below:
On 05/05/10 12:12, Vasileios Kontorinis wrote:
Akara and Shanti hi,
I am parsing the runtimeStats from the driver.log file and I
run into this interesting issue.
In the output for _steady state_ I get "-" for the response time
when there are no successful operations since the last time
runtimeStats were printed.
This can happen for two reasons:
1) The interval for printing the runtimeStats is small (in my
case 5secs) and some operations, especially the ones that take
long and have small frequency in matrix (add event, add user)
just never happened.
Yes, if the number of successful operations in that period is 0,
most calculations will be a divide by 0. That's why it is printing
a '-'.
2) There are a bunch of requests going on and none of them is
successful. This can happen when there is no sufficient memory in
the machine. In that case there is lots of swapping, the cpu goes
to 100% utilization and all the operations time-out.
Is there any small change I can do to distinguish between the two
cases? Maybe printing a small "t" when there are many requests
timing out, so that I can distinguish between the two cases? Any
ideas are welcome.
There is the error count in the runtime stats that tell you about
error cases. This would also include timeouts. But the problem is
not that simple. The real problem is not the operations timing out
but rather the operations waiting that would time out. And we
cannot distinguish between these waiting operations and other
operations waiting just with slow response time.
One more thing. I use the variableLoad setting and in order to
know how many users are simulated while the benchmark run, I
parse log.xml looking for "Active threads: ". The problem is that
log.xml is big especially when many requests timeout. Is it easy
to also log the changes of active threads in the drive.log file ??
Please look into log configuration in
$FABAN/config/logging.properties. You should be able to make
certain loggers log to a particular file. I don't have the detail
off the top of my head.
-Akara