Re: Monitor versus accounting data

Barton Robinson Fri, 03 Apr 2009 08:18:25 -0700

Ok, let's go through this one at a time.  See inserted comments.


Alan Ackerman wrote:

Creating new thread.

1. The folks that receive the data at my shop are z/OS folks. Historicall
the capture ratio of MVS was really poor. The notion was that you shoulduse SMF data and never RMF data. I don't know if z/OS has cleaned up itsact or not.
But I have heard the same thing from VM folks. (I've said it myself.)
As Barton says, the capture ratio in VM has always been quite high, due t
the way the data is captured in the VMDBK. However, Barton computes this(I think) by comparing different record types in the monitor data, not by
comparing monitor to accounting data.
There is system overhead, but it is captured in the SYSTEM VMDBK block.Accounting data and monitor data are using the same data, so they shouldget the same results. Of course, some time gets charged to the wrong userfor example between the time an interrupt comes in and the new user isidentified. But it shows up the same in the monitor and the accountingdata. (User CPU time is more reproducible than total CPU time, for thisreason.)

Is "some time gets charged to the wrong user" a validated and relevantissue? I've not seen any "overhead" issues in accounting or monitor datain MANY years.

2. Monitor sample data is taken at one minute samples. It used to be that
data for users that logged in or off between samples was dropped for thepartial minutes. Is this still true? Was it ever true? Or is it urbanfolklore?

Transaction records are cut at logon/logoff, that is how we get 100.00%capture ratio. Nothing is lost.

3. On our systems, we sometimes see messages from CP that say the monitor
data has been thrown away because the user connected to *MONITOR did notrespond in time. This happens when the system is overloaded, either in CPor storage. So we lose some minutes of monitor data, but not, I think,accounting data.Often you can fix this by increasing the segment sizes or giveMONWRITE/ESAWRITE a bigger SHARE. Not always, though. In some cases themonitor segments get paged out. (We reported it to Velocity, who said itwas a CP problem.) I think IBM could do things to make collection ofmonitor data more reliable in the extreme cases.Unfortunately, I'm not responsible for this and it is "only performancedata". I think this can be dealt with, but it does take diligence and worto keep your monitor data accurate. You don't have to do this work foraccounting data.I think IBM could do things to make collection of monitor data more easy.

This still does happen occasionally when systems are thrashing so muchthat everything stops. At this point, accounting is probably lowerpriority. Capacity planning and performance tuning do need to beemployed in this platform.IBM could stop the DCSS from being paged out when the system starts tothrash.

4. On our systems, we switch files (I think hourly) to keep them fromgetting too big. We lose a minute or two of data each time.


ESALPS does not lose data each hour. Capture ratio is 100%

5. The default for ESAWRITE is to collect User history records only foruserids using more than 0.5% CPU. So when we go back to process CPUutilization for users, we get smaller totals for monitor than fromaccounting data. I assume this could be fixed by setting the threshold tozero.I don't know which of these, if any, affect the ESALPS data collectionthat Barton mentioned. We have tested ESALPS, but are not yet licensed.

The default for ESAWRITE is 100% capture ratio. ALL USER DATA iscaptured and retained for capacity planning and accounting. thethresholds only apply to current performance data. This has been thecase for 20 years. I'll repeat, capture ratio for user data is ALWAYS100.00%. You can't look at the interval data collected for performanceand use it for accounting. The summary data for each hour is 100% andis what one would use for accounting and capacity planning.

Alan AckermanAlan (dot) Ackerman (at) Bank of America (dot) com

Re: Monitor versus accounting data

Reply via email to