Re: Microprocessor Optimization Primer

David Crayford Thu, 31 Mar 2016 19:01:57 -0700

On 1/04/2016 8:13 AM, Andrew Rowley wrote:

I do know that the effects of allocation patterns are very differentfor GC languages and languages like C/C++. In C++ allocating andfreeing short lived objects is expensive because the memory needs tobe tracked. In GC languages short lived objects are cheap, it isobjects that survive to GC that are expensive. This can make it a bitdifficult to directly compare performance because there might bedifferent design decisions.

In C++ small object allocation can be done with pool allocatorshttp://www.boost.org/doc/libs/1_60_0/libs/pool/doc/html/index.html.

Most of my CPU time seems to be I/O related (processing SMF data).Even removing all the reporting seems to make very little differenceto the CPU consumed. I'm trying to work out if I can reduce the I/Ooverhead.

I guess you're using JZOS RecordReader? From what I know of theimplementation it's micro-optimized and it's as good as it gets for Java.

I don't know about JIT overhead - I think it is probably insignificantbecause it should only compile the methods you actually invoke.
The FTS1 CPU is running CICS, IMS and that one Java batch job spikespast it.
Without other work for competition anything CPU bound will max out theCPU, so that's not necessarily a negative - it just means that itisn't waiting for other things. E.g. at one site the DBAs greatlyincreased Adabas buffer space. CPU contention increased massively asvirtually nothing was waiting for I/O anymore. Caused a few problems,but it's a good problem to have. It just means that if it can only get5% CPU instead of 50% it will run ~10 times as long. Total CPU time isa better measure than CPU%.

Agreed. I probably didn't articulate my point very well. You can'tcompare a COBOL batch job to a Java batch job. The COBOL program willuse 200K of memory and use very little CPU where the Java program willbe a multi-threaded monster and gobble up as much memory as it candevour. That's the nature of the beast. It probably means the density ofJava workloads that can run on a zIIP is far less than conventionalprograms on CPs.

My Java program creates Redis pipeline buffers so uses quite a bit ofmemory. When I profile the program in APA I see the following.


Name      Description               Percent of CPU Time * 10.00%  +3.9%
                                        
*....1....2....3....4....5....6....7....8....9....*
APPLCN    Application Code         96.35 
************************************************

*PATHNAM  Application Program    96.35 
************************************************

  > Parallel  CSECT in *PATHNAM    34.86 *****************
  > MarkingS  CSECT in *PATHNAM    27.41 **************
  > CompactS  CSECT in *PATHNAM    17.11 *********
  > ObjectHe  CSECT in *PATHNAM     6.49 ***
  > Concurre  CSECT in *PATHNAM     3.80 **
  > HeapMapI  CSECT in *PATHNAM     1.90 *
  > J9ZERZ10  CSECT in *PATHNAM     1.26 *
  > Scavenge  CSECT in *PATHNAM     0.95
  > CardTabl  CSECT in *PATHNAM     0.63

Half of the CPU time is spent in GC. Ironically most of this program isspent running C++ GC code in the VM! ParallelScavenger,ConcurrentScavenger and MarkingScheme which I assume is part of themark-and-sweep GC engine.

What I really want for Christmas is to run C++ programs on a zIIP. ButIBM won't let me :)


----------------------------------------------------------------------
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to lists...@listserv.ua.edu with the message: INFO IBM-MAIN

Re: Microprocessor Optimization Primer

Reply via email to