Re: Is there a source for detailed, instruction-level performance info?

Robert A. Rosenberg Thu, 24 Dec 2015 22:27:03 -0800

At 15:53 -0600 on 12/24/2015, Joel C. Ewing wrote about Re: Is therea source for detailed, instruction-level perfo:

As Tom has noted, the most dramatic performance enhancements typically
come from a change in strategy or algorithm used.  In my experience you
get better results by looking for ways to accomplish the end result by
having the program do fewer actions rather than concentrating on
micro-optimzing the individual actions

This story (and the others) reminds me of an incident that occurredearly in my programming life.

We had an application that read Column Binary data on a 2540 CardReader. The gotcha was that the card was not pure Column Binary buthalf CB with the other half being normal EBCDIC. The program wouldread the card as CB but not eject it leaving the card image in thereader's buffer. It would then do a 2nd read (from the buffer) asEBCDIC with the bad format flag on and eject the card. The result wasa 160 byte image of the card as CB and a 80 byte EBCDIC image withthe CB columns as random junk.

This worked until the Bean Counters wanted to replace the 2540 with a2501 (since we were no longer punching output cards so did not needthe punch capability of the 2540). Since the 2501 was an unbufferedRead-and-Eject device you got one crack at reading the card so itcould only be read as CB (unless we did a second pass of the deck toget the EBCDIC data). It was decided to have the program take the CBimage and convert the EBCDIC section from CB. The task of writing theconversion routine was given to another programmer who built a tableof all the 256 2-byte bit patterns that represented the holes in thecard. His program would then do a search of the table one column at atime (I do not remember if this was a Binary or Hash search). In anycase the program was slow/inefficient.

I was asked to look at his code and see if I could speed up his code.I was able to do so by starting from scratch by using a few TRs andan OC. The basic idea was to use a TR to separate the top 6 rows fromthe bottom 6 rows of the card image in the CB buffer. Then TR each ofthe two set of rows to form an 5-bit map showing if Row 12/11/0/8/9was punched or not and a 3-bit binary number from 0 to 7 showingwhich row in the 1-7 range was punched (ie: Row 5 yielded 101).OC'ing the top row over the bottom yielded a value showing whichpunches were on the card. Running the result through one final TRconverted the Card-Image EBCDIC into the Internal Mapping.

Using the same sequence and a different set of TR Tables andreplacing the final TR with a TRT that checked for more then 1 biton, acted as a sanity check on the EBCDIC part of the CB. My versionran VERY fast. The major effort was creating the TR tables (and allof the mapping info needed was there since this was done by theoriginal programmer when he created his tables).


----------------------------------------------------------------------
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to lists...@listserv.ua.edu with the message: INFO IBM-MAIN

Re: Is there a source for detailed, instruction-level performance info?

Reply via email to