Re: Can this implementation of Damm algorithm be optimized?

Era Scarecrow via Digitalmars-d-learn Sun, 12 Feb 2017 17:16:55 -0800

On Monday, 13 February 2017 at 00:56:37 UTC, Nestor wrote:

On Sunday, 12 February 2017 at 05:54:34 UTC, Era Scarecrowwrote:
Ran some more tests.
Wow!
Thanks for the interest and effort.

Certainly. But the bulk of the answer comes down that the 2levels that I've already provided are the fastest you're probablygoing to get. Certainly we can test using shorts or bytesinstead, but it's likely the results will only go down.

To note my tests are strictly on my x86 system and it would bebetter to also test this on other systems like PPC, Linux, ARM,and other architectures to see how they perform, and possiblytweak them as appropriate.

Still we did find out there is some optimization that can bedone and successfully for the Damm algorithm, it just isn't goingto be a lot.

Hmmm... A thought does come to mind. Parallelizing the code;However that would require probably 11 instances to get a 2xspeedup (calculating the second half with all 10 possibilitiesfor the carry over, and also calculating the first half, thenchoosing which of the 10 based on the first half's output), whichonly really works if you have a ton of cores, and the input isREALLY REALLY large, like a meg or something. While the usage ofthe Damm code is more useful for adding a digit to the end of acode like UPC or Barcodes as error detection, and expectinglarger than 32 for real applications is unlikely.


 But at this point I'm rambling.

Re: Can this implementation of Damm algorithm be optimized?

Reply via email to