Re: [Bench!][Mir] +54%..+185% performance boost for Mersenne Twister.

Joseph Rushton Wakeling via Digitalmars-d Wed, 14 Dec 2016 00:26:15 -0800

On Saturday, 26 November 2016 at 16:31:40 UTC, Ilya Yaroshenkowrote:

1. Improve RNG generation performance by making code morefriendly for CPU pipelining. Tempering (finalization)operations was mixed with internal payload update operations.

A note on this. The `opCall` (or, in the range version,`popFront`) of Ilya's implementation mixes together twosuperficially independent actions:

(1) calculating the current random variate from the currentindex

      of the internal state array;

  (2) updating the current index of the internal state array, and
      moving to the next entry.

It's straightforward to split out these two procedures into twoseparate methods (or at least two clearly separated sequenceswithin the `opCall`), but doing so results in a notableperformance hit (on my machine, something in the order of 1 GB/sless random bits).

Intertwining these steps in this way is therefore a very smartoptimization (although TBH it feels a little worrying that it'snecessary).

Re: [Bench!][Mir] +54%..+185% performance boost for Mersenne Twister.

Reply via email to