Re: Normal/Gaussian random number generation for D

jerro Fri, 26 Oct 2012 14:05:46 -0700

I'm not expert on this, but there are various tests describedin this paper which compares different normal random numbergeneration techniques:
http://www.cse.cuhk.edu.hk/%7Ephwl/mt/public/archives/papers/grng_acmcs07.pdf


I'll implement the tests described in this paper.

... and the topic is also discussed in this paper:
http://www.doornik.com/research/ziggurat.pdf

I don't have sufficient knowhow to really feel confident abouthow to test these things though, and part of me feels it mightbe worth at some point approaching some stats experts andasking them to help define a test suite for D's random numberfunctionality.


Maybe there are testsuites already written for this.

Well, as I defined the function, the UniformRNG input has adefault value of rndGen, so if you call it just as normal(mean,sigma); it should work as intended. But if there's some factorwhich means it would be better to define a separate functionwhich doesn't receive the RNG as input, I can do that.


I don't see any downside to this.

I think it probably depends on the details of the technique.E.g. with Box-Muller you cache 2 random values, and you need tore-generate them for every pair of normal random variates theuser requests, so there's no problem with having it initializedin opCall on the first call to the engine.
In general I favour that approach of initializing in opCallbecause it means that you don't have to ever pass anything tothe constructor and also if I have a simulation which uses alot of random numbers, its output won't be changed simply bycreating an instance of a normal random number generator --only when I start actually _using_ that generator in place ofsome other source of randomness.

I have only been thinking about the Ziggurat algorithm, but youare right, it does depend on the details of the technique. ForBox-Muller (and other engines that cache samples) it only makessense to compute the first samples in the opCall. But for theZiggurat algorithm, tables that must be computed before you canstart sampling aren't changed during sampling and computing thetables doesn't require any additional arguments. So it makes themost sense for those tables to be initialized in the struct'sconstructor in the struct based API.

In my previous post I was talking about initializing staticinstances of the engine used in the normal() function. Theadvantage of initializing in a static constructor is that youdon't need an additional check every time the normal() functionis called. But because we will also have a struct based API, thatwill not require such checks (at least not for all engines), thisisn't really that important. So we can also initialize the globalengine instance in a call to normal(), if this simplifies things.

Did you get the code I attached to my original post on thistopic? Is there something extra that you need?

I didn't notice the attachments, sorry (but I see them now). Itseems it will be easy enough to write NormalZigguratEngine,there's just one thing that I think should be added to the Normalstruct. Engines should have a way to do their initialization inNormal's constructor (where possible), so I think something likethis:


static if(is(typeof(_engine.initialize())))
    _engine.initialize();

should be added to Normal's constructor. Or maybe

_engine = NormalRandomNumberEngine();

so that a static opCall can be used.

I'm going to turn that into patches for Phobos which I'll putup on GitHub in the next days, so we can pull and push andtest/write together as needed.

Maybe we should also have a separate file in a separate branchfor tests. There will probably be a lot of code needed to testthis well and the tests could take a long time to run, so I don'tthink they should go into unit test blocks (we can put somesimpler tests in unit test blocks later). It may also be usefulto use external libraries such as dstats for tests.

Re: Normal/Gaussian random number generation for D

Reply via email to