Re: [computer-go] Monte Carlo (MC) vs Quasi-Monte Carlo (QMC)

Matt Gokey Tue, 06 Feb 2007 17:23:54 -0800

Tapani Raiko wrote:

It seems that there are at least three cases:
1: Choosing a random move from a uniform distribution
2: Choosing a random move from a nonuniform distribution (patterns etc.)
3: Choosing a move taking into account what has been chosen before
The concensus seems to be that numbers 1 and 2 are MC and 3 is QMC.Mogo uses QMC within the tree in memory and MC for the leaves, so whichshould it be called?

Yes, programs will quickly diverge into many different "classifications"so that is why in the end it probably doesn't mean much to distinguishbetween just MC and QMC.

And about reducing variance: In games you only care about estimating thegoodness of the best moves (in order to select the best one). You don'tcare how bad a move is, if you are fairly certain that it is not the bestone. You should thus reduce the variance of the best moves, that is, studythem more often. This is exactly what UCT is about, reducing the varianceof variables of interest.

I understand, but you must still correctly correlate good and bad movesand doesn't this require a goodness factor? Running simulations is theonly way MC can begin to classify them and the only way for UCT todecide to further study them. So better simulations will produce betterresults. "One dimensional" simulations will probably not produce thebest results.

I could see a case where it is possible to reduce a variance of a singlevariable even in the 0-1 case. Let us say that black has about 5% chancesof winning. If we could (exactly) double the chances of black winning bychanging the nonuniform sampling somehow (say, enforce bad moves bywhite), we could sample from that and divide the estimated black's winningchance in the end by 2. This would of course be very difficult inpractice. (A binary random variable gives more information when thechances are closer to 50-50.) This could be useful in practice inhandicap games, by for instance enforcing a black pass with 1% chanceevery move. Sampling would be distorted towards white win, which isrealistic since white is assumed to be a stronger player, anyway.

I don't understand this line of reasoning.

To summarise, I agree that there are links to other MC research, and theyshould be explored.

Yes, I agree.

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] Monte Carlo (MC) vs Quasi-Monte Carlo (QMC)

Reply via email to