Re: [computer-go] Goal-directedness of Monte-Carlo

Brett Koonce Mon, 08 Sep 2008 17:04:03 -0700

Greetings from a lurker,

Forgive me if I am talking out of my hat. It has been a long timesince I have done any real coding.

It seems most of the gains in MC/UCT come fairly quickly (or ratheryou can get within 50% of a good move guess with a few iterations).It would be interesting to perhaps do a progressive stepping down/widening, i.e. 1k playouts with komi + 3 as the cutoff, then feed thistree into 2k playouts with komi + 2, then 4k playouts with komi + 1,and then finally do the usual full blown regular analysis, say 50kplayouts (numbers can be tweaked of course). You would lose theinitial simulations from your final one, so you would be sacrificingsay 10% of the possible simulations, but on the other hand it wouldseem to bias the tree toward making moves that have a greater chanceof winning by a greedy amount without explicitly telling the computerit has to win by a certain number, which would seem dangerous if thesimulations are near the threshold.

I apologize if this is an obvious idea, was just wondering if therewas a flaw with it/someone had done experiments in this directionalready.


-Brett
_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] Goal-directedness of Monte-Carlo

Reply via email to