Re: [computer-go] FW: computer-go] Monte carlo play?

Mark Boon Sun, 16 Nov 2008 19:40:14 -0800

Some months ago I did several experiments with using tactics andpatterns in playouts. Generally I found a big boost in strength usingtactics. I also found a boost in strength using patterns but with asevere diminishing return after a certain number and even becomingdetrimental when using large number of patterns (1,000s to 10,000s).Since I was using a generalized pattern-matcher, using it slowedthings down considerably. Although it played a lot better with thesame number of playouts, if I compared MC playouts with patterns to aMC playout without patterns using the same amount of CPU time thegain was not so obvious. Since most of the gain in strength wasgained by just a few patterns I concluded just as David that it wasprobably better to just use a handful of hard-coded patterns duringplayouts.

I only recently started to do real experiments with hard-codedpatterns and so far my results are rather inconclusive. I found whenmixing different things it's not always clear what contributes to anyincreased strength observed. So I'm still in the process of trying todissect what is actually contributing where. I found for example thata lot of the increased level of play using patterns does not comefrom using them during playouts but comes from the effect they haveon move-exploration. I don't know if this is due to my particular wayof implementing MC playouts in combination with UCT search, but movesmatching a pattern (usually) automatically make it first in the tree-expansion as well and generally I can say so far I'm observing thatmost of the increase in level comes from the selection duringexploration and only in small part from the selection during simulation.

For example, in one particular experiment using just 5 patterns I sawa win-rate of 65% against the same program not using patterns (withthe same number of playouts). But when not using the patterns duringexploration saw the win-rate drop to just 55%.

I still have a lot of testing to do and it's too early to draw anyhard conclusions. But I think it's worthwhile trying to distinguishwhere the strength is actually gained. Better yet, finding outexactly 'why' it gained strength, because with MC playouts I oftenfind results during testing highly counter-intuitive, occasionally tothe point of being (seemingly) nonsensical.

I also think what Don was proposing with his reference-bot could beinteresting. Trying to make it play around ELO 1700 on CGOS justusing 5,000 (light) playouts. I don't know if it's possible, but Ithink it's a fruitful exercise. At a time where most people arelooking at using more and more hardware to increase playing-strength,knowing what plays best at the other end of the spectrum is valuableas well. With that I mean, finding what plays best using severelyconstrained resources.


Mark

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] FW: computer-go] Monte carlo play?

Reply via email to