Re: [computer-go] RAVE formula of David Silver (reposted)

Mark Boon Fri, 28 Nov 2008 05:35:15 -0800

Thanks for posting that Remi. I do remember seeing that before butsomehow I didn't notice it when looking for RAVE-related stuff recently.

Mathematics is not my strong point, so I have a hard time makingsense of those formula's. I do get the gist that it uses a UCT valueand a RAVE value in a similar fashion, one based on actual playoutsand the other based on virtual playouts (based on AMAF). The balancein which the two values influences node-selection is calculated bybeta, which favours UCT for frequently visited nodes and RAVE forunfrequently visited notes. But I'm not toally clear on what b_r andq_ur actually are in formula (11). (I don't know how to denotesubscription symbols in mail.) At first glance this seems to be a bitmore sophisticated version of what Denis was trying to explain.

What is also not clear to me from the article is how this UCT_RAVEvalue is used after it's calculated. In plain UCT search you selectthe node with the highest win/loss+UCT value. How does the virtualwin/loss ratio get used in combination with the UCT-RAVE valueresulting from formula (14)? Is this explained in the original byGelly and Silver?


        Mark


On 28-nov-08, at 07:38, Rémi Coulom wrote:

Hi Mark,
Maybe you missed the nice RAVE formula that David Silver posted inthat message:
http://computer-go.org/pipermail/computer-go/2008-February/014095.html
Unfortunately, the list archive does not keep attachments. Iattached another copy to this message.
I am not sure it is better than your formula, but I thought itwould be good to repost it, since it seems that it is not availableonline anywhere.
Rémi<rave.pdf>_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/


_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] RAVE formula of David Silver (reposted)

Reply via email to