Instead of usng different polcies to choose a child node, another possibility is to run different play out over the same tree. Each play out uses a different policy. Standard UCT-MC is one of the policy. I think it would achieve the same resukts as the RAVE.
DL
_______________________________________________ Computer-go mailing list [email protected] http://dvandva.org/cgi-bin/mailman/listinfo/computer-go
