Re: [Computer-go] Monte-Carlo Tree Search as Regularized Policy Optimization

Rémi Coulom Thu, 16 Jul 2020 10:48:15 -0700

This looks very interesting.

>From a quick glance, it seems the improvement is mainly when the number of
playouts is small. Also they don't test on the game of Go. Has anybody
tried it?


I will take a deeper look later.

On Thu, Jul 16, 2020 at 9:49 AM Ray Tayek <[email protected]> wrote:

>
> https://old.reddit.com/r/MachineLearning/comments/hrzooh/r_montecarlo_tree_search_as_regularized_policy/
>
>
> --
> Honesty is a very expensive gift. So, don't expect it from cheap people -
> Warren Buffett
> http://tayek.com/
>
> _______________________________________________
> Computer-go mailing list
> [email protected]
> http://computer-go.org/mailman/listinfo/computer-go
>

_______________________________________________
Computer-go mailing list
[email protected]
http://computer-go.org/mailman/listinfo/computer-go

Re: [Computer-go] Monte-Carlo Tree Search as Regularized Policy Optimization

Reply via email to