In particular, they had no way to train a value net, so it was back to
AlphaGo v1 style of training just a policy net and reusing it as the
rollout policy.
On Fri, Apr 6, 2018 at 6:31 AM Fidel Santiago wrote:
> Hello,
>
> Apparently the lessons of Alphago (and many others) are being applied to
Hello,
Apparently the lessons of Alphago (and many others) are being applied to
other fields:
https://www.nature.com/articles/d41586-018-03774-5
"The authors devised a computational process that starts by automatically
extracting chemical transformations from a large commercial database, being
c