Would the expected improvement be reduced training time or improved
accuracy?


2016-06-11 23:06 GMT+03:00 Stefan Kaitschick <stefan.kaitsch...@hamburg.de>:

> If I understood it right, the playout NN in AlphaGo was created by using
> the same training set as the one used for the large NN that is used in the
> tree. There would be an alternative though. I don't know if this is the
> best source, but here is one example: https://arxiv.org/pdf/1312.6184.pdf
> The idea is to teach a shallow NN to mimic the outputs of a deeper net.
> For one thing, this seems to give better results than direct training on
> the same set. But also, more importantly, this could be done after the
> large NN has been improved with selfplay.
> And after that, the selfplay could be restarted with the new playout NN.
> So it seems to me, there is real room for improvement here.
>
> Stefan
>
> _______________________________________________
> Computer-go mailing list
> Computer-go@computer-go.org
> http://computer-go.org/mailman/listinfo/computer-go
>
_______________________________________________
Computer-go mailing list
Computer-go@computer-go.org
http://computer-go.org/mailman/listinfo/computer-go

Reply via email to