Re: [Computer-go] QLR questions

Rémi Coulom Mon, 24 May 2010 13:36:51 -0700

Hi Gian-Carlo,

I reply to the list, because the answer might be interesting to otherreaders.


On 05/24/2010 09:50 PM, Gian-Carlo Pascutto wrote:

I have 2 based on a quick look:

1) What does one do with draws? I could continue matches until there is
a winner, but I suspect this is suboptimal wrt the optimization procedure?

Right now, QLR does not support draws. That will be for next version,probably. It should not be more than half a day of work for programmingand testing.

There are also some important features that are not supported yet, andthat could be useful in the future:- Allow replications at one point in parameter space. Right now, QLRwill only ask for one game result. But a tester might wish to play twogames from a random opening, one with White, and the other with Black.Or 2N games against N opponents.- Integer parameters. Right now all parameters are assumed continuous.Of course, you can round them in your program. But QLR would behavebetter if it could be aware that you round them.


2) Do you have any advice regarding high-dimensional problems? I have a
few hundred parameters to tune. In theory your program can handle all at
once, if I get it correctly. In practise, surely there are some tradeoffs?

A good automatic parameter optimizer should always perform better whenoptimizing all parameters together, than when optimizing them one by one(or few by few). It is my objective to have this property, but QLRprobably does not have it yet, unfortunately.

QLR does a quadratic regression. That is to say, it fits a model todata. With N parameters to be optimized, this model has (N+1)*(N+2)/2parameters. For large N, that's a lot of parameters. So it requiresstrong regularization. My plan for the future is to have a strong prioron covariance terms. That is to say, assume parameters are independentunless data proves there is a correlation. This should allow directoptimization of really many parameters.

Right now my regression does not have a strong regularization. The priorover regression parameters is a Gaussian with high variance. Inpractice, I tested that it works in dimension 10, on artificialproblems. QLR makes no miracles, anyway. If you wish to optimizehundreds of parameters, you'll have to play millions of games.

I suggest you try to optimize few parameters first. And if you haveprior intuition that some parameters tend to be independent from others,then it is better to optimize them separately. Move ordering andposition-evaluation could probably be optimized separately withoutlosing much, for instance.

A feature I will add in the future is the ability to re-use data from anoptimization of a subset of parameters. For instance, you start byoptimizing bishop and knight value, with rook=5. Then, you can use thedata you collected, in order to optimize bishop + knight + rook together.

Another important feature QLR is missing is the possibility to usereference values for parameters. You surely have good guesses for allyour parameters. Giving them to QLR would help. Right now, you only givean interval to QLR, and it might waste some time in the beginning inorder to discover reasonable values. Seeding QLR with a few hundredgames at your current reference value would help it a lot in the beginning.

So QLR is just at a starting point now. It will improve in the future.But you can start writing a connection script for your program rightnow: it will be useful soon.


Rémi
_______________________________________________
Computer-go mailing list
[email protected]
http://dvandva.org/cgi-bin/mailman/listinfo/computer-go

Re: [Computer-go] QLR questions

Reply via email to