Re: [computer-go] RAVE formula of David Silver (reposted)

Mark Boon Sun, 30 Nov 2008 21:24:11 -0800


On 30-nov-08, at 16:51, Jason House wrote:

You've claimed to be non-statistical, so I'm hoping the followingis useful... You can compute the likelihood that you made animprovement as:
erf(# of standard deviations)
Where # of standard deviations =
(win rate - 0.5)/sqrt(#games)
Erf is ill-defined, and in practice, people use lookup tables totranslate between standard deviations and confidence levels. Inpractice, people set a goal confidence and directly translate it toa number of standard deviations (3.0 for 99.85%). This situationrequires the one-tailed p test.
After about 20 or 30 games, this approximation is accurate and canbe used for early termination of your test.

Lately I use twogtp for my test runs. It computes the winningpercentage and puts a ± value after it in parenthesis. Is that thevalue of one standard deviation? (I had always assumed so.) Evenafter a 1,000 games it stays in the 1.5% neighbourhood.

Maybe 20-30 games is usually an accurate approximation. But if youperform tests often, you'll occasionally bump into that unlikelyevent where what you thought was a big improvement turned out to beno improvement at all. Or the other way around. Only when I see 20+games with a zero winning percentage do I stop it, assuming I made amistake.


Mark

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] RAVE formula of David Silver (reposted)

Reply via email to