subject:"Finding best NearestNUserNeighborhood size"

Re: Finding best NearestNUserNeighborhood size

2013-01-23 Thread Sean Owen

That is good for making a test repeatable because you are picking the same random sample repeatedly. For evaluation purposes here that's not a good thing and you do want several actually different samples of the result. On Jan 23, 2013 1:19 PM, "Stevo Slavić" wrote: > When evaluating recommender

Re: Finding best NearestNUserNeighborhood size

2013-01-23 Thread Sean Owen

The stochastic nature of the evaluation means your results will vary randomly from run to run. This looks to my eyeballs like most of the variation you see. You probably want to average over many runs. You will probably find that accuracy peaks around some neighborhood size: adding more useful nei

Re: Finding best NearestNUserNeighborhood size

2013-01-23 Thread Stevo Slavić

When evaluating recommender before running evaluator put RandomUtils.useTestSeed(); to make splitting of data set consistent; don't use it in production, just for evaluation. This is all explained more thoroughly in Mahout in Action book. Kind regards, Stevo Slavic. On Wed, Jan 23, 2013 at 2:0