Re: Finding best NearestNUserNeighborhood size

Sean Owen Wed, 23 Jan 2013 05:22:56 -0800

That is good for making a test repeatable because you are picking the same
random sample repeatedly. For evaluation purposes here that's not a good
thing and you do want several actually different samples of the result.
On Jan 23, 2013 1:19 PM, "Stevo Slavić" <ssla...@gmail.com> wrote:


> When evaluating recommender before running evaluator put
>
> RandomUtils.useTestSeed();
>
> to make splitting of data set consistent; don't use it in production, just
> for evaluation.
> This is all explained more thoroughly in Mahout in Action book.
>
> Kind regards,
> Stevo Slavic.
>
>
> On Wed, Jan 23, 2013 at 2:01 PM, Zia mel <ziad.kame...@gmail.com> wrote:
>
> > Hi
> > I used NearestNUserNeighborhood with RMSE in a user recommender that
> > use PearsonCorrelationSimilarity , I found that changing the
> > neighborhood size has no clear pattern or effect. Sometimes it
> > increase others decrease. While using the neighborhood size with
> > precision has a better pattern. Any reason? Another point is that the
> > RMSE change for every run since it choose different sample , so would
> > running the code for 10 or 20 times and taking the average be a good
> > idea or there is better thing to do?
> >
> > //-- RUN 1
> >  2,  0.5523623146152608
> >  3,  0.5425283201773704
> >  4,  0.669846658662311
> >  5,  0.5956616542334392
> >  6,  0.6033911039809353
> >  7,  0.6135206544496685
> >  8,  0.5740444208649034
> >  9,  0.642798288443049
> >  10,  0.6266535555651472
> >
> > //-- RUN 2
> >  2,  0.5415411343523825
> >  3,  0.6784589323396696
> >  4,  0.6347069968141124
> >  5,  0.6968820296725008
> >  6,  0.5953849874479478
> >  7,  0.6791828191904128
> >  8,  0.6072462830257853
> >  9,  0.6461346217476011
> >  10,  0.6043919119341171
> >
> > Thanks !
> >
>

Re: Finding best NearestNUserNeighborhood size

Reply via email to