Re: ItemSimilarityJob's results differ from non-distributed version

Sebastian Schelter Thu, 24 Nov 2011 23:04:36 -0800

Hi Greg,

You should get the same results, can you describe exactly how you
converted the dataset? I'd like to try this myself, maybe you found some
subtle bug.


I also have doubts whether taking the precision of the top 5 recommended
items is really a good quality measure.

--sebastian

On 25.11.2011 02:41, Greg H wrote:
> Thanks for the replies Sebastian and Sean. I looked at the similarity
> values and they are the same, but ItemSimilarityJob is calculating fewer of
> them. So it must be still doing some sort of sampling. I thought that I
> could force it to use all of the data by setting maxPrefsPerUser
> sufficiently large. Could there be another reason for it not to calculate
> all of the similarity values?
> 
> I also tried to use a smaller amount of similarItemsPerItem but this leads
> to worse results.
> 
> Thanks again,
> Greg
>

Re: ItemSimilarityJob's results differ from non-distributed version

Reply via email to