I don't recall that it has ever caused a problem, no. The values are
just keys in a hashtable, so don't need to be sequential.

On Fri, Jul 6, 2012 at 8:26 PM, Dan Brickley <dan...@danbri.org> wrote:
> I recall having problems with this before, using the non-Mahout Taste
> code. I have meaningful strings for content IDs and had mapped them
> systematically to pseudo-meaningful (but non-sequential) numbers. I
> remember that causing some problems a year or so back, ... but I'm
> trying it again now with the itemsimilarity Hadoop job. If I need to
> iterate through all rows in the log and generate consecutive counts to
> identify items and users I guess I could, though it doesn't seem very
> Hadoop-friendly. Or should I be OK with anything that's int-shaped?
>
> Dan

Reply via email to