Hi,

 

For your clustering/grouping, what is your expectation? Do you have pre-defined 
clusters/groups that you want to cluster the items within those, or do you 
envision a system where clusters/groups will change and evolve as the data 
changes?

 

In each case, it seems you are looking for unsupervised approaches. Is that 
correct?

 

I am new to this email list, so pardon my ignorance, but from what work I have 
done in the past with IR, ML (clustering, More like this, categorization, topic 
detection etc.), my advice to you is to identify your requirements, use cases 
and page flow interactions as the first step. :)

 

Hope this helps!

Vivek.
 
> Date: Tue, 22 Jun 2010 15:50:18 -0700
> Subject: User/Items Reco Engine clustering
> From: [email protected]
> To: [email protected]
> 
> I'm looking to enhance a product recommendation engine. It currently works
> with all data as a whole. I want to introduce clustering/grouping. Its
> model based and the relationship is the common User-Items relationship.
> Originally I was thinking of using a Canopy / kmeans cluster. And then
> determine which cluster a user is in and execute Item Similarity against
> only that cluster of items. However I can't figure out how to build a
> SequenceFile using vectors with the User/Items relationship. I don't know
> which data points to feed the vector. So I scratched that idea and turned
> my attention to Lucene, thinking that this is really a document issue. Where
> users are documents and the items are the content. I should be able to ask
> Lucene, give me documents that look like this "productId3 productId9056
> productId234".
> 
> I'm looking for any and all feedback from those experienced in the
> recommendation world, specifically with the grouping of users and items.
> 
> Thanks,
> -Jay
                                          
_________________________________________________________________
The New Busy is not the old busy. Search, chat and e-mail from your inbox.
http://www.windowslive.com/campaign/thenewbusy?ocid=PID28326::T:WLMTAGL:ON:WL:en-US:WM_HMP:042010_3

Reply via email to