Inline
On Sun, Jul 21, 2013 at 3:31 PM, B Lyon <bradfl...@gmail.com> wrote: > Paper and presentation are very interesting to me as well. I am fairly new > to this, and coming to terms with some of the terms, etc. I assume that > "action matrix" here is just the raw matrix of how each user has > "interacted with" the items/types-of-items. Yes. Also called the history matrix. For graph problems, this is the adjacency matrix. > I didn't quite get the > incorporation into SOLR (not familiar with that much, either), in > particular the "indexing" related to the generated (root LLR-based?) > co-occurence matrices for the different types of things so that it can be > used in searches - so, a real newbie question: how can the co-occurence > matrix be implemented as a search index in SOLR? Just point me at the RTFM > docs is fine :) > The idea is that the rows of the sparse item-item matrix after processing by the RowSimilarityJob can be fields in the corresponding item documents in Solr. This works because the rows are quite sparse. The contents of these fields is not text, but rather a list of items. If you have different kinds of actions for the same items or if you have different sets of items, you can compute the indicators for each separately. Then you can store each row from each of the item-item matrices in a different field in Solr. Recommendation consists of using a set of items as a query against one of these fields. If the recent history has multiple kinds of items or multiple kinds of actions, then the query can refer to multiple fields. > On Sun, Jul 21, 2013 at 5:17 PM, Pat Ferrel <pat.fer...@gmail.com> wrote: > > > Read the paper, and the preso. > > > > As to the 'offline to Solr' part. It sounds like you are suggesting an > > item item similarity matrix be stored and indexed in Solr. One would have > > to create the action matrix from user profile data (preference history), > do > > a rowsimiarity job on it (using LLR similarity) and move the result to > > Solr. The first part of this is nearly identical to the current > recommender > > job workflow and could pretty easily be created from it I think. The new > > part is taking the DistributedRowMatrix and storing it in a particular > way > > in Solr, right? > > > > BTW Is there some reason not to use an existing real data set? > > > > On Jul 19, 2013, at 3:45 PM, Ted Dunning <ted.dunn...@gmail.com> wrote: > > > > OK. I think the crux here is the off-line to Solr part so let's see who > > else pops up. > > > > Having a solr maven could be very helpful. > > > > > > > > > -- > BF Lyon > http://www.nowherenearithaca.com >