Re: Website, urgent help needed

2014-03-12 Thread Juan José Ramos
Hi Sebastian, I am afraid I am only familiar with the recommendation part. In previous posts, I pointed a couple of errors in this wiki page: https://cwiki.apache.org/confluence/display/MAHOUT/Quick+tour+of+text+analysis+using+the+Mahout+command+line If you are planning to keep it in the new

Re: Recommend items not rated by any user

2014-03-05 Thread Juan José Ramos
José Ramos jjar...@gmail.com wrote: First thing is thatI know this requirement would not make sense in a CF Recommender. In my case, I am trying to use Mahout to create something closer to a Content-Based Recommender. In particular, I am pre-computing a similarity matrix between all

Re: Recommend items not rated by any user

2014-03-05 Thread Juan José Ramos
interface to add a parameter that says whether to keep or remove the current users items? We could even do this in the abstract base class then. --sebastian On 03/05/2014 10:42 AM, Juan José Ramos wrote: In case somebody runs into the same situation, the key seems

Re: Recommend items not rated by any user

2014-03-05 Thread Juan José Ramos
-NaN similarity value that is with at least one of the items preferred by the user. Tevfik On Wed, Mar 5, 2014 at 2:30 PM, Sebastian Schelter s...@apache.org wrote: On 03/05/2014 01:23 PM, Juan José Ramos wrote: Thanks for the reply, Sebastian. I am not sure if that should

Re: Recommend items not rated by any user

2014-03-05 Thread Juan José Ramos
as the query on the indexed articles and Solr will return articles ranked by similarity. This is a content based recommender based solely on Solr. Does this describe your situation? On Mar 4, 2014, at 1:16 AM, Juan José Ramos jjar...@gmail.com wrote: First thing is thatI know this requirement

Re: Recommend items not rated by any user

2014-03-05 Thread Juan José Ramos
and if the similarity value between X and at least one of the items rated (preferred) by the user is not NaN, then X will be not be returned by AllSimilarItemsCandidateItemsStrategy, but it will be returned by AllUnknownItemsCandidateItemsStrategy. On Wed, Mar 5, 2014 at 4:42 PM, Juan José Ramos

Re: Issue updating a FileDataModel

2014-03-04 Thread Juan José Ramos
between the time of the call to refresh() and the last modified time of the file. --sebastian On 03/03/2014 04:45 PM, Juan José Ramos wrote: Thanks for the reply, Sebastian. I do not have concurrent updates, but they actually may happen very, very close in time. Would the fact of adding

Re: Issue updating a FileDataModel

2014-03-03 Thread Juan José Ramos
directly append new data to the original file. If you want a to have a DataModel that can be concurrently updated, I suggest your data to a database. --sebastian On 03/02/2014 11:11 PM, Juan José Ramos wrote: I am having issues refreshing my recommender, in particular with the DataModel. I am

Re: how to recommend users already consumed items

2014-03-02 Thread Juan José Ramos
I am using a GenericItemBasedRecommender and have faced the same problem. In my case the items are news and the user does not provide an explicit rating but the system would infer one for him/her based on the actions he makes: Reading, sharing, favouriting... The approach I have followed has

Issue updating a FileDataModel

2014-03-02 Thread Juan José Ramos
I am having issues refreshing my recommender, in particular with the DataModel. I am using a FileDataModel and a GenericItemBasedRecommender that also has a CachingItemSimilarity wrapping a FileItemSimilarity. But for the test I am running I am making things even simpler. By the time I

Wiki - 'Quick tour of text analysis using the Mahout command line' clarification

2014-02-25 Thread Juan José Ramos
In the wiki page: 'Quick tour of text analysis using the Mahout command line'. https://cwiki.apache.org/confluence/display/MAHOUT/Quick+tour+of+text+analysis+using+the+Mahout+command+line At the very bottom it is said that 1. This will generate the 10 most similar docs to each doc in the

Re: Wiki - 'Quick tour of text analysis using the Mahout command line' clarification

2014-02-25 Thread Juan José Ramos
On Feb 25, 2014, at 9:22 AM, Juan José Ramos jjar...@gmail.com wrote: In the wiki page: 'Quick tour of text analysis using the Mahout command line'. https://cwiki.apache.org/confluence/display/MAHOUT/Quick+tour+of+text+analysis+using+the+Mahout+command+line At the very bottom

Load output of rowsimilarity to memory

2014-02-24 Thread Juan José Ramos
Is there a way to reproduce this process: https://cwiki.apache.org/confluence/display/MAHOUT/Quick+tour+of+text+analysis+using+the+Mahout+command+line inside Java code and not using the command line tool? I am not interested in the clustering part but in 'Calculate several similar docs to each

Re: Load output of rowsimilarity to memory

2014-02-24 Thread Juan José Ramos
can be loaded by the FileItemSimilarity. --sebastian On 02/24/2014 08:31 PM, Juan José Ramos wrote: Is there a way to reproduce this process: https://cwiki.apache.org/confluence/display/MAHOUT/ Quick+tour+of+text+analysis+using+the+Mahout+command+line inside Java code and not using

Re: Load output of rowsimilarity to memory

2014-02-24 Thread Juan José Ramos
with FileItemSimilarity. --sebastian On 02/24/2014 09:30 PM, Juan José Ramos wrote: Thanks for the prompt reply. RowSimilarityJob produces an output in the form of: Key: 0: Value: {61112:0.21139380179557016,52144:0.23797846026935565,...} whereas FileItemSimilarity is expecting a comma or tab