Hi Sebastian,
I am afraid I am only familiar with the recommendation part.
In previous posts, I pointed a couple of errors in this wiki page:
https://cwiki.apache.org/confluence/display/MAHOUT/Quick+tour+of+text+analysis+using+the+Mahout+command+line
If you are planning to keep it in the new
José Ramos jjar...@gmail.com wrote:
First thing is thatI know this requirement would not make sense in a CF
Recommender. In my case, I am trying to use Mahout to create something
closer to a Content-Based Recommender.
In particular, I am pre-computing a similarity matrix between all
interface to add a
parameter that says whether to keep or remove the current users items?
We could even do this in the abstract base class then.
--sebastian
On 03/05/2014 10:42 AM, Juan José Ramos wrote:
In case somebody runs into the same situation, the key seems
-NaN similarity value that is with at
least one of the items preferred by the user.
Tevfik
On Wed, Mar 5, 2014 at 2:30 PM, Sebastian Schelter s...@apache.org
wrote:
On 03/05/2014 01:23 PM, Juan José Ramos wrote:
Thanks for the reply, Sebastian.
I am not sure if that should
as the query on
the indexed articles and Solr will return articles ranked by similarity.
This is a content based recommender based solely on Solr.
Does this describe your situation?
On Mar 4, 2014, at 1:16 AM, Juan José Ramos jjar...@gmail.com wrote:
First thing is thatI know this requirement
and if the similarity value between X and at least one of
the items rated (preferred) by the user is not NaN, then X will be not
be returned by AllSimilarItemsCandidateItemsStrategy, but it will be
returned by AllUnknownItemsCandidateItemsStrategy.
On Wed, Mar 5, 2014 at 4:42 PM, Juan José Ramos
between the time of the call to
refresh() and the last modified time of the file.
--sebastian
On 03/03/2014 04:45 PM, Juan José Ramos wrote:
Thanks for the reply, Sebastian.
I do not have concurrent updates, but they actually may happen very, very
close in time.
Would the fact of adding
directly append new data to the original file.
If you want a to have a DataModel that can be concurrently updated, I
suggest your data to a database.
--sebastian
On 03/02/2014 11:11 PM, Juan José Ramos wrote:
I am having issues refreshing my recommender, in particular with the
DataModel.
I am
I am using a GenericItemBasedRecommender and have faced the same problem.
In my case the items are news and the user does not provide an explicit
rating but the system would infer one for him/her based on the actions he
makes: Reading, sharing, favouriting...
The approach I have followed has
I am having issues refreshing my recommender, in particular with the
DataModel.
I am using a FileDataModel and a GenericItemBasedRecommender that also has
a CachingItemSimilarity wrapping a FileItemSimilarity. But for the test I
am running I am making things even simpler.
By the time I
In the wiki page: 'Quick tour of text analysis using the Mahout command
line'.
https://cwiki.apache.org/confluence/display/MAHOUT/Quick+tour+of+text+analysis+using+the+Mahout+command+line
At the very bottom it is said that
1. This will generate the 10 most similar docs to each doc in the
On Feb 25, 2014, at 9:22 AM, Juan José Ramos jjar...@gmail.com wrote:
In the wiki page: 'Quick tour of text analysis using the Mahout command
line'.
https://cwiki.apache.org/confluence/display/MAHOUT/Quick+tour+of+text+analysis+using+the+Mahout+command+line
At the very bottom
Is there a way to reproduce this process:
https://cwiki.apache.org/confluence/display/MAHOUT/Quick+tour+of+text+analysis+using+the+Mahout+command+line
inside Java code and not using the command line tool? I am not interested
in the clustering part but in 'Calculate several similar docs to each
can be loaded by the FileItemSimilarity.
--sebastian
On 02/24/2014 08:31 PM, Juan José Ramos wrote:
Is there a way to reproduce this process:
https://cwiki.apache.org/confluence/display/MAHOUT/
Quick+tour+of+text+analysis+using+the+Mahout+command+line
inside Java code and not using
with FileItemSimilarity.
--sebastian
On 02/24/2014 09:30 PM, Juan José Ramos wrote:
Thanks for the prompt reply.
RowSimilarityJob produces an output in the form of:
Key: 0: Value: {61112:0.21139380179557016,52144:0.23797846026935565,...}
whereas FileItemSimilarity is expecting a comma or tab
15 matches
Mail list logo