Yes open a JIRA here with a patch:
https://issues.apache.org/jira/browse/MAHOUT
OK - I put it here:
https://issues.apache.org/jira/browse/MAHOUT-951
If you're writing SequenceFiles, sure you can just write them straight
to S3 by writing to a Path on s3:// -- is that something you've tried
That part is easy. On the job's Configuration, call:
set(fs.s3.awsAccessKeyId, YourAccessKey);
set(fs.s3.awsSecretAccessKey, YourSecretKey);
If you use s3n:// URLs, do the same with .s3n..
I also set fs.defaultFS and fs.default.name to s3://mybucket.
On Fri, Jan 20, 2012 at 9:49 AM,
After analyzing Mahout bayes code I found that priors are not taken into
account. Mahout just provides some different version of Naive Bayes. Today
I evaluated machine learning java library from http://mallet.cs.umass.edu .
For the trivial test data presented below, it gives the results I was
From the example below, solr search results should be clustered in some
following way
list all the items which have matching RiskLevels e.g.
Cluster 1:
Title RiskLevel1 RiskLevel2 RiskLevel3
abc High Medium Low
xyz
On Fri, Jan 20, 2012 at 4:01 PM, Vikas Pandya vika...@yahoo.com wrote:
From the example below, solr search results should be clustered in some
following way
list all the items which have matching RiskLevels e.g.
Cluster 1:
Title RiskLevel1 RiskLevel2 RiskLevel3
Hi,
'maxSimilaritiesPerRow' denotes the maximum number of similar rows
(documents in your use case) to keep per document.
'excludeSelfSimilarity' means that rows (documents) should not be
compared to themselves.
Sry for the lack of documentation, RowSimilarityJob was originally only
an internal
Hello Mahout Users,
I have Developed Mahout Taste Recommender with the referring Mahout
In Actipon on my system.So, after getting mahoutapp-1.0-SNAPSHOT.jar.How
can I execute on Hadoop 0.19. Can any one have tried this?
Thanks
Jams
I doubt if it will work on Hadoop 0.19. Mahout requires 0.20 and pretty
much always has. Changing that will be difficult to check even if it isn't
difficult to do.
In any case, you should probably get off of 0.19 as soon as possible as
well since there are known stability problems with that