Re: stackoverflow - recursion in LuceneIterator.computeNext

2012-01-20 Thread Paul Rudin
Yes open a JIRA here with a patch: https://issues.apache.org/jira/browse/MAHOUT OK - I put it here: https://issues.apache.org/jira/browse/MAHOUT-951 If you're writing SequenceFiles, sure you can just write them straight to S3 by writing to a Path on s3:// -- is that something you've tried

Re: stackoverflow - recursion in LuceneIterator.computeNext

2012-01-20 Thread Sean Owen
That part is easy. On the job's Configuration, call: set(fs.s3.awsAccessKeyId, YourAccessKey); set(fs.s3.awsSecretAccessKey, YourSecretKey); If you use s3n:// URLs, do the same with .s3n.. I also set fs.defaultFS and fs.default.name to s3://mybucket. On Fri, Jan 20, 2012 at 9:49 AM,

Re: Bayes classification - strange results

2012-01-20 Thread Daniel Korzekwa
After analyzing Mahout bayes code I found that priors are not taken into account. Mahout just provides some different version of Naive Bayes. Today I evaluated machine learning java library from http://mallet.cs.umass.edu . For the trivial test data presented below, it gives the results I was

Re: How to present mahout cluster in combination with Solr results

2012-01-20 Thread Vikas Pandya
From the example below, solr search results should be clustered in some following way list all the items which have matching RiskLevels e.g. Cluster 1: Title          RiskLevel1          RiskLevel2         RiskLevel3 abc            High                     Medium             Low xyz            

Re: How to present mahout cluster in combination with Solr results

2012-01-20 Thread Frank Scholten
On Fri, Jan 20, 2012 at 4:01 PM, Vikas Pandya vika...@yahoo.com wrote: From the example below, solr search results should be clustered in some following way list all the items which have matching RiskLevels e.g. Cluster 1: Title          RiskLevel1          RiskLevel2         RiskLevel3

Re: Question on RowSimilarityJob

2012-01-20 Thread Sebastian Schelter
Hi, 'maxSimilaritiesPerRow' denotes the maximum number of similar rows (documents in your use case) to keep per document. 'excludeSelfSimilarity' means that rows (documents) should not be compared to themselves. Sry for the lack of documentation, RowSimilarityJob was originally only an internal

Mahout Taste Deployment On Hadoop

2012-01-20 Thread jams gost
Hello Mahout Users, I have Developed Mahout Taste Recommender with the referring Mahout In Actipon on my system.So, after getting mahoutapp-1.0-SNAPSHOT.jar.How can I execute on Hadoop 0.19. Can any one have tried this? Thanks Jams

Re: Mahout Taste Deployment On Hadoop

2012-01-20 Thread Ted Dunning
I doubt if it will work on Hadoop 0.19. Mahout requires 0.20 and pretty much always has. Changing that will be difficult to check even if it isn't difficult to do. In any case, you should probably get off of 0.19 as soon as possible as well since there are known stability problems with that