Re: Problems installing Mahout

2010-04-06 Thread Sean Owen
I'm ready to patch this issue, but I went the other way -- fixed the output to use Locale.ENGLISH. Either way works, what's preferred to you guys? Is it making the output deterministic, or locale-friendly? I opted for fixing it to Locale.ENGLISH because I like not depending on the platform, and

Re: Build failed in Hudson: Mahout Trunk #584

2010-04-06 Thread Sean Owen
Weak, surely my changes that did it but I don't know why I didn't see this in a local build / test. On Tue, Apr 6, 2010 at 10:41 AM, Apache Hudson Server hud...@hudson.zones.apache.org wrote: See http://hudson.zones.apache.org/hudson/job/Mahout%20Trunk/584/changes Changes: [srowen]

Re: Build failed in Hudson: Mahout Trunk #584

2010-04-06 Thread Robin Anil
Running org.apache.mahout.cf.taste.hadoop.similarity.item.ItemSimilarityTest Tests run: 7, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 0.039 sec FAILURE! On Tue, Apr 6, 2010 at 3:13 PM, Sean Owen sro...@gmail.com wrote: Weak, surely my changes that did it but I don't know why I didn't

Re: Build failed in Hudson: Mahout Trunk #584

2010-04-06 Thread Sean Owen
I see all tests pass in a full clean / test. :( I will look at Hudson's output to see why it think it failed. On Tue, Apr 6, 2010 at 10:48 AM, Robin Anil robin.a...@gmail.com wrote: I have tasted this before, That was when I didn't do a clean install before checking in.

Re: Build failed in Hudson: Mahout Trunk #584

2010-04-06 Thread Sean Owen
I can't reproduce this at all and don't see how to get details out of Hudson. Does anyone know where it sticks test output? or can anyone repro this? On Tue, Apr 6, 2010 at 10:58 AM, Sean Owen sro...@gmail.com wrote: I see all tests pass in a full clean / test. :( I will look at Hudson's output

Re: Build failed in Hudson: Mahout Trunk #584

2010-04-06 Thread Sean Owen
That must be it. I had removed the '/' earlier since on OS X the temp dir path ends with '/', and at the time I believed it was the cause of some other failures (which I'm guessing I was wrong about). I can easily make the logic account for both cases. Sean On Tue, Apr 6, 2010 at 11:24 AM,

Re: Build failed in Hudson: Mahout Trunk #584

2010-04-06 Thread Sebastian Schelter
Hi Sean, I think I saw another potential problem, lines 233 to 237 should be changed from if (tmpDir.exists()) { recursiveDelete(tmpDir); } else { tmpDir.mkdirs(); } to if (tmpDir.exists()) { recursiveDelete(tmpDir); } tmpDir.mkdirs();

[jira] Commented: (MAHOUT-358) the pref value field of output of org.apache.mahout.cf.taste.hadoop.item.RecommenderJob has negative

2010-04-06 Thread Hui Wen Han (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12853919#action_12853919 ] Hui Wen Han commented on MAHOUT-358: I used the latest code test again , the final

[jira] Commented: (MAHOUT-358) the pref value field of output of org.apache.mahout.cf.taste.hadoop.item.RecommenderJob has negative

2010-04-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12853929#action_12853929 ] Sean Owen commented on MAHOUT-358: -- Maybe it also clarifies to say: those valus are *not*

[jira] Commented: (MAHOUT-356) ClassNotFoundException: org.apache.mahout.math.function.IntDoubleProcedure

2010-04-06 Thread Kris Jack (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12853930#action_12853930 ] Kris Jack commented on MAHOUT-356: -- No, I haven't set a CLASSPATH var (not intentionally

[jira] Commented: (MAHOUT-358) the pref value field of output of org.apache.mahout.cf.taste.hadoop.item.RecommenderJob has negative

2010-04-06 Thread Hui Wen Han (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12853977#action_12853977 ] Hui Wen Han commented on MAHOUT-358: I have no negative ratings. the pref value

[jira] Commented: (MAHOUT-358) the pref value field of output of org.apache.mahout.cf.taste.hadoop.item.RecommenderJob has negative

2010-04-06 Thread Hui Wen Han (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12853983#action_12853983 ] Hui Wen Han commented on MAHOUT-358: I will debug and tell you the result . Thanks :)

[jira] Commented: (MAHOUT-358) the pref value field of output of org.apache.mahout.cf.taste.hadoop.item.RecommenderJob has negative

2010-04-06 Thread Hui Wen Han (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12854073#action_12854073 ] Hui Wen Han commented on MAHOUT-358: if use Text as the out format ,everything is ok.

[jira] Commented: (MAHOUT-358) the pref value field of output of org.apache.mahout.cf.taste.hadoop.item.RecommenderJob has negative

2010-04-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12854077#action_12854077 ] Sean Owen commented on MAHOUT-358: -- You mean that you do not see those negative values?

Re: Proposal: make collections releases independent of the rest of Mahout

2010-04-06 Thread Benson Margulies
Hearing no other remarks, I will proceed to disconnect and make the version 1.0-SNAPSHOT, and call a release vote RSN. On Sun, Apr 4, 2010 at 7:58 PM, Benson Margulies bimargul...@gmail.com wrote: Last question: What's the first version going to be? I propose '1.0'. 0.4 would get mighty

Re: Proposal: make collections releases independent of the rest of Mahout

2010-04-06 Thread Sean Owen
This still lives in Mahout, just has a different version number? what's the substance of the change in the short-term; I think I missed that step. On Tue, Apr 6, 2010 at 6:41 PM, Benson Margulies bimargul...@gmail.com wrote: Hearing no other remarks, I will proceed to disconnect and make the

Re: Proposal: make collections releases independent of the rest of Mahout

2010-04-06 Thread Benson Margulies
Substance: 1: remove collections-codegen and collections from the top-level pom's module list. 2: change their parents to point to the apache parent. 3: tweak their poms so that the release plugin works right with them. 4: release them 5: change rest of mahout to consume release. On Tue, Apr 6,

Re: Proposal: make collections releases independent of the rest of Mahout

2010-04-06 Thread Ted Dunning
For what it is worth, I actually prefer this approach to the multi-pom approach in many cases. If it really is a separate thing, it might as well have a separate release schedule and artifact. If it isn't a separate thing, then you might as well use a single pom. This heuristic doesn't always

Re: Proposal: make collections releases independent of the rest of Mahout

2010-04-06 Thread Jake Mannix
I agree in principal, but having a whole different set of versionings seems kinda... messy? If m-collections goes 1.0, and then 1.1, and then m-math goes 1.0, and core goes to 0.5, we have a whole pile of different version numbers to keep track of. Didn't Lucene and Solr just intentionally do

Re: Proposal: make collections releases independent of the rest of Mahout

2010-04-06 Thread Ted Dunning
The Lucene/Solr community have decided to loosely couple release schedules and explicitly decided to not lock version numbers. One of their arguments was that it would confuse users, which doesn't apply for us. The other argument was that either side should be free to have a release that was

Re: Proposal: make collections releases independent of the rest of Mahout

2010-04-06 Thread Ted Dunning
I should have said there should SOON be a vanishingly small number of collections releases. Clearly that isn't so just yet. On Tue, Apr 6, 2010 at 12:09 PM, Ted Dunning ted.dunn...@gmail.com wrote: if only because there should be a vanishingly small number of collections releases

Re: Proposal: make collections releases independent of the rest of Mahout

2010-04-06 Thread Benson Margulies
We gain the ability to release collections more frequently. *because* it is less mature, it needs that. On Tue, Apr 6, 2010 at 2:48 PM, Jake Mannix jake.man...@gmail.com wrote: I agree in principal, but having a whole different set of versionings seems kinda... messy?  If m-collections goes

Re: Proposal: make collections releases independent of the rest of Mahout

2010-04-06 Thread Benson Margulies
Where are we on the consensus process? Jake, have Ted and I satisfied you? Does this call for a VOTE to be sure that we're on the same page? On Tue, Apr 6, 2010 at 3:33 PM, Benson Margulies bimargul...@gmail.com wrote: On Tue, Apr 6, 2010 at 3:10 PM, Ted Dunning ted.dunn...@gmail.com wrote: I

Re: Proposal: make collections releases independent of the rest of Mahout

2010-04-06 Thread Ted Dunning
Very cool. Very exciting. Benson, that sounds like consensus to me. On Tue, Apr 6, 2010 at 1:02 PM, Jake Mannix jake.man...@gmail.com wrote: ... I'm in favor, I guess, of: 1: remove collections-codegen and collections from the top-level pom's module list. 2: change their parents to point

Re: Proposal: make collections releases independent of the rest of Mahout

2010-04-06 Thread Benson Margulies
Indeed. Off I go. On Tue, Apr 6, 2010 at 4:23 PM, Ted Dunning ted.dunn...@gmail.com wrote: Very cool.  Very exciting. Benson, that sounds like consensus to me. On Tue, Apr 6, 2010 at 1:02 PM, Jake Mannix jake.man...@gmail.com wrote: ... I'm in favor, I guess, of: 1: remove

Re: Proposal: make collections releases independent of the rest of Mahout

2010-04-06 Thread Robin Anil
Great proposal. Hopefully this will push Mahout core to have faster releases Robin On Wed, Apr 7, 2010 at 3:29 AM, Grant Ingersoll gsing...@apache.org wrote: +1. Release early, release often. -Grant On Apr 6, 2010, at 5:12 PM, Benson Margulies wrote: Indeed. Off I go. On Tue, Apr

VOTE: release mahout-collections-codegen 1.0

2010-04-06 Thread Benson Margulies
In order to decouple the mahout-collections library from the rest of Mahout, to allow more frequent releases and other good things, we propose to release the code generator for the collections library as a separate Maven artifact. (Followed, in short order, by the collections library proper.) This

[jira] Created: (MAHOUT-364) [GSOC] Proposal to implement Neural Network with backpropagation learning on Hadoop

2010-04-06 Thread Zaid Md. Abdul Wahab Sheikh (JIRA)
[GSOC] Proposal to implement Neural Network with backpropagation learning on Hadoop --- Key: MAHOUT-364 URL: https://issues.apache.org/jira/browse/MAHOUT-364 Project:

GSOC [mentor idea]: Clustering visualization with GraphViz

2010-04-06 Thread Robin Anil
Here is a good project wish list, If anyone wishes to take it forward I would be willing to help mentor. http://www.graphviz.org/ Check out one of the graphs which i believe is a good way to represent clusters. Creating this graph is as easy was writing cluster output to the graphviz format

[jira] Updated: (MAHOUT-364) [GSOC] Proposal to implement Neural Network with backpropagation learning on Hadoop

2010-04-06 Thread Zaid Md. Abdul Wahab Sheikh (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zaid Md. Abdul Wahab Sheikh updated MAHOUT-364: --- Description: Proposal Title: Implement Multi-Layer Perceptrons with

[jira] Updated: (MAHOUT-364) [GSOC] Proposal to implement Neural Network with backpropagation learning on Hadoop

2010-04-06 Thread Zaid Md. Abdul Wahab Sheikh (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zaid Md. Abdul Wahab Sheikh updated MAHOUT-364: --- Comment: was deleted (was: formatting :() [GSOC] Proposal to

Re: VOTE: release mahout-collections-codegen 1.0

2010-04-06 Thread Ted Dunning
Is that possible here instead: https://repository.apache.org/content/repositories/staging/org/apache/mahout/? On Tue, Apr 6, 2010 at 6:08 PM, Benson Margulies bimargul...@gmail.comwrote: In order to decouple the mahout-collections library from the rest of Mahout, to allow more frequent

Re: VOTE: release mahout-collections-codegen 1.0

2010-04-06 Thread Benson Margulies
On Tue, Apr 6, 2010 at 9:40 PM, Ted Dunning ted.dunn...@gmail.com wrote: Is that possible here instead: https://repository.apache.org/content/repositories/staging/org/apache/mahout/? No, that's not right. That path has our last (0.3) release in it. However, I had forgotten to close it.

Re: A request for prospective GSOC students

2010-04-06 Thread Zaid Md Abdul Wahab Sheikh
I just submitted a proposal to implement Neural Network with backpropagation learning Jira issue: http://issues.apache.org/jira/browse/MAHOUT-364 On Sat, Apr 3, 2010 at 9:07 PM, Robin Anil robin.a...@gmail.com wrote: I am having a tough time separating Mahout proposals from rest of Apache on

[GSoC 2010] Requesting feedback on my proposal for implementing Neural Network with backpropagation learning

2010-04-06 Thread Zaid Md Abdul Wahab Sheikh
Hi all, I just submitted a GSoC proposal for implementing Neural Network with backpropagation on Hadoop. Jira issue: http://issues.apache.org/jira/browse/MAHOUT-364 I would appreciate your feedback and comments on the proposal and on the working or implementation plan.

[jira] Commented: (MAHOUT-364) [GSOC] Proposal to implement Neural Network with backpropagation learning on Hadoop

2010-04-06 Thread Jake Mannix (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-364?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12854304#action_12854304 ] Jake Mannix commented on MAHOUT-364: I've got to say, this is a fantastically well

[jira] Commented: (MAHOUT-358) the pref value field of output of org.apache.mahout.cf.taste.hadoop.item.RecommenderJob has negative

2010-04-06 Thread Hui Wen Han (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12854317#action_12854317 ] Hui Wen Han commented on MAHOUT-358:

[jira] Commented: (MAHOUT-364) [GSOC] Proposal to implement Neural Network with backpropagation learning on Hadoop

2010-04-06 Thread Ted Dunning (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-364?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12854349#action_12854349 ] Ted Dunning commented on MAHOUT-364: This is a very nicely written proposal. One

Re: VOTE: release mahout-collections-codegen 1.0

2010-04-06 Thread Ted Dunning
I confirm that the components exist and appear in good order. Is there a way for me to test this component? Is there any testing needed beyond checking existence? On Tue, Apr 6, 2010 at 7:13 PM, Benson Margulies bimargul...@gmail.comwrote: On Tue, Apr 6, 2010 at 9:40 PM, Ted Dunning

Re: VOTE: release mahout-collections-codegen 1.0

2010-04-06 Thread Robin Anil
Is there a patch which pulls this dependency to build Mahout. Thats the good test for it Robin On Wed, Apr 7, 2010 at 10:45 AM, Ted Dunning ted.dunn...@gmail.com wrote: I confirm that the components exist and appear in good order. Is there a way for me to test this component? Is there any