[jira] [Commented] (MAHOUT-947) Improvements to seqdumper

2012-02-09 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-947?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13205262#comment-13205262 ] Sean Owen commented on MAHOUT-947: -- My only issue with this is that this has brought in a

Jenkins build is still unstable: Mahout-Quality #1346

2012-02-09 Thread Apache Jenkins Server
See

[jira] [Commented] (MAHOUT-929) Refactor Clustering (Vector Classification) into a Separate Postprocess with Outlier Pruning

2012-02-09 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-929?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13205226#comment-13205226 ] Paritosh Ranjan commented on MAHOUT-929: I would prefer committing the code becaus

AbstractJob, CommandLine Parsing and MAHOUT-947

2012-02-09 Thread Grant Ingersoll
In looking at MAHOUT-947, I did some work to convert the Seq dumper to AbstractJob cmd line parsing. In doing so, I realized AJ has no support for multiple arguments of the same name, which the seq dumper needs. I thus added it, which of course broke a lot of other things, which meant I tried

[jira] [Commented] (MAHOUT-947) Improvements to seqdumper

2012-02-09 Thread Jake Mannix (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-947?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13205104#comment-13205104 ] Jake Mannix commented on MAHOUT-947: I wasn't suggesting supporting multiple args, jus

[jira] [Updated] (MAHOUT-947) Improvements to seqdumper

2012-02-09 Thread Grant Ingersoll (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll updated MAHOUT-947: --- Attachment: MAHOUT-947.patch This patch is quite a bit bigger than Tom's b/c as I was digging

Jenkins build is still unstable: Mahout-Quality #1345

2012-02-09 Thread Apache Jenkins Server
See

[jira] [Commented] (MAHOUT-947) Improvements to seqdumper

2012-02-09 Thread Grant Ingersoll (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-947?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13205018#comment-13205018 ] Grant Ingersoll commented on MAHOUT-947: I have a patch for this that cleans this

[jira] [Commented] (MAHOUT-784) Exception at 20 Newsgroups examples

2012-02-09 Thread Lance Norskog (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13205019#comment-13205019 ] Lance Norskog commented on MAHOUT-784: -- Hi- This patch makes so many reformatting ch

[jira] [Commented] (MAHOUT-947) Improvements to seqdumper

2012-02-09 Thread Jake Mannix (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-947?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13205017#comment-13205017 ] Jake Mannix commented on MAHOUT-947: so one comment: instead of --seqDirectory vs --se

[jira] [Commented] (MAHOUT-947) Improvements to seqdumper

2012-02-09 Thread Lance Norskog (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-947?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13205003#comment-13205003 ] Lance Norskog commented on MAHOUT-947: -- I won't be able to try it. The patch looks cl

[jira] [Commented] (MAHOUT-929) Refactor Clustering (Vector Classification) into a Separate Postprocess with Outlier Pruning

2012-02-09 Thread Jeff Eastman (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-929?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13204936#comment-13204936 ] Jeff Eastman commented on MAHOUT-929: - Sequential version looks good but lacks tests o

[jira] [Assigned] (MAHOUT-929) Refactor Clustering (Vector Classification) into a Separate Postprocess with Outlier Pruning

2012-02-09 Thread Jeff Eastman (Assigned) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman reassigned MAHOUT-929: --- Assignee: Jeff Eastman > Refactor Clustering (Vector Classification) into a Separate Post

Re: Multi Layer Perceptron for Mahout

2012-02-09 Thread Herta, Christian
Hello Atul, thanks for your interest. Yes, I created a jira issue(MAHOUT-976 [https://issues.apache.org/jira/browse/MAHOUT-976] ) and started with the implementation. I will upload a patch in the next days. Then you can get an impression what's going on. I am interested in your feedback. The mos

Jenkins build is still unstable: Mahout-Quality #1344

2012-02-09 Thread Apache Jenkins Server
See

[jira] [Commented] (MAHOUT-948) Improved error reporting when ARFF index does not exist in arff.vector [fix provided]

2012-02-09 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13204866#comment-13204866 ] Hudson commented on MAHOUT-948: --- Integrated in Mahout-Quality #1344 (See [https://builds.ap

[jira] [Commented] (MAHOUT-915) OutOfMemoryError in EigenVerificationJob

2012-02-09 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13204865#comment-13204865 ] Hudson commented on MAHOUT-915: --- Integrated in Mahout-Quality #1344 (See [https://builds.ap

Jenkins build is back to normal : Mahout-Examples-Cluster-Reuters-II #37

2012-02-09 Thread Apache Jenkins Server
See

Re: Multi Layer Perceptron for Mahout

2012-02-09 Thread Atul Kulkarni
Have you created a JIRA for this? I would like to help. On Tue, Feb 7, 2012 at 9:43 AM, Ted Dunning wrote: > The JIRA will be the best place to talk about your strategy for > implementation. Parallelization of NN training isn't a trivial problem. > We should learn from recent experience of oth

Jenkins build is still unstable: Mahout-Quality #1343

2012-02-09 Thread Apache Jenkins Server
See

[jira] [Commented] (MAHOUT-784) Exception at 20 Newsgroups examples

2012-02-09 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13204663#comment-13204663 ] Hudson commented on MAHOUT-784: --- Integrated in Mahout-Quality #1343 (See [https://builds.ap

[jira] [Updated] (MAHOUT-973) SparseVectorsFromSequenceFiles will not create a proper TFIDF (bug in TFIDFPartialVectorReducer)

2012-02-09 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-973: - Affects Version/s: (was: 0.7) Fix Version/s: 0.7 Assignee: Grant Ingersoll Grant

[jira] [Resolved] (MAHOUT-948) Improved error reporting when ARFF index does not exist in arff.vector [fix provided]

2012-02-09 Thread Sean Owen (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved MAHOUT-948. -- Resolution: Fixed Fix Version/s: 0.7 Assignee: Sean Owen > Improved error reporting

[jira] [Resolved] (MAHOUT-915) OutOfMemoryError in EigenVerificationJob

2012-02-09 Thread Sean Owen (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved MAHOUT-915. -- Resolution: Fixed Fix Version/s: 0.7 > OutOfMemoryError in EigenVerificationJob > -

[jira] [Commented] (MAHOUT-972) Implement Taste DynamoDBDataModel

2012-02-09 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13204538#comment-13204538 ] Sean Owen commented on MAHOUT-972: -- Ok, good start. This will go in integration/ and it w

[jira] [Updated] (MAHOUT-972) Implement Taste DynamoDBDataModel

2012-02-09 Thread Nick Jordan (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Jordan updated MAHOUT-972: --- Attachment: DynamoDBDataModel.java Attached is an early version of this that should be fully function

[jira] [Updated] (MAHOUT-704) Refactor PredictionJob to use MultipleInputs for reduce side joins

2012-02-09 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-704: - Fix Version/s: (was: 1.0) > Refactor PredictionJob to use MultipleInputs for reduce side joins >

[jira] [Resolved] (MAHOUT-784) Exception at 20 Newsgroups examples

2012-02-09 Thread Sean Owen (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved MAHOUT-784. -- Resolution: Fixed Fix Version/s: 0.7 Assignee: Sean Owen (was: Robin Anil) > Excep

[jira] [Resolved] (MAHOUT-629) FP Growth performance improvement

2012-02-09 Thread Sean Owen (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-629?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved MAHOUT-629. -- Resolution: Invalid Looks interesting, but we can't act on this without a current patch.

[jira] [Resolved] (MAHOUT-758) seqdirectory does not preserve full, correct pathnames in sequence files

2012-02-09 Thread Sean Owen (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved MAHOUT-758. -- Resolution: Cannot Reproduce Assignee: Sean Owen > seqdirectory does not preserve full, corre

[jira] [Updated] (MAHOUT-845) Make cluster top terms code more reusable

2012-02-09 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-845?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-845: - Resolution: Fixed Assignee: Jake Mannix Status: Resolved (was: Patch Available) > Make

[jira] [Commented] (MAHOUT-845) Make cluster top terms code more reusable

2012-02-09 Thread Frank Scholten (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13204512#comment-13204512 ] Frank Scholten commented on MAHOUT-845: --- I guess this one can be closed?

Re: Mandatory svnpubsub migration by Jan 2013

2012-02-09 Thread Dan Brickley
On 9 February 2012 13:34, Grant Ingersoll wrote: > I just migrated Lucene (I think y'all will find the look and feel quite > familiar) over to the new CMS, so I can help if needed.  It is quite nice and > easy to use.  For us, it should be trivial for the main page.  The Confluence > stuff we r

Re: Mandatory svnpubsub migration by Jan 2013

2012-02-09 Thread Grant Ingersoll
I just migrated Lucene (I think y'all will find the look and feel quite familiar) over to the new CMS, so I can help if needed. It is quite nice and easy to use. For us, it should be trivial for the main page. The Confluence stuff we really need to fix. On Feb 9, 2012, at 3:55 AM, Sean Owen

Fwd: Mandatory svnpubsub migration by Jan 2013

2012-02-09 Thread Sean Owen
We have a small to-do here, and I can do it. As I undersrtand, this just means that /www/mahout.apache.org needs to be in Subversion somewhere, and checked out there. We already have the main page in SVN at https://svn.apache.org/repos/asf/mahout/site/new_website I propose to just check everythin

Re: Apache Mahout 0.6 Released

2012-02-09 Thread Ioan Eugen Stan
Congratulations and keep it going. Pe 06.02.2012 23:19, Shannon Quinn a scris: Apache Mahout has reached version 0.6. All developers are encouraged to begin using version 0.6, as much has changed and will continue to do so as we march inexorably toward a 1.0 release. Highlights of 0.6 include:

MAHOUT-945 and MAHOUT-954

2012-02-09 Thread IKumasa Mukai
Hi deneche-san. I wish to fix these two bug issues on df asap. Your prompt reviewing/checking would be appreciated. https://issues.apache.org/jira/browse/MAHOUT-945 https://issues.apache.org/jira/browse/MAHOUT-954 Regards & Thanks, 2012/1/24 IKumasa Mukai : > Hi Deneche-san. > > Sorry for this

[jira] [Commented] (MAHOUT-944) LuceneIndexToSequenceFiles (lucene2seq) utility

2012-02-09 Thread Frank Scholten (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13204382#comment-13204382 ] Frank Scholten commented on MAHOUT-944: --- Ah, seq2encoded currently supports text onl