[jira] [Commented] (MAHOUT-1047) CVB hangs after completion

2013-04-06 Thread Saikat Kanjilal (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13624758#comment-13624758 ] Saikat Kanjilal commented on MAHOUT-1047: - I'd like to help out on this issue, pl

Re: Any interest in Data Preparation?

2013-04-06 Thread Gokhan Capan
I consider this as a tiny step of a larger making-it-more-usable action. Ted, I actually started this to evaluate my implementation of factorization machines, I'm going to write about it after trying on some data on the thread you started, we could talk about details there. About data tools,

[jira] [Commented] (MAHOUT-1186) OpenKeyTypeObjectHashMap#clear() has been broken forever.

2013-04-06 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13624514#comment-13624514 ] Hudson commented on MAHOUT-1186: Integrated in Mahout-Quality #1950 (See [https://builds

Re: Any interest in Data Preparation?

2013-04-06 Thread Ted Dunning
I differ a bit in that these are important to have in general. Unfortunately, however, our current command line structure would make these really inefficient to use. On Sat, Apr 6, 2013 at 9:22 AM, Sebastian Schelter wrote: > In general, I think it is great to have such tools. But they should

RE: Mahout-1025, Need input

2013-04-06 Thread Saikat Kanjilal
Didnt hear back on this, added an explanation of CVB, please review and let me know if anything else needs to be documented. If not I will move to the next JIRA issue. Regards > From: ted.dunn...@gmail.com > Date: Tue, 2 Apr 2013 22:44:32 +0200 > Subject: Re: Mahout-1025, Need input > To: dev@m

[jira] [Commented] (MAHOUT-1025) Update documentation for LDA before the release.

2013-04-06 Thread Saikat Kanjilal (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13624474#comment-13624474 ] Saikat Kanjilal commented on MAHOUT-1025: - Ok, added an explanation of CVB, pleas

Re: Any interest in Data Preparation?

2013-04-06 Thread Sebastian Schelter
In general, I think it is great to have such tools. But they should be developed in context with a specific algorithm or problem. On 06.04.2013 17:47, Gokhan Capan wrote: > Hi, > > Are you guys interested in Weka like filters implementation, > like NominalToBinary, Discretize etc. > > I started

Re: Mahout Suggestions - Refactoring Effort

2013-04-06 Thread Gokhan Capan
Hi, As a first step I started to implement data preparation tools. I started a separate thread for that, so we can talk on that specifically. On Thu, Mar 28, 2013 at 7:37 PM, Dan Filimon wrote: > Guys, a couple of proposals for GSOC are about refactoring. > > Could you have a look in that thr

Any interest in Data Preparation?

2013-04-06 Thread Gokhan Capan
Hi, Are you guys interested in Weka like filters implementation, like NominalToBinary, Discretize etc. I started to implement in-memory versions running on Mahout Matrix, and plan to extend the implementations so they could run on sequence files of IntWritable, VectorWritable pairs. -- Gokhan