Re: Any interest in Data Preparation?

2013-04-06 Thread Gokhan Capan
I consider this as a tiny step of a larger making-it-more-usable action. Ted, I actually started this to evaluate my implementation of factorization machines, I'm going to write about it after trying on some data on the thread you started, we could talk about details there. About data tools,

Re: Any interest in Data Preparation?

2013-04-06 Thread Ted Dunning
I differ a bit in that these are important to have in general. Unfortunately, however, our current command line structure would make these really inefficient to use. On Sat, Apr 6, 2013 at 9:22 AM, Sebastian Schelter wrote: > In general, I think it is great to have such tools. But they should

Re: Any interest in Data Preparation?

2013-04-06 Thread Sebastian Schelter
In general, I think it is great to have such tools. But they should be developed in context with a specific algorithm or problem. On 06.04.2013 17:47, Gokhan Capan wrote: > Hi, > > Are you guys interested in Weka like filters implementation, > like NominalToBinary, Discretize etc. > > I started

Any interest in Data Preparation?

2013-04-06 Thread Gokhan Capan
Hi, Are you guys interested in Weka like filters implementation, like NominalToBinary, Discretize etc. I started to implement in-memory versions running on Mahout Matrix, and plan to extend the implementations so they could run on sequence files of IntWritable, VectorWritable pairs. -- Gokhan