Re: MAHOUT 0.9 Release - New URL

2014-01-20 Thread Suneel Marthi
Hmmm... that's an issue. Since both Dirichlet and Meanshift clustering have been removed from 0.9, cluster-syntheticcontrol.sh options 4,5 are not gonna work and should have been removed for 0.9. To PMC,  - rollback the release, fix this issue (and other patches that were submitted in the

Re: MAHOUT 0.9 Release - New URL

2014-01-20 Thread Suneel Marthi
This is an issue (trivial one though) that needs to be fixed for 0.9 Release, will be rerolling the release today (in the next few hrs) and putting out a new release candidate in staging. Thanks for reporting this Andrew P. On Monday, January 20, 2014 12:34 AM, Andrew Palumbo

[OT] Uses Cases for Taming Text, 2nd ed.

2014-01-20 Thread Grant Ingersoll
Hi Mahout Users, Drew Farris, Tom Morton and I are currently working on the 2nd Edition of Taming Text (http://www.manning.com/ingersoll for first ed.) and are soliciting interested parties who would be willing to contribute to a chapter on practical use cases (i.e. you have something in

Re: About Parallel Frequent Growth algorithm

2014-01-20 Thread Sebastian Schelter
Hi Yoonmin, we removed a bunch of algorithms either because they were rarely used or not actively maintained anymore. IIRC the first thing was true for PFG. --sebastian On 01/20/2014 03:42 AM, Yoonmin Nam wrote: Hello, everyone! Is there anyone know about the reason why PFG is deprecated

Re: About Parallel Frequent Growth algorithm

2014-01-20 Thread Dmitriy Lyubimov
that's a bit weird though. Association mining is still a pretty popular technique. (our scientists use it, albeit not in exact FPGrowth form) On Mon, Jan 20, 2014 at 3:15 PM, Sebastian Schelter s...@apache.org wrote: Hi Yoonmin, we removed a bunch of algorithms either because they were

Re: About Parallel Frequent Growth algorithm

2014-01-20 Thread Suneel Marthi
I was asked this question too and I had no clear answer. May be it wasn't right to remove FP from the codebase. Not having this may well be one another reason for users to look at options other than Mahout. Given the issues that Frank's reported with Streaming KMeans (and I am seeing them too)

Re: About Parallel Frequent Growth algorithm

2014-01-20 Thread Dmitriy Lyubimov
it seems more like it is not supported. I'd port it into spark counterpart and make sure there's a support (i.e. a person to go after when it breaks :) On Mon, Jan 20, 2014 at 3:44 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: I was asked this question too and I had no clear answer. May be

Re: About Parallel Frequent Growth algorithm

2014-01-20 Thread Ted Dunning
On Mon, Jan 20, 2014 at 5:44 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: I was asked this question too and I had no clear answer. May be it wasn't right to remove FP from the codebase. The major problem was that we had no maintainers for the code.

Re: About Parallel Frequent Growth algorithm

2014-01-20 Thread Qinghao Dai
May I ask what is the qualification to be the maintainer? I have read this part of code, and would like to have a try. Best Regards, Qinghao 2014/1/20 Ted Dunning ted.dunn...@gmail.com On Mon, Jan 20, 2014 at 5:44 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: I was asked this question

Re: About Parallel Frequent Growth algorithm

2014-01-20 Thread Dmitriy Lyubimov
(1) diligently answer inqueries about method use and theoretical foundation on the user list (2) fix arising issues related to that stuff, diligently as well. when (1) and (2) stops happening, the method gets axed in a year or couple releases (which what pretty much happend this time i think).

The maintainer of FPG algorithm

2014-01-20 Thread Yoonmin Nam
If it is available, then I also want to become a maintainer of FGP algorithm. Thanks Yoonmin