Re: Why do most algorithms use sequencefile as input and output?

2014-11-09 Thread Bertrand Dechoux
SequenceFile is/was also the standard for binary data on Hadoop. The question is rather : what else would you expect? Surely not a text format? Bertrand On Fri, Nov 7, 2014 at 3:51 AM, Lee S sle...@gmail.com wrote: any other reasons or can you give a thorough analysis? 2014-11-05 11:00

Re: Why do most algorithms use sequencefile as input and output?

2014-11-09 Thread Serega Sheypak
Also it's the easiest way to SerDe any complex stuff and get split + block compression features since SeqFiles are splittable and could be compressed by default. See the code, it has really complex stuff to transfer between jobs. 2014-11-10 3:06 GMT+03:00 Bertrand Dechoux decho...@gmail.com:

why rbm was removed from mahout?

2014-11-09 Thread zhonghong...@yy.com
Can anyone tell me why the Restricted Boltzmann Machines algorithm was removed ?

Re: Re: why rbm was removed from mahout?

2014-11-09 Thread zhonghong...@yy.com
So is there any scalable rbms available ? I'm going to implement a recommender based on it. From: Ted Dunning Date: 2014-11-10 15:34 To: user@mahout.apache.org Subject: Re: why rbm was removed from mahout? The algorithm wasn't particularly scalable. Nobody was around to support it. Nobody

Re: why rbm was removed from mahout?

2014-11-09 Thread Suneel Marthi
The algorithm never made it to the codebase and remained a patch for sometime when the original author recalled the patch while we were working on 0.8. It wasn't scalable and the author didn't think it was worth committing to trunk. On Mon, Nov 10, 2014 at 2:34 AM, Ted Dunning

Re: Re: why rbm was removed from mahout?

2014-11-09 Thread Ted Dunning
Check out H2O. http://0xdata.com/ On Mon, Nov 10, 2014 at 1:38 AM, zhonghong...@yy.com zhonghong...@yy.com wrote: So is there any scalable rbms available ? I'm going to implement a recommender based on it. From: Ted Dunning Date: 2014-11-10 15:34 To: user@mahout.apache.org Subject: Re: