SequenceFile is/was also the standard for binary data on Hadoop. The
question is rather : what else would you expect? Surely not a text format?
Bertrand
On Fri, Nov 7, 2014 at 3:51 AM, Lee S sle...@gmail.com wrote:
any other reasons or can you give a thorough analysis?
2014-11-05 11:00
Also it's the easiest way to SerDe any complex stuff and get split + block
compression features since SeqFiles are splittable and could be compressed
by default. See the code, it has really complex stuff to transfer between
jobs.
2014-11-10 3:06 GMT+03:00 Bertrand Dechoux decho...@gmail.com:
Can anyone tell me why the Restricted Boltzmann Machines algorithm was removed ?
So is there any scalable rbms available ?
I'm going to implement a recommender based on it.
From: Ted Dunning
Date: 2014-11-10 15:34
To: user@mahout.apache.org
Subject: Re: why rbm was removed from mahout?
The algorithm wasn't particularly scalable. Nobody was around to support
it. Nobody
The algorithm never made it to the codebase and remained a patch for
sometime when the original author recalled the patch while we were working
on 0.8.
It wasn't scalable and the author didn't think it was worth committing to
trunk.
On Mon, Nov 10, 2014 at 2:34 AM, Ted Dunning
Check out H2O.
http://0xdata.com/
On Mon, Nov 10, 2014 at 1:38 AM, zhonghong...@yy.com zhonghong...@yy.com
wrote:
So is there any scalable rbms available ?
I'm going to implement a recommender based on it.
From: Ted Dunning
Date: 2014-11-10 15:34
To: user@mahout.apache.org
Subject: Re: