What should the input be?
On Tue, Nov 4, 2014 at 12:28 AM, Lee S <sle...@gmail.com> wrote: > Hi all: > I'm wondering why the input and output of most algorithm like > kmeans,naivebayes are all sequencefiles. One more step of conversion need > to be done if we want the algorithm works.And > I think the step is time consuming. Because it's also a mapreduce job. > For the reason to deal with small files and compress to save disk space? >