Thanks Luca, but what other way to sort a directory of sequence files?

I don't plan to write a sorting algorithm in mappers/reducers, but hoping to
use the sequenceFile.sorter instead.

Any ideas?

Mark

On Mon, May 23, 2011 at 12:33 AM, Luca Pireddu <pire...@crs4.it> wrote:

>
> On May 22, 2011 03:21:53 Mark question wrote:
> > I'm trying to sort Sequence files using the Hadoop-Example TeraSort. But
> > after taking a couple of minutes .. output is empty.
>
> <snip>
>
> > I'm trying to find what the input format for the TeraSort is, but it is
> not
> > specified.
> >
> > Thanks for any thought,
> > Mark
>
> Terasort sorts lines of text.  The InputFormat (for version 0.20.2) is in
>
>
> hadoop-0.20.2/src/examples/org/apache/hadoop/examples/terasort/TeraInputFormat.java
>
> The documentation at the top of the class says "An input format that reads
> the
> first 10 characters of each line as the key and the rest of the line as the
> value."
>
> HTH
>
> --
> Luca Pireddu
> CRS4 - Distributed Computing Group
> Loc. Pixina Manna Edificio 1
> Pula 09010 (CA), Italy
> Tel:  +39 0709250452
>

Reply via email to