Thanks, Tom
The problem that content was different was that
I converted one sample to Base64 byte-by-byte, and converted the other
from-byte-array to-byte-array (Strange, that they cause different outputs).
Thanks for good points.

Rasit

2009/2/2 Tom White <t...@cloudera.com>

> The SequenceFile format is described here:
>
> http://hadoop.apache.org/core/docs/current/api/org/apache/hadoop/io/SequenceFile.html
> .
> The format of the keys and values depends on the serialization classes
> used. For example, BytesWritable writes out the length of its byte
> array followed by the actual bytes in the array (see the write()
> method in BytesWritable).
>
> Hope this helps.
> Tom
>
> On Mon, Feb 2, 2009 at 3:21 PM, Rasit OZDAS <rasitoz...@gmail.com> wrote:
> > I tried to use SequenceFile.Writer to convert my binaries into Sequence
> > Files,
> > I read the binary data with FileInputStream, getting all bytes with
> > reader.read(byte[])  , wrote it to a file with SequenceFile.Writer, with
> > parameters NullWritable as key, BytesWritable as value. But the content
> > changes,
> > (I can see that by converting to Base64)
> >
> > Binary File:
> > 73 65 65 65 81 65 65 65 65 65 81 81 65 119 84 81 65 111 67 81 65 52 57 81
> 65
> > 103 54 81 65 65 97 81 65 65 65 81 ...
> >
> > Sequence File:
> > 73 65 65 65 65 69 65 65 65 65 65 65 65 69 66 65 65 77 66 77 81 103 67 103
> 67
> > 69 77 65 52 80 86 67 65 73 68 114 ...
> >
> > Thanks for any points..
> > Rasit
> >
> > 2009/2/2 Rasit OZDAS <rasitoz...@gmail.com>
> >
> >> Hi,
> >> I tried to use SequenceFileInputFormat, for this I appended "SEQ" as
> first
> >> bytes of my "binary" files (with hex editor).
> >> but I get this exception:
> >>
> >> A record version mismatch occured. Expecting v6, found v32
> >>     at
> >> org.apache.hadoop.io.SequenceFile$Reader.init(SequenceFile.java:1460)
> >>     at
> >> org.apache.hadoop.io.SequenceFile$Reader.<init>(SequenceFile.java:1428)
> >>     at
> >> org.apache.hadoop.io.SequenceFile$Reader.<init>(SequenceFile.java:1417)
> >>     at
> >> org.apache.hadoop.io.SequenceFile$Reader.<init>(SequenceFile.java:1412)
> >>     at
> >>
> org.apache.hadoop.mapred.SequenceFileRecordReader.<init>(SequenceFileRecordReader.java:43)
> >>     at
> >>
> org.apache.hadoop.mapred.SequenceFileInputFormat.getRecordReader(SequenceFileInputFormat.java:58)
> >>     at org.apache.hadoop.mapred.MapTask.run(MapTask.java:321)
> >>     at org.apache.hadoop.mapred.Child.main(Child.java:155)
> >>
> >> What could it be? Is it not enough just to add "SEQ" to binary files?
> >> I use Hadoop v.0.19.0 .
> >>
> >> Thanks in advance..
> >> Rasit
> >>
> >>
> >> different *version* of *Hadoop* between your server and your client.
> >>
> >> --
> >> M. Raşit ÖZDAŞ
> >>
> >
> >
> >
> > --
> > M. Raşit ÖZDAŞ
> >
>



-- 
M. Raşit ÖZDAŞ

Reply via email to