I tried to use SequenceFile.Writer to convert my binaries into Sequence Files, I read the binary data with FileInputStream, getting all bytes with reader.read(byte[]) , wrote it to a file with SequenceFile.Writer, with parameters NullWritable as key, BytesWritable as value. But the content changes, (I can see that by converting to Base64)
Binary File: 73 65 65 65 81 65 65 65 65 65 81 81 65 119 84 81 65 111 67 81 65 52 57 81 65 103 54 81 65 65 97 81 65 65 65 81 ... Sequence File: 73 65 65 65 65 69 65 65 65 65 65 65 65 69 66 65 65 77 66 77 81 103 67 103 67 69 77 65 52 80 86 67 65 73 68 114 ... Thanks for any points.. Rasit 2009/2/2 Rasit OZDAS <rasitoz...@gmail.com> > Hi, > I tried to use SequenceFileInputFormat, for this I appended "SEQ" as first > bytes of my "binary" files (with hex editor). > but I get this exception: > > A record version mismatch occured. Expecting v6, found v32 > at > org.apache.hadoop.io.SequenceFile$Reader.init(SequenceFile.java:1460) > at > org.apache.hadoop.io.SequenceFile$Reader.<init>(SequenceFile.java:1428) > at > org.apache.hadoop.io.SequenceFile$Reader.<init>(SequenceFile.java:1417) > at > org.apache.hadoop.io.SequenceFile$Reader.<init>(SequenceFile.java:1412) > at > org.apache.hadoop.mapred.SequenceFileRecordReader.<init>(SequenceFileRecordReader.java:43) > at > org.apache.hadoop.mapred.SequenceFileInputFormat.getRecordReader(SequenceFileInputFormat.java:58) > at org.apache.hadoop.mapred.MapTask.run(MapTask.java:321) > at org.apache.hadoop.mapred.Child.main(Child.java:155) > > What could it be? Is it not enough just to add "SEQ" to binary files? > I use Hadoop v.0.19.0 . > > Thanks in advance.. > Rasit > > > different *version* of *Hadoop* between your server and your client. > > -- > M. Raşit ÖZDAŞ > -- M. Raşit ÖZDAŞ