Hello Mike, Thank you and you were right in your first comment, the expected field, Lucene46FieldInfos is within the file _0.cfs. We have taken a closer look and in details. The problem was because copy bytes in hex form from cassandra to the byte array was wrong because the source offset was set wrongly. It was set from 0 all the time when it should be set based on lucene called seek(position). Thank you again.
Jack, it is educational purpose and we think lucene is a fantastic software and we would like to learn it in details. Jason On Mon, Feb 17, 2014 at 10:31 PM, Jack Krupansky <j...@basetechnology.com>wrote: > Are you using or aware of Solandra? See: > > https://github.com/tjake/Solandra > > Solandra has been superceded by a commercial product, DataStax Enterprise > that combines Solr/Lucene and Cassandra. Solr/Lucene indexing of Cassandra > data is supported, but the actual Lucene indexes are stored in the native > file system for greater performance. Solrandra stored the Lucene indexes in > Cassandra, but the performance penalty was too high. > > -- Jack Krupansky > > -----Original Message----- From: Jason Wee > Sent: Friday, February 14, 2014 3:13 AM > To: java-user@lucene.apache.org > Subject: codec mismatch > > > Hello, > > This is my first question to lucene mailing list, sorry if the question > sounds funny. > > I have been experimenting to store lucene index files on cassandra, > unfortunately the exception got overwhelmed. Below are the stacktrace. > > org.apache.lucene.index.CorruptIndexException: codec mismatch: actual > codec=CompoundFileWriterData vs expected codec=Lucene46FieldInfos > (resource: SlicedIndexInput(SlicedIndexInput(_0.fnm in > lucene-cassandra-desc) in lucene-cassandra-desc slice=31:340)) > at org.apache.lucene.codecs.CodecUtil.checkHeaderNoMagic( > CodecUtil.java:140) > at org.apache.lucene.codecs.CodecUtil.checkHeader(CodecUtil.java:130) > at > org.apache.lucene.codecs.lucene46.Lucene46FieldInfosReader.read( > Lucene46FieldInfosReader.java:56) > at > org.apache.lucene.index.SegmentReader.readFieldInfos( > SegmentReader.java:214) > at org.apache.lucene.index.SegmentReader.<init>(SegmentReader.java:94) > at > org.apache.lucene.index.StandardDirectoryReader$1.doBody( > StandardDirectoryReader.java:62) > at > org.apache.lucene.index.SegmentInfos$FindSegmentsFile. > run(SegmentInfos.java:843) > at > org.apache.lucene.index.StandardDirectoryReader.open( > StandardDirectoryReader.java:52) > at org.apache.lucene.index.DirectoryReader.open(DirectoryReader.java:66) > at org.apache.lucene.store.Search.<init>(Search.java:41) > at org.apache.lucene.store.Search.main(Search.java:34) > > I'm not sure what does it means, can anybody help? > > When I check the hex representation of _0.fnm in cassandra, and translated > to ascii. It is something like this: > ??l??Lucene46FieldInfos??????path?Q?????????????? > PerFieldPostingsFormat.format?Lucene41?PerFieldPostingsFormat.suffix? > 0?modified?Q??????????????PerFieldPostingsFormat.format?Lucene41? > PerFieldPostingsFormat.suffix?0?contents???????????????? > PerFieldPostingsFormat.format?Lucene41?PerFieldPostingsFormat.suffix?0 > > It looks to me the expected codec is found in the _0.fnm file or am I > wrong? > > Thank you and please let me know if you need additional information. > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > For additional commands, e-mail: java-user-h...@lucene.apache.org > >