I'm trying to query a large index using Lucene.NET (version 2.9.2.1).
The index contains ~1.2 Million docs and its size is 1.7GB
When trying to open an IndexSearcher, I get the following Exception:
Lucene.Net.Index.CurruptIndexException - {"Incompatible format version: 2
expected 1 or lower"}
at Lucene.Net.Index.SegmentInfos.FindSegmentsFile.Run(IndexCommit commit)
at Lucene.Net.Index.DirectoryReader.Open(Directory directory,
IndexDeletionPolicy deletionPolicy, IndexCommit commit, Boolean readOnly, Int32
termInfosIndexDivisor)
at Lucene.Net.Index.IndexReader.Open(Directory directory,
IndexDeletionPolicy deletionPolicy, IndexCommit commit, Boolean readOnly, Int32
termInfosIndexDivisor)
at Lucene.Net.Index.IndexReader.Open(Directory directory, Boolean readOnly)
at Lucene.Net.Search.IndexSearcher..ctor(Directory path, Boolean readOnly)
at CET.KotarIndexBuilder.KotarParagraphsSearcher.Search(String sQueryTerm)
in C:\Users\odedo\documents\visual studio
2010\Projects\CET.LucenePOC\CET.KotarIndexBuilder\KotarParagraphsSearcher.cs:line
42
at CET.KotarIndexBuilder.Program.Main(String[] args) in
C:\Users\odedo\documents\visual studio
2010\Projects\CET.LucenePOC\CET.KotarIndexBuilder\Program.cs:line 23
at System.AppDomain._nExecuteAssembly(RuntimeAssembly assembly, String[]
args)
at System.AppDomain.ExecuteAssembly(String assemblyFile, Evidence
assemblySecurity, String[] args)
at Microsoft.VisualStudio.HostingProcess.HostProc.RunUsersAssembly()
at System.Threading.ThreadHelper.ThreadStart_Context(Object state)
at System.Threading.ExecutionContext.Run(ExecutionContext executionContext,
ContextCallback callback, Object state, Boolean ignoreSyncCtx)
at System.Threading.ExecutionContext.Run(ExecutionContext executionContext,
ContextCallback callback, Object state)
at System.Threading.ThreadHelper.ThreadStart()
The strange thing is, that I can open the index for writing/updating, and I can
also open the index using Luke, and query it.
I tried to optimize the index using Luke, but it did not do any good.
When I rebuild this index with a subset of the data, for example only 500K or
750K of documents, I can open and query it successfully... but somewhere over
1M docs, something goes wrong.
I'm running VS2010, on .NET 4.0, using 64bit machine over Win7Pro.
Thanks,
Oded