Re: Lucene 2.9.0-rc5 : Reader stays open after IndexWriter.updateDocument(), is that possible?

Daniel Shane Mon, 28 Sep 2009 09:25:58 -0700

Oh boy!

It seems like I have found the problem in my case, which afaik, hasnothing to do with lucene but rather the library we use to tokenize HTMLdocument. Its just that we have changed our HTML parser at the same timeas the version of Lucene and nekoHTML (cyberneko) does not close itsHTML reader even when we call parser.abort()/parser.close() (which isplaced in the close() of the lucene Tokenizer()).

Before that, the HTML parser would close the reader so I wrongfullythought it was the change of version of Lucene that caused this.

Bad news is that I had you all worked up for nothing, but good news isyou don't have any bugs here.

However, they may be something with the fact that Lucene's Analyzersautomatically close the reader when its done analyzing. I think thisencourages people not to explicitly close them, and creates thepotential of having open fd's if an exception is thrown in the middle ofthe analysis or before addDocument/updateDocument is called.

I don't think changing the API of Field to accept a "ReaderFactory"would solve anything because there are cases where you must index areader that is already opened (like a network connection) and wrappingit with a dummy readerFactory does not look very good.


Daniel Shane

---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org

Re: Lucene 2.9.0-rc5 : Reader stays open after IndexWriter.updateDocument(), is that possible?

Reply via email to