Re: [jira] Commented: (LUCENE-843) improve how IndexWriter uses RAM to buffer added documents

Michael McCandless Thu, 05 Apr 2007 16:30:28 -0700

"Mike Klaas" <[EMAIL PROTECTED]> wrote:
> On 4/5/07, Chris Hostetter <[EMAIL PROTECTED]> wrote:
> >
> > : Thanks!  But remember many Lucene apps won't see these speedups since I've
> > : carefully minimized cost of tokenization and cost of document retrieval.  
> > I
> > : think for many Lucene apps these are a sizable part of time spend 
> > indexing.
> >
> > true, but as long as the changes you are making has no impact on the
> > tokenization/docbuilding times, that shouldn't be a factor -- that should
> > be consiered a "cosntant time" adjunct to the code you are varying ...
> > people with expensive analysis may not see any significant increases, but
> > that's their own problem -- people concerned about performance will
> > already have that as fast as they can get it, and now the internals of
> > document adding will get faster as well.
> 
> Especially since it is relatively easy for users to tweak the analysis
> bits for performance--compared to the messy guts of index creation.
> 
> I am eagerly tracking the progress of your work.


Thanks Mike (and Hoss).

Hoss, what you said is correct: I'm only affecting the actual indexing of
a document, nothing before that.

I just want to make sure I get that disclaimer out, as much as possible, so
nobody tries the patch and says "Hey!  My app only got 10% faster!  This was
false advertising!".

People who indeed have minimized their doc retrieval and tokenization time
should see speedups around what I'm seeing with the benchmarks (I hope!).

Mike

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: [jira] Commented: (LUCENE-843) improve how IndexWriter uses RAM to buffer added documents

Reply via email to