Re: Flexible indexing

Michael Busch Sun, 11 Mar 2007 13:42:13 -0800

Hi Grant,

I certainly agree that it would be great if we could make some progressand commit the payloads patch soon. I think it is quite independent fromFI. FI will introduce different posting formats (see Wiki:http://wiki.apache.org/lucene-java/FlexibleIndexing). Payloads will bepart of some of those formats, but not all (i. e. per-position payloadsonly make sense if positions are stored).

The only concern some people had was about the API the patch introduces.It extends Token and TermPositions. Doug's argument was, that if weintroduce new APIs now but want to change them with FI, then it will behard to support those APIs. I think that is a valid point, but at thesame time it slows down progress to have to plan ahead in too manydirections. That's why I'd vote for marking the new APIs as experimentalso that people can try them out at own risk.If we could agree on that approach then I'd go ahead and submit anupdated payloads patch in the next days, that applies cleanly on thecurrent trunk and contains the additional warnings in the javadocs.

In regard of FI and 662 however I really believe we should split it upand plan ahead (in a way I mentioned already), so that we have moreisolated patches. It is really great that we have 662 already (Nicolas,thank you so much for your hard work, I hope you'll keep working with uson FI!!). We'll probably use some of that code, and it will definitelybe helpful.


Michael

Grant Ingersoll wrote:

Hi Michael,
This is very good. I know 662 is different, just wasn't sure ifNicolas patch was meant to be applied after 662, b/c I know we haddiscussed this before.
I do agree with you about planning this out, but I also know thatpatches seem to motivate people the best and provide a certainconcreteness to it all. I mostly started asking questions on thesetwo issues b/c I wanted to spur some more discussion and see if we canget people motivated to move on it.
I was hoping that I would be able to apply each patch to two differentcheckouts so I could start seeing where the overlap is and how theycould fit together (I also admit I was procrastinating on my ApacheContalk...). In the new, flexible world, the payloads implementationcould be a separate implementation of the indexing or it could be partof the core/existing file format implementation. Sometimes I justneed to get my hands on the code to get a real feel for what I feel isthe best way to do it.
I agree about the XML storage for Index information. We do that inour in-house wrapper around Lucene, storing info about the language,analyzer used, etc. We may also want a binary index-level storagecapability. I know most people just create a single document usuallyto store binary info about the index, but an binary storage might begood too.
Part of me says to apply the Payloads patch now, as it provides a lotof bang for the buck and I think the FI is going to take a lot longerto hash out. However, I know that it may pin us in or force us tochange things for FI. Ultimately, I would love to see both thesefeatures for the next release, but that isn't a requirement. Also, onFI, I would love to see two different implementations of whatever APIwe choose before releasing it, as I always find two implementations ofan Interface really work out the API details.
-Grant



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Flexible indexing

Reply via email to