Re: who clears attributes?

Mark Miller Mon, 10 Aug 2009 18:50:20 -0700

Grant Ingersoll wrote:

On Aug 10, 2009, at 6:28 PM, Mark Miller wrote:
Grant Ingersoll wrote:
On Aug 10, 2009, at 5:12 PM, Shai Erera wrote:
Maybe we should follow what I seem to read from Earwin and Grant -come up w/ real use cases, try to implement them w/ the currentAPI, then if it's impossible, discuss how we can make the currentAPI more adaptive. If at the end of this we'll get back to the newAPI, then we'll at least feel better about it, and more convincedit is the way to go.
Well, I have real use cases for it, but all of it is still missingthe biggest piece: search side support. It's the 900 lb. elephantin the room. The 500 lb. elephant is the fact that all theseattributes, AIUI, require you to hook in your own indexing chain,etc. in order to even be indexed, which is all package privatestuff. It's not even clear to me what happens right now if youwere to, say have a Token Stream that, say, had only one Attributeon it and none of the existing attributes (term buffer, length,position, etc.) Please correct me if I am wrong, I still don't havea deep understanding of it all.
Michael has always been up front that this new API is in preparationfor flexible indexing. It doesn't give us the goodness - he has laidout the reasons for moving before the goodness comes more than once Ithink. From my understanding, Michael looked at what Mike was doingin one of his flexible indexing patches, wondered how some of theTokenStream stuff was going to work well with it, and came up withthis new API as a solution. Yes - it gets us nothing now. But its abig move, and there is no need to do everything at once - in fact itwould probably be harder to do it all at once - the rest has alwaysbeen on the table. 3.0 has always been convenient to push it before,as deprecations can than be removed. Nothing forcing us to make thatdecision now though.
Honestly, though, it really gives you very little over the current,well functioning payloads capability other than stronger typing, theability to pick only those attributes that you want indexed (intheory) and a byte (or so) of savings per any token that has apayload, and we _HAVE_ right now, search support for payloads.
Payloads gives us nothing as developers - you can't use thatfunctionality without taking it from the users - payloads are for users.
Flexible indexing will lead to all kinds of little cool things - thelikes of which have been discussed a lot in older emails. It willlikely lead to things we cannot predict as well. Everything will bemore flexible. It also could play a part in CSF, and work on allowingcustom files to plug into merging. Plus everything else thats beenmentioned (pfor, etc) I've been sold on the long term benefits. Idon't think you need these API for them, but its my understanding ithelps solve part of the equation.
A bunch of issues have come up. To my knowledge, they have beenaddressed with vigor every time. If someone is unhappy with howsomething has been addressed, and it needs to be addressed further,please speak up.
Um, that's what I've been doing. Vigor is good. I very muchappreciate everyone's work. From what I can tell, most devs here areunsure at best what to do with their existing Analyzer capabilities.I've actually implemented a couple of new TokenFilter's using the newAPIs. I like that aspect of it. I'm just not sure on the back compathoops (and yes, I asked for them). But I'm also operating under theassumption that our BC approach isn't going to change anytime soon,such that it is very important that these new capabilities are workedout (and I don't just mean little performance nicks here and there, Imean in terms of usability and performance).

I'm not just responding to just you there, but more to the growing packof those speaking against the new API. I don't see specific issues beingbrought up - the only issues I have seen brought up have been addressedin JIRA issues that have received no comments indicating the fix was notgood enough. So we are seeing a lot of general complaints, but specificcomplaints have been addressed as far as I can tell.

As far as back compat - is it really still considered an issue? We havebroken back compat in this release wherever it was convenient to do so.I suspect that will continue. I just wish our policy reflected howthings actually work (and I think they work as they should, based on thecircumstances that lead to each decision).

Let's put it this way: We expect to release 2.9 within the month(which is very short in Lucene time). That will give us a sum totalof, what, 2.5 weeks of review by devs for some very major changes? Iwant 3.0 as much as anyone (I've been pushing for 1.5 support for atleast 2 years now), but I don't want us to be in a hole going into itbecause we felt rushed right when the "finish" was so close.
Otherwise, I don't think the sky is falling - I think the new API isbeing shaken out.
I agree its not falling. It never is. This is in fact how theprocess works. People are doing the right thing here by discussing itand working on it.

Thats kind of in response to the ground swell that appeared to bebuilding to roll back or hold off on the new API. To me, we would dothat if the sky was falling. As long as specific issues are beingaddressed (and the number issues has not been that high), I just don'tsee a reason to hold off on the current plan.


Oh, and now it seems the new QP is dependent on it all.

Dependent how?


Attribute and a whole slew of AttributeImpls.

Oh, because it uses the Attributes. I think the new QueryParser is itsown kettle of fish. It really shouldn't have a back compat promise whileit lives in contrib. It needs to be shaked out before it could possiblyreplace the current parser.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org



--
- Mark

http://www.lucidimagination.com




---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org

Re: who clears attributes?

Reply via email to