Re: Multiword Highlighting

Mark Miller Fri, 16 Feb 2007 09:19:36 -0800

1> my test cases throw some exceptions with the code as-is. Thespans.get(0)is a problem in that it's not guaranteed that the spans returned willhaveanything in them. Also, I don't think that the test forreqSpans.get(0).next
in queryClauses[i].isRequired is correct (even if it doesn't throw
exceptions). Isn't the sense there that we want to include the spansif we
*do* have entries??

You should get a Spans object back no matter what, hence the .get(0). Ifthe Spans object returned has no Spans in it, then the first call tonext will return false.

2> But more importantly, I think this throws things in the "span bucket"
across documents. Consicer two documents with text "a b c d e f" is inone
document, and "x y z" is in another, and we query on "a AND z", it seems
like extractSpansFromTermQuery would return one span from each document,
which would satisfy the tests in getSpansFromBooleanQueryinappropriately.

This might be the case. I have not considered it...I am working withreal hit highlighting and so I only work with a single document at a time.

Is it just me or is working with Spans really intended to be "one pass
through and only forward"? There are several places in the SpansExtractor
code where we want to ask "are there any spans in here?". But to askthat,you have to call next(). Which changes the state of the Spans suchthat you
have to be really careful when you use any Spans that have had this test
performed already and do a do..while (spans.next()); rather than awhile (
spans.next()) {}..... Ditto with skipTo.

Could be. I'll do some testing.

I haven't thrown any exceptions yet, but I am working with a single docin a memoryindex. So far I have yet to see a problem. I will keep looking.



- Mark

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Multiword Highlighting

Reply via email to