At 12:54 PM +0200 8/23/00, Quim Sanmarti wrote:
>1. I was particularly interested in how the boolean queries are parsed. The
>first thing I tried was:

Well, first off I can say that boolean queries are still stranger 
than even you've seen. Try mixing in a phrase into a boolean query! 
Whoops...

>Hmm. Does this imply that the parser generates *binary* children when
>trying boolean? Trying only with 'and' operators, does much the same...

Yes, the boolean parser doesn't try to do more than n=2 right now. 
Even that's causing some problems that are hard to test as you've 
seen. Of course you can also see that it's doing binary matching in 
the code.

>Hmm. No precedence is defined between 'or' and 'and'.

I left precedence for both to be the same, based on left to right parsing.

>It seems to be ignoring the parens.

Yes, right now the HtWordToken function used for splitting queries 
tosses out parens. I wrote a new tokenizer which will be needed 
anyway to keep field:word together.

>Parsing as a boolean query FAILED
>[silence, infinite loop]

The loop is from AndParseTree parsing. I fixed that particular bug, 
but it uncovered others--I need to gobble up whitespace between 
tokens sometimes, but the String class doesn't offer a useful method 
(yet).

I hope to commit a pile of cleanups tonight.
-Geoff


------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
[EMAIL PROTECTED] 
You will receive a message to confirm this. 


Reply via email to