On Mon, 29 Nov 1999, Gilles Detillieux wrote:
> Just a hunch, but you wouldn't happen to have a � in valid_punctuation,
> would you? In any case, could you run htdig -vvv twice, searching
> first for ANL�NDE, and then for anl�nde? How do the initial debugging
> messages differ. What's happening to the � - is it getting stripped
> out or changed to another character? Is the upper case � getting changed
> to a �, or to another character? Are you using the exact same config
> file for htdig, htmerge and htsearch?
I use the default for "valid_punctuation", I even tried adding it as
"extra_word_characters: �".
Here's the debugging info for the second (237th! :) try.
su10-2 <74> htsearch -vvv
Enter value for words: anl�nde
tempWords: 'anl�nde:0 '
Boolean: 'anl�nde:0 '
initial: ''
Add: anl�nde
searchWords: 'anl�nde:0 '
LogicalWords: anl�nde
Pattern:
Enter value for format:
su10-2 <75> htsearch -vvv
Enter value for words: ANL�NDE
tempWords: 'anl�nde:0 '
Boolean: 'anl�nde:0 '
initial: ''
Fuzzy on: anl�nde
(null) anl�nde
(null) word=anl�nde prefix_suffix=* prefix_suffix_length=1
minimum_prefix_length=1
endings anl�nda anl�ndandet anl�ndandets anl�ndande anl�nd- anl�nder
anl�nt anl�nds anl�ndes anl�nts anl�ndes
synonyms
searchWords: '(:0 anl�nde:0 |:0 anl�nda:0 |:0 anl�ndandet:0 |:0
anl�ndandets:0 |:0 anl�ndande:0 |:0 anl�nd-:0 |:0 anl�nder:0 |:0 anl�nt:0
|:0 anl�nds:0 |:0 anl�ndes:0 |:0 anl�nts:0 |:0 anl�ndes:0 ):0 '
LogicalWords: (anl�nde or anl�nda or anl�ndandet or anl�ndandets or
anl�ndande or anl�nd- or anl�nder or anl�nt or anl�nds or anl�ndes or
anl�nts or anl�ndes)
Pattern: anl�nde
Enter value for format:
looks ok to me... what do you say?
> Not that I know of, but you could put a originalWords.uppercase(); right
> after the originalWords.chop(" \t\r\n"); in htsearch/htsearch.cc. If the
> htsearch -vvv above doesn't get to the root of the problem, it might be
> interesting to see if this hack has any effect.
I'll try this too. If the above looks ok.
I got a mail from another Swedish subscriber of this list and according to
him everything worked well using sv_SE (which I don't have) and indexing
using an English dictionary (which shouldn't change anything).
I'll try to get hold of that locale and try it...
/Philippe
------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.