According to Lachlan Andrew:
> Greetings Gilles,
> 
> As I understand Dominique's problem, it is not an issue of chaining 
> the "accents" fuzzy rule with the "endings" fuzzy rule.  He is 
> searching for a word with an accent.  If it matches a word in the 
> database with an accent, then it should not need a fuzzy algorithm to 
> register a match.  I thought that "accents" causes "acouph�ne" to 
> match "acouphene" (without the accend).

I think you're right.  I likely jumped to the wrong conclusion, as it
resembles a problem that's come up many times before.  More likely, the
problem is that "herbe" exists in Dominique's francais.0 dictionary, or
whatever he used to generate the endings database, but "acouph�ne" isn't.
He'd need to add it, with the appropriates suffixes for pluralization
(usually "/S"), and then do "htfuzzy endings" again.

The big problem with the current endings algorithm is its dependence on
a static dictionary which may be incomplete.  The stemming algorithms
which Neal talked about, and wants to add to ht://Dig, adapt automatically
to whatever words get indexed, based on a set of rules for stemming words.

> Dominique, could you confirm that both the document and the query have 
> an accent?  If that is the case, then we *may* be able to fix this 
> problem without needing to chain fuzzy rules.

Also, please check the dictionary file used by your endings_dictionary
attribute in htdig.conf, to make sure it contains the proper pluralization
rules for any words for which you have a problem, as for acouph�ne in
this case.

> On Thu, 3 Jun 2004 02:07 am, Gilles Detillieux wrote:
> > Dominique had written:
> > > > 3- I have a problem with the accent and plurials.
> > > >
> > > > If a search for "herbe" or "herbes", no problems.  But, if the
> > > > works have an accent, like "acouph�ne", htdig have a problem to
> > > > find the plural.
> > > >
> > > > herbe: 136 results
> > > > herbes: 136 results
> > > >
> > > > acouph�ne:   6 results
> > > > acouph�nes:  25 results
> > > >
> > > >
> > > > search_algorithm:       exact:1 endings:1 prefix:1 accent:1
> > > > synonyms:0,5
> >
> > htsearch does not yet support chaining of fuzzy match algorithms,
> > so the results of the accents algorithms don't have the endings
> > algorithm applied to them (nor vice-versa).


-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/
Dept. Physiology, U. of Manitoba  Winnipeg, MB  R3E 3J7  (Canada)


-------------------------------------------------------
This SF.Net email is sponsored by: GNOME Foundation
Hackers Unite!  GUADEC: The world's #1 Open Source Desktop Event.
GNOME Users and Developers European Conference, 28-30th June in Norway
http://2004/guadec.org
_______________________________________________
ht://Dig Developer mailing list:
[EMAIL PROTECTED]
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-dev

Reply via email to