According to Emma Jane Hogbin: > On Fri, Jan 24, 2003 at 01:55:37AM +0100, Olivier Korn wrote: > > Of course, they have *not*. Look at <http://ecogest.info/recherche/>... > > Good!! I was worried that what I was getting was the only way of getting > the search results.
Are you sure you haven't applied a patch to ht://Dig that would force it to strip out accented characters? There were two accent patches available in the patch archives for 3.1.5: one was the accents fuzzy algorithm by Robert Marchand, which eventually became a standard part of 3.1.6 and 3.2 betas, and the other was a hack that mapped all ISO-8859-1 (Latin 1) accented letters to their unaccented counterparts. If you've applied that latter patch, ftp://ftp.ccsf.org/htdig-patches/3.1.5/accents.zip, to any ht://Dig version, then that would explain the problem. In an earlier e-mail, you mentioned that you "also grabbed the language pack (for lack of a better term) from the web site." What pack are you referring to specifically? It would be helpful to know the actual file name and the location from which you got it. Also, which version of ht://Dig are you running, and what patches, if any, have been applied. See http://www.htdig.org/FAQ.html#q5.33 > > This ht://Dig implementation uses the accents fuzzy algorithm AND the > > endings fuzzy algorithm (with french dictionary and affix files). > > Did you work from the language package on the web site? I will go through > it again and try to understand what steps I missed. I've just found at > one mistake. I was using fr_FR but I only have fr in /usr/share/locale > Found that tid bit in: > http://htdig.org/FAQ.html#q4.13 > I also noticed that I don't have an LC_CTYPE in my /usr/share/locale/fr > folder. I'll bug the list again once I've figured out how to fix this > situation. I believe it involves "installing" fr_CA instead of the generic > fr. Also have a look in /usr/lib/locale, as some systems (namely Linux systems using more recent versions of glibc) put locale definitions there. Any French locale that has the LC_CTYPE file should do, as any national variations of a language shouldn't affect the character set used. Based on what you've reported, it doesn't sound like a locale problem to me. Usually, if the locale you pick doesn't support accented letters, these letters are treated as punctuation, causing words to be split up wherever an accented character appears in a word, but the accented characters still show up in the excerpts. You just can't search for accented words because these words aren't put in the database. However, you reported that the accents are stripped from the results page. Am I misunderstanding you in interpretting this as meaning that accented letters are replaced with their unaccented counterparts, for example, that "�" appears as "e"? Or do you mean they disappear altogether? > Oliver thanks for your offer of next week. I hope I will have things > solved by then. I have some new things to work on now and I am absolutely > encouraged by the URL you sent along. > > emma :) > PS When I get the locale installed will I be able to see accented > characters in my email? I can already see them in my browser. That is likely an entirely different matter, and probably depends more on what program you're using for e-mail. Of your recent messages to this list, most seem to be sent from "Mutt", with one of them from "SquirrelMail". Check the docs and/or web pages for these packages, and if you run up empty, maybe try their support mailing lists or forums. Mutt runs in text-mode, so the display of accents may also depend on the settings of the terminal program you use, and what character set it uses for display. -- Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/ Dept. Physiology, U. of Manitoba Winnipeg, MB R3E 3J7 (Canada) ------------------------------------------------------- This SF.NET email is sponsored by: SourceForge Enterprise Edition + IBM + LinuxWorld = Something 2 See! http://www.vasoftware.com _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

