Hi!

I was using the HTDig for many years and I've just switched from 3.20b4 to
3.20b5 HTDig.  I don't know why, but I discovered a lot of locale-related
problems (I guess they may be due to the upgrade of gcc).

I'm running Slackware-9.1 with gcc 3.2.3 and the system locale=ru_RU.KOI8-R
(LANG=ru_RU.KOI8-R).  The HTDig config file's locale is ru_RU.KOI8-R, too.
The system locale files were generated from scratch (the i18 library).

It appeared that many basic C calls in HTDig code (like tolower, toupper,
isalpha, etc) give wrong results.  For example, tolower doesn't transform
capital Russian letters to lowercase.  As a result, I can't find any word
in the Russian text, which starts with a capital letter (English capitals
are OK because locale doesn't influence the 032-127 ascii codes in the C
functions).  Besides, for every Russian word in the stop list I get an
error message "ignored because is NOALPHA".

Simple testing C programs convinced me that the C functions in gcc 3.2.3
are working properly with the Russian locale, and so the problems come
from bugs in HTDig code.  I've solved some of the problems by adding the
setlocale(LC_ALL, "") command in two places in htlib/String.cc file, but
there are still a lot of other problems.  Can I ask the authors to correct
these bugs and issue the corrected code?

Best regards,
- Alexander

P.S.  I guess there are serious problems for an user who want to use HTDig
for multilanguage support because the system locale (LANG) can point only
one language (in addition to English).  Probably, different settings of
LC_CTYPE in the config file may help.



-------------------------------------------------------
SF.Net is sponsored by: Speed Start Your Linux Apps Now.
Build and deploy apps & Web services for Linux with
a free DVD software kit from IBM. Click Now!
http://ads.osdn.com/?ad_id=1356&alloc_id=3438&op=click
_______________________________________________
ht://Dig general mailing list: <[EMAIL PROTECTED]>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general

Reply via email to