According to Isam Bayazidi: > Hi all > I wanted to ask if htdig could help me in providing a search engine for > pages that contain arabic text that is most probably incoded in iso8859-6 or > cp1256 or maybe utf-8 ..
htdig will only support 8-bit encodings, and only if these encodings are supported by a "locale" on your system. See http://www.htdig.org/FAQ.html#q5.8 and http://www.htdig.org/FAQ.html#q4.10 > What i mean by searching is using the exact search algorithm .. I know the > fuzzy will require some affix setting .. am I right ? > I hope to get a responce soon .. and I hope that I am on the right list .. You have the right list, and I hope this response is soon enough. You are correct that most fuzzy match algorithms will require extra configuration. The endings algorithm needs a dictionary and affix file. The synonyms algorithm needs a list of equivalent words. The soundex and metaphone algorithms are pretty much English-specific, and right now the accents algorithm (in the 3.1.6 and 3.2.0b4 snapshots) only works with ISO-8859-1 (Latin 1) encoding. -- Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil Dept. Physiology, U. of Manitoba Phone: (204)789-3766 Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930 _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

