On Wed, 22 Jun 2005, Andrej wrote:
we would like to index a Japanese website. The pages are using utf-8
character encoding. The ht://Dig FAQ states that ht://Dig cannot index
Japanese pages yet, since they require 16-bit characters, which is not
supported by ht://Dig.
Has there been an update lately concerning this problem or do you know of a
possible workaround that will enable us to index the pages nonetheless?
I am not aware of any progress in this area. The 3.2.x code still lacks
support for multi-byte characters. There are still plans to add some
level of Unicode support to a future version, but when such a version
might become available is a complete unknown.
Jim
-------------------------------------------------------
SF.Net email is sponsored by: Discover Easy Linux Migration Strategies
from IBM. Find simple to follow Roadmaps, straightforward articles,
informative Webcasts and more! Get everything you need to get up to
speed, fast. http://ads.osdn.com/?ad_id=7477&alloc_id=16492&op=click
_______________________________________________
ht://Dig general mailing list: <[email protected]>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general