Bug#495448: html2text: Japanese handling fails even with -utf8 option
Kenshi Muto wrote: > Hi, Hi! Sorry for late answer. > Eugene V. Lyubimkin wrote: >> Please, try the same with option '-nobs'. Does the problem remain? > > Great, it solves. I got the perfect plain text. > (So -utf8 option should better enable -nobs option internally?) Great. Yes, I think so too. I will close this bug after upload with enabled '-nobs' automatically when '-utf8' has been supplied. -- Eugene V. Lyubimkin aka JackYF, Ukrainian C++ developer. signature.asc Description: OpenPGP digital signature
Bug#495448: html2text: Japanese handling fails even with -utf8 option
Hi, At Sun, 17 Aug 2008 22:49:13 +0300, Eugene V. Lyubimkin wrote: > Please, try the same with option '-nobs'. Does the problem remain? Great, it solves. I got the perfect plain text. (So -utf8 option should better enable -nobs option internally?) Thanks, -- Kenshi Muto [EMAIL PROTECTED] -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]
Bug#495448: html2text: Japanese handling fails even with -utf8 option
Kenshi Muto wrote: > Package: html2text > Version: 1.3.2a-6 > Severity: normal > Tags: experimental l10n > > Hi, > > As I replied at debian-devel, html2text 1.3.2a-6 couldn't handle > (at least) Japanese UTF-8 web page. > > I attached the example tarball. > > * index.html: sample page, was taken from www.debian.org and >modified smaller, and converted the encoding to UTF-8. > * h2t-1.png: browse index.html. > * i1.txt: converted text with -utf8 option. > * h2t-2.png: browse i1.txt. > * i2.txt: converted text without any options. > * h2t-3.png: browse i2.txt. > > In my quick view, there are problems around decorated strings, such > as or . > > Thanks, Please, try the same with option '-nobs'. Does the problem remain? -- Eugene V. Lyubimkin aka JackYF, Ukrainian C++ developer. signature.asc Description: OpenPGP digital signature