Your message dated Mon, 23 Feb 2009 17:50:59 +0200
with message-id <[email protected]>
and subject line Re: Bug#516694: html2text: fails to convert valid html
has caused the Debian Bug report #516694,
regarding html2text: fails to convert valid html
to be marked as done.
This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.
(NB: If you are a system administrator and have no idea what this
message is talking about, this may indicate a serious mail system
misconfiguration somewhere. Please contact [email protected]
immediately.)
--
516694: http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=516694
Debian Bug Tracking System
Contact [email protected] with problems
--- Begin Message ---
Package: html2text
Version: 1.3.2a-11
Severity: normal
Hello,
html2text fails to convert a file/web page that it used to convert in
version 1.3.2a-3.
The file can be obtained from
http://www.cse.ohio-state.edu/~gurari/TeX4ht/bugfixes2.html
The command issued was:
html2text -nobs -style compact -ascii -width 79 -o bugfixes2.txt bugfixes2.html
The error message is:
Input recoding failed due to invalid input sequence. Unconverted part of text
follows.
��→ /usr/bin/t4ht
Below, use ‘/usr/bin/’ (without quotes) — or whatever you got as
...
A copy of this file is is used during the build of "tex4ht" which is currently
failing to build properly due to this bug.
Regards,
Kapil.
-- System Information:
Debian Release: 5.0
APT prefers unstable
APT policy: (500, 'unstable')
Architecture: amd64 (x86_64)
Kernel: Linux 2.6.26-1-vserver-amd64 (SMP w/2 CPU cores)
Locale: LANG=C, LC_CTYPE=en_US.UTF-8 (charmap=ANSI_X3.4-1968) (ignored: LC_ALL
set to C)
Shell: /bin/sh linked to /bin/bash
Versions of packages html2text depends on:
ii libc6 2.9-3 GNU C Library: Shared libraries
ii libgcc1 1:4.3.3-4 GCC support library
ii libstdc++6 4.3.3-4 The GNU Standard C++ Library v3
html2text recommends no packages.
Versions of packages html2text suggests:
pn curl | wget <none> (no description available)
-- no debconf information
--- End Message ---
--- Begin Message ---
Version: 1.3.2a-12
Kapil Hari Paranjape wrote:
> Thanks for the rapid response. I checked your fix and it works.
> "html2text" no longer exits with error for the original command line.
>
> However, there still seems to be some sort of bug as it gives the same
> error message if the "-ascii" option is dropped from the command
> line.
>
> At the same time, I read the man page more carefully (as I should
> have done before filing the bug :) ) and found that "html2text" is
> not meant to do a reasonable job for html4 or xhtml. The doc in
> question _is_ xhtml so I suppose I should really be looking at
> a different solution for my package.
You can also try new '-utf8' or '-nometa' options, though I don't know
whether they have something to work in xhtml case or not.
New version with the fix has been uploaded recently.
--
Eugene V. Lyubimkin aka JackYF, JID: jackyf.devel(maildog)gmail.com
C++/Perl developer, Debian Maintainer
signature.asc
Description: OpenPGP digital signature
--- End Message ---