Your message dated Sun, 16 Apr 2006 20:36:58 +0200
with message-id <[EMAIL PROTECTED]>
and subject line parseString doesn't handle unicode strings
has caused the attached Bug report to be marked as done.

This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.

(NB: If you are a system administrator and have no idea what I am
talking about this indicates a serious mail system misconfiguration
somewhere.  Please contact me immediately.)

Debian bug tracking system administrator
(administrator, Debian Bugs database)

--- Begin Message ---
Package: python-utidylib
Version: 0.2-2

The parseString function does not handle unicode strings; it seems to
treat them the same as an empty string:

In [4]: print tidy.parseString("<p>hello")
<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 3.2//EN">
<html>
<head>
<meta name="generator" content=
"HTML Tidy for Linux/x86 (vers 12 April 2005), see www.w3.org">
<title></title>
</head>
<body>
<p>hello</p>
</body>
</html>


In [5]: print tidy.parseString(u"<p>hello")
<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 3.2//EN">
<html>
<head>
<meta name="generator" content=
"HTML Tidy for Linux/x86 (vers 12 April 2005), see www.w3.org">
<title></title>
</head>
<body>
</body>
</html>

- Josh Triplett

Attachment: signature.asc
Description: OpenPGP digital signature


--- End Message ---
--- Begin Message ---
Hi,

utidilib actually does handle unicode, you just have to encode
your unicode objects appropriately first and pass the character_encoding
option to parseString:

>>> import tidy
>>> print tidy.parseString(u"<p>hello".encode("utf8"), char_encoding="utf8")
<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 3.2//EN">
<html>
<head>
<meta name="generator" content=
"HTML Tidy for Linux/x86 (vers 1 September 2005), see www.w3.org">
<title></title>
</head>
<body>
<p>hello</p>
</body>
</html>

I'm closing this bug :)

Cheers,
Igor

P.S.: For a full list of possible options see
http://tidy.sourceforge.net/docs/quickref.html

--- End Message ---

Reply via email to