#7267: clean_html() bug
-------------------------------------+-------------------------------------
               Reporter:  Nikolay    |          Owner:  nobody
  <djangoproject@…>                  |         Status:  new
                   Type:  Bug        |      Component:  Core (Other)
              Milestone:             |       Severity:  Normal
                Version:  SVN        |       Keywords:  html
             Resolution:             |      Has patch:  1
           Triage Stage:  Accepted   |    Needs tests:  0
    Needs documentation:  0          |  Easy pickings:  0
Patch needs improvement:  1          |
-------------------------------------+-------------------------------------
Changes (by lukeplant):

 * needs_better_patch:  0 => 1
 * stage:  Ready for checkin => Accepted


Comment:

 The issue regarding `'\xe2\x80\xa2'` does need to be addressed. This is a
 UTF-8 bytestring sequence for
 [http://www.fileformat.info/info/unicode/char/2026/index.htm Unicode
 character 2022]. The correct way to convert it to a unicode object is:
 {{{
     '\xe2\x80\xa2'.decode('utf-8')
 }}}

 which results in `u'\u2022'`, not `u'\xe2\x80\xa2'`

-- 
Ticket URL: <http://code.djangoproject.com/ticket/7267#comment:11>
Django <http://code.djangoproject.com/>
The Web framework for perfectionists with deadlines.

-- 
You received this message because you are subscribed to the Google Groups 
"Django updates" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/django-updates?hl=en.

Reply via email to