Updates:
Status: Fixed
Comment #1 on issue 120 by ja...@hoppipolla.co.uk: Deprecate BeautifulSoup
http://code.google.com/p/html5lib/issues/detail?id=120
Added a warning when bs is used
--
You received this message because you are listed in the owner
or CC fields of this issue, or becaus
Comment #1 on issue 132 by EmilStenstrom: html5lib maximum recursion depth
with broken starting object tag
http://code.google.com/p/html5lib/issues/detail?id=132
Found in the wild at: http://www.webfinanser.com/
--
You received this message because you are listed in the owner
or CC fields of
Comment #3 on issue 131 by Ysbaddaden: Failure with custom namespace
elements (PHP lib)
http://code.google.com/p/html5lib/issues/detail?id=131
I heavily simplified the patch, which was wrong as geoffers pointed out.
I changed Treebuilder::insertElement() to create an element with
"http://www
Comment #2 on issue 131 by ezy...@mit.edu: Failure with custom namespace
elements (PHP lib)
http://code.google.com/p/html5lib/issues/detail?id=131
I did some experimentation with XML namespaces and PHP's DOM implementation
a while
back, and I think I decided that the situation was absolutel
Updates:
Status: Accepted
Labels: Port-PHP
Comment #1 on issue 131 by geoffers: Failure with custom namespace elements
(PHP lib)
http://code.google.com/p/html5lib/issues/detail?id=131
That patch is wrong: per the HTML 5 spec, that should create an element
whose
localname is
Status: New
Owner:
New issue 131 by Ysbaddaden: Failure with custom namespace elements (PHP
lib)
http://code.google.com/p/html5lib/issues/detail?id=131
Using a custom namespaced element (apart from the hardcoded SVG and MathML
namespaces) causes a failure with DOMDocument::createElementNS(
Updates:
Status: Fixed
Comment #6 on issue 96 by ja...@hoppipolla.co.uk: a better intToUnicodeStr
http://code.google.com/p/html5lib/issues/detail?id=96
(No comment was entered for this change.)
--
You received this message because you are listed in the owner
or CC fields of this issue, o
Comment #2 on issue 121 by ja...@hoppipolla.co.uk: nonXmlBMPRegexp is
totally bogus
http://code.google.com/p/html5lib/issues/detail?id=121
Is this any less bogus now?
--
You received this message because you are listed in the owner
or CC fields of this issue, or because you starred this issue.
Updates:
Status: Fixed
Comment #5 on issue 113 by ja...@hoppipolla.co.uk: cannot handle mailformed
attribute names with html5lib and lxml
http://code.google.com/p/html5lib/issues/detail?id=113
This should be fixed now.
--
You received this message because you are listed in the owner
or
Updates:
Status: Fixed
Comment #7 on issue 129 by geoffers: Crash when parsing windows-style quotes
http://code.google.com/p/html5lib/issues/detail?id=129
Fixed in a848a1225f.
--
You received this message because you are listed in the owner
or CC fields of this issue, or because you star
Updates:
Status: Started
Owner: geoffers
Comment #6 on issue 129 by geoffers: Crash when parsing windows-style quotes
http://code.google.com/p/html5lib/issues/detail?id=129
Now I get an error. Fun.
--
You received this message because you are listed in the owner
or CC fields of t
Updates:
Status: Duplicate
Mergedinto: 113
Comment #2 on issue 125 by geoffers: InfosetFilter.toXmlName doesn't filter
first character properly
http://code.google.com/p/html5lib/issues/detail?id=125
(No comment was entered for this change.)
--
You received this message because
Comment #4 on issue 113 by geoffers: cannot handle mailformed attribute
names with html5lib and lxml
http://code.google.com/p/html5lib/issues/detail?id=113
Issue 125 has been merged into this issue.
--
You received this message because you are listed in the owner
or CC fields of this issue, or
Comment #5 on issue 129 by geoffers: Crash when parsing windows-style quotes
http://code.google.com/p/html5lib/issues/detail?id=129
I get no error here.
--
You received this message because you are listed in the owner
or CC fields of this issue, or because you starred this issue.
You may adjust
Comment #1 on issue 130 by geoffers: Genshi Tree Walker Broken
http://code.google.com/p/html5lib/issues/detail?id=130
I can't reproduce this here. What version of Genshi are you using?
--
You received this message because you are listed in the owner
or CC fields of this issue, or because you sta
Comment #4 on issue 62 by chris.vigelius: Sanitizer does not allow
stripping of tags
http://code.google.com/p/html5lib/issues/detail?id=62
To Batiste (and anyone else this may concern): It seems you can achieve this
behaviour by overriding HtmlSanitizer - see attached example.
Note this has NO
Comment #4 on issue 122 by Christian.Oudard: Comments beginning a file
crashes the xml parser
http://code.google.com/p/html5lib/issues/detail?id=122
Thank you for the link to XML5, I realize now that I was using html5lib in
place of a
standalone liberal XML parser.
--
You received this messa
Comment #4 on issue 129 by ja...@hoppipolla.co.uk: Crash when parsing
windows-style quotes
http://code.google.com/p/html5lib/issues/detail?id=129
The reason for the crash here is that when we hit a meta tag, we assume
that the
value of the content attribute can always be encoded in the same e
Updates:
Status: Fixed
Comment #1 on issue 119 by ja...@hoppipolla.co.uk: Update to LC spec
http://code.google.com/p/html5lib/issues/detail?id=119
This meta bug isn't really that useful. We now pass almost all the tests we
have.
Sure more tests would be nice.
--
You received this messa
Comment #3 on issue 122 by ja...@hoppipolla.co.uk: Comments beginning a
file crashes the xml parser
http://code.google.com/p/html5lib/issues/detail?id=122
It's not a good idea to use the liberal XML parser to parse HTML. If you
want to
parse HTML you should use the HTML parser. If you want to
Comment #2 on issue 122 by Christian.Oudard: Comments beginning a file
crashes the xml parser
http://code.google.com/p/html5lib/issues/detail?id=122
This is very disappointing, because a solid parser is the backbone of any
html library.
I can't maintain the library, but I may be able to provi
Updates:
Status: WFM
Comment #3 on issue 112 by ja...@hoppipolla.co.uk: assertion in
processSpaceCharacters in InTableTextPhase
http://code.google.com/p/html5lib/issues/detail?id=112
I think this is fixed.
--
You received this message because you are listed in the owner
or CC fields of
Updates:
Status: Fixed
Comment #1 on issue 123 by ja...@hoppipolla.co.uk: simpletree cloneNode
only works for Elements (+patch)
http://code.google.com/p/html5lib/issues/detail?id=123
Thanks for the patch and sorry for the delay.
--
You received this message because you are listed in th
Updates:
Status: Fixed
Comment #4 on issue 81 by ja...@hoppipolla.co.uk: Verision info
http://code.google.com/p/html5lib/issues/detail?id=81
I fixed this
--
You received this message because you are listed in the owner
or CC fields of this issue, or because you starred this issue.
You ma
Updates:
Status: WontFix
Comment #1 on issue 122 by ja...@hoppipolla.co.uk: Comments beginning a
file crashes the xml parser
http://code.google.com/p/html5lib/issues/detail?id=122
The liberal XML parser is no longer being developed. Unless someone steps
up to
maintain this module, thi
Comment #5 on issue 101 by starsareblueandfaraway: DeprecationWarning for
inputstream.py
http://code.google.com/p/html5lib/issues/detail?id=101
I am having the same issue in Python 2.6 with html5lib. Thanks for making
this library.
File "build/bdist.linux-i686/egg/html5lib/inputstream.p
Comment #4 on issue 86 by aaron.devore: BeautifulSoup treebuilder string
attribute is missing
http://code.google.com/p/html5lib/issues/detail?id=86
The Tag.string bug is now fixed in Beautiful Soup as of version 3.0.8,
which was
released on November 30 (3 days ago). The bug hasn't been fixed
Comment #3 on issue 129 by EmilStenstrom: Crash when parsing windows-style
quotes
http://code.google.com/p/html5lib/issues/detail?id=129
I workaround that seems to work is to simple replace all ”-characters with
double
quotes before parsing. Easy as that:
f = f.replace('”', '"')
--
You rece
Comment #2 on issue 129 by EmilStenstrom: Crash when parsing windows-style
quotes
http://code.google.com/p/html5lib/issues/detail?id=129
This is also seen at: www.etc.se
--
You received this message because you are listed in the owner
or CC fields of this issue, or because you starred this is
Status: New
Owner:
New issue 130 by armin.ro...@active-4.com: Genshi Tree Walker Broken
http://code.google.com/p/html5lib/issues/detail?id=130
Currently the genshi tree walker does not work. It fails here:
Traceback (most recent call last):
...
File "inmhpc\html.py", line 28, in par
Updates:
Status: Fixed
Comment #2 on issue 128 by ja...@hoppipolla.co.uk: Crash when parsing
strangely nested form and table tags
http://code.google.com/p/html5lib/issues/detail?id=128
(No comment was entered for this change.)
--
You received this message because you are listed in the
Updates:
Status: Fixed
Comment #3 on issue 126 by ja...@hoppipolla.co.uk: Dom treebuilder crashes
with unclosed font tag
http://code.google.com/p/html5lib/issues/detail?id=126
(No comment was entered for this change.)
--
You received this message because you are listed in the owner
or
Comment #3 on issue 127 by EmilStenstrom: html5lib crashes with open table
tag and empty line
http://code.google.com/p/html5lib/issues/detail?id=127
Thanks for the work, but now I get a new error:
C:\Emils\Kod\sammanfatta\fetch\management\commands>randomtask.py
Traceback (most recent call last
Comment #2 on issue 126 by EmilStenstrom: Dom treebuilder crashes with
unclosed font tag
http://code.google.com/p/html5lib/issues/detail?id=126
Another testcase that gives the same error with a slightly different
stacktrace:
Stacktrace:
C:\Emils\Kod\sammanfatta\fetch\management\commands>ra
Comment #1 on issue 129 by EmilStenstrom: Crash when parsing windows-style
quotes
http://code.google.com/p/html5lib/issues/detail?id=129
Found in the wild: www.enytt.com
--
You received this message because you are listed in the owner
or CC fields of this issue, or because you starred this is
Status: New
Owner:
New issue 129 by EmilStenstrom: Crash when parsing windows-style quotes
http://code.google.com/p/html5lib/issues/detail?id=129
What steps will reproduce the problem?
# Example code
# -*- coding: utf-8 -*-
import html5lib
parser = html5lib.HTMLParser()
doc = parser.parse(""
Comment #1 on issue 128 by EmilStenstrom: Crash when parsing strangely
nested form and table tags
http://code.google.com/p/html5lib/issues/detail?id=128
Found in the wild: http://www.gourmet.se/
--
You received this message because you are listed in the owner
or CC fields of this issue, or be
Updates:
Status: Fixed
Comment #2 on issue 127 by ja...@hoppipolla.co.uk: html5lib crashes with
open table tag and empty line
http://code.google.com/p/html5lib/issues/detail?id=127
Should be fixed in latest tip
Thanks for all the great bug reports btw.
--
You received this message be
Status: New
Owner:
New issue 128 by EmilStenstrom: Crash when parsing strangely nested form
and table tags
http://code.google.com/p/html5lib/issues/detail?id=128
What steps will reproduce the problem?
# Example code
import html5lib
parser = html5lib.HTMLParser()
doc = parser.parse("")
St
Comment #1 on issue 127 by EmilStenstrom: html5lib crashes with open table
tag and empty line
http://code.google.com/p/html5lib/issues/detail?id=127
Edit: 7harad2.html should of course be free from quotes, just a table tag
followed by
an empty line
--
You received this message because you a
Status: New
Owner:
New issue 127 by EmilStenstrom: html5lib crashes with open table tag and
empty line
http://code.google.com/p/html5lib/issues/detail?id=127
What steps will reproduce the problem?
# Example code
import html5lib
f = open("7harad2.html")
parser = html5lib.HTMLParser()
doc =
Comment #1 on issue 126 by EmilStenstrom: Dom treebuilder crashes with
unclosed font tag
http://code.google.com/p/html5lib/issues/detail?id=126
Found in the wild at: http://7-harad.nu/
--
You received this message because you are listed in the owner
or CC fields of this issue, or because you
Status: New
Owner:
New issue 126 by EmilStenstrom: Dom treebuilder crashes with unclosed font
tag
http://code.google.com/p/html5lib/issues/detail?id=126
What steps will reproduce the problem?
# Example code
import html5lib
parser =
html5lib.HTMLParser(tree=html5lib.treebuilders.getTreeB
Updates:
Status: Duplicate
Mergedinto: 80
Comment #3 on issue 124 by excors: Crash when parsing three swedish news
sites with beautifulsoup treebuilder
http://code.google.com/p/html5lib/issues/detail?id=124
(No comment was entered for this change.)
--
You received this message
Comment #8 on issue 80 by excors: TypeError when serializing some pages to
BeautifulSoup
http://code.google.com/p/html5lib/issues/detail?id=80
Issue 124 has been merged into this issue.
--
You received this message because you are listed in the owner
or CC fields of this issue, or because you
Comment #1 on issue 125 by EmilStenstrom: InfosetFilter.toXmlName doesn't
filter first character properly
http://code.google.com/p/html5lib/issues/detail?id=125
Example where this is encountered in the wild: http://www.allehanda.se
--
You received this message because you are listed in the ow
Status: Accepted
Owner: excors
New issue 125 by excors: InfosetFilter.toXmlName doesn't filter first
character properly
http://code.google.com/p/html5lib/issues/detail?id=125
echo '' | ./parse.py - --treebuilder=lxml
ValueError: Invalid attribute name u'0'
--
You received this message becaus
Comment #2 on issue 124 by EmilStenstrom: Crash when parsing three swedish
news sites with beautifulsoup treebuilder
http://code.google.com/p/html5lib/issues/detail?id=124
Also happens on the latest source chekout, but with an extra
DataLossWarning:
C:\Program Files
(x86)\python\lib\site-
Comment #1 on issue 124 by EmilStenstrom: Crash when parsing three swedish
news sites with beautifulsoup treebuilder
http://code.google.com/p/html5lib/issues/detail?id=124
Bug does not appear with the "dom" treebuilder.
--
You received this message because you are listed in the owner
or CC fi
Status: New
Owner:
New issue 123 by rnairn: simpletree cloneNode only works for Elements
(+patch)
http://code.google.com/p/html5lib/issues/detail?id=123
Seems wrong to define it in the base class when it has to be different in
each
subclass.
Attachments:
simpletree-clone.patch
Status: New
Owner:
New issue 122 by Christian.Oudard: Comments beginning a file crashes the
xml parser
http://code.google.com/p/html5lib/issues/detail?id=122
What steps will reproduce the problem?
>>> import html5lib
>>> html5lib.XMLParser().parse('')
Traceback (most recent call last):
Comment #4 on issue 101 by schmi...@wsucougars.com: DeprecationWarning for
inputstream.py
http://code.google.com/p/html5lib/issues/detail?id=101
This is not fixed on the latest release. I am having the same issue using
it on
Ubuntu's Karmic Koala. Any suggestions to getting this working?
--
Updates:
Labels: Milestone-Release1.0 Type-Defect Port-Python
Comment #1 on issue 121 by ja...@hoppipolla.co.uk: nonXmlBMPRegexp is
totally bogus
http://code.google.com/p/html5lib/issues/detail?id=121
(No comment was entered for this change.)
--
You received this message because you a
Updates:
Labels: -Milestone-Release1.0 -Priority-High Priority-Low
Comment #5 on issue 96 by ja...@hoppipolla.co.uk: a better intToUnicodeStr
http://code.google.com/p/html5lib/issues/detail?id=96
This should be fixed but the code is never actually run in a release, it is
just used
to g
Status: Accepted
Owner: excors
New issue 121 by excors: nonXmlBMPRegexp is totally bogus
http://code.google.com/p/html5lib/issues/detail?id=121
It says:
nonXmlBMPRegexp = re.compile(u'[\x00-,|/|:-@|[-^|`|{-\xb6|\xb8-
\xbf|\xd7|\xf7|...')
which is wrong (e.g. it matches '[' but not
Comment #9 on issue 93 by geoffers: Quote attributes containing weird
whitespace or '<'
http://code.google.com/p/html5lib/issues/detail?id=93
I don't think we need to try and get the spec to quote anything else.
--
You received this message because you are listed in the owner
or CC fields of
Comment #3 on issue 86 by aaron.devore: BeautifulSoup treebuilder string
attribute is missing
http://code.google.com/p/html5lib/issues/detail?id=86
This bug will be fixed in the next version of Beautiful Soup in the 3.0.x
series.
Note that Beautiful Soup support will be deprecated as of vers
Status: Accepted
Owner: geoffers
Labels: Type-Defect Milestone-Release1.0 Priority-Critical Port-Python
New issue 120 by geoffers: Deprecate BeautifulSoup
http://code.google.com/p/html5lib/issues/detail?id=120
As I discussed with jgraham, we'll deprecate BS in 1.0 as it cannot
correctly repres
Status: Accepted
Owner: geoffers
Labels: Type-Enhancement Milestone-Release1.0 Priority-Critical Port-Python
NeedsTests
New issue 119 by geoffers: Update to LC spec
http://code.google.com/p/html5lib/issues/detail?id=119
We need to update the parser (tokenizer, tree-construction, resetting the
Updates:
Status: Invalid
Comment #2 on issue 25 by geoffers: Lack of trailing empty line generates a
parse error
http://code.google.com/p/html5lib/issues/detail?id=25
(No comment was entered for this change.)
--
You received this message because you are listed in the owner
or CC field
Comment #3 on issue 74 by geoffers: AttributeError: 'module' object has no
attribute 'isValidEncoding'
http://code.google.com/p/html5lib/issues/detail?id=74
Why is this still open?
--
You received this message because you are listed in the owner
or CC fields of this issue, or because you star
Updates:
Status: WontFix
Comment #1 on issue 75 by geoffers: [PATCH] Filters should pass
contentModelFlag changes to source
http://code.google.com/p/html5lib/issues/detail?id=75
Spec changes removing contentModelFlag make this irrelevant (and we'll soon
remove it).
--
You received t
Updates:
Status: WontFix
Comment #2 on issue 76 by geoffers: Validator complains about type and
global attrs on input tags.
http://code.google.com/p/html5lib/issues/detail?id=76
The validator was removed a few months ago due to lack of interest, so this
is wontfix
--
You received th
Updates:
Labels: -Priority-Medium Priority-High Milestone-Release1.0 Port-Python
Comment #1 on issue 79 by geoffers: getElementById doesn't work with minidom
http://code.google.com/p/html5lib/issues/detail?id=79
(No comment was entered for this change.)
--
You received this message beca
Updates:
Labels: Port-Python
Comment #7 on issue 80 by geoffers: TypeError when serializing some pages
to BeautifulSoup
http://code.google.com/p/html5lib/issues/detail?id=80
(No comment was entered for this change.)
--
You received this message because you are listed in the owner
or C
Updates:
Status: Accepted
Labels: Milestone-Release1.0
Comment #3 on issue 81 by geoffers: Verision info
http://code.google.com/p/html5lib/issues/detail?id=81
Mid-November of next year? :)
--
You received this message because you are listed in the owner
or CC fields of this issu
Updates:
Labels: -0.12-blocker Type-Defect Priority-Critical
Milestone-Release1.0
Port-Python
Comment #2 on issue 89 by geoffers: Installation using setup.py fails under
Windows
http://code.google.com/p/html5lib/issues/detail?id=89
(No comment was entered for this change.)
--
You r
Updates:
Labels: -0.12-blocker Type-Defect Milestone-Release1.0 Port-Python
Comment #8 on issue 92 by geoffers: Possible to make IE run script after
roundtripping in html5lib
http://code.google.com/p/html5lib/issues/detail?id=92
(No comment was entered for this change.)
--
You receive
Updates:
Labels: -0.12-blocker Priority-Critical Type-Defect Milestone-Release1.0
Comment #8 on issue 93 by geoffers: Quote attributes containing weird
whitespace or '<'
http://code.google.com/p/html5lib/issues/detail?id=93
(No comment was entered for this change.)
--
You received thi
Updates:
Labels: Type-Enhancement Milestone-Release1.0 Priority-High
Comment #4 on issue 96 by geoffers: a better intToUnicodeStr
http://code.google.com/p/html5lib/issues/detail?id=96
(No comment was entered for this change.)
--
You received this message because you are listed in the ow
Updates:
Labels: -0.12-blocker Priority-Critical Type-Defect Port-Python
Milestone-Release1.0
Comment #3 on issue 113 by geoffers: cannot handle mailformed attribute
names with html5lib and lxml
http://code.google.com/p/html5lib/issues/detail?id=113
(No comment was entered for this c
Updates:
Status: WontFix
Comment #2 on issue 86 by geoffers: BeautifulSoup treebuilder string
attribute is missing
http://code.google.com/p/html5lib/issues/detail?id=86
Bug in BS, as above.
--
You received this message because you are listed in the owner
or CC fields of this issue, or
Comment #1 on issue 88 by geoffers: Reading from stdin broken
http://code.google.com/p/html5lib/issues/detail?id=88
If this really is critical, can you do something about it? :P
--
You received this message because you are listed in the owner
or CC fields of this issue, or because you starred t
Updates:
Labels: 0.12-blocker
Comment #1 on issue 89 by geoffers: Installation using setup.py fails under
Windows
http://code.google.com/p/html5lib/issues/detail?id=89
(No comment was entered for this change.)
--
You received this message because you are listed in the owner
or CC fiel
Updates:
Status: Duplicate
Comment #2 on issue 90 by geoffers: All files doubled in archive
html5lib-0.11.1.zip
http://code.google.com/p/html5lib/issues/detail?id=90
(No comment was entered for this change.)
--
You received this message because you are listed in the owner
or CC fields
Updates:
Status: Accepted
Labels: 0.12-blocker
Comment #7 on issue 92 by geoffers: Possible to make IE run script after
roundtripping in html5lib
http://code.google.com/p/html5lib/issues/detail?id=92
(No comment was entered for this change.)
--
You received this message becaus
Updates:
Status: Accepted
Labels: 0.12-blocker
Comment #7 on issue 93 by geoffers: Quote attributes containing weird
whitespace or '<'
http://code.google.com/p/html5lib/issues/detail?id=93
Accepted, though we still need to decide how much to quote.
--
You received this message
Updates:
Labels: -0.12-blocker
Comment #3 on issue 96 by geoffers: a better intToUnicodeStr
http://code.google.com/p/html5lib/issues/detail?id=96
(No comment was entered for this change.)
--
You received this message because you are listed in the owner
or CC fields of this issue, or bec
Updates:
Labels: 0.12-blocker
Comment #2 on issue 96 by geoffers: a better intToUnicodeStr
http://code.google.com/p/html5lib/issues/detail?id=96
(No comment was entered for this change.)
--
You received this message because you are listed in the owner
or CC fields of this issue, or beca
Updates:
Status: Accepted
Comment #1 on issue 96 by geoffers: a better intToUnicodeStr
http://code.google.com/p/html5lib/issues/detail?id=96
(No comment was entered for this change.)
--
You received this message because you are listed in the owner
or CC fields of this issue, or because
Updates:
Status: Accepted
Owner: ja...@hoppipolla.co.uk
Comment #1 on issue 103 by geoffers: Can't easy_install/pip install
html5lib==dev
http://code.google.com/p/html5lib/issues/detail?id=103
(No comment was entered for this change.)
--
You received this message because you a
Updates:
Status: WFM
Comment #1 on issue 110 by geoffers: re module has no method "find"
http://code.google.com/p/html5lib/issues/detail?id=110
(No comment was entered for this change.)
--
You received this message because you are listed in the owner
or CC fields of this issue, or becau
Updates:
Status: Fixed
Comment #2 on issue 116 by geoffers: treewalker escapes from subtree if
root of subtree has a next sibling
http://code.google.com/p/html5lib/issues/detail?id=116
This appears to be fixed in the latest version in hg, can you please verify
this?
--
You received
Comment #2 on issue 112 by geoffers: assertion in processSpaceCharacters in
InTableTextPhase
http://code.google.com/p/html5lib/issues/detail?id=112
Is this not fixed?
--
You received this message because you are listed in the owner
or CC fields of this issue, or because you starred this issue
Comment #1 on issue 112 by geoffers: assertion in processSpaceCharacters in
InTableTextPhase
http://code.google.com/p/html5lib/issues/detail?id=112
Is this not fixed?
--
You received this message because you are listed in the owner
or CC fields of this issue, or because you starred this issue
Updates:
Status: Accepted
Labels: 0.12-blocker
Comment #2 on issue 113 by geoffers: cannot handle mailformed attribute
names with html5lib and lxml
http://code.google.com/p/html5lib/issues/detail?id=113
(No comment was entered for this change.)
--
You received this message bec
Comment #1 on issue 113 by eromirou: cannot handle mailformed attribute
names with html5lib and lxml
http://code.google.com/p/html5lib/issues/detail?id=113
I found out that using 'sanitizer.HTMLSanitizer' as the tokenizer works
fine:
import html5lib
from html5lib import treebuilders
from ht
Updates:
Status: fixed
Comment #2 on issue 118 by geoffers: python memory leak in dom.TreeBuilder
http://code.google.com/p/html5lib/issues/detail?id=118
Committed/pushed. Thanks!
--
You received this message because you are listed in the owner
or CC fields of this issue, or because you
Comment #6 on issue 92 by geoffers: Possible to make IE run script after
roundtripping in html5lib
http://code.google.com/p/html5lib/issues/detail?id=92
` is now non-conforming at the start of an unquoted attribute.
--
You received this message because you are listed in the owner
or CC fields
Updates:
Status: Fixed
Comment #1 on issue 115 by geoffers: param/source parsed incorrectly in body
http://code.google.com/p/html5lib/issues/detail?id=115
Now fixed.
--
You received this message because you are listed in the owner
or CC fields of this issue, or because you starred this
Updates:
Status: Fixed
Comment #4 on issue 111 by geoffers: With lxml treebuilder, with
namespaceHTMLElements=False, root element is in namespace
http://code.google.com/p/html5lib/issues/detail?id=111
Should've been fixed as of yesterday.
--
You received this message because you are l
Updates:
Status: Invalid
Comment #1 on issue 114 by ja...@hoppipolla.co.uk: [HAS PATCH]
inject_meta_charset inserts broken meta tag
http://code.google.com/p/html5lib/issues/detail?id=114
The form we emit is valid per HTML5, woks in all browsers, and considerably
easier to
author/read
Comment #1 on issue 118 by devex.soft: python memory leak in dom.TreeBuilder
http://code.google.com/p/html5lib/issues/detail?id=118
patch for dom.py
Attachments:
dom.py.patch 103 bytes
--
You received this message because you are listed in the owner
or CC fields of this issue, or beca
Status: New
Owner:
New issue 118 by devex.soft: python memory leak in dom.TreeBuilder
http://code.google.com/p/html5lib/issues/detail?id=118
html5lib/treebuilders/dom.py
on line 122:
def documentClass(self):
self.dom = Dom.getDOMImplementation().createDocument(None,None,None)
re
Status: Accepted
Owner:
New issue 117 by geoffers: Tokenizer tests are not JSON
http://code.google.com/p/html5lib/issues/detail?id=117
The wiki page says the tokenizer tests are JSON, but they are not, as they
contain surrogate codepoints (which are not allowed in JSON). This causes
problem
Comment #3 on issue 62 by sa3ruby: Sanitizer does not allow stripping of
tags
http://code.google.com/p/html5lib/issues/detail?id=62
My inclination is to flip this entirely. It seems inconsistent that evil
CSS is
stripped, and unknown attributes are stripped, but unknown elements are
escap
Comment #6 on issue 93 by Simetrical: Quote attributes containing weird
whitespace or '<'
http://code.google.com/p/html5lib/issues/detail?id=93
I posted this a couple of days ago:
http://lists.whatwg.org/htdig.cgi/whatwg-whatwg.org/2009-September/022711.html
--
You received this message beca
Comment #5 on issue 93 by zcorpan: Quote attributes containing weird
whitespace or '<'
http://code.google.com/p/html5lib/issues/detail?id=93
I did, and Hixie rejected it saying that it's an issue that will go away
over time.
Feel free to bring it up again (citing that sites who implement the
Comment #5 on issue 92 by zcorpan: Possible to make IE run script after
roundtripping in html5lib
http://code.google.com/p/html5lib/issues/detail?id=92
No, because xmp is a RAWTEXT element. So it's equivalent to the following
XML
`>
but in IE it's equivalent to the
Comment #4 on issue 93 by geoffers: Quote attributes containing weird
whitespace or '<'
http://code.google.com/p/html5lib/issues/detail?id=93
IMO yes, just someone needs to get around to it. :)
--
You received this message because you are listed in the owner
or CC fields of this issue, or bec
1 - 100 of 235 matches
Mail list logo