Re: Issue 120 in html5lib: Deprecate BeautifulSoup

2010-01-17 Thread codesite-noreply
Updates: Status: Fixed Comment #1 on issue 120 by ja...@hoppipolla.co.uk: Deprecate BeautifulSoup http://code.google.com/p/html5lib/issues/detail?id=120 Added a warning when bs is used -- You received this message because you are listed in the owner or CC fields of this issue, or becaus

Re: Issue 132 in html5lib: html5lib maximum recursion depth with broken starting object tag

2010-01-11 Thread codesite-noreply
Comment #1 on issue 132 by EmilStenstrom: html5lib maximum recursion depth with broken starting object tag http://code.google.com/p/html5lib/issues/detail?id=132 Found in the wild at: http://www.webfinanser.com/ -- You received this message because you are listed in the owner or CC fields of

Re: Issue 131 in html5lib: Failure with custom namespace elements (PHP lib)

2010-01-06 Thread codesite-noreply
Comment #3 on issue 131 by Ysbaddaden: Failure with custom namespace elements (PHP lib) http://code.google.com/p/html5lib/issues/detail?id=131 I heavily simplified the patch, which was wrong as geoffers pointed out. I changed Treebuilder::insertElement() to create an element with "http://www

Re: Issue 131 in html5lib: Failure with custom namespace elements (PHP lib)

2010-01-05 Thread codesite-noreply
Comment #2 on issue 131 by ezy...@mit.edu: Failure with custom namespace elements (PHP lib) http://code.google.com/p/html5lib/issues/detail?id=131 I did some experimentation with XML namespaces and PHP's DOM implementation a while back, and I think I decided that the situation was absolutel

Re: Issue 131 in html5lib: Failure with custom namespace elements (PHP lib)

2010-01-05 Thread codesite-noreply
Updates: Status: Accepted Labels: Port-PHP Comment #1 on issue 131 by geoffers: Failure with custom namespace elements (PHP lib) http://code.google.com/p/html5lib/issues/detail?id=131 That patch is wrong: per the HTML 5 spec, that should create an element whose localname is

Issue 131 in html5lib: Failure with custom namespace elements (PHP lib)

2010-01-05 Thread codesite-noreply
Status: New Owner: New issue 131 by Ysbaddaden: Failure with custom namespace elements (PHP lib) http://code.google.com/p/html5lib/issues/detail?id=131 Using a custom namespaced element (apart from the hardcoded SVG and MathML namespaces) causes a failure with DOMDocument::createElementNS(

Re: Issue 96 in html5lib: a better intToUnicodeStr

2010-01-03 Thread codesite-noreply
Updates: Status: Fixed Comment #6 on issue 96 by ja...@hoppipolla.co.uk: a better intToUnicodeStr http://code.google.com/p/html5lib/issues/detail?id=96 (No comment was entered for this change.) -- You received this message because you are listed in the owner or CC fields of this issue, o

Re: Issue 121 in html5lib: nonXmlBMPRegexp is totally bogus

2010-01-03 Thread codesite-noreply
Comment #2 on issue 121 by ja...@hoppipolla.co.uk: nonXmlBMPRegexp is totally bogus http://code.google.com/p/html5lib/issues/detail?id=121 Is this any less bogus now? -- You received this message because you are listed in the owner or CC fields of this issue, or because you starred this issue.

Re: Issue 113 in html5lib: cannot handle mailformed attribute names with html5lib and lxml

2010-01-03 Thread codesite-noreply
Updates: Status: Fixed Comment #5 on issue 113 by ja...@hoppipolla.co.uk: cannot handle mailformed attribute names with html5lib and lxml http://code.google.com/p/html5lib/issues/detail?id=113 This should be fixed now. -- You received this message because you are listed in the owner or

Re: Issue 129 in html5lib: Crash when parsing windows-style quotes

2010-01-02 Thread codesite-noreply
Updates: Status: Fixed Comment #7 on issue 129 by geoffers: Crash when parsing windows-style quotes http://code.google.com/p/html5lib/issues/detail?id=129 Fixed in a848a1225f. -- You received this message because you are listed in the owner or CC fields of this issue, or because you star

Re: Issue 129 in html5lib: Crash when parsing windows-style quotes

2010-01-02 Thread codesite-noreply
Updates: Status: Started Owner: geoffers Comment #6 on issue 129 by geoffers: Crash when parsing windows-style quotes http://code.google.com/p/html5lib/issues/detail?id=129 Now I get an error. Fun. -- You received this message because you are listed in the owner or CC fields of t

Re: Issue 125 in html5lib: InfosetFilter.toXmlName doesn't filter first character properly

2010-01-02 Thread codesite-noreply
Updates: Status: Duplicate Mergedinto: 113 Comment #2 on issue 125 by geoffers: InfosetFilter.toXmlName doesn't filter first character properly http://code.google.com/p/html5lib/issues/detail?id=125 (No comment was entered for this change.) -- You received this message because

Re: Issue 113 in html5lib: cannot handle mailformed attribute names with html5lib and lxml

2010-01-02 Thread codesite-noreply
Comment #4 on issue 113 by geoffers: cannot handle mailformed attribute names with html5lib and lxml http://code.google.com/p/html5lib/issues/detail?id=113 Issue 125 has been merged into this issue. -- You received this message because you are listed in the owner or CC fields of this issue, or

Re: Issue 129 in html5lib: Crash when parsing windows-style quotes

2010-01-02 Thread codesite-noreply
Comment #5 on issue 129 by geoffers: Crash when parsing windows-style quotes http://code.google.com/p/html5lib/issues/detail?id=129 I get no error here. -- You received this message because you are listed in the owner or CC fields of this issue, or because you starred this issue. You may adjust

Re: Issue 130 in html5lib: Genshi Tree Walker Broken

2010-01-02 Thread codesite-noreply
Comment #1 on issue 130 by geoffers: Genshi Tree Walker Broken http://code.google.com/p/html5lib/issues/detail?id=130 I can't reproduce this here. What version of Genshi are you using? -- You received this message because you are listed in the owner or CC fields of this issue, or because you sta

Re: Issue 62 in html5lib: Sanitizer does not allow stripping of tags

2009-12-29 Thread codesite-noreply
Comment #4 on issue 62 by chris.vigelius: Sanitizer does not allow stripping of tags http://code.google.com/p/html5lib/issues/detail?id=62 To Batiste (and anyone else this may concern): It seems you can achieve this behaviour by overriding HtmlSanitizer - see attached example. Note this has NO

Re: Issue 122 in html5lib: Comments beginning a file crashes the xml parser

2009-12-24 Thread codesite-noreply
Comment #4 on issue 122 by Christian.Oudard: Comments beginning a file crashes the xml parser http://code.google.com/p/html5lib/issues/detail?id=122 Thank you for the link to XML5, I realize now that I was using html5lib in place of a standalone liberal XML parser. -- You received this messa

Re: Issue 129 in html5lib: Crash when parsing windows-style quotes

2009-12-24 Thread codesite-noreply
Comment #4 on issue 129 by ja...@hoppipolla.co.uk: Crash when parsing windows-style quotes http://code.google.com/p/html5lib/issues/detail?id=129 The reason for the crash here is that when we hit a meta tag, we assume that the value of the content attribute can always be encoded in the same e

Re: Issue 119 in html5lib: Update to LC spec

2009-12-24 Thread codesite-noreply
Updates: Status: Fixed Comment #1 on issue 119 by ja...@hoppipolla.co.uk: Update to LC spec http://code.google.com/p/html5lib/issues/detail?id=119 This meta bug isn't really that useful. We now pass almost all the tests we have. Sure more tests would be nice. -- You received this messa

Re: Issue 122 in html5lib: Comments beginning a file crashes the xml parser

2009-12-24 Thread codesite-noreply
Comment #3 on issue 122 by ja...@hoppipolla.co.uk: Comments beginning a file crashes the xml parser http://code.google.com/p/html5lib/issues/detail?id=122 It's not a good idea to use the liberal XML parser to parse HTML. If you want to parse HTML you should use the HTML parser. If you want to

Re: Issue 122 in html5lib: Comments beginning a file crashes the xml parser

2009-12-24 Thread codesite-noreply
Comment #2 on issue 122 by Christian.Oudard: Comments beginning a file crashes the xml parser http://code.google.com/p/html5lib/issues/detail?id=122 This is very disappointing, because a solid parser is the backbone of any html library. I can't maintain the library, but I may be able to provi

Re: Issue 112 in html5lib: assertion in processSpaceCharacters in InTableTextPhase

2009-12-24 Thread codesite-noreply
Updates: Status: WFM Comment #3 on issue 112 by ja...@hoppipolla.co.uk: assertion in processSpaceCharacters in InTableTextPhase http://code.google.com/p/html5lib/issues/detail?id=112 I think this is fixed. -- You received this message because you are listed in the owner or CC fields of

Re: Issue 123 in html5lib: simpletree cloneNode only works for Elements (+patch)

2009-12-24 Thread codesite-noreply
Updates: Status: Fixed Comment #1 on issue 123 by ja...@hoppipolla.co.uk: simpletree cloneNode only works for Elements (+patch) http://code.google.com/p/html5lib/issues/detail?id=123 Thanks for the patch and sorry for the delay. -- You received this message because you are listed in th

Re: Issue 81 in html5lib: Verision info

2009-12-24 Thread codesite-noreply
Updates: Status: Fixed Comment #4 on issue 81 by ja...@hoppipolla.co.uk: Verision info http://code.google.com/p/html5lib/issues/detail?id=81 I fixed this -- You received this message because you are listed in the owner or CC fields of this issue, or because you starred this issue. You ma

Re: Issue 122 in html5lib: Comments beginning a file crashes the xml parser

2009-12-24 Thread codesite-noreply
Updates: Status: WontFix Comment #1 on issue 122 by ja...@hoppipolla.co.uk: Comments beginning a file crashes the xml parser http://code.google.com/p/html5lib/issues/detail?id=122 The liberal XML parser is no longer being developed. Unless someone steps up to maintain this module, thi

Re: Issue 101 in html5lib: DeprecationWarning for inputstream.py

2009-12-09 Thread codesite-noreply
Comment #5 on issue 101 by starsareblueandfaraway: DeprecationWarning for inputstream.py http://code.google.com/p/html5lib/issues/detail?id=101 I am having the same issue in Python 2.6 with html5lib. Thanks for making this library. File "build/bdist.linux-i686/egg/html5lib/inputstream.p

Re: Issue 86 in html5lib: BeautifulSoup treebuilder string attribute is missing

2009-12-03 Thread codesite-noreply
Comment #4 on issue 86 by aaron.devore: BeautifulSoup treebuilder string attribute is missing http://code.google.com/p/html5lib/issues/detail?id=86 The Tag.string bug is now fixed in Beautiful Soup as of version 3.0.8, which was released on November 30 (3 days ago). The bug hasn't been fixed

Re: Issue 129 in html5lib: Crash when parsing windows-style quotes

2009-11-26 Thread codesite-noreply
Comment #3 on issue 129 by EmilStenstrom: Crash when parsing windows-style quotes http://code.google.com/p/html5lib/issues/detail?id=129 I workaround that seems to work is to simple replace all ”-characters with double quotes before parsing. Easy as that: f = f.replace('”', '"') -- You rece

Re: Issue 129 in html5lib: Crash when parsing windows-style quotes

2009-11-26 Thread codesite-noreply
Comment #2 on issue 129 by EmilStenstrom: Crash when parsing windows-style quotes http://code.google.com/p/html5lib/issues/detail?id=129 This is also seen at: www.etc.se -- You received this message because you are listed in the owner or CC fields of this issue, or because you starred this is

Issue 130 in html5lib: Genshi Tree Walker Broken

2009-11-26 Thread codesite-noreply
Status: New Owner: New issue 130 by armin.ro...@active-4.com: Genshi Tree Walker Broken http://code.google.com/p/html5lib/issues/detail?id=130 Currently the genshi tree walker does not work. It fails here: Traceback (most recent call last): ... File "inmhpc\html.py", line 28, in par

Re: Issue 128 in html5lib: Crash when parsing strangely nested form and table tags

2009-11-26 Thread codesite-noreply
Updates: Status: Fixed Comment #2 on issue 128 by ja...@hoppipolla.co.uk: Crash when parsing strangely nested form and table tags http://code.google.com/p/html5lib/issues/detail?id=128 (No comment was entered for this change.) -- You received this message because you are listed in the

Re: Issue 126 in html5lib: Dom treebuilder crashes with unclosed font tag

2009-11-26 Thread codesite-noreply
Updates: Status: Fixed Comment #3 on issue 126 by ja...@hoppipolla.co.uk: Dom treebuilder crashes with unclosed font tag http://code.google.com/p/html5lib/issues/detail?id=126 (No comment was entered for this change.) -- You received this message because you are listed in the owner or

Re: Issue 127 in html5lib: html5lib crashes with open table tag and empty line

2009-11-23 Thread codesite-noreply
Comment #3 on issue 127 by EmilStenstrom: html5lib crashes with open table tag and empty line http://code.google.com/p/html5lib/issues/detail?id=127 Thanks for the work, but now I get a new error: C:\Emils\Kod\sammanfatta\fetch\management\commands>randomtask.py Traceback (most recent call last

Re: Issue 126 in html5lib: Dom treebuilder crashes with unclosed font tag

2009-11-23 Thread codesite-noreply
Comment #2 on issue 126 by EmilStenstrom: Dom treebuilder crashes with unclosed font tag http://code.google.com/p/html5lib/issues/detail?id=126 Another testcase that gives the same error with a slightly different stacktrace: Stacktrace: C:\Emils\Kod\sammanfatta\fetch\management\commands>ra

Re: Issue 129 in html5lib: Crash when parsing windows-style quotes

2009-11-23 Thread codesite-noreply
Comment #1 on issue 129 by EmilStenstrom: Crash when parsing windows-style quotes http://code.google.com/p/html5lib/issues/detail?id=129 Found in the wild: www.enytt.com -- You received this message because you are listed in the owner or CC fields of this issue, or because you starred this is

Issue 129 in html5lib: Crash when parsing windows-style quotes

2009-11-23 Thread codesite-noreply
Status: New Owner: New issue 129 by EmilStenstrom: Crash when parsing windows-style quotes http://code.google.com/p/html5lib/issues/detail?id=129 What steps will reproduce the problem? # Example code # -*- coding: utf-8 -*- import html5lib parser = html5lib.HTMLParser() doc = parser.parse(""

Re: Issue 128 in html5lib: Crash when parsing strangely nested form and table tags

2009-11-23 Thread codesite-noreply
Comment #1 on issue 128 by EmilStenstrom: Crash when parsing strangely nested form and table tags http://code.google.com/p/html5lib/issues/detail?id=128 Found in the wild: http://www.gourmet.se/ -- You received this message because you are listed in the owner or CC fields of this issue, or be

Re: Issue 127 in html5lib: html5lib crashes with open table tag and empty line

2009-11-23 Thread codesite-noreply
Updates: Status: Fixed Comment #2 on issue 127 by ja...@hoppipolla.co.uk: html5lib crashes with open table tag and empty line http://code.google.com/p/html5lib/issues/detail?id=127 Should be fixed in latest tip Thanks for all the great bug reports btw. -- You received this message be

Issue 128 in html5lib: Crash when parsing strangely nested form and table tags

2009-11-22 Thread codesite-noreply
Status: New Owner: New issue 128 by EmilStenstrom: Crash when parsing strangely nested form and table tags http://code.google.com/p/html5lib/issues/detail?id=128 What steps will reproduce the problem? # Example code import html5lib parser = html5lib.HTMLParser() doc = parser.parse("") St

Re: Issue 127 in html5lib: html5lib crashes with open table tag and empty line

2009-11-22 Thread codesite-noreply
Comment #1 on issue 127 by EmilStenstrom: html5lib crashes with open table tag and empty line http://code.google.com/p/html5lib/issues/detail?id=127 Edit: 7harad2.html should of course be free from quotes, just a table tag followed by an empty line -- You received this message because you a

Issue 127 in html5lib: html5lib crashes with open table tag and empty line

2009-11-22 Thread codesite-noreply
Status: New Owner: New issue 127 by EmilStenstrom: html5lib crashes with open table tag and empty line http://code.google.com/p/html5lib/issues/detail?id=127 What steps will reproduce the problem? # Example code import html5lib f = open("7harad2.html") parser = html5lib.HTMLParser() doc =

Re: Issue 126 in html5lib: Dom treebuilder crashes with unclosed font tag

2009-11-22 Thread codesite-noreply
Comment #1 on issue 126 by EmilStenstrom: Dom treebuilder crashes with unclosed font tag http://code.google.com/p/html5lib/issues/detail?id=126 Found in the wild at: http://7-harad.nu/ -- You received this message because you are listed in the owner or CC fields of this issue, or because you

Issue 126 in html5lib: Dom treebuilder crashes with unclosed font tag

2009-11-22 Thread codesite-noreply
Status: New Owner: New issue 126 by EmilStenstrom: Dom treebuilder crashes with unclosed font tag http://code.google.com/p/html5lib/issues/detail?id=126 What steps will reproduce the problem? # Example code import html5lib parser = html5lib.HTMLParser(tree=html5lib.treebuilders.getTreeB

Re: Issue 124 in html5lib: Crash when parsing three swedish news sites with beautifulsoup treebuilder

2009-11-22 Thread codesite-noreply
Updates: Status: Duplicate Mergedinto: 80 Comment #3 on issue 124 by excors: Crash when parsing three swedish news sites with beautifulsoup treebuilder http://code.google.com/p/html5lib/issues/detail?id=124 (No comment was entered for this change.) -- You received this message

Re: Issue 80 in html5lib: TypeError when serializing some pages to BeautifulSoup

2009-11-22 Thread codesite-noreply
Comment #8 on issue 80 by excors: TypeError when serializing some pages to BeautifulSoup http://code.google.com/p/html5lib/issues/detail?id=80 Issue 124 has been merged into this issue. -- You received this message because you are listed in the owner or CC fields of this issue, or because you

Re: Issue 125 in html5lib: InfosetFilter.toXmlName doesn't filter first character properly

2009-11-22 Thread codesite-noreply
Comment #1 on issue 125 by EmilStenstrom: InfosetFilter.toXmlName doesn't filter first character properly http://code.google.com/p/html5lib/issues/detail?id=125 Example where this is encountered in the wild: http://www.allehanda.se -- You received this message because you are listed in the ow

Issue 125 in html5lib: InfosetFilter.toXmlName doesn't filter first character properly

2009-11-22 Thread codesite-noreply
Status: Accepted Owner: excors New issue 125 by excors: InfosetFilter.toXmlName doesn't filter first character properly http://code.google.com/p/html5lib/issues/detail?id=125 echo '' | ./parse.py - --treebuilder=lxml ValueError: Invalid attribute name u'0' -- You received this message becaus

Re: Issue 124 in html5lib: Crash when parsing three swedish news sites with beautifulsoup treebuilder

2009-11-22 Thread codesite-noreply
Comment #2 on issue 124 by EmilStenstrom: Crash when parsing three swedish news sites with beautifulsoup treebuilder http://code.google.com/p/html5lib/issues/detail?id=124 Also happens on the latest source chekout, but with an extra DataLossWarning: C:\Program Files (x86)\python\lib\site-

Re: Issue 124 in html5lib: Crash when parsing three swedish news sites with beautifulsoup treebuilder

2009-11-22 Thread codesite-noreply
Comment #1 on issue 124 by EmilStenstrom: Crash when parsing three swedish news sites with beautifulsoup treebuilder http://code.google.com/p/html5lib/issues/detail?id=124 Bug does not appear with the "dom" treebuilder. -- You received this message because you are listed in the owner or CC fi

Issue 123 in html5lib: simpletree cloneNode only works for Elements (+patch)

2009-11-08 Thread codesite-noreply
Status: New Owner: New issue 123 by rnairn: simpletree cloneNode only works for Elements (+patch) http://code.google.com/p/html5lib/issues/detail?id=123 Seems wrong to define it in the base class when it has to be different in each subclass. Attachments: simpletree-clone.patch

Issue 122 in html5lib: Comments beginning a file crashes the xml parser

2009-11-05 Thread codesite-noreply
Status: New Owner: New issue 122 by Christian.Oudard: Comments beginning a file crashes the xml parser http://code.google.com/p/html5lib/issues/detail?id=122 What steps will reproduce the problem? >>> import html5lib >>> html5lib.XMLParser().parse('') Traceback (most recent call last):

Re: Issue 101 in html5lib: DeprecationWarning for inputstream.py

2009-11-01 Thread codesite-noreply
Comment #4 on issue 101 by schmi...@wsucougars.com: DeprecationWarning for inputstream.py http://code.google.com/p/html5lib/issues/detail?id=101 This is not fixed on the latest release. I am having the same issue using it on Ubuntu's Karmic Koala. Any suggestions to getting this working? --

Re: Issue 121 in html5lib: nonXmlBMPRegexp is totally bogus

2009-11-01 Thread codesite-noreply
Updates: Labels: Milestone-Release1.0 Type-Defect Port-Python Comment #1 on issue 121 by ja...@hoppipolla.co.uk: nonXmlBMPRegexp is totally bogus http://code.google.com/p/html5lib/issues/detail?id=121 (No comment was entered for this change.) -- You received this message because you a

Re: Issue 96 in html5lib: a better intToUnicodeStr

2009-11-01 Thread codesite-noreply
Updates: Labels: -Milestone-Release1.0 -Priority-High Priority-Low Comment #5 on issue 96 by ja...@hoppipolla.co.uk: a better intToUnicodeStr http://code.google.com/p/html5lib/issues/detail?id=96 This should be fixed but the code is never actually run in a release, it is just used to g

Issue 121 in html5lib: nonXmlBMPRegexp is totally bogus

2009-11-01 Thread codesite-noreply
Status: Accepted Owner: excors New issue 121 by excors: nonXmlBMPRegexp is totally bogus http://code.google.com/p/html5lib/issues/detail?id=121 It says: nonXmlBMPRegexp = re.compile(u'[\x00-,|/|:-@|[-^|`|{-\xb6|\xb8- \xbf|\xd7|\xf7|...') which is wrong (e.g. it matches '[' but not

Re: Issue 93 in html5lib: Quote attributes containing weird whitespace or '<'

2009-10-30 Thread codesite-noreply
Comment #9 on issue 93 by geoffers: Quote attributes containing weird whitespace or '<' http://code.google.com/p/html5lib/issues/detail?id=93 I don't think we need to try and get the spec to quote anything else. -- You received this message because you are listed in the owner or CC fields of

Issue 86 in html5lib: BeautifulSoup treebuilder string attribute is missing

2009-10-28 Thread codesite-noreply
Comment #3 on issue 86 by aaron.devore: BeautifulSoup treebuilder string attribute is missing http://code.google.com/p/html5lib/issues/detail?id=86 This bug will be fixed in the next version of Beautiful Soup in the 3.0.x series. Note that Beautiful Soup support will be deprecated as of vers

Issue 120 in html5lib: Deprecate BeautifulSoup

2009-10-28 Thread codesite-noreply
Status: Accepted Owner: geoffers Labels: Type-Defect Milestone-Release1.0 Priority-Critical Port-Python New issue 120 by geoffers: Deprecate BeautifulSoup http://code.google.com/p/html5lib/issues/detail?id=120 As I discussed with jgraham, we'll deprecate BS in 1.0 as it cannot correctly repres

Issue 119 in html5lib: Update to LC spec

2009-10-28 Thread codesite-noreply
Status: Accepted Owner: geoffers Labels: Type-Enhancement Milestone-Release1.0 Priority-Critical Port-Python NeedsTests New issue 119 by geoffers: Update to LC spec http://code.google.com/p/html5lib/issues/detail?id=119 We need to update the parser (tokenizer, tree-construction, resetting the

Issue 25 in html5lib: Lack of trailing empty line generates a parse error

2009-10-28 Thread codesite-noreply
Updates: Status: Invalid Comment #2 on issue 25 by geoffers: Lack of trailing empty line generates a parse error http://code.google.com/p/html5lib/issues/detail?id=25 (No comment was entered for this change.) -- You received this message because you are listed in the owner or CC field

Issue 74 in html5lib: AttributeError: 'module' object has no attribute 'isValidEncoding'

2009-10-28 Thread codesite-noreply
Comment #3 on issue 74 by geoffers: AttributeError: 'module' object has no attribute 'isValidEncoding' http://code.google.com/p/html5lib/issues/detail?id=74 Why is this still open? -- You received this message because you are listed in the owner or CC fields of this issue, or because you star

Issue 75 in html5lib: [PATCH] Filters should pass contentModelFlag changes to source

2009-10-28 Thread codesite-noreply
Updates: Status: WontFix Comment #1 on issue 75 by geoffers: [PATCH] Filters should pass contentModelFlag changes to source http://code.google.com/p/html5lib/issues/detail?id=75 Spec changes removing contentModelFlag make this irrelevant (and we'll soon remove it). -- You received t

Issue 76 in html5lib: Validator complains about type and global attrs on input tags.

2009-10-28 Thread codesite-noreply
Updates: Status: WontFix Comment #2 on issue 76 by geoffers: Validator complains about type and global attrs on input tags. http://code.google.com/p/html5lib/issues/detail?id=76 The validator was removed a few months ago due to lack of interest, so this is wontfix -- You received th

Issue 79 in html5lib: getElementById doesn't work with minidom

2009-10-28 Thread codesite-noreply
Updates: Labels: -Priority-Medium Priority-High Milestone-Release1.0 Port-Python Comment #1 on issue 79 by geoffers: getElementById doesn't work with minidom http://code.google.com/p/html5lib/issues/detail?id=79 (No comment was entered for this change.) -- You received this message beca

Issue 80 in html5lib: TypeError when serializing some pages to BeautifulSoup

2009-10-28 Thread codesite-noreply
Updates: Labels: Port-Python Comment #7 on issue 80 by geoffers: TypeError when serializing some pages to BeautifulSoup http://code.google.com/p/html5lib/issues/detail?id=80 (No comment was entered for this change.) -- You received this message because you are listed in the owner or C

Issue 81 in html5lib: Verision info

2009-10-28 Thread codesite-noreply
Updates: Status: Accepted Labels: Milestone-Release1.0 Comment #3 on issue 81 by geoffers: Verision info http://code.google.com/p/html5lib/issues/detail?id=81 Mid-November of next year? :) -- You received this message because you are listed in the owner or CC fields of this issu

Issue 89 in html5lib: Installation using setup.py fails under Windows

2009-10-28 Thread codesite-noreply
Updates: Labels: -0.12-blocker Type-Defect Priority-Critical Milestone-Release1.0 Port-Python Comment #2 on issue 89 by geoffers: Installation using setup.py fails under Windows http://code.google.com/p/html5lib/issues/detail?id=89 (No comment was entered for this change.) -- You r

Issue 92 in html5lib: Possible to make IE run script after roundtripping in html5lib

2009-10-28 Thread codesite-noreply
Updates: Labels: -0.12-blocker Type-Defect Milestone-Release1.0 Port-Python Comment #8 on issue 92 by geoffers: Possible to make IE run script after roundtripping in html5lib http://code.google.com/p/html5lib/issues/detail?id=92 (No comment was entered for this change.) -- You receive

Issue 93 in html5lib: Quote attributes containing weird whitespace or '<'

2009-10-28 Thread codesite-noreply
Updates: Labels: -0.12-blocker Priority-Critical Type-Defect Milestone-Release1.0 Comment #8 on issue 93 by geoffers: Quote attributes containing weird whitespace or '<' http://code.google.com/p/html5lib/issues/detail?id=93 (No comment was entered for this change.) -- You received thi

Issue 96 in html5lib: a better intToUnicodeStr

2009-10-28 Thread codesite-noreply
Updates: Labels: Type-Enhancement Milestone-Release1.0 Priority-High Comment #4 on issue 96 by geoffers: a better intToUnicodeStr http://code.google.com/p/html5lib/issues/detail?id=96 (No comment was entered for this change.) -- You received this message because you are listed in the ow

Issue 113 in html5lib: cannot handle mailformed attribute names with html5lib and lxml

2009-10-28 Thread codesite-noreply
Updates: Labels: -0.12-blocker Priority-Critical Type-Defect Port-Python Milestone-Release1.0 Comment #3 on issue 113 by geoffers: cannot handle mailformed attribute names with html5lib and lxml http://code.google.com/p/html5lib/issues/detail?id=113 (No comment was entered for this c

Issue 86 in html5lib: BeautifulSoup treebuilder string attribute is missing

2009-10-28 Thread codesite-noreply
Updates: Status: WontFix Comment #2 on issue 86 by geoffers: BeautifulSoup treebuilder string attribute is missing http://code.google.com/p/html5lib/issues/detail?id=86 Bug in BS, as above. -- You received this message because you are listed in the owner or CC fields of this issue, or

Issue 88 in html5lib: Reading from stdin broken

2009-10-28 Thread codesite-noreply
Comment #1 on issue 88 by geoffers: Reading from stdin broken http://code.google.com/p/html5lib/issues/detail?id=88 If this really is critical, can you do something about it? :P -- You received this message because you are listed in the owner or CC fields of this issue, or because you starred t

Issue 89 in html5lib: Installation using setup.py fails under Windows

2009-10-28 Thread codesite-noreply
Updates: Labels: 0.12-blocker Comment #1 on issue 89 by geoffers: Installation using setup.py fails under Windows http://code.google.com/p/html5lib/issues/detail?id=89 (No comment was entered for this change.) -- You received this message because you are listed in the owner or CC fiel

Issue 90 in html5lib: All files doubled in archive html5lib-0.11.1.zip

2009-10-28 Thread codesite-noreply
Updates: Status: Duplicate Comment #2 on issue 90 by geoffers: All files doubled in archive html5lib-0.11.1.zip http://code.google.com/p/html5lib/issues/detail?id=90 (No comment was entered for this change.) -- You received this message because you are listed in the owner or CC fields

Issue 92 in html5lib: Possible to make IE run script after roundtripping in html5lib

2009-10-28 Thread codesite-noreply
Updates: Status: Accepted Labels: 0.12-blocker Comment #7 on issue 92 by geoffers: Possible to make IE run script after roundtripping in html5lib http://code.google.com/p/html5lib/issues/detail?id=92 (No comment was entered for this change.) -- You received this message becaus

Issue 93 in html5lib: Quote attributes containing weird whitespace or '<'

2009-10-28 Thread codesite-noreply
Updates: Status: Accepted Labels: 0.12-blocker Comment #7 on issue 93 by geoffers: Quote attributes containing weird whitespace or '<' http://code.google.com/p/html5lib/issues/detail?id=93 Accepted, though we still need to decide how much to quote. -- You received this message

Issue 96 in html5lib: a better intToUnicodeStr

2009-10-28 Thread codesite-noreply
Updates: Labels: -0.12-blocker Comment #3 on issue 96 by geoffers: a better intToUnicodeStr http://code.google.com/p/html5lib/issues/detail?id=96 (No comment was entered for this change.) -- You received this message because you are listed in the owner or CC fields of this issue, or bec

Issue 96 in html5lib: a better intToUnicodeStr

2009-10-28 Thread codesite-noreply
Updates: Labels: 0.12-blocker Comment #2 on issue 96 by geoffers: a better intToUnicodeStr http://code.google.com/p/html5lib/issues/detail?id=96 (No comment was entered for this change.) -- You received this message because you are listed in the owner or CC fields of this issue, or beca

Issue 96 in html5lib: a better intToUnicodeStr

2009-10-28 Thread codesite-noreply
Updates: Status: Accepted Comment #1 on issue 96 by geoffers: a better intToUnicodeStr http://code.google.com/p/html5lib/issues/detail?id=96 (No comment was entered for this change.) -- You received this message because you are listed in the owner or CC fields of this issue, or because

Issue 103 in html5lib: Can't easy_install/pip install html5lib==dev

2009-10-28 Thread codesite-noreply
Updates: Status: Accepted Owner: ja...@hoppipolla.co.uk Comment #1 on issue 103 by geoffers: Can't easy_install/pip install html5lib==dev http://code.google.com/p/html5lib/issues/detail?id=103 (No comment was entered for this change.) -- You received this message because you a

Issue 110 in html5lib: re module has no method "find"

2009-10-28 Thread codesite-noreply
Updates: Status: WFM Comment #1 on issue 110 by geoffers: re module has no method "find" http://code.google.com/p/html5lib/issues/detail?id=110 (No comment was entered for this change.) -- You received this message because you are listed in the owner or CC fields of this issue, or becau

Issue 116 in html5lib: treewalker escapes from subtree if root of subtree has a next sibling

2009-10-28 Thread codesite-noreply
Updates: Status: Fixed Comment #2 on issue 116 by geoffers: treewalker escapes from subtree if root of subtree has a next sibling http://code.google.com/p/html5lib/issues/detail?id=116 This appears to be fixed in the latest version in hg, can you please verify this? -- You received

Issue 112 in html5lib: assertion in processSpaceCharacters in InTableTextPhase

2009-10-28 Thread codesite-noreply
Comment #2 on issue 112 by geoffers: assertion in processSpaceCharacters in InTableTextPhase http://code.google.com/p/html5lib/issues/detail?id=112 Is this not fixed? -- You received this message because you are listed in the owner or CC fields of this issue, or because you starred this issue

Issue 112 in html5lib: assertion in processSpaceCharacters in InTableTextPhase

2009-10-28 Thread codesite-noreply
Comment #1 on issue 112 by geoffers: assertion in processSpaceCharacters in InTableTextPhase http://code.google.com/p/html5lib/issues/detail?id=112 Is this not fixed? -- You received this message because you are listed in the owner or CC fields of this issue, or because you starred this issue

Issue 113 in html5lib: cannot handle mailformed attribute names with html5lib and lxml

2009-10-28 Thread codesite-noreply
Updates: Status: Accepted Labels: 0.12-blocker Comment #2 on issue 113 by geoffers: cannot handle mailformed attribute names with html5lib and lxml http://code.google.com/p/html5lib/issues/detail?id=113 (No comment was entered for this change.) -- You received this message bec

Issue 113 in html5lib: cannot handle mailformed attribute names with html5lib and lxml

2009-10-27 Thread codesite-noreply
Comment #1 on issue 113 by eromirou: cannot handle mailformed attribute names with html5lib and lxml http://code.google.com/p/html5lib/issues/detail?id=113 I found out that using 'sanitizer.HTMLSanitizer' as the tokenizer works fine: import html5lib from html5lib import treebuilders from ht

Issue 118 in html5lib: python memory leak in dom.TreeBuilder

2009-10-18 Thread codesite-noreply
Updates: Status: fixed Comment #2 on issue 118 by geoffers: python memory leak in dom.TreeBuilder http://code.google.com/p/html5lib/issues/detail?id=118 Committed/pushed. Thanks! -- You received this message because you are listed in the owner or CC fields of this issue, or because you

Issue 92 in html5lib: Possible to make IE run script after roundtripping in html5lib

2009-10-18 Thread codesite-noreply
Comment #6 on issue 92 by geoffers: Possible to make IE run script after roundtripping in html5lib http://code.google.com/p/html5lib/issues/detail?id=92 ` is now non-conforming at the start of an unquoted attribute. -- You received this message because you are listed in the owner or CC fields

Issue 115 in html5lib: param/source parsed incorrectly in body

2009-10-18 Thread codesite-noreply
Updates: Status: Fixed Comment #1 on issue 115 by geoffers: param/source parsed incorrectly in body http://code.google.com/p/html5lib/issues/detail?id=115 Now fixed. -- You received this message because you are listed in the owner or CC fields of this issue, or because you starred this

Issue 111 in html5lib: With lxml treebuilder, with namespaceHTMLElements=False, root element is in namespace

2009-10-18 Thread codesite-noreply
Updates: Status: Fixed Comment #4 on issue 111 by geoffers: With lxml treebuilder, with namespaceHTMLElements=False, root element is in namespace http://code.google.com/p/html5lib/issues/detail?id=111 Should've been fixed as of yesterday. -- You received this message because you are l

Issue 114 in html5lib: [HAS PATCH] inject_meta_charset inserts broken meta tag

2009-10-11 Thread codesite-noreply
Updates: Status: Invalid Comment #1 on issue 114 by ja...@hoppipolla.co.uk: [HAS PATCH] inject_meta_charset inserts broken meta tag http://code.google.com/p/html5lib/issues/detail?id=114 The form we emit is valid per HTML5, woks in all browsers, and considerably easier to author/read

Issue 118 in html5lib: python memory leak in dom.TreeBuilder

2009-10-11 Thread codesite-noreply
Comment #1 on issue 118 by devex.soft: python memory leak in dom.TreeBuilder http://code.google.com/p/html5lib/issues/detail?id=118 patch for dom.py Attachments: dom.py.patch 103 bytes -- You received this message because you are listed in the owner or CC fields of this issue, or beca

Issue 118 in html5lib: python memory leak in dom.TreeBuilder

2009-10-08 Thread codesite-noreply
Status: New Owner: New issue 118 by devex.soft: python memory leak in dom.TreeBuilder http://code.google.com/p/html5lib/issues/detail?id=118 html5lib/treebuilders/dom.py on line 122: def documentClass(self): self.dom = Dom.getDOMImplementation().createDocument(None,None,None) re

Issue 117 in html5lib: Tokenizer tests are not JSON

2009-09-17 Thread codesite-noreply
Status: Accepted Owner: New issue 117 by geoffers: Tokenizer tests are not JSON http://code.google.com/p/html5lib/issues/detail?id=117 The wiki page says the tokenizer tests are JSON, but they are not, as they contain surrogate codepoints (which are not allowed in JSON). This causes problem

Issue 62 in html5lib: Sanitizer does not allow stripping of tags

2009-09-09 Thread codesite-noreply
Comment #3 on issue 62 by sa3ruby: Sanitizer does not allow stripping of tags http://code.google.com/p/html5lib/issues/detail?id=62 My inclination is to flip this entirely. It seems inconsistent that evil CSS is stripped, and unknown attributes are stripped, but unknown elements are escap

Issue 93 in html5lib: Quote attributes containing weird whitespace or '<'

2009-09-07 Thread codesite-noreply
Comment #6 on issue 93 by Simetrical: Quote attributes containing weird whitespace or '<' http://code.google.com/p/html5lib/issues/detail?id=93 I posted this a couple of days ago: http://lists.whatwg.org/htdig.cgi/whatwg-whatwg.org/2009-September/022711.html -- You received this message beca

Issue 93 in html5lib: Quote attributes containing weird whitespace or '<'

2009-09-06 Thread codesite-noreply
Comment #5 on issue 93 by zcorpan: Quote attributes containing weird whitespace or '<' http://code.google.com/p/html5lib/issues/detail?id=93 I did, and Hixie rejected it saying that it's an issue that will go away over time. Feel free to bring it up again (citing that sites who implement the

Issue 92 in html5lib: Possible to make IE run script after roundtripping in html5lib

2009-09-06 Thread codesite-noreply
Comment #5 on issue 92 by zcorpan: Possible to make IE run script after roundtripping in html5lib http://code.google.com/p/html5lib/issues/detail?id=92 No, because xmp is a RAWTEXT element. So it's equivalent to the following XML `> but in IE it's equivalent to the

Issue 93 in html5lib: Quote attributes containing weird whitespace or '<'

2009-09-05 Thread codesite-noreply
Comment #4 on issue 93 by geoffers: Quote attributes containing weird whitespace or '<' http://code.google.com/p/html5lib/issues/detail?id=93 IMO yes, just someone needs to get around to it. :) -- You received this message because you are listed in the owner or CC fields of this issue, or bec

  1   2   3   >