Hello all, After having quashed most regular test fails in my PHP HTML 5 tokenizer, I've started looking at tests involving doctypes and parse errors. As I've been combing through my 58 failures, it seems to me that my behavior is more correct than the test case. For example, "<!DOCTYPEa SYSTEM''!" in test3.test supposedly emits a doctype token that doesn't have force-quirks mode, but my intuition and my reading of the spec says that quirks mode should be turned on for this doctype. (Upon further inspection, all of my failures stem from test3.test.)
However, I've been emailing the whatwg and html5lib-discuss lists a lot recently, and 90% of the time I've been wrong about these things. So I went ahead and ran the test suite against the Python implementation to see whether or not Python got 100% passes on test_tokenizer.py. It didn't. In fact, it didn't even finish, because I got a fatal exception "'ascii' codec can't encode characters in position 22-23: ordinal not in range(128)". This seems pretty bad, and certainly not the "one test failure" mentioned in 0.11 Known Issues. I'm running on Python 2.5.2, on Windows Vista courtesy of Cygwin. What's up? Cheers, Edward --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "html5lib-discuss" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/html5lib-discuss?hl=en-GB -~----------~----~----~----~------~----~------~--~---
