[EMAIL PROTECTED] schrieb:
> I understand that the web is full of ill-formed XHTML web pages but
> this is Microsoft:
>
> http://moneycentral.msn.com/companyreport?Symbol=BBBY
>
> I can't validate it and xml.minidom.dom.parseString won't work on it.
Interestingly, no-one mentioned lxml so far:
On 4 Mar, 20:21, Nikita the Spider <[EMAIL PROTECTED]> wrote:
> In article <[EMAIL PROTECTED]>,
>
> > I can't validate it and xml.minidom.dom.parseString won't work on it.
[...]
> Valid XHTML is scarcer than hen's teeth.
It probably doesn't need to be valid: being well-formed would be
sufficient
[EMAIL PROTECTED] wrote:
>
>Chris> http://moneycentral.msn.com/companyreport?Symbol=BBBY
>
>Chris> I can't validate it and xml.minidom.dom.parseString won't work on
>Chris> it.
>
>Chris> If this was just some teenager's web site I'd move on. Is there
>Chris> any hope avoiding r
P.S. Please send me 1% of all the money you make from your automated-
stock speculation program. On the other hand, if you lose money with
your program, don't bother sending me a bill.
-- Paul
--
http://mail.python.org/mailman/listinfo/python-list
On Mar 4, 11:42 am, "[EMAIL PROTECTED]"
<[EMAIL PROTECTED]> wrote:
> I understand that the web is full of ill-formed XHTML web pages but
> this is Microsoft:
>
> http://moneycentral.msn.com/companyreport?Symbol=BBBY
>
> I can't validate it and xml.minidom.dom.parseString won't work on it.
>
> If th
[EMAIL PROTECTED] wrote:
> I understand that the web is full of ill-formed XHTML web pages but
> this is Microsoft:
>
> http://moneycentral.msn.com/companyreport?Symbol=BBBY
Yes, thank you Microsoft!
> I can't validate it and xml.minidom.dom.parseString won't work on it.
>
> If this was just some
In article <[EMAIL PROTECTED]>,
"[EMAIL PROTECTED]" <[EMAIL PROTECTED]> wrote:
> I understand that the web is full of ill-formed XHTML web pages but
> this is Microsoft:
>
> http://moneycentral.msn.com/companyreport?Symbol=BBBY
>
> I can't validate it and xml.minidom.dom.parseString won't work
"[EMAIL PROTECTED]" <[EMAIL PROTECTED]> writes:
> I understand that the web is full of ill-formed XHTML web pages but
> this is Microsoft:
Yes... And Microsoft is responsible for a lot of the ill-formed pages on the
web be it on their website or made by their applications.
>
> http://moneycentr
Chris> http://moneycentral.msn.com/companyreport?Symbol=BBBY
Chris> I can't validate it and xml.minidom.dom.parseString won't work on
Chris> it.
Chris> If this was just some teenager's web site I'd move on. Is there
Chris> any hope avoiding regular expression hacks to extrac
I understand that the web is full of ill-formed XHTML web pages but
this is Microsoft:
http://moneycentral.msn.com/companyreport?Symbol=BBBY
I can't validate it and xml.minidom.dom.parseString won't work on it.
If this was just some teenager's web site I'd move on. Is there any
hope avoiding re
10 matches
Mail list logo