Re: How use XML parsing tools on this one specific URL?

2007-03-05 Thread Stefan Behnel
[EMAIL PROTECTED] schrieb: > I understand that the web is full of ill-formed XHTML web pages but > this is Microsoft: > > http://moneycentral.msn.com/companyreport?Symbol=BBBY > > I can't validate it and xml.minidom.dom.parseString won't work on it. Interestingly, no-one mentioned lxml so far:

Re: How use XML parsing tools on this one specific URL?

2007-03-05 Thread Paul Boddie
On 4 Mar, 20:21, Nikita the Spider <[EMAIL PROTECTED]> wrote: > In article <[EMAIL PROTECTED]>, > > > I can't validate it and xml.minidom.dom.parseString won't work on it. [...] > Valid XHTML is scarcer than hen's teeth. It probably doesn't need to be valid: being well-formed would be sufficient

Re: How use XML parsing tools on this one specific URL?

2007-03-05 Thread Fredrik Lundh
[EMAIL PROTECTED] wrote: > >Chris> http://moneycentral.msn.com/companyreport?Symbol=BBBY > >Chris> I can't validate it and xml.minidom.dom.parseString won't work on >Chris> it. > >Chris> If this was just some teenager's web site I'd move on. Is there >Chris> any hope avoiding r

Re: How use XML parsing tools on this one specific URL?

2007-03-04 Thread Paul McGuire
P.S. Please send me 1% of all the money you make from your automated- stock speculation program. On the other hand, if you lose money with your program, don't bother sending me a bill. -- Paul -- http://mail.python.org/mailman/listinfo/python-list

Re: How use XML parsing tools on this one specific URL?

2007-03-04 Thread Paul McGuire
On Mar 4, 11:42 am, "[EMAIL PROTECTED]" <[EMAIL PROTECTED]> wrote: > I understand that the web is full of ill-formed XHTML web pages but > this is Microsoft: > > http://moneycentral.msn.com/companyreport?Symbol=BBBY > > I can't validate it and xml.minidom.dom.parseString won't work on it. > > If th

Re: How use XML parsing tools on this one specific URL?

2007-03-04 Thread Paul Boddie
[EMAIL PROTECTED] wrote: > I understand that the web is full of ill-formed XHTML web pages but > this is Microsoft: > > http://moneycentral.msn.com/companyreport?Symbol=BBBY Yes, thank you Microsoft! > I can't validate it and xml.minidom.dom.parseString won't work on it. > > If this was just some

Re: How use XML parsing tools on this one specific URL?

2007-03-04 Thread Nikita the Spider
In article <[EMAIL PROTECTED]>, "[EMAIL PROTECTED]" <[EMAIL PROTECTED]> wrote: > I understand that the web is full of ill-formed XHTML web pages but > this is Microsoft: > > http://moneycentral.msn.com/companyreport?Symbol=BBBY > > I can't validate it and xml.minidom.dom.parseString won't work

Re: How use XML parsing tools on this one specific URL?

2007-03-04 Thread Jorge Godoy
"[EMAIL PROTECTED]" <[EMAIL PROTECTED]> writes: > I understand that the web is full of ill-formed XHTML web pages but > this is Microsoft: Yes... And Microsoft is responsible for a lot of the ill-formed pages on the web be it on their website or made by their applications. > > http://moneycentr

Re: How use XML parsing tools on this one specific URL?

2007-03-04 Thread skip
Chris> http://moneycentral.msn.com/companyreport?Symbol=BBBY Chris> I can't validate it and xml.minidom.dom.parseString won't work on Chris> it. Chris> If this was just some teenager's web site I'd move on. Is there Chris> any hope avoiding regular expression hacks to extrac

How use XML parsing tools on this one specific URL?

2007-03-04 Thread [EMAIL PROTECTED]
I understand that the web is full of ill-formed XHTML web pages but this is Microsoft: http://moneycentral.msn.com/companyreport?Symbol=BBBY I can't validate it and xml.minidom.dom.parseString won't work on it. If this was just some teenager's web site I'd move on. Is there any hope avoiding re