Tangentially, Dubai Airport has just changed its Web site in a manner that 
makes it far harder to parse, and among other things replaces the names of 
airlines with little GIF logos. Oh yes, and where once each flight was a tr 
with class "data-row", now 50% of them and the others are trs with no class 
attribute and some random colour value instead. Grrr.

This resulted:

output = [[td.string or td.img["src"] for td in tr.findAll(True) if td.img or 
td.string] for tr in soup.findAll('tr', bgcolor=lambda(value): if value == 
'White' or value == '#F7F7DE')]

Cthulhu fhtagn!

-- 
The only thing worse than e-mail disclaimers...is people who send e-mail to 
lists complaining about them

Attachment: signature.asc
Description: This is a digitally signed message part.

_______________________________________________
Mailing list [email protected]
Archive, settings, or unsubscribe:
https://secure.mysociety.org/admin/lists/mailman/listinfo/developers-public

Reply via email to