Hi,

I had some experience writing a simple feed aggregator that then loaded in the HTML pages from the remote URL's and tried to clean up the HTML into XHTML.

I used the Pear XML_HTMLSAX libraries for this and extended the examples to create my own classes to parse HTML from specific sites to extract out the blog content and then clean some of the XHTML up.

   You are welcome to have a look at the code which I have posted here...

   http://glenpike.co.uk/sd/ContentGrabber.zip

The ContentGrabber is given a URL of an article to get from the PharmaTimes or NYTimes websites and it creates a HTMLHandler object that deals with the parsing.

   HTH.

Glen
<http://glenpike.co.uk/sd/ContentGrabber.zip>
David Townsend wrote:
Heya All,

I am currently working on a project where I am receiving a huge database of
html pages that I need to convert into xml to drive a flash application.
>From memory, many current html codes do not work in Flash and I was
wondering if anyone knew of a way of cleaning up the code, preferably in
php, so that it will work in Flash, and not make the xml file invalid.
Has anyone ever done something like this? I'd love some guidance on the best
way of approaching the problem.

Thanks!

David Townsend
Mobile: 0403 055 383
The Australian Imagery Company
----------------------------------------
Address: PO Box 5075, Sandhurst East, Vic 3550
Phone: (03) 54442340 - Fax: (03) 54442341
Email:  <mailto:[EMAIL PROTECTED]>
[EMAIL PROTECTED]
Web:  <http://www.australianimagery.com.au/>
http://www.australianimagery.com.au

_______________________________________________
Flashcoders@chattyfig.figleaf.com
To change your subscription options or search the archive:
http://chattyfig.figleaf.com/mailman/listinfo/flashcoders

Brought to you by Fig Leaf Software
Premier Authorized Adobe Consulting and Training
http://www.figleaf.com
http://training.figleaf.com


_______________________________________________
Flashcoders@chattyfig.figleaf.com
To change your subscription options or search the archive:
http://chattyfig.figleaf.com/mailman/listinfo/flashcoders

Brought to you by Fig Leaf Software
Premier Authorized Adobe Consulting and Training
http://www.figleaf.com
http://training.figleaf.com

Reply via email to