[phpug] Re: regex and scraping html page

Bob M Brown Wed, 15 Jul 2009 21:35:28 -0700

I recently blogged about my experiences with this:
    http://www.guru.net.nz/blog/2009/06/screen-scraping-with-jquery.html


Also the Simple DOM HTML Parser looked promising:
    http://simplehtmldom.sourceforge.net/

ctx2002 wrote:
> Jochen was posted a question about use regex to extract information
> from HTML page.
>
> as every one can see, the regex is not easy to read and understand.
>
> I was thinking why not use xslt to process HTML file? PHP 5 has good
> support for xslt processor.
>
> only extra step we need is to use HTML tidy program to make HTML page
> "xml well form".
>
> for me, xsl file is easier to understand then regex expression.
>
> are there other way/tools to extra information from HTML without use
> regex?
>
>
> >
>   


--~--~---------~--~----~------------~-------~--~----~
NZ PHP Users Group: http://groups.google.com/group/nzphpug
To post, send email to [email protected]
To unsubscribe, send email to
[email protected]
-~----------~----~----~----~------~----~------~--~---

[phpug] Re: regex and scraping html page

Reply via email to