On Mon, Dec 16, 2013 at 1:56 PM, Paul Spencer <pagam...@gmail.com> wrote:

> I think you should consider using http://phantomjs.org/ for this, it will
> handle a lot of details for you like running javascript on the page that
> might include additional content etc that you won't get by just loading it
> with the http module.  There are a couple of node wrappers, I've used
> https://github.com/sgentle/phantomjs-node to write automated testing
> scripts
>

Note that phantomjs-node isn't terribly stable. In a commercial
implementation of phantomjs based scraping I ended up writing my own, which
has proved very stable (aside from the fact that phantomjs segfaults a LOT,
but you can't fix that). It's called node-phantom-simple on npm.

-- 
-- 
Job Board: http://jobs.nodejs.org/
Posting guidelines: 
https://github.com/joyent/node/wiki/Mailing-List-Posting-Guidelines
You received this message because you are subscribed to the Google
Groups "nodejs" group.
To post to this group, send email to nodejs@googlegroups.com
To unsubscribe from this group, send email to
nodejs+unsubscr...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/nodejs?hl=en?hl=en

--- 
You received this message because you are subscribed to the Google Groups 
"nodejs" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to nodejs+unsubscr...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Reply via email to