On 06/14/2011 02:52 AM, Natty wrote: > We can make it a web-scraping oriented meeting and giving another talk on HTML::Query and WWW::Mechanize > > or even WWW::HtmlUnit (built on-top Java's amazing HtmlUnit libary using Inline::Java to get JS support in the crawler) >
On 06/14/2011 01:46 PM, Shlomi Fish wrote: > Just a question - why are you using HTML::TreeBuilder::XPath instead > of HTML::TreeBuilder::LibXML : > See Gabor? The finer points of web scraping can inspire lively discussion. I second Natty's suggestion, and I could add 5-10 minutes about scraping with Mojolicious tools. Also, here's a pitch for another 10 minute talk Title: #myjourneytoperl - graphing a meme Agenda: on March 30th, at the urging of Matt Trout, perl people on Twitter posted text graphs showing their path to Perl from and through other programming languages. Inspired by the directed graph format used, I decided to capture these posts and combine them into a single giant graph which I would render using the graphviz toolkit. I'll talk about the tools I used (mojolicious, GraphViz, the twitter JSON api), the problems I encountered (Unicode!) and the snapshot of the Perl community that emerged. Time: 15 minutes -- This message has been scanned for viruses and dangerous content by MailScanner, and is believed to be clean. _______________________________________________ Perl mailing list [email protected] http://mail.perl.org.il/mailman/listinfo/perl
