Hi list, FYI I'll be giving a talk on Apache Nutch at Berlin Buzzwords<http://berlinbuzzwords.de/> (http://berlinbuzzwords.de/content/web-scale-crawling-apache-nutch<http://berlinbuzzwords.de/content/web-scale-crawling-apache-nutch> )
This talk will give an overview of Apache Nutch. I will describe its main components and how it fits with other Apache projects such as Hadoop, Lucene, SOLR, Tika or HBase. The presentation will contain examples of real-case uses. The second part of the presentation will be focused on the latest developments in Nutch and the changed introduces by the forthcoming version 2.0. There will be plenty of interesting talks at this conference and at least one other Nutch committer (Andrzej). If you can't make it, the talks will probably be available online after the conference. Julien -- * *Open Source Solutions for Text Engineering http://digitalpebble.blogspot.com/ http://www.digitalpebble.com

