Kewl !! Thanks Lewis :) I have circulated this news to big data group of my company and DB fellows @ UCI and my geek friends. Also posted over Bigdata and Hadoop pages over Facebook.
On Sat, Jun 8, 2013 at 2:53 PM, lewis john mcgibbney <lewi...@apache.org>wrote: > Good Afternoon Everyone, > > The Apache Nutch PMC are extremely pleased to announce the immediate > release of Apache Nutch v2.2. > > Apache Nutch is an open source web-search software project. Stemming > from Apache > Lucene <http://lucene.apache.org/java/>, it now builds on Apache > Solr<http://lucene.apache.org/solr/>adding web-specifics, such as a > crawler, a link-graph database and parsing > support handled by Apache Tika <http://tika.apache.org/> for HTML and and > array other document formats. > > This release includes over 30 bug fixes and over 25 improvements > representing the third release of increasingly popular 2.x Nutch series. > This release features inclusion of > Crawler-Commons<http://code.google.com/p/crawler-commons/>which Nutch > now utilizes for improved robots.txt parsing, library upgrades > to Apache Hadoop <http://hadoop.apache.org> 1.1.1, Apache > Gora<http://gora.apache.org>0.3, Apache > Tika <http://tika.apache.org> 1.2 and > Automaton<http://www.brics.dk/automaton/automaton>1.11-8. Please see > the list > of changes <http://www.apache.org/dist/nutch/2.2/2.2-CHANGES.txt> or > the release > report <http://s.apache.org/LPB> made in this version for a full > breakdown. > As usual in the 2.x series, this release is made available only as source, > but is also available within Maven Central <http://search.maven.org/>. The > release is available here <http://www.apache.org/dyn/closer.cgi/nutch/>. > > Have a great weekend. > > Best > lewismc > (on behalf of the Apache Nutch community) >