svn commit: r451648 - /lucene/nutch/trunk/src/plugin/lib-http/src/java/org/apache/nutch/protocol/http/api/HttpBase.java

2006-09-30 Thread pkosiorowski
Author: pkosiorowski Date: Sat Sep 30 12:35:36 2006 New Revision: 451648 URL: http://svn.apache.org/viewvc?view=revrev=451648 Log: NUTCH-374: when http.content.limit be set to -1 and Response.CONTENT_ENCODING is gzip or x-gzip , it can not fetch any thing.(King Kong) Modified: lucene/nutch

svn commit: r392572 - /lucene/nutch/trunk/default.properties

2006-04-08 Thread pkosiorowski
Author: pkosiorowski Date: Sat Apr 8 11:30:44 2006 New Revision: 392572 URL: http://svn.apache.org/viewcvs?rev=392572view=rev Log: Year updated Modified: lucene/nutch/trunk/default.properties Modified: lucene/nutch/trunk/default.properties URL: http://svn.apache.org/viewcvs/lucene/nutch

svn commit: r390463 - in /lucene/nutch/branches/branch-0.7: site/index.html site/index.pdf site/tutorial8.html site/tutorial8.pdf src/site/src/documentation/content/xdocs/index.xml

2006-03-31 Thread pkosiorowski
Author: pkosiorowski Date: Fri Mar 31 09:16:44 2006 New Revision: 390463 URL: http://svn.apache.org/viewcvs?rev=390463view=rev Log: 0.7.2 news added Modified: lucene/nutch/branches/branch-0.7/site/index.html lucene/nutch/branches/branch-0.7/site/index.pdf lucene/nutch/branches/branch

svn commit: r390479 - /lucene/nutch/tags/release-0.7.2/

2006-03-31 Thread pkosiorowski
Author: pkosiorowski Date: Fri Mar 31 11:16:14 2006 New Revision: 390479 URL: http://svn.apache.org/viewcvs?rev=390479view=rev Log: Nutch 0.7.2 release. Added: lucene/nutch/tags/release-0.7.2/ - copied from r390478, lucene/nutch/branches/branch-0.7/

svn commit: r390158 - in /lucene/nutch/branches/branch-0.7: CHANGES.txt conf/nutch-default.xml default.properties

2006-03-30 Thread pkosiorowski
Author: pkosiorowski Date: Thu Mar 30 07:16:00 2006 New Revision: 390158 URL: http://svn.apache.org/viewcvs?rev=390158view=rev Log: Updated version numbers for 0.7.2 release Modified: lucene/nutch/branches/branch-0.7/CHANGES.txt lucene/nutch/branches/branch-0.7/conf/nutch-default.xml

svn commit: r390012 - /lucene/nutch/branches/branch-0.7/src/site/src/documentation/content/xdocs/tutorial8.xml

2006-03-29 Thread pkosiorowski
Author: pkosiorowski Date: Wed Mar 29 23:04:53 2006 New Revision: 390012 URL: http://svn.apache.org/viewcvs?rev=390012view=rev Log: DmozParser package name fixed Modified: lucene/nutch/branches/branch-0.7/src/site/src/documentation/content/xdocs/tutorial8.xml Modified: lucene/nutch

svn commit: r388745 - in /lucene/nutch/branches/branch-0.7: ./ src/plugin/protocol-httpclient/src/java/org/apache/nutch/protocol/httpclient/

2006-03-25 Thread pkosiorowski
Author: pkosiorowski Date: Sat Mar 25 03:37:59 2006 New Revision: 388745 URL: http://svn.apache.org/viewcvs?rev=388745view=rev Log: NUTCH-239 - I changed httpclient to use javax.net.ssl instead of com.sun.net.ssl. (Jake Vanderdray) Modified: lucene/nutch/branches/branch-0.7/CHANGES.txt

svn commit: r388815 - in /lucene/nutch/branches/branch-0.7: CHANGES.txt src/java/org/apache/nutch/db/WebDBWriter.java

2006-03-25 Thread pkosiorowski
Author: pkosiorowski Date: Sat Mar 25 12:22:10 2006 New Revision: 388815 URL: http://svn.apache.org/viewcvs?rev=388815view=rev Log: NUTCH-117 - Crawl crashes with java.io.IOException: already exists: C:\nutch\crawl.intranet\oct18\db\webdb.new\pagesByURL. Modified: lucene/nutch/branches

svn commit: r386418 - in /lucene/nutch/branches/branch-0.7: CHANGES.txt src/plugin/protocol-httpclient/lib/commons-httpclient-3.0-rc2.jar src/plugin/protocol-httpclient/lib/commons-httpclient-3.0.jar

2006-03-16 Thread pkosiorowski
Author: pkosiorowski Date: Thu Mar 16 11:04:37 2006 New Revision: 386418 URL: http://svn.apache.org/viewcvs?rev=386418view=rev Log: commons-httpclient upgraded to version 3.0 Added: lucene/nutch/branches/branch-0.7/src/plugin/protocol-httpclient/lib/commons-httpclient-3.0.jar (with props

svn commit: r384594 - /lucene/nutch/branches/branch-0.7/site/doap.rdf

2006-03-09 Thread pkosiorowski
Author: pkosiorowski Date: Thu Mar 9 12:06:48 2006 New Revision: 384594 URL: http://svn.apache.org/viewcvs?rev=384594view=rev Log: Added DOAP file. Added: lucene/nutch/branches/branch-0.7/site/doap.rdf Added: lucene/nutch/branches/branch-0.7/site/doap.rdf URL: http://svn.apache.org

svn commit: r383656 - in /lucene/nutch/branches/branch-0.7/src/site/src/documentation/content/xdocs: mailing_lists.xml tabs.xml

2006-03-06 Thread pkosiorowski
Author: pkosiorowski Date: Mon Mar 6 12:43:39 2006 New Revision: 383656 URL: http://svn.apache.org/viewcvs?rev=383656view=rev Log: Added mailing list archives and forrest 0.7 compatibility Modified: lucene/nutch/branches/branch-0.7/src/site/src/documentation/content/xdocs/mailing_lists.xml

svn commit: r383665 [3/3] - in /lucene/nutch/branches/branch-0.7/site: ./ images/ skin/ skin/images/ skin/translations/

2006-03-06 Thread pkosiorowski
Modified: lucene/nutch/branches/branch-0.7/site/tutorial.pdf URL: http://svn.apache.org/viewcvs/lucene/nutch/branches/branch-0.7/site/tutorial.pdf?rev=383665r1=383664r2=383665view=diff == ---

svn commit: r378513 - /lucene/nutch/branches/branch-0.7/CHANGES.txt

2006-02-17 Thread pkosiorowski
Author: pkosiorowski Date: Fri Feb 17 05:58:05 2006 New Revision: 378513 URL: http://svn.apache.org/viewcvs?rev=378513view=rev Log: Fixed JUnit test failing due to changes in www.nutch.org. Modified: lucene/nutch/branches/branch-0.7/CHANGES.txt Modified: lucene/nutch/branches/branch-0.7

svn commit: r356880 - /lucene/nutch/trunk/src/webapps/jobtracker/jobdetails.jsp

2005-12-14 Thread pkosiorowski
Author: pkosiorowski Date: Wed Dec 14 13:30:41 2005 New Revision: 356880 URL: http://svn.apache.org/viewcvs?rev=356880view=rev Log: NUTCH-141: Invalid title tag in jsp page. Fix by Marko Bauhardt. Modified: lucene/nutch/trunk/src/webapps/jobtracker/jobdetails.jsp Modified: lucene/nutch

svn commit: r312935 - /lucene/nutch/trunk/src/java/org/apache/nutch/searcher/NutchBean.java

2005-10-11 Thread pkosiorowski
Author: pkosiorowski Date: Tue Oct 11 12:01:09 2005 New Revision: 312935 URL: http://svn.apache.org/viewcvs?rev=312935view=rev Log: NutchBean code cleanup. Contributed by Stefan Groschupf. Modified: lucene/nutch/trunk/src/java/org/apache/nutch/searcher/NutchBean.java Modified: lucene/nutch

svn commit: r312936 - /lucene/nutch/branches/branch-0.7/src/java/org/apache/nutch/searcher/NutchBean.java

2005-10-11 Thread pkosiorowski
Author: pkosiorowski Date: Tue Oct 11 12:01:35 2005 New Revision: 312936 URL: http://svn.apache.org/viewcvs?rev=312936view=rev Log: NutchBean code cleanup. Contributed by Stefan Groschupf. Modified: lucene/nutch/branches/branch-0.7/src/java/org/apache/nutch/searcher/NutchBean.java Modified

svn commit: r312943 - in /lucene/nutch/branches/branch-0.7: CHANGES.txt src/plugin/urlfilter-prefix/plugin.xml src/plugin/urlfilter-regex/plugin.xml

2005-10-11 Thread pkosiorowski
Author: pkosiorowski Date: Tue Oct 11 12:45:17 2005 New Revision: 312943 URL: http://svn.apache.org/viewcvs?rev=312943view=rev Log: NUTCH-107 - Typo in plugin/urlfilter-*/plugin.xml. (Stephen Cross) Modified: lucene/nutch/branches/branch-0.7/CHANGES.txt lucene/nutch/branches/branch-0.7

svn commit: r307036 - in /lucene/nutch/branches/branch-0.7: CHANGES.txt conf/nutch-default.xml default.properties

2005-10-07 Thread pkosiorowski
Author: pkosiorowski Date: Fri Oct 7 00:02:18 2005 New Revision: 307036 URL: http://svn.apache.org/viewcvs?rev=307036view=rev Log: Modifed version number - prepared for next maintenenace release if we would need it Modified: lucene/nutch/branches/branch-0.7/CHANGES.txt lucene/nutch

svn commit: r293022 - in /lucene/nutch/trunk/site: about.html bot.html credits.html faq.html faq.pdf i18n.html index.html index.pdf issue_tracking.html linkmap.html mailing_lists.html tutorial.html version_control.html

2005-10-01 Thread pkosiorowski
Author: pkosiorowski Date: Sat Oct 1 13:13:10 2005 New Revision: 293022 URL: http://svn.apache.org/viewcvs?rev=293022view=rev Log: Documentation regenerated for 0.7.1 release Removed: lucene/nutch/trunk/site/faq.html lucene/nutch/trunk/site/faq.pdf Modified: lucene/nutch/trunk/site

svn commit: r292837 - /lucene/nutch/branches/branch-0.7/src/site/src/documentation/content/xdocs/site.xml

2005-09-30 Thread pkosiorowski
Author: pkosiorowski Date: Fri Sep 30 11:51:43 2005 New Revision: 292837 URL: http://svn.apache.org/viewcvs?rev=292837view=rev Log: Updated FAQ link to point to Wiki Modified: lucene/nutch/branches/branch-0.7/src/site/src/documentation/content/xdocs/site.xml Modified: lucene/nutch

svn commit: r292838 - /lucene/nutch/trunk/src/site/src/documentation/content/xdocs/site.xml

2005-09-30 Thread pkosiorowski
Author: pkosiorowski Date: Fri Sep 30 12:07:04 2005 New Revision: 292838 URL: http://svn.apache.org/viewcvs?rev=292838view=rev Log: Updated faq link to point to Wiki Modified: lucene/nutch/trunk/src/site/src/documentation/content/xdocs/site.xml Modified: lucene/nutch/trunk/src/site/src

svn commit: r292021 - /lucene/nutch/branches/branch-0.7/CHANGES.txt

2005-09-27 Thread pkosiorowski
Author: pkosiorowski Date: Tue Sep 27 12:34:28 2005 New Revision: 292021 URL: http://svn.apache.org/viewcvs?rev=292021view=rev Log: CHANGES.txt updated with commit logs for 0.7.1 release. Modified: lucene/nutch/branches/branch-0.7/CHANGES.txt Modified: lucene/nutch/branches/branch-0.7

svn commit: r280392 - /lucene/nutch/tags/release-0.7/

2005-09-12 Thread pkosiorowski
Author: pkosiorowski Date: Mon Sep 12 11:50:42 2005 New Revision: 280392 URL: http://svn.apache.org/viewcvs?rev=280392view=rev Log: Nutch 0.7 release. Added: lucene/nutch/tags/release-0.7/ - copied from r233155, lucene/nutch/trunk/

svn commit: r240097 - /lucene/nutch/branches/Release-0.7/

2005-08-25 Thread pkosiorowski
Author: pkosiorowski Date: Thu Aug 25 09:12:19 2005 New Revision: 240097 URL: http://svn.apache.org/viewcvs?rev=240097view=rev Log: Nutch 0.7 release maintenance branch. Added: lucene/nutch/branches/Release-0.7/ - copied from r240096, lucene/nutch/tags/Release-0.7/

svn commit: r234193 - in /lucene/nutch/trunk/src/plugin/languageidentifier/src: java/org/apache/nutch/analysis/lang/LanguageIdentifier.java test/org/apache/nutch/analysis/lang/TestLanguageIdentifier.java

2005-08-21 Thread pkosiorowski
Author: pkosiorowski Date: Sun Aug 21 05:34:06 2005 New Revision: 234193 URL: http://svn.apache.org/viewcvs?rev=234193view=rev Log: Fixed failing JUnit test on Windows. Modified: lucene/nutch/trunk/src/plugin/languageidentifier/src/java/org/apache/nutch/analysis/lang/LanguageIdentifier.java

svn commit: r233368 - /lucene/nutch/trunk/src/plugin/build.xml

2005-08-18 Thread pkosiorowski
Author: pkosiorowski Date: Thu Aug 18 12:55:24 2005 New Revision: 233368 URL: http://svn.apache.org/viewcvs?rev=233368view=rev Log: Bad target name used. Reported by Fuad Efendi. Modified: lucene/nutch/trunk/src/plugin/build.xml Modified: lucene/nutch/trunk/src/plugin/build.xml URL: http

svn commit: r233150 - /lucene/nutch/trunk/CHANGES.txt

2005-08-17 Thread pkosiorowski
Author: pkosiorowski Date: Wed Aug 17 03:03:30 2005 New Revision: 233150 URL: http://svn.apache.org/viewcvs?rev=233150view=rev Log: Updated release date. Modified: lucene/nutch/trunk/CHANGES.txt Modified: lucene/nutch/trunk/CHANGES.txt URL: http://svn.apache.org/viewcvs/lucene/nutch/trunk

svn commit: r233040 - in /lucene/nutch/trunk: site/index.html site/index.pdf src/site/src/documentation/content/xdocs/index.xml

2005-08-16 Thread pkosiorowski
Author: pkosiorowski Date: Tue Aug 16 12:33:42 2005 New Revision: 233040 URL: http://svn.apache.org/viewcvs?rev=233040view=rev Log: Updated news on Nutch Website. Modified: lucene/nutch/trunk/site/index.html lucene/nutch/trunk/site/index.pdf lucene/nutch/trunk/src/site/src

svn commit: r230834 - in /lucene/nutch/trunk/src/site/src/documentation/content/xdocs: bot.xml i18n.xml

2005-08-08 Thread pkosiorowski
Author: pkosiorowski Date: Mon Aug 8 11:15:35 2005 New Revision: 230834 URL: http://svn.apache.org/viewcvs?rev=230834view=rev Log: Changed URLs in docs to point to Apache. Modified: lucene/nutch/trunk/src/site/src/documentation/content/xdocs/bot.xml lucene/nutch/trunk/src/site/src

svn commit: r230867 - /lucene/nutch/trunk/conf/crawl-urlfilter.txt.template

2005-08-08 Thread pkosiorowski
Author: pkosiorowski Date: Mon Aug 8 12:48:17 2005 New Revision: 230867 URL: http://svn.apache.org/viewcvs?rev=230867view=rev Log: Skipping png and pdf files. Modified: lucene/nutch/trunk/conf/crawl-urlfilter.txt.template Modified: lucene/nutch/trunk/conf/crawl-urlfilter.txt.template URL

svn commit: r230870 - /lucene/nutch/trunk/src/java/org/apache/nutch/tools/DistributedAnalysisTool.java

2005-08-08 Thread pkosiorowski
Author: pkosiorowski Date: Mon Aug 8 12:59:56 2005 New Revision: 230870 URL: http://svn.apache.org/viewcvs?rev=230870view=rev Log: NUTCH-7. Relative links from identical(MD5) pages were treated incorrectly. Modified: lucene/nutch/trunk/src/java/org/apache/nutch/tools