Author: pkosiorowski
Date: Sat Sep 30 12:35:36 2006
New Revision: 451648
URL: http://svn.apache.org/viewvc?view=revrev=451648
Log:
NUTCH-374: when http.content.limit be set to -1 and Response.CONTENT_ENCODING
is gzip or x-gzip , it can not fetch any thing.(King Kong)
Modified:
lucene/nutch
Author: pkosiorowski
Date: Sat Apr 8 11:30:44 2006
New Revision: 392572
URL: http://svn.apache.org/viewcvs?rev=392572view=rev
Log:
Year updated
Modified:
lucene/nutch/trunk/default.properties
Modified: lucene/nutch/trunk/default.properties
URL:
http://svn.apache.org/viewcvs/lucene/nutch
Author: pkosiorowski
Date: Fri Mar 31 09:16:44 2006
New Revision: 390463
URL: http://svn.apache.org/viewcvs?rev=390463view=rev
Log:
0.7.2 news added
Modified:
lucene/nutch/branches/branch-0.7/site/index.html
lucene/nutch/branches/branch-0.7/site/index.pdf
lucene/nutch/branches/branch
Author: pkosiorowski
Date: Fri Mar 31 11:16:14 2006
New Revision: 390479
URL: http://svn.apache.org/viewcvs?rev=390479view=rev
Log:
Nutch 0.7.2 release.
Added:
lucene/nutch/tags/release-0.7.2/
- copied from r390478, lucene/nutch/branches/branch-0.7/
Author: pkosiorowski
Date: Thu Mar 30 07:16:00 2006
New Revision: 390158
URL: http://svn.apache.org/viewcvs?rev=390158view=rev
Log:
Updated version numbers for 0.7.2 release
Modified:
lucene/nutch/branches/branch-0.7/CHANGES.txt
lucene/nutch/branches/branch-0.7/conf/nutch-default.xml
Author: pkosiorowski
Date: Wed Mar 29 23:04:53 2006
New Revision: 390012
URL: http://svn.apache.org/viewcvs?rev=390012view=rev
Log:
DmozParser package name fixed
Modified:
lucene/nutch/branches/branch-0.7/src/site/src/documentation/content/xdocs/tutorial8.xml
Modified:
lucene/nutch
Author: pkosiorowski
Date: Sat Mar 25 03:37:59 2006
New Revision: 388745
URL: http://svn.apache.org/viewcvs?rev=388745view=rev
Log:
NUTCH-239 - I changed httpclient to use javax.net.ssl instead of
com.sun.net.ssl. (Jake Vanderdray)
Modified:
lucene/nutch/branches/branch-0.7/CHANGES.txt
Author: pkosiorowski
Date: Sat Mar 25 12:22:10 2006
New Revision: 388815
URL: http://svn.apache.org/viewcvs?rev=388815view=rev
Log:
NUTCH-117 - Crawl crashes with java.io.IOException: already exists:
C:\nutch\crawl.intranet\oct18\db\webdb.new\pagesByURL.
Modified:
lucene/nutch/branches
Author: pkosiorowski
Date: Thu Mar 16 11:04:37 2006
New Revision: 386418
URL: http://svn.apache.org/viewcvs?rev=386418view=rev
Log:
commons-httpclient upgraded to version 3.0
Added:
lucene/nutch/branches/branch-0.7/src/plugin/protocol-httpclient/lib/commons-httpclient-3.0.jar
(with props
Author: pkosiorowski
Date: Thu Mar 9 12:06:48 2006
New Revision: 384594
URL: http://svn.apache.org/viewcvs?rev=384594view=rev
Log:
Added DOAP file.
Added:
lucene/nutch/branches/branch-0.7/site/doap.rdf
Added: lucene/nutch/branches/branch-0.7/site/doap.rdf
URL:
http://svn.apache.org
Author: pkosiorowski
Date: Mon Mar 6 12:43:39 2006
New Revision: 383656
URL: http://svn.apache.org/viewcvs?rev=383656view=rev
Log:
Added mailing list archives and forrest 0.7 compatibility
Modified:
lucene/nutch/branches/branch-0.7/src/site/src/documentation/content/xdocs/mailing_lists.xml
Modified: lucene/nutch/branches/branch-0.7/site/tutorial.pdf
URL:
http://svn.apache.org/viewcvs/lucene/nutch/branches/branch-0.7/site/tutorial.pdf?rev=383665r1=383664r2=383665view=diff
==
---
Author: pkosiorowski
Date: Fri Feb 17 05:58:05 2006
New Revision: 378513
URL: http://svn.apache.org/viewcvs?rev=378513view=rev
Log:
Fixed JUnit test failing due to changes in www.nutch.org.
Modified:
lucene/nutch/branches/branch-0.7/CHANGES.txt
Modified: lucene/nutch/branches/branch-0.7
Author: pkosiorowski
Date: Wed Dec 14 13:30:41 2005
New Revision: 356880
URL: http://svn.apache.org/viewcvs?rev=356880view=rev
Log:
NUTCH-141: Invalid title tag in jsp page. Fix by Marko Bauhardt.
Modified:
lucene/nutch/trunk/src/webapps/jobtracker/jobdetails.jsp
Modified: lucene/nutch
Author: pkosiorowski
Date: Tue Oct 11 12:01:09 2005
New Revision: 312935
URL: http://svn.apache.org/viewcvs?rev=312935view=rev
Log:
NutchBean code cleanup. Contributed by Stefan Groschupf.
Modified:
lucene/nutch/trunk/src/java/org/apache/nutch/searcher/NutchBean.java
Modified: lucene/nutch
Author: pkosiorowski
Date: Tue Oct 11 12:01:35 2005
New Revision: 312936
URL: http://svn.apache.org/viewcvs?rev=312936view=rev
Log:
NutchBean code cleanup. Contributed by Stefan Groschupf.
Modified:
lucene/nutch/branches/branch-0.7/src/java/org/apache/nutch/searcher/NutchBean.java
Modified
Author: pkosiorowski
Date: Tue Oct 11 12:45:17 2005
New Revision: 312943
URL: http://svn.apache.org/viewcvs?rev=312943view=rev
Log:
NUTCH-107 - Typo in plugin/urlfilter-*/plugin.xml. (Stephen Cross)
Modified:
lucene/nutch/branches/branch-0.7/CHANGES.txt
lucene/nutch/branches/branch-0.7
Author: pkosiorowski
Date: Fri Oct 7 00:02:18 2005
New Revision: 307036
URL: http://svn.apache.org/viewcvs?rev=307036view=rev
Log:
Modifed version number - prepared for next maintenenace release if we would
need it
Modified:
lucene/nutch/branches/branch-0.7/CHANGES.txt
lucene/nutch
Author: pkosiorowski
Date: Sat Oct 1 13:13:10 2005
New Revision: 293022
URL: http://svn.apache.org/viewcvs?rev=293022view=rev
Log:
Documentation regenerated for 0.7.1 release
Removed:
lucene/nutch/trunk/site/faq.html
lucene/nutch/trunk/site/faq.pdf
Modified:
lucene/nutch/trunk/site
Author: pkosiorowski
Date: Fri Sep 30 11:51:43 2005
New Revision: 292837
URL: http://svn.apache.org/viewcvs?rev=292837view=rev
Log:
Updated FAQ link to point to Wiki
Modified:
lucene/nutch/branches/branch-0.7/src/site/src/documentation/content/xdocs/site.xml
Modified:
lucene/nutch
Author: pkosiorowski
Date: Fri Sep 30 12:07:04 2005
New Revision: 292838
URL: http://svn.apache.org/viewcvs?rev=292838view=rev
Log:
Updated faq link to point to Wiki
Modified:
lucene/nutch/trunk/src/site/src/documentation/content/xdocs/site.xml
Modified: lucene/nutch/trunk/src/site/src
Author: pkosiorowski
Date: Tue Sep 27 12:34:28 2005
New Revision: 292021
URL: http://svn.apache.org/viewcvs?rev=292021view=rev
Log:
CHANGES.txt updated with commit logs for 0.7.1 release.
Modified:
lucene/nutch/branches/branch-0.7/CHANGES.txt
Modified: lucene/nutch/branches/branch-0.7
Author: pkosiorowski
Date: Mon Sep 12 11:50:42 2005
New Revision: 280392
URL: http://svn.apache.org/viewcvs?rev=280392view=rev
Log:
Nutch 0.7 release.
Added:
lucene/nutch/tags/release-0.7/
- copied from r233155, lucene/nutch/trunk/
Author: pkosiorowski
Date: Thu Aug 25 09:12:19 2005
New Revision: 240097
URL: http://svn.apache.org/viewcvs?rev=240097view=rev
Log:
Nutch 0.7 release maintenance branch.
Added:
lucene/nutch/branches/Release-0.7/
- copied from r240096, lucene/nutch/tags/Release-0.7/
Author: pkosiorowski
Date: Sun Aug 21 05:34:06 2005
New Revision: 234193
URL: http://svn.apache.org/viewcvs?rev=234193view=rev
Log:
Fixed failing JUnit test on Windows.
Modified:
lucene/nutch/trunk/src/plugin/languageidentifier/src/java/org/apache/nutch/analysis/lang/LanguageIdentifier.java
Author: pkosiorowski
Date: Thu Aug 18 12:55:24 2005
New Revision: 233368
URL: http://svn.apache.org/viewcvs?rev=233368view=rev
Log:
Bad target name used. Reported by Fuad Efendi.
Modified:
lucene/nutch/trunk/src/plugin/build.xml
Modified: lucene/nutch/trunk/src/plugin/build.xml
URL:
http
Author: pkosiorowski
Date: Wed Aug 17 03:03:30 2005
New Revision: 233150
URL: http://svn.apache.org/viewcvs?rev=233150view=rev
Log:
Updated release date.
Modified:
lucene/nutch/trunk/CHANGES.txt
Modified: lucene/nutch/trunk/CHANGES.txt
URL:
http://svn.apache.org/viewcvs/lucene/nutch/trunk
Author: pkosiorowski
Date: Tue Aug 16 12:33:42 2005
New Revision: 233040
URL: http://svn.apache.org/viewcvs?rev=233040view=rev
Log:
Updated news on Nutch Website.
Modified:
lucene/nutch/trunk/site/index.html
lucene/nutch/trunk/site/index.pdf
lucene/nutch/trunk/src/site/src
Author: pkosiorowski
Date: Mon Aug 8 11:15:35 2005
New Revision: 230834
URL: http://svn.apache.org/viewcvs?rev=230834view=rev
Log:
Changed URLs in docs to point to Apache.
Modified:
lucene/nutch/trunk/src/site/src/documentation/content/xdocs/bot.xml
lucene/nutch/trunk/src/site/src
Author: pkosiorowski
Date: Mon Aug 8 12:48:17 2005
New Revision: 230867
URL: http://svn.apache.org/viewcvs?rev=230867view=rev
Log:
Skipping png and pdf files.
Modified:
lucene/nutch/trunk/conf/crawl-urlfilter.txt.template
Modified: lucene/nutch/trunk/conf/crawl-urlfilter.txt.template
URL
Author: pkosiorowski
Date: Mon Aug 8 12:59:56 2005
New Revision: 230870
URL: http://svn.apache.org/viewcvs?rev=230870view=rev
Log:
NUTCH-7. Relative links from identical(MD5) pages were treated incorrectly.
Modified:
lucene/nutch/trunk/src/java/org/apache/nutch/tools
31 matches
Mail list logo