[jira] [Commented] (NUTCH-1854) ./bin/crawl fails with a parsing fetcher

2015-04-14 Thread Asitang Mishra (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14495687#comment-14495687 ] Asitang Mishra commented on NUTCH-1854: --- okay done Lewis.. > ./bin/crawl fails with

[jira] [Updated] (NUTCH-1854) ./bin/crawl fails with a parsing fetcher

2015-04-14 Thread Asitang Mishra (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Asitang Mishra updated NUTCH-1854: -- Attachment: NUTCH-1854ver4.patch Added NUTCH-1854ver4.patch : formatted the NUTCH-1854ver3.patch

[Nutch Wiki] Update of "FrontPage" by ChrisMattmann

2015-04-14 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification. The "FrontPage" page has been changed by ChrisMattmann: https://wiki.apache.org/nutch/FrontPage?action=diff&rev1=296&rev2=297 Comment: - whitelist tutorial * [[NutchMavenSupport|Usin

Re: Review Request 33112: NUTCH-1927: Create a whitelist of IPs/hostnames to allow skipping of RobotRules parsing

2015-04-14 Thread Chris Mattmann
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/33112/ --- (Updated April 15, 2015, 3:56 a.m.) Review request for nutch. Bugs: NUTCH-192

[jira] [Commented] (NUTCH-1927) Create a whitelist of IPs/hostnames to allow skipping of RobotRules parsing

2015-04-14 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14495620#comment-14495620 ] Chris A. Mattmann commented on NUTCH-1927: -- let me know what you guys think. Test

[jira] [Updated] (NUTCH-1927) Create a whitelist of IPs/hostnames to allow skipping of RobotRules parsing

2015-04-14 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-1927: - Attachment: NUTCH-1927.Mattmann.041415.patch.txt - updated patch addresses comments from L

[jira] [Updated] (NUTCH-1985) Adding a main() method to the MimeTypeIndexingFilter

2015-04-14 Thread Jorge Luis Betancourt Gonzalez (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jorge Luis Betancourt Gonzalez updated NUTCH-1985: -- Attachment: NUTCH-1985.patch > Adding a main() method to the Mim

[jira] [Created] (NUTCH-1985) Adding a main() method to the MimeTypeIndexingFilter

2015-04-14 Thread Jorge Luis Betancourt Gonzalez (JIRA)
Jorge Luis Betancourt Gonzalez created NUTCH-1985: - Summary: Adding a main() method to the MimeTypeIndexingFilter Key: NUTCH-1985 URL: https://issues.apache.org/jira/browse/NUTCH-1985 P

[jira] [Commented] (NUTCH-1854) ./bin/crawl fails with a parsing fetcher

2015-04-14 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14494778#comment-14494778 ] Lewis John McGibbney commented on NUTCH-1854: - [~asitang] can you please use t

[Nutch Wiki] Update of "SumanSaurabh/GSoC2015Nutch" by SumanSaurabh

2015-04-14 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification. The "SumanSaurabh/GSoC2015Nutch" page has been changed by SumanSaurabh: https://wiki.apache.org/nutch/SumanSaurabh/GSoC2015Nutch?action=diff&rev1=3&rev2=4 . {{{ + }}} + + . De

[Nutch Wiki] Update of "SumanSaurabh/GSoC2015Nutch" by SumanSaurabh

2015-04-14 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification. The "SumanSaurabh/GSoC2015Nutch" page has been changed by SumanSaurabh: https://wiki.apache.org/nutch/SumanSaurabh/GSoC2015Nutch?action=diff&rev1=2&rev2=3 . . '''1.2) Workspace Set

[jira] [Issue Comment Deleted] (NUTCH-1946) Upgrade to Gora 0.6.1

2015-04-14 Thread Jeroen Vlek (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeroen Vlek updated NUTCH-1946: --- Comment: was deleted (was: Sorry, I'm a bit confused: Is any more action required on my part for the