[jira] [Commented] (NUTCH-1029) Readdb throws EOFException

2011-07-09 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1029?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13062489#comment-13062489 ] Markus Jelsma commented on NUTCH-1029: -- It seems this error is caused due to the _SUC

[jira] [Created] (NUTCH-1036) Solr jobs should increment counters in Reporter

2011-07-09 Thread Markus Jelsma (JIRA)
Solr jobs should increment counters in Reporter --- Key: NUTCH-1036 URL: https://issues.apache.org/jira/browse/NUTCH-1036 Project: Nutch Issue Type: Improvement Components: indexer

[jira] [Updated] (NUTCH-1029) Readdb throws EOFException

2011-07-09 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1029?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1029: - Attachment: NUTCH-1029-1.4-1.patch The assumption was correct. Here's a patch for 1.4 that disabl

[jira] [Updated] (NUTCH-1029) Readdb throws EOFException

2011-07-09 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1029?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1029: - Priority: Critical (was: Major) Patch Info: [Patch Available] > Readdb throws EOFException

[jira] [Issue Comment Edited] (NUTCH-1029) Readdb throws EOFException

2011-07-09 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1029?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13062630#comment-13062630 ] Markus Jelsma edited comment on NUTCH-1029 at 7/9/11 9:22 PM: --

[jira] [Created] (NUTCH-1037) Deduplicate anchors before indexing

2011-07-09 Thread Markus Jelsma (JIRA)
Deduplicate anchors before indexing --- Key: NUTCH-1037 URL: https://issues.apache.org/jira/browse/NUTCH-1037 Project: Nutch Issue Type: Improvement Reporter: Markus Jelsma Assignee: Ma

[jira] [Commented] (NUTCH-937) When nutch is run on hadoop > 0.20.2 (or cdh) it will not find plugins because MapReduce will not unpack plugin/ directory from the job's pack (due to MAPREDUCE-967)

2011-07-09 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-937?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13062654#comment-13062654 ] Markus Jelsma commented on NUTCH-937: - I agree, marking it as a fix for the Hadoop vers

[jira] [Issue Comment Edited] (NUTCH-937) When nutch is run on hadoop > 0.20.2 (or cdh) it will not find plugins because MapReduce will not unpack plugin/ directory from the job's pack (due to MAPREDU

2011-07-09 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-937?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13062654#comment-13062654 ] Markus Jelsma edited comment on NUTCH-937 at 7/9/11 11:12 PM: --

[jira] [Commented] (NUTCH-1030) WebgraphDB program requires manually added directories

2011-07-09 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13062655#comment-13062655 ] Markus Jelsma commented on NUTCH-1030: -- Objections? If not, i'll commit this within t

Build failed in Jenkins: Nutch-trunk #1541

2011-07-09 Thread Apache Jenkins Server
See -- [...truncated 985 lines...] A src/plugin/subcollection/src/java/org/apache/nutch/collection A src/plugin/subcollection/src/java/org/apache/nutch/collection/Subcollection.java A