[jira] [Commented] (NUTCH-1057) Make fetcher thread time out configurable

2011-08-24 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13090081#comment-13090081 ] Julien Nioche commented on NUTCH-1057: -- Haven't you committed it already? Or do you

[jira] [Commented] (NUTCH-1024) Dynamically set fetchInterval by MIME-type

2011-08-24 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1024?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13090082#comment-13090082 ] Julien Nioche commented on NUTCH-1024: -- Do you mind if we wait a bit? I'd like to

[jira] [Commented] (NUTCH-1024) Dynamically set fetchInterval by MIME-type

2011-08-24 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1024?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13090105#comment-13090105 ] Markus Jelsma commented on NUTCH-1024: -- Sure but what do you mean by info from

[jira] [Commented] (NUTCH-1057) Make fetcher thread time out configurable

2011-08-24 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13090106#comment-13090106 ] Markus Jelsma commented on NUTCH-1057: -- Sorry, wrong issue! Make fetcher thread

[jira] [Commented] (NUTCH-1024) Dynamically set fetchInterval by MIME-type

2011-08-24 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1024?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13090115#comment-13090115 ] Julien Nioche commented on NUTCH-1024: -- There is a JIRA issue for 2.0

[jira] [Created] (NUTCH-1090) LinkDb (invertlinks) should inform the user when it ignores internal links

2011-08-24 Thread Marek Bachmann (JIRA)
LinkDb (invertlinks) should inform the user when it ignores internal links -- Key: NUTCH-1090 URL: https://issues.apache.org/jira/browse/NUTCH-1090 Project: Nutch Issue

[jira] [Commented] (NUTCH-408) Plugin development documentation

2011-08-24 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-408?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13090242#comment-13090242 ] Lewis John McGibbney commented on NUTCH-408: Any objections to closing this

[jira] [Commented] (NUTCH-408) Plugin development documentation

2011-08-24 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-408?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13090247#comment-13090247 ] Markus Jelsma commented on NUTCH-408: - +1 close Plugin development documentation

[jira] [Commented] (NUTCH-1090) LinkDb (invertlinks) should inform the user when it ignores internal links

2011-08-24 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13090257#comment-13090257 ] Markus Jelsma commented on NUTCH-1090: -- You can patch o.a.n.crawl.LinkDB.configure()

[jira] [Commented] (NUTCH-1090) LinkDb (invertlinks) should inform the user when it ignores internal links

2011-08-24 Thread Marek Bachmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13090260#comment-13090260 ] Marek Bachmann commented on NUTCH-1090: --- Then I did it right. Thanks LinkDb

[jira] [Commented] (NUTCH-1090) LinkDb (invertlinks) should inform the user when it ignores internal links

2011-08-24 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13090261#comment-13090261 ] Markus Jelsma commented on NUTCH-1090: -- Looking at it i feel writing in the invert

[jira] [Commented] (NUTCH-1090) LinkDb (invertlinks) should inform the user when it ignores internal links

2011-08-24 Thread Marek Bachmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13090264#comment-13090264 ] Marek Bachmann commented on NUTCH-1090: --- Ok, I thought so too. But I was unsure that

Re: [jira] [Commented] (NUTCH-1090) LinkDb (invertlinks) should inform the user when it ignores internal links

2011-08-24 Thread Markus Jelsma
Yes, the job object is created there. The can then be read like in the configure method. On Wednesday 24 August 2011 16:40:29 Marek Bachmann (JIRA) wrote: [ https://issues.apache.org/jira/browse/NUTCH-1090?page=com.atlassian.jira.p

[jira] [Commented] (NUTCH-1090) LinkDb (invertlinks) should inform the user when it ignores internal links

2011-08-24 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13090267#comment-13090267 ] Markus Jelsma commented on NUTCH-1090: -- Yes, the job object is created there. The can

[jira] [Issue Comment Edited] (NUTCH-1090) LinkDb (invertlinks) should inform the user when it ignores internal links

2011-08-24 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13090267#comment-13090267 ] Markus Jelsma edited comment on NUTCH-1090 at 8/24/11 2:48 PM:

how to use Nutch 1.3 as a single job jar on newer Hadoop releases

2011-08-24 Thread Ferdy Galema
Hi, Compiling Nutch 1.3 with patch NUTCH-993 (newest patch) and configuring mapreduce.job.jar.unpack.pattern and plugin.folders according to issue NUTCH-937 still won't allow me to run the stand-alone job jar. What else should I patch/configure in order to do so? The command I use is hadoop

[jira] [Updated] (NUTCH-1090) LinkDb (invertlinks) should inform the user when it ignores internal links

2011-08-24 Thread Marek Bachmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marek Bachmann updated NUTCH-1090: -- Attachment: (was: LinkDb.patch) LinkDb (invertlinks) should inform the user when it

[jira] [Updated] (NUTCH-1090) LinkDb (invertlinks) should inform the user when it ignores internal links

2011-08-24 Thread Marek Bachmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marek Bachmann updated NUTCH-1090: -- Attachment: LinkDb.patch Inserted a {{LOG.info}} command in the {{invert}} method when

Re: how to use Nutch 1.3 as a single job jar on newer Hadoop releases

2011-08-24 Thread Julien Nioche
Make sure you specify the params in runtime/deploy/conf unless you rebuild the job file with 'ant job' On 24 August 2011 16:09, Ferdy Galema ferdy.gal...@kalooga.com wrote: Hi, Compiling Nutch 1.3 with patch NUTCH-993 (newest patch) and configuring mapreduce.job.jar.unpack.**pattern and

[jira] [Resolved] (NUTCH-408) Plugin development documentation

2011-08-24 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-408. Resolution: Fixed Fix Version/s: 2.0 1.4 This issue was

[jira] [Closed] (NUTCH-408) Plugin development documentation

2011-08-24 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-408. -- Plugin development documentation Key:

[jira] [Resolved] (NUTCH-1056) Write a new plugin example for inclusion on the wiki

2011-08-24 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1056?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1056. - Resolution: Fixed This issue has been addressed which subsequently means that

[jira] [Closed] (NUTCH-1056) Write a new plugin example for inclusion on the wiki

2011-08-24 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1056?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-1056. --- Write a new plugin example for inclusion on the wiki

[jira] [Updated] (NUTCH-1078) Upgrade all instances of commons logging to slf4j (with log4j backend)

2011-08-24 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1078: Attachment: NUTCH-1078-branch-1.4-20110824-v2.patch The attached patch replaces

[jira] [Created] (NUTCH-1091) Remove commons logging dependency from Nutch branch.

2011-08-24 Thread Lewis John McGibbney (JIRA)
Remove commons logging dependency from Nutch branch. Key: NUTCH-1091 URL: https://issues.apache.org/jira/browse/NUTCH-1091 Project: Nutch Issue Type: Improvement Components:

[jira] [Created] (NUTCH-1092) overhaul FAQ's and publish to Nutch site

2011-08-24 Thread Lewis John McGibbney (JIRA)
overhaul FAQ's and publish to Nutch site Key: NUTCH-1092 URL: https://issues.apache.org/jira/browse/NUTCH-1092 Project: Nutch Issue Type: Sub-task Components: documentation Affects

[jira] [Created] (NUTCH-1093) create core documentation

2011-08-24 Thread Lewis John McGibbney (JIRA)
create core documentation - Key: NUTCH-1093 URL: https://issues.apache.org/jira/browse/NUTCH-1093 Project: Nutch Issue Type: Sub-task Components: documentation Affects Versions: 1.4, 2.0

[jira] [Created] (NUTCH-1094) create comprehensive documentation for Nutch 2.0 trunk

2011-08-24 Thread Lewis John McGibbney (JIRA)
create comprehensive documentation for Nutch 2.0 trunk -- Key: NUTCH-1094 URL: https://issues.apache.org/jira/browse/NUTCH-1094 Project: Nutch Issue Type: Sub-task Components:

[jira] [Updated] (NUTCH-1093) create core documentation

2011-08-24 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1093: Affects Version/s: (was: 2.0) Fix Version/s: (was: 2.0) create

[jira] [Created] (NUTCH-1095) remove i18n from Nutch site to archive and legacy secton of wiki

2011-08-24 Thread Lewis John McGibbney (JIRA)
remove i18n from Nutch site to archive and legacy secton of wiki Key: NUTCH-1095 URL: https://issues.apache.org/jira/browse/NUTCH-1095 Project: Nutch Issue Type: Task

[jira] [Commented] (NUTCH-1095) remove i18n from Nutch site to archive and legacy secton of wiki

2011-08-24 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1095?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13090477#comment-13090477 ] Julien Nioche commented on NUTCH-1095: -- +1 thanks! remove i18n from Nutch site to

[Nutch Wiki] Trivial Update of Archive and Legacy by LewisJohnMcgibbney

2011-08-24 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on Nutch Wiki for change notification. The Archive and Legacy page has been changed by LewisJohnMcgibbney: http://wiki.apache.org/nutch/Archive%20and%20Legacy?action=diffrev1=14rev2=15 === General Information === *

[Nutch Wiki] Trivial Update of Nutch_i18n by LewisJohnMcgibbney

2011-08-24 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on Nutch Wiki for change notification. The Nutch_i18n page has been changed by LewisJohnMcgibbney: http://wiki.apache.org/nutch/Nutch_i18n New page: = Nutch_i18n = TableOfContents(3) The Nutch search pages are easy to

[jira] [Commented] (NUTCH-1095) remove i18n from Nutch site to archive and legacy secton of wiki

2011-08-24 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1095?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13090514#comment-13090514 ] Lewis John McGibbney commented on NUTCH-1095: - committed at revision 1161287.

[jira] [Updated] (NUTCH-940) static field plugin

2011-08-24 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-940: --- Attachment: NUTCH-940-branch-1.4-20110824.patch Final patch for review including

[jira] [Commented] (NUTCH-940) static field plugin

2011-08-24 Thread Lewis John McGibbney (JIRA)
: New Feature Components: indexer Affects Versions: 1.3, 2.0 Reporter: Claudio Martella Assignee: Lewis John McGibbney Priority: Minor Fix For: 1.4, 2.0 Attachments: NUTCH-940-branch-1.4-20110824.patch, index-static.diff, index

[jira] [Commented] (NUTCH-940) static field plugin

2011-08-24 Thread Markus Jelsma (JIRA)
Affects Versions: 1.3, 2.0 Reporter: Claudio Martella Assignee: Lewis John McGibbney Priority: Minor Fix For: 1.4, 2.0 Attachments: NUTCH-940-branch-1.4-20110824.patch, index-static.diff, index-static.diff, static-field.diff, static

[jira] [Issue Comment Edited] (NUTCH-940) static field plugin

2011-08-24 Thread Markus Jelsma (JIRA)
McGibbney Priority: Minor Fix For: 1.4, 2.0 Attachments: NUTCH-940-branch-1.4-20110824.patch, index-static.diff, index-static.diff, static-field.diff, static-field.tar.gz A simple plugin called at indexing that adds fields with static data. You can specify

Build failed in Jenkins: Nutch-trunk #1584

2011-08-24 Thread Apache Jenkins Server
See https://builds.apache.org/job/Nutch-trunk/1584/ -- Started by timer Building remotely on solaris1 hudson.util.IOException2: remote file operation failed: https://builds.apache.org/job/Nutch-trunk/ws/ at hudson.remoting.Channel@44eaeac0:solaris1