Build failed in Jenkins: Nutch-trunk #1482

2011-05-09 Thread Apache Jenkins Server
See -- Started by timer Building remotely on solaris1 hudson.util.IOException2: remote file operation failed: at hudson.remoting.Channel@17cde38

[jira] [Resolved] (NUTCH-996) Indexer adds solr.commit.size+1 docs

2011-05-09 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma resolved NUTCH-996. - Resolution: Fixed Committed for trunk in rev. 1101279 and for 1.3 in 1101280. Commit.size might be

[jira] [Issue Comment Edited] (NUTCH-994) Fine tune Solr schema

2011-05-09 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13030969#comment-13030969 ] Markus Jelsma edited comment on NUTCH-994 at 5/10/11 12:35 AM: --

[jira] [Issue Comment Edited] (NUTCH-994) Fine tune Solr schema

2011-05-09 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13030969#comment-13030969 ] Markus Jelsma edited comment on NUTCH-994 at 5/10/11 12:34 AM: --

[jira] [Updated] (NUTCH-994) Fine tune Solr schema

2011-05-09 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-994: Patch Info: [Patch Available] > Fine tune Solr schema > - > > Ke

[jira] [Updated] (NUTCH-994) Fine tune Solr schema

2011-05-09 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-994: Attachment: NUTCH-994-all.patch This patches changes: * non-analyzed field types to their Trie-based

[jira] [Assigned] (NUTCH-994) Fine tune Solr schema

2011-05-09 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma reassigned NUTCH-994: --- Assignee: Markus Jelsma > Fine tune Solr schema > - > > Ke

[jira] [Commented] (NUTCH-937) When nutch is run on hadoop > 0.20.2 (or cdh) it will not find plugins because MapReduce will not unpack plugin/ directory from the job's pack (due to MAPREDUCE-967)

2011-05-09 Thread Viksit Gaur (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-937?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13030946#comment-13030946 ] Viksit Gaur commented on NUTCH-937: --- Has this workaround been reviewed by anyone and vali

Re: found a nutch bug

2011-05-09 Thread Julien Nioche
Hi Could you please open a JIRA with a description of the problem and attach a patch generated against the branch-1.3 with 'svn diff'? Thanks 2011/5/9 ldk_5370 > hi, > > I found a bug about calss org.apache.nutch.protocol.http.HttpResponse, > HttpResponse can not got all html content for som

[jira] [Commented] (NUTCH-585) [PARSE-HTML plugin] Block certain parts of HTML code from being indexed

2011-05-09 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13030660#comment-13030660 ] Markus Jelsma commented on NUTCH-585: - Thanks for mentioning Wim. This patch can be use

Re: Return value of jobs

2011-05-09 Thread Julien Nioche
Hi Markus, > Currently the various Nutch jobs return 0 or -1 resp. indicating success or > failure. It would be convenient to have certain jobs return the number of > processed items instead of zero to make it a lot easier for shell scripts > to > fetch useful statistics. > > What would be an arg

[jira] [Commented] (NUTCH-887) Delegate parsing of feeds to Tika

2011-05-09 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-887?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13030639#comment-13030639 ] Julien Nioche commented on NUTCH-887: - This issue is about parse-feeds and it requires