[jira] [Updated] (NUTCH-1491) UTF-8 non-character codepoints in title

2012-11-05 Thread Nathan Gass (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1491?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nathan Gass updated NUTCH-1491: --- Attachment: patch > UTF-8 non-character codepoints in title > ---

[jira] [Created] (NUTCH-1491) UTF-8 non-character codepoints in title

2012-11-05 Thread Nathan Gass (JIRA)
Nathan Gass created NUTCH-1491: -- Summary: UTF-8 non-character codepoints in title Key: NUTCH-1491 URL: https://issues.apache.org/jira/browse/NUTCH-1491 Project: Nutch Issue Type: Bug C

Build failed in Jenkins: Nutch-trunk #2006

2012-11-05 Thread Apache Jenkins Server
See -- Started by timer Building remotely on solaris1 in workspace hudson.util.IOException2: remote file operation failed:

Build failed in Jenkins: Nutch-nutchgora #396

2012-11-05 Thread Apache Jenkins Server
See -- Started by timer Building remotely on solaris1 in workspace hudson.util.IOException2: remote file operation failed:

[jira] [Commented] (NUTCH-1457) Nutch2 Refactor the update process so that fetched items are only processed once

2012-11-05 Thread Alexander Kingson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13491124#comment-13491124 ] Alexander Kingson commented on NUTCH-1457: -- Can we use batchId in update command

[jira] [Commented] (NUTCH-747) inject&Index metadatas and inherit these metadatas to all matching suburls

2012-11-05 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13490678#comment-13490678 ] Lewis John McGibbney commented on NUTCH-747: Excellent Julien. I thought this w

[jira] [Resolved] (NUTCH-747) inject&Index metadatas and inherit these metadatas to all matching suburls

2012-11-05 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche resolved NUTCH-747. - Resolution: Implemented This has been made possible since thanks to : - Metadata injection (https

[jira] [Updated] (NUTCH-747) inject&Index metadatas and inherit these metadatas to all matching suburls

2012-11-05 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-747: --- Description: Hi. the following two patches supports + inject metadatas to url's into a

[jira] [Commented] (NUTCH-1490) Data Truncation exceptions when using mysql

2012-11-05 Thread Nathan Gass (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13490636#comment-13490636 ] Nathan Gass commented on NUTCH-1490: Additionally the given maximum length for urls an

[jira] [Updated] (NUTCH-1490) Data Truncation exceptions when using mysql

2012-11-05 Thread Nathan Gass (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nathan Gass updated NUTCH-1490: --- Attachment: patch The actual length values I used are somewhat arbitrary. They need to be large enou

[jira] [Commented] (NUTCH-1489) elasticindex should report the indexed documents like solrindex does

2012-11-05 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13490629#comment-13490629 ] Lewis John McGibbney commented on NUTCH-1489: - Hi Rogério, when I came back to

[jira] [Commented] (NUTCH-1473) Column length too big for column 'text' (max = 21845); use BLOB or TEXT instead

2012-11-05 Thread Nathan Gass (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13490613#comment-13490613 ] Nathan Gass commented on NUTCH-1473: I opened a new issue NUTCH-1490. TEXT seems to b

[jira] [Created] (NUTCH-1490) Data Truncation exceptions when using mysql

2012-11-05 Thread Nathan Gass (JIRA)
Nathan Gass created NUTCH-1490: -- Summary: Data Truncation exceptions when using mysql Key: NUTCH-1490 URL: https://issues.apache.org/jira/browse/NUTCH-1490 Project: Nutch Issue Type: Bug Aff