[jira] [Commented] (NUTCH-1047) Pluggable indexing backends

2013-02-21 Thread Tejas Patil (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13583011#comment-13583011 ] Tejas Patil commented on NUTCH-1047: Hi Julien, One small change in Java class will b

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2013-02-21 Thread Tejas Patil (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13583013#comment-13583013 ] Tejas Patil commented on NUTCH-1031: Hey Ken, A gentle reminder for releasing CC.

[jira] [Commented] (NUTCH-1534) cassandra/hector exception: InvalidRequestException(why:column name must not be empty)

2013-02-21 Thread Roland (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13583080#comment-13583080 ] Roland commented on NUTCH-1534: --- Seems to be not the case, I had breakpoints in :557 trigger

[jira] [Commented] (NUTCH-1047) Pluggable indexing backends

2013-02-21 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13583111#comment-13583111 ] Julien Nioche commented on NUTCH-1047: -- Tejas, The CleaningJob is backend-neutral an

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2013-02-21 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13583340#comment-13583340 ] Lewis John McGibbney commented on NUTCH-1031: - Hi Tejas. We released it ;) Re

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2013-02-21 Thread Tejas Patil (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13583662#comment-13583662 ] Tejas Patil commented on NUTCH-1031: Hi Lewis, I should have checked on the main page

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2013-02-21 Thread Tejas Patil (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13583664#comment-13583664 ] Tejas Patil commented on NUTCH-1031: @Dev: I am planning to commit this change in comi

[jira] [Assigned] (NUTCH-1529) Port nutch-mongdb-parser to trunk

2013-02-21 Thread lufeng (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] lufeng reassigned NUTCH-1529: - Assignee: lufeng > Port nutch-mongdb-parser to trunk > - > >

Build failed in Jenkins: Nutch-nutchgora #503

2013-02-21 Thread Apache Jenkins Server
See -- [...truncated 3351 lines...] deploy: copy-generated-lib: test: [echo] Testing plugin: protocol-file [junit] Running org.apache.nutch.protocol.file.TestProtocolFile [junit] Tests run:

[jira] [Commented] (NUTCH-1521) CrawlDbFilter pass null url to urlNormailzers

2013-02-21 Thread lufeng (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13584043#comment-13584043 ] lufeng commented on NUTCH-1521: --- Hi Tejas Yes, you are right. It seems that DbUpdateMapper

[jira] [Commented] (NUTCH-1373) Implement consistent execution of normalising and filtering in Generator

2013-02-21 Thread lufeng (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13584062#comment-13584062 ] lufeng commented on NUTCH-1373: --- Hi Lewis Do you mean we can put URLNormalizers in Generato