[jira] [Commented] (NUTCH-656) DeleteDuplicates based on crawlDB only

2013-11-14 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-656?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13823182#comment-13823182 ] Hudson commented on NUTCH-656: -- SUCCESS: Integrated in Nutch-trunk #2421 (See [https://builds

[jira] [Commented] (NUTCH-1621) Deprecated class o.a.n.crawl.Crawler is still in code base

2013-11-14 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13823183#comment-13823183 ] Hudson commented on NUTCH-1621: --- SUCCESS: Integrated in Nutch-trunk #2421 (See [https://bui

[jira] [Commented] (NUTCH-1621) Deprecated class o.a.n.crawl.Crawler is still in code base

2013-11-14 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13823179#comment-13823179 ] Hudson commented on NUTCH-1621: --- FAILURE: Integrated in Nutch-nutchgora #819 (See [https://

Build failed in Jenkins: Nutch-nutchgora #819

2013-11-14 Thread Apache Jenkins Server
See Changes: [jnioche] Removed all in one Crawl class (NUTCH-1621) -- [...truncated 1071 lines...] A src/java/org/apache/nutch/plugin/ExtensionPoint.java A src/java/org/apache/nutc

[jira] [Updated] (NUTCH-1646) IndexerMapReduce to consider DB status

2013-11-14 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-1646: --- Attachment: NUTCH-1646-3.patch New patch: applies after changes for NUTCH-656, removed counte

[jira] [Commented] (NUTCH-656) DeleteDuplicates based on crawlDB only

2013-11-14 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-656?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13823019#comment-13823019 ] Sebastian Nagel commented on NUTCH-656: --- Well done! Run successful test crawl with de

[jira] [Updated] (NUTCH-1668) Remove package org.apache.nutch.indexer.solr

2013-11-14 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated NUTCH-1668: - Attachment: NUTCH-1668.patch Patch which removes the indexer.solr subpackage and deprecates the c

All in one Crawl class

2013-11-14 Thread Julien Nioche
See https://issues.apache.org/jira/browse/NUTCH-1621 It has now been removed from both trunk and 2.x. I will update the Wiki pages accordingly over the next couple of days to reflect this change. As of the next releases of Nutch the crawl script will have to be used instead. It works just as well

[jira] [Resolved] (NUTCH-1621) Deprecated class o.a.n.crawl.Crawler is still in code base

2013-11-14 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche resolved NUTCH-1621. -- Resolution: Fixed Trunk : Committed revision 1541885. 2.x : Committed revision 1541886. I will

[jira] [Created] (NUTCH-1668) Remove package org.apache.nutch.indexer.solr

2013-11-14 Thread Julien Nioche (JIRA)
Julien Nioche created NUTCH-1668: Summary: Remove package org.apache.nutch.indexer.solr Key: NUTCH-1668 URL: https://issues.apache.org/jira/browse/NUTCH-1668 Project: Nutch Issue Type: Task

[jira] [Resolved] (NUTCH-656) DeleteDuplicates based on crawlDB only

2013-11-14 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche resolved NUTCH-656. - Resolution: Fixed Committed revision 1541883. Committed with a few minor changes compared to the

[jira] [Updated] (NUTCH-656) DeleteDuplicates based on crawlDB only

2013-11-14 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated NUTCH-656: Attachment: NUTCH-656.v3.patch correct attachment > DeleteDuplicates based on crawlDB only > -

[jira] [Updated] (NUTCH-656) DeleteDuplicates based on crawlDB only

2013-11-14 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated NUTCH-656: Attachment: (was: NUTCH-656.v3.patch) > DeleteDuplicates based on crawlDB only > --

[jira] [Updated] (NUTCH-656) DeleteDuplicates based on crawlDB only

2013-11-14 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated NUTCH-656: Attachment: NUTCH-656.v3.patch Thanks for your comments Seb. This new patch addresses some of the is

[jira] [Updated] (NUTCH-1667) Updatedb always ignore batchId

2013-11-14 Thread Nguyen Manh Tien (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nguyen Manh Tien updated NUTCH-1667: Attachment: NUTCH-1556-batchId.patch > Updatedb always ignore batchId > ---

[jira] [Created] (NUTCH-1667) Updatedb always ignore batchId

2013-11-14 Thread Nguyen Manh Tien (JIRA)
Nguyen Manh Tien created NUTCH-1667: --- Summary: Updatedb always ignore batchId Key: NUTCH-1667 URL: https://issues.apache.org/jira/browse/NUTCH-1667 Project: Nutch Issue Type: Bug Affect