Re: [ANNONCEMENT] Apache Nutch 1.8 Release

2014-03-17 Thread Markus Jelsma
Thanks lewis!Lewis John Mcgibbney lewis.mcgibb...@gmail.com schreef:Good Evening, The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v1.8. Apache Nutch is a highly extensible and scalable open source web crawler software project. Stemming from Apache Lucene,

[jira] [Created] (NUTCH-1738) Expose number of URLs generated per batch in GeneratorJob

2014-03-17 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-1738: --- Summary: Expose number of URLs generated per batch in GeneratorJob Key: NUTCH-1738 URL: https://issues.apache.org/jira/browse/NUTCH-1738 Project: Nutch

[jira] [Commented] (NUTCH-1738) Expose number of URLs generated per batch in GeneratorJob

2014-03-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13937642#comment-13937642 ] Lewis John McGibbney commented on NUTCH-1738: - This concept could also be

[jira] [Assigned] (NUTCH-1738) Expose number of URLs generated per batch in GeneratorJob

2014-03-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-1738: --- Assignee: Lewis John McGibbney Expose number of URLs generated per batch in

Re: How do I customize Nutch to cater to existing SOLR schema

2014-03-17 Thread tripiy
Hi Lajos, Appreciate ur help in providing the patch which would definitely improve the usability of the product. For now we have resolved the unique field issue using the following changes to solrindex-mapping.xml: field dest=_uniqueid source=url/ copyField source=url dest=_uniqueid/ For

[jira] [Resolved] (NUTCH-1671) indexchecker to add digest field

2014-03-17 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1671?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-1671. Resolution: Fixed Committed to trunk r1578616 and 2.x r1578620. indexchecker to add

[jira] [Commented] (NUTCH-1671) indexchecker to add digest field

2014-03-17 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1671?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13938653#comment-13938653 ] Hudson commented on NUTCH-1671: --- SUCCESS: Integrated in Nutch-nutchgora #957 (See

[jira] [Commented] (NUTCH-1671) indexchecker to add digest field

2014-03-17 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1671?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13938662#comment-13938662 ] Hudson commented on NUTCH-1671: --- SUCCESS: Integrated in Nutch-trunk #2568 (See

[GitHub] nutch pull request: Patch for fixing coding bug

2014-03-17 Thread ysc
Github user ysc closed the pull request at: https://github.com/apache/nutch/pull/2 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled