[jira] [Updated] (NUTCH-1382) Adding support for EmbeddedSolrServer to SolrIndexer

2012-06-08 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-1382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Emre Çelikten updated NUTCH-1382: - Attachment: embeddedsolrserver.patch > Adding support for EmbeddedSolrServer to SolrIndexer >

[jira] [Created] (NUTCH-1382) Adding support for EmbeddedSolrServer to SolrIndexer

2012-06-08 Thread JIRA
Emre Çelikten created NUTCH-1382: Summary: Adding support for EmbeddedSolrServer to SolrIndexer Key: NUTCH-1382 URL: https://issues.apache.org/jira/browse/NUTCH-1382 Project: Nutch Issue Type

VOTE Apache Nutch 2.0 RC1

2012-06-08 Thread lewis john mcgibbney
Good Evening Everyone, A candidate for the Apache Nutch 2.0 RC1 is available at: http://people.apache.org/~lewismc/nutch-2.0 The release candidate is a src.zip, bin.zip, src.tar.gz and bin.tar.gz archive of the sources in: http://svn.apache.org/repos/asf/nutch/tags/release-2.0rc1 Further, a st

[Nutch Wiki] Trivial Update of "Release_HOWTO" by LewisJohnMcgibbney

2012-06-08 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification. The "Release_HOWTO" page has been changed by LewisJohnMcgibbney: http://wiki.apache.org/nutch/Release_HOWTO?action=diff&rev1=14&rev2=15 1. Remove the maven-ant-tasks jar from t

[Nutch Wiki] Trivial Update of "Release_HOWTO" by LewisJohnMcgibbney

2012-06-08 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification. The "Release_HOWTO" page has been changed by LewisJohnMcgibbney: http://wiki.apache.org/nutch/Release_HOWTO?action=diff&rev1=13&rev2=14 1. Update version numbers (from X.Y-dev to

[jira] [Commented] (NUTCH-1352) Improve regex urlfilters/normalizers synchronization

2012-06-08 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13291789#comment-13291789 ] Lewis John McGibbney commented on NUTCH-1352: - Markus feel free to commit this

[jira] [Closed] (NUTCH-1361) Fix mishandling of malformed urls in generator job

2012-06-08 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-1361. --- Resolution: Fixed > Fix mishandling of malformed urls in generator job > ---

[jira] [Commented] (NUTCH-1320) IndexChecker and ParseChecker choke on IDN's

2012-06-08 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13291683#comment-13291683 ] Hudson commented on NUTCH-1320: --- Integrated in Nutch-trunk #1865 (See [https://builds.apach

[jira] [Commented] (NUTCH-1336) Optionally not index db_notmodified pages

2012-06-08 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1336?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13291686#comment-13291686 ] Hudson commented on NUTCH-1336: --- Integrated in Nutch-trunk #1865 (See [https://builds.apach

[jira] [Commented] (NUTCH-1346) Follow outlinks to ignore external

2012-06-08 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13291684#comment-13291684 ] Hudson commented on NUTCH-1346: --- Integrated in Nutch-trunk #1865 (See [https://builds.apach

[jira] [Commented] (NUTCH-1351) DomainStatistics to aggregate by TLD

2012-06-08 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13291685#comment-13291685 ] Hudson commented on NUTCH-1351: --- Integrated in Nutch-trunk #1865 (See [https://builds.apach

[jira] [Commented] (NUTCH-1381) Allow to override default subcollection field name

2012-06-08 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13291682#comment-13291682 ] Hudson commented on NUTCH-1381: --- Integrated in Nutch-trunk #1865 (See [https://builds.apach

[jira] [Commented] (NUTCH-1336) Optionally not index db_notmodified pages

2012-06-08 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1336?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13291628#comment-13291628 ] Hudson commented on NUTCH-1336: --- Integrated in nutch-trunk-maven #302 (See [https://builds.

[jira] [Commented] (NUTCH-1262) Map `duplicating` content-types to a single type

2012-06-08 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13291602#comment-13291602 ] Markus Jelsma commented on NUTCH-1262: -- I'll commit this one in the next few days unl

[jira] [Commented] (NUTCH-1024) Dynamically set fetchInterval by MIME-type

2012-06-08 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1024?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13291601#comment-13291601 ] Markus Jelsma commented on NUTCH-1024: -- I'll commit this one in the next few days unl

[jira] [Resolved] (NUTCH-1336) Optionally not index db_notmodified pages

2012-06-08 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma resolved NUTCH-1336. -- Resolution: Fixed Committed for 1.6 in rev. 1347909. > Optionally not index db

[jira] [Commented] (NUTCH-1346) Follow outlinks to ignore external

2012-06-08 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13291596#comment-13291596 ] Hudson commented on NUTCH-1346: --- Integrated in nutch-trunk-maven #301 (See [https://builds.

[jira] [Resolved] (NUTCH-1346) Follow outlinks to ignore external

2012-06-08 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma resolved NUTCH-1346. -- Resolution: Fixed Committed for 1.6 in rev. 1347897. > Follow outlinks to igno