Jenkins build is back to normal : Nutch » Nutch-trunk #64

2021-12-17 Thread Apache Jenkins Server
See

[jira] [Created] (NUTCH-2920) Implement a indexer-opensearch plugin

2021-12-17 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created NUTCH-2920: --- Summary: Implement a indexer-opensearch plugin Key: NUTCH-2920 URL: https://issues.apache.org/jira/browse/NUTCH-2920 Project: Nutch Issue

[jira] [Commented] (NUTCH-2919) Upgrade to Tika 2.2.0

2021-12-17 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2919?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17461771#comment-17461771 ] ASF GitHub Bot commented on NUTCH-2919: --- lewismc opened a new pull request #717: URL:

[GitHub] [nutch] lewismc opened a new pull request #717: NUTCH-2919 Upgrade to Tika 2.2.0

2021-12-17 Thread GitBox
lewismc opened a new pull request #717: URL: https://github.com/apache/nutch/pull/717 This PR addresses [NUTCH-2919](https://issues.apache.org/jira/browse/NUTCH-2919) I'm performing some tests. -- This is an automated message from the Apache Git Service. To respond to the message,

[jira] [Commented] (NUTCH-2449) Usage of Tika LanguageIdentifier in language-identifier plugin

2021-12-17 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17461770#comment-17461770 ] Hudson commented on NUTCH-2449: --- ABORTED: Integrated in Jenkins build Nutch » Nutch-trunk #63 (See

[jira] [Resolved] (NUTCH-2449) Usage of Tika LanguageIdentifier in language-identifier plugin

2021-12-17 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2449. - Resolution: Fixed > Usage of Tika LanguageIdentifier in language-identifier

[jira] [Commented] (NUTCH-2449) Usage of Tika LanguageIdentifier in language-identifier plugin

2021-12-17 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17461766#comment-17461766 ] ASF GitHub Bot commented on NUTCH-2449: --- lewismc merged pull request #716: URL:

[GitHub] [nutch] lewismc merged pull request #716: NUTCH-2449 Replace Tika LanguageIdentifier in language-identifier

2021-12-17 Thread GitBox
lewismc merged pull request #716: URL: https://github.com/apache/nutch/pull/716 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[jira] [Commented] (NUTCH-2278) Handle alpha-2 language codes consistently

2021-12-17 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17461637#comment-17461637 ] Lewis John McGibbney commented on NUTCH-2278: - Out of curiosity [~Fengtan] are you still

[jira] [Comment Edited] (NUTCH-2278) Handle alpha-2 language codes consistently

2021-12-17 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17461635#comment-17461635 ] Lewis John McGibbney edited comment on NUTCH-2278 at 12/17/21, 7:48 PM:

[jira] [Commented] (NUTCH-2278) Handle alpha-2 language codes consistently

2021-12-17 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17461635#comment-17461635 ] Lewis John McGibbney commented on NUTCH-2278: - [~snagel] wdyt about this? > Handle alpha-2

[jira] [Commented] (NUTCH-2449) Usage of Tika LanguageIdentifier in language-identifier plugin

2021-12-17 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17461632#comment-17461632 ] ASF GitHub Bot commented on NUTCH-2449: --- lewismc commented on pull request #233: URL:

[jira] [Commented] (NUTCH-2449) Usage of Tika LanguageIdentifier in language-identifier plugin

2021-12-17 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17461631#comment-17461631 ] ASF GitHub Bot commented on NUTCH-2449: --- lewismc closed pull request #233: URL:

[jira] [Commented] (NUTCH-2449) Usage of Tika LanguageIdentifier in language-identifier plugin

2021-12-17 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17461630#comment-17461630 ] ASF GitHub Bot commented on NUTCH-2449: --- lewismc opened a new pull request #716: URL:

[GitHub] [nutch] lewismc commented on pull request #233: NUTCH-2449: Replace Tika LanguageIdentifier in language-identifier

2021-12-17 Thread GitBox
lewismc commented on pull request #233: URL: https://github.com/apache/nutch/pull/233#issuecomment-996990713 Closing this PR off now in place of https://github.com/apache/nutch/pull/716 -- This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [nutch] lewismc closed pull request #233: NUTCH-2449: Replace Tika LanguageIdentifier in language-identifier

2021-12-17 Thread GitBox
lewismc closed pull request #233: URL: https://github.com/apache/nutch/pull/233 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [nutch] lewismc opened a new pull request #716: NUTCH-2449 Replace Tika LanguageIdentifier in language-identifier

2021-12-17 Thread GitBox
lewismc opened a new pull request #716: URL: https://github.com/apache/nutch/pull/716 This PR finishes off and therefore supersedes https://github.com/apache/nutch/pull/233 in a post-[NUTCH-2891](https://issues.apache.org/jira/browse/NUTCH-2891)/ b0cbea5 world :) @sebastian-nagel

[jira] [Commented] (NUTCH-2919) Upgrade to Tika 2.2.0

2021-12-17 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2919?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17461620#comment-17461620 ] Lewis John McGibbney commented on NUTCH-2919: - The artifacts have not yet made maven central.

[jira] [Commented] (NUTCH-2807) SitemapProcessor to warn that ignoring robots.txt affects detection of sitemaps

2021-12-17 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17461303#comment-17461303 ] Hudson commented on NUTCH-2807: --- FAILURE: Integrated in Jenkins build Nutch » Nutch-trunk #62 (See

[jira] [Commented] (NUTCH-2918) Upgrade to log4j 2.16.0

2021-12-17 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17461301#comment-17461301 ] Hudson commented on NUTCH-2918: --- FAILURE: Integrated in Jenkins build Nutch » Nutch-trunk #62 (See

[jira] [Commented] (NUTCH-2914) nutch-default.xml: remove obsolete and unused properties

2021-12-17 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2914?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17461302#comment-17461302 ] Hudson commented on NUTCH-2914: --- FAILURE: Integrated in Jenkins build Nutch » Nutch-trunk #62 (See

[jira] [Commented] (NUTCH-2808) Document side effects of ignoring robots.txt

2021-12-17 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17461300#comment-17461300 ] Hudson commented on NUTCH-2808: --- FAILURE: Integrated in Jenkins build Nutch » Nutch-trunk #62 (See

Build failed in Jenkins: Nutch » Nutch-trunk #62

2021-12-17 Thread Apache Jenkins Server
See Changes: [Sebastian Nagel] NUTCH-2808 Document side effects of ignoring robots.txt [Sebastian Nagel] Update documentation of protocol-related properties in [github] NUTCH-2918 Upgrade to log4j 2.16.0

[jira] [Resolved] (NUTCH-2914) nutch-default.xml: remove obsolete and unused properties

2021-12-17 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-2914. Resolution: Implemented > nutch-default.xml: remove obsolete and unused properties >

[jira] [Commented] (NUTCH-2914) nutch-default.xml: remove obsolete and unused properties

2021-12-17 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2914?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17461292#comment-17461292 ] ASF GitHub Bot commented on NUTCH-2914: --- sebastian-nagel merged pull request #709: URL:

[GitHub] [nutch] sebastian-nagel merged pull request #709: NUTCH-2914 nutch-default.xml: remove obsolete and unused properties

2021-12-17 Thread GitBox
sebastian-nagel merged pull request #709: URL: https://github.com/apache/nutch/pull/709 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[jira] [Resolved] (NUTCH-2807) SitemapProcessor to warn that ignoring robots.txt affects detection of sitemaps

2021-12-17 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-2807. Resolution: Implemented > SitemapProcessor to warn that ignoring robots.txt affects

[jira] [Commented] (NUTCH-2807) SitemapProcessor to warn that ignoring robots.txt affects detection of sitemaps

2021-12-17 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17461288#comment-17461288 ] ASF GitHub Bot commented on NUTCH-2807: --- sebastian-nagel merged pull request #710: URL:

[GitHub] [nutch] sebastian-nagel merged pull request #710: NUTCH-2807 SitemapProcessor to warn that ignoring robots.txt affects …

2021-12-17 Thread GitBox
sebastian-nagel merged pull request #710: URL: https://github.com/apache/nutch/pull/710 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[jira] [Resolved] (NUTCH-2808) Document side effects of ignoring robots.txt

2021-12-17 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-2808. Resolution: Implemented > Document side effects of ignoring robots.txt >

[jira] [Commented] (NUTCH-2808) Document side effects of ignoring robots.txt

2021-12-17 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17461284#comment-17461284 ] ASF GitHub Bot commented on NUTCH-2808: --- sebastian-nagel merged pull request #711: URL:

[GitHub] [nutch] sebastian-nagel merged pull request #711: NUTCH-2808 Document side effects of ignoring robots.txt

2021-12-17 Thread GitBox
sebastian-nagel merged pull request #711: URL: https://github.com/apache/nutch/pull/711 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[jira] [Resolved] (NUTCH-2918) Upgrade to log4j 2.16.0

2021-12-17 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-2918. Resolution: Fixed > Upgrade to log4j 2.16.0 > --- > >

[jira] [Commented] (NUTCH-2918) Upgrade to log4j 2.16.0

2021-12-17 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17461280#comment-17461280 ] ASF GitHub Bot commented on NUTCH-2918: --- sebastian-nagel merged pull request #715: URL:

[GitHub] [nutch] sebastian-nagel merged pull request #715: NUTCH-2918 Upgrade to log4j 2.16.0

2021-12-17 Thread GitBox
sebastian-nagel merged pull request #715: URL: https://github.com/apache/nutch/pull/715 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: