[jira] [Created] (NUTCH-3046) Use compact strings

2024-04-28 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created NUTCH-3046: --- Summary: Use compact strings Key: NUTCH-3046 URL: https://issues.apache.org/jira/browse/NUTCH-3046 Project: Nutch Issue Type: Sub-task

[jira] [Created] (NUTCH-3045) Upgrade from Java 11 to 17

2024-04-28 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created NUTCH-3045: --- Summary: Upgrade from Java 11 to 17 Key: NUTCH-3045 URL: https://issues.apache.org/jira/browse/NUTCH-3045 Project: Nutch Issue Type: Task

[ANNOUNCE] Apache Nutch 1.20 Release

2024-04-28 Thread lewis john mcgibbney
The Apache Nutch Project https://nutch.apache.org/download/ Please verify signatures using the KEYS file https://raw.githubusercontent.com/apache/nutch/master/KEYS when downloading the release. This release includes more than 60 bug fixes and improvements, the full list of changes can be seen in

[jira] [Commented] (NUTCH-3044) Generator: NPE when extracting the host part of a URL fails

2024-04-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17841682#comment-17841682 ] ASF GitHub Bot commented on NUTCH-3044: --- lewismc commented on PR #815: URL:

Re: [PR] NUTCH-3044 Generator: NPE when extracting the host part of a URL fails [nutch]

2024-04-28 Thread via GitHub
lewismc commented on PR #815: URL: https://github.com/apache/nutch/pull/815#issuecomment-2081564107 Excellent @sebastian-nagel +1 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[jira] [Commented] (NUTCH-3043) Generator: count URLs rejected by URL filters

2024-04-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17841681#comment-17841681 ] ASF GitHub Bot commented on NUTCH-3043: --- lewismc commented on PR #814: URL:

Re: [PR] NUTCH-3043 Generator: count URLs rejected by URL filters [nutch]

2024-04-28 Thread via GitHub
lewismc commented on PR #814: URL: https://github.com/apache/nutch/pull/814#issuecomment-2081563229 Excellent @sebastian-nagel  -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

Re: [DISCUSS] Consolidating Nutch Continuous Integration

2024-04-28 Thread Sebastian Nagel
Hi Lewis, > The Jenkins job used to be run nightly but > no longer is. It pulls nightly from git: https://ci-builds.apache.org/job/Nutch/job/Nutch-trunk/scmPollLog/ but a build is only run if there are new commits. The latest one: