Re: [VOTE] Apache Nutch 1.20 Release

2024-04-11 Thread Lewis John McGibbney
Hi Seb, On 2024/04/11 13:30:53 Sebastian Nagel wrote: > > https://github.com/sebastian-nagel/nutch-test-single-node-cluster/ I think we should make this into an integration test suite and run it as part of CI. I’ve been meaning and wanting to do this for the __longest__ time…! > > One

[jira] [Commented] (NUTCH-3040) Upgrade to Hadoop 3.4.0

2024-04-11 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3040?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17836191#comment-17836191 ] Tim Allison commented on NUTCH-3040: :cry-sob: This is great news! > Upgrade to Hadoop 3.4.0 >

[jira] [Created] (NUTCH-3040) Upgrade to Hadoop 3.4.0

2024-04-11 Thread Sebastian Nagel (Jira)
Sebastian Nagel created NUTCH-3040: -- Summary: Upgrade to Hadoop 3.4.0 Key: NUTCH-3040 URL: https://issues.apache.org/jira/browse/NUTCH-3040 Project: Nutch Issue Type: Improvement

Re: [VOTE] Apache Nutch 1.20 Release

2024-04-11 Thread Sebastian Nagel
Hi Lewis, here's my +1 * signatures of release packages are valid * build from the source package successful, unit tests pass * tested few Nutch tools in the binary package (local mode) * run a sample crawl and tested many Nutch tools on a single-node cluster running Hadoop 3.4.0, see

[jira] [Commented] (NUTCH-3039) Failure to handle ftp:// URLs

2024-04-11 Thread Markus Jelsma (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3039?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17836133#comment-17836133 ] Markus Jelsma commented on NUTCH-3039: -- Thanks for spotting that! > Failure to handle ftp:// URLs >

[jira] [Commented] (NUTCH-3039) Failure to handle ftp:// URLs

2024-04-11 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3039?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17836126#comment-17836126 ] ASF GitHub Bot commented on NUTCH-3039: --- sebastian-nagel opened a new pull request, #812: URL:

[jira] [Assigned] (NUTCH-3039) Failure to handle ftp:// URLs

2024-04-11 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel reassigned NUTCH-3039: -- Assignee: Sebastian Nagel > Failure to handle ftp:// URLs >

[PR] NUTCH-3039 Failure to handle ftp:// URLs [nutch]

2024-04-11 Thread via GitHub
sebastian-nagel opened a new pull request, #812: URL: https://github.com/apache/nutch/pull/812 Pass ftp:// URLs to the standard JVM URLStreamHandler -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[jira] [Created] (NUTCH-3039) Failure to handle ftp:// URLs

2024-04-11 Thread Sebastian Nagel (Jira)
Sebastian Nagel created NUTCH-3039: -- Summary: Failure to handle ftp:// URLs Key: NUTCH-3039 URL: https://issues.apache.org/jira/browse/NUTCH-3039 Project: Nutch Issue Type: Bug