[jira] [Created] (NUTCH-2949) Tasks of a multi-threaded map runner may fail because of slow creation of URL stream handlers

2022-05-19 Thread Sebastian Nagel (Jira)
Sebastian Nagel created NUTCH-2949: -- Summary: Tasks of a multi-threaded map runner may fail because of slow creation of URL stream handlers Key: NUTCH-2949 URL: https://issues.apache.org/jira/browse/NUTCH-2949

[GitHub] [nutch] sebastian-nagel commented on pull request #726: NUTCH-2936 Early registration of URL stream handlers provided by plugins may fail Hadoop jobs running in distributed mode

2022-05-19 Thread GitBox
sebastian-nagel commented on PR #726: URL: https://github.com/apache/nutch/pull/726#issuecomment-1131460482 +1 Afaics, this PR addresses only code style, code conventions, Javadoc, etc. but does not change anything functionally. Maybe this should be reflected in the commit messages as well.

[jira] [Commented] (NUTCH-2936) Early registration of URL stream handlers provided by plugins may fail Hadoop jobs running in distributed mode

2022-05-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17539430#comment-17539430 ] ASF GitHub Bot commented on NUTCH-2936: --- sebastian-nagel commented on PR #726: URL:

Final reminder: ApacheCon North America call for presentations closing soon

2022-05-19 Thread Rich Bowen
[Note: You're receiving this because you are subscribed to one or more Apache Software Foundation project mailing lists.] This is your final reminder that the Call for Presetations for ApacheCon North America 2022 will close at 00:01 GMT on Monday, May 23rd, 2022. Please don't wait! Get your talk

[GitHub] [nutch] sebastian-nagel merged pull request #728: NUTCH-2946 Fetcher: optionally slow down fetching from hosts with repeated exceptions

2022-05-19 Thread GitBox
sebastian-nagel merged PR #728: URL: https://github.com/apache/nutch/pull/728 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...@nutch.apac

[jira] [Commented] (NUTCH-2946) Fetcher: optionally slow down fetching from hosts with repeated exceptions

2022-05-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17539553#comment-17539553 ] ASF GitHub Bot commented on NUTCH-2946: --- sebastian-nagel merged PR #728: URL: https

[jira] [Resolved] (NUTCH-2946) Fetcher: optionally slow down fetching from hosts with repeated exceptions

2022-05-19 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-2946. Resolution: Implemented > Fetcher: optionally slow down fetching from hosts with repeated e

[GitHub] [nutch] sebastian-nagel commented on pull request #729: NUTCH-2947 Fetcher: keep state of empty fetch queues unless queue feeder is finished

2022-05-19 Thread GitBox
sebastian-nagel commented on PR #729: URL: https://github.com/apache/nutch/pull/729#issuecomment-1131710151 Updated to be based on master branch after merging NUTCH-2946/#728. The state of a queue is also preserved if `fetcher.exceptions.per.queue.delay` > 0.0 (in the discussion of NUTCH-29

[jira] [Commented] (NUTCH-2947) Fetcher: keep state of empty fetch queues unless queue feeder is finished

2022-05-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2947?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17539564#comment-17539564 ] ASF GitHub Bot commented on NUTCH-2947: --- sebastian-nagel commented on PR #729: URL:

Jenkins build is back to normal : Nutch » Nutch-trunk #74

2022-05-19 Thread Apache Jenkins Server
See

[jira] [Commented] (NUTCH-2946) Fetcher: optionally slow down fetching from hosts with repeated exceptions

2022-05-19 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17539600#comment-17539600 ] Hudson commented on NUTCH-2946: --- SUCCESS: Integrated in Jenkins build Nutch » Nutch-trunk #

[jira] [Created] (NUTCH-2950) UpdateHostDb: performance improvements

2022-05-19 Thread Sebastian Nagel (Jira)
Sebastian Nagel created NUTCH-2950: -- Summary: UpdateHostDb: performance improvements Key: NUTCH-2950 URL: https://issues.apache.org/jira/browse/NUTCH-2950 Project: Nutch Issue Type: Improvem

[GitHub] [nutch] sebastian-nagel opened a new pull request, #731: NUTCH-2950 UpdateHostDb: performance improvements

2022-05-19 Thread GitBox
sebastian-nagel opened a new pull request, #731: URL: https://github.com/apache/nutch/pull/731 (see NUTCH-2950 and commit messages) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific co

[jira] [Commented] (NUTCH-2950) UpdateHostDb: performance improvements

2022-05-19 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17539617#comment-17539617 ] Sebastian Nagel commented on NUTCH-2950: If desired I could also split the issue/

[jira] [Commented] (NUTCH-2950) UpdateHostDb: performance improvements

2022-05-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17539616#comment-17539616 ] ASF GitHub Bot commented on NUTCH-2950: --- sebastian-nagel opened a new pull request,