sebastian-nagel opened a new pull request, #733: URL: https://github.com/apache/nutch/pull/733
Fixes to address the failing of Nutch jobs in (pseudo-)distributed mode. Implements: - caching of URLStreamHandlers per protocol to avoid that handlers are created anew - enforce routing of standard protocols (http, https, file, jar) to handlers implemented by the JVM - utilizes the URLStreamHandler cache - fixes NUTCH-2936 (verified in pseudo-distributed mode) Also: - code improvements in classes of the package "org.apache.nutch.plugin" - use `Class<?>` and remove suppressions of warnings - javadocs: fix typos - remove superfluous white space - autoformat using code style template - protocol-okhttp: initialize SSLContext not in a static code block (SSLContext is used to ignore SSL/TLS certificate verification): this was the initial fix for the needless warning in parsechecker even in local mode. This seems also fixed by the enforced routing of standard URLStreamHandlers, but I left it in, to avoid that all the testing in pseudo-distributed mode needs to be run again. Next week I will test the fixes in real distributed mode. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...@nutch.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org