sebastian-nagel opened a new pull request, #733:
URL: https://github.com/apache/nutch/pull/733

   Fixes to address the failing of Nutch jobs in (pseudo-)distributed mode. 
Implements:
   - caching of URLStreamHandlers per protocol to avoid that handlers are 
created anew
   
   - enforce routing of standard protocols (http, https, file, jar) to handlers 
implemented by the JVM
     - utilizes the URLStreamHandler cache
     - fixes NUTCH-2936 (verified in pseudo-distributed mode)
   
   Also:
   - code improvements in classes of the package "org.apache.nutch.plugin"
     - use `Class<?>` and remove suppressions of warnings
     - javadocs: fix typos
     - remove superfluous white space
     - autoformat using code style template
   - protocol-okhttp: initialize SSLContext not in a static code block 
(SSLContext is used to ignore SSL/TLS certificate verification): this was the 
initial fix for the needless warning in parsechecker even in local mode. This 
seems also fixed by the enforced routing of standard URLStreamHandlers, but I 
left it in, to avoid that all the testing in pseudo-distributed mode needs to 
be run again.
   
   Next week I will test the fixes in real distributed mode.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@nutch.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to