Sebastian Nagel created NUTCH-3173:
--------------------------------------

             Summary: protocol-okhttp: store OkHttp's internal URL in response 
metadata
                 Key: NUTCH-3173
                 URL: https://issues.apache.org/jira/browse/NUTCH-3173
             Project: Nutch
          Issue Type: Improvement
          Components: plugin, protocol
    Affects Versions: 1.23
            Reporter: Sebastian Nagel
             Fix For: 1.23


OkHttp uses its 
[HttpUrl|https://square.github.io/okhttp/5.x/okhttp/okhttp3/-http-url/index.html]
 for HTTP requests. There are some differences between HttpURl and java.net.URL 
resp. java.net.URI. And the HttpUrl.parse may parse a URL string differently 
than Java's URL class.

It would be good to store the stringified HttpUrl in the response metadata, at 
least, if it differs from the original URL string. The 
[Request|https://square.github.io/okhttp/5.x/okhttp/okhttp3/-request/index.html]
 holds the HttpUrl object.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to