[
https://issues.apache.org/jira/browse/HTTPCORE-778?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17934271#comment-17934271
]
Oleg Kalnichevski edited comment on HTTPCORE-778 at 3/11/25 7:04 PM:
---------------------------------------------------------------------
> As per RFC3986, quite a bit more characters should not be encoded:
[~peterhalicky] What statement in RFC 3986 actually supports this assertion?
This section of the specification defines what characters are valid for
fragment component. There is nothing that I can see that makes encoding of
reserved characters illegal or not recommended.
Oleg
was (Author: olegk):
> As per RFC3986, quite a bit more characters should not be encoded:
[~peterhalicky] What statement in RFC 3986 actually supports this assertion?
This session of the specification defines what characters are valid for
fragment component. There is nothing that I can see that makes encoding of
reservesd characters illegal or not recommended.
Oleg
> URIBuilder uses incorrect encoding method for URI fragment
> ----------------------------------------------------------
>
> Key: HTTPCORE-778
> URL: https://issues.apache.org/jira/browse/HTTPCORE-778
> Project: HttpComponents HttpCore
> Issue Type: Bug
> Components: HttpCore
> Affects Versions: 5.3.3
> Reporter: Peter Halicky
> Priority: Major
>
> URI fragment is encoded in URIBuilder using:
> {code:java}
> PercentCodec.encode(sb, this.fragment, this.charset); {code}
> (line 401, end of buildString method)
> This encodes all characters except UNRESERVED using the percent-format.
> As per (obsoleted) RFC2396, URI fragment should use URIC safe-chars.
> As per RFC3986, quite a bit more characters should not be encoded:
> {code:java}
> pct-encoded = "%" HEXDIG HEXDIG
> unreserved = ALPHA / DIGIT / "-" / "." / "_" / "~"
> sub-delims = "!" / "$" / "&" / "'" / "(" / ")" / "*" / "+" / "," / ";" /
> "="
> pchar = unreserved / pct-encoded / sub-delims / ":" / "@"
> fragment = *( pchar / "/" / "?" ) {code}
> Note that URIBuilder in httpclient 4.5.13 conforms to at least the old
> RFC2396, as it uses URIC set of safe characters (i.e. this is in fact a
> regression).
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]