PAX header parser fails for non-ASCII values
--------------------------------------------

                 Key: COMPRESS-184
                 URL: https://issues.apache.org/jira/browse/COMPRESS-184
             Project: Commons Compress
          Issue Type: Bug
          Components: Archivers
    Affects Versions: 1.3
            Reporter: Stefan Bodewig
            Assignee: Stefan Bodewig
             Fix For: 1.4


The current logic parsing PAX extension headers fails if the number of bytes 
used to encode an entry is different from the number of characters - i.e. for 
any character outside of the ASCII range as the headers are UTF-8 encoded.  E.g.

{noformat}
11 path=รค
{noformat}

takes 11 bytes (one has to account for the trailing newline) for 10 characters 
and the parser fails with "Expected 3 chars, read 2"

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira


Reply via email to