[ https://issues.apache.org/jira/browse/HADOOP-6097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12768015#action_12768015 ]
Hudson commented on HADOOP-6097: -------------------------------- Integrated in Hadoop-Common-trunk-Commit #64 (See [http://hudson.zones.apache.org/hudson/job/Hadoop-Common-trunk-Commit/64/]) . Fix Path conversion in makeQualified and reset LineReader byte count at the start of each block in Hadoop archives. Contributed by Ben Slusky, Tom White, and Mahadev Konar > Multiple bugs w/ Hadoop archives > -------------------------------- > > Key: HADOOP-6097 > URL: https://issues.apache.org/jira/browse/HADOOP-6097 > Project: Hadoop Common > Issue Type: Bug > Components: fs > Affects Versions: 0.18.0, 0.18.1, 0.18.2, 0.18.3, 0.19.0, 0.19.1, 0.19.2, > 0.20.0, 0.20.1 > Reporter: Ben Slusky > Assignee: Ben Slusky > Fix For: 0.20.2 > > Attachments: HADOOP-6097-0.20.patch, HADOOP-6097-0.20.patch, > HADOOP-6097-0.20.patch, HADOOP-6097-v2.patch, HADOOP-6097.patch > > > Found and fixed several bugs involving Hadoop archives: > - In makeQualified(), the sloppy conversion from Path to URI and back mangles > the path if it contains an escape-worthy character. > - It's possible that fileStatusInIndex() may have to read more than one > segment of the index. The LineReader and count of bytes read need to be reset > for each block. > - har:// connections cannot be indexed by (scheme, authority, username) -- > the path is significant as well. Caching them in this way limits a hadoop > client to opening one archive per filesystem. It seems to be safe not to > cache them, since they wrap another connection that does the actual > networking. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.