[ https://issues.apache.org/jira/browse/HADOOP-6307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782220#action_12782220 ]
Tsz Wo (Nicholas), SZE commented on HADOOP-6307: ------------------------------------------------ > SequenceFile.Reader actually do not need the file length. ... Thanks Chris and Arun pointing out that the file length (i.e. SequenceFile.Reader.end) cannot be removed. Otherwise, SequenceFile.Sorter won't work. I guess we have to introduce a new public constructor, which takes length as a parameter. So, that user applications could possibly pass the correct length when creating a new Reader. > Support reading on un-closed SequenceFile > ----------------------------------------- > > Key: HADOOP-6307 > URL: https://issues.apache.org/jira/browse/HADOOP-6307 > Project: Hadoop Common > Issue Type: Improvement > Components: io > Reporter: Tsz Wo (Nicholas), SZE > > When a SequenceFile.Reader is constructed, it calls > fs.getFileStatus(file).getLen(). However, fs.getFileStatus(file).getLen() > does not return the hflushed length for un-closed file since the Namenode > does not know the hflushed length. DFSClient have to ask a datanode for the > length last block which is being written; see also HDFS-570. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.