[ 
https://issues.apache.org/jira/browse/HADOOP-6307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782220#action_12782220
 ] 

Tsz Wo (Nicholas), SZE commented on HADOOP-6307:
------------------------------------------------

> SequenceFile.Reader actually do not need the file length. ...

Thanks Chris and Arun pointing out that the file length (i.e. 
SequenceFile.Reader.end) cannot be removed.  Otherwise, SequenceFile.Sorter 
won't work.

I guess we have to introduce a new public constructor, which takes length as a 
parameter.  So, that user applications could possibly pass the correct length 
when creating a new Reader.

> Support reading on un-closed SequenceFile
> -----------------------------------------
>
>                 Key: HADOOP-6307
>                 URL: https://issues.apache.org/jira/browse/HADOOP-6307
>             Project: Hadoop Common
>          Issue Type: Improvement
>          Components: io
>            Reporter: Tsz Wo (Nicholas), SZE
>
> When a SequenceFile.Reader is constructed, it calls 
> fs.getFileStatus(file).getLen().  However, fs.getFileStatus(file).getLen() 
> does not return the hflushed length for un-closed file since the Namenode 
> does not know the hflushed length.  DFSClient have to ask a datanode for the 
> length last block which is being written; see also HDFS-570.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to