Jan Filipiak created HDFS-8836:
----------------------------------

             Summary: Skip newline on empty files with getMerge -nl
                 Key: HDFS-8836
                 URL: https://issues.apache.org/jira/browse/HDFS-8836
             Project: Hadoop HDFS
          Issue Type: Improvement
          Components: hdfs-client
    Affects Versions: 2.7.1, 2.6.0
            Reporter: Jan Filipiak
            Priority: Trivial


Hello everyone,

I recently was in the need of using the new line option -nl with getMerge 
because the files I needed to merge simply didn't had one. I was merging all 
the files from one directory and unfortunately this directory also included 
empty files, which effectively led to multiple newlines append after some 
files. I needed to remove them manually afterwards.

In this situation it is maybe good to have another argument that allows
skipping empty files.

Thing one could try to implement this feature:

The call for IOUtils.copyBytes(in, out, getConf(), false); doesn't
return the number of bytes copied which would be convenient as one could
skip append the new line when 0 bytes where copied or one would check the file 
size before.

I posted this Idea on the mailing list 
http://mail-archives.apache.org/mod_mbox/hadoop-user/201507.mbox/%3C55B25140.3060005%40trivago.com%3E
 but I didn't really get many responses, so I thought I my try this way.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to