[ 
https://issues.apache.org/jira/browse/NIFI-3759?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16019849#comment-16019849
 ] 

Mika Borner commented on NIFI-3759:
-----------------------------------

[~joewitt] File size matters when reading many files at the same time 
(http://blog.cloudera.com/blog/2009/02/the-small-files-problem).

Of course, one could write a post-write file merger job, but I think from an 
efficiency/simplicity point of view, it's better to write directly into the 
final format.

I understand there are certain concerns regarding safe writes, but also 
implementing an atomic file merger is not trivial.






> Enable Avro append for Put HDFS
> -------------------------------
>
>                 Key: NIFI-3759
>                 URL: https://issues.apache.org/jira/browse/NIFI-3759
>             Project: Apache NiFi
>          Issue Type: Improvement
>          Components: Core Framework
>    Affects Versions: 1.1.0, 1.1.1
>            Reporter: Jonas Hartwig
>            Priority: Minor
>
> It would be nice where Nifi already supports working with AVRO to enable hdfs 
> append working correctly with avro files coming from MergeContent processor.
> I suggest making a similar choice in as in MergeContent to be able to choose 
> a file format which applies file format specific logic when required.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to