[ 
https://issues.apache.org/jira/browse/NIFI-919?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14727384#comment-14727384
 ] 

Sean Busbey commented on NIFI-919:
----------------------------------

if it's for dealing with interface to systems that already deal with just bare 
records, then I think it makes sense. though I'd want to make clear that folks 
shouldn't be opting for it unless they have such a system.

one nice advantage of sticking with datafile is we can have a mode where we 
only split on block boundaries, so that we can just deal with nice big blobs of 
bytes. suppose that'll have to either allow for "approximately N records" or 
just be an optimization when the count of records in blocks allows for it.

> Support Splitting Avro Files
> ----------------------------
>
>                 Key: NIFI-919
>                 URL: https://issues.apache.org/jira/browse/NIFI-919
>             Project: Apache NiFi
>          Issue Type: New Feature
>            Reporter: Bryan Bende
>            Assignee: Bryan Bende
>            Priority: Minor
>             Fix For: 0.4.0
>
>
> Provide a processor that splits an Avro file into multiple smaller files. 
> Would be nice to have a configurable batch size so a user could produce 
> single record files and also multi-record files of smaller size than the 
> original. Also consider making the output format configurable, data file vs 
> bare record.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to