[ https://issues.apache.org/jira/browse/NIFI-919?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14727384#comment-14727384 ]
Sean Busbey commented on NIFI-919: ---------------------------------- if it's for dealing with interface to systems that already deal with just bare records, then I think it makes sense. though I'd want to make clear that folks shouldn't be opting for it unless they have such a system. one nice advantage of sticking with datafile is we can have a mode where we only split on block boundaries, so that we can just deal with nice big blobs of bytes. suppose that'll have to either allow for "approximately N records" or just be an optimization when the count of records in blocks allows for it. > Support Splitting Avro Files > ---------------------------- > > Key: NIFI-919 > URL: https://issues.apache.org/jira/browse/NIFI-919 > Project: Apache NiFi > Issue Type: New Feature > Reporter: Bryan Bende > Assignee: Bryan Bende > Priority: Minor > Fix For: 0.4.0 > > > Provide a processor that splits an Avro file into multiple smaller files. > Would be nice to have a configurable batch size so a user could produce > single record files and also multi-record files of smaller size than the > original. Also consider making the output format configurable, data file vs > bare record. -- This message was sent by Atlassian JIRA (v6.3.4#6332)