Thanks for the suggestion Juan. I tried to make that change and it didn’t seem to make a significant difference. The breakdown into bins was a little different - different numbers of flow files were merged together - but the end result was the same
-Tim > On Aug 28, 2018, at 11:11 AM, Juan Sequeiros <helloj...@gmail.com> wrote: > > Hi, > > For: > > Correlation Attribute Name: ${myFlowfileAttributeName} > Should be set to: myFlowfileAttributeName > > NOT ${myFlowfileAttibuteName) > > Hope that helps. > > On Tue, Aug 28, 2018 at 12:07 PM Tim Dean <tim.d...@gmail.com > <mailto:tim.d...@gmail.com>> wrote: > I have a flow that sends a large number of JSON files into a MergeContent > processor. The job of that processor is to combine all the incoming flow > files with a particular flow file attribute into a single flow file, creating > a JSON array containing each of the input flow files’ JSON. > > I have configured the MergeContent as processor as follows: > Merge Strategy: Bin-Packing Algorithm > Merge Format: Binary Concatenation > Correlation Attribute Name: ${myFlowfileAttributeName} > Minimum number of entries: 1 > Maximum number of entries: 5000 > Minimum group size: 0 B > Maximum group size: <no value set> > Max bin age: 30 min > Maximum number of bins: 10 > Delimiter strategy: Text > Header: [ > Footer: ] > Demarcator: , > > When I run data through this flow, I am seeing a large number of small-ish > merged flow files being sent to the merged relationship, I was expecting ALL > of the files for a given flow file attribute value to be binned together, but > they are not coming through that way. To give a example, I pushed through > data containing 262 input JSON files. Of these 262, 2 of them have a flow > file attribute value of ‘A’, 2 of them have a flow file attribute value of > ‘B’, and 258 have a flow file attribute of ‘C’. I was expecting the merged > relationship to deliver 3 flow files, one each for value A, B, and C. But.I > am seeing 24 flow files on the merged relationship, 1 for a value of A, 1 for > a value of B, and 22 of varying sizes with the value of C. > > Can someone help me understand what other criteria MergeContent might be > using to determine when to send along its merged flow files? > > Thanks > > > -- > Juan Carlos Sequeiros