Thanks for the suggestion Juan.

I tried to make that change and it didn’t seem to make a significant 
difference. The breakdown into bins was a little different - different numbers 
of flow files were merged together - but the end result was the same

-Tim

> On Aug 28, 2018, at 11:11 AM, Juan Sequeiros <helloj...@gmail.com> wrote:
> 
> Hi,
> 
> For:
> 
> Correlation Attribute Name: ${myFlowfileAttributeName}
> Should be set to: myFlowfileAttributeName
> 
> NOT ${myFlowfileAttibuteName)
> 
> Hope that helps.
> 
> On Tue, Aug 28, 2018 at 12:07 PM Tim Dean <tim.d...@gmail.com 
> <mailto:tim.d...@gmail.com>> wrote:
> I have a flow that sends a large number of JSON files into a MergeContent 
> processor. The job of that processor is to combine all the incoming flow 
> files with a particular flow file attribute into a single flow file, creating 
> a JSON array containing each of the input flow files’ JSON.
> 
> I have configured the MergeContent as processor as follows:
> Merge Strategy: Bin-Packing Algorithm
> Merge Format: Binary Concatenation
> Correlation Attribute Name: ${myFlowfileAttributeName}
> Minimum number of entries: 1
> Maximum number of entries: 5000
> Minimum group size: 0 B
> Maximum group size: <no value set>
> Max bin age: 30 min
> Maximum number of bins: 10
> Delimiter strategy: Text
> Header: [
> Footer: ]
> Demarcator: ,
> 
> When I run data through this flow, I am seeing a large number of small-ish 
> merged flow files being sent to the merged relationship, I was expecting ALL 
> of the files for a given flow file attribute value to be binned together, but 
> they are not coming through that way. To give a example, I pushed through 
> data containing 262 input JSON files. Of these 262, 2 of them have a flow 
> file attribute value of ‘A’, 2 of them have a flow file attribute value of 
> ‘B’, and 258 have a flow file attribute of ‘C’. I was expecting the merged 
> relationship to deliver 3 flow files, one each for value A, B, and C. But.I 
> am seeing 24 flow files on the merged relationship, 1 for a value of A, 1 for 
> a value of B, and 22 of varying sizes with the value of C.
> 
> Can someone help me understand what other criteria MergeContent might be 
> using to determine when to send along its merged flow files?
> 
> Thanks
> 
> 
> -- 
> Juan Carlos Sequeiros

Reply via email to