Hello, DetectDuplicate uses a DistributedMapCacheClientService which would be connecting to a DistributedMapCacheServer on one of your nodes.
So all nodes should be connecting to the same cache server which is where the information about previously seen data is stored. -Bryan On Tue, Apr 10, 2018 at 1:24 PM, Eric Ulicny <[email protected]> wrote: > Hello, > > We have a use case where we execute processors on all nodes but would like > to use the detect duplicate processor to ensure records are unique. We are > observing that we must run it on one node to truly detect duplicates. Is > there any way to merge flowfiles from all running executors? > > -Eric
