Hey Jeremy, 

Something linke this 
https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi/nifi-standard-nar/1.5.0/org.apache.nifi.processors.standard.DetectDuplicate/index.html
 
<https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi/nifi-standard-nar/1.5.0/org.apache.nifi.processors.standard.DetectDuplicate/index.html>
 ? 


> On 15. Feb 2021, at 04:45, Jeremy Pemberton-Pigott <fuzzych...@gmail.com> 
> wrote:
> 
> Hi everyone, I'm wondering if there is a Detect Duplicate processor that can 
> read records from a flow file and as output gives just the non-duplicates 
> (can be single records or a group of non-duplicates would be better).  I want 
> to use a record reader to avoid splitting the json content into 10000s of 
> flow files to detect the duplicates.  Immediately after this flow is a record 
> reader/writer going to HBase.
> 
> Jeremy

Reply via email to