Re: Data anonymization in Nifi

2017-10-23 Thread Matt Burgess
Vyshali, The AnonymizeRecord processor does not yet exist, I just wrote up a Jira to track the addition of it possibly sometime in the future. For the scripted solution, you can add the location of the ARX JARs to the Module Directory property of ExecuteScript. If it is a flat directory of JARs

Re: Data anonymization in Nifi

2017-10-23 Thread Vyshali
Hi Matt, Thanks for the suggestion. It would be very much helpful if you can give the instruction on how to use the AnonymizeRecord processor. Please give some clarity on how to setup processor after downloading ARX jars I downloaded the jar from http://arx.deidentifier.org/downloads/

Nifi:how to handle with multiple flowfiles?

2017-10-23 Thread sally
I have request data which can be splitted into several flowfile for example request data for date 2017-02-02 can be splitted to four part 2017-02-02.1 ,2017-02-02.2 ( i use this partiotion because webservice uses paging and it can send only certain amount of data at a time (400 mb) and if

Re: Cron Job that runs HiveQL

2017-10-23 Thread Pierre Villard
Hi Daniel, You can add a GenerateFlowFile processor with the query you want to execute for the content of generated flow files, set the expected scheduling (cron scheduling), and connect it to a PutHiveQL that will execute the query from the content of the flow file. Does it make sense? Pierre

Cron Job that runs HiveQL

2017-10-23 Thread Daniel Müller
Is there a possibility to run a HiveQL processor, that runs just a "alter table abc concatenate" command like in a cron job, without having a pre- nor post-processor? I want to call the command above just in a regular time interval, without depending on other processes on my NiFi instance.