Re: initiating a machine learning script on a remote server

2020-06-25 Thread Darren Govoni
Quick answer is you could just execute a ssh command to execute on the remote machine. If you need flowfiles to go remote, nifi supports remote processor groups. Sent from my Verizon, Samsung Galaxy smartphone

initiating a machine learning script on a remote server

2020-06-25 Thread Mike Sofen
I've been prototyping various functionality on nifi, initially on a Windows laptop, now on a single GCP Linux instance (for now), using the more basic processors for files and databases. It's really a superb platform. What I now need to solve for is firing a python machine learning script

Re: Duplicate Attribute Values in Extract Text Processor Output

2020-06-25 Thread Andy LoPresto
The resulting flowfile will always have at least two attributes because the whole match is extracted as an attribute and every capture group is extracted as an attribute, and the expression must contain at least one capture group. What is the objective you are trying to accomplish? If you want

Re: Duplicate Attribute Values in Extract Text Processor Output

2020-06-25 Thread muhyid72
Hi Andy, Thank you for your quick answer and interest. Actually I tried that but there were still 2 attributes on the flow file. As far as I understand it is by design, I can't set just one attribute, it has at least 2. Am i right? Can I use Route Text Processor instead of Extract Text (I have

Re: Replacing a base64-encoded field in a JSON-document with its decoded/converted value

2020-06-25 Thread Andy LoPresto
Hi Bjørn, No, XML to JSON conversion is not an Expression Language feature. You’ll need to either get this data into a flowfile as the complete content to perform the conversion with existing built-in tools, or add that step to your Groovy script. With that additional requirement, I think

Re: Duplicate Attribute Values in Extract Text Processor Output

2020-06-25 Thread Andy LoPresto
The regex you’re using contains a capture group, and so the entire string is captured as one attribute, and then the contained capture groups are also extracted as attributes. You can set the property “Include Capture Group 0” to false to remove one of them. The others are provided as expected.

Re: NiFi 1.11 - non-heap size

2020-06-25 Thread Valentina Ivanova
I see. Thanks again! Valentina From: Joe Witt Sent: Thursday, 25 June 2020 17:54 To: users@nifi.apache.org Subject: Re: NiFi 1.11 - non-heap size Since the max is undefined if more memory is needed the size will grow and the current usage may ebb and flow thus

Re: NiFi 1.11 - non-heap size

2020-06-25 Thread Joe Witt
Since the max is undefined if more memory is needed the size will grow and the current usage may ebb and flow thus leaving a difference which will/should likely always be relatively small. But things look good/healthy in terms of mem on that system as of now On Thu, Jun 25, 2020 at 8:51 AM

Re: NiFi 1.11 - non-heap size

2020-06-25 Thread Valentina Ivanova
Hi Joe, Thanks for the quick reply! What about the used and free non-heap - free non-heap is 14 MB only while used is 224MB? Thanks Valentina From: Joe Witt Sent: Thursday, 25 June 2020 17:35 To: users@nifi.apache.org Subject: Re: NiFi 1.11 - non-heap size

Re: NiFi 1.11 - non-heap size

2020-06-25 Thread Joe Witt
-1 in that case means it is on a system or configuration for which the JVM cannot get that answer. Nothing to worry about. On Thu, Jun 25, 2020 at 8:33 AM Valentina Ivanova wrote: > Hello again! > > I have been looking at the system diagnostics and noticed the values for > non-heap size

NiFi 1.11 - non-heap size

2020-06-25 Thread Valentina Ivanova
Hello again! I have been looking at the system diagnostics and noticed the values for non-heap size (screenshot attached). I am running NiFi 1.11 with openjdk version 1.8.0_181. It seems I have too little non-heap memory free and the max value is set to -1 (which seems strange). Shall I be

Duplicate Attribute Values in Extract Text Processor Output

2020-06-25 Thread muhyid72
Dear All I need an information about Flow Files Attribute of Extract Text Processor. My flow is that; 1. Getting IIS Log files from Azure Blob Storage 2. Splitting each IIS Log File to line by line with Split Text Processor. 2.1. Line Split Count:1 2.2. Maximum Fragment Size: No value set 2.3.

Re: NiFi 1.11 - High repository storage usage

2020-06-25 Thread Valentina Ivanova
Hi Wesley & Harald, Thanks for the quick replies! @Wesley C. Dias de Oliveira I have recently increased these settings due to other reasons, so that might be the primary cause. @Harald You are right, they are on the same drive. I assumed that usage is relative

AW: NiFi 1.11 - High repository storage usage

2020-06-25 Thread Dobbernack, Harald (Key-Work)
* When I check the size of the respective folders on the disk they don't even add up to 12,24 GB shown on the screenshot. I'm guessing you have other stuff on the drive or partition as well and not only the repositories. Von: Dobbernack, Harald (Key-Work) Gesendet: Donnerstag, 25. Juni

AW: NiFi 1.11 - High repository storage usage

2020-06-25 Thread Dobbernack, Harald (Key-Work)
Hi, presumably all three respositories are on the same partition or drive? I believe the screenshot view you posted shows what the OS reports as the space usage of the whole partition/drive on which the repositories are sitting. Best practice would be to place each repository on its own

Re: NiFi 1.11 - High repository storage usage

2020-06-25 Thread Wesley C. Dias de Oliveira
Hi, Valentina. I've experienced the same issue on an old installation. In my case, it was related to memory usage. The system begins to dump things to the disk when there's no available memory. Have you checked the memory( params? # JVM memory settings java.arg.2=-Xms512m java.arg.3=-Xmx512m

NiFi 1.11 - High repository storage usage

2020-06-25 Thread Valentina Ivanova
Hello! I see (screenshot attached) quite high (86%) storage usage for all three Flow File, Content & Provenance Repositories in the System Diagnostics. When I check the size of the respective folders on the disk they don't even add up to 12,24 GB shown on the screenshot. I also have the

SV: Replacing a base64-encoded field in a JSON-document with its decoded/converted value

2020-06-25 Thread Myklebust , Bjørn Magnar
Thanks Andy. The XML-content is around 5 kB-ish. But I also need to convert the XML to JSON before replacing it back into the original JSON-file. Can this be done with e.g a ConvertAttribute before the ReplaceText? Thanks, Bjørn Fra: Andy LoPresto Sendt: onsdag 24. juni 2020 17:24 Til:

Re: Indications in the UI of which cluster node hosts a “stuck” thread?

2020-06-25 Thread James McMahon
This does help, thank you Matt. And I like your suggestion. It would be more at our fingertips if as we hover over the thread count on the processor, the distribution across all cluster nodes is presented in a popup. I wonder if project leads would consider this helpful improvement? I can now see