[
https://issues.apache.org/jira/browse/NIFI-14095?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18043965#comment-18043965
]
ASF subversion and git services commented on NIFI-14095:
--------------------------------------------------------
Commit 5d56f553b04dfdfae5d7d48c48de48efe6b4fdbf in nifi's branch
refs/heads/main from Michael W Moser
[ https://gitbox.apache.org/repos/asf?p=nifi.git;h=5d56f553b0 ]
NIFI-14095 Updated GetFile default Polling Interval to 30 seconds (#10618)
Signed-off-by: David Handermann <[email protected]>
> GetFile - "KeepSourceFile" set to true can fill up content repository
> ---------------------------------------------------------------------
>
> Key: NIFI-14095
> URL: https://issues.apache.org/jira/browse/NIFI-14095
> Project: Apache NiFi
> Issue Type: Improvement
> Components: Configuration
> Affects Versions: 2.0.0, 1.28.1
> Reporter: Filip Maretić
> Assignee: Michael W Moser
> Priority: Major
> Labels: GetFile, ListFile
> Time Spent: 1h 10m
> Remaining Estimate: 0h
>
> Just setting the *KeepSourceFile* property to *true* can cause continuous
> ingestion of files into NiFi. If the file is big (e.g. 20 GB) this can cause
> the content repository (e.g. size of 400 GB) to be filled in an instant. This
> renders the NiFi node unusable and a cleanup is needed. There is no reason
> for this to happen, the flow should at least have enough time to process a
> chunk of such a huge file before attempting to load the same file again.
> A quick solution would be just to change the default Polling Interval to "10
> sec"
> -@DefaultSchedule(strategy = SchedulingStrategy.TIMER_DRIVEN, period = "1
> min")-
> -This is anyway present on the ListFile processor, so why not to add it here
> also? if the user really wants to set this to 0 seconds I guess he should be
> aware of the consequences.-
--
This message was sent by Atlassian Jira
(v8.20.10#820010)