Is it safe to manually remove some of the older files in the repository to avoid our disk from filling up?
On Wed, Jun 15, 2016 at 4:55 PM, Ricky Saltzer <ri...@cloudera.com> wrote: > Just a reminder, I just today noticed the "archive.enabled" option was > false and changed it to true. > > $ find . -type f -ls | grep archive | wc -l > 0 > > > > On Wed, Jun 15, 2016 at 4:53 PM, Mark Payne <marka...@hotmail.com> wrote: > >> OK, thanks. It doesn't appear that it believes there is anything to >> reclaim. >> >> Can you try going to your content repository and running: >> >> find . -type f -ls | grep archive >> >> Curious as to how much data it has archived. >> >> > On Jun 15, 2016, at 4:48 PM, Ricky Saltzer <ri...@cloudera.com> wrote: >> > >> > Oh sorry! Trying again >> > >> > [1] >> > >> https://gist.githubusercontent.com/rickysaltzer/b00196a3881c052df9b38b418722cd02/raw/279a1bc8c60530426732eb7b653de1f3f74574e2/gistfile1.txt >> > >> > >> > On Wed, Jun 15, 2016 at 4:38 PM, Ricky Saltzer <ri...@cloudera.com> >> wrote: >> > >> >> I should also mention, I just realized that our worker nodes are on >> 0.5.1, >> >> and for some reason I missed updating the master from 0.4.0. I'm sure >> that >> >> is not helping. >> >> >> >> On Wed, Jun 15, 2016 at 4:36 PM, Ricky Saltzer <ri...@cloudera.com> >> wrote: >> >> >> >>> Looks like the threads are parked and waiting [1] >> >>> >> >>> [1] >> >>> >> http://github.mtv.cloudera.com/gist/ricky/7a5d89f2eeba58e2206d/raw/0e2b446ca049a8b5f27298c700ac709772d2847c/gistfile1.txt >> >>> >> >>> On Wed, Jun 15, 2016 at 4:33 PM, Joe Witt <joe.w...@gmail.com> wrote: >> >>> >> >>>> thanks Ricky - then please take a look at mark's note as that is >> >>>> probably more relevant to your case. >> >>>> >> >>>> On Wed, Jun 15, 2016 at 4:32 PM, Ricky Saltzer <ri...@cloudera.com> >> >>>> wrote: >> >>>>> Hey Joe - >> >>>>> >> >>>>> The NiFi web UI currently reads as: >> >>>>> >> >>>>> Active threads: 3 >> >>>>> Queued: 10,173 / 0 bytes >> >>>>> Connected nodes: 2 / 2 >> >>>>> Stats last refreshed: 13:31:28 PDT >> >>>>> >> >>>>> >> >>>>> On Wed, Jun 15, 2016 at 4:29 PM, Joe Witt <joe.w...@gmail.com> >> wrote: >> >>>>> >> >>>>>> And the data remains? If so that is an interesting data point I >> >>>>>> think. So to mark's point how much data do you have queued up >> >>>>>> actively in the flow then on that nodes? Number of objects you >> >>>>>> mention is 3273 files corresponding to 825GB in the content >> >>>>>> repository. Does NiFi see those 825GB worth of data as being in >> the >> >>>>>> flow/queued up? And then if that is the case are we talking about >> a >> >>>>>> roughly 1TB repo and so the reported value seems correct and this >> is >> >>>>>> simply a case of queueing near to the limit your system can hold? >> >>>>>> >> >>>>>> On Wed, Jun 15, 2016 at 4:24 PM, Ricky Saltzer <ri...@cloudera.com >> > >> >>>> wrote: >> >>>>>>> I have two nodes in clustered mode. I have the other node that >> isn't >> >>>>>>> filling up as my primary. I've actually already restarted nifi on >> >>>> the >> >>>>>> node >> >>>>>>> which has the large repository a few times. >> >>>>>>> >> >>>>>>> On Wed, Jun 15, 2016 at 4:22 PM, Joe Witt <joe.w...@gmail.com> >> >>>> wrote: >> >>>>>>> >> >>>>>>>> Ricky, >> >>>>>>>> >> >>>>>>>> If you restart nifi and then find that it cleans those things up >> I >> >>>>>>>> believe then it is related to the defects corrected in the >> 0.5/0.6 >> >>>>>>>> timeframe. >> >>>>>>>> >> >>>>>>>> Is restarting an option for you at this time. You agree mark? >> >>>>>>>> >> >>>>>>>> Thanks >> >>>>>>>> Joe >> >>>>>>>> >> >>>>>>>> On Wed, Jun 15, 2016 at 4:21 PM, Ricky Saltzer < >> ri...@cloudera.com >> >>>>> >> >>>>>> wrote: >> >>>>>>>>> Hey Mark - >> >>>>>>>>> >> >>>>>>>>> Thanks for the quick reply! This is our production system so >> it's >> >>>>>>>>> unfortunately running 0.4.0. There are currently 3273 files, >> >>>> with some >> >>>>>>>>> files dating back to May 18th. The content repository itself is >> >>>> 825G. >> >>>>>>>>> >> >>>>>>>>> Ricky >> >>>>>>>>> >> >>>>>>>>> On Wed, Jun 15, 2016 at 4:17 PM, Mark Payne < >> >>>> marka...@hotmail.com> >> >>>>>>>> wrote: >> >>>>>>>>> >> >>>>>>>>>> Hey Ricky >> >>>>>>>>>> >> >>>>>>>>>> The reclaim process is pretty much continuous. What version of >> >>>> NiFi >> >>>>>> are >> >>>>>>>>>> you running? >> >>>>>>>>>> I know there was an issue with this a while back that caused it >> >>>> not >> >>>>>> to >> >>>>>>>>>> cleanup properly. >> >>>>>>>>>> >> >>>>>>>>>> Also, how much data & how many FlowFiles do you have queued up >> >>>> in >> >>>>>> your >> >>>>>>>>>> flow? >> >>>>>>>>>> Data won't be archived or reclaimed if in the flow. >> >>>>>>>>>> >> >>>>>>>>>> Thanks >> >>>>>>>>>> -Mark >> >>>>>>>>>> >> >>>>>>>>>> >> >>>>>>>>>> >> >>>>>>>>>>> On Jun 15, 2016, at 4:04 PM, Ricky Saltzer < >> >>>> ri...@cloudera.com> >> >>>>>>>> wrote: >> >>>>>>>>>>> >> >>>>>>>>>>> Hey guys - >> >>>>>>>>>>> >> >>>>>>>>>>> I recently discovered I didn't have my "archive.enabled" >> >>>> option >> >>>>>> set to >> >>>>>>>>>> true >> >>>>>>>>>>> after my disk filled up to 95%. I enabled it and then set the >> >>>>>>>> retention >> >>>>>>>>>>> period to 12 hours and 50% (default values). However, after >> >>>>>> restarting >> >>>>>>>>>>> NiFi, I am not seeing any disk space reclaimed. >> >>>>>>>>>>> >> >>>>>>>>>>> I'm curious, is the reclaiming process periodic or continuous? >> >>>>>>>>>>> >> >>>>>>>>>>> --- >> >>>>>>>>>>> ricky >> >>>>>>>>>> >> >>>>>>>>>> >> >>>>>>>>> >> >>>>>>>>> >> >>>>>>>>> -- >> >>>>>>>>> Ricky Saltzer >> >>>>>>>>> http://www.cloudera.com >> >>>>>>>> >> >>>>>>> >> >>>>>>> >> >>>>>>> >> >>>>>>> -- >> >>>>>>> Ricky Saltzer >> >>>>>>> http://www.cloudera.com >> >>>>>> >> >>>>> >> >>>>> >> >>>>> >> >>>>> -- >> >>>>> Ricky Saltzer >> >>>>> http://www.cloudera.com >> >>>> >> >>> >> >>> >> >>> >> >>> -- >> >>> Ricky Saltzer >> >>> http://www.cloudera.com >> >>> >> >>> >> >> >> >> >> >> -- >> >> Ricky Saltzer >> >> http://www.cloudera.com >> >> >> >> >> > >> > >> > -- >> > Ricky Saltzer >> > http://www.cloudera.com >> >> > > > -- > Ricky Saltzer > http://www.cloudera.com > > -- Ricky Saltzer http://www.cloudera.com