Re: Flink memory leak

2017-11-17 Thread Piotr Nowojski
gt;>>>>>>>> Ebru: could you explain in a little more details how does >>>>>>>>> your Job(s) >>>>>>>>>> look like? Could you post some code? If you are just using >>>>>>>>> maps and >>>

Re: Flink memory leak

2017-11-14 Thread Piotr Nowojski
4:43, Javier Lopez >>>>>>> mailto:javier.lo...@zalando.de>> >>>>>>>> wrote: >>>>>>>> Hi, >>>>>>>> We have been facing a similar problem. We have tried some >>>>>>> different >>>>>>&g

Re: Flink memory leak

2017-11-14 Thread Flavio Pompermaier
ase share > > that code? > > If > there is no stateful operation at all, it's strange where > > the list > > state instances are coming from. > On Tue, Nov 7, 2017 at 2:35 PM, ebru > > > > wrote: > Hi Ufuk, > We don’t explicitly define any state

Re: Flink memory leak

2017-11-14 Thread Piotr Nowojski
;> that Flavio has, we restart the taskmanagers once they reach >>>>>> a >>>>>>> memory threshold. We created a small test to remove all of >>>>>> our >>>>>>> dependencies and leave only flink native libraries. This >&

Re: Flink memory leak

2017-11-10 Thread Piotr Nowojski
ilters there shouldn’t be any network transfers involved, >>>>>>>>> aside >>>>>>>>>> from Source and Sink functions. >>>>>>>>>> Piotrek >>>>>>>>>> On 8 Nov 2017, at 12:54,

Re: Flink memory leak

2017-11-10 Thread ÇETİNKAYA EBRU ÇETİNKAYA EBRU
use case, you have to manually call `clear()` on the state instance in order to release the managed state. Best, Ufuk On Tue, Nov 7, 2017 at 12:43 PM, ebru wrote: Begin forwarded message: From: ebru Subject: Re: Flink memory leak Date: 7 November 2017 at 14:09:17 GMT+3 To: Ufuk Celebi Hi Ufuk, Ther

Re: Flink memory leak

2017-11-10 Thread Piotr Nowojski
>>> Kien, but it didn't work. We have a workaround similar to >>>>>> the one >>>>>>> that Flavio has, we restart the taskmanagers once they reach >>>>>> a >>>>>>> memory threshold. We created a small test to

Re: Flink memory leak

2017-11-10 Thread Piotr Nowojski
;>>> per node. We have one job that uses 56 slots, and we cannot >>>> execute >>>>> that job 5 times in a row because the whole cluster dies. If >>>> you >>>>> want, we can publish our test job. >>>>> Regards, >>>>

Re: Flink memory leak

2017-11-10 Thread Piotr Nowojski
gt; On 8. Nov 2017, at 10:25, Flavio Pompermaier >> >>> wrote: >>> We also have the same problem in production. At the moment >> the >>> solution is to restart the entire Flink cluster after every >> job.. >>> We've tried to reproduce this problem wi

Re: Flink memory leak

2017-11-09 Thread Piotr Nowojski
& @Piotr Could you please have a look at this? You >>>> both >>>>>>>>> recently worked on the network stack and might be most >>>> familiar with >>>>>>>>> this. >>>>>>>>> On 8. Nov 2017, at 1

Re: Flink memory leak

2017-11-09 Thread ÇETİNKAYA EBRU ÇETİNKAYA EBRU
Ufuk On Tue, Nov 7, 2017 at 12:43 PM, ebru wrote: Begin forwarded message: From: ebru Subject: Re: Flink memory leak Date: 7 November 2017 at 14:09:17 GMT+3 To: Ufuk Celebi Hi Ufuk, There are there snapshots of htop output. 1. snapshot is initial state. 2. snapshot is after submitted one job.

Re: Flink memory leak

2017-11-08 Thread Piotr Nowojski
t;>>> <mailto:pomperma...@okkam.it>> >> >>>>> wrote: >> >>>>> >> >>>>> We also have the same problem in production. At the moment the >> >>>>> solution is to restart the entire Flink cluster after e

Re: Flink memory leak

2017-11-08 Thread Piotr Nowojski
ak are > >>>>> correlated.. > >>>>> > >>>>> Best, > >>>>> Flavio > >>>>> > >>>>> On Wed, Nov 8, 2017 at 9:51 AM, ÇETİNKAYA EBRU ÇETİNKAYA EBRU > >>>>> mailto:b20926...@cs.hacettepe

Re: Flink memory leak

2017-11-08 Thread Javier Lopez
t;>> On 2017-11-07 16:53, Ufuk Celebi wrote: >>>>> Do you use any windowing? If yes, could you please share that code? >>>>> If >>>>> there is no stateful operation at all, it's strange where the list >>>>> state instances a

Re: Flink memory leak

2017-11-08 Thread Piotr Nowojski
state instances are coming from. >> >> On Tue, Nov 7, 2017 at 2:35 PM, ebru > <mailto:b20926...@cs.hacettepe.edu.tr>> >> wrote: >> Hi Ufuk, >> >> We don’t explicitly define any state descriptor. We only use map >> and filters >> operato

Re: Flink memory leak

2017-11-08 Thread Javier Lopez
>>>> Do you use any windowing? If yes, could you please share that code? >>>> If >>>> there is no stateful operation at all, it's strange where the list >>>> state instances are coming from. >>>> >>>> On Tue, Nov 7, 201

Re: Flink memory leak

2017-11-08 Thread Piotr Nowojski
@apache.org>> wrote: > > Hey Ebru, the memory usage might be increasing as long as a job is > running. > This is expected (also in the case of multiple running jobs). The > screenshots are not helpful in that regard. :-( > > What kind of stateful operations are you using

Re: Flink memory leak

2017-11-08 Thread Javier Lopez
’t explicitly define any state descriptor. We only use map >>> and filters >>> operator. We thought that gc handle clearing the flink’s internal >>> states. >>> So how can we manage the memory if it is always increasing? >>> >>> - Ebru >>

Re: Flink memory leak

2017-11-08 Thread ÇETİNKAYA EBRU ÇETİNKAYA EBRU
state. Best, Ufuk On Tue, Nov 7, 2017 at 12:43 PM, ebru wrote: Begin forwarded message: From: ebru Subject: Re: Flink memory leak Date: 7 November 2017 at 14:09:17 GMT+3 To: Ufuk Celebi Hi Ufuk, There are there snapshots of htop output. 1. snapshot is initial state. 2. snapshot is af

Re: Flink memory leak

2017-11-08 Thread Piotr Nowojski
.@apache.org>> wrote: >>> >>> Hey Ebru, the memory usage might be increasing as long as a job is running. >>> This is expected (also in the case of multiple running jobs). The >>> screenshots are not helpful in that regard. :-( >>> >>> What kind of stateful

Re: Flink memory leak

2017-11-08 Thread ebru
. >> This is expected (also in the case of multiple running jobs). The >> screenshots are not helpful in that regard. :-( >> >> What kind of stateful operations are you using? Depending on your use case, >> you have to manually call `clear()` on the state instance in orde

Re: Flink memory leak

2017-11-08 Thread Javier Lopez
gt;>> screenshots are not helpful in that regard. :-( >>>> >>>> What kind of stateful operations are you using? Depending on your use >>>> case, >>>> you have to manually call `clear()` on the state instance in order to >>>> release the m

Re: Flink memory leak

2017-11-08 Thread Aljoscha Krettek
ave to manually call `clear()` on the state instance in order to > release the managed state. > > Best, > > Ufuk > > On Tue, Nov 7, 2017 at 12:43 PM, ebru <mailto:b20926...@cs.hacettepe.edu.tr>> wrote: > > > > Begin forwarded message: > > From: ebr

Re: Flink memory leak

2017-11-08 Thread Flavio Pompermaier
ations are you using? Depending on your use >>> case, >>> you have to manually call `clear()` on the state instance in order to >>> release the managed state. >>> >>> Best, >>> >>> Ufuk >>> >>> On Tue, Nov 7, 2017 at 12

Re: Flink memory leak

2017-11-07 Thread Aljoscha Krettek
managed state. >> >> Best, >> >> Ufuk >> >> On Tue, Nov 7, 2017 at 12:43 PM, ebru wrote: >>> >>> >>> >>> Begin forwarded message: >>> >>> From: ebru >>> Subject: Re: Flink memory leak >>

Re: Flink memory leak

2017-11-07 Thread Ufuk Celebi
nce in order to > release the managed state. > > Best, > > Ufuk > > On Tue, Nov 7, 2017 at 12:43 PM, ebru wrote: >> >> >> >> Begin forwarded message: >> >> From: ebru >> Subject: Re: Flink memory leak >> Date: 7 November 2017 at 1

Re: Flink memory leak

2017-11-07 Thread ebru
to manually call `clear()` on the state instance in order to release > the managed state. > > Best, > > Ufuk > > On Tue, Nov 7, 2017 at 12:43 PM, ebru <mailto:b20926...@cs.hacettepe.edu.tr>> wrote: > > >> Begin forwarded message: >> >> Fro

Re: Flink memory leak

2017-11-07 Thread Ufuk Celebi
Hey Ebru, let me pull in Aljoscha (CC'd) who might have an idea what's causing this. Since multiple jobs are running, it will be hard to understand to which job the state descriptors from the heap snapshot belong to. - Is it possible to isolate the problem and reproduce the behaviour with only a

Flink memory leak

2017-11-07 Thread ÇETİNKAYA EBRU ÇETİNKAYA EBRU
Hi, We are using Flink 1.3.1 in production, we have one job manager and 3 task managers in standalone mode. Recently, we've noticed that we have memory related problems. We use docker container to serve Flink cluster. We have 300 slots and 20 jobs are running with parallelism of 10. Also the