Re: Buffer stats when Back Pressure is high

2019-01-08 Thread Gagan Agrawal
Thanks Timo for suggested solution. Will go with idea of artificial key for our use case. Gagan On Mon, Jan 7, 2019 at 10:21 PM Timo Walther wrote: > Hi Gagan, > > a typical solution to such a problem is to introduce an artifical key > (enrichment id + some additional suffix), you can then keyB

Re: Buffer stats when Back Pressure is high

2019-01-07 Thread Timo Walther
Hi Gagan, a typical solution to such a problem is to introduce an artifical key (enrichment id + some additional suffix), you can then keyBy on this artificial key and thus spread the workload more evenly. Of course you need to make sure that records of the second stream are duplicated to all

Re: Buffer stats when Back Pressure is high

2019-01-07 Thread Gagan Agrawal
Flink Version is 1.7. Thanks Zhijiang for your pointer. Initially I was checking only for few. However I just checked for all and found couple of them having queue length of 40+ which seems to be due to skewness in data. Is there any general guide lines on how to handle skewed data? In my case I am

Re: Buffer stats when Back Pressure is high

2019-01-07 Thread zhijiang
Hi Gagan, What flink version do you use? And have you checked the buffers.inputQueueLength for all the related parallelism (connected with A) of B? It may exist the scenario that only one parallelim B is full of inqueue buffers which back pressure A, and the input queue for other parallelism B

Buffer stats when Back Pressure is high

2019-01-06 Thread Gagan Agrawal
Hi, I want to understand does any of buffer stats help in debugging / validating that downstream operator is performing slow when Back Pressure is high? Say I have A -> B operators and A shows High Back Pressure which indicates something wrong or not performing well on B side which is slowing down