Re: [DISCUSS] FLIP-418: Show data skew score on Flink Dashboard

2024-02-07 Thread Rui Fan
nism. I will update the FLIP and propose names > for > > > the metrics. > > > > > > Kind regards, > > > Emre > > > > > > On 23/01/2024, 10:31, "Krzysztof Dziołak" kdzio...@live.com> > kdzio...@live.com <mailto:kdzio...@li

Re: [DISCUSS] FLIP-418: Show data skew score on Flink Dashboard

2024-02-01 Thread Kartoglu, Emre
f configuration would you propose to > > maintain for this feature? Would On/off switch and/or aggregated period > > length be configurable? Should we capture the toggles in the FLIP ? > > 2. Metrics - are we planning to emit the skew metric via metric reporters > > mechanism. S

Re: [DISCUSS] FLIP-418: Show data skew score on Flink Dashboard

2024-01-31 Thread Rui Fan
. > > > > > > > > > > > > > > Hi Emre, > > > > > > Thank you for driving this proposal. I've got two questions about the > > extensions to the proposal that are not captured in the FLIP. > > > > > > > > > >

Re: [DISCUSS] FLIP-418: Show data skew score on Flink Dashboard

2024-01-31 Thread Kartoglu, Emre
.uk.inva>>LID> > Sent: Monday, January 15, 2024 4:59 PM > To: dev@flink.apache.org <mailto:dev@flink.apache.org> > <mailto:dev@flink.apache.org <mailto:dev@flink.apache.org>> < > dev@flink.apache.org <mailto:dev@flink.apache.org> > <mailto:d

Re: [DISCUSS] FLIP-418: Show data skew score on Flink Dashboard

2024-01-31 Thread Rui Fan
chanism. Should we capture proposed metric schema in the FLIP ? > > > Kind regards, > Krzysztof > > > > From: Kartoglu, Emre kar...@amazon.co.uk.inva>LID> > Sent: Monday, January 15, 2024 4:59 PM > To: dev@flink.apache.org <m

Re: [DISCUSS] FLIP-418: Show data skew score on Flink Dashboard

2024-01-23 Thread Kartoglu, Emre
: dev@flink.apache.org <mailto:dev@flink.apache.org> mailto:dev@flink.apache.org>> Subject: [DISCUSS] FLIP-418: Show data skew score on Flink Dashboard Hello, I’m opening this thread to discuss a FLIP[1] to make data skew more visible on Flink Dashboard. Data skew is currently not

Re: [DISCUSS] FLIP-418: Show data skew score on Flink Dashboard

2024-01-23 Thread Krzysztof Dziołak
, January 15, 2024 4:59 PM To: dev@flink.apache.org Subject: [DISCUSS] FLIP-418: Show data skew score on Flink Dashboard Hello, I’m opening this thread to discuss a FLIP[1] to make data skew more visible on Flink Dashboard. Data skew is currently not as visible as it should be. Users have

Re: Re:[DISCUSS] FLIP-418: Show data skew score on Flink Dashboard

2024-01-16 Thread Kartoglu, Emre
Hi Xuyang, Thanks for the feedback! Please find my response below. > 1. How will the colors of vertics with high data skew scores be unified with > existing backpressure and high busyness colors on the UI? Users should be able to distinguish at a glance which vertics in the entire job graph is

Re: [DISCUSS] FLIP-418: Show data skew score on Flink Dashboard

2024-01-16 Thread Kartoglu, Emre
Hi Rui, Thanks for the feedback. Please find my response below: > The number_of_records_received_by_each_subtask is the total received records, > right? No it's not the total. I understand why this is confusing. I had initially wanted to name it "the list of number of records received by each

Re: [DISCUSS] FLIP-418: Show data skew score on Flink Dashboard

2024-01-15 Thread Rui Fan
Thanks Emre for driving this proposal! It's very useful for troubleshooting. I have a question: The number_of_records_received_by_each_subtask is the total received records, right? I'm not sure whether we should check data skew based on the latest duration period. In the production, I found

Re:[DISCUSS] FLIP-418: Show data skew score on Flink Dashboard

2024-01-15 Thread Xuyang
Hi, Emre. In large-scale production jobs, the phenomenon of data skew often occurs. Having an metric on the UI that reflects data skew without the need for manual inspection of each vertex by clicking on them would be quite cool. This could help users quickly identify problematic nodes,

[DISCUSS] FLIP-418: Show data skew score on Flink Dashboard

2024-01-15 Thread Kartoglu, Emre
Hello, I’m opening this thread to discuss a FLIP[1] to make data skew more visible on Flink Dashboard. Data skew is currently not as visible as it should be. Users have to click each operator and check how much data each sub-task is processing and compare the sub-tasks against each other.