Re: [VOTE] FLIP-100: Add Attempt Information

2020-03-04 Thread Yadong Xie
Hi Gary Kurt, and Jark I am canceling the vote and restart it since the POC has some changes from the initial one. All the changes are following the proposal in this mail thread. please vote again in the new thread, thanks Jark Wu 于2020年3月4日周三 下午12:13写道: > +1 from my side. > > Best, > Jark >

Re: [VOTE] FLIP-100: Add Attempt Information

2020-03-03 Thread Jark Wu
+1 from my side. Best, Jark On Wed, 4 Mar 2020 at 11:39, Kurt Young wrote: > LGTM now, +1 from my side. > > Best, > Kurt > > > On Wed, Mar 4, 2020 at 12:27 AM Gary Yao wrote: > >> Hi Yadong, >> >> Thank you for updating the wiki page. >> >> Only one minor suggestion – I would change: >> >>

Re: [VOTE] FLIP-100: Add Attempt Information

2020-03-03 Thread Kurt Young
LGTM now, +1 from my side. Best, Kurt On Wed, Mar 4, 2020 at 12:27 AM Gary Yao wrote: > Hi Yadong, > > Thank you for updating the wiki page. > > Only one minor suggestion – I would change: > > > If show-history is true return the information of attempt. > > to > > > If show-history is

Re: [VOTE] FLIP-100: Add Attempt Information

2020-03-03 Thread Gary Yao
Hi Yadong, Thank you for updating the wiki page. Only one minor suggestion – I would change: > If show-history is true return the information of attempt. to > If show-history is true, information for all attempts including previous ones will be returned That being said, FLIP-100

Re: [VOTE] FLIP-100: Add Attempt Information

2020-03-03 Thread Yadong Xie
Hi all The rest API part has been updated with Gary and Till's suggestions here is the link: https://cwiki.apache.org/confluence/display/FLINK/FLIP-100%3A+Add+Attempt+Information Yadong Xie 于2020年3月3日周二 下午9:14写道: > Hi Chesnay > > most discussions in this vote are about the more feature/demo

Re: [VOTE] FLIP-100: Add Attempt Information

2020-03-03 Thread Yadong Xie
Hi Chesnay most discussions in this vote are about the more feature/demo request in POC or discussion about response format, the main proposal the web UI part which is not changed and the discussion about the response is converging, the response format discussion could happen either here or at

Re: [VOTE] FLIP-100: Add Attempt Information

2020-03-03 Thread Chesnay Schepler
I suggest to cancel this vote. Several discussion items have been brought up during the vote, some of which are still unresolved, others which resulted in changes to the proposal. My conclusion is that this proposal needs more discussions. On 20/02/2020 10:46, Yadong Xie wrote: Hi all I

Re: [VOTE] FLIP-100: Add Attempt Information

2020-03-02 Thread Yadong Xie
> * Why are we duplicating the subtask index in the objects that are stored in the attempts-time-info array? I thought that all objects in the same array share the same subtask index. yes, I agree with you that subtask could be removed in the `attempts-time-info` > * Are we confident that the

Re: [VOTE] FLIP-100: Add Attempt Information

2020-03-02 Thread Gary Yao
Hi Yadong, Thanks for driving this FLIP. I have a few questions/remarks: * Why are we duplicating the subtask index in the objects that are stored in the attempts-time-info array? I thought that all objects in the same array share the same subtask index. * Are we confident that the

Re: [VOTE] FLIP-100: Add Attempt Information

2020-02-28 Thread Yadong Xie
Hi everyone A subtask filter and an attempt id filter was added to the attempt timeline page now check the POC here: http://101.132.122.69:8081/web/#/job/9d651769488466d33e7a607e85203543/timeline FLIP doc was also updated:

Re: [VOTE] FLIP-100: Add Attempt Information

2020-02-27 Thread Till Rohrmann
I think your proposal is better because then we don't have to bump the REST API version. Cheers, Till On Thu, Feb 27, 2020 at 8:02 AM Yadong Xie wrote: > Hi Till > > We keep response a flattened SubtaskTimeInfo and an array of > SubtaskTimeInfo to keep the restAPI backward compatible, since

Re: [VOTE] FLIP-100: Add Attempt Information

2020-02-26 Thread Yadong Xie
Hi Till We keep response a flattened SubtaskTimeInfo and an array of SubtaskTimeInfo to keep the restAPI backward compatible, since Flink users may still need the API('/jobs/{jobId}/vertices/{vertexId}/subtasktimes') to get the flattened SubtaskTimeInfo. If we want to get an array of

Re: [VOTE] FLIP-100: Add Attempt Information

2020-02-26 Thread Till Rohrmann
Fair enough. If this should become a problem we could introduce it later as well. What about changing the SubtasksTimeInfo response type into an array of SubtaskTimeInfo? At the moment SubtasksTimeInfo contains a flattened SubtaskTimeInfo and and array of SubtaskTimeInfo for the previous

Re: [VOTE] FLIP-100: Add Attempt Information

2020-02-26 Thread Yadong Xie
Hi Till Thanks for your comments. > I have a comment concerning the SubtasksTimesHandler It would be much easier for the frontend to handle a large amount of data if we have a rest API parameter filter, but in my opinion, the attempt list data is not large enough that we have to rely on the

Re: [VOTE] FLIP-100: Add Attempt Information

2020-02-25 Thread Till Rohrmann
Hi Yadong, thanks for creating this FLIP. I like the idea to make the web-ui information richer wrt to subtask attempt information. I have a comment concerning the SubtasksTimesHandler: Should we change the response type SubtasksTimeInfo so that it simply contains an array of SubtaskTimeInfo?

Re: [VOTE] FLIP-100: Add Attempt Information

2020-02-25 Thread Benchao Li
Hi Yadong, Thanks for the updating. LGTM now. +1 (non-binding) Yadong Xie 于2020年2月25日周二 下午4:41写道: > Hi Kurt > > There will be no differences between batch jobs and stream jobs in > subtask-attempt level in the UI > The only differences are in the vertex timeline, I have added a screenshot >

Re: [VOTE] FLIP-100: Add Attempt Information

2020-02-25 Thread Yadong Xie
Hi Kurt There will be no differences between batch jobs and stream jobs in subtask-attempt level in the UI The only differences are in the vertex timeline, I have added a screenshot of the batch job in the FLIP-100 since the batch job will disappear from the list after it finished soon. here is

Re: [VOTE] FLIP-100: Add Attempt Information

2020-02-24 Thread Yadong Xie
Hi Jark and Benchao The link to taks manager log was added to the attempt timeline now, here is the new POC url check it here: http://101.132.122.69:8081/web/#/job/9d651769488466d33e7a607e85203543/timeline thanks for your suggestions! lining jing 于2020年2月24日周一 下午5:16写道: > I agree with Jark.

Re: [VOTE] FLIP-100: Add Attempt Information

2020-02-24 Thread lining jing
I agree with Jark. The user can find the previous taskmanager according to the log link in the history of the attempt. The reason for adding attempts' information to the timeline is that for some reason, the job failover failed or took a long time. This problem is challenging to locate. As the

Re: [VOTE] FLIP-100: Add Attempt Information

2020-02-23 Thread Benchao Li
I agree with Jark. It's helpful to add a TM logs link for each attempt. Usually there are multiple TMs per host, so only host name cannot determine the real TM. Jark Wu 于2020年2月21日周五 下午2:06写道: > Thanks Yadong, > > I think one of the purpose to find the failed attempts is that we want to > look

Re: [VOTE] FLIP-100: Add Attempt Information

2020-02-20 Thread Jark Wu
Thanks Yadong, I think one of the purpose to find the failed attempts is that we want to look into the logs in the failed node (node may change across attempts). However, from the POC, I didn't find a log link on each attempts. Maybe we can add such feature? Best, Jark On Fri, 21 Feb 2020 at

Re: [VOTE] FLIP-100: Add Attempt Information

2020-02-20 Thread Kurt Young
Hi Yadong, Thanks for the proposal, it's a useful feature, especially for batch jobs. But according to the examples you gave, I can't tell whether i got required information from that. Can you replace the demo job to a more complex batch job and then we can see some differences of start/stop time

[VOTE] FLIP-100: Add Attempt Information

2020-02-20 Thread Yadong Xie
Hi all I want to start the vote for FLIP-100, which proposes to add attempt information inside subtask and timeline in web UI. To help everyone better understand the proposal, we spent some efforts on making an online POC Timeline Attempt (click the vertex timeline to see the differences):