Re: How can I prove ....
Hi Stephan,This is from the dashboard. Total Parallelism is set = 1024.259 tasks per TM. all say Running, but I get *.out log in beam4 server only (bottom of the servers list).Does this mean that all nodes are engaged in processing the data?Why are these encircled columns having 0's for their data exchange report?Thanks+regards,Amir- From: Stephan Ewen <se...@apache.org> To: user@flink.apache.org; amir bahmanyari <amirto...@yahoo.com> Cc: Felix Dreissig <f...@f30.me> Sent: Monday, September 26, 2016 2:18 AM Subject: Re: How can I prove You do not need to create any JSON. Just click on "Running Jobs" in the UI, and then on the job. The parallelism is shown as a number in the boxes of the graph. On Sat, Sep 24, 2016 at 6:28 PM, amir bahmanyari <amirto...@yahoo.com> wrote: Thanks Felix.Interesting. I tried to create the JASON but didnt work according to the sample code I found in docs.There is a way to get the same JASON from the command line.Is there an example?Thanks+regardsAmir- From: Felix Dreissig <f...@f30.me> To: amir bahmanyari <amirto...@yahoo.com> Cc: user@flink.apache.org Sent: Saturday, September 24, 2016 8:18 AM Subject: Re: How can I prove Hi Amir, On 23 Sep 2016, at 19:57, amir bahmanyari <amirto...@yahoo.com> wrote: > Currently running with 512 slots all taken as indicated by the dashboard. > Are we talking about this? Then yes based on no available slots, I assume I > am at 512 . I guess Stephan is referring to the parallelism of single operators as displayed in the operator graph, see e.g. https://ci.apache.org/ projects/flink/flink-docs- release-0.10/page/img/ webclient_plan_view.png . Regards, Felix
Re: How can I prove ....
You do not need to create any JSON. Just click on "Running Jobs" in the UI, and then on the job. The parallelism is shown as a number in the boxes of the graph. On Sat, Sep 24, 2016 at 6:28 PM, amir bahmanyari <amirto...@yahoo.com> wrote: > Thanks Felix. > Interesting. I tried to create the JASON but didnt work according to the > sample code I found in docs. > There is a way to get the same JASON from the command line. > Is there an example? > Thanks+regards > Amir- > > > -- > *From:* Felix Dreissig <f...@f30.me> > *To:* amir bahmanyari <amirto...@yahoo.com> > *Cc:* user@flink.apache.org > *Sent:* Saturday, September 24, 2016 8:18 AM > > *Subject:* Re: How can I prove > > Hi Amir, > > On 23 Sep 2016, at 19:57, amir bahmanyari <amirto...@yahoo.com> wrote: > > Currently running with 512 slots all taken as indicated by the dashboard. > > Are we talking about this? Then yes based on no available slots, I > assume I am at 512 . > > > I guess Stephan is referring to the parallelism of single operators as > displayed in the operator graph, see e.g. https://ci.apache.org/ > projects/flink/flink-docs-release-0.10/page/img/webclient_plan_view.png . > > Regards, > Felix > >
Re: How can I prove ....
Thanks Felix.Interesting. I tried to create the JASON but didnt work according to the sample code I found in docs.There is a way to get the same JASON from the command line.Is there an example?Thanks+regardsAmir- From: Felix Dreissig <f...@f30.me> To: amir bahmanyari <amirto...@yahoo.com> Cc: user@flink.apache.org Sent: Saturday, September 24, 2016 8:18 AM Subject: Re: How can I prove Hi Amir, On 23 Sep 2016, at 19:57, amir bahmanyari <amirto...@yahoo.com> wrote: > Currently running with 512 slots all taken as indicated by the dashboard. > Are we talking about this? Then yes based on no available slots, I assume I > am at 512 . I guess Stephan is referring to the parallelism of single operators as displayed in the operator graph, see e.g. https://ci.apache.org/projects/flink/flink-docs-release-0.10/page/img/webclient_plan_view.png . Regards, Felix
Re: How can I prove ....
Hi Amir, On 23 Sep 2016, at 19:57, amir bahmanyariwrote: > Currently running with 512 slots all taken as indicated by the dashboard. > Are we talking about this? Then yes based on no available slots, I assume I > am at 512 . I guess Stephan is referring to the parallelism of single operators as displayed in the operator graph, see e.g. https://ci.apache.org/projects/flink/flink-docs-release-0.10/page/img/webclient_plan_view.png . Regards, Felix
Re: How can I prove ....
Are you sure you have the parallelism set to 448? You can see the parallelism of operators in the web UI. On Fri, Sep 23, 2016 at 12:15 AM, amir bahmanyari <amirto...@yahoo.com> wrote: > Hi Again & sorry to take your time. But am puzzled by what I cannot > explain why. > The parallelism is set to 448. There are 112 tasks per TM. > Why is Flink NOT allocating ALL 448 slots? It allocates only 1/2 of it. > I also bumped up the # buffers to equate a 2GiB in each TM & see no > difference :-( > So I incremented my total-slots = 448. Kafka topic also has 448 partitions. > Why am I having such a bad luck with this!!!??? LOL!! > Thanks for your attention Aljoscha. > > [image: Inline image] > > > > -- > *From:* amir bahmanyari <amirto...@yahoo.com> > *To:* Aljoscha Krettek <aljos...@apache.org>; User <user@flink.apache.org> > > *Sent:* Thursday, September 22, 2016 10:10 AM > > *Subject:* Re: How can I prove > > Hi Again, following is from the dashboard while wverything is supposedlt > running. > No real-time change in send/received/#of records...but one node is > definitely producing a *.out file... > And all TMs are reporting in their *.log files. And the process will > eventually end , but very slow. > Thanks again Aljoscha. > [image: Inline image] > > > > -- > *From:* amir bahmanyari <amirto...@yahoo.com> > *To:* Aljoscha Krettek <aljos...@apache.org>; User <user@flink.apache.org> > > *Sent:* Thursday, September 22, 2016 9:16 AM > *Subject:* Re: How can I prove > > Thanks Aljoscha, > Thats why I am wondering about this. I dont see send/receive columns > change at alljust 0's all the time. > The only thing that changes is time stamp. > Is this an indication that the nodes in the cluster are not participating > in execution of the data? > Thanks again. > Amir- > > > -------------- > *From:* Aljoscha Krettek <aljos...@apache.org> > *To:* amir bahmanyari <amirto...@yahoo.com>; User <user@flink.apache.org> > *Sent:* Thursday, September 22, 2016 5:01 AM > *Subject:* Re: How can I prove > > Hi, > depending on the data source you might not be able to stress CPU/MEM > because the source might be to slow. As long as you see the numbers > increasing in the Flink Dashboard for all operators you should be good. > > Cheers, > Aljoscha > > On Thu, 22 Sep 2016 at 00:26 amir bahmanyari <amirto...@yahoo.com> wrote: > > That all nodes in a Flink Cluster are involved simultaneously in > processing the data? > Programmatically, graphically... > I need to stress CPU , MEM and all resources to their max. > How can I guarantee this is happening in Flink Cluster? > Out of 4 nodes, this is the highest resource usage I see from "top"... > Everything else is not even close... > t > op - 22:22:45 up 41 days, 2:39, 1 user, load average: 1.76, 1.55, 1.28 > Tasks: 344 total, 1 running, 343 sleeping, 0 stopped, 0 zombie > %Cpu(s): 5.4 us, 1.0 sy, 0.0 ni, 93.5 id, 0.1 wa, 0.0 hi, 0.0 si, > 0.0 st > KiB Mem: 11551564+total, 65702020 used, 49813632 free, 115072 buffers > KiB Swap:0 total,0 used,0 free. 3148420 cached Mem > > > I am pretty sure I can push FlinkRunner to way more extent than this > And thats where true realistic perf numbers start showing up. > Thanks+regards, > Amir- > > > > > > > >
Re: How can I prove ....
Hi Again, following is from the dashboard while wverything is supposedlt running.No real-time change in send/received/#of records...but one node is definitely producing a *.out file...And all TMs are reporting in their *.log files. And the process will eventually end , but very slow.Thanks again Aljoscha. From: amir bahmanyari <amirto...@yahoo.com> To: Aljoscha Krettek <aljos...@apache.org>; User <user@flink.apache.org> Sent: Thursday, September 22, 2016 9:16 AM Subject: Re: How can I prove Thanks Aljoscha,Thats why I am wondering about this. I dont see send/receive columns change at alljust 0's all the time.The only thing that changes is time stamp.Is this an indication that the nodes in the cluster are not participating in execution of the data?Thanks again.Amir- From: Aljoscha Krettek <aljos...@apache.org> To: amir bahmanyari <amirto...@yahoo.com>; User <user@flink.apache.org> Sent: Thursday, September 22, 2016 5:01 AM Subject: Re: How can I prove Hi,depending on the data source you might not be able to stress CPU/MEM because the source might be to slow. As long as you see the numbers increasing in the Flink Dashboard for all operators you should be good. Cheers,Aljoscha On Thu, 22 Sep 2016 at 00:26 amir bahmanyari <amirto...@yahoo.com> wrote: That all nodes in a Flink Cluster are involved simultaneously in processing the data?Programmatically, graphically...I need to stress CPU , MEM and all resources to their max.How can I guarantee this is happening in Flink Cluster?Out of 4 nodes, this is the highest resource usage I see from "top"...Everything else is not even close...top - 22:22:45 up 41 days, 2:39, 1 user, load average: 1.76, 1.55, 1.28Tasks: 344 total, 1 running, 343 sleeping, 0 stopped, 0 zombie%Cpu(s): 5.4 us, 1.0 sy, 0.0 ni, 93.5 id, 0.1 wa, 0.0 hi, 0.0 si, 0.0 stKiB Mem: 11551564+total, 65702020 used, 49813632 free, 115072 buffersKiB Swap: 0 total, 0 used, 0 free. 3148420 cached Mem I am pretty sure I can push FlinkRunner to way more extent than thisAnd thats where true realistic perf numbers start showing up.Thanks+regards,Amir-
Re: How can I prove ....
Thanks Aljoscha,Thats why I am wondering about this. I dont see send/receive columns change at alljust 0's all the time.The only thing that changes is time stamp.Is this an indication that the nodes in the cluster are not participating in execution of the data?Thanks again.Amir- From: Aljoscha Krettek <aljos...@apache.org> To: amir bahmanyari <amirto...@yahoo.com>; User <user@flink.apache.org> Sent: Thursday, September 22, 2016 5:01 AM Subject: Re: How can I prove Hi,depending on the data source you might not be able to stress CPU/MEM because the source might be to slow. As long as you see the numbers increasing in the Flink Dashboard for all operators you should be good. Cheers,Aljoscha On Thu, 22 Sep 2016 at 00:26 amir bahmanyari <amirto...@yahoo.com> wrote: That all nodes in a Flink Cluster are involved simultaneously in processing the data?Programmatically, graphically...I need to stress CPU , MEM and all resources to their max.How can I guarantee this is happening in Flink Cluster?Out of 4 nodes, this is the highest resource usage I see from "top"...Everything else is not even close...top - 22:22:45 up 41 days, 2:39, 1 user, load average: 1.76, 1.55, 1.28Tasks: 344 total, 1 running, 343 sleeping, 0 stopped, 0 zombie%Cpu(s): 5.4 us, 1.0 sy, 0.0 ni, 93.5 id, 0.1 wa, 0.0 hi, 0.0 si, 0.0 stKiB Mem: 11551564+total, 65702020 used, 49813632 free, 115072 buffersKiB Swap: 0 total, 0 used, 0 free. 3148420 cached Mem I am pretty sure I can push FlinkRunner to way more extent than thisAnd thats where true realistic perf numbers start showing up.Thanks+regards,Amir-
How can I prove ....
That all nodes in a Flink Cluster are involved simultaneously in processing the data?Programmatically, graphically...I need to stress CPU , MEM and all resources to their max.How can I guarantee this is happening in Flink Cluster?Out of 4 nodes, this is the highest resource usage I see from "top"...Everything else is not even close...top - 22:22:45 up 41 days, 2:39, 1 user, load average: 1.76, 1.55, 1.28Tasks: 344 total, 1 running, 343 sleeping, 0 stopped, 0 zombie%Cpu(s): 5.4 us, 1.0 sy, 0.0 ni, 93.5 id, 0.1 wa, 0.0 hi, 0.0 si, 0.0 stKiB Mem: 11551564+total, 65702020 used, 49813632 free, 115072 buffersKiB Swap: 0 total, 0 used, 0 free. 3148420 cached Mem I am pretty sure I can push FlinkRunner to way more extent than thisAnd thats where true realistic perf numbers start showing up.Thanks+regards,Amir-