Re: How can I prove ....

2016-09-30 Thread amir bahmanyari
Hi Stephan,This is from the dashboard. Total Parallelism is set = 1024.259 
tasks per TM. all say Running, but I get *.out log in beam4 server only (bottom 
of the servers list).Does this mean that all nodes are engaged in processing 
the data?Why are these encircled columns having 0's for their data exchange 
report?Thanks+regards,Amir-




  From: Stephan Ewen <se...@apache.org>
 To: user@flink.apache.org; amir bahmanyari <amirto...@yahoo.com> 
Cc: Felix Dreissig <f...@f30.me>
 Sent: Monday, September 26, 2016 2:18 AM
 Subject: Re: How can I prove 
   
You do not need to create any JSON.
Just click on "Running Jobs" in the UI, and then on the job. The parallelism is 
shown as a number in the boxes of the graph.
On Sat, Sep 24, 2016 at 6:28 PM, amir bahmanyari <amirto...@yahoo.com> wrote:

Thanks Felix.Interesting. I tried to create the JASON but didnt work  according 
to the sample code I found in docs.There is a way to get the same JASON from 
the command line.Is there an example?Thanks+regardsAmir-

  From: Felix Dreissig <f...@f30.me>
 To: amir bahmanyari <amirto...@yahoo.com> 
Cc: user@flink.apache.org
 Sent: Saturday, September 24, 2016 8:18 AM
 Subject: Re: How can I prove 
   
Hi Amir,

On 23 Sep 2016, at 19:57, amir bahmanyari <amirto...@yahoo.com> wrote:
> Currently running with 512 slots all taken as indicated by the dashboard.
> Are we talking about this? Then yes based on no available slots, I assume I 
> am at 512 .

I guess Stephan is referring to the parallelism of single operators as 
displayed in the operator graph, see e.g. https://ci.apache.org/ 
projects/flink/flink-docs- release-0.10/page/img/ webclient_plan_view.png .

Regards,
Felix

   



   

Re: How can I prove ....

2016-09-26 Thread Stephan Ewen
You do not need to create any JSON.

Just click on "Running Jobs" in the UI, and then on the job. The
parallelism is shown as a number in the boxes of the graph.

On Sat, Sep 24, 2016 at 6:28 PM, amir bahmanyari <amirto...@yahoo.com>
wrote:

> Thanks Felix.
> Interesting. I tried to create the JASON but didnt work  according to the
> sample code I found in docs.
> There is a way to get the same JASON from the command line.
> Is there an example?
> Thanks+regards
> Amir-
>
>
> --
> *From:* Felix Dreissig <f...@f30.me>
> *To:* amir bahmanyari <amirto...@yahoo.com>
> *Cc:* user@flink.apache.org
> *Sent:* Saturday, September 24, 2016 8:18 AM
>
> *Subject:* Re: How can I prove 
>
> Hi Amir,
>
> On 23 Sep 2016, at 19:57, amir bahmanyari <amirto...@yahoo.com> wrote:
> > Currently running with 512 slots all taken as indicated by the dashboard.
> > Are we talking about this? Then yes based on no available slots, I
> assume I am at 512 .
>
>
> I guess Stephan is referring to the parallelism of single operators as
> displayed in the operator graph, see e.g. https://ci.apache.org/
> projects/flink/flink-docs-release-0.10/page/img/webclient_plan_view.png .
>
> Regards,
> Felix
>
>


Re: How can I prove ....

2016-09-24 Thread amir bahmanyari
Thanks Felix.Interesting. I tried to create the JASON but didnt work  according 
to the sample code I found in docs.There is a way to get the same JASON from 
the command line.Is there an example?Thanks+regardsAmir-

  From: Felix Dreissig <f...@f30.me>
 To: amir bahmanyari <amirto...@yahoo.com> 
Cc: user@flink.apache.org
 Sent: Saturday, September 24, 2016 8:18 AM
 Subject: Re: How can I prove 
   
Hi Amir,

On 23 Sep 2016, at 19:57, amir bahmanyari <amirto...@yahoo.com> wrote:
> Currently running with 512 slots all taken as indicated by the dashboard.
> Are we talking about this? Then yes based on no available slots, I assume I 
> am at 512 .

I guess Stephan is referring to the parallelism of single operators as 
displayed in the operator graph, see e.g. 
https://ci.apache.org/projects/flink/flink-docs-release-0.10/page/img/webclient_plan_view.png
 .

Regards,
Felix

   

Re: How can I prove ....

2016-09-24 Thread Felix Dreissig
Hi Amir,

On 23 Sep 2016, at 19:57, amir bahmanyari  wrote:
> Currently running with 512 slots all taken as indicated by the dashboard.
> Are we talking about this? Then yes based on no available slots, I assume I 
> am at 512 .

I guess Stephan is referring to the parallelism of single operators as 
displayed in the operator graph, see e.g. 
https://ci.apache.org/projects/flink/flink-docs-release-0.10/page/img/webclient_plan_view.png
 .

Regards,
Felix

Re: How can I prove ....

2016-09-23 Thread Stephan Ewen
Are you sure you have the parallelism set to 448? You can see the
parallelism of operators in the web UI.


On Fri, Sep 23, 2016 at 12:15 AM, amir bahmanyari <amirto...@yahoo.com>
wrote:

> Hi Again & sorry to take your time. But am puzzled by what I cannot
> explain why.
> The parallelism is set to 448. There are 112 tasks per TM.
> Why is Flink NOT allocating ALL 448 slots? It allocates only 1/2 of it.
> I also bumped up the # buffers to equate a 2GiB in each TM & see no
> difference :-(
> So I incremented my total-slots = 448. Kafka topic also has 448 partitions.
> Why am I having such a bad luck with this!!!??? LOL!!
> Thanks for your attention Aljoscha.
>
> [image: Inline image]
>
>
>
> --
> *From:* amir bahmanyari <amirto...@yahoo.com>
> *To:* Aljoscha Krettek <aljos...@apache.org>; User <user@flink.apache.org>
>
> *Sent:* Thursday, September 22, 2016 10:10 AM
>
> *Subject:* Re: How can I prove 
>
> Hi Again, following is from the dashboard while wverything is supposedlt
> running.
> No real-time change in send/received/#of records...but one node is
> definitely producing a *.out file...
> And all TMs are reporting in their *.log files. And the process will
> eventually end , but very slow.
> Thanks again Aljoscha.
> [image: Inline image]
>
>
>
> --
> *From:* amir bahmanyari <amirto...@yahoo.com>
> *To:* Aljoscha Krettek <aljos...@apache.org>; User <user@flink.apache.org>
>
> *Sent:* Thursday, September 22, 2016 9:16 AM
> *Subject:* Re: How can I prove 
>
> Thanks Aljoscha,
> Thats why I am wondering about this. I dont see send/receive columns
> change at alljust 0's all the time.
> The only thing that changes is time stamp.
> Is this an indication that the nodes in the cluster are not participating
> in execution of the data?
> Thanks again.
> Amir-
>
>
> --------------
> *From:* Aljoscha Krettek <aljos...@apache.org>
> *To:* amir bahmanyari <amirto...@yahoo.com>; User <user@flink.apache.org>
> *Sent:* Thursday, September 22, 2016 5:01 AM
> *Subject:* Re: How can I prove 
>
> Hi,
> depending on the data source you might not be able to stress CPU/MEM
> because the source might be to slow. As long as you see the numbers
> increasing in the Flink Dashboard for all operators you should be good.
>
> Cheers,
> Aljoscha
>
> On Thu, 22 Sep 2016 at 00:26 amir bahmanyari <amirto...@yahoo.com> wrote:
>
> That all nodes in a Flink Cluster are involved simultaneously in
> processing the data?
> Programmatically, graphically...
> I need to stress CPU , MEM and all resources to their max.
> How can I guarantee this is happening in Flink Cluster?
> Out of 4 nodes, this is the highest resource usage I see from "top"...
> Everything else is not even close...
> t
> op - 22:22:45 up 41 days,  2:39,  1 user,  load average: 1.76, 1.55, 1.28
> Tasks: 344 total,   1 running, 343 sleeping,   0 stopped,   0 zombie
> %Cpu(s):  5.4 us,  1.0 sy,  0.0 ni, 93.5 id,  0.1 wa,  0.0 hi,  0.0 si,
>  0.0 st
> KiB Mem:  11551564+total, 65702020 used, 49813632 free,   115072 buffers
> KiB Swap:0 total,0 used,0 free.  3148420 cached Mem
>
>
> I am pretty sure I can push FlinkRunner to way more extent than this
> And thats where true realistic perf numbers start showing up.
> Thanks+regards,
> Amir-
>
>
>
>
>
>
>
>


Re: How can I prove ....

2016-09-22 Thread amir bahmanyari
Hi Again, following is from the dashboard while wverything is supposedlt 
running.No real-time change in send/received/#of records...but one node is 
definitely producing a *.out file...And all TMs are reporting in their *.log 
files. And the process will eventually end , but very slow.Thanks again 
Aljoscha.



  From: amir bahmanyari <amirto...@yahoo.com>
 To: Aljoscha Krettek <aljos...@apache.org>; User <user@flink.apache.org> 
 Sent: Thursday, September 22, 2016 9:16 AM
 Subject: Re: How can I prove 
   
Thanks Aljoscha,Thats why I am wondering about this. I dont see send/receive 
columns change at alljust 0's all the time.The only thing that changes is 
time stamp.Is this an indication that the nodes in the cluster are not 
participating in execution of the data?Thanks again.Amir-

  From: Aljoscha Krettek <aljos...@apache.org>
 To: amir bahmanyari <amirto...@yahoo.com>; User <user@flink.apache.org> 
 Sent: Thursday, September 22, 2016 5:01 AM
 Subject: Re: How can I prove 
  
Hi,depending on the data source you might not be able to stress CPU/MEM because 
the source might be to slow. As long as you see the numbers increasing in the 
Flink Dashboard for all operators you should be good.
Cheers,Aljoscha
On Thu, 22 Sep 2016 at 00:26 amir bahmanyari <amirto...@yahoo.com> wrote:

That all nodes in a Flink Cluster are involved simultaneously in processing the 
data?Programmatically, graphically...I need to stress CPU , MEM and all 
resources to their max.How can I guarantee this is happening in Flink 
Cluster?Out of 4 nodes, this is the highest resource usage I see from 
"top"...Everything else is not even close...top - 22:22:45 up 41 days,  2:39,  
1 user,  load average: 1.76, 1.55, 1.28Tasks: 344 total,   1 running, 343 
sleeping,   0 stopped,   0 zombie%Cpu(s):  5.4 us,  1.0 sy,  0.0 ni, 93.5 id,  
0.1 wa,  0.0 hi,  0.0 si,  0.0 stKiB Mem:  11551564+total, 65702020 used, 
49813632 free,   115072 buffersKiB Swap:        0 total,        0 used,        
0 free.  3148420 cached Mem

I am pretty sure I can push FlinkRunner to way more extent than thisAnd 
thats where true realistic perf numbers start showing up.Thanks+regards,Amir-


   

   

Re: How can I prove ....

2016-09-22 Thread amir bahmanyari
Thanks Aljoscha,Thats why I am wondering about this. I dont see send/receive 
columns change at alljust 0's all the time.The only thing that changes is 
time stamp.Is this an indication that the nodes in the cluster are not 
participating in execution of the data?Thanks again.Amir-

  From: Aljoscha Krettek <aljos...@apache.org>
 To: amir bahmanyari <amirto...@yahoo.com>; User <user@flink.apache.org> 
 Sent: Thursday, September 22, 2016 5:01 AM
 Subject: Re: How can I prove 
   
Hi,depending on the data source you might not be able to stress CPU/MEM because 
the source might be to slow. As long as you see the numbers increasing in the 
Flink Dashboard for all operators you should be good.
Cheers,Aljoscha
On Thu, 22 Sep 2016 at 00:26 amir bahmanyari <amirto...@yahoo.com> wrote:

That all nodes in a Flink Cluster are involved simultaneously in processing the 
data?Programmatically, graphically...I need to stress CPU , MEM and all 
resources to their max.How can I guarantee this is happening in Flink 
Cluster?Out of 4 nodes, this is the highest resource usage I see from 
"top"...Everything else is not even close...top - 22:22:45 up 41 days,  2:39,  
1 user,  load average: 1.76, 1.55, 1.28Tasks: 344 total,   1 running, 343 
sleeping,   0 stopped,   0 zombie%Cpu(s):  5.4 us,  1.0 sy,  0.0 ni, 93.5 id,  
0.1 wa,  0.0 hi,  0.0 si,  0.0 stKiB Mem:  11551564+total, 65702020 used, 
49813632 free,   115072 buffersKiB Swap:        0 total,        0 used,        
0 free.  3148420 cached Mem

I am pretty sure I can push FlinkRunner to way more extent than thisAnd 
thats where true realistic perf numbers start showing up.Thanks+regards,Amir-


   

How can I prove ....

2016-09-21 Thread amir bahmanyari
That all nodes in a Flink Cluster are involved simultaneously in processing the 
data?Programmatically, graphically...I need to stress CPU , MEM and all 
resources to their max.How can I guarantee this is happening in Flink 
Cluster?Out of 4 nodes, this is the highest resource usage I see from 
"top"...Everything else is not even close...top - 22:22:45 up 41 days,  2:39,  
1 user,  load average: 1.76, 1.55, 1.28Tasks: 344 total,   1 running, 343 
sleeping,   0 stopped,   0 zombie%Cpu(s):  5.4 us,  1.0 sy,  0.0 ni, 93.5 id,  
0.1 wa,  0.0 hi,  0.0 si,  0.0 stKiB Mem:  11551564+total, 65702020 used, 
49813632 free,   115072 buffersKiB Swap:        0 total,        0 used,        
0 free.  3148420 cached Mem

I am pretty sure I can push FlinkRunner to way more extent than thisAnd 
thats where true realistic perf numbers start showing up.Thanks+regards,Amir-