[ 
https://issues.apache.org/jira/browse/FLINK-1017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14093215#comment-14093215
 ] 

Fabian Hueske commented on FLINK-1017:
--------------------------------------

Hi,

you specify with {{setParallelism()}} how many parallel instance of each 
operator are executed.
If you have a program like {{DataSource -> Map -> Reduce -> DataSink}} and set 
the default parallelism to 2, your program will be execute with 2 DataSource 
tasks, 2 Map tasks, 2 Reduce tasks, and 2 DataSinks. You can also individually 
set the parallelism of each operator.

Internally, parallel task instances are executed in separate threads (some task 
instances are merged together and executed in the same thread, some task 
instances require more than one thread).


> Add setParallelism() to Java API documentation
> ----------------------------------------------
>
>                 Key: FLINK-1017
>                 URL: https://issues.apache.org/jira/browse/FLINK-1017
>             Project: Flink
>          Issue Type: Task
>          Components: Documentation
>    Affects Versions: 0.6-incubating, pre-apache-0.5
>            Reporter: Fabian Hueske
>            Assignee: Hung Chang
>            Priority: Minor
>              Labels: starter
>             Fix For: 0.6-incubating
>
>
> The Java API offers {{setParallelism()}} to control the degree of parallelism 
> for each operator. This feature is not documented and should be added.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to