[
https://issues.apache.org/jira/browse/FLINK-1017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14093215#comment-14093215
]
Fabian Hueske commented on FLINK-1017:
--------------------------------------
Hi,
you specify with {{setParallelism()}} how many parallel instance of each
operator are executed.
If you have a program like {{DataSource -> Map -> Reduce -> DataSink}} and set
the default parallelism to 2, your program will be execute with 2 DataSource
tasks, 2 Map tasks, 2 Reduce tasks, and 2 DataSinks. You can also individually
set the parallelism of each operator.
Internally, parallel task instances are executed in separate threads (some task
instances are merged together and executed in the same thread, some task
instances require more than one thread).
> Add setParallelism() to Java API documentation
> ----------------------------------------------
>
> Key: FLINK-1017
> URL: https://issues.apache.org/jira/browse/FLINK-1017
> Project: Flink
> Issue Type: Task
> Components: Documentation
> Affects Versions: 0.6-incubating, pre-apache-0.5
> Reporter: Fabian Hueske
> Assignee: Hung Chang
> Priority: Minor
> Labels: starter
> Fix For: 0.6-incubating
>
>
> The Java API offers {{setParallelism()}} to control the degree of parallelism
> for each operator. This feature is not documented and should be added.
--
This message was sent by Atlassian JIRA
(v6.2#6252)