[
https://issues.apache.org/jira/browse/FLINK-1017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14098118#comment-14098118
]
Hung Chang commented on FLINK-1017:
-----------------------------------
Thank you Fabian.
Could I ask about the general idea to distribute the parallelism at job level?
For example. After setting
int degreeOfParallelism = 3;
ExecutionEnvironment.setDefaultLocalParallelism(degreeOfParallelism);
final ExecutionEnvironment env =
ExecutionEnvironment.getExecutionEnvironment();
(My little sample
https://github.com/HungUnicorn/Test/blob/master/ParallelWordCount line 63)
It will retrieve the following result.
...
2> (action, 1)
2> (against, 1)
2> (and, 12)
1> (a, 5)
0> (of, 15)
....
Does it mean there are three threads 0,1,2 run DataSource -> Map -> Reduce ->
DataSink? Is it thread?
On the other hand, could I ask the idea to dispatch thread2 takes "action" &
"against" while thread1 takes "a", thread0 takes "of"?
Shall I read some documentation?
Thanks for your reading.
> Add setParallelism() to Java API documentation
> ----------------------------------------------
>
> Key: FLINK-1017
> URL: https://issues.apache.org/jira/browse/FLINK-1017
> Project: Flink
> Issue Type: Task
> Components: Documentation
> Affects Versions: 0.6-incubating, pre-apache-0.5
> Reporter: Fabian Hueske
> Assignee: Hung Chang
> Priority: Minor
> Labels: starter
> Fix For: 0.6-incubating
>
>
> The Java API offers {{setParallelism()}} to control the degree of parallelism
> for each operator. This feature is not documented and should be added.
--
This message was sent by Atlassian JIRA
(v6.2#6252)