[ 
https://issues.apache.org/jira/browse/FLINK-1017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14093667#comment-14093667
 ] 

Hung Chang commented on FLINK-1017:
-----------------------------------

Hi, Fabian,

Cool and very concrete explanation. 

Does it mean the program will decide the <tt>partition</tt> (parallelism = 2) 
considering some factors?
{(1) your data and the number of intermediate keys you are generating in your 
mappers and (2) the partitioner and distribution of map (combiner) output keys. 
from http://pig.apache.org/docs/r0.8.1/cookbook.html}

Could I borrow your explanation as the core idea to write the documentation?
Are there some materials recommended to read that help implement for seeing 
<tt>DataSource -&gt; Map -&gt; Reduce -&gt; DataSink</tt> being spilited? 

> Add setParallelism() to Java API documentation
> ----------------------------------------------
>
>                 Key: FLINK-1017
>                 URL: https://issues.apache.org/jira/browse/FLINK-1017
>             Project: Flink
>          Issue Type: Task
>          Components: Documentation
>    Affects Versions: 0.6-incubating, pre-apache-0.5
>            Reporter: Fabian Hueske
>            Assignee: Hung Chang
>            Priority: Minor
>              Labels: starter
>             Fix For: 0.6-incubating
>
>
> The Java API offers {{setParallelism()}} to control the degree of parallelism 
> for each operator. This feature is not documented and should be added.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to