Repository: incubator-samza
Updated Branches:
  refs/heads/master 41d17568d -> a73c4cbc9


SAMZA-370; document job.systemstreampartition.grouper.factory in config table


Project: http://git-wip-us.apache.org/repos/asf/incubator-samza/repo
Commit: http://git-wip-us.apache.org/repos/asf/incubator-samza/commit/a73c4cbc
Tree: http://git-wip-us.apache.org/repos/asf/incubator-samza/tree/a73c4cbc
Diff: http://git-wip-us.apache.org/repos/asf/incubator-samza/diff/a73c4cbc

Branch: refs/heads/master
Commit: a73c4cbc9a1198774d9720e82d6eb515eb9ad8a2
Parents: 41d1756
Author: Chris Riccomini <[email protected]>
Authored: Tue Aug 19 13:58:20 2014 -0700
Committer: Chris Riccomini <[email protected]>
Committed: Tue Aug 19 13:58:20 2014 -0700

----------------------------------------------------------------------
 .../versioned/jobs/configuration-table.html          | 15 +++++++++++++++
 1 file changed, 15 insertions(+)
----------------------------------------------------------------------


http://git-wip-us.apache.org/repos/asf/incubator-samza/blob/a73c4cbc/docs/learn/documentation/versioned/jobs/configuration-table.html
----------------------------------------------------------------------
diff --git a/docs/learn/documentation/versioned/jobs/configuration-table.html 
b/docs/learn/documentation/versioned/jobs/configuration-table.html
index d7a5cf7..0c74167 100644
--- a/docs/learn/documentation/versioned/jobs/configuration-table.html
+++ b/docs/learn/documentation/versioned/jobs/configuration-table.html
@@ -188,6 +188,21 @@
                 </tr>
 
                 <tr>
+                    <td class="property" 
id="job-systemstreampartition-grouper-factory">job.systemstreampartition.grouper.factory</td>
+                    <td 
class="default">org.apache.samza.container.grouper.stream.GroupByPartitionFactory</td>
+                    <td class="description">
+                        A factory class that is used to determine how input 
SystemStreamPartitions are grouped together for processing in individual 
StreamTask instances. The factory must implement the 
SystemStreamPartitionGrouperFactory interface. Once this configuration is set, 
it can't be changed, since doing so could violate state semantics, and lead to 
a loss of data.
+
+                        <dl>
+                          
<dt><code>org.apache.samza.container.grouper.stream.GroupByPartitionFactory</code></dt>
+                          <dd>Groups input stream partitions according to 
their partition number. This grouping leads to a single StreamTask processing 
all messages for a single partition (e.g. partition 0) across all input streams 
that have a partition 0. Therefore, the default is that you get one StreamTask 
for all input partitions with the same partition number. Using this strategy, 
if two input streams have a partition 0, then messages from both partitions 
will be routed to a single StreamTask. This partitioning strategy is useful for 
joining and aggregating streams.</dt>
+                          
<dt><code>org.apache.samza.container.grouper.stream.GroupBySystemStreamPartitionFactory</code></dt>
+                          <dd>Assigns each SystemStreamPartition to its own 
unique StreamTask. The  GroupBySystemStreamPartitionFactory is useful in cases 
where you want increased parallelism (more containers), and don't care about 
co-locating partitions for grouping or joins, since it allows for a greater 
number of StreamTasks to be divided up amongst Samza containers.</dd>
+                        </dl>
+                    </td>
+                </tr>
+
+                <tr>
                     <th colspan="3" class="section" id="task"><a 
href="../api/overview.html">Task configuration</a></th>
                 </tr>
 

Reply via email to