This is an automated email from the ASF dual-hosted git repository. okislal pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/madlib.git
The following commit(s) were added to refs/heads/master by this push: new 8a612fe Add input params to last part of WCC impl 8a612fe is described below commit 8a612fe1bdb46d3319cb8895b4a6abf4e67c81c9 Author: Domino Valdano <dvald...@pivotal.io> AuthorDate: Tue Jul 9 17:04:57 2019 -0700 Add input params to last part of WCC impl Also, change name of algorithm from "Update oldupdate table" to "Update message table" to match rest of text. Closes: #420 --- doc/design/modules/graph.tex | 7 ++++--- 1 file changed, 4 insertions(+), 3 deletions(-) diff --git a/doc/design/modules/graph.tex b/doc/design/modules/graph.tex index 7df2bc6..9e660e6 100644 --- a/doc/design/modules/graph.tex +++ b/doc/design/modules/graph.tex @@ -125,7 +125,7 @@ negative cycle in the graph. \begin{algorithm}[SSSP$(V,E,start)$] \label{alg:sssp} -\alginput{Vertex set $v$, edge set $E$, starting vertex $start$} +\alginput{Vertex set $V$, edge set $E$, starting vertex $start$} \algoutput{Distance and parent set for every vertex $cur$} \begin{algorithmic}[1] \State $toupdate(0) \set (start,0,start)$ @@ -600,7 +600,8 @@ WHERE newupdate.id = toupdate.id Finally, the $message$ table is updated with potential new component IDs for active vertices using the following query: -\begin{algorithm}[Update oldupdate table] \label{wcc:message} +\begin{algorithm}[Update message table$(toupdate, edge)$] +\label{alg:wcc:message} \begin{lstlisting} CREATE TEMP TABLE message AS SELECT id, MIN(component_id) AS component_id @@ -631,7 +632,7 @@ potential performance drawback in Greenplum systems. In general, we advise that the edge tables should be distributed by their source columns. However, in WCC, we use both source and destination columns of the edge table in JOIN clauses. In addition, we employ a GROUP BY clause using the column that did -not serve as the join key. Algorithm~\ref{wcc:message} shows that when $dest$ +not serve as the join key. Algorithm~\ref{alg:wcc:message} shows that when $dest$ is used for the JOIN clause, $src$ is renamed to $id$ to be used for GROUP BY and vice versa. This query forces multiple redistribute motions in the database which might cause performance degradation. To address this issue, we