[
https://issues.apache.org/jira/browse/TEZ-800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14011422#comment-14011422
]
Hitesh Shah commented on TEZ-800:
---------------------------------
Comments:
{code}
}
-
+
// inform the vertices
{code}
- whitespace added.
{code}
LOG.info("Inferring vertex: " + outVertex.getVertexName() + " to be "
+ + v.getParallelism() + " from 1-1 connection with vertex "
+ + v.getVertexName());
{code}
- log message is not clear. It will show up as "Inferring vertex: v1 to be
10 from 1-1 connection with v2". Maybe added parallelism/ numtasks or something
similar after 10 ? Or "Inferring vertex parallelism for v1 to be ... "
For checkAndInferOneToOneParallelism(), should this be done using a BFS/DFS
approach? The current approach probably needs some comments to clarify how many
loops it will need to converge. Also, there is no handling for vertices that
have multiple inbound edges of different types or even all inbounds being of
type 1:1. I am assuming some of those will be caught by the consistency check
that follows.
In the tests, it will be good to test with re-ordering of the vertices when
they are added to the DAG. i.e. if the DAG is v1->v2->v3, do dag.add(v3);
add(v2); add(v1).
Is there a reason why this was not caught by the AM itself and parallelism set
as needed in the state machines?
> One-one edge with parallelism -1 fails if source vertex parallelism is not -1
> as well
> -------------------------------------------------------------------------------------
>
> Key: TEZ-800
> URL: https://issues.apache.org/jira/browse/TEZ-800
> Project: Apache Tez
> Issue Type: Bug
> Affects Versions: 0.3.0
> Reporter: Rohini Palaniswamy
> Assignee: Bikas Saha
> Attachments: TEZ-800.1.patch
>
>
> // no input initializers. At this moment, only other case is 1-1 edge
> // with uninitialized sources
> boolean hasOneToOneUninitedSource = false;
> for (Map.Entry<Vertex, Edge> entry :
> vertex.sourceVertices.entrySet()) {
> if (entry.getValue().getEdgeProperty().getDataMovementType() ==
> DataMovementType.ONE_TO_ONE) {
> if (entry.getKey().getTotalTasks() == -1) {
> hasOneToOneUninitedSource = true;
> break;
> }
> }
> }
> This checks for the source vertex which has the 1-1 edge to also have -1
> parallelism.
--
This message was sent by Atlassian JIRA
(v6.2#6252)