[
https://issues.apache.org/jira/browse/SPARK-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14029576#comment-14029576
]
Colin Patrick McCabe commented on SPARK-2086:
---------------------------------------------
Hi [~GregOwen], I took a look at improving {{RDD#toDebugString}} (before I
noticed this JIRA). I can take this one if you like (if you haven't already
started on it.)
> Improve output of toDebugString to make shuffle boundaries more clear
> ---------------------------------------------------------------------
>
> Key: SPARK-2086
> URL: https://issues.apache.org/jira/browse/SPARK-2086
> Project: Spark
> Issue Type: Improvement
> Reporter: Patrick Wendell
> Assignee: Gregory Owen
> Priority: Minor
>
> It would be nice if the toDebugString method of an RDD did a better job of
> explaining where shuffle boundaries occur in the lineage graph. One way to do
> this would be to only indent the tree at a shuffle boundary instead of
> indenting it for every parent.
> We can determine when a shuffle boundary occurs based on the type of
> dependency seen in the RDD.
--
This message was sent by Atlassian JIRA
(v6.2#6252)