[ 
https://issues.apache.org/jira/browse/SPARK-5786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15839788#comment-15839788
 ] 

Hyukjin Kwon commented on SPARK-5786:
-------------------------------------

It seems they are documented, at least, in API docs, e.g., 
https://github.com/apache/spark/blob/4cb49412d1d7d10ffcc738475928c7de2bc59fd4/core/src/main/scala/org/apache/spark/scheduler/DAGScheduler.scala#L55-L61

> Documentation of Narrow Dependencies
> ------------------------------------
>
>                 Key: SPARK-5786
>                 URL: https://issues.apache.org/jira/browse/SPARK-5786
>             Project: Spark
>          Issue Type: Improvement
>          Components: Documentation
>            Reporter: Imran Rashid
>
> Narrow dependencies can really improve job performance by skipping shuffles 
> entirely.  However aside from being mentioned in some early papers and during 
> some meetups, they aren't explained (or even mentioned) in the docs.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to