Imran Rashid created SPARK-5786: ----------------------------------- Summary: Documentation of Narrow Dependencies Key: SPARK-5786 URL: https://issues.apache.org/jira/browse/SPARK-5786 Project: Spark Issue Type: Improvement Components: Documentation Reporter: Imran Rashid
Narrow dependencies can really improve job performance by skipping shuffles entirely. However aside from being mentioned in some early papers and during some meetups, they aren't explained (or even mentioned) in the docs. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org