Github user shivaram commented on the pull request: https://github.com/apache/spark/pull/8280#issuecomment-132320966 Could you provide some more information about the map output ? The reducer locality should not kick in unless a certain map output location has more than 20% of the output data. How many map tasks were run and what were their output sizes ?
--- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org