[GitHub] spark issue #19634: [SPARK-22412][SQL] Fix incorrect comment in DataSourceSc...

vgankidi Wed, 08 Nov 2017 14:31:36 -0800

Github user vgankidi commented on the issue:

    https://github.com/apache/spark/pull/19634
  
    We will end up having fewer combined splits. That reduces the number of 
files that the job produces and also reduces the number of tasks in the 
downstream jobs. In some tests I have noticed about 10% reduction in the 
combined splits. However, the simple implementation of FFD has O(n^2) run time.



---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #19634: [SPARK-22412][SQL] Fix incorrect comment in DataSourceSc...

Reply via email to