Github user ash211 commented on a diff in the pull request:
https://github.com/apache/spark/pull/20372#discussion_r163464207
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/DataSourceScanExec.scala
---
@@ -445,16 +445,25 @@ case class FileSourceScanExec(
Github user glentakahashi commented on a diff in the pull request:
https://github.com/apache/spark/pull/20372#discussion_r163427261
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/DataSourceScanExec.scala
---
@@ -445,16 +445,25 @@ case class FileSourceScanExec(
Github user glentakahashi commented on a diff in the pull request:
https://github.com/apache/spark/pull/20372#discussion_r163426554
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/DataSourceScanExec.scala
---
@@ -445,16 +445,25 @@ case class FileSourceScanExec(
Github user ash211 commented on a diff in the pull request:
https://github.com/apache/spark/pull/20372#discussion_r163419745
--- Diff:
sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/FileSourceStrategySuite.scala
---
@@ -142,15 +142,16 @@ class
Github user ash211 commented on a diff in the pull request:
https://github.com/apache/spark/pull/20372#discussion_r163424784
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/DataSourceScanExec.scala
---
@@ -445,16 +445,25 @@ case class FileSourceScanExec(
Github user ash211 commented on a diff in the pull request:
https://github.com/apache/spark/pull/20372#discussion_r163419675
--- Diff:
sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/FileSourceStrategySuite.scala
---
@@ -142,15 +142,16 @@ class
Github user ash211 commented on a diff in the pull request:
https://github.com/apache/spark/pull/20372#discussion_r163424415
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/DataSourceScanExec.scala
---
@@ -445,16 +445,25 @@ case class FileSourceScanExec(
GitHub user glentakahashi opened a pull request:
https://github.com/apache/spark/pull/20372
Improved block merging logic for partitions
## What changes were proposed in this pull request?
Change DataSourceScanExec so that when grouping blocks together into
partitions, also