Github user Achuth17 commented on a diff in the pull request:

    https://github.com/apache/spark/pull/21608#discussion_r208295767
  
    --- Diff: 
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSourceUtils.scala
 ---
    @@ -49,4 +51,11 @@ object DataSourceUtils {
           }
         }
       }
    +
    +  // SPARK-15895: Metadata files (e.g. Parquet summary files) and 
temporary files should not be
    +  // counted as data files, so that they shouldn't participate partition 
discovery.
    +  private[sql] def isDataPath(path: Path): Boolean = {
    +    val name = path.getName
    +    !((name.startsWith("_") && !name.contains("=")) || 
name.startsWith("."))
    --- End diff --
    
    Should I use the earlier implementation with a simple if condition?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to