[GitHub] spark pull request #20403: [SPARK-23238][PYTHON] Externalize SQLConf spark.s...

viirya Sat, 27 Jan 2018 18:56:37 -0800

Github user viirya commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20403#discussion_r164287467
  
    --- Diff: 
sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala ---
    @@ -1043,11 +1043,11 @@ object SQLConf {
     
       val ARROW_EXECUTION_ENABLE =
         buildConf("spark.sql.execution.arrow.enabled")
    -      .internal()
    -      .doc("Make use of Apache Arrow for columnar data transfers. 
Currently available " +
    -        "for use with pyspark.sql.DataFrame.toPandas with the following 
data types: " +
    -        "StringType, BinaryType, BooleanType, DoubleType, FloatType, 
ByteType, IntegerType, " +
    -        "LongType, ShortType")
    +      .doc("When true, make use of Apache Arrow for columnar data 
transfers. Currently available " +
    +        "for use with pyspark.sql.DataFrame.toPandas, and " +
    +        "pyspark.sql.SparkSession.createDataFrame when its input is a 
Pandas DataFrame. " +
    +        "The following data types are unsupported: " +
    +        "MapType, ArrayType of TimestampType, and nested StructType.")
           .booleanConf
           .createWithDefault(false)
    --- End diff --
    
    `spark.sql.execution.arrow.maxRecordsPerBatch` is also mentioned in the doc 
change at #19575. Shall we also externalize it?



---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #20403: [SPARK-23238][PYTHON] Externalize SQLConf spark.s...

Reply via email to