Hyukjin Kwon created SPARK-39301: ------------------------------------ Summary: Levearge LocalRelation in createDataFrame with Arrow optimization Key: SPARK-39301 URL: https://issues.apache.org/jira/browse/SPARK-39301 Project: Spark Issue Type: Improvement Components: PySpark Affects Versions: 3.4.0 Reporter: Hyukjin Kwon
Currently, we use {{LogicalRDD}} that always creates an RDD. in Spark SQL, we have some nice optimization with {{LocalRelation}}. We should leverage this in {{createDataFrame}} in PySpark with Arrow optimization to boost the speed up. -- This message was sent by Atlassian Jira (v8.20.7#820007) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org