[ https://issues.apache.org/jira/browse/SPARK-35807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17383698#comment-17383698 ]
Apache Spark commented on SPARK-35807: -------------------------------------- User 'itholic' has created a pull request for this issue: https://github.com/apache/spark/pull/33379 > Deprecate the `num_files` argument > ---------------------------------- > > Key: SPARK-35807 > URL: https://issues.apache.org/jira/browse/SPARK-35807 > Project: Spark > Issue Type: Sub-task > Components: PySpark > Affects Versions: 3.2.0 > Reporter: Haejoon Lee > Priority: Major > > We should deprecate the num_files argument in [DataFrame.to_csv > |https://koalas.readthedocs.io/en/latest/reference/api/databricks.koalas.DataFrame.to_csv.html]and > > [DataFrame.to_json|https://koalas.readthedocs.io/en/latest/reference/api/databricks.koalas.DataFrame.to_json.html]. > Because the behavior of num_files is not actually specify the number of > files, but it specifies the number of partition. > So we should encourage users to use > [DataFrame.spark.repartition|https://koalas.readthedocs.io/en/latest/reference/api/databricks.koalas.DataFrame.spark.repartition.html] > instead in the warning message. -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org