[ 
https://issues.apache.org/jira/browse/SPARK-41553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Bjørn Jørgensen updated SPARK-41553:
------------------------------------
    Description: 

num_files has been deprecated and might be removed in a future version. "
"Use DataFrame.spark.repartition instead.",

The num_files argument doesn't manage the number of files, but specifying the 
partition number.

  was:
Functions have this signature. 

 
def to_json(
(..)
num_files: Optional[int] = None,
 
 
.. note:: pandas-on-Spark writes JSON files into the directory, `path`, and 
writes
multiple `part-...` files in the directory when `path` is specified.
This behavior was inherited from Apache Spark. The number of files can
be controlled by `num_files`.
 
 
 
if num_files is not None:
warnings.warn(
"`num_files` has been deprecated and might be removed in a future version. "
"Use `DataFrame.spark.repartition` instead.",
FutureWarning,
)
 
 
I will change num_files to repartition


> Fix the documentation for num_files
> -----------------------------------
>
>                 Key: SPARK-41553
>                 URL: https://issues.apache.org/jira/browse/SPARK-41553
>             Project: Spark
>          Issue Type: Improvement
>          Components: Pandas API on Spark
>    Affects Versions: 3.4.0
>            Reporter: Bjørn Jørgensen
>            Priority: Major
>
> num_files has been deprecated and might be removed in a future version. "
> "Use DataFrame.spark.repartition instead.",
> The num_files argument doesn't manage the number of files, but specifying the 
> partition number.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to