jonkeane commented on a change in pull request #11681:
URL: https://github.com/apache/arrow/pull/11681#discussion_r748414843
##########
File path: r/R/parquet.R
##########
@@ -82,7 +82,7 @@ read_parquet <- function(file,
#' @param x `data.frame`, [RecordBatch], or [Table]
#' @param sink A string file path, URI, or [OutputStream], or path in a file
#' system (`SubTreeFileSystem`)
-#' @param chunk_size chunk size in number of rows. If NULL, the total number
of rows is used.
+#' @param chunk_size how many rows of data to write to disk at once. If NULL,
the total number of rows is used.
Review comment:
```suggestion
#' @param chunk_size how many rows of data to write to disk at once. This
directly corresponds to how many rows will be in each row group in parquet. If
NULL, a best guess will be made for optimal size (based on the number of
columns and number of rows), or the total number of rows is used if the data
has fewer than 250 million cells (rows x cols).
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]