paleolimbot commented on code in PR #33748:
URL: https://github.com/apache/arrow/pull/33748#discussion_r1073852493


##########
r/NEWS.md:
##########
@@ -19,6 +19,77 @@
 
 # arrow 10.0.1.9000
 
+## New features
+
+### Docs
+
+* A substantial reorganisation, rewrite of and addition to, many of the 
+  vignettes and README. (@djnavarro, #14514)  
+
+### Reading/writing data
+
+* New functions `open_csv_dataset()`, `open_ts_dataset()`, and 

Review Comment:
   ```suggestion
   * New functions `open_csv_dataset()`, `open_tsv_dataset()`, and 
   ```



##########
r/NEWS.md:
##########
@@ -19,6 +19,77 @@
 
 # arrow 10.0.1.9000
 
+## New features
+
+### Docs
+
+* A substantial reorganisation, rewrite of and addition to, many of the 
+  vignettes and README. (@djnavarro, #14514)  
+
+### Reading/writing data
+
+* New functions `open_csv_dataset()`, `open_ts_dataset()`, and 
+  `open_delim_dataset()` all wrap `open_dataset()`- they don't provide new 
+  functionality, but allow for readr-style options to be supplied, making it 
+  simpler to switch between individual file-reading and dataset 
+  functionality. (#33614)
+* User-defined null values can now be set when writing CSVs both as datasets 
+  and as individual files. (@wjones127, #14679)
+* The new `col_names` parameter allows specification of column names when 
+  opening a CSV dataset. (@wjones127, #14705)
+* The `parse_options`, `read_options`, and `convert_options` parameters for 
+  reading individual files and datasets can now be passed in as lists. (#15270)

Review Comment:
   To which functions? `read_XXX_arrow()` and `open_XXX_dataset()`?



##########
r/NEWS.md:
##########
@@ -19,6 +19,77 @@
 
 # arrow 10.0.1.9000
 
+## New features
+
+### Docs
+
+* A substantial reorganisation, rewrite of and addition to, many of the 
+  vignettes and README. (@djnavarro, #14514)  
+
+### Reading/writing data
+
+* New functions `open_csv_dataset()`, `open_ts_dataset()`, and 
+  `open_delim_dataset()` all wrap `open_dataset()`- they don't provide new 
+  functionality, but allow for readr-style options to be supplied, making it 
+  simpler to switch between individual file-reading and dataset 
+  functionality. (#33614)
+* User-defined null values can now be set when writing CSVs both as datasets 
+  and as individual files. (@wjones127, #14679)
+* The new `col_names` parameter allows specification of column names when 
+  opening a CSV dataset. (@wjones127, #14705)
+* The `parse_options`, `read_options`, and `convert_options` parameters for 
+  reading individual files and datasets can now be passed in as lists. (#15270)
+
+### Function bindings
+
+The following functions can now be used in queries on Arrow objects:
+* `lubridate::with_tz()` and `lubridate::force_tz()` (@eitsupi, #14093)
+* `stringr::str_remove()` and `stringr::str_remove_all()` (#14644)
+
+### Installation
+
+* The package can now be installed offline using pre-downloaded binaries. 
+  (@pgramme, #14086)
+* The package can now automatically link to system installations of the AWS SDK
+  for C++. (@kou, #14235)
+
+### Other
+
+* New dplyr (1.1.0) function `join_by()` has been implemented for dplyr joins 
+  on Arrow objects (equality conditions only).  (#33664)
+* StructArray objects can now be created directly via `StructArray$create()`. 
+  (#14922)
+* curl timeout policy can now be configured for S3. (#15166)
+
+## Minor improvements and fixes
+
+* `map_batches()` now is lazy by default. (#14521)
+* Arrays of Decimal type objects can now be created directly and without 
+  casting. (#15211)
+* Calling `lubridate::as_datetime()` on Arrow objects now can handle time in 
+  sub-seconds. (@eitsupi, #13890)
+* `head()` can now be called after `as_record_batch_read()` without error. 
+  (#14518)
+* Fix for a bug in which `dplyr::right_join()` did not coalesce keys. (#15077)
+* Fix for a bug in output returned when multiple 
+  `dplyr::group_by()`/`dplyr::summarise()` calls are used. (#14905)
+* Fix for a bug in which `dplyr::summarize()` fails with division when divisor 
+  is a variable. (#14933)
+* Fix for a bug in which `as.Date()` fails going from `timestamp[us]` to
+  `timestamp[s]`. (#14935)
+* Fix for a bug in which creating an Array from an object bigger than 2^31 
+  results in an Array of length 0. (#14929)
+* Fix for a bug in which accents in file paths caused an error in 
+  `read_csv_arrow()`. (#14930)

Review Comment:
   ```suggestion
   * Fix for a bug in which error messages with non-ASCII characters
     were displayed incorrectly in non-UTF-8 locales. (#14930)
   ```



##########
r/NEWS.md:
##########
@@ -19,6 +19,77 @@
 
 # arrow 10.0.1.9000
 
+## New features
+
+### Docs
+
+* A substantial reorganisation, rewrite of and addition to, many of the 
+  vignettes and README. (@djnavarro, #14514)  
+
+### Reading/writing data
+
+* New functions `open_csv_dataset()`, `open_ts_dataset()`, and 
+  `open_delim_dataset()` all wrap `open_dataset()`- they don't provide new 
+  functionality, but allow for readr-style options to be supplied, making it 
+  simpler to switch between individual file-reading and dataset 
+  functionality. (#33614)
+* User-defined null values can now be set when writing CSVs both as datasets 
+  and as individual files. (@wjones127, #14679)
+* The new `col_names` parameter allows specification of column names when 
+  opening a CSV dataset. (@wjones127, #14705)
+* The `parse_options`, `read_options`, and `convert_options` parameters for 
+  reading individual files and datasets can now be passed in as lists. (#15270)
+
+### Function bindings
+
+The following functions can now be used in queries on Arrow objects:
+* `lubridate::with_tz()` and `lubridate::force_tz()` (@eitsupi, #14093)
+* `stringr::str_remove()` and `stringr::str_remove_all()` (#14644)
+
+### Installation
+
+* The package can now be installed offline using pre-downloaded binaries. 
+  (@pgramme, #14086)
+* The package can now automatically link to system installations of the AWS SDK
+  for C++. (@kou, #14235)
+
+### Other
+
+* New dplyr (1.1.0) function `join_by()` has been implemented for dplyr joins 
+  on Arrow objects (equality conditions only).  (#33664)
+* StructArray objects can now be created directly via `StructArray$create()`. 
+  (#14922)
+* curl timeout policy can now be configured for S3. (#15166)
+
+## Minor improvements and fixes
+
+* `map_batches()` now is lazy by default. (#14521)
+* Arrays of Decimal type objects can now be created directly and without 
+  casting. (#15211)

Review Comment:
   ```suggestion
   * Decimal arrays can now be created in `Array$create()` without
     casting. (#15211)
   ```



##########
r/NEWS.md:
##########
@@ -19,6 +19,77 @@
 
 # arrow 10.0.1.9000
 
+## New features
+
+### Docs
+
+* A substantial reorganisation, rewrite of and addition to, many of the 
+  vignettes and README. (@djnavarro, #14514)  
+
+### Reading/writing data
+
+* New functions `open_csv_dataset()`, `open_ts_dataset()`, and 
+  `open_delim_dataset()` all wrap `open_dataset()`- they don't provide new 
+  functionality, but allow for readr-style options to be supplied, making it 
+  simpler to switch between individual file-reading and dataset 
+  functionality. (#33614)
+* User-defined null values can now be set when writing CSVs both as datasets 
+  and as individual files. (@wjones127, #14679)
+* The new `col_names` parameter allows specification of column names when 
+  opening a CSV dataset. (@wjones127, #14705)
+* The `parse_options`, `read_options`, and `convert_options` parameters for 
+  reading individual files and datasets can now be passed in as lists. (#15270)
+
+### Function bindings
+
+The following functions can now be used in queries on Arrow objects:
+* `lubridate::with_tz()` and `lubridate::force_tz()` (@eitsupi, #14093)
+* `stringr::str_remove()` and `stringr::str_remove_all()` (#14644)
+
+### Installation
+
+* The package can now be installed offline using pre-downloaded binaries. 
+  (@pgramme, #14086)

Review Comment:
   ```suggestion
   * Improved offline installation using pre-downloaded binaries. 
     (@pgramme, #14086)
   ```



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to