[ 
https://issues.apache.org/jira/browse/ARROW-14902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dragoș Moldovan-Grünfeld reassigned ARROW-14902:
------------------------------------------------

    Assignee: Dragoș Moldovan-Grünfeld

> [R] Update write_csv_arrow() to support all args of readr::write_csv()
> ----------------------------------------------------------------------
>
>                 Key: ARROW-14902
>                 URL: https://issues.apache.org/jira/browse/ARROW-14902
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: R
>            Reporter: Dragoș Moldovan-Grünfeld
>            Assignee: Dragoș Moldovan-Grünfeld
>            Priority: Major
>
> Currently (arrow version 6.0.1 and readr version 2.1.0) we only support a few 
> of the {{readr::write_csv()}} arguments. Once ARROW-13623 is fixed 
> {{write_csv_arrow()}} will error if the user passes unsupported {{readr}} 
> arguments. 
> The following arguments need CsvWriteOptions (see linked issues) in order to 
> be exposed to R users:
>  * {{{}na{}}}: string used for missing values. Defaults to {{{}NA{}}}. 
> Missing values are never quoted; strings with the same value as {{na}} will 
> always be quoted.
>  * {{{}append{}}}: boolean. If \{[FALSE}} will overwrite existing file. If 
> {{TRUE}} will append to existing file. In both cases, if the file doesn't 
> exist, a new file is created.
>  * {{{}quote{}}}: how to handle fields which contain characters that need to 
> be quoted:
>  ** {{{}needed{}}}: only quote fields which need them
>  ** {{{}all{}}}: quote all fields - I think this might be the implicit 
> default behaviour for `write_csv_arrow()`
>  ** {{{}none{}}}: never quote fields
>  * {{{}escape{}}}: the type of escape to use when quotes are in the data:
>  ** {{{}double{}}}: quotes are escaped by doubling them
>  ** {{{}backslash{}}}: quotes are escaped by a preceding backslash
>  ** {{{}none{}}}: quotes are not escaped
>  * {{{}eol{}}}: the end of line character to use. Most commonly either "\n" 
> for Unix style newlines, or "\r\n" for Windows style newlines.
>  
> Once these are enabled, update the signature of `write_csv_arrow()` and 
> compare written files. 
> From ARROW-13623 "I noticed we had a difference in quoting: readr doesn't 
> quote strings by default but we do." Once we have more control over quoting, 
> we could write some tests to make sure default behaviours between 
> {{write_csv_arrow() and {{readr::write_csv()}}}} match.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to