[ 
https://issues.apache.org/jira/browse/ARROW-18049?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Nicola Crane updated ARROW-18049:
---------------------------------
        Parent: ARROW-18181
    Issue Type: Sub-task  (was: Improvement)

> [R] Support column renaming in col_select argument to file reading functions
> ----------------------------------------------------------------------------
>
>                 Key: ARROW-18049
>                 URL: https://issues.apache.org/jira/browse/ARROW-18049
>             Project: Apache Arrow
>          Issue Type: Sub-task
>          Components: R
>            Reporter: Nicola Crane
>            Priority: Major
>              Labels: good-first-issue
>
> We should support the ability to rename columns when reading in data via the 
> CSV/Parquet/Feather/JSON file readers.
> We currently have an argument {{col_select}}, which allows users to choose 
> which columns to read in, but renaming doesn't work.  
> To implement this, we'd need to check if any columns have been renamed by 
> {{col_select}} and then updating the schema of the object being returned once 
> the file has been read.
> {code:r}
> library(readr)
> library(arrow)
> readr::read_csv(readr_example("mtcars.csv"), col_select = c(not_hp = hp))
> #> # A tibble: 32 × 1
> #>    not_hp
> #>     <dbl>
> #>  1    110
> #>  2    110
> #>  3     93
> #>  4    110
> #>  5    175
> #>  6    105
> #>  7    245
> #>  8     62
> #>  9     95
> #> 10    123
> #> # … with 22 more rows
> arrow::read_csv_arrow(readr_example("mtcars.csv"), col_select = c(not_hp = 
> hp))
> #> # A tibble: 32 × 1
> #>       hp
> #>    <int>
> #>  1   110
> #>  2   110
> #>  3    93
> #>  4   110
> #>  5   175
> #>  6   105
> #>  7   245
> #>  8    62
> #>  9    95
> #> 10   123
> #> # … with 22 more rows
> {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to