[ https://issues.apache.org/jira/browse/ARROW-12992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Alessandro Molina updated ARROW-12992: -------------------------------------- Fix Version/s: (was: 5.0.0) 6.0.0 > [R] bindings for substr(), substring(), str_sub() > ------------------------------------------------- > > Key: ARROW-12992 > URL: https://issues.apache.org/jira/browse/ARROW-12992 > Project: Apache Arrow > Issue Type: New Feature > Components: R > Reporter: Neal Richardson > Assignee: Mauricio 'PachĂĄ' Vargas SepĂșlveda > Priority: Major > Labels: pull-request-available > Fix For: 6.0.0 > > Time Spent: 20m > Remaining Estimate: 0h > > Followup to ARROW-10557, which implemented the C++ > current state: > {code:r} > library(arrow) > library(dplyr) > library(stringr) > # get animal products, year 20919 > open_dataset( > "../cepii-datasets-arrow/parquet/baci_hs92", > partitioning = c("year", "reporter_iso") > ) %>% > filter( > year == 2019, > str_sub(product_code, 1, 2) == "01" > ) %>% > collect() > Error: Filter expression not supported for Arrow Datasets: > str_sub(product_code, 1, 2) == "01" > Call collect() first to pull data into R. > {code} -- This message was sent by Atlassian Jira (v8.3.4#803005)