[ https://issues.apache.org/jira/browse/ARROW-12992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Mauricio 'PachĂĄ' Vargas SepĂșlveda updated ARROW-12992: ------------------------------------------------------ Description: Followup to ARROW-10557, which implemented the C++ current state: {code:r} library(arrow) library(dplyr) library(stringr) # get animal products, year 20919 open_dataset( "../cepii-datasets-arrow/parquet/baci_hs92", partitioning = c("year", "reporter_iso") ) %>% filter( year == 2019, str_sub(product_code, 1, 2) == "01" ) %>% collect() Error: Filter expression not supported for Arrow Datasets: str_sub(product_code, 1, 2) == "01" Call collect() first to pull data into R. {code} was:Followup to ARROW-10557, which implemented the C++ > [R] bindings for substr > ----------------------- > > Key: ARROW-12992 > URL: https://issues.apache.org/jira/browse/ARROW-12992 > Project: Apache Arrow > Issue Type: New Feature > Components: R > Reporter: Neal Richardson > Priority: Major > Fix For: 5.0.0 > > > Followup to ARROW-10557, which implemented the C++ > current state: > {code:r} > library(arrow) > library(dplyr) > library(stringr) > # get animal products, year 20919 > open_dataset( > "../cepii-datasets-arrow/parquet/baci_hs92", > partitioning = c("year", "reporter_iso") > ) %>% > filter( > year == 2019, > str_sub(product_code, 1, 2) == "01" > ) %>% > collect() > Error: Filter expression not supported for Arrow Datasets: > str_sub(product_code, 1, 2) == "01" > Call collect() first to pull data into R. > {code} -- This message was sent by Atlassian Jira (v8.3.4#803005)