mrverhoeven commented on issue #29896:
URL: https://github.com/apache/arrow/issues/29896#issuecomment-2721999332
Still reproducible on 17.0.0.1.
A useful workaround for anyone who finds themselves here is to either mutate
that column type or specify it in the filter call:
```
open_dataset(tf, schema = tf_reg) %>%
+ mutate(stations = as.character(stations)) %>%
+ filter(stations == "41") %>%
+ collect()
# A tibble: 12 × 5
lat long depth mag stations
<dbl> <dbl> <int> <dbl> <chr>
1 -20.4 182. 562 4.8 41
2 -23.4 180 553 4.7 41
3 -25.7 180. 434 4.6 41
4 -32.2 180. 422 4.6 41
5 -18.1 182. 618 4.6 41
6 -25.0 180. 470 4.8 41
7 -34.4 180. 201 4.4 41
8 -34.1 182. 75 4.7 41
9 -23.5 180. 539 5 41
10 -16.0 186. 143 4.6 41
11 -19.0 169. 242 5 41
12 -13.8 167. 68 4.7 41
open_dataset(tf, schema = tf_reg) %>%
+ filter(as.character(stations) == "41") %>%
+ collect()
# A tibble: 12 × 5
lat long depth mag stations
<dbl> <dbl> <int> <dbl> <chr>
1 -20.4 182. 562 4.8 41
2 -23.4 180 553 4.7 41
3 -25.7 180. 434 4.6 41
4 -32.2 180. 422 4.6 41
5 -18.1 182. 618 4.6 41
6 -25.0 180. 470 4.8 41
7 -34.4 180. 201 4.4 41
8 -34.1 182. 75 4.7 41
9 -23.5 180. 539 5 41
10 -16.0 186. 143 4.6 41
11 -19.0 169. 242 5 41
12 -13.8 167. 68 4.7 41
>
> packageVersion("arrow")
[1] ‘17.0.0.1’
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]