Re: [R] no data when reading directory of csvs

Neal Richardson Thu, 03 Sep 2020 11:56:25 -0700

Hi Gary,
I'm not sure what you mean by "default install" (though I am curious), but
CSV dataset support was added in 1.0.0 (
https://arrow.apache.org/docs/r/news/index.html#datasets), so it is
expected that format = "csv" would not work on 0.16.

As for why you're getting no data, you may want to upgrade to 1.0.1, which
was released last week. There were a couple of path-related issues that
were fixed, and it's possible that one of those is causing what you're
experiencing. If upgrading doesn't solve it, please open a JIRA and provide
some more specifics about the files, and we can try to get to the bottom of
it.

Neal

On Thu, Sep 3, 2020 at 11:47 AM Gary Clark <gclark...@gmail.com> wrote:

> Hi,
>
> I'm not sure if this is a bug or not, so I wanted to ask first and see if
> im doing something wrong. I am trying to use open_dataset to read a
> directory full of csv files. The code is:
>
> ```
> path = './some_dir/'
> dfs = open_dataset(path, format = 'csv')
> ```
> sessionInfo():
>
> R version 3.6.3 (2020-02-29)
> Platform: x86_64-pc-linux-gnu (64-bit)
> Running under: Ubuntu 18.04.4 LTS
>
>
> and my version of arrow: arrow_1.0.0
>
>
> The default install of arrow gave me arrow_0.16.0 so I had to manually 
> upgrade to 1.0.0. Additionally, the default version DID NOT accept *csv* as 
> an argument to format. After upgrading it seems to accept it now, however, 
> when I 'collect()' the data and perform an operation like 'count()' it 
> returns nothing, i.e an empty tibble. Am I missing something?
>
>
> Thanks for the help.
>
>
> --
> Gary Clark
> *Data Scientist & Data Engineer*
> *B.S. Mechanical Engineering, Howard University '13*
> +1 (717) 798-6916
> gclark...@gmail.com
>

Re: [R] no data when reading directory of csvs

Reply via email to