[jira] [Resolved] (ARROW-15072) [R] Error: This build of the arrow package does not support Datasets
[ https://issues.apache.org/jira/browse/ARROW-15072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] hu geme resolved ARROW-15072. - Resolution: Fixed > [R] Error: This build of the arrow package does not support Datasets > > > Key: ARROW-15072 > URL: https://issues.apache.org/jira/browse/ARROW-15072 > Project: Apache Arrow > Issue Type: Bug > Components: Parquet, R >Affects Versions: 6.0.1 > Environment: x86_64-pc-linux-gnu (64-bit) via rocker/docker > rocker/r-base:4.1.2 >Reporter: hu geme >Priority: Minor > Fix For: 6.0.1 > > > Hello, > I would like to report a possible issue (or I did not grasp the documentation > and I apologize in advance) > Im trying to use R with arrow on docker in {*}order to read parquet files > from s3{*}: > > {code:java} > FROM rocker/r-base:4.1.2 > # TO READ FROM S3 > RUN apt update -qq \ > && apt install -t unstable -y --no-install-recommends \ >libcurl4-openssl-dev > ENV LIBARROW_MINIMAL false > RUN apt update && \ > apt install -y -V ca-certificates lsb-release wget && \ > wget "https://apache.jfrog.io/artifactory/arrow/$(lsb_release --id > --short | tr 'A-Z' 'a-z')/apache-arrow- apt-source-latest-$(lsb_release > --codename --short).deb" && \ > apt install -y -V ./apache-arrow-apt-source-latest-$(lsb_release > --codename --short).deb > RUN apt update && \ > apt install -y -V -f \ > libarrow-dev \ > libarrow-dataset-dev \ > libarrow-glib-dev \ > libarrow-flight-dev \ > libparquet-dev \ > libparquet-glib-dev > RUN install2.r --error \ > arrow{code} > Thats the output of sessionInfo from the container running R > > {code:java} > sessionInfo() > R version 4.1.2 (2021-11-01) > Platform: x86_64-pc-linux-gnu (64-bit) > Running under: Debian GNU/Linux 11 (bullseye)Matrix products: default > BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3 > LAPACK: > /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.18.solocale: > [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C > [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8 > [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8 > [7] LC_PAPER=en_US.UTF-8 LC_NAME=en_US.UTF-8 > [9] LC_ADDRESS=en_US.UTF-8 LC_TELEPHONE=en_US.UTF-8 > [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=en_US.UTF-8attached base > packages: > [1] stats graphics grDevices utils datasets methods base > other attached packages: > [1] arrow_6.0.1 DBI_1.1.1 loaded via a namespace (and not attached): > [1] tidyselect_1.1.1 bit_4.0.4 compiler_4.1.2 magrittr_2.0.1 > > [5] assertthat_0.2.1 R6_2.5.1 tools_4.1.2 glue_1.5.1 > > [9] bit64_4.0.5 vctrs_0.3.8 RJDBC_0.2-8 rlang_0.4.12 > > [13] rJava_1.0-5 AWR.Athena_2.0.7-0 purrr_0.3.4 {code} > And as far as I understand, all requierements are fulfilled to use datasets > R version 4.1.2 > Platform: x86_64-pc-linux-gnu (64-bit) > arrow_6.0.1 > > {code:java} > > .Machine$sizeof.pointer < 8 > [1] FALSE > > getRversion() < "4.0.0" > [1] FALSE > > tolower(Sys.info()[["sysname"]]) == "windows" > [1] FALSE > > {code} > Nevertheless I get > Error: This build of the arrow package does not support Datasets > in return when > {code:java} > arrow::open_dataset(sources = path) {code} > Appreciate any help! -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Comment Edited] (ARROW-15072) [R] Error: This build of the arrow package does not support Datasets
[ https://issues.apache.org/jira/browse/ARROW-15072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17459457#comment-17459457 ] hu geme edited comment on ARROW-15072 at 12/14/21, 8:18 PM: Hi [~willjones127] thx alot for putting your time into it. Just a small feedback. Running your dockerfile on MAC OSX 11.6.1 results in #7 451.8 make[2]: *** [CMakeFiles/awssdk_ep.dir/build.make:132: awssdk_ep-prefix/src/awssdk_ep-stamp/awssdk_ep-build] Error 1 #7 451.8 make[1]: *** [CMakeFiles/Makefile2:760: CMakeFiles/awssdk_ep.dir/all] Error 2 #7 451.8 gmake: *** [Makefile:160: all] Error 2 #7 451.8 Error building Arrow C++. #7 457.3 - NOTE --- #7 457.3 There was an issue preparing the Arrow C++ libraries. #7 457.3 See [https://arrow.apache.org/docs/r/articles/install.html] #7 457.3 - #7 457.5 ERROR: configuration failed for package ‘arrow’ #7 457.6 * removing ‘/usr/local/lib/R/site-library/arrow’ #7 458.2 #7 458.2 The downloaded source packages are in #7 458.2 ‘/tmp/downloaded_packages’ #7 458.2 Error: installation of package ‘arrow’ had non-zero exit status #7 458.2 In addition: Warning message: #7 458.2 In install.packages(pkgs, ...) : #7 458.2 installation of package ‘arrow’ had non-zero exit status #7 ERROR: executor failed running [/bin/sh -c install2.r --error arrow]: exit code: 1 However, same runs with without any issues on Ubuntu 18.04.6 LTS (GNU/Linux 5.4.0-1053-gcp x86_64). After modifying the dockerfile as following it seems to work on MAC OSX 11.6.1, though I do not know why :) {code:java} FROM rocker/r-base:4.1.2 # TO READ FROM S3 RUN apt update -qq \ && apt install -t unstable -y --no-install-recommends \ libcurl4-openssl-dev ENV LIBARROW_MINIMAL false ENV ARROW_DEV true ENV LIBARROW_BINARY true RUN R -e "install.packages('arrow', type = 'source')"{code} thx alot! I will close that story and my apologies to categorise it as a bug was (Author: JIRAUSER281607): Hi [~willjones127] thx alot for putting your time into it. Just a small feedback. Running your dockerfile on MAC OSX 11.6.1 results in #7 451.8 make[2]: *** [CMakeFiles/awssdk_ep.dir/build.make:132: awssdk_ep-prefix/src/awssdk_ep-stamp/awssdk_ep-build] Error 1 #7 451.8 make[1]: *** [CMakeFiles/Makefile2:760: CMakeFiles/awssdk_ep.dir/all] Error 2 #7 451.8 gmake: *** [Makefile:160: all] Error 2 #7 451.8 Error building Arrow C++. #7 457.3 - NOTE --- #7 457.3 There was an issue preparing the Arrow C++ libraries. #7 457.3 See https://arrow.apache.org/docs/r/articles/install.html #7 457.3 - #7 457.5 ERROR: configuration failed for package ‘arrow’ #7 457.6 * removing ‘/usr/local/lib/R/site-library/arrow’ #7 458.2 #7 458.2 The downloaded source packages are in #7 458.2 ‘/tmp/downloaded_packages’ #7 458.2 Error: installation of package ‘arrow’ had non-zero exit status #7 458.2 In addition: Warning message: #7 458.2 In install.packages(pkgs, ...) : #7 458.2 installation of package ‘arrow’ had non-zero exit status #7 ERROR: executor failed running [/bin/sh -c install2.r --error arrow]: exit code: 1 However, same runs with without any issues on Ubuntu 18.04.6 LTS (GNU/Linux 5.4.0-1053-gcp x86_64). After modifying the dockerfile as following it seems to work on MAC OSX 11.6.1, though I do not know why :) {code:java} FROM rocker/r-base:4.1.2 # TO READ FROM S3 RUN apt update -qq \ && apt install -t unstable -y --no-install-recommends \ libcurl4-openssl-dev ENV LIBARROW_MINIMAL false ENV ARROW_DEV true ENV LIBARROW_BINARY true RUN R -e "install.packages('arrow', type = 'source')"{code} thx alot! I will close that story and my apologies to categorise it as a bug > [R] Error: This build of the arrow package does not support Datasets > > > Key: ARROW-15072 > URL: https://issues.apache.org/jira/browse/ARROW-15072 > Project: Apache Arrow > Issue Type: Bug > Components: Parquet, R >Affects Versions: 6.0.1 > Environment: x86_64-pc-linux-gnu (64-bit) via rocker/docker > rocker/r-base:4.1.2 >Reporter: hu geme >Priority: Minor > Fix For: 6.0.1 > > > Hello, > I would like to report a possible issue (or I did not grasp the documentation > and I apologize in advance) > Im trying to use R with arrow on docker in {*}order to read parquet files > from s3{*}: > > {code:java} > FROM rocker/r-base:4.1.2 > # TO READ FROM S3 > RUN apt update -qq \ > && apt install -t unstable -y --no-install-recommends \ >libcurl4-openssl-dev > ENV LIBARROW_MINIMAL false > RUN apt
[jira] [Commented] (ARROW-15072) [R] Error: This build of the arrow package does not support Datasets
[ https://issues.apache.org/jira/browse/ARROW-15072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17459457#comment-17459457 ] hu geme commented on ARROW-15072: - Hi [~willjones127] thx alot for putting your time into it. Just a small feedback. Running your dockerfile on MAC OSX 11.6.1 results in #7 451.8 make[2]: *** [CMakeFiles/awssdk_ep.dir/build.make:132: awssdk_ep-prefix/src/awssdk_ep-stamp/awssdk_ep-build] Error 1 #7 451.8 make[1]: *** [CMakeFiles/Makefile2:760: CMakeFiles/awssdk_ep.dir/all] Error 2 #7 451.8 gmake: *** [Makefile:160: all] Error 2 #7 451.8 Error building Arrow C++. #7 457.3 - NOTE --- #7 457.3 There was an issue preparing the Arrow C++ libraries. #7 457.3 See https://arrow.apache.org/docs/r/articles/install.html #7 457.3 - #7 457.5 ERROR: configuration failed for package ‘arrow’ #7 457.6 * removing ‘/usr/local/lib/R/site-library/arrow’ #7 458.2 #7 458.2 The downloaded source packages are in #7 458.2 ‘/tmp/downloaded_packages’ #7 458.2 Error: installation of package ‘arrow’ had non-zero exit status #7 458.2 In addition: Warning message: #7 458.2 In install.packages(pkgs, ...) : #7 458.2 installation of package ‘arrow’ had non-zero exit status #7 ERROR: executor failed running [/bin/sh -c install2.r --error arrow]: exit code: 1 However, same runs with without any issues on Ubuntu 18.04.6 LTS (GNU/Linux 5.4.0-1053-gcp x86_64). After modifying the dockerfile as following it seems to work on MAC OSX 11.6.1, though I do not know why :) {code:java} FROM rocker/r-base:4.1.2 # TO READ FROM S3 RUN apt update -qq \ && apt install -t unstable -y --no-install-recommends \ libcurl4-openssl-dev ENV LIBARROW_MINIMAL false ENV ARROW_DEV true ENV LIBARROW_BINARY true RUN R -e "install.packages('arrow', type = 'source')"{code} thx alot! I will close that story and my apologies to categorise it as a bug > [R] Error: This build of the arrow package does not support Datasets > > > Key: ARROW-15072 > URL: https://issues.apache.org/jira/browse/ARROW-15072 > Project: Apache Arrow > Issue Type: Bug > Components: Parquet, R >Affects Versions: 6.0.1 > Environment: x86_64-pc-linux-gnu (64-bit) via rocker/docker > rocker/r-base:4.1.2 >Reporter: hu geme >Priority: Minor > Fix For: 6.0.1 > > > Hello, > I would like to report a possible issue (or I did not grasp the documentation > and I apologize in advance) > Im trying to use R with arrow on docker in {*}order to read parquet files > from s3{*}: > > {code:java} > FROM rocker/r-base:4.1.2 > # TO READ FROM S3 > RUN apt update -qq \ > && apt install -t unstable -y --no-install-recommends \ >libcurl4-openssl-dev > ENV LIBARROW_MINIMAL false > RUN apt update && \ > apt install -y -V ca-certificates lsb-release wget && \ > wget "https://apache.jfrog.io/artifactory/arrow/$(lsb_release --id > --short | tr 'A-Z' 'a-z')/apache-arrow- apt-source-latest-$(lsb_release > --codename --short).deb" && \ > apt install -y -V ./apache-arrow-apt-source-latest-$(lsb_release > --codename --short).deb > RUN apt update && \ > apt install -y -V -f \ > libarrow-dev \ > libarrow-dataset-dev \ > libarrow-glib-dev \ > libarrow-flight-dev \ > libparquet-dev \ > libparquet-glib-dev > RUN install2.r --error \ > arrow{code} > Thats the output of sessionInfo from the container running R > > {code:java} > sessionInfo() > R version 4.1.2 (2021-11-01) > Platform: x86_64-pc-linux-gnu (64-bit) > Running under: Debian GNU/Linux 11 (bullseye)Matrix products: default > BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3 > LAPACK: > /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.18.solocale: > [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C > [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8 > [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8 > [7] LC_PAPER=en_US.UTF-8 LC_NAME=en_US.UTF-8 > [9] LC_ADDRESS=en_US.UTF-8 LC_TELEPHONE=en_US.UTF-8 > [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=en_US.UTF-8attached base > packages: > [1] stats graphics grDevices utils datasets methods base > other attached packages: > [1] arrow_6.0.1 DBI_1.1.1 loaded via a namespace (and not attached): > [1] tidyselect_1.1.1 bit_4.0.4 compiler_4.1.2 magrittr_2.0.1 > > [5] assertthat_0.2.1 R6_2.5.1 tools_4.1.2 glue_1.5.1 > > [9] bit64_4.0.5 vctrs_0.3.8 RJDBC_0.2-8 rlang_0.4.12
[jira] [Commented] (ARROW-15072) [R] Error: This build of the arrow package does not support Datasets
[ https://issues.apache.org/jira/browse/ARROW-15072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17458786#comment-17458786 ] hu geme commented on ARROW-15072: - {code:java} > arrow_info() Arrow package version: 6.0.1Capabilities: dataset FALSE parquet FALSE json FALSE s3 FALSE utf8proc TRUE re2 TRUE snappy TRUE gzip TRUE brotli TRUE zstd TRUE lz4 TRUE lz4_frame TRUE lzo FALSE bz2 TRUE jemalloc TRUE mimalloc TRUETo reinstall with more optional capabilities enabled, see https://arrow.apache.org/docs/r/articles/install.htmlMemory: Allocator jemalloc Current 0 bytes Max 0 bytesRuntime: SIMD Level avx2 Detected SIMD Level avx2Build: C++ Library Version 6.0.1 C++ Compiler GNU C++ Compiler Version 10.2.1 {code} Is it safe to assume that LIBARROW_MINIMAL false should result in dataset true, parquet true, s3 true? since that is the whole point of using this awesome package > [R] Error: This build of the arrow package does not support Datasets > > > Key: ARROW-15072 > URL: https://issues.apache.org/jira/browse/ARROW-15072 > Project: Apache Arrow > Issue Type: Bug > Components: Parquet, R >Affects Versions: 6.0.1 > Environment: x86_64-pc-linux-gnu (64-bit) via rocker/docker > rocker/r-base:4.1.2 >Reporter: hu geme >Priority: Minor > Fix For: 6.0.1 > > > Hello, > I would like to report a possible issue (or I did not grasp the documentation > and I apologize in advance) > Im trying to use R with arrow on docker in {*}order to read parquet files > from s3{*}: > > {code:java} > FROM rocker/r-base:4.1.2 > # TO READ FROM S3 > RUN apt update -qq \ > && apt install -t unstable -y --no-install-recommends \ >libcurl4-openssl-dev > ENV LIBARROW_MINIMAL false > RUN apt update && \ > apt install -y -V ca-certificates lsb-release wget && \ > wget "https://apache.jfrog.io/artifactory/arrow/$(lsb_release --id > --short | tr 'A-Z' 'a-z')/apache-arrow- apt-source-latest-$(lsb_release > --codename --short).deb" && \ > apt install -y -V ./apache-arrow-apt-source-latest-$(lsb_release > --codename --short).deb > RUN apt update && \ > apt install -y -V -f \ > libarrow-dev \ > libarrow-dataset-dev \ > libarrow-glib-dev \ > libarrow-flight-dev \ > libparquet-dev \ > libparquet-glib-dev > RUN install2.r --error \ > arrow{code} > Thats the output of sessionInfo from the container running R > > {code:java} > sessionInfo() > R version 4.1.2 (2021-11-01) > Platform: x86_64-pc-linux-gnu (64-bit) > Running under: Debian GNU/Linux 11 (bullseye)Matrix products: default > BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3 > LAPACK: > /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.18.solocale: > [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C > [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8 > [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8 > [7] LC_PAPER=en_US.UTF-8 LC_NAME=en_US.UTF-8 > [9] LC_ADDRESS=en_US.UTF-8 LC_TELEPHONE=en_US.UTF-8 > [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=en_US.UTF-8attached base > packages: > [1] stats graphics grDevices utils datasets methods base > other attached packages: > [1] arrow_6.0.1 DBI_1.1.1 loaded via a namespace (and not attached): > [1] tidyselect_1.1.1 bit_4.0.4 compiler_4.1.2 magrittr_2.0.1 > > [5] assertthat_0.2.1 R6_2.5.1 tools_4.1.2 glue_1.5.1 > > [9] bit64_4.0.5 vctrs_0.3.8 RJDBC_0.2-8 rlang_0.4.12 > > [13] rJava_1.0-5 AWR.Athena_2.0.7-0 purrr_0.3.4 {code} > And as far as I understand, all requierements are fulfilled to use datasets > R version 4.1.2 > Platform: x86_64-pc-linux-gnu (64-bit) > arrow_6.0.1 > > {code:java} > > .Machine$sizeof.pointer < 8 > [1] FALSE > > getRversion() < "4.0.0" > [1] FALSE > > tolower(Sys.info()[["sysname"]]) == "windows" > [1] FALSE > > {code} > Nevertheless I get > Error: This build of the arrow package does not support Datasets > in return when > {code:java} > arrow::open_dataset(sources = path) {code} > Appreciate any help! -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Commented] (ARROW-15072) [R] Error: This build of the arrow package does not support Datasets
[ https://issues.apache.org/jira/browse/ARROW-15072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17458777#comment-17458777 ] hu geme commented on ARROW-15072: - Hi [~willjones127], [~jonkeane] thx you for your comment. I have adjusted the Dockerfile and replaced one of those {{apt install}} commands, which might not have worked on your machine. I tried your suggestion by adding ENV LIBARROW_MINIMAL false to the Dockerfile, which did not solve the problem of using Datasets. Additionally, I ran install_arrow(binary = FALSE, minimal = FALSE) as well in the container. However, I was not able to use Datasets here as well which resulted in the same error. I guess, Im missing something here, though it seems not self explanatory to me :) Best & Big Thx > [R] Error: This build of the arrow package does not support Datasets > > > Key: ARROW-15072 > URL: https://issues.apache.org/jira/browse/ARROW-15072 > Project: Apache Arrow > Issue Type: Bug > Components: Parquet, R >Affects Versions: 6.0.1 > Environment: x86_64-pc-linux-gnu (64-bit) via rocker/docker > rocker/r-base:4.1.2 >Reporter: hu geme >Priority: Minor > Fix For: 6.0.1 > > > Hello, > I would like to report a possible issue (or I did not grasp the documentation > and I apologize in advance) > Im trying to use R with arrow on docker in {*}order to read parquet files > from s3{*}: > > {code:java} > FROM rocker/r-base:4.1.2 > # TO READ FROM S3 > RUN apt update -qq \ > && apt install -t unstable -y --no-install-recommends \ >libcurl4-openssl-dev > ENV LIBARROW_MINIMAL false > RUN apt update && \ > apt install -y -V ca-certificates lsb-release wget && \ > wget "https://apache.jfrog.io/artifactory/arrow/$(lsb_release --id > --short | tr 'A-Z' 'a-z')/apache-arrow- apt-source-latest-$(lsb_release > --codename --short).deb" && \ > apt install -y -V ./apache-arrow-apt-source-latest-$(lsb_release > --codename --short).deb > RUN apt update && \ > apt install -y -V -f \ > libarrow-dev \ > libarrow-dataset-dev \ > libarrow-glib-dev \ > libarrow-flight-dev \ > libparquet-dev \ > libparquet-glib-dev > RUN install2.r --error \ > arrow{code} > Thats the output of sessionInfo from the container running R > > {code:java} > sessionInfo() > R version 4.1.2 (2021-11-01) > Platform: x86_64-pc-linux-gnu (64-bit) > Running under: Debian GNU/Linux 11 (bullseye)Matrix products: default > BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3 > LAPACK: > /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.18.solocale: > [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C > [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8 > [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8 > [7] LC_PAPER=en_US.UTF-8 LC_NAME=en_US.UTF-8 > [9] LC_ADDRESS=en_US.UTF-8 LC_TELEPHONE=en_US.UTF-8 > [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=en_US.UTF-8attached base > packages: > [1] stats graphics grDevices utils datasets methods base > other attached packages: > [1] arrow_6.0.1 DBI_1.1.1 loaded via a namespace (and not attached): > [1] tidyselect_1.1.1 bit_4.0.4 compiler_4.1.2 magrittr_2.0.1 > > [5] assertthat_0.2.1 R6_2.5.1 tools_4.1.2 glue_1.5.1 > > [9] bit64_4.0.5 vctrs_0.3.8 RJDBC_0.2-8 rlang_0.4.12 > > [13] rJava_1.0-5 AWR.Athena_2.0.7-0 purrr_0.3.4 {code} > And as far as I understand, all requierements are fulfilled to use datasets > R version 4.1.2 > Platform: x86_64-pc-linux-gnu (64-bit) > arrow_6.0.1 > > {code:java} > > .Machine$sizeof.pointer < 8 > [1] FALSE > > getRversion() < "4.0.0" > [1] FALSE > > tolower(Sys.info()[["sysname"]]) == "windows" > [1] FALSE > > {code} > Nevertheless I get > Error: This build of the arrow package does not support Datasets > in return when > {code:java} > arrow::open_dataset(sources = path) {code} > Appreciate any help! -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Updated] (ARROW-15072) [R] Error: This build of the arrow package does not support Datasets
[ https://issues.apache.org/jira/browse/ARROW-15072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] hu geme updated ARROW-15072: Description: Hello, I would like to report a possible issue (or I did not grasp the documentation and I apologize in advance) Im trying to use R with arrow on docker in {*}order to read parquet files from s3{*}: {code:java} FROM rocker/r-base:4.1.2 # TO READ FROM S3 RUN apt update -qq \ && apt install -t unstable -y --no-install-recommends \ libcurl4-openssl-dev ENV LIBARROW_MINIMAL false RUN apt update && \ apt install -y -V ca-certificates lsb-release wget && \ wget "https://apache.jfrog.io/artifactory/arrow/$(lsb_release --id --short | tr 'A-Z' 'a-z')/apache-arrow- apt-source-latest-$(lsb_release --codename --short).deb" && \ apt install -y -V ./apache-arrow-apt-source-latest-$(lsb_release --codename --short).deb RUN apt update && \ apt install -y -V -f \ libarrow-dev \ libarrow-dataset-dev \ libarrow-glib-dev \ libarrow-flight-dev \ libparquet-dev \ libparquet-glib-dev RUN install2.r --error \ arrow{code} Thats the output of sessionInfo from the container running R {code:java} sessionInfo() R version 4.1.2 (2021-11-01) Platform: x86_64-pc-linux-gnu (64-bit) Running under: Debian GNU/Linux 11 (bullseye)Matrix products: default BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3 LAPACK: /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.18.solocale: [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8 [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8 [7] LC_PAPER=en_US.UTF-8 LC_NAME=en_US.UTF-8 [9] LC_ADDRESS=en_US.UTF-8 LC_TELEPHONE=en_US.UTF-8 [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=en_US.UTF-8attached base packages: [1] stats graphics grDevices utils datasets methods base other attached packages: [1] arrow_6.0.1 DBI_1.1.1 loaded via a namespace (and not attached): [1] tidyselect_1.1.1 bit_4.0.4 compiler_4.1.2 magrittr_2.0.1 [5] assertthat_0.2.1 R6_2.5.1 tools_4.1.2 glue_1.5.1 [9] bit64_4.0.5 vctrs_0.3.8 RJDBC_0.2-8 rlang_0.4.12 [13] rJava_1.0-5 AWR.Athena_2.0.7-0 purrr_0.3.4 {code} And as far as I understand, all requierements are fulfilled to use datasets R version 4.1.2 Platform: x86_64-pc-linux-gnu (64-bit) arrow_6.0.1 {code:java} > .Machine$sizeof.pointer < 8 [1] FALSE > getRversion() < "4.0.0" [1] FALSE > tolower(Sys.info()[["sysname"]]) == "windows" [1] FALSE > {code} Nevertheless I get Error: This build of the arrow package does not support Datasets in return when {code:java} arrow::open_dataset(sources = path) {code} Appreciate any help! was: Hello, I would like to report a possible issue (or I did not grasp the documentation and I apologize in advance) Im trying to use R with arrow on docker: {code:java} FROM rocker/r-base:4.1.2 # TO READ FROM S3 RUN apt update -qq \ && apt install -t unstable -y --no-install-recommends \ libcurl4-openssl-dev ENV LIBARROW_MINIMAL false RUN apt update && \ apt install -y -V ca-certificates lsb-release wget && \ wget "https://apache.jfrog.io/artifactory/arrow/$(lsb_release --id --short | tr 'A-Z' 'a-z')/apache-arrow- apt-source-latest-$(lsb_release --codename --short).deb" && \ apt install -y -V ./apache-arrow-apt-source-latest-$(lsb_release --codename --short).deb RUN apt update && \ apt install -y -V -f \ libarrow-dev \ libarrow-dataset-dev \ libarrow-glib-dev \ libarrow-flight-dev \ libparquet-dev \ libparquet-glib-dev RUN install2.r --error \ arrow{code} Thats the output of sessionInfo from the container running R {code:java} sessionInfo() R version 4.1.2 (2021-11-01) Platform: x86_64-pc-linux-gnu (64-bit) Running under: Debian GNU/Linux 11 (bullseye)Matrix products: default BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3 LAPACK: /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.18.solocale: [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8 [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8 [7] LC_PAPER=en_US.UTF-8 LC_NAME=en_US.UTF-8 [9] LC_ADDRESS=en_US.UTF-8 LC_TELEPHONE=en_US.UTF-8 [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=en_US.UTF-8attached base packages: [1] stats graphics grDevices utils datasets methods base other attached packages: [1] arrow_6.0.1 DBI_1.1.1 loaded via a namespace (and
[jira] [Updated] (ARROW-15072) [R] Error: This build of the arrow package does not support Datasets
[ https://issues.apache.org/jira/browse/ARROW-15072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] hu geme updated ARROW-15072: Description: Hello, I would like to report a possible issue (or I did not grasp the documentation and I apologize in advance) Im trying to use R with arrow on docker: {code:java} FROM rocker/r-base:4.1.2 # TO READ FROM S3 RUN apt update -qq \ && apt install -t unstable -y --no-install-recommends \ libcurl4-openssl-dev ENV LIBARROW_MINIMAL false RUN apt update && \ apt install -y -V ca-certificates lsb-release wget && \ wget "https://apache.jfrog.io/artifactory/arrow/$(lsb_release --id --short | tr 'A-Z' 'a-z')/apache-arrow- apt-source-latest-$(lsb_release --codename --short).deb" && \ apt install -y -V ./apache-arrow-apt-source-latest-$(lsb_release --codename --short).deb RUN apt update && \ apt install -y -V -f \ libarrow-dev \ libarrow-dataset-dev \ libarrow-glib-dev \ libarrow-flight-dev \ libparquet-dev \ libparquet-glib-dev RUN install2.r --error \ arrow{code} Thats the output of sessionInfo from the container running R {code:java} sessionInfo() R version 4.1.2 (2021-11-01) Platform: x86_64-pc-linux-gnu (64-bit) Running under: Debian GNU/Linux 11 (bullseye)Matrix products: default BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3 LAPACK: /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.18.solocale: [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8 [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8 [7] LC_PAPER=en_US.UTF-8 LC_NAME=en_US.UTF-8 [9] LC_ADDRESS=en_US.UTF-8 LC_TELEPHONE=en_US.UTF-8 [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=en_US.UTF-8attached base packages: [1] stats graphics grDevices utils datasets methods base other attached packages: [1] arrow_6.0.1 DBI_1.1.1 loaded via a namespace (and not attached): [1] tidyselect_1.1.1 bit_4.0.4 compiler_4.1.2 magrittr_2.0.1 [5] assertthat_0.2.1 R6_2.5.1 tools_4.1.2 glue_1.5.1 [9] bit64_4.0.5 vctrs_0.3.8 RJDBC_0.2-8 rlang_0.4.12 [13] rJava_1.0-5 AWR.Athena_2.0.7-0 purrr_0.3.4 {code} And as far as I understand, all requierements are fulfilled to use datasets R version 4.1.2 Platform: x86_64-pc-linux-gnu (64-bit) arrow_6.0.1 {code:java} > .Machine$sizeof.pointer < 8 [1] FALSE > getRversion() < "4.0.0" [1] FALSE > tolower(Sys.info()[["sysname"]]) == "windows" [1] FALSE > {code} Nevertheless I get Error: This build of the arrow package does not support Datasets in return when {code:java} arrow::open_dataset(sources = path) {code} Appreciate any help! was: Hello, I would like to report a possible issue (or I did not grasp the documentation and I apologize in advance) Im trying to use R with arrow on docker: {code:java} FROM rocker/r-base:4.1.2 # TO READ FROM S3 RUN apt update -qq \ && apt install -t unstable -y --no-install-recommends \ libcurl4-openssl-dev ENV LIBARROW_MINIMAL=false RUN apt update && \ apt install -y -V ca-certificates lsb-release wget && \ wget "https://apache.jfrog.io/artifactory/arrow/$(lsb_release --id --short | tr 'A-Z' 'a-z')/apache-arrow- apt-source-latest-$(lsb_release --codename --short).deb" && \ apt install -y -V ./apache-arrow-apt-source-latest-$(lsb_release --codename --short).deb RUN apt update && \ apt install -y -V -f \ libarrow-dev \ libarrow-dataset-dev \ libarrow-glib-dev \ libarrow-flight-dev \ libparquet-dev \ libparquet-glib-dev RUN install2.r --error \ arrow{code} Thats the output of sessionInfo from the container running R {code:java} sessionInfo() R version 4.1.2 (2021-11-01) Platform: x86_64-pc-linux-gnu (64-bit) Running under: Debian GNU/Linux 11 (bullseye)Matrix products: default BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3 LAPACK: /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.18.solocale: [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8 [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8 [7] LC_PAPER=en_US.UTF-8 LC_NAME=en_US.UTF-8 [9] LC_ADDRESS=en_US.UTF-8 LC_TELEPHONE=en_US.UTF-8 [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=en_US.UTF-8attached base packages: [1] stats graphics grDevices utils datasets methods base other attached packages: [1] arrow_6.0.1 DBI_1.1.1 loaded via a namespace (and not attached): [1] tidyselect_1.1.1
[jira] [Updated] (ARROW-15072) [R] Error: This build of the arrow package does not support Datasets
[ https://issues.apache.org/jira/browse/ARROW-15072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] hu geme updated ARROW-15072: Description: Hello, I would like to report a possible issue (or I did not grasp the documentation and I apologize in advance) Im trying to use R with arrow on docker: {code:java} FROM rocker/r-base:4.1.2 # TO READ FROM S3 RUN apt update -qq \ && apt install -t unstable -y --no-install-recommends \ libcurl4-openssl-dev ENV LIBARROW_MINIMAL=false RUN apt update && \ apt install -y -V ca-certificates lsb-release wget && \ wget "https://apache.jfrog.io/artifactory/arrow/$(lsb_release --id --short | tr 'A-Z' 'a-z')/apache-arrow- apt-source-latest-$(lsb_release --codename --short).deb" && \ apt install -y -V ./apache-arrow-apt-source-latest-$(lsb_release --codename --short).deb RUN apt update && \ apt install -y -V -f \ libarrow-dev \ libarrow-dataset-dev \ libarrow-glib-dev \ libarrow-flight-dev \ libparquet-dev \ libparquet-glib-dev RUN install2.r --error \ arrow{code} Thats the output of sessionInfo from the container running R {code:java} sessionInfo() R version 4.1.2 (2021-11-01) Platform: x86_64-pc-linux-gnu (64-bit) Running under: Debian GNU/Linux 11 (bullseye)Matrix products: default BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3 LAPACK: /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.18.solocale: [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8 [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8 [7] LC_PAPER=en_US.UTF-8 LC_NAME=en_US.UTF-8 [9] LC_ADDRESS=en_US.UTF-8 LC_TELEPHONE=en_US.UTF-8 [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=en_US.UTF-8attached base packages: [1] stats graphics grDevices utils datasets methods base other attached packages: [1] arrow_6.0.1 DBI_1.1.1 loaded via a namespace (and not attached): [1] tidyselect_1.1.1 bit_4.0.4 compiler_4.1.2 magrittr_2.0.1 [5] assertthat_0.2.1 R6_2.5.1 tools_4.1.2 glue_1.5.1 [9] bit64_4.0.5 vctrs_0.3.8 RJDBC_0.2-8 rlang_0.4.12 [13] rJava_1.0-5 AWR.Athena_2.0.7-0 purrr_0.3.4 {code} And as far as I understand, all requierements are fulfilled to use datasets R version 4.1.2 Platform: x86_64-pc-linux-gnu (64-bit) arrow_6.0.1 {code:java} > .Machine$sizeof.pointer < 8 [1] FALSE > getRversion() < "4.0.0" [1] FALSE > tolower(Sys.info()[["sysname"]]) == "windows" [1] FALSE > {code} Nevertheless I get Error: This build of the arrow package does not support Datasets in return when {code:java} arrow::open_dataset(sources = path) {code} Appreciate any help! was: Hello, I would like to report a possible issue (or I did not grasp the documentation and I apologize in advance) Im trying to use R with arrow on docker: {code:java} FROM rocker/r-base:4.1.2 # TO READ FROM S3 RUN apt update -qq \ && apt install -t unstable -y --no-install-recommends \ libcurl4-openssl-dev ENV LIBARROW_MINIMAL=false RUN apt update && \ apt install -y -V ca-certificates lsb-release wget && \ wget "https://apache.jfrog.io/artifactory/arrow/$(lsb_release --id --short | tr 'A-Z' 'a-z')/apache-arrow- apt-source-latest-$(lsb_release --codename --short).deb" && \ apt install -y -V ./apache-arrow-apt-source-latest-$(lsb_release --codename --short).deb RUN apt update && \ apt install -y -V -f \ libarrow-dev \ libarrow-dataset-dev \ libarrow-glib-dev \ libarrow-flight-dev \ libparquet-dev \ libparquet-glib-dev RUN install2.r --error \ arrow{code} Thats the output of sessionInfo from the container running R {code:java} sessionInfo() R version 4.1.2 (2021-11-01) Platform: x86_64-pc-linux-gnu (64-bit) Running under: Debian GNU/Linux 11 (bullseye)Matrix products: default BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3 LAPACK: /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.18.solocale: [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8 [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8 [7] LC_PAPER=en_US.UTF-8 LC_NAME=en_US.UTF-8 [9] LC_ADDRESS=en_US.UTF-8 LC_TELEPHONE=en_US.UTF-8 [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=en_US.UTF-8attached base packages: [1] stats graphics grDevices utils datasets methods base other attached packages: [1] arrow_6.0.1 DBI_1.1.1 loaded via a namespace (and not attached): [1] tidyselect_1.1.1 bit_4.0.4
[jira] [Updated] (ARROW-15072) [R] Error: This build of the arrow package does not support Datasets
[ https://issues.apache.org/jira/browse/ARROW-15072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] hu geme updated ARROW-15072: Description: Hello, I would like to report a possible issue (or I did not grasp the documentation and I apologize in advance) Im trying to use R with arrow on docker: {code:java} FROM rocker/r-base:4.1.2 # TO READ FROM S3 RUN apt update -qq \ && apt install -t unstable -y --no-install-recommends \ libcurl4-openssl-dev ENV LIBARROW_MINIMAL=false RUN apt update && \ apt install -y -V ca-certificates lsb-release wget && \ wget "https://apache.jfrog.io/artifactory/arrow/$(lsb_release --id --short | tr 'A-Z' 'a-z')/apache-arrow- apt-source-latest-$(lsb_release --codename --short).deb" && \ apt install -y -V ./apache-arrow-apt-source-latest-$(lsb_release --codename --short).deb RUN apt update && \ apt install -y -V -f \ libarrow-dev \ libarrow-dataset-dev \ libarrow-glib-dev \ libarrow-flight-dev \ libparquet-dev \ libparquet-glib-dev RUN install2.r --error \ arrow{code} Thats the output of sessionInfo from the container running R {code:java} sessionInfo() R version 4.1.2 (2021-11-01) Platform: x86_64-pc-linux-gnu (64-bit) Running under: Debian GNU/Linux 11 (bullseye)Matrix products: default BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3 LAPACK: /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.18.solocale: [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8 [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8 [7] LC_PAPER=en_US.UTF-8 LC_NAME=en_US.UTF-8 [9] LC_ADDRESS=en_US.UTF-8 LC_TELEPHONE=en_US.UTF-8 [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=en_US.UTF-8attached base packages: [1] stats graphics grDevices utils datasets methods base other attached packages: [1] arrow_6.0.1 DBI_1.1.1 loaded via a namespace (and not attached): [1] tidyselect_1.1.1 bit_4.0.4 compiler_4.1.2 magrittr_2.0.1 [5] assertthat_0.2.1 R6_2.5.1 tools_4.1.2 glue_1.5.1 [9] bit64_4.0.5 vctrs_0.3.8 RJDBC_0.2-8 rlang_0.4.12 [13] rJava_1.0-5 AWR.Athena_2.0.7-0 purrr_0.3.4 {code} And as far as I understand, all requierements are fulfilled to use datasets R version 4.1.2 Platform: x86_64-pc-linux-gnu (64-bit) arrow_6.0.1 {code:java} > .Machine$sizeof.pointer < 8 [1] FALSE > getRversion() < "4.0.0" [1] FALSE > tolower(Sys.info()[["sysname"]]) == "windows" [1] FALSE > {code} Nevertheless I get Error: This build of the arrow package does not support Datasets in return when {code:java} arrow::open_dataset(sources = path) {code} Appreciate any help! was: Hello, I would like to report a possible issue (or I did not grasp the documentation and I apologize in advance) Im trying to use R with arrow on docker: {code:java} FROM rocker/r-base:4.1.2 RUN apt update && \ apt install -y -V ca-certificates lsb-release wget && \ wget "https://apache.jfrog.io/artifactory/arrow/$(lsb_release --id --short | tr 'A-Z' 'a-z')/apache-arrow- apt-source-latest-$(lsb_release --codename --short).deb" && \ apt install -y -V ./apache-arrow-apt-source-latest-$(lsb_release --codename --short).deb RUN apt update && \ apt install -y -V -f \ libarrow-dev \ libarrow-dataset-dev \ libarrow-glib-dev \ libarrow-flight-dev \ libparquet-dev \ libparquet-glib-dev RUN install2.r --error \ arrow {code} Thats the output of sessionInfo from the container running R {code:java} sessionInfo() R version 4.1.2 (2021-11-01) Platform: x86_64-pc-linux-gnu (64-bit) Running under: Debian GNU/Linux 11 (bullseye)Matrix products: default BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3 LAPACK: /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.18.solocale: [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8 [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8 [7] LC_PAPER=en_US.UTF-8 LC_NAME=en_US.UTF-8 [9] LC_ADDRESS=en_US.UTF-8 LC_TELEPHONE=en_US.UTF-8 [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=en_US.UTF-8attached base packages: [1] stats graphics grDevices utils datasets methods base other attached packages: [1] arrow_6.0.1 DBI_1.1.1 loaded via a namespace (and not attached): [1] tidyselect_1.1.1 bit_4.0.4 compiler_4.1.2 magrittr_2.0.1 [5] assertthat_0.2.1 R6_2.5.1 tools_4.1.2 glue_1.5.1 [9] bit64_4.0.5
[jira] [Updated] (ARROW-15072) [R] Error: This build of the arrow package does not support Datasets
[ https://issues.apache.org/jira/browse/ARROW-15072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] hu geme updated ARROW-15072: Description: Hello, I would like to report a possible issue (or I did not grasp the documentation and I apologize in advance) Im trying to use R with arrow on docker: {code:java} FROM rocker/r-base:4.1.2 RUN apt update && \ apt install -y -V ca-certificates lsb-release wget && \ wget "https://apache.jfrog.io/artifactory/arrow/$(lsb_release --id --short | tr 'A-Z' 'a-z')/apache-arrow- apt-source-latest-$(lsb_release --codename --short).deb" && \ apt install -y -V ./apache-arrow-apt-source-latest-$(lsb_release --codename --short).deb RUN apt update && \ apt install -y -V -f \ libarrow-dev \ libarrow-dataset-dev \ libarrow-glib-dev \ libarrow-flight-dev \ libparquet-dev \ libparquet-glib-dev RUN install2.r --error \ arrow {code} Thats the output of sessionInfo from the container running R {code:java} sessionInfo() R version 4.1.2 (2021-11-01) Platform: x86_64-pc-linux-gnu (64-bit) Running under: Debian GNU/Linux 11 (bullseye)Matrix products: default BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3 LAPACK: /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.18.solocale: [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8 [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8 [7] LC_PAPER=en_US.UTF-8 LC_NAME=en_US.UTF-8 [9] LC_ADDRESS=en_US.UTF-8 LC_TELEPHONE=en_US.UTF-8 [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=en_US.UTF-8attached base packages: [1] stats graphics grDevices utils datasets methods base other attached packages: [1] arrow_6.0.1 DBI_1.1.1 loaded via a namespace (and not attached): [1] tidyselect_1.1.1 bit_4.0.4 compiler_4.1.2 magrittr_2.0.1 [5] assertthat_0.2.1 R6_2.5.1 tools_4.1.2 glue_1.5.1 [9] bit64_4.0.5 vctrs_0.3.8 RJDBC_0.2-8 rlang_0.4.12 [13] rJava_1.0-5 AWR.Athena_2.0.7-0 purrr_0.3.4 {code} And as far as I understand, all requierements are fulfilled to use datasets R version 4.1.2 Platform: x86_64-pc-linux-gnu (64-bit) arrow_6.0.1 {code:java} > .Machine$sizeof.pointer < 8 [1] FALSE > getRversion() < "4.0.0" [1] FALSE > tolower(Sys.info()[["sysname"]]) == "windows" [1] FALSE > {code} Nevertheless I get Error: This build of the arrow package does not support Datasets in return when {code:java} arrow::open_dataset(sources = path) {code} Appreciate any help! was: Hello, I would like to report a possible issue (or I did not grasp the documentation and I apologize in advance) Im trying to use R with arrow on docker: {code:java} FROM rocker/r-base:4.1.2 RUN apt update && \ apt install -y -V ca-certificates lsb-release wget && \ wget "https://apache.jfrog.io/artifactory/arrow/$(lsb_release --id --short | tr 'A-Z' 'a-z')/apache-arrow-apt-source-latest-$(lsb_release --codename --short).deb" && \ apt-get -y --no-install-recommends install \ RUN apt update && \ apt install -y -V -f \ libarrow-dev \ libarrow-dataset-dev \ libarrow-glib-dev \ libarrow-flight-dev \ libparquet-dev \ libparquet-glib-dev RUN install2.r --error \ arrow {code} Thats the output of sessionInfo from the container running R {code:java} sessionInfo() R version 4.1.2 (2021-11-01) Platform: x86_64-pc-linux-gnu (64-bit) Running under: Debian GNU/Linux 11 (bullseye)Matrix products: default BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3 LAPACK: /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.18.solocale: [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8 [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8 [7] LC_PAPER=en_US.UTF-8 LC_NAME=en_US.UTF-8 [9] LC_ADDRESS=en_US.UTF-8 LC_TELEPHONE=en_US.UTF-8 [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=en_US.UTF-8attached base packages: [1] stats graphics grDevices utils datasets methods base other attached packages: [1] arrow_6.0.1 DBI_1.1.1 loaded via a namespace (and not attached): [1] tidyselect_1.1.1 bit_4.0.4 compiler_4.1.2 magrittr_2.0.1 [5] assertthat_0.2.1 R6_2.5.1 tools_4.1.2 glue_1.5.1 [9] bit64_4.0.5 vctrs_0.3.8 RJDBC_0.2-8 rlang_0.4.12 [13] rJava_1.0-5 AWR.Athena_2.0.7-0 purrr_0.3.4 {code} And as far as I understand, all requierements are fulfilled
[jira] [Updated] (ARROW-15072) [R] Error: This build of the arrow package does not support Datasets
[ https://issues.apache.org/jira/browse/ARROW-15072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] hu geme updated ARROW-15072: Description: Hello, I would like to report a possible issue (or I did not grasp the documentation and I apologize in advance) Im trying to use R with arrow on docker: {code:java} FROM rocker/r-base:4.1.2 RUN apt update && \ apt install -y -V ca-certificates lsb-release wget && \ wget "https://apache.jfrog.io/artifactory/arrow/$(lsb_release --id --short | tr 'A-Z' 'a-z')/apache-arrow-apt-source-latest-$(lsb_release --codename --short).deb" && \ apt-get -y --no-install-recommends install \ RUN apt update && \ apt install -y -V -f \ libarrow-dev \ libarrow-dataset-dev \ libarrow-glib-dev \ libarrow-flight-dev \ libparquet-dev \ libparquet-glib-dev RUN install2.r --error \ arrow {code} Thats the output of sessionInfo from the container running R {code:java} sessionInfo() R version 4.1.2 (2021-11-01) Platform: x86_64-pc-linux-gnu (64-bit) Running under: Debian GNU/Linux 11 (bullseye)Matrix products: default BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3 LAPACK: /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.18.solocale: [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8 [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8 [7] LC_PAPER=en_US.UTF-8 LC_NAME=en_US.UTF-8 [9] LC_ADDRESS=en_US.UTF-8 LC_TELEPHONE=en_US.UTF-8 [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=en_US.UTF-8attached base packages: [1] stats graphics grDevices utils datasets methods base other attached packages: [1] arrow_6.0.1 DBI_1.1.1 loaded via a namespace (and not attached): [1] tidyselect_1.1.1 bit_4.0.4 compiler_4.1.2 magrittr_2.0.1 [5] assertthat_0.2.1 R6_2.5.1 tools_4.1.2 glue_1.5.1 [9] bit64_4.0.5 vctrs_0.3.8 RJDBC_0.2-8 rlang_0.4.12 [13] rJava_1.0-5 AWR.Athena_2.0.7-0 purrr_0.3.4 {code} And as far as I understand, all requierements are fulfilled to use datasets R version 4.1.2 Platform: x86_64-pc-linux-gnu (64-bit) arrow_6.0.1 {code:java} > .Machine$sizeof.pointer < 8 [1] FALSE > getRversion() < "4.0.0" [1] FALSE > tolower(Sys.info()[["sysname"]]) == "windows" [1] FALSE > {code} Nevertheless I get Error: This build of the arrow package does not support Datasets in return when {code:java} arrow::open_dataset(sources = path) {code} Appreciate any help! was: Hello, I would like to report a possible issue (or I did not grasp the documentation and I apologize in advance) Im trying to use R with arrow on docker: {code:java} FROM rocker/r-base:4.1.2 RUN apt update && \ apt install -y -V ca-certificates lsb-release wget && \ wget "https://apache.jfrog.io/artifactory/arrow/$(lsb_release --id --short | tr 'A-Z' 'a-z')/apache-arrow-apt-source-latest-$(lsb_release --codename --short).deb" && \ apt-get -y --no-install-recommends install \ RUN apt update && \ apt install -y -V -f \ libarrow-dev \ libarrow-dataset-dev \ libarrow-glib-dev \ libarrow-flight-dev \ libparquet-dev \ libparquet-glib-dev RUN install2.r --error \ arrow {code} Thats the output of sessionInfo from the container running R {code:java} sessionInfo() R version 4.1.2 (2021-11-01) Platform: x86_64-pc-linux-gnu (64-bit) Running under: Debian GNU/Linux 11 (bullseye)Matrix products: default BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3 LAPACK: /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.18.solocale: [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8 [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8 [7] LC_PAPER=en_US.UTF-8 LC_NAME=en_US.UTF-8 [9] LC_ADDRESS=en_US.UTF-8 LC_TELEPHONE=en_US.UTF-8 [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=en_US.UTF-8attached base packages: [1] stats graphics grDevices utils datasets methods base other attached packages: [1] arrow_6.0.1 DBI_1.1.1 loaded via a namespace (and not attached): [1] tidyselect_1.1.1 bit_4.0.4 compiler_4.1.2 magrittr_2.0.1 [5] assertthat_0.2.1 R6_2.5.1 tools_4.1.2 glue_1.5.1 [9] bit64_4.0.5 vctrs_0.3.8 RJDBC_0.2-8 rlang_0.4.12 [13] rJava_1.0-5 AWR.Athena_2.0.7-0 purrr_0.3.4 {code} And as far as I understand, all requierements are fulfilled to use datasets R version 4.1.2 Platform:
[jira] [Updated] (ARROW-15072) [R] Error: This build of the arrow package does not support Datasets
[ https://issues.apache.org/jira/browse/ARROW-15072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] hu geme updated ARROW-15072: Description: Hello, I would like to report a possible issue (or I did not grasp the documentation and I apologize in advance) Im trying to use R with arrow on docker: {code:java} FROM rocker/r-base:4.1.2 RUN apt update && \ apt install -y -V ca-certificates lsb-release wget && \ wget "https://apache.jfrog.io/artifactory/arrow/$(lsb_release --id --short | tr 'A-Z' 'a-z')/apache-arrow-apt-source-latest-$(lsb_release --codename --short).deb" && \ apt-get -y --no-install-recommends install \ RUN apt update && \ apt install -y -V -f \ libarrow-dev \ libarrow-dataset-dev \ libarrow-glib-dev \ libarrow-flight-dev \ libparquet-dev \ libparquet-glib-dev RUN install2.r --error \ arrow {code} Thats the output of sessionInfo from the container running R {code:java} sessionInfo() R version 4.1.2 (2021-11-01) Platform: x86_64-pc-linux-gnu (64-bit) Running under: Debian GNU/Linux 11 (bullseye)Matrix products: default BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3 LAPACK: /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.18.solocale: [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8 [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8 [7] LC_PAPER=en_US.UTF-8 LC_NAME=en_US.UTF-8 [9] LC_ADDRESS=en_US.UTF-8 LC_TELEPHONE=en_US.UTF-8 [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=en_US.UTF-8attached base packages: [1] stats graphics grDevices utils datasets methods base other attached packages: [1] arrow_6.0.1 DBI_1.1.1 loaded via a namespace (and not attached): [1] tidyselect_1.1.1 bit_4.0.4 compiler_4.1.2 magrittr_2.0.1 [5] assertthat_0.2.1 R6_2.5.1 tools_4.1.2 glue_1.5.1 [9] bit64_4.0.5 vctrs_0.3.8 RJDBC_0.2-8 rlang_0.4.12 [13] rJava_1.0-5 AWR.Athena_2.0.7-0 purrr_0.3.4 {code} And as far as I understand, all requierements are fulfilled to use datasets R version 4.1.2 Platform: x86_64-pc-linux-gnu (64-bit) arrow_6.0.1 {code:java} > .Machine$sizeof.pointer < 8 [1] FALSE > getRversion() < "4.0.0" [1] FALSE > tolower(Sys.info()[["sysname"]]) == "windows" [1] FALSE > {code} Nevertheless I get Error: This build of the arrow package does not support Datasets in return when {code:java} arrow::open_dataset(sources = path) {code} Appreciate any help! was: Hello, I would like to report a possible issue (or I did not grasp the documentation and I apologize in advance) Im trying to use R with arrow on docker: {code:java} FROM rocker/r-base:4.1.2 RUN apt update && \ apt install -y -V ca-certificates lsb-release wget && \wget "https://apache.jfrog.io/artifactory/arrow/$(lsb_release --id --short | tr 'A-Z' 'a-z')/apache-arrow-apt-source-latest-$(lsb_release --codename --short).deb" && \apt-get -y --no-install-recommends install \ ./apache-arrow-apt-source-latest-$(lsb_release --codename --short).deb RUN apt update && \ apt install -y -V -f \libarrow-dev \libarrow-dataset-dev \ libarrow-glib-dev \libarrow-flight-dev \libparquet-dev \ libparquet-glib-dev RUN install2.r --error \ arrow {code} Thats the output of sessionInfo from the container running R {code:java} sessionInfo() R version 4.1.2 (2021-11-01) Platform: x86_64-pc-linux-gnu (64-bit) Running under: Debian GNU/Linux 11 (bullseye)Matrix products: default BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3 LAPACK: /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.18.solocale: [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8 [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8 [7] LC_PAPER=en_US.UTF-8 LC_NAME=en_US.UTF-8 [9] LC_ADDRESS=en_US.UTF-8 LC_TELEPHONE=en_US.UTF-8 [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=en_US.UTF-8attached base packages: [1] stats graphics grDevices utils datasets methods base other attached packages: [1] arrow_6.0.1 DBI_1.1.1 loaded via a namespace (and not attached): [1] tidyselect_1.1.1 bit_4.0.4 compiler_4.1.2 magrittr_2.0.1 [5] assertthat_0.2.1 R6_2.5.1 tools_4.1.2 glue_1.5.1 [9] bit64_4.0.5 vctrs_0.3.8 RJDBC_0.2-8 rlang_0.4.12 [13] rJava_1.0-5 AWR.Athena_2.0.7-0 purrr_0.3.4 {code} And as far as I understand, all requierements are fulfilled to use datasets R version 4.1.2
[jira] [Updated] (ARROW-15072) [R] r - arrow on docker
[ https://issues.apache.org/jira/browse/ARROW-15072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] hu geme updated ARROW-15072: Summary: [R] r - arrow on docker (was: [R] Error: This build of the arrow package does not support Datasets) > [R] r - arrow on docker > > > Key: ARROW-15072 > URL: https://issues.apache.org/jira/browse/ARROW-15072 > Project: Apache Arrow > Issue Type: Bug > Components: Parquet, R >Affects Versions: 6.0.1 > Environment: x86_64-pc-linux-gnu (64-bit) via rocker/docker > rocker/r-base:4.1.2 >Reporter: hu geme >Priority: Minor > Fix For: 6.0.1 > > > Hello, > I would like to report a possible issue (or I did not grasp the documentation > and I apologize in advance) > Im trying to use R with arrow on docker: > > {code:java} > FROM rocker/r-base:4.1.2 > RUN apt update && \ > apt install -y -V ca-certificates lsb-release wget && \wget > "https://apache.jfrog.io/artifactory/arrow/$(lsb_release --id --short | tr > 'A-Z' 'a-z')/apache-arrow-apt-source-latest-$(lsb_release --codename > --short).deb" && \apt-get -y --no-install-recommends install \ > ./apache-arrow-apt-source-latest-$(lsb_release --codename --short).deb > RUN apt update && \ > apt install -y -V -f \libarrow-dev \libarrow-dataset-dev > \libarrow-glib-dev \libarrow-flight-dev \ > libparquet-dev \libparquet-glib-dev > RUN install2.r --error \ >arrow {code} > Thats the output of sessionInfo from the container running R > > {code:java} > sessionInfo() > R version 4.1.2 (2021-11-01) > Platform: x86_64-pc-linux-gnu (64-bit) > Running under: Debian GNU/Linux 11 (bullseye)Matrix products: default > BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3 > LAPACK: > /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.18.solocale: > [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C > [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8 > [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8 > [7] LC_PAPER=en_US.UTF-8 LC_NAME=en_US.UTF-8 > [9] LC_ADDRESS=en_US.UTF-8 LC_TELEPHONE=en_US.UTF-8 > [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=en_US.UTF-8attached base > packages: > [1] stats graphics grDevices utils datasets methods base > other attached packages: > [1] arrow_6.0.1 DBI_1.1.1 loaded via a namespace (and not attached): > [1] tidyselect_1.1.1 bit_4.0.4 compiler_4.1.2 magrittr_2.0.1 > > [5] assertthat_0.2.1 R6_2.5.1 tools_4.1.2 glue_1.5.1 > > [9] bit64_4.0.5 vctrs_0.3.8 RJDBC_0.2-8 rlang_0.4.12 > > [13] rJava_1.0-5 AWR.Athena_2.0.7-0 purrr_0.3.4 {code} > And as far as I understand, all requierements are fulfilled to use datasets > R version 4.1.2 > Platform: x86_64-pc-linux-gnu (64-bit) > arrow_6.0.1 > > {code:java} > > .Machine$sizeof.pointer < 8 > [1] FALSE > > getRversion() < "4.0.0" > [1] FALSE > > tolower(Sys.info()[["sysname"]]) == "windows" > [1] FALSE > > {code} > Nevertheless I get > Error: This build of the arrow package does not support Datasets > in return when > {code:java} > arrow::open_dataset(sources = path) {code} > Appreciate any help! -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Updated] (ARROW-15072) [R] Error: This build of the arrow package does not support Datasets
[ https://issues.apache.org/jira/browse/ARROW-15072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] hu geme updated ARROW-15072: Summary: [R] Error: This build of the arrow package does not support Datasets (was: [R] r - arrow on docker ) > [R] Error: This build of the arrow package does not support Datasets > > > Key: ARROW-15072 > URL: https://issues.apache.org/jira/browse/ARROW-15072 > Project: Apache Arrow > Issue Type: Bug > Components: Parquet, R >Affects Versions: 6.0.1 > Environment: x86_64-pc-linux-gnu (64-bit) via rocker/docker > rocker/r-base:4.1.2 >Reporter: hu geme >Priority: Minor > Fix For: 6.0.1 > > > Hello, > I would like to report a possible issue (or I did not grasp the documentation > and I apologize in advance) > Im trying to use R with arrow on docker: > > {code:java} > FROM rocker/r-base:4.1.2 > RUN apt update && \ > apt install -y -V ca-certificates lsb-release wget && \wget > "https://apache.jfrog.io/artifactory/arrow/$(lsb_release --id --short | tr > 'A-Z' 'a-z')/apache-arrow-apt-source-latest-$(lsb_release --codename > --short).deb" && \apt-get -y --no-install-recommends install \ > ./apache-arrow-apt-source-latest-$(lsb_release --codename --short).deb > RUN apt update && \ > apt install -y -V -f \libarrow-dev \libarrow-dataset-dev > \libarrow-glib-dev \libarrow-flight-dev \ > libparquet-dev \libparquet-glib-dev > RUN install2.r --error \ >arrow {code} > Thats the output of sessionInfo from the container running R > > {code:java} > sessionInfo() > R version 4.1.2 (2021-11-01) > Platform: x86_64-pc-linux-gnu (64-bit) > Running under: Debian GNU/Linux 11 (bullseye)Matrix products: default > BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3 > LAPACK: > /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.18.solocale: > [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C > [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8 > [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8 > [7] LC_PAPER=en_US.UTF-8 LC_NAME=en_US.UTF-8 > [9] LC_ADDRESS=en_US.UTF-8 LC_TELEPHONE=en_US.UTF-8 > [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=en_US.UTF-8attached base > packages: > [1] stats graphics grDevices utils datasets methods base > other attached packages: > [1] arrow_6.0.1 DBI_1.1.1 loaded via a namespace (and not attached): > [1] tidyselect_1.1.1 bit_4.0.4 compiler_4.1.2 magrittr_2.0.1 > > [5] assertthat_0.2.1 R6_2.5.1 tools_4.1.2 glue_1.5.1 > > [9] bit64_4.0.5 vctrs_0.3.8 RJDBC_0.2-8 rlang_0.4.12 > > [13] rJava_1.0-5 AWR.Athena_2.0.7-0 purrr_0.3.4 {code} > And as far as I understand, all requierements are fulfilled to use datasets > R version 4.1.2 > Platform: x86_64-pc-linux-gnu (64-bit) > arrow_6.0.1 > > {code:java} > > .Machine$sizeof.pointer < 8 > [1] FALSE > > getRversion() < "4.0.0" > [1] FALSE > > tolower(Sys.info()[["sysname"]]) == "windows" > [1] FALSE > > {code} > Nevertheless I get > Error: This build of the arrow package does not support Datasets > in return when > {code:java} > arrow::open_dataset(sources = path) {code} > Appreciate any help! -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Updated] (ARROW-15072) [R] Error: This build of the arrow package does not support Datasets
[ https://issues.apache.org/jira/browse/ARROW-15072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] hu geme updated ARROW-15072: Summary: [R] Error: This build of the arrow package does not support Datasets (was: Error: This build of the arrow package does not support Datasets) > [R] Error: This build of the arrow package does not support Datasets > > > Key: ARROW-15072 > URL: https://issues.apache.org/jira/browse/ARROW-15072 > Project: Apache Arrow > Issue Type: Bug > Components: Parquet, R >Affects Versions: 6.0.1 > Environment: x86_64-pc-linux-gnu (64-bit) via rocker/docker > rocker/r-base:4.1.2 >Reporter: hu geme >Priority: Minor > Fix For: 6.0.1 > > > Hello, > I would like to report a possible issue (or I did not grasp the documentation > and I apologize in advance) > Im trying to use R with arrow on docker: > > {code:java} > FROM rocker/r-base:4.1.2 > RUN apt update && \ > apt install -y -V ca-certificates lsb-release wget && \wget > "https://apache.jfrog.io/artifactory/arrow/$(lsb_release --id --short | tr > 'A-Z' 'a-z')/apache-arrow-apt-source-latest-$(lsb_release --codename > --short).deb" && \apt-get -y --no-install-recommends install \ > ./apache-arrow-apt-source-latest-$(lsb_release --codename --short).deb > RUN apt update && \ > apt install -y -V -f \libarrow-dev \libarrow-dataset-dev > \libarrow-glib-dev \libarrow-flight-dev \ > libparquet-dev \libparquet-glib-dev > RUN install2.r --error \ >arrow {code} > Thats the output of sessionInfo from the container running R > > {code:java} > sessionInfo() > R version 4.1.2 (2021-11-01) > Platform: x86_64-pc-linux-gnu (64-bit) > Running under: Debian GNU/Linux 11 (bullseye)Matrix products: default > BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3 > LAPACK: > /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.18.solocale: > [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C > [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8 > [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8 > [7] LC_PAPER=en_US.UTF-8 LC_NAME=en_US.UTF-8 > [9] LC_ADDRESS=en_US.UTF-8 LC_TELEPHONE=en_US.UTF-8 > [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=en_US.UTF-8attached base > packages: > [1] stats graphics grDevices utils datasets methods base > other attached packages: > [1] arrow_6.0.1 DBI_1.1.1 loaded via a namespace (and not attached): > [1] tidyselect_1.1.1 bit_4.0.4 compiler_4.1.2 magrittr_2.0.1 > > [5] assertthat_0.2.1 R6_2.5.1 tools_4.1.2 glue_1.5.1 > > [9] bit64_4.0.5 vctrs_0.3.8 RJDBC_0.2-8 rlang_0.4.12 > > [13] rJava_1.0-5 AWR.Athena_2.0.7-0 purrr_0.3.4 {code} > And as far as I understand, all requierements are fulfilled to use datasets > R version 4.1.2 > Platform: x86_64-pc-linux-gnu (64-bit) > arrow_6.0.1 > > {code:java} > > .Machine$sizeof.pointer < 8 > [1] FALSE > > getRversion() < "4.0.0" > [1] FALSE > > tolower(Sys.info()[["sysname"]]) == "windows" > [1] FALSE > > {code} > Nevertheless I get > Error: This build of the arrow package does not support Datasets > in return when > {code:java} > arrow::open_dataset(sources = path) {code} > Appreciate any help! -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Updated] (ARROW-15072) Error: This build of the arrow package does not support Datasets
[ https://issues.apache.org/jira/browse/ARROW-15072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] hu geme updated ARROW-15072: Description: Hello, I would like to report a possible issue (or I did not grasp the documentation and I apologize in advance) Im trying to use R with arrow on docker: {code:java} FROM rocker/r-base:4.1.2 RUN apt update && \ apt install -y -V ca-certificates lsb-release wget && \wget "https://apache.jfrog.io/artifactory/arrow/$(lsb_release --id --short | tr 'A-Z' 'a-z')/apache-arrow-apt-source-latest-$(lsb_release --codename --short).deb" && \apt-get -y --no-install-recommends install \ ./apache-arrow-apt-source-latest-$(lsb_release --codename --short).deb RUN apt update && \ apt install -y -V -f \libarrow-dev \libarrow-dataset-dev \ libarrow-glib-dev \libarrow-flight-dev \libparquet-dev \ libparquet-glib-dev RUN install2.r --error \ arrow {code} Thats the output of sessionInfo from the container running R {code:java} sessionInfo() R version 4.1.2 (2021-11-01) Platform: x86_64-pc-linux-gnu (64-bit) Running under: Debian GNU/Linux 11 (bullseye)Matrix products: default BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3 LAPACK: /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.18.solocale: [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8 [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8 [7] LC_PAPER=en_US.UTF-8 LC_NAME=en_US.UTF-8 [9] LC_ADDRESS=en_US.UTF-8 LC_TELEPHONE=en_US.UTF-8 [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=en_US.UTF-8attached base packages: [1] stats graphics grDevices utils datasets methods base other attached packages: [1] arrow_6.0.1 DBI_1.1.1 loaded via a namespace (and not attached): [1] tidyselect_1.1.1 bit_4.0.4 compiler_4.1.2 magrittr_2.0.1 [5] assertthat_0.2.1 R6_2.5.1 tools_4.1.2 glue_1.5.1 [9] bit64_4.0.5 vctrs_0.3.8 RJDBC_0.2-8 rlang_0.4.12 [13] rJava_1.0-5 AWR.Athena_2.0.7-0 purrr_0.3.4 {code} And as far as I understand, all requierements are fulfilled to use datasets R version 4.1.2 Platform: x86_64-pc-linux-gnu (64-bit) arrow_6.0.1 {code:java} > .Machine$sizeof.pointer < 8 [1] FALSE > getRversion() < "4.0.0" [1] FALSE > tolower(Sys.info()[["sysname"]]) == "windows" [1] FALSE > {code} Nevertheless I get Error: This build of the arrow package does not support Datasets in return when {code:java} arrow::open_dataset(sources = path) {code} Appreciate any help! was: Hello, I would like to report a possible issue. Im trying to use R with arrow on docker: {code:java} FROM rocker/r-base:4.1.2 RUN apt update && \ apt install -y -V ca-certificates lsb-release wget && \wget "https://apache.jfrog.io/artifactory/arrow/$(lsb_release --id --short | tr 'A-Z' 'a-z')/apache-arrow-apt-source-latest-$(lsb_release --codename --short).deb" && \apt-get -y --no-install-recommends install \ ./apache-arrow-apt-source-latest-$(lsb_release --codename --short).deb RUN apt update && \ apt install -y -V -f \libarrow-dev \libarrow-dataset-dev \ libarrow-glib-dev \libarrow-flight-dev \libparquet-dev \ libparquet-glib-dev RUN install2.r --error \ arrow {code} Thats the output of sessionInfo from the container running R {code:java} sessionInfo() R version 4.1.2 (2021-11-01) Platform: x86_64-pc-linux-gnu (64-bit) Running under: Debian GNU/Linux 11 (bullseye)Matrix products: default BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3 LAPACK: /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.18.solocale: [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8 [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8 [7] LC_PAPER=en_US.UTF-8 LC_NAME=en_US.UTF-8 [9] LC_ADDRESS=en_US.UTF-8 LC_TELEPHONE=en_US.UTF-8 [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=en_US.UTF-8attached base packages: [1] stats graphics grDevices utils datasets methods base other attached packages: [1] arrow_6.0.1 DBI_1.1.1 loaded via a namespace (and not attached): [1] tidyselect_1.1.1 bit_4.0.4 compiler_4.1.2 magrittr_2.0.1 [5] assertthat_0.2.1 R6_2.5.1 tools_4.1.2 glue_1.5.1 [9] bit64_4.0.5 vctrs_0.3.8 RJDBC_0.2-8 rlang_0.4.12 [13] rJava_1.0-5 AWR.Athena_2.0.7-0 purrr_0.3.4 {code} And as far as I understand, all requierements are fulfilled to use datasets R version 4.1.2 Platform: x86_64-pc-linux-gnu (64-bit)
[jira] [Updated] (ARROW-15072) Error: This build of the arrow package does not support Datasets
[ https://issues.apache.org/jira/browse/ARROW-15072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] hu geme updated ARROW-15072: Priority: Minor (was: Trivial) > Error: This build of the arrow package does not support Datasets > > > Key: ARROW-15072 > URL: https://issues.apache.org/jira/browse/ARROW-15072 > Project: Apache Arrow > Issue Type: Bug > Components: Parquet, R >Affects Versions: 6.0.1 > Environment: x86_64-pc-linux-gnu (64-bit) via rocker/docker > rocker/r-base:4.1.2 >Reporter: hu geme >Priority: Minor > Fix For: 6.0.1 > > > Hello, > I would like to report a possible issue. > Im trying to use R with arrow on docker: > > {code:java} > FROM rocker/r-base:4.1.2 > RUN apt update && \ > apt install -y -V ca-certificates lsb-release wget && \wget > "https://apache.jfrog.io/artifactory/arrow/$(lsb_release --id --short | tr > 'A-Z' 'a-z')/apache-arrow-apt-source-latest-$(lsb_release --codename > --short).deb" && \apt-get -y --no-install-recommends install \ > ./apache-arrow-apt-source-latest-$(lsb_release --codename --short).deb > RUN apt update && \ > apt install -y -V -f \libarrow-dev \libarrow-dataset-dev > \libarrow-glib-dev \libarrow-flight-dev \ > libparquet-dev \libparquet-glib-dev > RUN install2.r --error \ >arrow {code} > Thats the output of sessionInfo from the container running R > > {code:java} > sessionInfo() > R version 4.1.2 (2021-11-01) > Platform: x86_64-pc-linux-gnu (64-bit) > Running under: Debian GNU/Linux 11 (bullseye)Matrix products: default > BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3 > LAPACK: > /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.18.solocale: > [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C > [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8 > [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8 > [7] LC_PAPER=en_US.UTF-8 LC_NAME=en_US.UTF-8 > [9] LC_ADDRESS=en_US.UTF-8 LC_TELEPHONE=en_US.UTF-8 > [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=en_US.UTF-8attached base > packages: > [1] stats graphics grDevices utils datasets methods base > other attached packages: > [1] arrow_6.0.1 DBI_1.1.1 loaded via a namespace (and not attached): > [1] tidyselect_1.1.1 bit_4.0.4 compiler_4.1.2 magrittr_2.0.1 > > [5] assertthat_0.2.1 R6_2.5.1 tools_4.1.2 glue_1.5.1 > > [9] bit64_4.0.5 vctrs_0.3.8 RJDBC_0.2-8 rlang_0.4.12 > > [13] rJava_1.0-5 AWR.Athena_2.0.7-0 purrr_0.3.4 {code} > And as far as I understand, all requierements are fulfilled to use datasets > R version 4.1.2 > Platform: x86_64-pc-linux-gnu (64-bit) > arrow_6.0.1 > > {code:java} > > .Machine$sizeof.pointer < 8 > [1] FALSE > > getRversion() < "4.0.0" > [1] FALSE > > tolower(Sys.info()[["sysname"]]) == "windows" > [1] FALSE > > {code} > Nevertheless I get > Error: This build of the arrow package does not support Datasets > in return when > {code:java} > arrow::open_dataset(sources = path) {code} > Appreciate any help! -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15072) Error: This build of the arrow package does not support Datasets
hu geme created ARROW-15072: --- Summary: Error: This build of the arrow package does not support Datasets Key: ARROW-15072 URL: https://issues.apache.org/jira/browse/ARROW-15072 Project: Apache Arrow Issue Type: Bug Components: Parquet, R Affects Versions: 6.0.1 Environment: x86_64-pc-linux-gnu (64-bit) via rocker/docker rocker/r-base:4.1.2 Reporter: hu geme Fix For: 6.0.1 Hello, I would like to report a possible issue. Im trying to use R with arrow on docker: {code:java} FROM rocker/r-base:4.1.2 RUN apt update && \ apt install -y -V ca-certificates lsb-release wget && \wget "https://apache.jfrog.io/artifactory/arrow/$(lsb_release --id --short | tr 'A-Z' 'a-z')/apache-arrow-apt-source-latest-$(lsb_release --codename --short).deb" && \apt-get -y --no-install-recommends install \ ./apache-arrow-apt-source-latest-$(lsb_release --codename --short).deb RUN apt update && \ apt install -y -V -f \libarrow-dev \libarrow-dataset-dev \ libarrow-glib-dev \libarrow-flight-dev \libparquet-dev \ libparquet-glib-dev RUN install2.r --error \ arrow {code} Thats the output of sessionInfo from the container running R {code:java} sessionInfo() R version 4.1.2 (2021-11-01) Platform: x86_64-pc-linux-gnu (64-bit) Running under: Debian GNU/Linux 11 (bullseye)Matrix products: default BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3 LAPACK: /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.18.solocale: [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8 [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8 [7] LC_PAPER=en_US.UTF-8 LC_NAME=en_US.UTF-8 [9] LC_ADDRESS=en_US.UTF-8 LC_TELEPHONE=en_US.UTF-8 [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=en_US.UTF-8attached base packages: [1] stats graphics grDevices utils datasets methods base other attached packages: [1] arrow_6.0.1 DBI_1.1.1 loaded via a namespace (and not attached): [1] tidyselect_1.1.1 bit_4.0.4 compiler_4.1.2 magrittr_2.0.1 [5] assertthat_0.2.1 R6_2.5.1 tools_4.1.2 glue_1.5.1 [9] bit64_4.0.5 vctrs_0.3.8 RJDBC_0.2-8 rlang_0.4.12 [13] rJava_1.0-5 AWR.Athena_2.0.7-0 purrr_0.3.4 {code} And as far as I understand, all requierements are fulfilled to use datasets R version 4.1.2 Platform: x86_64-pc-linux-gnu (64-bit) arrow_6.0.1 {code:java} > .Machine$sizeof.pointer < 8 [1] FALSE > getRversion() < "4.0.0" [1] FALSE > tolower(Sys.info()[["sysname"]]) == "windows" [1] FALSE > {code} Nevertheless I get Error: This build of the arrow package does not support Datasets in return when {code:java} arrow::open_dataset(sources = path) {code} Appreciate any help! -- This message was sent by Atlassian Jira (v8.20.1#820001)