[GitHub] [arrow] letmaik opened a new issue, #14998: Support for writing binary column in stream writer in Parquet

2022-12-16 Thread GitBox
letmaik opened a new issue, #14998: URL: https://github.com/apache/arrow/issues/14998 ### Describe the enhancement requested There is no `operator<<` overload in Parquet's `stream_writer.h` that accepts `std::vector` or similar such that data is written with converted type "none". It

[GitHub] [arrow] raulcd opened a new issue, #14999: [Release][Archery] Update archery release changelog to support GitHub issues

2022-12-16 Thread GitBox
raulcd opened a new issue, #14999: URL: https://github.com/apache/arrow/issues/14999 ### Describe the enhancement requested The `archery release changelog` command is currently used on a couple of steps during the release. - https://github.com/apache/arrow/blob/master/dev/release/

[GitHub] [arrow] raulcd opened a new issue, #15000: [Release] Changelog.md on master branch has not been updated since 6.0.1

2022-12-16 Thread GitBox
raulcd opened a new issue, #15000: URL: https://github.com/apache/arrow/issues/15000 ### Describe the bug, including details regarding any error messages, version, and platform. While investigating https://github.com/apache/arrow/issues/14999 I've realised we never merge from the mai

[GitHub] [arrow] paleolimbot opened a new issue, #15001: [R] Map/list equality tests are failing only on the sanitizer nightlies

2022-12-16 Thread GitBox
paleolimbot opened a new issue, #15001: URL: https://github.com/apache/arrow/issues/15001 ### Describe the bug, including details regarding any error messages, version, and platform. I don't know why this is happening only on the sanitizer nightlies ([test-ubuntu-18.04-r-sanitizer](h

[GitHub] [arrow] raulcd opened a new issue, #15002: [Release][Archery] Update archery release cherry-pick to support GitHub issues

2022-12-16 Thread GitBox
raulcd opened a new issue, #15002: URL: https://github.com/apache/arrow/issues/15002 ### Describe the enhancement requested We use the `archery release cherry-pick` command during the Release in order to update the maintenance branch once it has been created in order to pick the indi

[GitHub] [arrow-adbc] lidavidm closed issue #243: [Python] Allow passing RecordBatch/Table of parameters in DBAPI-style API

2022-12-16 Thread GitBox
lidavidm closed issue #243: [Python] Allow passing RecordBatch/Table of parameters in DBAPI-style API URL: https://github.com/apache/arrow-adbc/issues/243 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to g

[GitHub] [arrow] letmaik opened a new issue, #15004: Provide a `BinaryBuilder::AppendValues(const std::vector>&)` overload

2022-12-16 Thread GitBox
letmaik opened a new issue, #15004: URL: https://github.com/apache/arrow/issues/15004 ### Describe the enhancement requested In addition to the string vector overloads it would be great to support `std::vector` in `BinaryBuilder` for binary data. https://github.com/apache/arro

[GitHub] [arrow] zeroshade closed issue #14995: [Go] flightsql.SqlInfoSupportedUnions has incorrect value

2022-12-16 Thread GitBox
zeroshade closed issue #14995: [Go] flightsql.SqlInfoSupportedUnions has incorrect value URL: https://github.com/apache/arrow/issues/14995 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specifi

[GitHub] [arrow] zeroshade opened a new issue, #15005: [Go] Add ability to Append Scalars to Array Builders

2022-12-16 Thread GitBox
zeroshade opened a new issue, #15005: URL: https://github.com/apache/arrow/issues/15005 ### Describe the enhancement requested As the compute package is continued to be built out, there will be a need to be able to append scalars to Array builders when handling a group of literals th

[GitHub] [arrow] kou opened a new issue, #15007: [CI][RPM] Key import failed

2022-12-16 Thread GitBox
kou opened a new issue, #15007: URL: https://github.com/apache/arrow/issues/15007 ### Describe the bug, including details regarding any error messages, version, and platform. almalinux-8-amd64: https://github.com/ursacomputing/crossbow/actions/runs/3711543280/jobs/6292083831#s

[GitHub] [arrow] mofeiatwork closed issue #12459: Any easy way to convert parquet to JSON and access nested structures?

2022-12-16 Thread GitBox
mofeiatwork closed issue #12459: Any easy way to convert parquet to JSON and access nested structures? URL: https://github.com/apache/arrow/issues/12459 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [arrow] eitsupi opened a new issue, #15009: [R] stringr 1.5.0 with str_like function is already released

2022-12-16 Thread GitBox
eitsupi opened a new issue, #15009: URL: https://github.com/apache/arrow/issues/15009 ### Describe the enhancement requested stringr 1.5.0 was released on 2022-12-02. So the following special handling of `str_like` is no longer necessary. https://github.com/apache/arrow/blob

[GitHub] [arrow] kou closed issue #15007: [CI][RPM] Key import failed

2022-12-17 Thread GitBox
kou closed issue #15007: [CI][RPM] Key import failed URL: https://github.com/apache/arrow/issues/15007 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: i

[GitHub] [arrow] kou opened a new issue, #15012: [Packaging][deb] Debian GNU/Linux bookworm provides Protobuf 3.21.11

2022-12-17 Thread GitBox
kou opened a new issue, #15012: URL: https://github.com/apache/arrow/issues/15012 ### Describe the enhancement requested https://packages.debian.org/search?keywords=libprotobuf-dev Flight requires Protobuf 3.15.0 or later. We can use system Protobuf for Flight now. We ne

[GitHub] [arrow] kou closed issue #15012: [Packaging][deb] Debian GNU/Linux bookworm provides Protobuf 3.21.11

2022-12-17 Thread GitBox
kou closed issue #15012: [Packaging][deb] Debian GNU/Linux bookworm provides Protobuf 3.21.11 URL: https://github.com/apache/arrow/issues/15012 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the sp

[GitHub] [arrow] h-vetinari opened a new issue, #15015: BUG: plasma tests failing in conda builds on linux

2022-12-17 Thread GitBox
h-vetinari opened a new issue, #15015: URL: https://github.com/apache/arrow/issues/15015 ### Describe the bug, including details regarding any error messages, version, and platform. #14832 switched on the test suite within the conda-builds, which itself synced with https://github.com

[GitHub] [arrow] h-vetinari opened a new issue, #15016: BUG: test_get_include failing in conda builds on unix

2022-12-17 Thread GitBox
h-vetinari opened a new issue, #15016: URL: https://github.com/apache/arrow/issues/15016 ### Describe the bug, including details regarding any error messages, version, and platform. #14832 switched on the test suite within the conda-builds, which itself synced with https://github.com

[GitHub] [arrow] h-vetinari opened a new issue, #15017: BUG: two failing tests related to ARROW_DEFAULT_MEMORY_POOL in conda builds on unix

2022-12-17 Thread GitBox
h-vetinari opened a new issue, #15017: URL: https://github.com/apache/arrow/issues/15017 ### Describe the bug, including details regarding any error messages, version, and platform. #14832 switched on the test suite within the conda-builds, which itself synced with https://github.com

[GitHub] [arrow] h-vetinari opened a new issue, #15018: BUG: segmentation faults in the presence of `sparse` optional dependency (within conda builds)

2022-12-17 Thread GitBox
h-vetinari opened a new issue, #15018: URL: https://github.com/apache/arrow/issues/15018 ### Describe the bug, including details regarding any error messages, version, and platform. #14832 switched on the test suite within the conda-builds, which itself synced with https://github.com

[GitHub] [arrow] h-vetinari opened a new issue, #15019: BUG: `libhdfs3` not getting picked up correctly when using conda

2022-12-17 Thread GitBox
h-vetinari opened a new issue, #15019: URL: https://github.com/apache/arrow/issues/15019 ### Describe the bug, including details regarding any error messages, version, and platform. #14832 switched on the test suite within the conda-builds, which itself synced with https://github.com

[GitHub] [arrow] thisisnic closed issue #15009: [R] stringr 1.5.0 with the str_like function is already released

2022-12-17 Thread GitBox
thisisnic closed issue #15009: [R] stringr 1.5.0 with the str_like function is already released URL: https://github.com/apache/arrow/issues/15009 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] Alias4D opened a new issue, #15020: wheel for aarch64 Linux android python 3.9

2022-12-18 Thread GitBox
Alias4D opened a new issue, #15020: URL: https://github.com/apache/arrow/issues/15020 I can't pip install pyarrow on Android aarch64 using Pydroid app? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [arrow] kou closed issue #14992: [Packaging] Make dev/release/binary-task.rb reusable from other project

2022-12-18 Thread GitBox
kou closed issue #14992: [Packaging] Make dev/release/binary-task.rb reusable from other project URL: https://github.com/apache/arrow/issues/14992 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] zzzzwj closed issue #14748: [C++][Parquet] column_reader.HasNext() throws an exception "Access violation executing location"

2022-12-18 Thread GitBox
wj closed issue #14748: [C++][Parquet] column_reader.HasNext() throws an exception "Access violation executing location" URL: https://github.com/apache/arrow/issues/14748 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and us

[GitHub] [arrow] kou opened a new issue, #15023: [CI][Packaging][Java] libgandiva_jni.dylib links to libz3.dylib not libz3.a

2022-12-19 Thread GitBox
kou opened a new issue, #15023: URL: https://github.com/apache/arrow/issues/15023 ### Describe the bug, including details regarding any error messages, version, and platform. https://github.com/ursacomputing/crossbow/actions/runs/3724023719/jobs/6315909543 ```text + archery

[GitHub] [arrow] kou opened a new issue, #15025: [CI][Python] `brew update --preinstall` is failed

2022-12-19 Thread GitBox
kou opened a new issue, #15025: URL: https://github.com/apache/arrow/issues/15025 ### Describe the bug, including details regarding any error messages, version, and platform. https://github.com/apache/arrow/actions/runs/3729475663/jobs/6325425516 ```text ==> Pouring pyt...@3

[GitHub] [arrow] JeromeLovy opened a new issue, #15027: Documented examples for pyarrow "filtering by expression" fail with TypeError

2022-12-19 Thread GitBox
JeromeLovy opened a new issue, #15027: URL: https://github.com/apache/arrow/issues/15027 ### Describe the usage question you have. Please include as many useful details as possible. Hello, I'm trying to perform "filtering by expressions" with pyarrow, by testing the code

[GitHub] [arrow-adbc] lidavidm opened a new issue, #250: [C] Add basic tutorial/examples

2022-12-19 Thread GitBox
lidavidm opened a new issue, #250: URL: https://github.com/apache/arrow-adbc/issues/250 For example, they could be used to test the APT packages in #249: https://github.com/apache/arrow-adbc/pull/249/files#diff-ac2b5aa681dbc4cf797d64277b3fa7b798a29ea53823d251898bc8c6c521876aR136 -- This i

[GitHub] [arrow] eitsupi opened a new issue, #15028: [R] `NOT_CRAN` should be `"true"` instead of `TRUE`

2022-12-19 Thread GitBox
eitsupi opened a new issue, #15028: URL: https://github.com/apache/arrow/issues/15028 ### Describe the bug, including details regarding any error messages, version, and platform. Currently on the installation manual it says the following, https://github.com/apache/arrow/blob/d

[GitHub] [arrow] zeroshade closed issue #14940: [Go][Parquet] writer.WriteBatch silently fail when using encryption

2022-12-19 Thread GitBox
zeroshade closed issue #14940: [Go][Parquet] writer.WriteBatch silently fail when using encryption URL: https://github.com/apache/arrow/issues/14940 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to t

[GitHub] [arrow] zeroshade closed issue #15005: [Go] Add ability to Append Scalars to Array Builders

2022-12-19 Thread GitBox
zeroshade closed issue #15005: [Go] Add ability to Append Scalars to Array Builders URL: https://github.com/apache/arrow/issues/15005 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific com

[GitHub] [arrow] zeroshade closed issue #14865: [Go] pqarrow.WriteArrowToColumn leaks memory from its memory.Allocator

2022-12-19 Thread GitBox
zeroshade closed issue #14865: [Go] pqarrow.WriteArrowToColumn leaks memory from its memory.Allocator URL: https://github.com/apache/arrow/issues/14865 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go t

[GitHub] [arrow] Thomas-Hirsch opened a new issue, #15032: pyarrow won't cast date32 to date64

2022-12-19 Thread GitBox
Thomas-Hirsch opened a new issue, #15032: URL: https://github.com/apache/arrow/issues/15032 ### Describe the bug, including details regarding any error messages, version, and platform. I'm trying to build a tool to confirm that a given parquet file conforms to an expected schema with

[GitHub] [arrow] benzipperer opened a new issue, #15033: [R] feather round-trip support for named vectors in list columns

2022-12-19 Thread GitBox
benzipperer opened a new issue, #15033: URL: https://github.com/apache/arrow/issues/15033 ### Describe the bug, including details regarding any error messages, version, and platform. Are write_feather and read_feather supposed to support named vectors in list columns? Here they drop

[GitHub] [arrow] JeromeLovy closed issue #15027: Documented examples for pyarrow "filtering by expression" fail with TypeError

2022-12-19 Thread GitBox
JeromeLovy closed issue #15027: Documented examples for pyarrow "filtering by expression" fail with TypeError URL: https://github.com/apache/arrow/issues/15027 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[GitHub] [arrow] raulcd opened a new issue, #15035: [CI] Remove unsopported turbodbc integration test code

2022-12-19 Thread GitBox
raulcd opened a new issue, #15035: URL: https://github.com/apache/arrow/issues/15035 ### Describe the enhancement requested As discussed on this ML thread: https://lists.apache.org/thread/nqbbrnsdf86j0lj4j1rhbyxgs5o33p5w The turbodbc nightly integration tests have been disabled

[GitHub] [arrow] royassis opened a new issue, #15039: Rename arrow table column name in Arrow javascript

2022-12-19 Thread GitBox
royassis opened a new issue, #15039: URL: https://github.com/apache/arrow/issues/15039 ### Describe the usage question you have. Please include as many useful details as possible. Hey there I'm trying to rename a column name in an Arrow table using JS. is this possible ?

[GitHub] [arrow] assignUser opened a new issue, #15040: [R][CI] Arrow nightly binaries require aws sdk system dependency

2022-12-19 Thread GitBox
assignUser opened a new issue, #15040: URL: https://github.com/apache/arrow/issues/15040 ### Describe the bug, including details regarding any error messages, version, and platform. It seems that https://github.com/apache/arrow/pull/14235 introduced aws-sdk as a system dependency for

[GitHub] [arrow] kou closed issue #15035: [CI] Remove unsupported turbodbc integration test code

2022-12-19 Thread GitBox
kou closed issue #15035: [CI] Remove unsupported turbodbc integration test code URL: https://github.com/apache/arrow/issues/15035 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

[GitHub] [arrow] kou closed issue #15025: [CI][C++] "brew update --preinstall" is failed

2022-12-19 Thread GitBox
kou closed issue #15025: [CI][C++] "brew update --preinstall" is failed URL: https://github.com/apache/arrow/issues/15025 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To uns

[GitHub] [arrow] rustyconover opened a new issue, #15042: Column statistics incorrect for Dictionary Column in Parquet

2022-12-19 Thread GitBox
rustyconover opened a new issue, #15042: URL: https://github.com/apache/arrow/issues/15042 ### Describe the bug, including details regarding any error messages, version, and platform. When writing a column that is a `pa.dictionary(pa.int32(), pa.string())` to a Parquet file in

[GitHub] [arrow-adbc] kou opened a new issue, #251: [CI] Run ci/dev/run-rat.sh

2022-12-19 Thread GitBox
kou opened a new issue, #251: URL: https://github.com/apache/arrow-adbc/issues/251 If we don't run RAT in CI, we may get RAT error in release process. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above t

[GitHub] [arrow] amoeba opened a new issue, #15043: Change docstring for `decompressed_size` arg in pyarrow.decompress to reflect implementation

2022-12-19 Thread GitBox
amoeba opened a new issue, #15043: URL: https://github.com/apache/arrow/issues/15043 ### Describe the bug, including details regarding any error messages, version, and platform. The docstring for `decompressed_size` in `pyarrow.decompress` and the underlying `Codec.decompress` states

[GitHub] [arrow] kou closed issue #14943: [Python] pyarrow.get_libraries() returns libraries with wrong order

2022-12-19 Thread GitBox
kou closed issue #14943: [Python] pyarrow.get_libraries() returns libraries with wrong order URL: https://github.com/apache/arrow/issues/14943 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the spe

[GitHub] [arrow-adbc] lidavidm closed issue #103: [deb] Add packages for Debian and Ubuntu

2022-12-19 Thread GitBox
lidavidm closed issue #103: [deb] Add packages for Debian and Ubuntu URL: https://github.com/apache/arrow-adbc/issues/103 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To uns

[GitHub] [arrow-adbc] lidavidm opened a new issue, #252: [CI] Run packaging.yml workflow on PRs

2022-12-19 Thread GitBox
lidavidm opened a new issue, #252: URL: https://github.com/apache/arrow-adbc/issues/252 - Rework uploads so that they all happen once at the end (same as creating the GitHub release). Then we can also remove the PowerShell script used to upload artifacts on Windows. - Re-enable paralle

[GitHub] [arrow-adbc] kou opened a new issue, #253: [CI] Run packaging workflow for each PR

2022-12-19 Thread GitBox
kou opened a new issue, #253: URL: https://github.com/apache/arrow-adbc/issues/253 https://github.com/apache/arrow-adbc/pull/249#issuecomment-1358688665 > So I think we can turn it on, with maybe the following changes: > - Rework uploads so that they all happen once at the end (same

[GitHub] [arrow-adbc] kou closed issue #253: [CI] Run packaging workflow for each PR

2022-12-19 Thread GitBox
kou closed issue #253: [CI] Run packaging workflow for each PR URL: https://github.com/apache/arrow-adbc/issues/253 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscri

[GitHub] [arrow] LinGeLin opened a new issue, #15044: [c++][parquet][8.0.0] The coredump appears when constructed parquetfileReader

2022-12-19 Thread GitBox
LinGeLin opened a new issue, #15044: URL: https://github.com/apache/arrow/issues/15044 ### Describe the bug, including details regarding any error messages, version, and platform. arrow version 8.0.0 Not necessarily, It's been running for a while。 ` do { auto

[GitHub] [arrow] kou closed issue #14993: [CI][Conda] conda-linux-aarch64-* are failing

2022-12-19 Thread GitBox
kou closed issue #14993: [CI][Conda] conda-linux-aarch64-* are failing URL: https://github.com/apache/arrow/issues/14993 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsu

[GitHub] [arrow] h-vetinari opened a new issue, #15045: [CI][conda] don't build pyarrow in r-jobs

2022-12-19 Thread GitBox
h-vetinari opened a new issue, #15045: URL: https://github.com/apache/arrow/issues/15045 ### Describe the enhancement requested This came up in https://github.com/apache/arrow/pull/14832, see [here](https://github.com/apache/arrow/pull/14832#issuecomment-1341069549) and following com

[GitHub] [arrow] jorisvandenbossche opened a new issue, #15047: [Python] Drop pytz for timezone support (default to use datetime.timezone / zoneinfo)

2022-12-20 Thread GitBox
jorisvandenbossche opened a new issue, #15047: URL: https://github.com/apache/arrow/issues/15047 We already made pytz an optional dependency a while ago (ARROW-15580, https://github.com/apache/arrow/pull/12522), so you can now convert arrow timestamp with tz to python without having pytz. I

[GitHub] [arrow] lwhite1 opened a new issue, #15050: [Java] [Docs] Reorganize and update Java tutorial Memory page

2022-12-20 Thread GitBox
lwhite1 opened a new issue, #15050: URL: https://github.com/apache/arrow/issues/15050 ### Describe the enhancement requested Move selected text from the package JavaDoc to the rst/html documentation to avoid duplication and maintenance issues. Update docs to include mention of simpli

[GitHub] [arrow-adbc] lidavidm closed issue #251: [CI] Run ci/dev/run-rat.sh

2022-12-20 Thread GitBox
lidavidm closed issue #251: [CI] Run ci/dev/run-rat.sh URL: https://github.com/apache/arrow-adbc/issues/251 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-ma

[GitHub] [arrow] Anieway opened a new issue, #15052: [C++][Parquet] Getting only 0 when reading DELTA_BINARY_PACKED

2022-12-20 Thread GitBox
Anieway opened a new issue, #15052: URL: https://github.com/apache/arrow/issues/15052 ### Describe the bug, including details regarding any error messages, version, and platform. I have encountered a possible bug where when reading values one at a time with the low level API of parqu

[GitHub] [arrow] mroeschke opened a new issue, #15053: [Python] pyarrow.compute.utf8_center disagrees with str.center when number of needed padding characters is odd

2022-12-20 Thread GitBox
mroeschke opened a new issue, #15053: URL: https://github.com/apache/arrow/issues/15053 ### Describe the bug, including details regarding any error messages, version, and platform. ``` In [1]: import pyarrow as pa; import pyarrow.compute as pc In [2]: pa.__version__ Out[2

[GitHub] [arrow] kou opened a new issue, #15054: [CI][Python] wheel-manylinux2014-* sometimes crashed on pytest exit

2022-12-20 Thread GitBox
kou opened a new issue, #15054: URL: https://github.com/apache/arrow/issues/15054 ### Describe the bug, including details regarding any error messages, version, and platform. wheel-manylinux2014-cp38-amd64 2022-12-20 nighty: https://github.com/ursacomputing/crossbow/actions/ru

[GitHub] [arrow] kou closed issue #15023: [CI][Packaging][Java] libgandiva_jni.dylib links to libz3.dylib not libz3.a

2022-12-20 Thread GitBox
kou closed issue #15023: [CI][Packaging][Java] libgandiva_jni.dylib links to libz3.dylib not libz3.a URL: https://github.com/apache/arrow/issues/15023 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go t

[GitHub] [arrow] lpiep opened a new issue, #15055: Aggregate Functions in R API

2022-12-20 Thread GitBox
lpiep opened a new issue, #15055: URL: https://github.com/apache/arrow/issues/15055 ### Describe the usage question you have. Please include as many useful details as possible. What aggregate functions are available in the R dplyr interface? Is there documentation of what aggreg

[GitHub] [arrow] cgostic opened a new issue, #15056: Deployed/published Rshiny app using arrow (R) to query an AWS-hosted dataset causes intermittent "stack imbalance", "segfault", "memory not mapped

2022-12-20 Thread GitBox
cgostic opened a new issue, #15056: URL: https://github.com/apache/arrow/issues/15056 ### Describe the bug, including details regarding any error messages, version, and platform. **Issue description:** I have an RShiny app that pulls data from a hive-partitioned dataset hosted

[GitHub] [arrow] lidavidm closed issue #15050: [Java] [Docs] Reorganize and update Java tutorial Memory page

2022-12-20 Thread GitBox
lidavidm closed issue #15050: [Java] [Docs] Reorganize and update Java tutorial Memory page URL: https://github.com/apache/arrow/issues/15050 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the spec

[GitHub] [arrow-adbc] lidavidm closed issue #252: [CI] Run packaging.yml workflow on PRs

2022-12-20 Thread GitBox
lidavidm closed issue #252: [CI] Run packaging.yml workflow on PRs URL: https://github.com/apache/arrow-adbc/issues/252 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsub

[GitHub] [arrow-cookbook] raulcd opened a new issue, #285: [R] Fix matomo integration for R bookdown

2022-12-21 Thread GitBox
raulcd opened a new issue, #285: URL: https://github.com/apache/arrow-cookbook/issues/285 This PR https://github.com/apache/arrow-cookbook/pull/283 added the matomo integration to the cookbooks. After deploy we can see the matomo script on the different cookbooks and the index page but i

[GitHub] [arrow] Fokko opened a new issue, #15058: Native support for UUID

2022-12-21 Thread GitBox
Fokko opened a new issue, #15058: URL: https://github.com/apache/arrow/issues/15058 ### Describe the enhancement requested In Apache Iceberg we have support for the UUID type. I think it would be nice to also support this in (Py)Arrow natively instead of having an extension. ##

[GitHub] [arrow] westonpace opened a new issue, #15059: [C++] The new scan node should use values from fragment guarantees instead of loading them from disk

2022-12-21 Thread GitBox
westonpace opened a new issue, #15059: URL: https://github.com/apache/arrow/issues/15059 ### Describe the enhancement requested The main reason we need to do this is because the columns are not always going to be on the disk (right now the new scan node fails in this case). It's al

[GitHub] [arrow] assignUser closed issue #15028: [R][Docs] `NOT_CRAN` should be `"true"` instead of `TRUE` in R

2022-12-21 Thread GitBox
assignUser closed issue #15028: [R][Docs] `NOT_CRAN` should be `"true"` instead of `TRUE` in R URL: https://github.com/apache/arrow/issues/15028 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the s

[GitHub] [arrow] ddemaeyer opened a new issue, #15060: [JS] Missing support for LargeUtf8 datatype

2022-12-21 Thread GitBox
ddemaeyer opened a new issue, #15060: URL: https://github.com/apache/arrow/issues/15060 ### Describe the bug, including details regarding any error messages, version, and platform. Currently there is no support for the large_string or largeutf8 datatype in the javascript libraries.

[GitHub] [arrow] kou closed issue #15019: BUG: `libhdfs3` not getting picked up correctly when using conda

2022-12-21 Thread GitBox
kou closed issue #15019: BUG: `libhdfs3` not getting picked up correctly when using conda URL: https://github.com/apache/arrow/issues/15019 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specif

[GitHub] [arrow] EpsilonPrime opened a new issue, #15062: Remove unused kUnspecified enum value and simplify EnumParser.

2022-12-21 Thread GitBox
EpsilonPrime opened a new issue, #15062: URL: https://github.com/apache/arrow/issues/15062 ### Describe the bug, including details regarding any error messages, version, and platform. In engine/substrait/extension_set.cc EnumParser has special logic to ignore unspecified values which

[GitHub] [arrow] jorisvandenbossche closed issue #15043: [Python] [Docs] Change docstring for `decompressed_size` arg in pyarrow.decompress to reflect implementation

2022-12-22 Thread GitBox
jorisvandenbossche closed issue #15043: [Python] [Docs] Change docstring for `decompressed_size` arg in pyarrow.decompress to reflect implementation URL: https://github.com/apache/arrow/issues/15043 -- This is an automated message from the Apache Git Service. To respond to the message, please

[GitHub] [arrow] jorisvandenbossche opened a new issue, #15064: [Python][CI] Dask nightly tests are failing due to fsspec bug

2022-12-22 Thread GitBox
jorisvandenbossche opened a new issue, #15064: URL: https://github.com/apache/arrow/issues/15064 ### Describe the bug, including details regarding any error messages, version, and platform. Nightly crossbow tests against dask are failing because of https://github.com/fsspec/filesyste

[GitHub] [arrow] jorisvandenbossche closed issue #14885: [Docs] Make changes to the New Contrib Guide (Jira -> GitHub)

2022-12-22 Thread GitBox
jorisvandenbossche closed issue #14885: [Docs] Make changes to the New Contrib Guide (Jira -> GitHub) URL: https://github.com/apache/arrow/issues/14885 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go t

[GitHub] [arrow] bkowshik opened a new issue, #15066: ModuleNotFoundError: No module named 'cmake'

2022-12-22 Thread GitBox
bkowshik opened a new issue, #15066: URL: https://github.com/apache/arrow/issues/15066 ### Describe the usage question you have. Please include as many useful details as possible. Installing `pyarrow` fails with the following error: - Mac OS Monterey - Chip: Apple M1 Pr

[GitHub] [arrow] jorisvandenbossche closed issue #15064: [Python][CI] Dask nightly tests are failing due to fsspec bug

2022-12-22 Thread GitBox
jorisvandenbossche closed issue #15064: [Python][CI] Dask nightly tests are failing due to fsspec bug URL: https://github.com/apache/arrow/issues/15064 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go t

[GitHub] [arrow] jorisvandenbossche closed issue #14918: [Docs] Make changes to developers section of the docs (Jira -> GitHub)

2022-12-22 Thread GitBox
jorisvandenbossche closed issue #14918: [Docs] Make changes to developers section of the docs (Jira -> GitHub) URL: https://github.com/apache/arrow/issues/14918 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL abov

[GitHub] [arrow] chaubold opened a new issue, #15068: [Python] IPC serialization roundtrip of extension type with storage type pa.null() reconstructs invalid buffers

2022-12-22 Thread GitBox
chaubold opened a new issue, #15068: URL: https://github.com/apache/arrow/issues/15068 ### Describe the bug, including details regarding any error messages, version, and platform. Hi guys, I wanted to create a JIRA ticket, but apparently that is not allowed anymore for the pub

[GitHub] [arrow-adbc] lidavidm closed issue #104: [RPM] Add packages for Red Hat Enterprise Linux based distributions

2022-12-22 Thread GitBox
lidavidm closed issue #104: [RPM] Add packages for Red Hat Enterprise Linux based distributions URL: https://github.com/apache/arrow-adbc/issues/104 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to t

[GitHub] [arrow] mhconradt opened a new issue, #15069: PyArrow Flight DoAction does not return results as available

2022-12-22 Thread GitBox
mhconradt opened a new issue, #15069: URL: https://github.com/apache/arrow/issues/15069 ### Describe the bug, including details regarding any error messages, version, and platform. In short, when using PyArrow Flight server and client, messages are received by the program using the c

[GitHub] [arrow] jorisvandenbossche opened a new issue, #15070: [Python][CI] Remove temporary skips for pandas 2.0 development version

2022-12-22 Thread GitBox
jorisvandenbossche opened a new issue, #15070: URL: https://github.com/apache/arrow/issues/15070 https://github.com/apache/arrow/pull/15048 added some skips for regressions in pandas main. Once those are fixed, we should remove the skips on our side. -- This is an automated message from t

[GitHub] [arrow] akshaysu12 opened a new issue, #15071: Error building Arrow C++ libraries on MacOS Ventura 13.1

2022-12-22 Thread GitBox
akshaysu12 opened a new issue, #15071: URL: https://github.com/apache/arrow/issues/15071 ### Describe the usage question you have. Please include as many useful details as possible. Trying to follow the python development guide to build arrow: https://arrow.apache.org/docs/devel

[GitHub] [arrow] EpsilonPrime opened a new issue, #15072: Separate round functionality into a separate module

2022-12-22 Thread GitBox
EpsilonPrime opened a new issue, #15072: URL: https://github.com/apache/arrow/issues/15072 ### Describe the enhancement requested 40% of scalar_arithmetic.cc is comprised of round related functionality in roughly 1200 of 3000 lines of code. This is a sufficient bulk of logic to sepa

[GitHub] [arrow] mapleFU opened a new issue, #15074: [Parquet][C++] Using int32_t to store page_ordinal

2022-12-22 Thread GitBox
mapleFU opened a new issue, #15074: URL: https://github.com/apache/arrow/issues/15074 ### Describe the enhancement requested When a Page can be well compressed in `PLAIN` format, if the estimate size is much more larger than compressed size, the Page can be very small. And a 512MB ro

[GitHub] [arrow] jorisvandenbossche closed issue #14975: [Python] Add Dataset.sort_by

2022-12-23 Thread GitBox
jorisvandenbossche closed issue #14975: [Python] Add Dataset.sort_by URL: https://github.com/apache/arrow/issues/14975 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubs

[GitHub] [arrow] arteconceito closed issue #14973: [JS] Using tableFromIPC().toString() results in invalid stringified JSON when containing dates

2022-12-23 Thread GitBox
arteconceito closed issue #14973: [JS] Using tableFromIPC().toString() results in invalid stringified JSON when containing dates URL: https://github.com/apache/arrow/issues/14973 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub an

[GitHub] [arrow-julia] jrevels opened a new issue, #367: inappropriately applied `Arrow.NullVector` optimization?

2022-12-23 Thread GitBox
jrevels opened a new issue, #367: URL: https://github.com/apache/arrow-julia/issues/367 It seems like `Union{ZonedDateTime,Missing}` (de)serialization can bork Arrow.jl in some circumstances, presumably to a misapplied `Arrow.NullVector` optimization (or code in that optimization assuming a

[GitHub] [arrow] 0x26res opened a new issue, #15078: add type and mask argument to pa.MapArray.from_array

2022-12-23 Thread GitBox
0x26res opened a new issue, #15078: URL: https://github.com/apache/arrow/issues/15078 ### Describe the enhancement requested I would like to be able to create MapArray, using the from_array function, and: 1. specify missing value (by that I mean the whole map is null), by passing

[GitHub] [arrow-adbc] lidavidm opened a new issue, #259: [Release] Linux packaging does not work in RC builds

2022-12-23 Thread GitBox
lidavidm opened a new issue, #259: URL: https://github.com/apache/arrow-adbc/issues/259 ``` Command failed with status (1): [docker pull ghcr.io/lidavidm/arrow-adbc/pa...] /home/runner/work/arrow-adbc/arrow-adbc/arrow/dev/tasks/linux-packages/package-task.rb:209:in `docker_pull'

[GitHub] [arrow] jrbourbeau opened a new issue, #15079: PyArrow fails to rountrip non-ns datetime64 with pandas 2.0

2022-12-23 Thread GitBox
jrbourbeau opened a new issue, #15079: URL: https://github.com/apache/arrow/issues/15079 ### Describe the bug, including details regarding any error messages, version, and platform. While adding support in Dask for the upcoming `pandas=2.0` release, some of our unit tests began faili

[GitHub] [arrow-julia] jrevels opened a new issue, #368: `ERROR: MethodError: no method matching zero(::Type{Union{Nothing, String}})` when trying to serialize `Union{Nothing,String}` fields

2022-12-23 Thread GitBox
jrevels opened a new issue, #368: URL: https://github.com/apache/arrow-julia/issues/368 MWE: ```jl julia> using Arrow julia> t = (; x = Union{Missing,NamedTuple{(:k,),Tuple{Union{Nothing,String[missing, (; k = "k")]) (x = Union{Missing, NamedTuple{(:k,), Tuple{Union

[GitHub] [arrow] kou opened a new issue, #15080: [CI][R] Binary package job for R 4.1 on Windows is failed with purrr 1.0.0

2022-12-23 Thread GitBox
kou opened a new issue, #15080: URL: https://github.com/apache/arrow/issues/15080 ### Describe the bug, including details regarding any error messages, version, and platform. With purrr 0.3.5 (success): https://github.com/ursacomputing/crossbow/actions/runs/3756216656/jobs/6382536442

[GitHub] [arrow] kou opened a new issue, #15081: [Release] Make dev/release/05-binary-upload.sh more reusable from other project

2022-12-23 Thread GitBox
kou opened a new issue, #15081: URL: https://github.com/apache/arrow/issues/15081 ### Describe the enhancement requested The current `dev/release/05-binary-upload.sh` can be used other project. For example, https://github.com/datafusion-contrib/datafusion-c/ uses it. But the cu

[GitHub] [arrow] kou closed issue #15081: [Release] Make dev/release/05-binary-upload.sh more reusable from other project

2022-12-23 Thread GitBox
kou closed issue #15081: [Release] Make dev/release/05-binary-upload.sh more reusable from other project URL: https://github.com/apache/arrow/issues/15081 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to g

[GitHub] [arrow] kou closed issue #14968: [Python] `write_dataset(table, format='orc')`: segfault

2022-12-23 Thread GitBox
kou closed issue #14968: [Python] `write_dataset(table, format='orc')`: segfault URL: https://github.com/apache/arrow/issues/14968 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment

[GitHub] [arrow] heronshoes opened a new issue, #15084: [Ruby] Use common keys when keys.nil? in Table#join

2022-12-23 Thread GitBox
heronshoes opened a new issue, #15084: URL: https://github.com/apache/arrow/issues/15084 ### Describe the enhancement requested Target method Arrow::Table#join Target argument keys Proposed feature Implicit common key(s) will used when

[GitHub] [arrow] heronshoes opened a new issue, #15085: Add a method to return column names in Arrow::Table

2022-12-23 Thread GitBox
heronshoes opened a new issue, #15085: URL: https://github.com/apache/arrow/issues/15085 ### Describe the enhancement requested Target method Arrow::Table#keys or #names (tentative) Proposed feature Return an Array `columns(&name)`. Impact of

[GitHub] [arrow] lpiep closed issue #15055: Aggregate Functions in R API

2022-12-23 Thread GitBox
lpiep closed issue #15055: Aggregate Functions in R API URL: https://github.com/apache/arrow/issues/15055 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail

[GitHub] [arrow-adbc] lidavidm opened a new issue, #263: [Release] 02-source.sh should use GPG key specified in .env as well

2022-12-23 Thread GitBox
lidavidm opened a new issue, #263: URL: https://github.com/apache/arrow-adbc/issues/263 Also, it might be nice to have a single sourceable script to set up all the common utilities (.env, printing headers) -- This is an automated message from the Apache Git Service. To respond to the mess

[GitHub] [arrow-adbc] lidavidm closed issue #259: [Release] Linux packaging does not work in RC builds

2022-12-23 Thread GitBox
lidavidm closed issue #259: [Release] Linux packaging does not work in RC builds URL: https://github.com/apache/arrow-adbc/issues/259 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comm

[GitHub] [arrow-adbc] lidavidm closed issue #263: [Release] 02-source.sh should use GPG key specified in .env as well

2022-12-23 Thread GitBox
lidavidm closed issue #263: [Release] 02-source.sh should use GPG key specified in .env as well URL: https://github.com/apache/arrow-adbc/issues/263 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to t

[GitHub] [arrow-adbc] lidavidm opened a new issue, #267: [CI] "All keyword arguments must be after positional arguments"

2022-12-24 Thread GitBox
lidavidm opened a new issue, #267: URL: https://github.com/apache/arrow-adbc/issues/267 ``` + meson setup build --buildtype=debugoptimized --default-library=both --libdir=/usr/lib64 --prefix=/usr -Dadbc_build_dir=/root/rpmbuild/BUILD/apache-arrow-adbc-0.1.0/glib/../c/driver_manager/build

<    4   5   6   7   8   9   10   11   12   13   >