[jira] [Created] (ARROW-17294) [Release] Update remove old artifacts release script
Krisztian Szucs created ARROW-17294: --- Summary: [Release] Update remove old artifacts release script Key: ARROW-17294 URL: https://issues.apache.org/jira/browse/ARROW-17294 Project: Apache Arrow Issue Type: Bug Components: Developer Tools Reporter: Krisztian Szucs Fix For: 10.0.0 I just executed the remove old artifacts release script which also removed the previously created three patch releases for 6.0.2, 7.0.1, 8.0.1. That's not desirable since those have just been released so I had to revert to an earlier revision. cc [~kou] [~assignUser] [~raulcd] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Created] (ARROW-17260) [Release] Java jars verification pass despite that nothing has been uploaded
Krisztian Szucs created ARROW-17260: --- Summary: [Release] Java jars verification pass despite that nothing has been uploaded Key: ARROW-17260 URL: https://issues.apache.org/jira/browse/ARROW-17260 Project: Apache Arrow Issue Type: Bug Components: Developer Tools Reporter: Krisztian Szucs Build do pass, despite that I forgot to upload the java binaries: https://github.com/ursacomputing/crossbow/runs/7587084181?check_suite_focus=true cc [~assignUser] [~raulcd] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Created] (ARROW-17238) [Release] Turn off GCS testing during wheel verification
Krisztian Szucs created ARROW-17238: --- Summary: [Release] Turn off GCS testing during wheel verification Key: ARROW-17238 URL: https://issues.apache.org/jira/browse/ARROW-17238 Project: Apache Arrow Issue Type: Bug Components: Developer Tools Reporter: Krisztian Szucs Fix For: 9.0.0 -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Created] (ARROW-17233) [Crossbow] Outdated artifact patterns for certain linux jobs
Krisztian Szucs created ARROW-17233: --- Summary: [Crossbow] Outdated artifact patterns for certain linux jobs Key: ARROW-17233 URL: https://issues.apache.org/jira/browse/ARROW-17233 Project: Apache Arrow Issue Type: Bug Components: Developer Tools Reporter: Krisztian Szucs almalinux-8-arm64 and almalinux-9-arm64: {code} arrow-flight-devel-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow-flight-glib-devel-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow-flight-glib-doc-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow-flight-sql-devel-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow-flight-sql-glib-devel-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow-flight-sql-glib-doc-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow[0-9]+-flight-glib-libs-debuginfo-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow[0-9]+-flight-glib-libs-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow[0-9]+-flight-libs-debuginfo-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow[0-9]+-flight-libs-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow[0-9]+-flight-sql-glib-libs-debuginfo-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow[0-9]+-flight-sql-glib-libs-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow[0-9]+-flight-sql-libs-debuginfo-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow[0-9]+-flight-sql-libs-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow-glib-devel-9.0.0-1.el8.aarch64.rpm [ OK] arrow-glib-doc-9.0.0-1.el8.aarch64.rpm [ OK] arrow9-glib-libs-debuginfo-9.0.0-1.el8.aarch64.rpm [ OK] arrow9-glib-libs-9.0.0-1.el8.aarch64.rpm [ OK] arrow9-libs-debuginfo-9.0.0-1.el8.aarch64.rpm [ OK] arrow9-libs-9.0.0-1.el8.aarch64.rpm [ OK] arrow-python-devel-9.0.0-1.el8.aarch64.rpm [ OK] arrow-python-flight-devel-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow[0-9]+-python-flight-libs-debuginfo-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow[0-9]+-python-flight-libs-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] {code} centos-7-amd64 {code} arrow-python-devel-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow[0-9]+-python-libs-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] {code} centos-8-arm64 and centos-9-arm64: {code} arrow-flight-devel-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow-flight-glib-devel-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow-flight-glib-doc-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow-flight-sql-devel-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow-flight-sql-glib-devel-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow-flight-sql-glib-doc-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow[0-9]+-flight-glib-libs-debuginfo-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow[0-9]+-flight-glib-libs-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow[0-9]+-flight-libs-debuginfo-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow[0-9]+-flight-libs-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow[0-9]+-flight-sql-glib-libs-debuginfo-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow[0-9]+-flight-sql-glib-libs-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow[0-9]+-flight-sql-libs-debuginfo-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow[0-9]+-flight-sql-libs-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow-glib-devel-9.0.0-1.el8.aarch64.rpm [ OK] arrow-glib-doc-9.0.0-1.el8.aarch64.rpm [ OK] arrow9-glib-libs-debuginfo-9.0.0-1.el8.aarch64.rpm [ OK] arrow9-glib-libs-9.0.0-1.el8.aarch64.rpm [ OK] arrow9-libs-debuginfo-9.0.0-1.el8.aarch64.rpm [ OK] arrow9-libs-9.0.0-1.el8.aarch64.rpm [ OK] arrow-python-devel-9.0.0-1.el8.aarch64.rpm [ OK] arrow-python-flight-devel-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow[0-9]+-python-flight-libs-debuginfo-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] arrow[0-9]+-python-flight-libs-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING] {code} ubuntu-bionic-amd64 / ubuntu-bionic-arm64: {code} libarrow-python-dev_9.0.0-1_[a-z0-9]+.deb [MISSING] libarrow-python-flight-dev_9.0.0-1_[a-z0-9]+.deb [MISSING] libarrow-python-flight900-dbgsym_9.0.0-1_[a-z0-9]+.d?deb [MISSING] libarrow-python-flight900_9.0.0-1_[a-z0-9]+.deb [MISSING]
[jira] [Created] (ARROW-17232) [Release] Missing R binary packages
Krisztian Szucs created ARROW-17232: --- Summary: [Release] Missing R binary packages Key: ARROW-17232 URL: https://issues.apache.org/jira/browse/ARROW-17232 Project: Apache Arrow Issue Type: Bug Components: Developer Tools Reporter: Krisztian Szucs Seems like the binary upload script now expects some R binaries to upload, but the {{packaging}} crossbow task group doesn't contain any relevant tasks. I assume the {{r-binary-packages}} should be added to the {{packaging}} group. cc [~kou][~raulcd][~assignUser] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Created] (ARROW-17227) [C++] Extend hash-join unit tests to cover both empty and length=0 batches
Krisztian Szucs created ARROW-17227: --- Summary: [C++] Extend hash-join unit tests to cover both empty and length=0 batches Key: ARROW-17227 URL: https://issues.apache.org/jira/browse/ARROW-17227 Project: Apache Arrow Issue Type: Improvement Components: C++ Reporter: Krisztian Szucs Assignee: Weston Pace Fix For: 9.0.0 -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Created] (ARROW-16767) [Archery] Refactor archery.release submodule to its own subpackage
Krisztian Szucs created ARROW-16767: --- Summary: [Archery] Refactor archery.release submodule to its own subpackage Key: ARROW-16767 URL: https://issues.apache.org/jira/browse/ARROW-16767 Project: Apache Arrow Issue Type: Improvement Components: Archery Reporter: Krisztian Szucs Fix For: 9.0.0 -- This message was sent by Atlassian Jira (v8.20.7#820007)
[jira] [Created] (ARROW-16654) [Dev][Archery] Support cherry-picking for major releases
Krisztian Szucs created ARROW-16654: --- Summary: [Dev][Archery] Support cherry-picking for major releases Key: ARROW-16654 URL: https://issues.apache.org/jira/browse/ARROW-16654 Project: Apache Arrow Issue Type: New Feature Components: Archery, Developer Tools Reporter: Krisztian Szucs Fix For: 9.0.0 -- This message was sent by Atlassian Jira (v8.20.7#820007)
[jira] [Created] (ARROW-16589) [CI][Dev] Make tasks.yml easier to maintain
Krisztian Szucs created ARROW-16589: --- Summary: [CI][Dev] Make tasks.yml easier to maintain Key: ARROW-16589 URL: https://issues.apache.org/jira/browse/ARROW-16589 Project: Apache Arrow Issue Type: New Feature Components: Continuous Integration, Developer Tools Reporter: Krisztian Szucs I think {{dev/tasks/tasks.yml}} has reached its limits by using jinja2 templated yml. We should think about a better way to define crossbow jobs while: - keeping it readable - in a dialect which is natively supported by editors - while supporting tasks parametrization Just one idea is to use python files containing python objects, e.g.: {code} Task( name="wheel-macos-big-sur-cp38-arm64", ci="github", template="python-wheels/github.osx.arm64.yml", params=dict( arch="arm64", arrow_simd_level="DEFAULT", python_version="3.8", macos_deployment_target="11.0" ), artifacts=[ "pyarrow-{no_rc_version}-cp38-cp38-macosx_11_0_arm64.whl" ] ) {code} where {{Task}} would be the crossbow task class (which could be refactored to use pydantic or another alternative for less boilerplate). Of course porting to the tasks definitions to plain python could make the situation even worse by accessing too many scripting utilities. We could try a dynamic config language which sits between yaml and python like HCL. [~kou] what syntax would you be comfortable to work with? Do you have any alternatives we could use? cc [~amol-] [~raulcd] [~assignUser] -- This message was sent by Atlassian Jira (v8.20.7#820007)
[jira] [Created] (ARROW-16332) [Release] Java jars verification pass despite binaries not being uploaded
Krisztian Szucs created ARROW-16332: --- Summary: [Release] Java jars verification pass despite binaries not being uploaded Key: ARROW-16332 URL: https://issues.apache.org/jira/browse/ARROW-16332 Project: Apache Arrow Issue Type: Bug Components: Developer Tools Reporter: Krisztian Szucs Fix For: 9.0.0 See results at https://github.com/apache/arrow/pull/12991#issuecomment-1109525407 -- This message was sent by Atlassian Jira (v8.20.7#820007)
[jira] [Created] (ARROW-16315) [Python] Cython api test fails with allocation error on windows
Krisztian Szucs created ARROW-16315: --- Summary: [Python] Cython api test fails with allocation error on windows Key: ARROW-16315 URL: https://issues.apache.org/jira/browse/ARROW-16315 Project: Apache Arrow Issue Type: Bug Components: Python Reporter: Krisztian Szucs Fix For: 9.0.0 Getting memory pool deallocation errors https://github.com/ursacomputing/crossbow/runs/6154173225?check_suite_focus=true#step:6:33401 -- This message was sent by Atlassian Jira (v8.20.7#820007)
[jira] [Created] (ARROW-16314) [Python][CI] Skip running cython tests in windows verification builds
Krisztian Szucs created ARROW-16314: --- Summary: [Python][CI] Skip running cython tests in windows verification builds Key: ARROW-16314 URL: https://issues.apache.org/jira/browse/ARROW-16314 Project: Apache Arrow Issue Type: Bug Components: Continuous Integration, Python Reporter: Krisztian Szucs Getting memory pool errors https://github.com/ursacomputing/crossbow/runs/6154173225?check_suite_focus=true#step:6:33401 -- This message was sent by Atlassian Jira (v8.20.7#820007)
[jira] [Created] (ARROW-16312) [C++][CI] Install tzdata in the windows verification builds
Krisztian Szucs created ARROW-16312: --- Summary: [C++][CI] Install tzdata in the windows verification builds Key: ARROW-16312 URL: https://issues.apache.org/jira/browse/ARROW-16312 Project: Apache Arrow Issue Type: Improvement Components: C++, Continuous Integration Reporter: Krisztian Szucs Fix For: 8.0.0 See build log https://github.com/ursacomputing/crossbow/runs/614860?check_suite_focus=true -- This message was sent by Atlassian Jira (v8.20.7#820007)
[jira] [Created] (ARROW-16301) [C#][CI] Fix docker configuration for .NET 6
Krisztian Szucs created ARROW-16301: --- Summary: [C#][CI] Fix docker configuration for .NET 6 Key: ARROW-16301 URL: https://issues.apache.org/jira/browse/ARROW-16301 Project: Apache Arrow Issue Type: Improvement Components: C#, Continuous Integration Reporter: Krisztian Szucs Fix For: 8.0.0 Forgot to update the docker setup in https://github.com/apache/arrow/commit/f275f50792fb80e1615427620fd32681ecf3e07a -- This message was sent by Atlassian Jira (v8.20.7#820007)
[jira] [Created] (ARROW-16284) [Python][Packaging] Use delocate-fuse to create universal2 wheels
Krisztian Szucs created ARROW-16284: --- Summary: [Python][Packaging] Use delocate-fuse to create universal2 wheels Key: ARROW-16284 URL: https://issues.apache.org/jira/browse/ARROW-16284 Project: Apache Arrow Issue Type: Improvement Components: Packaging, Python Reporter: Krisztian Szucs Previously we used specific universal2 configurations for vcpkg to build the dependencies containing symbols for both architectures. This approach proved to be fragile to vcpkg changes making it hard to upgrade the vcpkg version. As an example https://github.com/apache/arrow/pull/12893 bumps the vcpkg version where absl has stopped compiling for two CMAKE_OSX_ARCHITECTURES, it has been already fixed in absl's upstream but that hasn't been released yet. The new approach uses multibuild's delocate to build the wheels for both arm64 and amd64 separately and fuse them in an upcoming step to a universal2 wheel (using {{lipo}} under the hood). -- This message was sent by Atlassian Jira (v8.20.7#820007)
[jira] [Created] (ARROW-15555) [Release] Post release version bumping script tries to push the release tag
Krisztian Szucs created ARROW-1: --- Summary: [Release] Post release version bumping script tries to push the release tag Key: ARROW-1 URL: https://issues.apache.org/jira/browse/ARROW-1 Project: Apache Arrow Issue Type: Bug Components: Developer Tools Reporter: Krisztian Szucs Fix For: 8.0.0 fatal: tag 'apache-arrow-7.0.0' already exists -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15504) [Python] Ensure to test ORC bindings
Krisztian Szucs created ARROW-15504: --- Summary: [Python] Ensure to test ORC bindings Key: ARROW-15504 URL: https://issues.apache.org/jira/browse/ARROW-15504 Project: Apache Arrow Issue Type: Bug Components: Python Reporter: Krisztian Szucs Fix For: 8.0.0 See conversation https://github.com/apache/arrow/commit/f9f6fdbb7518c09b833cb6b78bc202008d28e865#r64854632 -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15499) [Python] Fix import error in pyarrow._orc
Krisztian Szucs created ARROW-15499: --- Summary: [Python] Fix import error in pyarrow._orc Key: ARROW-15499 URL: https://issues.apache.org/jira/browse/ARROW-15499 Project: Apache Arrow Issue Type: Bug Components: Python Reporter: Krisztian Szucs Assignee: Krisztian Szucs -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15486) [Relase][Java] Verify staged maven artifacts
Krisztian Szucs created ARROW-15486: --- Summary: [Relase][Java] Verify staged maven artifacts Key: ARROW-15486 URL: https://issues.apache.org/jira/browse/ARROW-15486 Project: Apache Arrow Issue Type: Bug Reporter: Krisztian Szucs We have two tests right now: 1. Execute {{mvn test}} from the source tarball's java directory testing the source https://github.com/apache/arrow/blob/master/dev/release/verify-release-candidate.sh#L278 2. Verify the checksums and signatures of the uploaded maven artifacts https://github.com/apache/arrow/blob/master/dev/release/verify-release-candidate.sh#L766 But we don't actually *test* the packages. We should add that to the verification scripts. cc [~kou] [~anthonylouis] -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15485) [Release][Java] Fix java jars upload script
Krisztian Szucs created ARROW-15485: --- Summary: [Release][Java] Fix java jars upload script Key: ARROW-15485 URL: https://issues.apache.org/jira/browse/ARROW-15485 Project: Apache Arrow Issue Type: Bug Components: Developer Tools, Java Reporter: Krisztian Szucs Fix For: 8.0.0 Locally not existing files get uploaded to maven. -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15483) [Release] Exercise source verification builds on a nightly basis
Krisztian Szucs created ARROW-15483: --- Summary: [Release] Exercise source verification builds on a nightly basis Key: ARROW-15483 URL: https://issues.apache.org/jira/browse/ARROW-15483 Project: Apache Arrow Issue Type: New Feature Components: Developer Tools Reporter: Krisztian Szucs Fix For: 8.0.0 We need to update the verification scripts to support specific git revisions without checking the signatures, then we can simply submit the verification tasks using crossbow. cc [~kou] -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15456) [Release] Automatize source verification task submission
Krisztian Szucs created ARROW-15456: --- Summary: [Release] Automatize source verification task submission Key: ARROW-15456 URL: https://issues.apache.org/jira/browse/ARROW-15456 Project: Apache Arrow Issue Type: New Feature Components: Developer Tools Reporter: Krisztian Szucs The workflow would look like this: {code} git push -u apache release- git push -u apache release--rc git push -u apache apache-arrow- dev/release/02-source.sh dev/release/03-source-verify.sh {code} Where {{03-source-verify.sh}} would create a pull request and submit crossbow source verification tasks by either: a. placing a github comment triggering the comment bot b. calling crossbow locally then placing a comment to the PR using the same {{archery.crossbow.CommentReport}} class The resulting PR should look like this https://github.com/apache/arrow/pull/12262 Opinions @kou? -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15453) [Crossbow] Unable to parse github owner/repository pair
Krisztian Szucs created ARROW-15453: --- Summary: [Crossbow] Unable to parse github owner/repository pair Key: ARROW-15453 URL: https://issues.apache.org/jira/browse/ARROW-15453 Project: Apache Arrow Issue Type: Bug Components: Developer Tools Reporter: Krisztian Szucs Fix For: 8.0.0 See build log: https://github.com/ursacomputing/crossbow/runs/4939685651?check_suite_focus=true#step:12:118 Should support plain http urls, like 'https://github.com/ursacomputing/crossbow/' -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15450) [Python][Wheel] Flight test receives SIGKILL during in macOS tests
Krisztian Szucs created ARROW-15450: --- Summary: [Python][Wheel] Flight test receives SIGKILL during in macOS tests Key: ARROW-15450 URL: https://issues.apache.org/jira/browse/ARROW-15450 Project: Apache Arrow Issue Type: Improvement Components: Python Reporter: Krisztian Szucs Fix For: 7.0.0 See build: https://github.com/ursacomputing/crossbow/runs/4928437869?check_suite_focus=true#step:4:2967 cc [~davidli] -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15449) [Release] Add post-{num}-changelog.sh to update CHANGELOG.md
Krisztian Szucs created ARROW-15449: --- Summary: [Release] Add post-{num}-changelog.sh to update CHANGELOG.md Key: ARROW-15449 URL: https://issues.apache.org/jira/browse/ARROW-15449 Project: Apache Arrow Issue Type: Improvement Components: Developer Tools Reporter: Krisztian Szucs Fix For: 8.0.0 See https://github.com/apache/arrow/pull/12235#discussion_r791194366 It's going to prevent issues like https://issues.apache.org/jira/browse/ARROW-13460 -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15448) [C++] Use apache mirror system to download ORC's source
Krisztian Szucs created ARROW-15448: --- Summary: [C++] Use apache mirror system to download ORC's source Key: ARROW-15448 URL: https://issues.apache.org/jira/browse/ARROW-15448 Project: Apache Arrow Issue Type: Improvement Components: C++ Reporter: Krisztian Szucs Assignee: Krisztian Szucs Fix For: 7.0.0 By the recent switch to bundled ORC builds in the wheels has surfaced flaky download issues from apache dist which should be discouraged to use. -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15447) [C++] ORC adapter fails to compile due to name conflict
Krisztian Szucs created ARROW-15447: --- Summary: [C++] ORC adapter fails to compile due to name conflict Key: ARROW-15447 URL: https://issues.apache.org/jira/browse/ARROW-15447 Project: Apache Arrow Issue Type: Bug Components: C++ Reporter: Krisztian Szucs Fix For: 7.0.0 See build https://github.com/ursacomputing/crossbow/runs/4932765676?check_suite_focus=true#step:5:1191 -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15442) [Python] GDB test cannot locate libarrow
Krisztian Szucs created ARROW-15442: --- Summary: [Python] GDB test cannot locate libarrow Key: ARROW-15442 URL: https://issues.apache.org/jira/browse/ARROW-15442 Project: Apache Arrow Issue Type: Bug Reporter: Krisztian Szucs See build https://github.com/ursacomputing/crossbow/runs/4930447399?check_suite_focus=true#step:5:16777 cc [~apitrou] -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15436) [Release][Python] Disable verification of gdb tests on windows and a flaky test on apple M1
Krisztian Szucs created ARROW-15436: --- Summary: [Release][Python] Disable verification of gdb tests on windows and a flaky test on apple M1 Key: ARROW-15436 URL: https://issues.apache.org/jira/browse/ARROW-15436 Project: Apache Arrow Issue Type: Task Components: Python Reporter: Krisztian Szucs Assignee: Krisztian Szucs Fix For: 8.0.0 See verification problems occured in https://github.com/apache/arrow/pull/12235 -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15420) [Python] Sdist packaging build is failing due to missing GDB script
Krisztian Szucs created ARROW-15420: --- Summary: [Python] Sdist packaging build is failing due to missing GDB script Key: ARROW-15420 URL: https://issues.apache.org/jira/browse/ARROW-15420 Project: Apache Arrow Issue Type: Bug Components: Python Reporter: Krisztian Szucs Assignee: Krisztian Szucs Fix For: 7.0.0 See nightly build log https://github.com/ursacomputing/crossbow/runs/4911185725?check_suite_focus=true -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15417) [Python][Packaging] Windows wheels are crashing due to AWS SDK error
Krisztian Szucs created ARROW-15417: --- Summary: [Python][Packaging] Windows wheels are crashing due to AWS SDK error Key: ARROW-15417 URL: https://issues.apache.org/jira/browse/ARROW-15417 Project: Apache Arrow Issue Type: Bug Components: Packaging, Python Reporter: Krisztian Szucs Fix For: 7.0.0 Sadly we have an unexpected crash during the windows wheel verification which needs to be investigated: https://github.com/apache/arrow/pull/12224#issuecomment-1018910642 -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15416) [Python] Add option to skip gdb tests
Krisztian Szucs created ARROW-15416: --- Summary: [Python] Add option to skip gdb tests Key: ARROW-15416 URL: https://issues.apache.org/jira/browse/ARROW-15416 Project: Apache Arrow Issue Type: Improvement Components: Python Reporter: Krisztian Szucs Assignee: Krisztian Szucs Fix For: 7.0.0 The newly added gdb feature tests are failing on macos M1 in the wheel verification builds due to not universal2 gdb binary: https://github.com/ursacomputing/crossbow/branches/all?query=nightly-2022-01-23-0-github-wheel-macos-big-sur-cp39-arm64 -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15404) [Java][Packaging] Use bundled ORC for building java JNI jars
Krisztian Szucs created ARROW-15404: --- Summary: [Java][Packaging] Use bundled ORC for building java JNI jars Key: ARROW-15404 URL: https://issues.apache.org/jira/browse/ARROW-15404 Project: Apache Arrow Issue Type: Bug Components: Java, Packaging Reporter: Krisztian Szucs Assignee: Krisztian Szucs Fix For: 7.0.0 Forgot to update the JNI files in https://github.com/apache/arrow/pull/12153 -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15403) [Python] Fails to build python wheels due to depending on more recent ORC
Krisztian Szucs created ARROW-15403: --- Summary: [Python] Fails to build python wheels due to depending on more recent ORC Key: ARROW-15403 URL: https://issues.apache.org/jira/browse/ARROW-15403 Project: Apache Arrow Issue Type: Bug Components: Packaging, Python Reporter: Krisztian Szucs Assignee: Krisztian Szucs Fix For: 7.0.0 See build log: https://github.com/ursacomputing/crossbow/runs/4894370329?check_suite_focus=true#step:6:1469 That API is available since https://issues.apache.org/jira/browse/ORC-984 but vcpkg doesn't ship any of the versions highlighted in the ticket. -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15401) [Python] Gdb tests are failing on windows
Krisztian Szucs created ARROW-15401: --- Summary: [Python] Gdb tests are failing on windows Key: ARROW-15401 URL: https://issues.apache.org/jira/browse/ARROW-15401 Project: Apache Arrow Issue Type: Bug Components: Python Reporter: Krisztian Szucs Fix For: 7.0.0 See build https://github.com/ursacomputing/crossbow/runs/4889157090?check_suite_focus=true#step:5:31451 cc [~apitrou] -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15400) [Go][CI] Exercise builds on arm machines
Krisztian Szucs created ARROW-15400: --- Summary: [Go][CI] Exercise builds on arm machines Key: ARROW-15400 URL: https://issues.apache.org/jira/browse/ARROW-15400 Project: Apache Arrow Issue Type: New Feature Components: Continuous Integration, Go Reporter: Krisztian Szucs Fix For: 8.0.0 Preferably on travis for pull requests and we can create an additional crossbow job to also test on apple M1 on a nightly basis. cc [~zeroshade] -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15399) [Release][JS] Increase minimum NodeJS version to 16
Krisztian Szucs created ARROW-15399: --- Summary: [Release][JS] Increase minimum NodeJS version to 16 Key: ARROW-15399 URL: https://issues.apache.org/jira/browse/ARROW-15399 Project: Apache Arrow Issue Type: Task Components: JavaScript Reporter: Krisztian Szucs Fix For: 7.0.0 -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15395) [Release][Ruby] Ruby verification fails on M1
Krisztian Szucs created ARROW-15395: --- Summary: [Release][Ruby] Ruby verification fails on M1 Key: ARROW-15395 URL: https://issues.apache.org/jira/browse/ARROW-15395 Project: Apache Arrow Issue Type: Bug Components: Developer Tools Reporter: Krisztian Szucs Fix For: 7.0.0 See build log https://github.com/ursacomputing/crossbow/runs/4883657307?check_suite_focus=true#step:4:8653 While this is not a blocker I may need to cut another release candidate meanwhile. cc [~kou] -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15393) [Release][Crossbow] Fall back to 0 distance when generating scm version
Krisztian Szucs created ARROW-15393: --- Summary: [Release][Crossbow] Fall back to 0 distance when generating scm version Key: ARROW-15393 URL: https://issues.apache.org/jira/browse/ARROW-15393 Project: Apache Arrow Issue Type: Bug Components: Developer Tools Reporter: Krisztian Szucs Fix For: 8.0.0 The generated SCM version number in the verification tasks is `8.0.0devNone` which raises an error from setup.py -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15392) [JS] Flaky javascript unittest
Krisztian Szucs created ARROW-15392: --- Summary: [JS] Flaky javascript unittest Key: ARROW-15392 URL: https://issues.apache.org/jira/browse/ARROW-15392 Project: Apache Arrow Issue Type: Bug Components: JavaScript Reporter: Krisztian Szucs See build log: https://github.com/ursacomputing/crossbow/runs/4871354453?check_suite_focus=true#step:5:8164 While the error is flaky it occurs pretty often. -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15380) [Python][Release] NumPy ABI incompatibility during verification
Krisztian Szucs created ARROW-15380: --- Summary: [Python][Release] NumPy ABI incompatibility during verification Key: ARROW-15380 URL: https://issues.apache.org/jira/browse/ARROW-15380 Project: Apache Arrow Issue Type: Bug Components: Python Reporter: Krisztian Szucs Fix For: 7.0.0 See build https://github.com/ursacomputing/crossbow/runs/4871349353?check_suite_focus=true#step:5:12115 -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15378) [C++][Release] GTest linking error during windows verification
Krisztian Szucs created ARROW-15378: --- Summary: [C++][Release] GTest linking error during windows verification Key: ARROW-15378 URL: https://issues.apache.org/jira/browse/ARROW-15378 Project: Apache Arrow Issue Type: Bug Components: C++ Reporter: Krisztian Szucs Fix For: 7.0.0 See build https://github.com/ursacomputing/crossbow/runs/4871374560?check_suite_focus=true#step:5:1274 -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15377) [JS][Release] JavaScript verification fails
Krisztian Szucs created ARROW-15377: --- Summary: [JS][Release] JavaScript verification fails Key: ARROW-15377 URL: https://issues.apache.org/jira/browse/ARROW-15377 Project: Apache Arrow Issue Type: Bug Components: JavaScript Reporter: Krisztian Szucs Fix For: 7.0.0 See build log https://github.com/ursacomputing/crossbow/runs/4871354453?check_suite_focus=true#step:5:8164 -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15376) [Go][Release] Go verification fails
Krisztian Szucs created ARROW-15376: --- Summary: [Go][Release] Go verification fails Key: ARROW-15376 URL: https://issues.apache.org/jira/browse/ARROW-15376 Project: Apache Arrow Issue Type: Bug Components: Go Reporter: Krisztian Szucs Fix For: 7.0.0 See build error https://github.com/ursacomputing/crossbow/runs/4871355213?check_suite_focus=true#step:4:2703 -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15372) [C++][Gandiva] Gandiva now depends on boost/crc.hpp which is missing from the trimmed boost archive
Krisztian Szucs created ARROW-15372: --- Summary: [C++][Gandiva] Gandiva now depends on boost/crc.hpp which is missing from the trimmed boost archive Key: ARROW-15372 URL: https://issues.apache.org/jira/browse/ARROW-15372 Project: Apache Arrow Issue Type: Bug Components: C++, C++ - Gandiva Affects Versions: 7.0.0 Reporter: Krisztian Szucs See build error https://github.com/ursacomputing/crossbow/runs/4871392838?check_suite_focus=true#step:5:11762 -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15371) [Release] Missing libsqlite-dev from the verification docker images
Krisztian Szucs created ARROW-15371: --- Summary: [Release] Missing libsqlite-dev from the verification docker images Key: ARROW-15371 URL: https://issues.apache.org/jira/browse/ARROW-15371 Project: Apache Arrow Issue Type: Improvement Components: Developer Tools Reporter: Krisztian Szucs See build error https://github.com/ursacomputing/crossbow/runs/4870407487?check_suite_focus=true#step:5:4852 -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15355) [Docs] Trigger sphinx build on documentation changes
Krisztian Szucs created ARROW-15355: --- Summary: [Docs] Trigger sphinx build on documentation changes Key: ARROW-15355 URL: https://issues.apache.org/jira/browse/ARROW-15355 Project: Apache Arrow Issue Type: Improvement Components: Documentation Reporter: Krisztian Szucs Fix For: 7.0.0 -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15133) [CI] Removing util_checkout.sh and util_cleanup.sh scripts
Krisztian Szucs created ARROW-15133: --- Summary: [CI] Removing util_checkout.sh and util_cleanup.sh scripts Key: ARROW-15133 URL: https://issues.apache.org/jira/browse/ARROW-15133 Project: Apache Arrow Issue Type: Improvement Components: Continuous Integration Reporter: Krisztian Szucs Fix For: 7.0.0 - ci/scripts/util_checkout.sh was used to checkout submodules because actions/checkout@v2 has removed support for that, but they have restored it since. - ci/scripts/util_cleanup.sh was used to free up disk space on github actions runners, because at that time it was limited to 7GB, from a recent run it looks like the linux runners now have 32GB free space so we can try to disable the cleanup step sparing almost a minute of build time -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-15006) [Python][Doc] Iteratively enable more numpydoc checks
Krisztian Szucs created ARROW-15006: --- Summary: [Python][Doc] Iteratively enable more numpydoc checks Key: ARROW-15006 URL: https://issues.apache.org/jira/browse/ARROW-15006 Project: Apache Arrow Issue Type: Improvement Components: Documentation, Python Reporter: Krisztian Szucs Asof https://github.com/apache/arrow/pull/7732 we're going to have a numpydoc check running on pull requests. There is a single rule enabled at the moment: PR01 Additional checks we can run: {code} ERROR_MSGS = { "GL01": "Docstring text (summary) should start in the line immediately " "after the opening quotes (not in the same line, or leaving a " "blank line in between)", "GL02": "Closing quotes should be placed in the line after the last text " "in the docstring (do not close the quotes in the same line as " "the text, or leave a blank line between the last text and the " "quotes)", "GL03": "Double line break found; please use only one blank line to " "separate sections or paragraphs, and do not leave blank lines " "at the end of docstrings", "GL05": 'Tabs found at the start of line "{line_with_tabs}", please use ' "whitespace only", "GL06": 'Found unknown section "{section}". Allowed sections are: ' "{allowed_sections}", "GL07": "Sections are in the wrong order. Correct order is: {correct_sections}", "GL08": "The object does not have a docstring", "GL09": "Deprecation warning should precede extended summary", "GL10": "reST directives {directives} must be followed by two colons", "SS01": "No summary found (a short summary in a single line should be " "present at the beginning of the docstring)", "SS02": "Summary does not start with a capital letter", "SS03": "Summary does not end with a period", "SS04": "Summary contains heading whitespaces", "SS05": "Summary must start with infinitive verb, not third person " '(e.g. use "Generate" instead of "Generates")', "SS06": "Summary should fit in a single line", "ES01": "No extended summary found", "PR01": "Parameters {missing_params} not documented", "PR02": "Unknown parameters {unknown_params}", "PR03": "Wrong parameters order. Actual: {actual_params}. " "Documented: {documented_params}", "PR04": 'Parameter "{param_name}" has no type', "PR05": 'Parameter "{param_name}" type should not finish with "."', "PR06": 'Parameter "{param_name}" type should use "{right_type}" instead ' 'of "{wrong_type}"', "PR07": 'Parameter "{param_name}" has no description', "PR08": 'Parameter "{param_name}" description should start with a ' "capital letter", "PR09": 'Parameter "{param_name}" description should finish with "."', "PR10": 'Parameter "{param_name}" requires a space before the colon ' "separating the parameter name and type", "RT01": "No Returns section found", "RT02": "The first line of the Returns section should contain only the " "type, unless multiple values are being returned", "RT03": "Return value has no description", "RT04": "Return value description should start with a capital letter", "RT05": 'Return value description should finish with "."', "YD01": "No Yields section found", "SA01": "See Also section not found", "SA02": "Missing period at end of description for See Also " '"{reference_name}" reference', "SA03": "Description should be capitalized for See Also " '"{reference_name}" reference', "SA04": 'Missing description for See Also "{reference_name}" reference', "EX01": "No examples section found", } {code} cc [~alenkaf] [~amol-] [~jorisvandenbossche] -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14996) [Python][Gandiva] Deprecate of hide make_projector and make_filter testing utilities
Krisztian Szucs created ARROW-14996: --- Summary: [Python][Gandiva] Deprecate of hide make_projector and make_filter testing utilities Key: ARROW-14996 URL: https://issues.apache.org/jira/browse/ARROW-14996 Project: Apache Arrow Issue Type: Improvement Components: Python Reporter: Krisztian Szucs {{pyarrow.gandiva.{make_filter, make_projector}}} functions are only used from gandiva unittests. Additionally unexpected arguments can cause segmentations faults. We either should deprecate or hide these functions from the public API. -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14995) [Doc][Python] Document missing arguments for pyarrow.flight objects
Krisztian Szucs created ARROW-14995: --- Summary: [Doc][Python] Document missing arguments for pyarrow.flight objects Key: ARROW-14995 URL: https://issues.apache.org/jira/browse/ARROW-14995 Project: Apache Arrow Issue Type: Improvement Components: Documentation, Python Reporter: Krisztian Szucs To see the list of undocumented arguments: 1. uncomment https://github.com/apache/arrow/pull/7732/files#diff-fafe69518755e93c6d34fd8d0b5e722a2dc23c30920223015b8a80faa0b98db8R249 2. execute {{archery numpydoc -a PR01}} -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14991) [Packaging][Python] Windows wheel builds are failing due to wrong vcpkg triplet name
Krisztian Szucs created ARROW-14991: --- Summary: [Packaging][Python] Windows wheel builds are failing due to wrong vcpkg triplet name Key: ARROW-14991 URL: https://issues.apache.org/jira/browse/ARROW-14991 Project: Apache Arrow Issue Type: Improvement Components: Packaging, Python Reporter: Krisztian Szucs Assignee: Krisztian Szucs Fix For: 7.0.0 See build log https://github.com/ursacomputing/crossbow/runs/4426753814?check_suite_focus=true#step:7:192 -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14968) [Python] Pin numpy build dependency using oldest-supported-numpy
Krisztian Szucs created ARROW-14968: --- Summary: [Python] Pin numpy build dependency using oldest-supported-numpy Key: ARROW-14968 URL: https://issues.apache.org/jira/browse/ARROW-14968 Project: Apache Arrow Issue Type: Improvement Components: Python Reporter: Krisztian Szucs Fix For: 7.0.0 -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14962) [CI] Fix minio installation on s390x
Krisztian Szucs created ARROW-14962: --- Summary: [CI] Fix minio installation on s390x Key: ARROW-14962 URL: https://issues.apache.org/jira/browse/ARROW-14962 Project: Apache Arrow Issue Type: Bug Components: Continuous Integration Reporter: Krisztian Szucs Fix For: 7.0.0 -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14932) [CI][Python] Prefer mamba over conda
Krisztian Szucs created ARROW-14932: --- Summary: [CI][Python] Prefer mamba over conda Key: ARROW-14932 URL: https://issues.apache.org/jira/browse/ARROW-14932 Project: Apache Arrow Issue Type: Improvement Components: Continuous Integration, Python Reporter: Krisztian Szucs Fix For: 7.0.0 Mamba should provide quicker docker image builds compared to conda. -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14928) [Python][Packaging] Remove boost-filesystem vcpkg dependency from the wheel dockerfiles
Krisztian Szucs created ARROW-14928: --- Summary: [Python][Packaging] Remove boost-filesystem vcpkg dependency from the wheel dockerfiles Key: ARROW-14928 URL: https://issues.apache.org/jira/browse/ARROW-14928 Project: Apache Arrow Issue Type: Improvement Components: Packaging, Python Reporter: Krisztian Szucs Fix For: 7.0.0 We don't build the C++ tests there so boost-filesystem can be omitted. See comment https://github.com/apache/arrow/pull/11569#discussion_r759270985 -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14879) [Python][Packaging] Remove manylinux2010 wheels
Krisztian Szucs created ARROW-14879: --- Summary: [Python][Packaging] Remove manylinux2010 wheels Key: ARROW-14879 URL: https://issues.apache.org/jira/browse/ARROW-14879 Project: Apache Arrow Issue Type: Improvement Components: Packaging, Python Reporter: Krisztian Szucs Fix For: 7.0.0 More recent vcpkg is not compatible with older glibc shipped by manylinux2010 so we won't be able to regularly update the dependencies. Besides that manylinux2010 has reached EOL. -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14587) [CI][Crossbow] Fetch a single crossbow branch instead of the full repo on Azure
Krisztian Szucs created ARROW-14587: --- Summary: [CI][Crossbow] Fetch a single crossbow branch instead of the full repo on Azure Key: ARROW-14587 URL: https://issues.apache.org/jira/browse/ARROW-14587 Project: Apache Arrow Issue Type: Improvement Components: Continuous Integration Reporter: Krisztian Szucs Fix For: 7.0.0 Since crossbow has a lot of references the checkout step can take a long time, see build https://dev.azure.com/ursacomputing/crossbow/_build/results?buildId=14952=logs=0da5d1d9-276d-5173-c4c4-9d4d4ed14fdb=5bbb8710-d4c1-5a8b-fc80-a388730cf6ac We should alter the azure crossbow template to explicitly check out the task's branch using {{ {{ task.branch }} }} jinja variable. See azure documentation: https://docs.microsoft.com/en-us/azure/devops/pipelines/repos/multi-repo-checkout?view=azure-devops#checking-out-a-specific-ref -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14512) [Java][Doc] JavaDoc errors while building the docs
Krisztian Szucs created ARROW-14512: --- Summary: [Java][Doc] JavaDoc errors while building the docs Key: ARROW-14512 URL: https://issues.apache.org/jira/browse/ARROW-14512 Project: Apache Arrow Issue Type: Improvement Components: Documentation, Java Reporter: Krisztian Szucs Fix For: 7.0.0 On JDK 11: https://github.com/apache/arrow/runs/4037920463?check_suite_focus=true#step:8:4913 -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14505) [CI][Docs] Exercise documentation builds on the main branch
Krisztian Szucs created ARROW-14505: --- Summary: [CI][Docs] Exercise documentation builds on the main branch Key: ARROW-14505 URL: https://issues.apache.org/jira/browse/ARROW-14505 Project: Apache Arrow Issue Type: Improvement Components: Continuous Integration, Documentation Reporter: Krisztian Szucs Fix For: 7.0.0 We regularly have documentation build issues since the build has been disabled on github actions. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14499) [Docs] Version dropdown side-by-side with search box
Krisztian Szucs created ARROW-14499: --- Summary: [Docs] Version dropdown side-by-side with search box Key: ARROW-14499 URL: https://issues.apache.org/jira/browse/ARROW-14499 Project: Apache Arrow Issue Type: Improvement Components: Documentation Reporter: Krisztian Szucs Assignee: Joris Van den Bossche Fix For: 7.0.0, 6.0.1 Small follow-up on #11283 to improve the styling of the version dropdown. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14498) [Docs] Make it possible to regenerate older docs with additional patch(es)
Krisztian Szucs created ARROW-14498: --- Summary: [Docs] Make it possible to regenerate older docs with additional patch(es) Key: ARROW-14498 URL: https://issues.apache.org/jira/browse/ARROW-14498 Project: Apache Arrow Issue Type: Wish Components: Documentation Reporter: Krisztian Szucs Fix For: 7.0.0 We may need to regenerate older docs to include new changes, e.g. the new version dropdown feature. Since we need to regenerate the docs for the first time, it would be great if we could encapsulate this in a script. After applying the patch {{archery docker run ubuntu-docs}} should do the rest, similarly like we use in the post-release task https://github.com/apache/arrow/blob/master/dev/release/post-09-docs.sh ``` dev/release/generate-docs.sh dev/release/generate-docs.sh 6.0.0 # no patch required dev/release/generate-docs.sh 5.0.0 docs.patch dev/release/generate-docs.sh 4.0.0 docs.patch dev/release/generate-docs.sh 3.0.0 docs.patch # then deploy to asf-site ``` cc [~jorisvandenbossche] -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14497) [Docs] Use relative internal links in the sphinx docs
Krisztian Szucs created ARROW-14497: --- Summary: [Docs] Use relative internal links in the sphinx docs Key: ARROW-14497 URL: https://issues.apache.org/jira/browse/ARROW-14497 Project: Apache Arrow Issue Type: Improvement Components: Documentation Reporter: Krisztian Szucs Fix For: 7.0.0 There are a lot of hardcoded urls referencing non-sphinx documentations across the generated HTML files, couple of examples: - https://arrow.apache.org/docs/r/ - https://arrow.apache.org/docs/js/ - https://arrow.apache.org/docs/c_glib/ - https://arrow.apache.org/docs/java/reference/ Using the new versioned docs the {{https://arrow.apache.org/docs/5.0/java/index.html}} links should point to {{https://arrow.apache.org/docs/5.0/java/reference/}} instead of {{https://arrow.apache.org/docs/java/reference/}} cc [~jorisvandenbossche] -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14490) [Doc] Regenerate CHANGELOG.md to include all versions
Krisztian Szucs created ARROW-14490: --- Summary: [Doc] Regenerate CHANGELOG.md to include all versions Key: ARROW-14490 URL: https://issues.apache.org/jira/browse/ARROW-14490 Project: Apache Arrow Issue Type: Improvement Components: Documentation Reporter: Krisztian Szucs Fix For: 7.0.0 Since the move to release branches we haven't been updating the CHANGELOG.md file on the main branch so the versions are missing begining from release 3.0.0. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14489) [Rust][CI] Install stable rust toolchain in the integration docker image
Krisztian Szucs created ARROW-14489: --- Summary: [Rust][CI] Install stable rust toolchain in the integration docker image Key: ARROW-14489 URL: https://issues.apache.org/jira/browse/ARROW-14489 Project: Apache Arrow Issue Type: Improvement Components: Continuous Integration, Rust Reporter: Krisztian Szucs Fix For: 7.0.0 To enable the downstream rust pull request: https://github.com/apache/arrow-rs/pull/591 -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14472) [Dev][Archery] Generate contribution statistics using archery
Krisztian Szucs created ARROW-14472: --- Summary: [Dev][Archery] Generate contribution statistics using archery Key: ARROW-14472 URL: https://issues.apache.org/jira/browse/ARROW-14472 Project: Apache Arrow Issue Type: Improvement Components: Archery, Developer Tools Reporter: Krisztian Szucs Currently we use a bash script to do that: https://github.com/apache/arrow/blob/master/dev/release/post-03-website.sh#L47-L67 Since the rust repository split, this logic needs to be extended. Additionally the scripts expects {{gnu date}} commands which is not available on macOS by default. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14468) [Python] Resolve parquet version deprecation warnings when compiling pyarrow
Krisztian Szucs created ARROW-14468: --- Summary: [Python] Resolve parquet version deprecation warnings when compiling pyarrow Key: ARROW-14468 URL: https://issues.apache.org/jira/browse/ARROW-14468 Project: Apache Arrow Issue Type: Improvement Components: Python Reporter: Krisztian Szucs {code} /tmp/arrow-6.0.0.theE2/apache-arrow-6.0.0/python/build/temp.linux-x86_64-3.8/_parquet.cpp: In function ‘PyObject* __pyx_pf_7pyarrow_8_parquet_12FileMetaData_14format_version___get__(__pyx_obj_7pyarrow_8_parquet_FileMetaData*)’: /tmp/arrow-6.0.0.theE2/apache-arrow-6.0.0/python/build/temp.linux-x86_64-3.8/_parquet.cpp:14168:36: warning: ‘parquet::ParquetVersion::PARQUET_2_0’ is deprecated: use PARQUET_2_4 or PARQUET_2_6 for fine-grained feature selection [-Wdeprecated-declarations] 14168 | case parquet::ParquetVersion::PARQUET_2_0: |^~~ In file included from /tmp/arrow-6.0.0.theE2/install/include/parquet/types.h:30, from /tmp/arrow-6.0.0.theE2/install/include/parquet/schema.h:32, from /tmp/arrow-6.0.0.theE2/install/include/parquet/api/schema.h:21, from /tmp/arrow-6.0.0.theE2/apache-arrow-6.0.0/python/build/temp.linux-x86_64-3.8/_parquet.cpp:734: /tmp/arrow-6.0.0.theE2/install/include/parquet/type_fwd.h:44:5: note: declared here 44 | PARQUET_2_0 ARROW_DEPRECATED_ENUM_VALUE("use PARQUET_2_4 or PARQUET_2_6 " | ^~~ /tmp/arrow-6.0.0.theE2/apache-arrow-6.0.0/python/build/temp.linux-x86_64-3.8/_parquet.cpp:14168:36: warning: ‘parquet::ParquetVersion::PARQUET_2_0’ is deprecated: use PARQUET_2_4 or PARQUET_2_6 for fine-grained feature selection [-Wdeprecated-declarations] 14168 | case parquet::ParquetVersion::PARQUET_2_0: |^~~ In file included from /tmp/arrow-6.0.0.theE2/install/include/parquet/types.h:30, from /tmp/arrow-6.0.0.theE2/install/include/parquet/schema.h:32, from /tmp/arrow-6.0.0.theE2/install/include/parquet/api/schema.h:21, from /tmp/arrow-6.0.0.theE2/apache-arrow-6.0.0/python/build/temp.linux-x86_64-3.8/_parquet.cpp:734: /tmp/arrow-6.0.0.theE2/install/include/parquet/type_fwd.h:44:5: note: declared here 44 | PARQUET_2_0 ARROW_DEPRECATED_ENUM_VALUE("use PARQUET_2_4 or PARQUET_2_6 " | ^~~ /tmp/arrow-6.0.0.theE2/apache-arrow-6.0.0/python/build/temp.linux-x86_64-3.8/_parquet.cpp: In function ‘std::shared_ptr __pyx_f_7pyarrow_8_parquet__create_writer_properties(__pyx_opt_args_7pyarrow_8_parquet__create_writer_properties*)’: /tmp/arrow-6.0.0.theE2/apache-arrow-6.0.0/python/build/temp.linux-x86_64-3.8/_parquet.cpp:23800:62: warning: ‘parquet::ParquetVersion::PARQUET_2_0’ is deprecated: use PARQUET_2_4 or PARQUET_2_6 for fine-grained feature selection [-Wdeprecated-declarations] 23800 | (void)(__pyx_v_props.version( parquet::ParquetVersion::PARQUET_2_0)); | ^~~ In file included from /tmp/arrow-6.0.0.theE2/install/include/parquet/types.h:30, from /tmp/arrow-6.0.0.theE2/install/include/parquet/schema.h:32, from /tmp/arrow-6.0.0.theE2/install/include/parquet/api/schema.h:21, from /tmp/arrow-6.0.0.theE2/apache-arrow-6.0.0/python/build/temp.linux-x86_64-3.8/_parquet.cpp:734: /tmp/arrow-6.0.0.theE2/install/include/parquet/type_fwd.h:44:5: note: declared here 44 | PARQUET_2_0 ARROW_DEPRECATED_ENUM_VALUE("use PARQUET_2_4 or PARQUET_2_6 " | ^~~ /tmp/arrow-6.0.0.theE2/apache-arrow-6.0.0/python/build/temp.linux-x86_64-3.8/_parquet.cpp:23800:62: warning: ‘parquet::ParquetVersion::PARQUET_2_0’ is deprecated: use PARQUET_2_4 or PARQUET_2_6 for fine-grained feature selection [-Wdeprecated-declarations] 23800 | (void)(__pyx_v_props.version( parquet::ParquetVersion::PARQUET_2_0)); | ^~~ In file included from /tmp/arrow-6.0.0.theE2/install/include/parquet/types.h:30, from /tmp/arrow-6.0.0.theE2/install/include/parquet/schema.h:32, from /tmp/arrow-6.0.0.theE2/install/include/parquet/api/schema.h:21, from /tmp/arrow-6.0.0.theE2/apache-arrow-6.0.0/python/build/temp.linux-x86_64-3.8/_parquet.cpp:734: /tmp/arrow-6.0.0.theE2/install/include/parquet/type_fwd.h:44:5: note: declared here 44 | PARQUET_2_0 ARROW_DEPRECATED_ENUM_VALUE("use PARQUET_2_4 or PARQUET_2_6 " | ^~~ {code} -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14438) [CI] Don't cancel build on the main branch
Krisztian Szucs created ARROW-14438: --- Summary: [CI] Don't cancel build on the main branch Key: ARROW-14438 URL: https://issues.apache.org/jira/browse/ARROW-14438 Project: Apache Arrow Issue Type: Improvement Components: Continuous Integration Reporter: Krisztian Szucs Fix For: 7.0.0 When listing the commits from the master branch I often see a bunch of failing commits which are actually cancelled due to concurrency groups: https://github.com/apache/arrow/blob/master/.github/workflows/dev.yml#L26 While we should keep this feature for the pull requests we should disable it for branches. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14437) [Python] CSV test_cancellation unittests fail on Apple M1
Krisztian Szucs created ARROW-14437: --- Summary: [Python] CSV test_cancellation unittests fail on Apple M1 Key: ARROW-14437 URL: https://issues.apache.org/jira/browse/ARROW-14437 Project: Apache Arrow Issue Type: Improvement Components: Python Reporter: Krisztian Szucs Fix For: 7.0.0 Perhaps M1 is too quick :) Most noticable when running the release verification tasks: https://github.com/apache/arrow/pull/11511 Failing builds: - https://github.com/ursacomputing/crossbow/runs/3969076907?check_suite_focus=true - https://github.com/ursacomputing/crossbow/runs/3974036108?check_suite_focus=true#step:5:2014 -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14436) [C++] Disable color diagnostics when compiling with ccache
Krisztian Szucs created ARROW-14436: --- Summary: [C++] Disable color diagnostics when compiling with ccache Key: ARROW-14436 URL: https://issues.apache.org/jira/browse/ARROW-14436 Project: Apache Arrow Issue Type: Improvement Components: C++ Reporter: Krisztian Szucs Fix For: 7.0.0 Copied from https://github.com/apache/arrow/issues/11279 Steps to reproduce: Compile arrow_objlib with ccache, clang and CCACHE_DEBUG=1 CCACHE_LOGFILE=./ccache.log Find in ./ccache.log: Failed; falling back to running the real compiler Result: unsupported compiler option Dropping -fcolor-diagnostics fixes the issue. I suggest either opting into color diagnostics with WITH_COLOR_DIAGNOSTICS or adding a way to disable it via DISABLE_COLOR_DIAGNOSTICS. It would be good if this wouldn't be tied to ARROW_USE_CCACHE since its also relevant for: -DARROW_USE_CCACHE=OFF -DCMAKE_CXX_COMPILER_LAUNCHER=emscripten_ccache. I can open a PR if you tell me which way you prefer. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14435) [Release] Update verification scripts to check python 3.10 wheels
Krisztian Szucs created ARROW-14435: --- Summary: [Release] Update verification scripts to check python 3.10 wheels Key: ARROW-14435 URL: https://issues.apache.org/jira/browse/ARROW-14435 Project: Apache Arrow Issue Type: Improvement Components: Developer Tools Reporter: Krisztian Szucs Assignee: Krisztian Szucs Fix For: 7.0.0 Python 3.10 should be available from conda now, so the verification scripts can check the new python 3.10 wheels. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14424) [Packaging][Python] Disable windows wheel testing for python 3.6
Krisztian Szucs created ARROW-14424: --- Summary: [Packaging][Python] Disable windows wheel testing for python 3.6 Key: ARROW-14424 URL: https://issues.apache.org/jira/browse/ARROW-14424 Project: Apache Arrow Issue Type: Bug Components: Packaging, Python Reporter: Krisztian Szucs Assignee: Krisztian Szucs Fix For: 6.0.0 Two layers of the official python 3.6 windows image are not available for download. Docker pull returns with unexpected status resolving reader: 403 Forbidden. While this is a transient error, it blocks the release process. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14423) [Python] Fix version constraints in pyproject.toml
Krisztian Szucs created ARROW-14423: --- Summary: [Python] Fix version constraints in pyproject.toml Key: ARROW-14423 URL: https://issues.apache.org/jira/browse/ARROW-14423 Project: Apache Arrow Issue Type: Bug Components: Python Reporter: Krisztian Szucs Assignee: Krisztian Szucs Fix For: 6.0.0 Causes build error during packaging https://github.com/ursacomputing/crossbow/runs/3967169617?check_suite_focus=true#step:7:2185 -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14411) [Release][Integration] Go integration tests fail for 6.0.0-RC1
Krisztian Szucs created ARROW-14411: --- Summary: [Release][Integration] Go integration tests fail for 6.0.0-RC1 Key: ARROW-14411 URL: https://issues.apache.org/jira/browse/ARROW-14411 Project: Apache Arrow Issue Type: Bug Components: Integration Reporter: Krisztian Szucs Only on linux interestingly: https://github.com/apache/arrow/pull/11487#issuecomment-947798453 Here is the build log https://github.com/ursacomputing/crossbow/runs/3955744317?check_suite_focus=true#step:6:55443 I wonder whether it was introduced with https://github.com/apache/arrow/commit/41529c76fe80d1fe8e60b52c0da3669c901a45bb The integration tests on the master branch are passing, so this migh be just a verification task issue. cc [~zeroshade] -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14410) [Python][Packaging] Use numpy 1.21.3 to build python 3.10 wheels for macOS and windows
Krisztian Szucs created ARROW-14410: --- Summary: [Python][Packaging] Use numpy 1.21.3 to build python 3.10 wheels for macOS and windows Key: ARROW-14410 URL: https://issues.apache.org/jira/browse/ARROW-14410 Project: Apache Arrow Issue Type: New Feature Components: Packaging, Python Reporter: Krisztian Szucs Fix For: 7.0.0 Numpy has just released new wheels for python 3.10 which we can now use to build wheels on macOS and windows. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14409) [Packaging][Python] Update the manylinux platform tags
Krisztian Szucs created ARROW-14409: --- Summary: [Packaging][Python] Update the manylinux platform tags Key: ARROW-14409 URL: https://issues.apache.org/jira/browse/ARROW-14409 Project: Apache Arrow Issue Type: New Feature Components: Packaging, Python Reporter: Krisztian Szucs Fix For: 7.0.0 Newer versions {{wheel}} produces filenames with future-proof platform tags: {{manylinux_2_17_x86_64.manylinux2014_x86_64.whl}} instead of the previous {{manylinux2014_x86_64.whl}} -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14408) [Packaging][Crossbow] Option for skipping artifact pattern validation
Krisztian Szucs created ARROW-14408: --- Summary: [Packaging][Crossbow] Option for skipping artifact pattern validation Key: ARROW-14408 URL: https://issues.apache.org/jira/browse/ARROW-14408 Project: Apache Arrow Issue Type: New Feature Components: Packaging Reporter: Krisztian Szucs Assignee: Krisztian Szucs Fix For: 7.0.0 In certain cases we may want to skip artifact pattern validation to still download the produced artifacts despite that their names are slightly different from the expected patterns. For example the manylinux platform tags have changed with the more recent wheel library and we only noticed it after a successful packaging build for the release. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14398) [CI] Don't build doxygen docs in all of the conda builds
Krisztian Szucs created ARROW-14398: --- Summary: [CI] Don't build doxygen docs in all of the conda builds Key: ARROW-14398 URL: https://issues.apache.org/jira/browse/ARROW-14398 Project: Apache Arrow Issue Type: Improvement Components: Continuous Integration Reporter: Krisztian Szucs Fix For: 7.0.0 We reuse the yml anchor to define the command for the conda docker builds: https://github.com/apache/arrow/blob/master/docker-compose.yml#L240 The {{true}} argument instruments the script to build the documentation. We should only enable it in the conda-cpp build which is exercised on all commits and disable for the rest of the builds. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14397) [C++] Fix valgrind error in test utility
Krisztian Szucs created ARROW-14397: --- Summary: [C++] Fix valgrind error in test utility Key: ARROW-14397 URL: https://issues.apache.org/jira/browse/ARROW-14397 Project: Apache Arrow Issue Type: Bug Components: C++ Reporter: Krisztian Szucs See the latest nightly build error https://dev.azure.com/ursacomputing/crossbow/_build/results?buildId=14046=logs=0da5d1d9-276d-5173-c4c4-9d4d4ed14fdb=d9b15392-e4ce-5e4c-0c8c-b69645229181=3469 -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14393) [C++] GTest linking errors during the source release verification
Krisztian Szucs created ARROW-14393: --- Summary: [C++] GTest linking errors during the source release verification Key: ARROW-14393 URL: https://issues.apache.org/jira/browse/ARROW-14393 Project: Apache Arrow Issue Type: Bug Components: C++ Reporter: Krisztian Szucs Fix For: 6.0.0 https://github.com/ursacomputing/crossbow/runs/3949371326?check_suite_focus=true#step:6:1161 -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14392) [C++] Bundled gRPC misses bundled Abseil include path
Krisztian Szucs created ARROW-14392: --- Summary: [C++] Bundled gRPC misses bundled Abseil include path Key: ARROW-14392 URL: https://issues.apache.org/jira/browse/ARROW-14392 Project: Apache Arrow Issue Type: Bug Components: C++ Reporter: Krisztian Szucs Fix For: 6.0.0 {code} CMake Error in src/arrow/flight/CMakeLists.txt: Imported target "gRPC::grpc++" includes non-existent path "/tmp/arrow-6.0.0.v1qFD/apache-arrow-6.0.0/cpp/build/absl_ep-install/include" in its INTERFACE_INCLUDE_DIRECTORIES. Possible reasons include: * The path was deleted, renamed, or moved to another location. * An install or uninstall procedure did not complete successfully. * The installation package was faulty and references files it does not provide. CMake Error in src/arrow/flight/CMakeLists.txt: Imported target "gRPC::grpc++" includes non-existent path "/tmp/arrow-6.0.0.v1qFD/apache-arrow-6.0.0/cpp/build/absl_ep-install/include" in its INTERFACE_INCLUDE_DIRECTORIES. Possible reasons include: * The path was deleted, renamed, or moved to another location. * An install or uninstall procedure did not complete successfully. * The installation package was faulty and references files it does not provide. {code} -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14388) [Python] Add unittests for converter arrays with pandas masks
Krisztian Szucs created ARROW-14388: --- Summary: [Python] Add unittests for converter arrays with pandas masks Key: ARROW-14388 URL: https://issues.apache.org/jira/browse/ARROW-14388 Project: Apache Arrow Issue Type: Improvement Components: Python Reporter: Krisztian Szucs Fix For: 7.0.0 Cover the changes in https://github.com/apache/arrow/pull/11465 cc [~amol-] -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14381) [CI] Spark integration failures
Krisztian Szucs created ARROW-14381: --- Summary: [CI] Spark integration failures Key: ARROW-14381 URL: https://issues.apache.org/jira/browse/ARROW-14381 Project: Apache Arrow Issue Type: Bug Components: Continuous Integration Reporter: Krisztian Szucs Fix For: 6.0.0 Both spark-master and spark-3.0 nightly builds are failing: master: https://github.com/ursacomputing/crossbow/runs/3938861610#step:7:9237 branch-3.0: https://github.com/ursacomputing/crossbow/runs/3938887794#step:7:8917 We should also test against branch-3.2 cc [~bryanc] -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14377) [Packaging][Python] Python 3.9 installation fails in macOS wheel build
Krisztian Szucs created ARROW-14377: --- Summary: [Packaging][Python] Python 3.9 installation fails in macOS wheel build Key: ARROW-14377 URL: https://issues.apache.org/jira/browse/ARROW-14377 Project: Apache Arrow Issue Type: Bug Components: Packaging, Python Reporter: Krisztian Szucs Assignee: Krisztian Szucs Fix For: 6.0.0 Due to a trailing comma in the script https://github.com/ursacomputing/crossbow/runs/3938860251#step:8:19 -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14373) [Packaging][Java] Missing LLVM dependency in the macOS java-jars build
Krisztian Szucs created ARROW-14373: --- Summary: [Packaging][Java] Missing LLVM dependency in the macOS java-jars build Key: ARROW-14373 URL: https://issues.apache.org/jira/browse/ARROW-14373 Project: Apache Arrow Issue Type: Bug Components: Java, Packaging Reporter: Krisztian Szucs Fix For: 7.0.0 -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14372) [CI][C++][Python] Exercise builds on GCC 4.8
Krisztian Szucs created ARROW-14372: --- Summary: [CI][C++][Python] Exercise builds on GCC 4.8 Key: ARROW-14372 URL: https://issues.apache.org/jira/browse/ARROW-14372 Project: Apache Arrow Issue Type: New Feature Components: C++, Continuous Integration, Python Reporter: Krisztian Szucs Fix For: 7.0.0 Add a build to {{.github/workflows/python.yml}} to avoid issues like https://issues.apache.org/jira/browse/ARROW-14369 We may extend our docker-compose configuration to include CentOS 7/8 for testing C++ and Python. cc @kou -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14364) [CI][C++] Support LLVM 13
Krisztian Szucs created ARROW-14364: --- Summary: [CI][C++] Support LLVM 13 Key: ARROW-14364 URL: https://issues.apache.org/jira/browse/ARROW-14364 Project: Apache Arrow Issue Type: New Feature Components: C++, Continuous Integration Reporter: Krisztian Szucs Assignee: Krisztian Szucs Fix For: 6.0.0 Major platforms have started to provide LLVM 13 packages which causes multiple build errors. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14363) [C++][Gandiva] LLVM 13 has deprecated CreateGEP and CreateLoad methods without explicit element type
Krisztian Szucs created ARROW-14363: --- Summary: [C++][Gandiva] LLVM 13 has deprecated CreateGEP and CreateLoad methods without explicit element type Key: ARROW-14363 URL: https://issues.apache.org/jira/browse/ARROW-14363 Project: Apache Arrow Issue Type: Bug Components: C++, C++ - Gandiva Reporter: Krisztian Szucs -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14361) [C++] Define a MAX default value for ARROW_SIMD_LEVEL
Krisztian Szucs created ARROW-14361: --- Summary: [C++] Define a MAX default value for ARROW_SIMD_LEVEL Key: ARROW-14361 URL: https://issues.apache.org/jira/browse/ARROW-14361 Project: Apache Arrow Issue Type: New Feature Components: C++ Reporter: Krisztian Szucs Fix For: 7.0.0 In order to enable {{ARROW_HAVE_NEON}} CMake flag on ARM architectures {{ARROW_SIMD_LEVEL}} option must be set to not {{"NONE"}}, see https://github.com/apache/arrow/blob/master/cpp/cmake_modules/SetupCxxFlags.cmake#L444 The default value for {{ARROW_SIMD_LEVEL}} is {{SSE4_2}} which is a bit misleading on ARM64, it should rather be {{NEON}} which is not listed as a valid option for {{ARROW_SIMD_LEVEL}}. We may have a {{"MAX"}} default value similarly to the {{ARROW_RUNTIME_SIMD_LEVEL}} option, see https://github.com/apache/arrow/blob/master/cpp/cmake_modules/DefineOptions.cmake#L115 Original github comment: https://github.com/apache/arrow/pull/11433#discussion_r729852835 cc [~yibocai] [~apitrou] -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14343) [Packaging][Python] Enable NEON SIMD optimization for M1 wheels
Krisztian Szucs created ARROW-14343: --- Summary: [Packaging][Python] Enable NEON SIMD optimization for M1 wheels Key: ARROW-14343 URL: https://issues.apache.org/jira/browse/ARROW-14343 Project: Apache Arrow Issue Type: New Feature Components: Packaging, Python Reporter: Krisztian Szucs Fix For: 6.0.0 -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14312) [Python] Integer conversion failures with python 3.10
Krisztian Szucs created ARROW-14312: --- Summary: [Python] Integer conversion failures with python 3.10 Key: ARROW-14312 URL: https://issues.apache.org/jira/browse/ARROW-14312 Project: Apache Arrow Issue Type: Bug Components: Python Reporter: Krisztian Szucs We have conversion issues during testing the python wheels for 3.10: https://github.com/ursacomputing/crossbow/runs/3882292730?check_suite_focus=true#step:8:658 Some of the failures should be related to the removed {{__int__}} method: https://docs.python.org/3/whatsnew/3.10.html#removed cc [~apitrou] -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14276) [Packaging] Dependency resolution issues in the nightly conda builds
Krisztian Szucs created ARROW-14276: --- Summary: [Packaging] Dependency resolution issues in the nightly conda builds Key: ARROW-14276 URL: https://issues.apache.org/jira/browse/ARROW-14276 Project: Apache Arrow Issue Type: New Feature Components: Packaging Reporter: Krisztian Szucs Fix For: 6.0.0 The majority of the conda nightly builds are failing due to dependency resolution problems: {code} - conda-linux-gcc-py37-arm64: URL: https://github.com/ursacomputing/crossbow/branches/all?query=nightly-2021-10-11-0-azure-conda-linux-gcc-py37-arm64 - conda-linux-gcc-py37-cpu-r41: URL: https://github.com/ursacomputing/crossbow/branches/all?query=nightly-2021-10-11-0-azure-conda-linux-gcc-py37-cpu-r41 - conda-linux-gcc-py37-cuda: URL: https://github.com/ursacomputing/crossbow/branches/all?query=nightly-2021-10-11-0-azure-conda-linux-gcc-py37-cuda - conda-linux-gcc-py38-arm64: URL: https://github.com/ursacomputing/crossbow/branches/all?query=nightly-2021-10-11-0-azure-conda-linux-gcc-py38-arm64 - conda-linux-gcc-py38-cpu: URL: https://github.com/ursacomputing/crossbow/branches/all?query=nightly-2021-10-11-0-azure-conda-linux-gcc-py38-cpu - conda-linux-gcc-py38-cuda: URL: https://github.com/ursacomputing/crossbow/branches/all?query=nightly-2021-10-11-0-azure-conda-linux-gcc-py38-cuda - conda-linux-gcc-py39-arm64: URL: https://github.com/ursacomputing/crossbow/branches/all?query=nightly-2021-10-11-0-azure-conda-linux-gcc-py39-arm64 - conda-linux-gcc-py39-cpu: URL: https://github.com/ursacomputing/crossbow/branches/all?query=nightly-2021-10-11-0-azure-conda-linux-gcc-py39-cpu - conda-linux-gcc-py39-cuda: URL: https://github.com/ursacomputing/crossbow/branches/all?query=nightly-2021-10-11-0-azure-conda-linux-gcc-py39-cuda - conda-win-vs2017-py36-r40: URL: https://github.com/ursacomputing/crossbow/branches/all?query=nightly-2021-10-11-0-azure-conda-win-vs2017-py36-r40 - conda-win-vs2017-py38: URL: https://github.com/ursacomputing/crossbow/branches/all?query=nightly-2021-10-11-0-azure-conda-win-vs2017-py38 - conda-win-vs2017-py39: URL: https://github.com/ursacomputing/crossbow/branches/all?query=nightly-2021-10-11-0-azure-conda-win-vs2017-py39 {code} I assume that we need to sync the recipes again with up to date pin files. cc @uwe -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-14217) [Python][CI] Add support for python 3.10
Krisztian Szucs created ARROW-14217: --- Summary: [Python][CI] Add support for python 3.10 Key: ARROW-14217 URL: https://issues.apache.org/jira/browse/ARROW-14217 Project: Apache Arrow Issue Type: New Feature Components: Continuous Integration, Python Reporter: Krisztian Szucs Assignee: Krisztian Szucs Fix For: 6.0.0 Python 3.10 has just been released, so exercise builds and ship packages for it. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-13921) [Python][Packaging] Pin minimum setuptools version for the macos wheels
Krisztian Szucs created ARROW-13921: --- Summary: [Python][Packaging] Pin minimum setuptools version for the macos wheels Key: ARROW-13921 URL: https://issues.apache.org/jira/browse/ARROW-13921 Project: Apache Arrow Issue Type: Improvement Components: Packaging, Python Reporter: Krisztian Szucs Assignee: Krisztian Szucs Fix For: 6.0.0 There was a bug in setuptools which caused the recent nightly failures: https://github.com/ursacomputing/crossbow/runs/3521607291#step:10:269 -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-13914) [C++][Python] Optimize type inference when converting from python values
Krisztian Szucs created ARROW-13914: --- Summary: [C++][Python] Optimize type inference when converting from python values Key: ARROW-13914 URL: https://issues.apache.org/jira/browse/ARROW-13914 Project: Apache Arrow Issue Type: Improvement Components: Python Reporter: Krisztian Szucs Currently we use an extensive set of checks to infer arrow type from python sequences. Last time I checked using asv, the inference part had a significant overhead. We could try other approaches to speed-up the type inference, see comments: https://github.com/apache/arrow/pull/11076#discussion_r702808196 -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-13635) [Packaging][Python] Define --with-lg-page for jemalloc in the arm manylinux builds
Krisztian Szucs created ARROW-13635: --- Summary: [Packaging][Python] Define --with-lg-page for jemalloc in the arm manylinux builds Key: ARROW-13635 URL: https://issues.apache.org/jira/browse/ARROW-13635 Project: Apache Arrow Issue Type: Task Components: Packaging, Python Reporter: Krisztian Szucs Fix For: 6.0.0 Follow-up ticket for https://github.com/apache/arrow/issues/10929 -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-13557) [Packaging][Python] Skip test_cancellation test case on M1
Krisztian Szucs created ARROW-13557: --- Summary: [Packaging][Python] Skip test_cancellation test case on M1 Key: ARROW-13557 URL: https://issues.apache.org/jira/browse/ARROW-13557 Project: Apache Arrow Issue Type: Task Components: Packaging, Python Reporter: Krisztian Szucs Assignee: Krisztian Szucs Fix For: 6.0.0 The nightly wheel packaging builds have started to fail: https://github.com/ursacomputing/crossbow/runs/3238359543 -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-13483) [Release][Dev] Port the release note generation script to archery
Krisztian Szucs created ARROW-13483: --- Summary: [Release][Dev] Port the release note generation script to archery Key: ARROW-13483 URL: https://issues.apache.org/jira/browse/ARROW-13483 Project: Apache Arrow Issue Type: Improvement Components: Developer Tools Reporter: Krisztian Szucs Fix For: 6.0.0 Archery already have a couple of utilities to parse commits between revisions and access various metadata from git. Implementing it python would make it more portable (e.g. {{date}} function is different from {{GNU date}} on macOS). -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-13478) [Release] Unnecessary rc-number argument for the version bumping post-release script
Krisztian Szucs created ARROW-13478: --- Summary: [Release] Unnecessary rc-number argument for the version bumping post-release script Key: ARROW-13478 URL: https://issues.apache.org/jira/browse/ARROW-13478 Project: Apache Arrow Issue Type: Improvement Components: Developer Tools Reporter: Krisztian Szucs Assignee: Krisztian Szucs Fix For: 6.0.0 -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-13477) [Release] Pass ARTIFACTORY_API_KEY to the upload script
Krisztian Szucs created ARROW-13477: --- Summary: [Release] Pass ARTIFACTORY_API_KEY to the upload script Key: ARROW-13477 URL: https://issues.apache.org/jira/browse/ARROW-13477 Project: Apache Arrow Issue Type: Bug Components: Developer Tools Reporter: Krisztian Szucs Assignee: Krisztian Szucs Fix For: 6.0.0 -- This message was sent by Atlassian Jira (v8.3.4#803005)