Script 'mail_helper' called by obssrc Hello community, here is the log from the commit of package apache-arrow for openSUSE:Factory checked in at 2025-09-29 16:32:07 ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Comparing /work/SRC/openSUSE:Factory/apache-arrow (Old) and /work/SRC/openSUSE:Factory/.apache-arrow.new.11973 (New) ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Package is "apache-arrow" Mon Sep 29 16:32:07 2025 rev:21 rq:1307560 version:21.0.0 Changes: -------- --- /work/SRC/openSUSE:Factory/apache-arrow/apache-arrow.changes 2025-06-14 16:18:41.032629268 +0200 +++ /work/SRC/openSUSE:Factory/.apache-arrow.new.11973/apache-arrow.changes 2025-09-29 16:34:20.959757155 +0200 @@ -1,0 +2,325 @@ +Fri Sep 26 16:52:42 UTC 2025 - Ben Greiner <[email protected]> + +- Remove boost::system dependency for Tumbleweed + * Add arrow-boost-system-1.89-boo1249599.patch + * gh#boostorg/system#132 + * boo#1249599 + +------------------------------------------------------------------- +Thu Sep 25 10:24:04 UTC 2025 - Ben Greiner <[email protected]> + +- Update to 21.0.0 + ## Bug Fixes + * GH-32276 - [C++][FlightRPC] Add option to align RecordBatch + buffers given to IPC reader (#44279) + * GH-35166 - [C++][Compute] Increase precision of decimals in sum + aggregates (#44184) + * GH-40756 - [C++] Remove dead Boost urls (#46452) + * GH-45532 - [C++] RunEndEncodedBuilder should clear dimensions + after a Finish() call (#45533) + * GH-45534 - [C++] Test: RunEndEncodeTableColumns should update + REE columns' schema types (#45535) + * GH-45608 - [C++][Flight] Fix compilation for clang (#46264) + * GH-45735 - [C++] Broken tests for extract_regex compute funcion + (#45900) + * GH-45853 - [C++][Dev] Fix Meson compilation issues in Docker + builds (#45858) + * GH-46011 - [C++] Hide DCHECK family from public headers + (#46015) + * GH-46025 - [C++] Use ARROW_CUDA_EXPORT instead of ARROW_EXPORT + for libarrow_cuda (#46030) + * GH-46052 - [C++][Benchmarking] Don't build grouper benchmark + without ARROW_COMPUTE=ON (#46053) + * GH-46070 - [C++] Remove duplicate storage_type in JsonExtension + (#46071) + * GH-46084 - [C++] Always use ARROW_VCPKG to detect vcpkg mode + (#46467) + * GH-46090 - [C++] Set default IPC option to enabled in Meson + (#46114) + * GH-46094 - [C++][Docs] Add note to RleDecoder::Get's doc + comment (#46874) + * GH-46146 - [C++] Merge metadata in SchemaBuidler::AddMetadata + (#46654) + * GH-46149 - [C++] Opening dataset fails with sshfs-3.7.3 due to + F_RDADVISE error (#46346) + * GH-46157 - [C++] Move test utility RunEndEncodeTableColumns + that uses REE to test_util_internal on acero instead of common + gtest_util (#46161) + * GH-46192 - [C++] Add substrait dep to third party download + script (#46191) + * GH-46197 - [C++] Tests use legacy timezones (#46201) + * GH-46214 - [C++] Improve S3 client initialization (#46723) + * GH-46224 - [C++][Acero] Fix the hang in asof join (#46300) + * GH-46231 - [C++][CMake] Fix arrow_bundled_dependencies to be + externally accessible by FetchContent (#46232) + * GH-46233 - [C++] Fix missing nested braces in QueuedTask + initialization (#46234) + * GH-46268 - [C++] Improve ArrayData docstrings (#46271) + * GH-46270 - [C++][Parquet] Clarify GeoStatistics docstring + (#46649) + * GH-46299 - [C++][Compute] Don't use static inline const for + default options (#46303) + * GH-46306 - [C++][Parquet] Should use LoadEnumSafe for geo enum + (#46307) + * GH-46314 - [C++][Parquet] Fix valgrind error when collecting + parameterized tests for MakeWKBPoint (#46320) + * GH-46326 - [C++][Parquet] Fix stack overflow in rapidjson value + comparison to integer (#46327) + * GH-46359 - [C++][Thirdparty] Bump Apache ORC to 2.1.2 (#46360) + * GH-46394 - [C++][R] gcc-UBSAN errors on CRAN (#46397) + * GH-46395 - [C++][Statistics] Use EqualOptions for min and max + in arrow::ArrayStatistics::Equals() (#46422) + * GH-46407 - [C++] Fix IPC serialization of sliced list arrays + (#46408) + * GH-46414 - [C++] Fix GCS filesystem getFileInfo method (#46416) + * GH-46417 - [C++][Parquet] Fix UB in LoadEnumSafe for + EdgeInterpolationAlgorithm (#46418) + * GH-46419 - [C++] Remove duplicate declaration and sync arg + names on acero test_util_internal functions (#45400) + * GH-46420 - [C++][Dataset] Fix DatasetWriter deadlock on + writting batch greater than max_rows_queued (#46139) + * GH-46424 - [C++][Parquet] Fix erroneous unit test skip (#46425) + * GH-46435 - [Parquet][C++] Fix uninitialized value in writer + test (#46533) + * GH-46478 - [C++] Implement recent JSON changes into Meson + configuration (#46479) + * GH-46481 - [C++][Python] Allow nullable schema in FlightInfo + (#46489) + * GH-46512 - [CI][C++] Install the llvm package explicitly on + MSYS2 (#46525) + * GH-46564 - [C++] Export ARROW_VCPKG in ArrowConfig.cmake + (#46565) + * GH-46576 - [C++] Suppress codecvt_utf8 deprecation warning + (#46622) + * GH-46589 - [C++] Fix utf8_is_digit to support full Unicode + digit range (#46590) + * GH-46599 - [C++][Doc][Parquet] Update supported types + documentation (#46620) + * GH-46611 - [Python][C++] Allow building float16 arrays without + numpy (#46618) + * GH-46623 - [C++][Compute] Fix the failure of large memory test + in arrow-compute-row-test (#46635) + * GH-46659 - [C++] Fix export of extension arrays with binary + view/string view storage (#46660) + * GH-46674 - [C++] Construct Array from ExtensionType Scalar + (#46675) + * GH-46684 - [C++] Fix Meson configuration issue on Windows + (#46685) + * GH-46704 - [C++] Fix OSS-Fuzz build failure (#46706) + * GH-46708 - [C++][Gandiva] Added zero return values for + castDECIMAL_utf8 (#46709) + * GH-46710 - [C++] Fix ownership and lifetime issues in Dataset + Writer (#46711) + * GH-46724 - [C++][Parquet] OSSFuzz: Prevent from Bad-cast in + handling statistics (#46725) + * GH-46761 - [C++] Add executable detection on FreeBSD (#46759) + * GH-46764 - [C++][Gandiva] Fix wrong .bc depends (#46765) + * GH-46777 - [C++] Use SimplifyIsIn only when the value_set of + the expression is lower than a threshold (#46859) + * GH-46811 - [C++][Python] Fix crash on + FileReaderImpl::GetRecordBatchReader (#46931) + * GH-46827 - [C++] Update Meson Configuration for compute shared + lib (#46839) + * GH-46831 - [C++][R] Remove some pending references to CMake < + 3.25 (docs + minor CMake references) (#46834) + * GH-46841 - [C++][Gandiva] Fix date trunc edge case (#46842) + * GH-46863 - [CI][C++] Suppress a false positive UBSAN error in + AWS SDK for C++ (#46870) + * GH-46871 - [C++][Parquet] Restore implementation of 3 + arrow::FileReader::GetRecordBatchReader() functions (#46868) + * GH-46888 - [C++] Remove override of default buildtype in Meson + config (#46919) + * GH-46915 - [C++][Compute] Initialize Compute kernels on + benchmarks that require extra kernels (#46922) + * GH-46934 - [C++][Parquet] Trying to fix ub in AttachStatistics + (#46940) + * GH-46986 - [CI][C++] Fix a build error with C++20 (#46987) + * GH-46988 - [C++][Parquet] Fix FLBA DecodeArrow multiply + overflow (#46991) + * GH-46995 - [CI][R][C++] Use system memory allocator in + sanitizer jobs (#47007) + * GH-46998 - [C++] Fix mockfs.cc compiling error with C++23 + (#46999) + * GH-47015 - [CI][C++] Use mold on conda-cpp to work around + issues with GNU ld (#47028) + * GH-47033 - [C++][Compute] Never use custom gtest main with MSVC + (#47049) + * GH-47037 - [CI][C++] Fix Fedora 39 CI jobs (#47038) + ## New Features and Improvements + * GH-25025 - [C++] Move non core compute kernels into separate + shared library (#46261) + * GH-26818 - [C++][Python] Preserve order when writing dataset + multi-threaded (#44470) + * GH-36753 - [C++] Properly pretty-print and diff HalfFloatArrays + (#46857) + * GH-37027 - [C++] Add float16 kernels to if-else and + vector-replace functions (#46446) + * GH-37677 - [C++][FlightRPC] Allow FlightInfo.schema to be + nullable + * GH-37891 - [C++][Parquet] Refine several classes in Parquet + encryption (#46202) + * GH-37891 - [C++] Followup Buffer change to use sptr move + (#46027) + * GH-39294 - [C++][Python] DLPack on Tensor class (#42118) + * GH-40278 - [C++] Support casting string to duration in CSV + converter (#46035) + * GH-40343 - [C++] Move S3FileSystem to the registry (#41559) + * GH-43041 - [C++][Python] Read/write Parquet BYTE_ARRAY as + Large/View types directly (#46532) + * GH-43807 - [C++][Python] Add UUID extension type conversion + support to/from Parquet (#45866) + * GH-43891 - [C++][Parquet] Faster reading of + FIXED_LEN_BYTE_ARRAY data (#46886) + * GH-45028 - [C++][Compute] Allow cast to reorder struct fields + (#45246) + * GH-45083 - [C++] Add HalfFloat kernels for is_nan, is_inf, + is_finite, negate, negate_checked, sign (#46866) + * GH-45195 - [C++] Update bundled AWS SDK for C++ to 1.11.587 + (#45306) + * GH-45522 - [Parquet][C++] Parquet GEOMETRY and GEOGRAPHY + logical type implementations (#45459) + * GH-45664 - [C++] Allow + LargeString,LargeBinary,FixedSizeBinary,StringView and + BinaryView for RecordBatch::MakeStatisticsArray() (#46031) + * GH-45750 - [C++][Python][Parquet] Implement Content-Defined + Chunking for the Parquet writer (#45360) + * GH-45794 - [C++] Add array directory to Meson configuration + (#45795) + * GH-45796 - [C++] Add integration directory to Meson + configuration (#45797) + * GH-45798 - [C++] Add extension directory to Meson (#45799) + * GH-45800 - [C++] Implement util configuration in Meson (#45824) + * GH-45829 - [C++] Add compute directory to Meson configuration + (#45830) + * GH-45833 - [C++] Add JSON directory to Meson configuration + (#45834) + * GH-45865 - [C++] Create dedicated benchmark dependency in Meson + (#45909) + * GH-45908 - [C++][Docs] Rename and expose basic + {Array,...}FromJSON helpers as public APIs (#46180) + * GH-45957 - [C++][Python] Expose allow_delayed_open on + S3FileSystem (#46078) + * GH-45978 - [C++] Bump bundled mimalloc version (#45979) + * GH-45991 - [C++] Bump bundled nlohmann_json to v3.12.0 (#46112) + * GH-45992 - [C++] Bump bundled utf8proc version to 2.10.0 + (#46032) + * GH-46091 - [C++] Use feature options in Meson configuration + (#46204) + * GH-46092 - [C++] Add filesystem related options to Meson + (#46101) + * GH-46104 - GH-45937: [C++][Parquet] Logical type definition for + variant + * GH-46115 - [C++] Implement compression libraries in Meson + (#46358) + * GH-46116 - [C++] Implement IPC directory in Meson (#46117) + * GH-46118 - [C++] Add tensor directory to Meson (#46119) + * GH-46132 - [C++][Parquet] Remove deprecated parquet APIs from + 19.0.0 (#46133) + * GH-46141 - [C++] Add flight directory to Meson configuration + (#46142) + * GH-46153 - [C++] Implement acero directory in Meson (#46154) + * GH-46155 - [C++] Implement Tensorflow directory in Meson + (#46156) + * GH-46163 - [C++] Add vendored directory to Meson (#46164) + * GH-46196 - [C++] Remove ARROW_USE_PRECOMPILED_HEADERS and + related logic (#46200) + * GH-46207 - [C++] Rename arrow::util::StringBuilder and move to + internal namespace (#46813) + * GH-46209 - [Documentation][C++][Compute] Add cpp developer + documentation for row table (#46210) + * GH-46215 - [C++][Docs] Add README for Meson subprojects + directory (#46216) + * GH-46217 - [C++][Parquet] Update the timestamp of + parquet::encryption::TwoLevelCacheWithExpiration correctly + (#46283) + * GH-46219 - [C++][Parquet] Remove PARQUET_MINIMAL_DEPENDENCY + option (#46274) + * GH-46285 - [C++] Add support for Decimal32/64 and HalfFloat to + run_end_encode/run_end_decode (#46286) + * GH-46318 - [Docs][C++] Add Extension Array/Type documents + (#46319) + * GH-46321 - [C++][Doc] Better explain ArrayData IsValid and + GetNullCount (#46332) + * GH-46338 - [C++] Add compile step for Meson in cpp_build.sh + (#46339) + * GH-46367 - [C++] Prevent Meson from using git info if built as + subproject (#46368) + * GH-46386 - [C++] Ensure using our CMake packages not + Find*.cmake (#46387) + * GH-46388 - [C++] Check Snappy::snappy{,-static} in + FindSnappyAlt.cmake (#46389) + * GH-46396 - [C++][Documentation][Statistics] Revise the + documentation to clarify that arrow::ArrayStatistics is ignored + during arrow::Array comparisons (#46470) + * GH-46403 - [C++] Add support for limiting element size when + printing data (#46536) + * GH-46439 - [C++] Use result pattern for all FromJSONString + Helpers (#46696) + * GH-46439 - [C++] Rename internal Converter class in + from_string.cc (#46697) + * GH-46439 - [C++] Remove unneeded namespace prefix in + test_util_internal.h (#46695) + * GH-46444 - [Documentation][C++][Acero] Move internal Swiss + table doc into public C++ developer doc (#46445) + * GH-46459 - [C++] Make some arrow/util headers internal (#46721) + * GH-46462 - [C++][Parquet] Expose currently thrown + EncodedStatistics when checking is_stats_set (#46463) + * GH-46473 - [C++][Docs] Fix typos in decimal comments (#46474) + * GH-46475 - [Documentation][C++][Compute] Consolidate Acero + developer docs (#46476) + * GH-46477 - [C++] Use vendored flatbuffers in Meson + configuration (#46484) + * GH-46487 - [C++] Refactor lz4 from ExternalProject to + FetchContent (#46390) + * GH-46499 - [CI][Crossbow][C++] Use apache/arrow for Meson + (#46501) + * GH-46508 - [C++] Upgrade OpenTelemetry cpp to avoid build error + on recent Clang (#46509) + * GH-46522 - [C++][FlightRPC] Add Arrow Flight SQL ODBC driver + (#40939) + * GH-46529 - [C++] Convert static inline type trait functions to + constexpr (#46559) + * GH-46537 - [Docs][C++] Add RunEndEncodedArray, FlatArray, and + PrimitiveArray API Docs (#46540) + * GH-46551 - [C++] Use std::string_view for type schema API + (#46553) + * GH-46633 - [Docs][C++][Python] Update CombineChunks + documentation to specify that binary columns can be combined + into multiple chunks (#46638) + * GH-46665 - [CI][Crossbow][C++] Use apache/arrow for Alpine + Linux (#46666) + * GH-46676 - [C++][Python][Parquet] Allow reading Parquet LIST + data as LargeList directly (#46678) + * GH-46679 - [C++][Meson] Use WrapDB entry for gflags instead of + CMake wrapper (#46680) + * GH-46683 - [C++][Python] Add utf8_zero_fill compute function + for sign-aware zero padding (#46815) + * GH-46714 - [C++] Use hidden symbol visibility in Meson ++++ 28 more lines (skipped) ++++ between /work/SRC/openSUSE:Factory/apache-arrow/apache-arrow.changes ++++ and /work/SRC/openSUSE:Factory/.apache-arrow.new.11973/apache-arrow.changes --- /work/SRC/openSUSE:Factory/apache-arrow/python-pyarrow.changes 2025-06-14 16:18:41.644654502 +0200 +++ /work/SRC/openSUSE:Factory/.apache-arrow.new.11973/python-pyarrow.changes 2025-09-29 16:34:21.127764228 +0200 @@ -1,0 +2,112 @@ +Thu Sep 25 10:25:07 UTC 2025 - Ben Greiner <[email protected]> + +- Update to 21.0.0 + ## Bug Fixes + * GH-44366 - [Python][Acero] RecordBatch.filter on expression + raises error if result set is empty (#46057) + * GH-45292 - [Python] test_dtypes hypotesis test fails + sporadically (#46029) + * GH-46080 - [Python][Docs] Provide guidance for tzdata related + issues if installing with pip (#46591) + * GH-46121 - [Python] Add missing column_index argument to + ArrowReaderProperties::read_dictionary's Cython binding + (#46122) + * GH-46174 - [Python] Failing tests in python minimal builds + (#46175) + * GH-46238 - [Release][Python] Use array to avoid empty argument + in dev/release/post-11-python.sh (#46239) + * GH-46343 - [CI][Python] Remove workaround for gdb packaging + issue (#46848) + * GH-46344 - [CI][Python] Skip doctest for s3.get_file_info to + avoid bucket restrictions (#46345) + * GH-46355 - [Python] Fix table.to_struct_array with an empty + table (#46357) + * GH-46481 - [C++][Python] Allow nullable schema in FlightInfo + (#46489) + * GH-46516 - [CI][Python] Force Cython>3.1.1 for docs builds + (#46770) + * GH-46606 - [Python] Do not require numpy when normalizing slice + (#46732) + * GH-46611 - [Python][C++] Allow building float16 arrays without + numpy (#46618) + * GH-46729 - [Python] Allow constructing InMemoryDataset from + RecordBatchReader (#46731) + * GH-46811 - [C++][Python] Fix crash on + FileReaderImpl::GetRecordBatchReader (#46931) + ## New Features and Improvements + * GH-26818 - [C++][Python] Preserve order when writing dataset + multi-threaded (#44470) + * GH-38914 - [Python] Add + EncryptionConfiguration.uniform_encryption (#46347) + * GH-39294 - [C++][Python] DLPack on Tensor class (#42118) + * GH-40754 - [Python] Expose tls_ca_file_path to S3FileSystem + (#45881) + * GH-41496 - [Python][Azure][Docs] Turn on azure on debian-docs + (#46892) + * GH-41672 - [Python][Doc] Clarify docstring of + FixedSizeListArray.values that it ignores the offset (#46144) + * GH-42012 - [Python] Add Schema with_field or set_field method + (#46348) + * GH-43041 - [C++][Python] Read/write Parquet BYTE_ARRAY as + Large/View types directly (#46532) + * GH-43807 - [C++][Python] Add UUID extension type conversion + support to/from Parquet (#45866) + * GH-44500 - [Python][Parquet] Map Parquet logical types to Arrow + extension types by default (#46772) + * GH-44900 - [Python] Support explicit fsspec+{protocol} and + hf:// filesystem URIs (#45089) + * GH-45229 - [Python] Migrate from scipy.spmatrix to + scipy.sparray (#46423) + * GH-45229 - [Python] skip scipy.sparse roundtrip tests for + float16 (#46413) + * GH-45531 - [Python] Add the dim_names argument to + from_numpy_ndarray (#46170) + * GH-45619 - [Python] Use f-string instead of string.format + (#45629) + * GH-45653 - [Python] Scalar subclasses should implement Python + protocols (#45818) + * GH-45750 - [C++][Python][Parquet] Implement Content-Defined + Chunking for the Parquet writer (#45360) + * GH-45957 - [C++][Python] Expose allow_delayed_open on + S3FileSystem (#46078) + * GH-46019 - [Python] Raise TypeError on feather read_table if + columns is not a Sequence (#46038) + * GH-46054 - [Python][Packaging] Re-enable pandas on Windows + free-threaded wheel (#46109) + * GH-46058 - [Python] Run Python in AppVeyor outside of source + directory (#46059) + * GH-46130 - [Python] Remove use_legacy_format in favour of + setting IpcWriteOptions (#46131) + * GH-46198 - [Python] Remove deprecated PyExtensionType (#46199) + * GH-46222 - [Python] Allow to specify footer metadata when + opening IPC file for writing (#46354) + * GH-46349 - [Python] Move parquet definitions to + pyarrow/includes/libparquet.pxd (#46437) + * GH-46373 - [Python] Exercise fallback case on tests for + parquet.read_table in case dataset is not available (#46550) + * GH-46544 - [CI][Dev][Python] Use pre-commit for autopep8 + (#46552) + * GH-46545 - [CI][Dev][Python] Update pre-commit for cython-lint + (#46580) + * GH-46546 - [CI][Dev][Python] Use pre-commit for numpydoc + (#46595) + * GH-46572 - [Python] expose filter option to python for join + (#46566) + * GH-46633 - [Docs][C++][Python] Update CombineChunks + documentation to specify that binary columns can be combined + into multiple chunks (#46638) + * GH-46652 - [Python][Docs] Update language for row_group_size + parameter (#46653) + * GH-46676 - [C++][Python][Parquet] Allow reading Parquet LIST + data as LargeList directly (#46678) + * GH-46683 - [C++][Python] Add utf8_zero_fill compute function + for sign-aware zero padding (#46815) + * GH-46771 - [Python][C++] Implement pa.arange function to + generate array sequences (#46778) + * GH-46833 - [Python] Expose ConfigureManagedIdentityCredential + and ConfigureClientSecretCredential to AzureFileSystem on + PyArrow (#46837) + * GH-46959 - [Python][Packaging] Drop support for manylinux2014 + (#46965) + +------------------------------------------------------------------- Old: ---- apache-arrow-19.0.1-mimalloc-version.patch apache-arrow-20.0.0.tar.gz arrow-testing-20.0.0.tar.gz mimalloc-v2.0.9.tar.gz parquet-testing-20.0.0.tar.gz New: ---- apache-arrow-21.0.0.tar.gz arrow-boost-system-1.89-boo1249599.patch arrow-testing-21.0.0.tar.gz mimalloc-v2.2.4.tar.gz parquet-testing-21.0.0.tar.gz ----------(Old B)---------- Old:/work/SRC/openSUSE:Factory/.apache-arrow.new.11973/apache-arrow.changes- 7.1 to 9 (#47026) /work/SRC/openSUSE:Factory/.apache-arrow.new.11973/apache-arrow.changes:- Drop apache-arrow-19.0.1-mimalloc-version.patch /work/SRC/openSUSE:Factory/.apache-arrow.new.11973/apache-arrow.changes- ----------(Old E)---------- ----------(New B)---------- New:/work/SRC/openSUSE:Factory/.apache-arrow.new.11973/apache-arrow.changes-- Remove boost::system dependency for Tumbleweed /work/SRC/openSUSE:Factory/.apache-arrow.new.11973/apache-arrow.changes: * Add arrow-boost-system-1.89-boo1249599.patch /work/SRC/openSUSE:Factory/.apache-arrow.new.11973/apache-arrow.changes- * gh#boostorg/system#132 ----------(New E)---------- ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Other differences: ------------------ ++++++ apache-arrow.spec ++++++ --- /var/tmp/diff_new_pack.naOGcG/_old 2025-09-29 16:34:21.823793531 +0200 +++ /var/tmp/diff_new_pack.naOGcG/_new 2025-09-29 16:34:21.827793699 +0200 @@ -1,7 +1,7 @@ # # spec file for package apache-arrow # -# Copyright (c) 2025 SUSE LLC +# Copyright (c) 2025 SUSE LLC and contributors # # All modifications and additions to the file contributed by third parties # remain the property of their copyright owners, unless otherwise agreed @@ -29,16 +29,15 @@ %define gccver 13 %endif -%define sonum 2000 +%define sonum 2100 # See git submodule /testing pointing to the correct revision -%define arrow_testing_commit d2a13712303498963395318a4eb42872e66aead7 +%define arrow_testing_commit fbf6b703dc93d17d75fa3664c5aa2c7873ebaf06 # See git submodule /cpp/submodules/parquet-testing pointing to the correct revision %define parquet_testing_commit 18d17540097fca7c40be3d42c167e6bfad90763c # See cpp/thirdparty/versions.txt, replace by BuildRequires: pkgconfig(mimalloc) as soon as gh#apache/arrow#42211 is resolved -# mimalloc version bumped, see Patch100 -%define arrow_mimalloc_build_version v2.0.9 +%define arrow_mimalloc_build_version v2.2.4 Name: apache-arrow -Version: 20.0.0 +Version: 21.0.0 Release: 0 Summary: A development platform for in-memory data License: Apache-2.0 AND BSD-3-Clause AND BSD-2-Clause AND MIT @@ -49,7 +48,8 @@ Source1: https://github.com/apache/arrow-testing/archive/%{arrow_testing_commit}.tar.gz#/arrow-testing-%{version}.tar.gz Source2: https://github.com/apache/parquet-testing/archive/%{parquet_testing_commit}.tar.gz#/parquet-testing-%{version}.tar.gz Source3: https://github.com/microsoft/mimalloc/archive/%{arrow_mimalloc_build_version}.tar.gz#/mimalloc-%{arrow_mimalloc_build_version}.tar.gz -Patch100: apache-arrow-19.0.1-mimalloc-version.patch +# PATCH-FIX-OPENSUSE arrow-boost-system-1.89-boo1249599.patch gh#boostorg/system#132, boo#1249599 +Patch1: arrow-boost-system-1.89-boo1249599.patch BuildRequires: bison BuildRequires: cmake >= 3.25 BuildRequires: fdupes @@ -58,8 +58,11 @@ BuildRequires: libboost_context-devel BuildRequires: libboost_date_time-devel BuildRequires: libboost_filesystem-devel +BuildRequires: libboost_headers-devel BuildRequires: libboost_process-devel -BuildRequires: libboost_system-devel >= 1.64.0 +%if 0%{?suse_version} < 1699 +BuildRequires: libboost_system-devel +%endif %if %{with static} BuildRequires: libzstd-devel-static %endif @@ -67,27 +70,27 @@ BuildRequires: python-rpm-macros BuildRequires: python3-base BuildRequires: (cmake(lz4) >= 1.10 or (pkgconfig(liblz4) >= 1.8.3 with pkgconfig(liblz4) < 1.10)) -BuildRequires: cmake(Snappy) >= 1.1.7 +BuildRequires: cmake(Snappy) >= 1.2.2 BuildRequires: cmake(absl) BuildRequires: cmake(double-conversion) >= 3.1.5 BuildRequires: cmake(re2) BuildRequires: pkgconfig(RapidJSON) BuildRequires: pkgconfig(bzip2) >= 1.0.8 -BuildRequires: pkgconfig(gflags) >= 2.2.0 -BuildRequires: pkgconfig(grpc++) >= 1.20.0 -BuildRequires: pkgconfig(libbrotlicommon) >= 1.0.7 -BuildRequires: pkgconfig(libbrotlidec) >= 1.0.7 -BuildRequires: pkgconfig(libbrotlienc) >= 1.0.7 -BuildRequires: pkgconfig(libcares) >= 1.15.0 -BuildRequires: pkgconfig(libglog) >= 0.3.5 +BuildRequires: pkgconfig(gflags) >= 2.2.2 +BuildRequires: pkgconfig(grpc++) >= 1.46.3 +BuildRequires: pkgconfig(libbrotlicommon) >= 1.0.9 +BuildRequires: pkgconfig(libbrotlidec) >= 1.0.9 +BuildRequires: pkgconfig(libbrotlienc) >= 1.0.9 +BuildRequires: pkgconfig(libcares) >= 1.17.2 +BuildRequires: pkgconfig(libglog) >= 0.5.0 BuildRequires: pkgconfig(libopenssl) BuildRequires: pkgconfig(liburiparser) >= 0.9.3 -BuildRequires: pkgconfig(libutf8proc) -BuildRequires: pkgconfig(libzstd) >= 1.4.3 -BuildRequires: pkgconfig(protobuf) >= 3.7.1 -BuildRequires: pkgconfig(sqlite3) >= 3.45.2 -BuildRequires: pkgconfig(thrift) >= 0.11.0 -BuildRequires: pkgconfig(zlib) >= 1.2.11 +BuildRequires: pkgconfig(libutf8proc) >= 2.10.0 +BuildRequires: pkgconfig(libzstd) >= 1.5.7 +BuildRequires: pkgconfig(protobuf) >= 21.3 +BuildRequires: pkgconfig(sqlite3) +BuildRequires: pkgconfig(thrift) >= 0.22.0 +BuildRequires: pkgconfig(zlib) >= 1.3.1 %if %{with tests} BuildRequires: timezone BuildRequires: pkgconfig(gmock) >= 1.10 @@ -130,6 +133,20 @@ This package provides the shared library for the Acero streaming execution engine +%package -n libarrow_compute%{sonum} +Summary: Development platform for in-memory data - shared library +Group: System/Libraries + +%description -n libarrow_compute%{sonum} +Apache Arrow is a cross-language development platform for in-memory +data. It specifies a standardized language-independent columnar memory +format for flat and hierarchical data, organized for efficient +analytic operations on modern hardware. It also provides computational +libraries and zero-copy streaming messaging and interprocess +communication. + +This package provides the shared library for the C++ Compute module + %package -n libarrow_flight%{sonum} Summary: Development platform for in-memory data - shared library Group: System/Libraries @@ -191,6 +208,7 @@ Group: Development/Libraries/C and C++ Requires: libarrow%{sonum} = %{version} Requires: libarrow_acero%{sonum} = %{version} +Requires: libarrow_compute%{sonum} = %{version} Requires: libarrow_dataset%{sonum} = %{version} %if %{with flight} Requires: libarrow_flight%{sonum} = %{version} @@ -199,6 +217,7 @@ %if %{with static} Suggests: %{name}-devel-static = %{version} Suggests: %{name}-acero-devel-static = %{version} +Suggests: %{name}-compute-devel-static = %{version} Suggests: %{name}-dataset-devel-static = %{version} %if %{with flight} Suggests: %{name}-flight-devel-static = %{version} @@ -248,6 +267,21 @@ This package provides the static library for the Acero streaming execution engine +%package compute-devel-static +Summary: Development platform for in-memory data - development files +Group: Development/Libraries/C and C++ +Requires: %{name}-devel = %{version} + +%description compute-devel-static +Apache Arrow is a cross-language development platform for in-memory +data. It specifies a standardized language-independent columnar memory +format for flat and hierarchical data, organized for efficient +analytic operations on modern hardware. It also provides computational +libraries and zero-copy streaming messaging and interprocess +communication. + +This package provides the static library for the C++ Compute module + %package flight-devel-static Summary: Development platform for in-memory data - development files Group: Development/Libraries/C and C++ @@ -343,7 +377,9 @@ %prep %setup -q -n arrow-apache-arrow-%{version} -a1 -a2 -%autopatch -p1 +%if 0%{?suse_version} >= 1699 +%patch -P1 -p1 +%endif # https://github.com/protocolbuffers/protobuf/issues/12292 sed -i 's/find_package(Protobuf/find_package(Protobuf CONFIG/' cpp/cmake_modules/FindProtobufAlt.cmake @@ -458,6 +494,8 @@ %postun -n libarrow%{sonum} -p /sbin/ldconfig %post -n libarrow_acero%{sonum} -p /sbin/ldconfig %postun -n libarrow_acero%{sonum} -p /sbin/ldconfig +%post -n libarrow_compute%{sonum} -p /sbin/ldconfig +%postun -n libarrow_compute%{sonum} -p /sbin/ldconfig %if %{with flight} %post -n libarrow_flight%{sonum} -p /sbin/ldconfig %postun -n libarrow_flight%{sonum} -p /sbin/ldconfig @@ -482,6 +520,10 @@ %license LICENSE.txt NOTICE.txt %{_libdir}/libarrow_acero.so.* +%files -n libarrow_compute%{sonum} +%license LICENSE.txt NOTICE.txt +%{_libdir}/libarrow_compute.so.* + %if %{with flight} %files -n libarrow_flight%{sonum} %license LICENSE.txt NOTICE.txt @@ -507,6 +549,7 @@ %{_libdir}/cmake/Arrow* %{_libdir}/libarrow.so %{_libdir}/libarrow_acero.so +%{_libdir}/libarrow_compute.so %{_libdir}/libarrow_dataset.so %if %{with flight} %{_libdir}/libarrow_flight.so @@ -530,6 +573,10 @@ %license LICENSE.txt NOTICE.txt %{_libdir}/libarrow_acero.a +%files compute-devel-static +%license LICENSE.txt NOTICE.txt +%{_libdir}/libarrow_compute.a + %files dataset-devel-static %license LICENSE.txt NOTICE.txt %{_libdir}/libarrow_dataset.a ++++++ python-pyarrow.spec ++++++ --- /var/tmp/diff_new_pack.naOGcG/_old 2025-09-29 16:34:21.883796057 +0200 +++ /var/tmp/diff_new_pack.naOGcG/_new 2025-09-29 16:34:21.887796225 +0200 @@ -1,7 +1,7 @@ # # spec file for package python-pyarrow # -# Copyright (c) 2025 SUSE LLC +# Copyright (c) 2025 SUSE LLC and contributors # # All modifications and additions to the file contributed by third parties # remain the property of their copyright owners, unless otherwise agreed @@ -20,7 +20,7 @@ %bcond_with xsimd %define plainpython python # See git submodule /testing pointing to the correct revision -%define arrow_testing_commit d2a13712303498963395318a4eb42872e66aead7 +%define arrow_testing_commit fbf6b703dc93d17d75fa3664c5aa2c7873ebaf06 # See git submodule /cpp/submodules/parquet-testing pointing to the correct revision %define parquet_testing_commit 18d17540097fca7c40be3d42c167e6bfad90763c %if %{suse_version} <= 1500 @@ -28,7 +28,7 @@ %define gccver 13 %endif Name: python-pyarrow -Version: 20.0.0 +Version: 21.0.0 Release: 0 Summary: Python library for Apache Arrow License: Apache-2.0 AND BSD-3-Clause AND BSD-2-Clause AND MIT @@ -39,7 +39,7 @@ Source2: parquet-testing-%{version}.tar.gz Source99: python-pyarrow.rpmlintrc BuildRequires: %{python_module Cython >= 3} -BuildRequires: %{python_module devel >= 3.8} +BuildRequires: %{python_module devel >= 3.9} BuildRequires: %{python_module numpy-devel >= 1.25} BuildRequires: %{python_module pip} BuildRequires: %{python_module setuptools_scm} ++++++ apache-arrow-20.0.0.tar.gz -> apache-arrow-21.0.0.tar.gz ++++++ /work/SRC/openSUSE:Factory/apache-arrow/apache-arrow-20.0.0.tar.gz /work/SRC/openSUSE:Factory/.apache-arrow.new.11973/apache-arrow-21.0.0.tar.gz differ: char 12, line 1 ++++++ arrow-boost-system-1.89-boo1249599.patch ++++++ We have to tell cmake that the openSUSE packager removed the boost::system target. The headers subpackage provides the necessary Boost:system header file. diff -ur --no-dereference arrow-apache-arrow-21.0.0.orig/cpp/cmake_modules/ThirdpartyToolchain.cmake arrow-apache-arrow-21.0.0/cpp/cmake_modules/ThirdpartyToolchain.cmake --- arrow-apache-arrow-21.0.0.orig/cpp/cmake_modules/ThirdpartyToolchain.cmake 2025-07-11 09:44:45.000000000 +0200 +++ arrow-apache-arrow-21.0.0/cpp/cmake_modules/ThirdpartyToolchain.cmake 2025-09-26 20:53:58.409119646 +0200 @@ -1259,7 +1259,7 @@ set(Boost_USE_STATIC_LIBS ON) endif() if(ARROW_BOOST_REQUIRE_LIBRARY) - set(ARROW_BOOST_COMPONENTS filesystem system) + set(ARROW_BOOST_COMPONENTS filesystem) if(ARROW_FLIGHT_SQL_ODBC AND MSVC) list(APPEND ARROW_BOOST_COMPONENTS locale) endif() diff -ur --no-dereference arrow-apache-arrow-21.0.0.orig/cpp/src/arrow/io/CMakeLists.txt arrow-apache-arrow-21.0.0/cpp/src/arrow/io/CMakeLists.txt --- arrow-apache-arrow-21.0.0.orig/cpp/src/arrow/io/CMakeLists.txt 2025-07-11 09:44:45.000000000 +0200 +++ arrow-apache-arrow-21.0.0/cpp/src/arrow/io/CMakeLists.txt 2025-09-26 20:53:51.229519926 +0200 @@ -30,7 +30,7 @@ EXTRA_LINK_LIBS arrow::hadoop Boost::filesystem - Boost::system) + Boost::headers) endif() add_arrow_test(memory_test PREFIX "arrow-io") ++++++ arrow-testing-20.0.0.tar.gz -> arrow-testing-21.0.0.tar.gz ++++++ Binary files old/arrow-testing-d2a13712303498963395318a4eb42872e66aead7/data/parquet/fuzzing/clusterfuzz-testcase-minimized-parquet-arrow-fuzz-5638988460785664 and new/arrow-testing-fbf6b703dc93d17d75fa3664c5aa2c7873ebaf06/data/parquet/fuzzing/clusterfuzz-testcase-minimized-parquet-arrow-fuzz-5638988460785664 differ Binary files old/arrow-testing-d2a13712303498963395318a4eb42872e66aead7/data/parquet/fuzzing/clusterfuzz-testcase-minimized-parquet-arrow-fuzz-5843248142942208 and new/arrow-testing-fbf6b703dc93d17d75fa3664c5aa2c7873ebaf06/data/parquet/fuzzing/clusterfuzz-testcase-minimized-parquet-arrow-fuzz-5843248142942208 differ Binary files old/arrow-testing-d2a13712303498963395318a4eb42872e66aead7/data/parquet/fuzzing/clusterfuzz-testcase-parquet-arrow-fuzz-4550303804620800 and new/arrow-testing-fbf6b703dc93d17d75fa3664c5aa2c7873ebaf06/data/parquet/fuzzing/clusterfuzz-testcase-parquet-arrow-fuzz-4550303804620800 differ ++++++ mimalloc-v2.0.9.tar.gz -> mimalloc-v2.2.4.tar.gz ++++++ ++++ 52667 lines of diff (skipped) ++++++ parquet-testing-20.0.0.tar.gz -> parquet-testing-21.0.0.tar.gz ++++++ ++++ no output (probably identical)
