Messages by Thread
-
[PR] feat(format): add `constraint_expression` to GetObjects [arrow-adbc]
via GitHub
-
[PR] Move `ListLikeArray` to arrow-array to be shared with json writer and… [arrow-rs]
via GitHub
-
Re: [I] Support `ListView` in arrow-json [arrow-rs]
via GitHub
-
Re: [I] [Ruby] Add support for auto dependency install for red-arrow on macOS [arrow]
via GitHub
-
Re: [I] [CI] C++ extra jobs are executed with the `CI: Extra: R` label [arrow]
via GitHub
-
[PR] GH-49335: [CI] Don't run C++ extra jobs with the `CI: Extra: R` label [arrow]
via GitHub
-
Re: [I] Support for GCS requester pays [arrow-rs-object-store]
via GitHub
-
[PR] docs: fix minor issues in profiles and manifests docs [arrow-adbc]
via GitHub
-
Re: [PR] GH-49153: [C++] Remove deprecated APIs from v13.0.0 and v18.0.0 [arrow]
via GitHub
-
Re: [I] [R] parquet does not retain haven::tagged_na() [arrow]
via GitHub
-
Re: [I] [R] "Invalid metadata$r" warning [arrow]
via GitHub
-
Re: [I] [CI][R] test-r-alpine-linux-cran fails with segmentation fault [arrow]
via GitHub
-
[PR] GH-48334: Support reading encrypted bloom filters [arrow]
via GitHub
-
[PR] GH-48145: [R] Update to testthat 3.3.0 and use its expect_r6_class() [arrow]
via GitHub
-
Re: [I] [R] Add tests for filter() and arrange() with aggregation expressions [arrow]
via GitHub
-
Re: [PR] GH-48586: [Python][CI] Upload artifact to python-sdist job [arrow]
via GitHub
-
Re: [I] [Python][C++] Add Profile support to S3FileSystem [arrow]
via GitHub
-
[PR] GH-39600: [R] Add trademark attribution to pkgdown site footer [arrow]
via GitHub
-
[PR] GH-49330: [R] Update docs to reflect removal of OpenSSL 1.0 and 1.1 support [arrow]
via GitHub
-
Re: [I] [Python] Extract partition list from pyarrow.dataset.ParquetFileFragment object [arrow]
via GitHub
-
Re: [I] pytest-cython 0.4.0 RC available [arrow]
via GitHub
-
Re: [I] Check that YMM register saving is enabled before using AVX at runtime [arrow]
via GitHub
-
Re: [I] [C++] Add a type_singleton utility function [arrow]
via GitHub
-
Re: [I] [C++][Parquet] Add SkipValues() to decoder, Refactor TypedColumnReader::Skip to use it. [arrow]
via GitHub
-
Re: [I] Use correct attribution in the footer of documentation pages (monorepo) [arrow]
via GitHub
-
Re: [I] [C++][Python] Add option to include partitioning columns in basename_template's filename [arrow]
via GitHub
-
Re: [I] [C++] bit_util TrailingBits can be made much faster [arrow]
via GitHub
-
Re: [I] [Python] Add timezone information when printing TimestampArray [arrow]
via GitHub
-
Re: [I] [C++][Python] DLPack implementation for Arrow Arrays (consuming) [arrow]
via GitHub
-
Re: [I] [C++][Parquet] Support read by row ranges [arrow]
via GitHub
-
Re: [I] [R] Update NEWS.md for 23.0.1 [arrow]
via GitHub
-
Re: [I] [C++][Compute] Add ExecNode(s) for vectorized take with computed indices [arrow]
via GitHub
-
Re: [I] [Python][C++] Hex decoding strings/allow casting strings to UUIDs and vice-versa [arrow]
via GitHub
-
Re: [I] [R] Add `schema` argument to `write_parquet()` [arrow]
via GitHub
-
Re: [I] [R] pillar::glimpse takes too long [arrow]
via GitHub
-
Re: [I] [R] Add `verify-rc-source-r` job [arrow]
via GitHub
-
Re: [I] [C++][Gandiva] Add regexp_like like Oracle [arrow]
via GitHub
-
Re: [I] Add trademark symbol to Apache Arrow logo [arrow]
via GitHub
-
Re: [I] Floordiv compute kernel [arrow]
via GitHub
-
Re: [I] adding iterrow [arrow]
via GitHub
-
Re: [I] [R] Without invalid_row_handler in CSV Parsing Options [arrow]
via GitHub
-
Re: [I] [Archery] Allow running external repetitions of C++ micro-benchmarks [arrow]
via GitHub
-
Re: [I] [C++][Parquet] Decoding: allow Boolean RecordReader get raw LSB bitmap [arrow]
via GitHub
-
Re: [I] [C++][Python] DLPack implementation for Arrow Arrays with CudaBuffers [arrow]
via GitHub
-
Re: [I] [C++] Deprecate Scalar::CastTo [arrow]
via GitHub
-
Re: [I] Add DNF, SQL, Compute expression support to dataset filters [arrow]
via GitHub
-
Re: [I] [Python] Clean up ExtensionType.__reduce__ [arrow]
via GitHub
-
Re: [I] [R] `RecordBatchReader$batches()` is very slow [arrow]
via GitHub
-
Re: [I] [Python][Parquet] Faster parquet partitioning scheme [arrow]
via GitHub
-
Re: [I] Consider merging static libarrow builds into universal binaries [arrow]
via GitHub
-
Re: [I] [C++][Gandiva] Refactor built-in stub functions to use `FunctionRegistry` for registration [arrow]
via GitHub
-
Re: [I] [C++] Consider adding Memory Sanitizer build [arrow]
via GitHub
-
Re: [I] [Docs] Warn against execution of arbitrary code in use of extension types [arrow]
via GitHub
-
Re: [I] [Documentation][Parquet] Reading parquet and memory mapping [arrow]
via GitHub
-
[PR] Add `SortBuilder` for `SortOptions` [arrow-rs]
via GitHub
-
Re: [I] [C++][FS][Azure] Add allow_container_deletion option [arrow]
via GitHub
-
Re: [I] [Python][Parquet] Support wiriting decimals as integers [arrow]
via GitHub
-
Re: [I] [C++][Parquet] Reuse memory of passed array in ConvertToDecimal [arrow]
via GitHub
-
Re: [I] [C++][Parquet] Allow computing distinct_count accross multiple ColumnChunkMetadata [arrow]
via GitHub
-
Re: [I] [C++][Python] DLPack on FixedShapeTensorArray/FixedShapeTensorScalar [arrow]
via GitHub
-
Re: [I] [C++][Parquet] support passing a RowRange to RecordBatchReader [arrow]
via GitHub
-
Re: [I] [C++][Parquet] Parquet: support exact in Page/Row-Group level Statistics [arrow]
via GitHub
-
Re: [I] [C++][Parquet]: Add support for list view and large list view [arrow]
via GitHub
-
Re: [I] Allow read_csv invalid_row_handler to allow it to fix the row [arrow]
via GitHub
-
Re: [I] [Dev] Prompt whether an issue should be labeled as Breaking Change when merging [arrow]
via GitHub
-
Re: [I] [Integration] Track Rust-allocated memory [arrow]
via GitHub
-
Re: [I] Parquet dictionary filter pushdown slow and large files are created [arrow]
via GitHub
-
Re: [I] [Python] Add function/method to deepcopy a `pa.Table` [arrow]
via GitHub
-
Re: [I] [C++] [Gandiva] Optimize Gandiva Cache [arrow]
via GitHub
-
Re: [I] [R] Write metadata to parquet file as argument to `write_parquet()` [arrow]
via GitHub
-
Re: [I] [R] Should we relax version checking for system libarrow? [arrow]
via GitHub
-
Re: [I] [R][Documentation] Document add_filename on open_dataset help page [arrow]
via GitHub
-
Re: [I] Enable a smaller build of just libparquet [arrow]
via GitHub
-
Re: [I] [MATLAB] Add utility for validating `Array` indexing expressions [arrow]
via GitHub
-
Re: [I] [CI] Bump timeout on Integration pipeline [arrow]
via GitHub
-
Re: [I] How to implement numa-aware memory management? [arrow]
via GitHub
-
Re: [I] [C++][Acero] HashJoinSchema::init may don't need construct FieldVector [arrow]
via GitHub
-
Re: [I] [C++][Pyarrow] Add the ability to wrap/unwrap acero/compute objects [arrow]
via GitHub
-
Re: [I] [Python] Add is_nan, is_null, is_valid as operators to DNF filters [arrow]
via GitHub
-
Re: [I] [C++] Unify the sets of random generators in testing/random.h [arrow]
via GitHub
-
Re: [I] [Integration][Documentation] Document JSON test data for BinaryView and Utf8View [arrow]
via GitHub
-
Re: [I] [Parquet] Encoding configuration should be easier and more automated [arrow]
via GitHub
-
Re: [I] [C++] Update t-digest implementation [arrow]
via GitHub
-
Re: [I] add support for Fabric OneLake, it is already supported by Delta_rs [arrow]
via GitHub
-
Re: [I] [R] preserve hive partitions when opening along a path / path vector [arrow]
via GitHub
-
Re: [I] [Docs] Describe Flight RPC/Flight SQL integration testing [arrow]
via GitHub
-
Re: [I] [Integration] Time the integration tests and report durations [arrow]
via GitHub
-
Re: [I] Allow projection of schemas/structs [arrow]
via GitHub
-
Re: [I] [C++][Gandiva] Enhance random data generation [arrow]
via GitHub
-
Re: [I] [C++] Add filesystem stats [arrow]
via GitHub
-
Re: [I] [Docs] Update status.rst with new Flight features [arrow]
via GitHub
-
Re: [I] [C++][Compute] Support ChunkedArray sorting for dictionary type [arrow]
via GitHub
-
Re: [I] [C++][Pyarrow] Make a generic acero Option interface to instantiate custom nodes from pyarrow/other languages [arrow]
via GitHub
-
Re: [I] [MATLAB] Make `arrow.array.Array` inherit from `matlab.mixin.indexing.RedefinesParen` to supporting indexing semantics [arrow]
via GitHub
-
Re: [I] [Release] Make download_rc_binaries.py less frustrating [arrow]
via GitHub
-
Re: [I] [C++][Compute] Support Recordbatch sorting for dictionary type [arrow]
via GitHub
-
Re: [I] [Discuss][C++] Replace MemoTable with a SwissTable implementation [arrow]
via GitHub
-
Re: [I] [Integration] Test non-zero offsets in C Data Interface [arrow]
via GitHub
-
Re: [I] Support `ndim` and `shape` attributes on both `Array` and `Table` [arrow]
via GitHub
-
Re: [I] [C++][Parquet] parquet::arrow FileReader and FileReaderBuilder might multiple different memory pool [arrow]
via GitHub
-
Re: [I] Broadcasting version of `pyarrow.compute.list_slice` that accepts arrays of `start/stop/step` [arrow]
via GitHub
-
Re: [I] [R][Docs] Add section on debugging S3 in the R developer docs [arrow]
via GitHub
-
Re: [I] [R] Add a test that confirms that `install-arrow.R` is self contained [arrow]
via GitHub
-
Re: [I] [C++] Implement BinaryOverDictionaryArray [arrow]
via GitHub
-
Re: [I] [C++] Optimize Compact() of DictionaryArray for the dictionary of which only a slice is used [arrow]
via GitHub
-
Re: [I] [R] Map `format = "text` to `format = "csv"` in `write_dataset()` [arrow]
via GitHub
-
Re: [I] [R][Release] Add update-checksums.R to r/Makefile [arrow]
via GitHub
-
[PR] fix(flight/flightsql/driver): fix `time.Time` params [arrow-go]
via GitHub
-
Re: [PR] GH-35437: [C++] Remove obsolete TODO about DictionaryArray const& return types [arrow]
via GitHub
-
Re: [PR] GH-33450: [C++] Remove GlobalForkSafeMutex [arrow]
via GitHub
-
Re: [I] [C++] Simplify type_traits.h [arrow]
via GitHub
-
Re: [I] [Python] Expose CanCast and Schema::AreCompatible in Python [arrow]
via GitHub
-
Re: [I] [MATLAB] Provide way to expose underlying data representation of `arrow.array.Array`s [arrow]
via GitHub
-
Re: [I] [Python] pyarrow.array should special-case array.array objects [arrow]
via GitHub
-
Re: [I] Support for GROUPING SETS/CUBE/ROLLUP [arrow]
via GitHub
-
Re: [I] [R] Allow overriding the libarrow static lib version [arrow]
via GitHub
-
Re: [I] [R] expose `decimal_point` argument in CSVConvertOptions [arrow]
via GitHub
-
Re: [I] [Python][Interchange protocol] Export boolean columns as bit-packed values [arrow]
via GitHub
-
Re: [I] Support async CustomOpen in FileSource [arrow]
via GitHub
-
Re: [I] [R][CI] Re-enable centos binary test job in r-binary-packages [arrow]
via GitHub
-
Re: [I] [C++] Allow non destructive finalize method on aggregation kernels [arrow]
via GitHub
-
Re: [I] [R] Support joining using `NA` as join key [arrow]
via GitHub
-
Re: [I] [C++][FlightRPC] Memory tracking for arrow flight over grpc [arrow]
via GitHub
-
Re: [I] [C++] Concatenating a single array is a compaction utility [arrow]
via GitHub
-
Re: [I] [MATLAB] Consider renaming `field()`, `column()` and `chunk()` methods to `getField()`, `getColumn()`, and `getChunk()` [arrow]
via GitHub
-
Re: [I] [C++] S3 stress tests can fail to delete a temporary directory [arrow]
via GitHub
-
Re: [I] [C++] Add sorting and hashing fast paths for string view [arrow]
via GitHub
-
Re: [I] [C++][Compute] Checked arithmetic functions are slow-ish [arrow]
via GitHub
-
Re: [I] [C++][Parquet] Dataset: Parquet Writer Supports user memory-pool [arrow]
via GitHub
-
Re: [I] [C++] Add software implementation of PDEP and reenable BMI2 code paths on all AVX2 CPUs [arrow]
via GitHub
-
Re: [I] [Docs][FlightRPC] Update documentation to note that `FlightInfo::schema` may be null [arrow]
via GitHub
-
Re: [I] [C++][Parquet] Using BMI to implement filter pushdown [arrow]
via GitHub
-
Re: [I] [R] Add Way to Track and Expose User Defined Functions [arrow]
via GitHub
-
Re: [I] [FlightSQL] Support `DoExchange` (in addition to `DoPut`) to bind parameters and execute prepared statements [arrow]
via GitHub
-
Re: [I] [Docs][FlightSQL] Clarify some questions about PollFlightInfo [arrow]
via GitHub
-
Re: [I] [C++] Add support for system static RE2 [arrow]
via GitHub
-
Re: [I] [C++][Parquet] add statistics for better estimating unencoded/uncompressed sizes and finer grained filtering [arrow]
via GitHub
-
Re: [I] [Python] Add documentation for Dataset FileWriteOptions classes [arrow]
via GitHub
-
Re: [I] [C++] Reduce template parameters for aggregate functions [arrow]
via GitHub
-
Re: [I] [C++][Flight] Improve error message when port cannot be bound with gRPC [arrow]
via GitHub
-
Re: [I] Add more boolean string maps to cpp/src/arrow/csv/options.cc [arrow]
via GitHub
-
Re: [I] [Python] Define a Dataset protocol based on Substrait and C Data Interface [arrow]
via GitHub
-
Re: [I] [C++] Improve adjoin_as_list for struct types [arrow]
via GitHub
-
Re: [I] [C++][Integration] Install executables for integration test and use it [arrow]
via GitHub
-
Re: [I] [Integration] Refactor datagen.py [arrow]
via GitHub
-
Re: [I] [C++] Stop plan early to improve hashjoin performance. [arrow]
via GitHub
-
Re: [I] Add more NULL mappings to arrow/cpp/src/arrow/csv /options.cc [arrow]
via GitHub
-
Re: [I] [Python] Expose additional Cython wrap/unwrap helpers [arrow]
via GitHub
-
Re: [I] [C++] Support Nested Loop Join node. [arrow]
via GitHub
-
Re: [I] [C++] clang-format result may be invalid for cpplint.py [arrow]
via GitHub
-
Re: [I] [Python] Expose StreamDecoder to pyarrow python API [arrow]
via GitHub
-
Re: [I] [Packaging][Release] Use Debian/RPM type Artifactory repositories instead of General type Artifactory repository [arrow]
via GitHub
-
Re: [I] [C++][FlightRPC][Python] Unit-test and expose TransportStatusDetail [arrow]
via GitHub
-
Re: [I] [C++][FlightRPC] Implement async versions of other metadata methods [arrow]
via GitHub
-
Re: [I] [C++] Implement REE support in ArrayFromJSON [arrow]
via GitHub
-
Re: [I] [C++] Support IS DISTINCT and IS NOT DISTINCT expression. [arrow]
via GitHub
-
Re: [I] [C++][Python] pyarrow.ChunkedArray.combine_chunks is slow [arrow]
via GitHub
-
Re: [I] [Python][CI] Enable warnings as errors in pytests for CI jobs [arrow]
via GitHub
-
Re: [I] [CI] Curate Crossbow nightly jobs [arrow]
via GitHub
-
Re: [I] [C++] IO: Can we support extra IO tag in RandomAccessFile? [arrow]
via GitHub
-
Re: [I] [Python] In pyarrow len(ListScalar()) may have performance issues [arrow]
via GitHub
-
Re: [I] [Doc] Enhancement the document for dataset and s3 [arrow]
via GitHub
-
Re: [I] [C++][Python] Add `CastOptions` to `csv.ConvertOptions` for usage in `read_csv`. [arrow]
via GitHub
-
Re: [I] [C++] Util: Compression supports a Compression/Decompression Context [arrow]
via GitHub
-
Re: [I] [C++] Add concrete ArraySpan classes [arrow]
via GitHub
-
Re: [I] [MATLAB] Vectorize the `field` method of `arrow.tabular.Schema` [arrow]
via GitHub
-
Re: [I] [R][C++] Add ability to trim whitespace to CSV reading options [arrow]
via GitHub
-
Re: [I] Missing kernels for ordering with struct types [arrow]
via GitHub
-
Re: [I] [Python] Provide pybind11 type casters [arrow]
via GitHub
-
Re: [I] [Format] Add wording for alternative layouts [arrow]
via GitHub
-
Re: [I] [R] Explicitly enumerate the `ParquetReaderProperties` and `ParquetArrowReaderProperties` arguments in `write_parquet()` [arrow]
via GitHub
-
Re: [I] [Python][Skyhook] pyarrow library not include the "SkyhookFileFormat" function [arrow]
via GitHub
-
Re: [I] Point empty buffers to kNonNullFiller [arrow]
via GitHub
-
Re: [I] [C++] Refactor scan_node to introduce support for scan tasks [arrow]
via GitHub
-
Re: [I] [C++][SIMD] Avoid one-definition-rule violation of `arrow::internal::BitmapWriter` without depending on `-O2` [arrow]
via GitHub
-
Re: [I] [C++] Support parquet field-id resolution in the substrait consumer [arrow]
via GitHub
-
Re: [I] [C++] [Acero] Enhance aggregate kernel API's for intermediate state [arrow]
via GitHub
-
Re: [I] [R] Any support for rolling windows functions? [arrow]
via GitHub
-
Re: [I] [C++][Parquet] Add an asynchronous version of ReadRowGroup/ReadRowGroups [arrow]
via GitHub
-
Re: [I] [C++][Python] Remove integer compatibility for trunc, floor and ceil [arrow]
via GitHub
-
Re: [I] [Python] Instantiate `pa.Table` from a `Generator`/`Iterator` [arrow]
via GitHub
-
Re: [I] [R] Create a wrapper function around Dataset's `$files` method [arrow]
via GitHub
-
Re: [I] [C++] Provide way for extension array to provide it's own value pretty printer [arrow]
via GitHub