github
Thread
Date
Earlier messages
Messages by Date
2026/06/17
Re: [PR] Validate coerce int96 config 17498 [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: multiple columns in count distinct [datafusion]
via GitHub
2026/06/17
Re: [PR] fix: coalesce the merged key of RIGHT/FULL USING/NATURAL joins [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: add PostgreSQL EXCLUDE constraint parsing [datafusion-sqlparser-rs]
via GitHub
2026/06/17
Re: [I] Reduce Github Action Usage [datafusion]
via GitHub
2026/06/17
Re: [PR] refactor(shim): add new shim infra with `enableIfVer` macro to avoid shims duplication [datafusion-comet]
via GitHub
2026/06/17
Re: [PR] [#21878] extensive test for multi-dictionary column group bys [datafusion]
via GitHub
2026/06/17
[PR] feat: use aligned slice access during bulk append in SparkUnsafeArray [datafusion-comet]
via GitHub
2026/06/17
Re: [PR] [#21878] extensive test for multi-dictionary column group bys [datafusion]
via GitHub
2026/06/17
Re: [PR] Skip loading Parquet page index when row-group statistics already prove it cannot prune [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: Add new `input_file_name` UDF for file-backed scans [datafusion]
via GitHub
2026/06/17
Re: [I] RAT check in Maven build scans temporary release/scratch directories, making release builds slow [datafusion-comet]
via GitHub
2026/06/17
[I] RAT check in Maven build scans temporary release/scratch directories, making release builds slow [datafusion-comet]
via GitHub
2026/06/17
Re: [PR] feat: improve pythonic interface on date/time functions [datafusion-python]
via GitHub
2026/06/17
Re: [PR] build(deps): bump prost from 0.14.3 to 0.14.4 [datafusion-python]
via GitHub
2026/06/17
Re: [PR] build(deps): bump prost from 0.14.3 to 0.14.4 [datafusion-python]
via GitHub
2026/06/17
Re: [PR] build(deps): bump pyo3-log from 0.13.3 to 0.13.4 [datafusion-python]
via GitHub
2026/06/17
Re: [PR] build(deps): bump pyo3-log from 0.13.3 to 0.13.4 [datafusion-python]
via GitHub
2026/06/17
Re: [PR] build(deps): bump uuid from 1.23.2 to 1.23.3 [datafusion-python]
via GitHub
2026/06/17
Re: [PR] build(deps): bump uuid from 1.23.2 to 1.23.3 [datafusion-python]
via GitHub
2026/06/17
Re: [PR] chore: update rust dependencies [datafusion-python]
via GitHub
2026/06/17
Re: [PR] feat: support native Comet scan of plain Delta Lake tables [datafusion-comet]
via GitHub
2026/06/17
Re: [PR] feat: decouple partition location from executor metadata [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] perf: address scheduler crash on high partition SF data [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] perf: address scheduler crash on high partition SF data [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] chore: Test update object store to 0.14.0 [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: support native Comet scan of plain Delta Lake tables [datafusion-comet]
via GitHub
2026/06/17
[PR] chore: Test update object store to 0.14.0 [datafusion]
via GitHub
2026/06/17
Re: [PR] fix: Omit NULL values from build side of hash joins [datafusion]
via GitHub
2026/06/17
Re: [PR] fix: Omit NULL values from build side of hash joins [datafusion]
via GitHub
2026/06/17
Re: [PR] fix: Omit NULL values from build side of hash joins [datafusion]
via GitHub
2026/06/17
[PR] Add Hotdata to the "known users" list in introduction.md [datafusion]
via GitHub
2026/06/17
Re: [I] perf: use aligned slice access in SparkUnsafeArray bulk append [datafusion-comet]
via GitHub
2026/06/17
[PR] chore: update rust dependencies [datafusion-python]
via GitHub
2026/06/17
Re: [PR] build(deps): batch dependabot dependency updates [datafusion-python]
via GitHub
2026/06/17
Re: [PR] Fix shared TopK early exit with shared prefix threshold [datafusion]
via GitHub
2026/06/17
Re: [PR] refactor: make scalar distance u64 and overflow aware [datafusion]
via GitHub
2026/06/17
[PR] Add sorted TopK TPC-H benchmark target [datafusion]
via GitHub
2026/06/17
Re: [PR] fix: coalesce the merged key of RIGHT/FULL USING/NATURAL joins [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: support native Comet scan of plain Delta Lake tables [datafusion-comet]
via GitHub
2026/06/17
[PR] Experimental: support parquet partition write [datafusion-comet]
via GitHub
2026/06/17
Re: [PR] perf: address scheduler crash on high partition SF data [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] feat: expose spark-compatible functions [datafusion-python]
via GitHub
2026/06/17
Re: [I] Expose Spark functions [datafusion-python]
via GitHub
2026/06/17
Re: [PR] perf: address scheduler crash on high partition SF data [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] feat: expose spark-compatible functions [datafusion-python]
via GitHub
2026/06/17
[PR] feat: support native Comet scan of plain Delta Lake tables [datafusion-comet]
via GitHub
2026/06/17
Re: [PR] Fix shared TopK early exit with shared prefix threshold [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: Add new `input_file_name` UDF for file-backed scans [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: Add new `input_file_name` UDF for file-backed scans [datafusion]
via GitHub
2026/06/17
Re: [PR] Revert Teradata dialect [datafusion-sqlparser-rs]
via GitHub
2026/06/17
[PR] Fix DuckDB unparse for optimized join projections [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: Add new `input_file_name` UDF for file-backed scans [datafusion]
via GitHub
2026/06/17
[PR] feat(unparser): support binary literals [datafusion]
via GitHub
2026/06/17
[I] Unparser: support unparsing binary scalars [datafusion]
via GitHub
2026/06/17
[PR] Feat/unparser spaceship [datafusion]
via GitHub
2026/06/17
Re: [PR] Refactor outer join null-rejection analysis to track join sides directly [datafusion]
via GitHub
2026/06/17
Re: [PR] perf: address scheduler crash on high partition SF data [datafusion-ballista]
via GitHub
2026/06/17
Re: [I] RIGHT/FULL/NATURAL JOIN ... USING(k) does not coalesce the join key (returns NULL for right-only rows) [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: expose spark-compatible functions [datafusion-python]
via GitHub
2026/06/17
Re: [PR] feat: expose spark-compatible functions [datafusion-python]
via GitHub
2026/06/17
Re: [PR] perf: address scheduler crash on high partition SF data [datafusion-ballista]
via GitHub
2026/06/17
[PR] fix: coalesce the merged key of RIGHT/FULL USING/NATURAL joins [datafusion]
via GitHub
2026/06/17
Re: [PR] refactor: thread SubqueryContext explicitly through physical planning [datafusion]
via GitHub
2026/06/17
Re: [I] Support logical protobuf serialization for range repartitioning [datafusion]
via GitHub
2026/06/17
Re: [I] Support logical protobuf serialization for range repartitioning [datafusion]
via GitHub
2026/06/17
Re: [PR] Optimize Parquet row-filter struct schema pruning [datafusion]
via GitHub
2026/06/17
Re: [PR] Optimize Parquet row-filter struct schema pruning [datafusion]
via GitHub
2026/06/17
Re: [I] Support logical protobuf serialization for range repartitioning [datafusion]
via GitHub
2026/06/17
[I] Unparser: support spaceship operator [datafusion]
via GitHub
2026/06/17
Re: [PR] perf: address scheduler crash on high partition SF data [datafusion-ballista]
via GitHub
2026/06/17
Re: [I] Refactor eliminate_outer_join null-rejection tracking to side-level state [datafusion]
via GitHub
2026/06/17
Re: [PR] Refactor outer join null-rejection analysis to track join sides directly [datafusion]
via GitHub
2026/06/17
Re: [PR] bugfix: changed return type of spark's width_bucket to i64 [datafusion]
via GitHub
2026/06/17
Re: [PR] Rich t kid/implement multi dictionary aggr [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: expose spark-compatible functions [datafusion-python]
via GitHub
2026/06/17
Re: [PR] Rich t kid/implement multi dictionary aggr [datafusion]
via GitHub
2026/06/17
Re: [PR] Rich t kid/implement multi dictionary aggr [datafusion]
via GitHub
2026/06/17
Re: [PR] Refactor outer join null-rejection analysis to track join sides directly [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: add PostgreSQL EXCLUDE constraint parsing [datafusion-sqlparser-rs]
via GitHub
2026/06/17
Re: [PR] bugfix: changed return type of spark's width_bucket to i64 [datafusion]
via GitHub
2026/06/17
Re: [I] [EPIC] TUI Improvements [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] Optimize Dictionary groupings [datafusion]
via GitHub
2026/06/17
Re: [PR] Fix shared TopK early exit with shared prefix threshold [datafusion]
via GitHub
2026/06/17
[PR] implement map_agg [datafusion]
via GitHub
2026/06/17
Re: [PR] Fix shared TopK early exit with shared prefix threshold [datafusion]
via GitHub
2026/06/17
Re: [PR] Fix shared TopK early exit with shared prefix threshold [datafusion]
via GitHub
2026/06/17
Re: [PR] perf: optimize object store requests when reading CSV [datafusion]
via GitHub
2026/06/17
[I] Parquet bloom filter pruning can incorrectly filter decimals encoded as FIXED_LEN_BYTE_ARRAY [datafusion]
via GitHub
2026/06/17
[PR] fix: Parquet bloom filter pruning can incorrectly filter decimals encoded as FIXED_LEN_BYTE_ARRAY [datafusion]
via GitHub
2026/06/17
Re: [I] Parquet bloom filter pruning can incorrectly filter decimals encoded as FIXED_LEN_BYTE_ARRAY [datafusion]
via GitHub
2026/06/17
Re: [PR] perf: optimize object store requests when reading CSV [datafusion]
via GitHub
2026/06/17
Re: [PR] Optimize Dictionary groupings [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: decouple partition location from executor metadata [datafusion-ballista]
via GitHub
2026/06/17
Re: [I] Improve job failure handling [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] Fix shared TopK early exit with shared prefix threshold [datafusion]
via GitHub
2026/06/17
Re: [PR] perf: optimize object store requests when reading CSV [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: decouple partition location from executor metadata [datafusion-ballista]
via GitHub
2026/06/17
Re: [I] Improve job failure handling [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] perf: address scheduler crash on high partition SF data [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] Fix shared TopK early exit with shared prefix threshold [datafusion]
via GitHub
2026/06/17
Re: [PR] Fix shared TopK early exit with shared prefix threshold [datafusion]
via GitHub
2026/06/17
Re: [PR] perf: address scheduler crash on high partition SF data [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] fix: ProjectionPushdown internal error on NestedLoopJoin mark joins [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: decouple partition location from executor metadata [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] fix: isolate anonymous file statistics cache [datafusion]
via GitHub
2026/06/17
Re: [PR] fix: Preserve integer values in round() for large Int64 and UInt64 inputs [datafusion]
via GitHub
2026/06/17
[PR] Revert Teradata dialect [datafusion-sqlparser-rs]
via GitHub
2026/06/17
Re: [PR] fix: isolate anonymous file statistics cache [datafusion]
via GitHub
2026/06/17
Re: [PR] fix: Preserve integer values in round() for large Int64 and UInt64 inputs [datafusion]
via GitHub
2026/06/17
Re: [PR] Optimize Parquet row-filter struct schema pruning [datafusion]
via GitHub
2026/06/17
Re: [PR] fix: isolate anonymous file statistics cache [datafusion]
via GitHub
2026/06/17
Re: [PR] Do not consume statement terminator in unparenthesized option lists [datafusion-sqlparser-rs]
via GitHub
2026/06/17
Re: [I] NewType pattern for executor id's [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] Doris SQL: add Doris Dialect [datafusion-sqlparser-rs]
via GitHub
2026/06/17
Re: [I] Add support for `DataFrame.cache()` to Ballista [datafusion-ballista]
via GitHub
2026/06/17
Re: [I] Add support for `DataFrame.checkpoint()` to Ballista [datafusion-ballista]
via GitHub
2026/06/17
Re: [I] Add support for `DataFrame.cache()` to Ballista [datafusion-ballista]
via GitHub
2026/06/17
Re: [I] Add support for `DataFrame.cache()` to Ballista [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] refactor(shim): add new shim infra with `enableIfVer` macro to avoid shims duplication [datafusion-comet]
via GitHub
2026/06/17
Re: [PR] refactor(shim): add new shim infra with `enableIfVer` macro to avoid shims duplication [datafusion-comet]
via GitHub
2026/06/17
[I] NewType pattern for executor id's [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] feat: warn on NULL equality predicates [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: warn on NULL equality predicates [datafusion]
via GitHub
2026/06/17
Re: [I] Implement Aggregate function `map_agg` [datafusion]
via GitHub
2026/06/17
[I] Implement Aggregate function `map_agg` [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: warn on NULL equality predicates [datafusion]
via GitHub
2026/06/17
Re: [PR] Remove sqllogictest fork swap from regenerate_sqlite_files.sh [datafusion]
via GitHub
2026/06/17
Re: [PR] Update expected results for duplicate column names fix [datafusion-testing]
via GitHub
2026/06/17
Re: [PR] perf: address scheduler crash on high partition SF data [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] fix: coerce SIMILAR TO operands to a common string type [datafusion]
via GitHub
2026/06/17
[PR] fix: coerce SIMILAR TO operands to a common string type [datafusion]
via GitHub
2026/06/17
Re: [I] Add support for `DataFrame.cache()` to Ballista [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] fix: coerce SIMILAR TO operands to a common string type to avoid 'failed to downcast array' panic [datafusion]
via GitHub
2026/06/17
Re: [I] Add support for `DataFrame.cache()` to Ballista [datafusion-ballista]
via GitHub
2026/06/17
Re: [I] `ExecutorMetatada` too verbose [datafusion-ballista]
via GitHub
2026/06/17
[PR] Doris SQL: add Doris Dialect [datafusion-sqlparser-rs]
via GitHub
2026/06/17
Re: [PR] feat: warn on NULL equality predicates [datafusion]
via GitHub
2026/06/17
[I] DorisSQL: add Doris Dialect [datafusion-sqlparser-rs]
via GitHub
2026/06/17
[PR] Fix shared TopK early exit with global prefix threshold [datafusion]
via GitHub
2026/06/17
Re: [PR] perf: optimize object store requests when reading CSV [datafusion]
via GitHub
2026/06/17
Re: [PR] fix: graceful error for deeply nested expressions instead of stack overflow [datafusion]
via GitHub
2026/06/17
Re: [I] Improve shuffle (column) statistics [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] feat: support file-level parquet row selections [datafusion]
via GitHub
2026/06/17
Re: [I] `ExecutorMetatada` too verbose [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] bench: add correlated-proxy case to the predicate_eval suite [datafusion]
via GitHub
2026/06/17
Re: [PR] bench: add correlated-proxy case to the predicate_eval suite [datafusion]
via GitHub
2026/06/17
Re: [PR] bench: add correlated-proxy case to the predicate_eval suite [datafusion]
via GitHub
2026/06/17
Re: [I] `ExecutorMetatada` too verbose [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] feat: decouple partition location from executor metadata [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] Add StatisticsContext parameter to partition_statistics [datafusion]
via GitHub
2026/06/17
Re: [I] Decouple operator statistics propagation from traversal/caching via statistics_from_inputs [datafusion]
via GitHub
2026/06/17
[PR] perf: address scheduler crash on high partition SF data [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] bench: add correlated-proxy case to the predicate_eval suite [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: support binary and string types for concat UDFs [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: support binary and string types for concat UDFs [datafusion]
via GitHub
2026/06/17
Re: [PR] fix: block timestamp precision narrowing unwrap [datafusion]
via GitHub
2026/06/17
Re: [PR] fix: Consider column names' case when aliasing tables [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: decouple partition location from executor metadata [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] feat: support binary and string types for concat UDFs [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: support binary and string types for concat UDFs [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: support binary and string types for concat UDFs [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: support binary and string types for concat UDFs [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: support binary and string types for concat UDFs [datafusion]
via GitHub
2026/06/16
Re: [I] `ExecutorMetatada` too verbose [datafusion-ballista]
via GitHub
2026/06/16
Re: [PR] feat: support binary and string types for concat UDFs [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: support binary and string types for concat UDFs [datafusion]
via GitHub
2026/06/16
Re: [I] Disallow `order by` within ordered-set aggregate functions argument lists [datafusion]
via GitHub
2026/06/16
Re: [PR] Add ExpressionAnalyzer for pluggable expression-level statistics estimation [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: Stage based fallback [datafusion-comet]
via GitHub
2026/06/16
Re: [PR] Remove redundant `collect_stat` and `target_partitions` on `ListingOptions` [datafusion]
via GitHub
2026/06/16
Re: [I] `ExecutorMetatada` too verbose [datafusion-ballista]
via GitHub
2026/06/16
Re: [PR] Remove redundant `collect_stat` and `target_partitions` on `ListingOptions` [datafusion]
via GitHub
2026/06/16
Re: [PR] Remove redundant `collect_stat` and `target_partitions` on `ListingOptions` [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: support reading from stdin in datafusion-cli [datafusion]
via GitHub
2026/06/16
Re: [I] Support more types for `approx_distinct` function [datafusion]
via GitHub
2026/06/16
Re: [PR] Optimize Dictionary groupings [datafusion]
via GitHub
2026/06/16
[PR] Return errors on string builder offset overflow in `replace` and `initcap` [datafusion]
via GitHub
2026/06/16
Re: [PR] chore(deps): bump the all-other-cargo-deps group across 1 directory with 6 updates [datafusion]
via GitHub
2026/06/16
Re: [PR] Fix optimize_projections failure with struct-field join keys [datafusion]
via GitHub
2026/06/16
Re: [PR] Optimize Dictionary groupings [datafusion]
via GitHub
2026/06/16
Re: [PR] Add FixedSizeList support for recursive struct schema adaptation [datafusion]
via GitHub
2026/06/16
Re: [PR] Optimize Dictionary groupings [datafusion]
via GitHub
2026/06/16
Re: [PR] fix: ProjectionPushdown internal error on NestedLoopJoin mark joins [datafusion]
via GitHub
2026/06/16
Re: [PR] chore(deps): bump itertools from 0.14.0 to 0.15.0 [datafusion-ballista]
via GitHub
2026/06/16
Re: [PR] chore(deps): bump config from 0.15.23 to 0.15.24 [datafusion-ballista]
via GitHub
2026/06/16
Re: [I] `ExecutorMetatada` too verbose [datafusion-ballista]
via GitHub
2026/06/16
Re: [PR] fix: Omit NULL values from build side of hash joins [datafusion]
via GitHub
2026/06/16
Re: [PR] Unify AVG group state conversion and filter handling across Spark and built-in accumulators [datafusion]
via GitHub
2026/06/16
[PR] chore(deps): bump itertools from 0.14.0 to 0.15.0 [datafusion-ballista]
via GitHub
2026/06/16
[PR] chore(deps): bump config from 0.15.23 to 0.15.24 [datafusion-ballista]
via GitHub
2026/06/16
Re: [PR] perf: preserve dictionary encoding for lower/upper to avoid materializing low-cardinality columns [datafusion]
via GitHub
2026/06/16
Re: [I] Unsafe comparison cast rewriting in ExprSimplifier silently produces wrong query results [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: add OR pre-selection short-circuit [datafusion]
via GitHub
2026/06/16
Re: [PR] fix(sort): record output_batches, output_bytes and end_time for when not using merge sort [datafusion]
via GitHub
2026/06/16
Re: [PR] feat(parquet): runtime row-group early stop via TopK dynamic filter [datafusion]
via GitHub
2026/06/16
Re: [PR] feat(parquet): runtime row-group early stop via TopK dynamic filter [datafusion]
via GitHub
2026/06/16
Re: [PR] perf : experiment roaring bitmap for int32 anti and semi joins [datafusion]
via GitHub
2026/06/16
Re: [PR] feat(parquet): runtime row-group early stop via TopK dynamic filter [datafusion]
via GitHub
2026/06/16
Re: [PR] feat(parquet): runtime row-group early stop via TopK dynamic filter [datafusion]
via GitHub
Earlier messages