github
Thread
Date
Earlier messages
Later messages
Messages by Date
2026/06/16
Re: [I] Disallow `order by` within ordered-set aggregate functions argument lists [datafusion]
via GitHub
2026/06/16
Re: [PR] Add ExpressionAnalyzer for pluggable expression-level statistics estimation [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: Stage based fallback [datafusion-comet]
via GitHub
2026/06/16
Re: [PR] Remove redundant `collect_stat` and `target_partitions` on `ListingOptions` [datafusion]
via GitHub
2026/06/16
Re: [I] `ExecutorMetatada` too verbose [datafusion-ballista]
via GitHub
2026/06/16
Re: [PR] Remove redundant `collect_stat` and `target_partitions` on `ListingOptions` [datafusion]
via GitHub
2026/06/16
Re: [PR] Remove redundant `collect_stat` and `target_partitions` on `ListingOptions` [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: support reading from stdin in datafusion-cli [datafusion]
via GitHub
2026/06/16
Re: [I] Support more types for `approx_distinct` function [datafusion]
via GitHub
2026/06/16
Re: [PR] Optimize Dictionary groupings [datafusion]
via GitHub
2026/06/16
[PR] Return errors on string builder offset overflow in `replace` and `initcap` [datafusion]
via GitHub
2026/06/16
Re: [PR] chore(deps): bump the all-other-cargo-deps group across 1 directory with 6 updates [datafusion]
via GitHub
2026/06/16
Re: [PR] Fix optimize_projections failure with struct-field join keys [datafusion]
via GitHub
2026/06/16
Re: [PR] Optimize Dictionary groupings [datafusion]
via GitHub
2026/06/16
Re: [PR] Add FixedSizeList support for recursive struct schema adaptation [datafusion]
via GitHub
2026/06/16
Re: [PR] Optimize Dictionary groupings [datafusion]
via GitHub
2026/06/16
Re: [PR] fix: ProjectionPushdown internal error on NestedLoopJoin mark joins [datafusion]
via GitHub
2026/06/16
Re: [PR] chore(deps): bump itertools from 0.14.0 to 0.15.0 [datafusion-ballista]
via GitHub
2026/06/16
Re: [PR] chore(deps): bump config from 0.15.23 to 0.15.24 [datafusion-ballista]
via GitHub
2026/06/16
Re: [I] `ExecutorMetatada` too verbose [datafusion-ballista]
via GitHub
2026/06/16
Re: [PR] fix: Omit NULL values from build side of hash joins [datafusion]
via GitHub
2026/06/16
Re: [PR] Unify AVG group state conversion and filter handling across Spark and built-in accumulators [datafusion]
via GitHub
2026/06/16
[PR] chore(deps): bump itertools from 0.14.0 to 0.15.0 [datafusion-ballista]
via GitHub
2026/06/16
[PR] chore(deps): bump config from 0.15.23 to 0.15.24 [datafusion-ballista]
via GitHub
2026/06/16
Re: [PR] perf: preserve dictionary encoding for lower/upper to avoid materializing low-cardinality columns [datafusion]
via GitHub
2026/06/16
Re: [I] Unsafe comparison cast rewriting in ExprSimplifier silently produces wrong query results [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: add OR pre-selection short-circuit [datafusion]
via GitHub
2026/06/16
Re: [PR] fix(sort): record output_batches, output_bytes and end_time for when not using merge sort [datafusion]
via GitHub
2026/06/16
Re: [PR] feat(parquet): runtime row-group early stop via TopK dynamic filter [datafusion]
via GitHub
2026/06/16
Re: [PR] feat(parquet): runtime row-group early stop via TopK dynamic filter [datafusion]
via GitHub
2026/06/16
Re: [PR] perf : experiment roaring bitmap for int32 anti and semi joins [datafusion]
via GitHub
2026/06/16
Re: [PR] feat(parquet): runtime row-group early stop via TopK dynamic filter [datafusion]
via GitHub
2026/06/16
Re: [PR] feat(parquet): runtime row-group early stop via TopK dynamic filter [datafusion]
via GitHub
2026/06/16
Re: [PR] feat(parquet): runtime row-group early stop via TopK dynamic filter [datafusion]
via GitHub
2026/06/16
Re: [PR] Add ExpressionAnalyzer for pluggable expression-level statistics estimation [datafusion]
via GitHub
2026/06/16
Re: [PR] Skip loading Parquet page index when row-group statistics already prove it cannot prune [datafusion]
via GitHub
2026/06/16
Re: [PR] refactor(shim): add new shim infra with `enableIfVer` macro to avoid shims duplication [datafusion-comet]
via GitHub
2026/06/16
Re: [PR] Parallelize `infer_schema` [datafusion]
via GitHub
2026/06/16
Re: [PR] fix(unparser): Fix column alias rewriting for Filter nodes preserved by Inexact filter pushdown [datafusion]
via GitHub
2026/06/16
Re: [PR] feat(cli): implement mmap based object store for local files [datafusion]
via GitHub
2026/06/16
Re: [PR] feat(parquet): support CDC chunking options [datafusion]
via GitHub
2026/06/16
Re: [PR] Support arithmetic expressions in PruningPredicate for Parquet row gr… [datafusion]
via GitHub
2026/06/16
Re: [PR] Remove sqllogictest fork swap from regenerate_sqlite_files.sh [datafusion]
via GitHub
2026/06/16
Re: [PR] refactor: Simplify `approx_distinct` (-200 LoC) [datafusion]
via GitHub
2026/06/16
[I] Support more types for `approx_distinct` function [datafusion]
via GitHub
2026/06/16
[PR] feat: Add SQL planner, physical planner, and TableProvider hook for MERGE INTO [datafusion]
via GitHub
2026/06/16
Re: [PR] perf: optimize object store requests when reading CSV [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: Add new `input_file_name` UDF for file-backed scans [datafusion]
via GitHub
2026/06/16
[I] Centralize `date_bin` per-row mapping for scalar and array inputs [datafusion]
via GitHub
2026/06/16
Re: [PR] refactor(approx_distinct): centralize grouped HLL type dispatch [datafusion]
via GitHub
2026/06/16
Re: [PR] refactor(approx_distinct): centralize grouped HLL type dispatch [datafusion]
via GitHub
2026/06/16
Re: [PR] refactor(approx_distinct): centralize grouped HLL type dispatch [datafusion]
via GitHub
2026/06/16
[I] Unify list-like row decomposition for map construction [datafusion]
via GitHub
2026/06/16
Re: [PR] refactor: Simplify `approx_distinct` (-200 LoC) [datafusion]
via GitHub
2026/06/16
Re: [I] Port `ScalarSubqueryExpr` to use `try_to_proto` / `try_from_proto` [datafusion]
via GitHub
2026/06/16
Re: [PR] build(deps): bump pyjwt from 2.12.0 to 2.13.0 [datafusion-python]
via GitHub
2026/06/16
Re: [PR] build(deps): bump filelock from 3.18.0 to 3.20.3 [datafusion-python]
via GitHub
2026/06/16
Re: [PR] build(deps-dev): bump setuptools from 75.8.0 to 78.1.1 [datafusion-python]
via GitHub
2026/06/16
Re: [PR] build(deps): bump pyjwt from 2.12.0 to 2.13.0 [datafusion-python]
via GitHub
2026/06/16
Re: [PR] build(deps-dev): bump setuptools from 75.8.0 to 78.1.1 [datafusion-python]
via GitHub
2026/06/16
Re: [PR] build(deps): bump virtualenv from 20.31.2 to 20.36.1 [datafusion-python]
via GitHub
2026/06/16
Re: [PR] build(deps): bump virtualenv from 20.31.2 to 20.36.1 [datafusion-python]
via GitHub
2026/06/16
Re: [PR] build(deps-dev): bump jinja2 from 3.1.5 to 3.1.6 [datafusion-python]
via GitHub
2026/06/16
Re: [PR] build(deps): bump filelock from 3.18.0 to 3.20.3 [datafusion-python]
via GitHub
2026/06/16
Re: [PR] build(deps-dev): bump jinja2 from 3.1.5 to 3.1.6 [datafusion-python]
via GitHub
2026/06/16
Re: [PR] docs: convert reStructuredText sources to MyST markdown [datafusion-python]
via GitHub
2026/06/16
[PR] refactor(approx_distinct): centralize grouped HLL type dispatch [datafusion]
via GitHub
2026/06/16
Re: [PR] Optimize Dictionary groupings [datafusion]
via GitHub
2026/06/16
Re: [PR] Optimize Dictionary groupings [datafusion]
via GitHub
2026/06/16
Re: [PR] Optimize Dictionary groupings [datafusion]
via GitHub
2026/06/16
Re: [PR] minor: reuse ColumnarValue::into_array in map's expand_if_scalar [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: Alternative table provider as spark data source [datafusion-java]
via GitHub
2026/06/16
Re: [PR] Refactor outer join null-rejection analysis to track join sides directly [datafusion]
via GitHub
2026/06/16
Re: [PR] Add StatisticsContext parameter to partition_statistics [datafusion]
via GitHub
2026/06/16
[PR] minor: reuse ColumnarValue::into_array in map's expand_if_scalar [datafusion]
via GitHub
2026/06/16
Re: [I] Decouple operator statistics propagation from traversal/caching via statistics_from_inputs [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: Add Spark-compatible decimal division [datafusion]
via GitHub
2026/06/16
Re: [PR] Intermediate result blocked approach to aggregation memory management [datafusion]
via GitHub
2026/06/16
Re: [PR] Add ExpressionAnalyzer for pluggable expression-level statistics estimation [datafusion]
via GitHub
2026/06/16
Re: [PR] Cleanup redundant fields in `ListingOptions` [datafusion]
via GitHub
2026/06/16
Re: [PR] Optimize Dictionary groupings [datafusion]
via GitHub
2026/06/16
Re: [PR] Optimize Dictionary groupings [datafusion]
via GitHub
2026/06/16
Re: [PR] Add MERGE INTO types to datafusion-expr [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: introduce pluggable SpillFile trait and TempFileFactory for custom spill backends [datafusion]
via GitHub
2026/06/16
Re: [PR] Add MERGE INTO types to datafusion-expr [datafusion]
via GitHub
2026/06/16
Re: [PR] Optimize Dictionary groupings [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: Alternative table provider as spark data source [datafusion-java]
via GitHub
2026/06/16
Re: [I] Add optional native Lance scan support [datafusion-comet]
via GitHub
2026/06/16
Re: [PR] perf: optimize object store requests when reading CSV [datafusion]
via GitHub
2026/06/16
Re: [I] Ballista 53.0.0 — `BatchCoalescer expects 0 columns` on TPCDS Q9 [datafusion-ballista]
via GitHub
2026/06/16
Re: [I] Ballista 53.0.0 — `BatchCoalescer expects 0 columns` on TPCDS Q9 [datafusion-ballista]
via GitHub
2026/06/16
Re: [PR] fix: block timestamp precision narrowing unwrap [datafusion]
via GitHub
2026/06/16
Re: [PR] perf: optimize object store requests when reading CSV [datafusion]
via GitHub
2026/06/16
Re: [I] Unsafe comparison cast rewriting in ExprSimplifier silently produces wrong query results [datafusion]
via GitHub
2026/06/16
Re: [I] Update PyO3 to 0.29 and explore abi3t support [datafusion-python]
via GitHub
2026/06/16
Re: [I] Unsafe comparison cast rewriting in ExprSimplifier silently produces wrong query results [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: Allow datafusion-ffi to opt out of proto parquet [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: support reading from stdin in datafusion-cli [datafusion]
via GitHub
2026/06/16
Re: [I] support for reading data from stdin for `datafusion-cli` [datafusion]
via GitHub
2026/06/16
Re: [I] Make `LogicalPlan::Unnest` expression/rebuild contracts explicit [datafusion]
via GitHub
2026/06/16
Re: [PR] Make LogicalPlan::Unnest expression/rebuild contracts consistent [datafusion]
via GitHub
2026/06/16
Re: [PR] fix: Spark-compatible HALF_UP rounding for round() on float types [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: support reading from stdin in datafusion-cli [datafusion]
via GitHub
2026/06/16
Re: [D] reading from stdin in datafusion-cli? [datafusion]
via GitHub
2026/06/16
Re: [PR] fix: classify unsupported format patterns as Unsupported in CometFromUnixTime [datafusion-comet]
via GitHub
2026/06/16
Re: [I] Improved performance for streaming grouping with single string columns [datafusion]
via GitHub
2026/06/16
Re: [I] Improved performance for streaming grouping with single string columns [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: support reading from stdin in datafusion-cli [datafusion]
via GitHub
2026/06/16
Re: [PR] Make LogicalPlan::Unnest expression/rebuild contracts consistent [datafusion]
via GitHub
2026/06/16
Re: [I] Update PyO3 to 0.29 and explore abi3t support [datafusion-python]
via GitHub
2026/06/16
Re: [PR] refactor(shim): add new shim infra with `enableIfVer` macro to avoid shims duplication [datafusion-comet]
via GitHub
2026/06/16
Re: [PR] Add MERGE INTO types to datafusion-expr [datafusion]
via GitHub
2026/06/16
[PR] Rich t kid/implement multi dictionary aggr [datafusion]
via GitHub
2026/06/16
Re: [I] Improve job failure handling [datafusion-ballista]
via GitHub
2026/06/16
Re: [PR] perf: optimize object store requests when reading CSV [datafusion]
via GitHub
2026/06/16
Re: [PR] perf: optimize object store requests when reading CSV [datafusion]
via GitHub
2026/06/16
Re: [I] Improve job failure handling [datafusion-ballista]
via GitHub
2026/06/16
Re: [I] Improve job failure handling [datafusion-ballista]
via GitHub
2026/06/16
Re: [I] Improve job failure handling [datafusion-ballista]
via GitHub
2026/06/16
Re: [I] Mark joins don't support null mark columns [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: Allow datafusion-ffi to opt out of proto parquet [datafusion]
via GitHub
2026/06/16
Re: [PR] perf: optimize object store requests when reading CSV [datafusion]
via GitHub
2026/06/16
Re: [PR] Reduce per-file metadata overhead for wide-schema parquet scans [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: Add new `input_file_name` UDF for file-backed scans [datafusion]
via GitHub
2026/06/16
Re: [PR] refactor: use raw view access in do_append_val_inner and consolidate duplicated logic [datafusion]
via GitHub
2026/06/16
Re: [PR] Add MERGE INTO types to datafusion-expr [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: support reading from stdin in datafusion-cli [datafusion]
via GitHub
2026/06/16
Re: [PR] refactor: use raw view access in do_append_val_inner and consolidate duplicated logic [datafusion]
via GitHub
2026/06/16
Re: [PR] Add MERGE INTO types to datafusion-expr [datafusion]
via GitHub
2026/06/16
Re: [PR] Add ColumnarValue::to_array_variant method (follow-up to #22784) [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: informational message channel + generic native-available hint [datafusion-comet]
via GitHub
2026/06/16
Re: [PR] feat: informational message channel + generic native-available hint [datafusion-comet]
via GitHub
2026/06/16
Re: [PR] feat: add input_file_name() for file-backed scans (plumbing PR) [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: add input_file_name() for file-backed scans (plumbing PR) [datafusion]
via GitHub
2026/06/16
Re: [PR] perf: optimize object store requests when reading CSV [datafusion]
via GitHub
2026/06/16
Re: [PR] perf: optimize object store requests when reading CSV [datafusion]
via GitHub
2026/06/16
Re: [PR] perf: optimize object store requests when reading CSV [datafusion]
via GitHub
2026/06/16
Re: [I] Update PyO3 to 0.29 and explore abi3t support [datafusion-python]
via GitHub
2026/06/16
Re: [PR] feat: add input_file_name() for file-backed scans (plumbing PR) [datafusion]
via GitHub
2026/06/16
Re: [I] Update PyO3 to 0.29 and explore abi3t support [datafusion-python]
via GitHub
2026/06/16
Re: [I] Add optional native Lance scan support [datafusion-comet]
via GitHub
2026/06/16
Re: [PR] Add MERGE INTO types to datafusion-expr [datafusion]
via GitHub
2026/06/16
Re: [I] Add optional native Lance scan support [datafusion-comet]
via GitHub
2026/06/16
Re: [PR] perf: Optimize semi-, anti-join implementation using existence probes [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: add OR pre-selection short-circuit [datafusion]
via GitHub
2026/06/16
[I] Update PyO3 to 0.29 [datafusion-python]
via GitHub
2026/06/16
Re: [PR] refactor(shim): add new shim infra with `enableIfVer` macro to avoid shims duplication [datafusion-comet]
via GitHub
2026/06/16
Re: [PR] feat(parquet): runtime row-group early stop via TopK dynamic filter [datafusion]
via GitHub
2026/06/16
Re: [PR] feat(parquet): runtime row-group early stop via TopK dynamic filter [datafusion]
via GitHub
2026/06/16
Re: [PR] Fix leaf expression reconciliation [datafusion]
via GitHub
2026/06/16
Re: [I] optimize_projections fails on leaf-expression pushdown over a view with an unconsumed column [datafusion]
via GitHub
2026/06/16
Re: [PR] feat(parquet): runtime row-group early stop via TopK dynamic filter [datafusion]
via GitHub
2026/06/16
Re: [PR] feat(parquet): runtime row-group early stop via TopK dynamic filter [datafusion]
via GitHub
2026/06/16
Re: [PR] feat(parquet): runtime row-group early stop via TopK dynamic filter [datafusion]
via GitHub
2026/06/16
Re: [PR] chore: start 0.18.0 development [datafusion-comet]
via GitHub
2026/06/16
Re: [PR] bugfix: changed return type of spark's width_bucket to i64 [datafusion]
via GitHub
2026/06/16
Re: [I] Comet 0.17.0 Release [datafusion-comet]
via GitHub
2026/06/16
Re: [PR] feat(parquet): runtime row-group early stop via TopK dynamic filter [datafusion]
via GitHub
2026/06/16
Re: [PR] feat(parquet): runtime row-group early stop via TopK dynamic filter [datafusion]
via GitHub
2026/06/16
Re: [PR] feat(parquet): runtime row-group early stop via TopK dynamic filter [datafusion]
via GitHub
2026/06/16
Re: [PR] feat(parquet): runtime row-group early stop via TopK dynamic filter [datafusion]
via GitHub
2026/06/16
Re: [PR] feat(parquet): runtime row-group early stop via TopK dynamic filter [datafusion]
via GitHub
2026/06/16
Re: [PR] feat(parquet): runtime row-group early stop via TopK dynamic filter [datafusion]
via GitHub
2026/06/16
Re: [PR] feat(parquet): runtime row-group early stop via TopK dynamic filter [datafusion]
via GitHub
2026/06/16
Re: [PR] feat(parquet): runtime row-group early stop via TopK dynamic filter [datafusion]
via GitHub
2026/06/16
Re: [PR] fix: surface SparkArithmeticException(DIVIDE_BY_ZERO) for divide-by-zero in dispatched ScalaUDF path [datafusion-comet]
via GitHub
2026/06/16
[PR] Parser: fix exponential parse time on nested function-call arguments [datafusion-sqlparser-rs]
via GitHub
2026/06/16
Re: [PR] docs: update release_process for changelog [datafusion-comet]
via GitHub
2026/06/16
Re: [PR] docs: [branch-0.17] 0.17.0 changelog [datafusion-comet]
via GitHub
2026/06/16
Re: [I] Comet 0.17.0 Release [datafusion-comet]
via GitHub
2026/06/16
[PR] docs: update release_process for changelog [datafusion-comet]
via GitHub
2026/06/16
Re: [PR] feat: Alternative table provider as spark data source [datafusion-java]
via GitHub
2026/06/16
[PR] docs: 0.17.0 changelog [datafusion-comet]
via GitHub
2026/06/16
Re: [PR] chore: add branch protection to release branches, update release_process.md [datafusion-comet]
via GitHub
2026/06/16
Re: [PR] feat: Alternative table provider as spark data source [datafusion-java]
via GitHub
2026/06/16
Re: [PR] chore: [branch-0.17] update release_process.md [datafusion-comet]
via GitHub
2026/06/16
Re: [PR] Do not consume statement terminator in unparenthesized option lists [datafusion-sqlparser-rs]
via GitHub
2026/06/16
Re: [PR] Optionally split the Iceberg write/commit operator into separate writer and committer operations [datafusion-comet]
via GitHub
2026/06/16
[PR] chore: [branch0.17] Fix release process docs 0.17 [datafusion-comet]
via GitHub
2026/06/16
Re: [PR] feat(tui): Show job failed status below the Jobs table [datafusion-ballista]
via GitHub
2026/06/16
Re: [I] TUI: Show reason for failed jobs [datafusion-ballista]
via GitHub
2026/06/16
Re: [PR] feat(parquet): runtime row-group early stop via TopK dynamic filter [datafusion]
via GitHub
2026/06/16
[PR] chore: add branch protection to release branches, update release_process.md [datafusion-comet]
via GitHub
2026/06/16
Re: [I] Honour max_row_group_bytes in the parallel Parquet writer [datafusion]
via GitHub
2026/06/16
Re: [I] Honour max_row_group_bytes in the parallel Parquet writer [datafusion]
via GitHub
2026/06/16
[I] Honour max_row_group_bytes in the parallel Parquet writer [datafusion]
via GitHub
2026/06/16
Re: [PR] docs: [branch-0.17] generate release docs, update script [datafusion-comet]
via GitHub
2026/06/16
Re: [PR] chore: fix generate-release-docs.sh for per-Spark-version doc layout [datafusion-comet]
via GitHub
2026/06/16
Re: [PR] chore: [branch-0.17] change version from 0.17.0-SNAPSHOT to 0.17.0 [datafusion-comet]
via GitHub
2026/06/16
Re: [I] Ballista 53.0.0 — `BatchCoalescer expects 0 columns` on TPCDS Q9 [datafusion-ballista]
via GitHub
2026/06/16
Re: [PR] refactor(shim): add new shim infra with `enableIfVer` macro to avoid shims duplication [datafusion-comet]
via GitHub
2026/06/16
[PR] chore: start 0.18.0 development [datafusion-comet]
via GitHub
2026/06/16
Re: [PR] feat(tui): Show job failed status below the Jobs table [datafusion-ballista]
via GitHub
2026/06/16
[PR] chore: fix generate-release-docs.sh for per-Spark-version doc layout [datafusion-comet]
via GitHub
2026/06/16
[PR] docs: [branch-0.17] generate release docs, update script [datafusion-comet]
via GitHub
2026/06/16
[PR] slt: Add null-aware mark join tests for current behavior [datafusion]
via GitHub
2026/06/16
Re: [I] Hash join dynamic filters are unsafe with null-equal joins [datafusion]
via GitHub
2026/06/16
Re: [I] Hash join dynamic filters are unsafe with null-equal joins [datafusion]
via GitHub
2026/06/16
Re: [PR] feat(parquet): runtime row-group early stop via TopK dynamic filter [datafusion]
via GitHub
2026/06/16
Re: [PR] feat(parquet): runtime row-group early stop via TopK dynamic filter [datafusion]
via GitHub
Earlier messages
Later messages