github
Thread
Date
Earlier messages
Later messages
Messages by Date
2026/06/17
Re: [PR] fix: Preserve integer values in round() for large Int64 and UInt64 inputs [datafusion]
via GitHub
2026/06/17
Re: [PR] Optimize Parquet row-filter struct schema pruning [datafusion]
via GitHub
2026/06/17
Re: [PR] fix: isolate anonymous file statistics cache [datafusion]
via GitHub
2026/06/17
Re: [PR] Do not consume statement terminator in unparenthesized option lists [datafusion-sqlparser-rs]
via GitHub
2026/06/17
Re: [I] NewType pattern for executor id's [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] Doris SQL: add Doris Dialect [datafusion-sqlparser-rs]
via GitHub
2026/06/17
Re: [I] Add support for `DataFrame.cache()` to Ballista [datafusion-ballista]
via GitHub
2026/06/17
Re: [I] Add support for `DataFrame.checkpoint()` to Ballista [datafusion-ballista]
via GitHub
2026/06/17
Re: [I] Add support for `DataFrame.cache()` to Ballista [datafusion-ballista]
via GitHub
2026/06/17
Re: [I] Add support for `DataFrame.cache()` to Ballista [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] refactor(shim): add new shim infra with `enableIfVer` macro to avoid shims duplication [datafusion-comet]
via GitHub
2026/06/17
Re: [PR] refactor(shim): add new shim infra with `enableIfVer` macro to avoid shims duplication [datafusion-comet]
via GitHub
2026/06/17
[I] NewType pattern for executor id's [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] feat: warn on NULL equality predicates [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: warn on NULL equality predicates [datafusion]
via GitHub
2026/06/17
Re: [I] Implement Aggregate function `map_agg` [datafusion]
via GitHub
2026/06/17
[I] Implement Aggregate function `map_agg` [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: warn on NULL equality predicates [datafusion]
via GitHub
2026/06/17
Re: [PR] Remove sqllogictest fork swap from regenerate_sqlite_files.sh [datafusion]
via GitHub
2026/06/17
Re: [PR] Update expected results for duplicate column names fix [datafusion-testing]
via GitHub
2026/06/17
Re: [PR] perf: address scheduler crash on high partition SF data [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] fix: coerce SIMILAR TO operands to a common string type [datafusion]
via GitHub
2026/06/17
[PR] fix: coerce SIMILAR TO operands to a common string type [datafusion]
via GitHub
2026/06/17
Re: [I] Add support for `DataFrame.cache()` to Ballista [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] fix: coerce SIMILAR TO operands to a common string type to avoid 'failed to downcast array' panic [datafusion]
via GitHub
2026/06/17
Re: [I] Add support for `DataFrame.cache()` to Ballista [datafusion-ballista]
via GitHub
2026/06/17
Re: [I] `ExecutorMetatada` too verbose [datafusion-ballista]
via GitHub
2026/06/17
[PR] Doris SQL: add Doris Dialect [datafusion-sqlparser-rs]
via GitHub
2026/06/17
Re: [PR] feat: warn on NULL equality predicates [datafusion]
via GitHub
2026/06/17
[I] DorisSQL: add Doris Dialect [datafusion-sqlparser-rs]
via GitHub
2026/06/17
[PR] Fix shared TopK early exit with global prefix threshold [datafusion]
via GitHub
2026/06/17
Re: [PR] perf: optimize object store requests when reading CSV [datafusion]
via GitHub
2026/06/17
Re: [PR] fix: graceful error for deeply nested expressions instead of stack overflow [datafusion]
via GitHub
2026/06/17
Re: [I] Improve shuffle (column) statistics [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] feat: support file-level parquet row selections [datafusion]
via GitHub
2026/06/17
Re: [I] `ExecutorMetatada` too verbose [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] bench: add correlated-proxy case to the predicate_eval suite [datafusion]
via GitHub
2026/06/17
Re: [PR] bench: add correlated-proxy case to the predicate_eval suite [datafusion]
via GitHub
2026/06/17
Re: [PR] bench: add correlated-proxy case to the predicate_eval suite [datafusion]
via GitHub
2026/06/17
Re: [I] `ExecutorMetatada` too verbose [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] feat: decouple partition location from executor metadata [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] Add StatisticsContext parameter to partition_statistics [datafusion]
via GitHub
2026/06/17
Re: [I] Decouple operator statistics propagation from traversal/caching via statistics_from_inputs [datafusion]
via GitHub
2026/06/17
[PR] perf: address scheduler crash on high partition SF data [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] bench: add correlated-proxy case to the predicate_eval suite [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: support binary and string types for concat UDFs [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: support binary and string types for concat UDFs [datafusion]
via GitHub
2026/06/17
Re: [PR] fix: block timestamp precision narrowing unwrap [datafusion]
via GitHub
2026/06/17
Re: [PR] fix: Consider column names' case when aliasing tables [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: decouple partition location from executor metadata [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] feat: support binary and string types for concat UDFs [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: support binary and string types for concat UDFs [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: support binary and string types for concat UDFs [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: support binary and string types for concat UDFs [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: support binary and string types for concat UDFs [datafusion]
via GitHub
2026/06/16
Re: [I] `ExecutorMetatada` too verbose [datafusion-ballista]
via GitHub
2026/06/16
Re: [PR] feat: support binary and string types for concat UDFs [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: support binary and string types for concat UDFs [datafusion]
via GitHub
2026/06/16
Re: [I] Disallow `order by` within ordered-set aggregate functions argument lists [datafusion]
via GitHub
2026/06/16
Re: [PR] Add ExpressionAnalyzer for pluggable expression-level statistics estimation [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: Stage based fallback [datafusion-comet]
via GitHub
2026/06/16
Re: [PR] Remove redundant `collect_stat` and `target_partitions` on `ListingOptions` [datafusion]
via GitHub
2026/06/16
Re: [I] `ExecutorMetatada` too verbose [datafusion-ballista]
via GitHub
2026/06/16
Re: [PR] Remove redundant `collect_stat` and `target_partitions` on `ListingOptions` [datafusion]
via GitHub
2026/06/16
Re: [PR] Remove redundant `collect_stat` and `target_partitions` on `ListingOptions` [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: support reading from stdin in datafusion-cli [datafusion]
via GitHub
2026/06/16
Re: [I] Support more types for `approx_distinct` function [datafusion]
via GitHub
2026/06/16
Re: [PR] Optimize Dictionary groupings [datafusion]
via GitHub
2026/06/16
[PR] Return errors on string builder offset overflow in `replace` and `initcap` [datafusion]
via GitHub
2026/06/16
Re: [PR] chore(deps): bump the all-other-cargo-deps group across 1 directory with 6 updates [datafusion]
via GitHub
2026/06/16
Re: [PR] Fix optimize_projections failure with struct-field join keys [datafusion]
via GitHub
2026/06/16
Re: [PR] Optimize Dictionary groupings [datafusion]
via GitHub
2026/06/16
Re: [PR] Add FixedSizeList support for recursive struct schema adaptation [datafusion]
via GitHub
2026/06/16
Re: [PR] Optimize Dictionary groupings [datafusion]
via GitHub
2026/06/16
Re: [PR] fix: ProjectionPushdown internal error on NestedLoopJoin mark joins [datafusion]
via GitHub
2026/06/16
Re: [PR] chore(deps): bump itertools from 0.14.0 to 0.15.0 [datafusion-ballista]
via GitHub
2026/06/16
Re: [PR] chore(deps): bump config from 0.15.23 to 0.15.24 [datafusion-ballista]
via GitHub
2026/06/16
Re: [I] `ExecutorMetatada` too verbose [datafusion-ballista]
via GitHub
2026/06/16
Re: [PR] fix: Omit NULL values from build side of hash joins [datafusion]
via GitHub
2026/06/16
Re: [PR] Unify AVG group state conversion and filter handling across Spark and built-in accumulators [datafusion]
via GitHub
2026/06/16
[PR] chore(deps): bump itertools from 0.14.0 to 0.15.0 [datafusion-ballista]
via GitHub
2026/06/16
[PR] chore(deps): bump config from 0.15.23 to 0.15.24 [datafusion-ballista]
via GitHub
2026/06/16
Re: [PR] perf: preserve dictionary encoding for lower/upper to avoid materializing low-cardinality columns [datafusion]
via GitHub
2026/06/16
Re: [I] Unsafe comparison cast rewriting in ExprSimplifier silently produces wrong query results [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: add OR pre-selection short-circuit [datafusion]
via GitHub
2026/06/16
Re: [PR] fix(sort): record output_batches, output_bytes and end_time for when not using merge sort [datafusion]
via GitHub
2026/06/16
Re: [PR] feat(parquet): runtime row-group early stop via TopK dynamic filter [datafusion]
via GitHub
2026/06/16
Re: [PR] feat(parquet): runtime row-group early stop via TopK dynamic filter [datafusion]
via GitHub
2026/06/16
Re: [PR] perf : experiment roaring bitmap for int32 anti and semi joins [datafusion]
via GitHub
2026/06/16
Re: [PR] feat(parquet): runtime row-group early stop via TopK dynamic filter [datafusion]
via GitHub
2026/06/16
Re: [PR] feat(parquet): runtime row-group early stop via TopK dynamic filter [datafusion]
via GitHub
2026/06/16
Re: [PR] feat(parquet): runtime row-group early stop via TopK dynamic filter [datafusion]
via GitHub
2026/06/16
Re: [PR] Add ExpressionAnalyzer for pluggable expression-level statistics estimation [datafusion]
via GitHub
2026/06/16
Re: [PR] Skip loading Parquet page index when row-group statistics already prove it cannot prune [datafusion]
via GitHub
2026/06/16
Re: [PR] refactor(shim): add new shim infra with `enableIfVer` macro to avoid shims duplication [datafusion-comet]
via GitHub
2026/06/16
Re: [PR] Parallelize `infer_schema` [datafusion]
via GitHub
2026/06/16
Re: [PR] fix(unparser): Fix column alias rewriting for Filter nodes preserved by Inexact filter pushdown [datafusion]
via GitHub
2026/06/16
Re: [PR] feat(cli): implement mmap based object store for local files [datafusion]
via GitHub
2026/06/16
Re: [PR] feat(parquet): support CDC chunking options [datafusion]
via GitHub
2026/06/16
Re: [PR] Support arithmetic expressions in PruningPredicate for Parquet row gr… [datafusion]
via GitHub
2026/06/16
Re: [PR] Remove sqllogictest fork swap from regenerate_sqlite_files.sh [datafusion]
via GitHub
2026/06/16
Re: [PR] refactor: Simplify `approx_distinct` (-200 LoC) [datafusion]
via GitHub
2026/06/16
[I] Support more types for `approx_distinct` function [datafusion]
via GitHub
2026/06/16
[PR] feat: Add SQL planner, physical planner, and TableProvider hook for MERGE INTO [datafusion]
via GitHub
2026/06/16
Re: [PR] perf: optimize object store requests when reading CSV [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: Add new `input_file_name` UDF for file-backed scans [datafusion]
via GitHub
2026/06/16
[I] Centralize `date_bin` per-row mapping for scalar and array inputs [datafusion]
via GitHub
2026/06/16
Re: [PR] refactor(approx_distinct): centralize grouped HLL type dispatch [datafusion]
via GitHub
2026/06/16
Re: [PR] refactor(approx_distinct): centralize grouped HLL type dispatch [datafusion]
via GitHub
2026/06/16
Re: [PR] refactor(approx_distinct): centralize grouped HLL type dispatch [datafusion]
via GitHub
2026/06/16
[I] Unify list-like row decomposition for map construction [datafusion]
via GitHub
2026/06/16
Re: [PR] refactor: Simplify `approx_distinct` (-200 LoC) [datafusion]
via GitHub
2026/06/16
Re: [I] Port `ScalarSubqueryExpr` to use `try_to_proto` / `try_from_proto` [datafusion]
via GitHub
2026/06/16
Re: [PR] build(deps): bump pyjwt from 2.12.0 to 2.13.0 [datafusion-python]
via GitHub
2026/06/16
Re: [PR] build(deps): bump filelock from 3.18.0 to 3.20.3 [datafusion-python]
via GitHub
2026/06/16
Re: [PR] build(deps-dev): bump setuptools from 75.8.0 to 78.1.1 [datafusion-python]
via GitHub
2026/06/16
Re: [PR] build(deps): bump pyjwt from 2.12.0 to 2.13.0 [datafusion-python]
via GitHub
2026/06/16
Re: [PR] build(deps-dev): bump setuptools from 75.8.0 to 78.1.1 [datafusion-python]
via GitHub
2026/06/16
Re: [PR] build(deps): bump virtualenv from 20.31.2 to 20.36.1 [datafusion-python]
via GitHub
2026/06/16
Re: [PR] build(deps): bump virtualenv from 20.31.2 to 20.36.1 [datafusion-python]
via GitHub
2026/06/16
Re: [PR] build(deps-dev): bump jinja2 from 3.1.5 to 3.1.6 [datafusion-python]
via GitHub
2026/06/16
Re: [PR] build(deps): bump filelock from 3.18.0 to 3.20.3 [datafusion-python]
via GitHub
2026/06/16
Re: [PR] build(deps-dev): bump jinja2 from 3.1.5 to 3.1.6 [datafusion-python]
via GitHub
2026/06/16
Re: [PR] docs: convert reStructuredText sources to MyST markdown [datafusion-python]
via GitHub
2026/06/16
[PR] refactor(approx_distinct): centralize grouped HLL type dispatch [datafusion]
via GitHub
2026/06/16
Re: [PR] Optimize Dictionary groupings [datafusion]
via GitHub
2026/06/16
Re: [PR] Optimize Dictionary groupings [datafusion]
via GitHub
2026/06/16
Re: [PR] Optimize Dictionary groupings [datafusion]
via GitHub
2026/06/16
Re: [PR] minor: reuse ColumnarValue::into_array in map's expand_if_scalar [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: Alternative table provider as spark data source [datafusion-java]
via GitHub
2026/06/16
Re: [PR] Refactor outer join null-rejection analysis to track join sides directly [datafusion]
via GitHub
2026/06/16
Re: [PR] Add StatisticsContext parameter to partition_statistics [datafusion]
via GitHub
2026/06/16
[PR] minor: reuse ColumnarValue::into_array in map's expand_if_scalar [datafusion]
via GitHub
2026/06/16
Re: [I] Decouple operator statistics propagation from traversal/caching via statistics_from_inputs [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: Add Spark-compatible decimal division [datafusion]
via GitHub
2026/06/16
Re: [PR] Intermediate result blocked approach to aggregation memory management [datafusion]
via GitHub
2026/06/16
Re: [PR] Add ExpressionAnalyzer for pluggable expression-level statistics estimation [datafusion]
via GitHub
2026/06/16
Re: [PR] Cleanup redundant fields in `ListingOptions` [datafusion]
via GitHub
2026/06/16
Re: [PR] Optimize Dictionary groupings [datafusion]
via GitHub
2026/06/16
Re: [PR] Optimize Dictionary groupings [datafusion]
via GitHub
2026/06/16
Re: [PR] Add MERGE INTO types to datafusion-expr [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: introduce pluggable SpillFile trait and TempFileFactory for custom spill backends [datafusion]
via GitHub
2026/06/16
Re: [PR] Add MERGE INTO types to datafusion-expr [datafusion]
via GitHub
2026/06/16
Re: [PR] Optimize Dictionary groupings [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: Alternative table provider as spark data source [datafusion-java]
via GitHub
2026/06/16
Re: [I] Add optional native Lance scan support [datafusion-comet]
via GitHub
2026/06/16
Re: [PR] perf: optimize object store requests when reading CSV [datafusion]
via GitHub
2026/06/16
Re: [I] Ballista 53.0.0 — `BatchCoalescer expects 0 columns` on TPCDS Q9 [datafusion-ballista]
via GitHub
2026/06/16
Re: [I] Ballista 53.0.0 — `BatchCoalescer expects 0 columns` on TPCDS Q9 [datafusion-ballista]
via GitHub
2026/06/16
Re: [PR] fix: block timestamp precision narrowing unwrap [datafusion]
via GitHub
2026/06/16
Re: [PR] perf: optimize object store requests when reading CSV [datafusion]
via GitHub
2026/06/16
Re: [I] Unsafe comparison cast rewriting in ExprSimplifier silently produces wrong query results [datafusion]
via GitHub
2026/06/16
Re: [I] Update PyO3 to 0.29 and explore abi3t support [datafusion-python]
via GitHub
2026/06/16
Re: [I] Unsafe comparison cast rewriting in ExprSimplifier silently produces wrong query results [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: Allow datafusion-ffi to opt out of proto parquet [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: support reading from stdin in datafusion-cli [datafusion]
via GitHub
2026/06/16
Re: [I] support for reading data from stdin for `datafusion-cli` [datafusion]
via GitHub
2026/06/16
Re: [I] Make `LogicalPlan::Unnest` expression/rebuild contracts explicit [datafusion]
via GitHub
2026/06/16
Re: [PR] Make LogicalPlan::Unnest expression/rebuild contracts consistent [datafusion]
via GitHub
2026/06/16
Re: [PR] fix: Spark-compatible HALF_UP rounding for round() on float types [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: support reading from stdin in datafusion-cli [datafusion]
via GitHub
2026/06/16
Re: [D] reading from stdin in datafusion-cli? [datafusion]
via GitHub
2026/06/16
Re: [PR] fix: classify unsupported format patterns as Unsupported in CometFromUnixTime [datafusion-comet]
via GitHub
2026/06/16
Re: [I] Improved performance for streaming grouping with single string columns [datafusion]
via GitHub
2026/06/16
Re: [I] Improved performance for streaming grouping with single string columns [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: support reading from stdin in datafusion-cli [datafusion]
via GitHub
2026/06/16
Re: [PR] Make LogicalPlan::Unnest expression/rebuild contracts consistent [datafusion]
via GitHub
2026/06/16
Re: [I] Update PyO3 to 0.29 and explore abi3t support [datafusion-python]
via GitHub
2026/06/16
Re: [PR] refactor(shim): add new shim infra with `enableIfVer` macro to avoid shims duplication [datafusion-comet]
via GitHub
2026/06/16
Re: [PR] Add MERGE INTO types to datafusion-expr [datafusion]
via GitHub
2026/06/16
[PR] Rich t kid/implement multi dictionary aggr [datafusion]
via GitHub
2026/06/16
Re: [I] Improve job failure handling [datafusion-ballista]
via GitHub
2026/06/16
Re: [PR] perf: optimize object store requests when reading CSV [datafusion]
via GitHub
2026/06/16
Re: [PR] perf: optimize object store requests when reading CSV [datafusion]
via GitHub
2026/06/16
Re: [I] Improve job failure handling [datafusion-ballista]
via GitHub
2026/06/16
Re: [I] Improve job failure handling [datafusion-ballista]
via GitHub
2026/06/16
Re: [I] Improve job failure handling [datafusion-ballista]
via GitHub
2026/06/16
Re: [I] Mark joins don't support null mark columns [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: Allow datafusion-ffi to opt out of proto parquet [datafusion]
via GitHub
2026/06/16
Re: [PR] perf: optimize object store requests when reading CSV [datafusion]
via GitHub
2026/06/16
Re: [PR] Reduce per-file metadata overhead for wide-schema parquet scans [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: Add new `input_file_name` UDF for file-backed scans [datafusion]
via GitHub
2026/06/16
Re: [PR] refactor: use raw view access in do_append_val_inner and consolidate duplicated logic [datafusion]
via GitHub
2026/06/16
Re: [PR] Add MERGE INTO types to datafusion-expr [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: support reading from stdin in datafusion-cli [datafusion]
via GitHub
2026/06/16
Re: [PR] refactor: use raw view access in do_append_val_inner and consolidate duplicated logic [datafusion]
via GitHub
2026/06/16
Re: [PR] Add MERGE INTO types to datafusion-expr [datafusion]
via GitHub
2026/06/16
Re: [PR] Add ColumnarValue::to_array_variant method (follow-up to #22784) [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: informational message channel + generic native-available hint [datafusion-comet]
via GitHub
2026/06/16
Re: [PR] feat: informational message channel + generic native-available hint [datafusion-comet]
via GitHub
2026/06/16
Re: [PR] feat: add input_file_name() for file-backed scans (plumbing PR) [datafusion]
via GitHub
2026/06/16
Re: [PR] feat: add input_file_name() for file-backed scans (plumbing PR) [datafusion]
via GitHub
2026/06/16
Re: [PR] perf: optimize object store requests when reading CSV [datafusion]
via GitHub
2026/06/16
Re: [PR] perf: optimize object store requests when reading CSV [datafusion]
via GitHub
2026/06/16
Re: [PR] perf: optimize object store requests when reading CSV [datafusion]
via GitHub
2026/06/16
Re: [I] Update PyO3 to 0.29 and explore abi3t support [datafusion-python]
via GitHub
2026/06/16
Re: [PR] feat: add input_file_name() for file-backed scans (plumbing PR) [datafusion]
via GitHub
2026/06/16
Re: [I] Update PyO3 to 0.29 and explore abi3t support [datafusion-python]
via GitHub
2026/06/16
Re: [I] Add optional native Lance scan support [datafusion-comet]
via GitHub
2026/06/16
Re: [PR] Add MERGE INTO types to datafusion-expr [datafusion]
via GitHub
Earlier messages
Later messages