github
Thread
Date
Later messages
Messages by Date
2026/06/19
Re: [I] Call for Presentations: Community Showcase: Regular series for sharing what you're building with DataFusion [datafusion]
via GitHub
2026/06/19
Re: [PR] ClickHouse: Support unparenthesized IN right-hand side [datafusion-sqlparser-rs]
via GitHub
2026/06/19
Re: [PR] Parse `ALTER USER` as a synonym for `ALTER ROLE` [datafusion-sqlparser-rs]
via GitHub
2026/06/19
Re: [PR] refactor: name build-row and matchable-map presence checks in hash join [datafusion]
via GitHub
2026/06/19
Re: [I] Parquet bloom filter pruning can incorrectly filter decimals encoded as FIXED_LEN_BYTE_ARRAY [datafusion]
via GitHub
2026/06/19
Re: [PR] fix: Parquet bloom filter pruning can incorrectly filter decimals encoded as FIXED_LEN_BYTE_ARRAY [datafusion]
via GitHub
2026/06/19
[PR] chore: add optional CI flow for parquet writes [datafusion-comet]
via GitHub
2026/06/19
Re: [PR] chore: add optional CI flow for parquet writes [datafusion-comet]
via GitHub
2026/06/19
Re: [PR] fix: isolate anonymous file statistics cache [datafusion]
via GitHub
2026/06/19
Re: [I] panic: `ProjectionExprs::project_statistics` index out of bounds [datafusion]
via GitHub
2026/06/19
[PR] test: gate hash-dependent approx_distinct tests behind not(force_hash_collisions) [datafusion]
via GitHub
2026/06/19
Re: [PR] Centralize DATE_BIN source scaling and binning through shared helper [datafusion]
via GitHub
2026/06/19
Re: [PR] refactor: centralize date_bin per-row mapping [datafusion]
via GitHub
2026/06/19
Re: [PR] refactor: name build-row and matchable-map presence checks in hash join [datafusion]
via GitHub
2026/06/19
Re: [PR] feat: Add Semi/Anti join to PiecewiseMergeJoin [datafusion]
via GitHub
2026/06/19
Re: [PR] feat(spark): add spark random functions [datafusion]
via GitHub
2026/06/19
Re: [PR] (Test) Advanced adaptive filter selectivity evaluation [datafusion]
via GitHub
2026/06/19
Re: [PR] feat: support Spark-compatible `arrays_zip` function [datafusion]
via GitHub
2026/06/19
Re: [I] CreateArray with nullability-divergent children panics in native make_array [datafusion-comet]
via GitHub
2026/06/19
Re: [PR] fix: decline CreateArray with struct-nullability-divergent children [datafusion-comet]
via GitHub
2026/06/19
Re: [I] Avoid concatenating record batches in joins to alleviate memory pressure [datafusion]
via GitHub
2026/06/19
[I] [Proposal] Scan I/O acceleration: node-local fragment cache, asynchronous prefetch, and cache-affinity scheduling [datafusion-comet]
via GitHub
2026/06/19
Re: [PR] refactor: name build-row and matchable-map presence checks in hash join [datafusion]
via GitHub
2026/06/19
Re: [PR] feat: add docker-compose.quick.yml and fix onboarding docs [datafusion-ballista]
via GitHub
2026/06/19
Re: [PR] fix(shuffle): tolerate non-UTF-8 bytes in get_string (lossy decode) [datafusion-comet]
via GitHub
2026/06/19
Re: [I] native shuffle: get_string should not panic on non-UTF-8 bytes (use lossy decode) [datafusion-comet]
via GitHub
2026/06/19
Re: [PR] feat: Add Native Support for In-Memory Cache [datafusion-comet]
via GitHub
2026/06/19
Re: [PR] feat: add Comet CachedBatchSerializer for native in-memory cache [datafusion-comet]
via GitHub
2026/06/19
Re: [PR] feat: add docker-compose.quick.yml and fix onboarding docs [datafusion-ballista]
via GitHub
2026/06/19
Re: [PR] chore(deps): bump cryptography from 46.0.5 to 46.0.7 [datafusion-sandbox]
via GitHub
2026/06/19
Re: [PR] chore(deps): bump cryptography from 46.0.5 to 46.0.7 [datafusion-sandbox]
via GitHub
2026/06/19
[PR] chore(deps): bump cryptography from 46.0.5 to 48.0.1 [datafusion-sandbox]
via GitHub
2026/06/19
[PR] Add basic sql benchmark runner for running sql benchmarks [datafusion]
via GitHub
2026/06/19
Re: [PR] Add deprecation warnings for Expr passed to confirmed literal-only function arguments [datafusion-python]
via GitHub
2026/06/19
Re: [PR] perf: O(1) PlanDataInjector lookup by op kind [datafusion-comet]
via GitHub
2026/06/19
Re: [D] How does Comet compare to Gluten? Are there any plans to integrate with Gluten? [datafusion-comet]
via GitHub
2026/06/19
Re: [PR] feat: add docker-compose.quick.yml and fix onboarding docs [datafusion-ballista]
via GitHub
2026/06/19
[I] Add `CONTRIBUTING.md` with link to the contributor guide [datafusion-comet]
via GitHub
2026/06/19
Re: [PR] feat: add docker-compose.quick.yml and fix onboarding docs [datafusion-ballista]
via GitHub
2026/06/19
Re: [PR] fix: gate non-default collations for Spark 4 datetime expressions [datafusion-comet]
via GitHub
2026/06/19
Re: [PR] fix: exclude release scratch dirs from RAT and license skill docs [datafusion-comet]
via GitHub
2026/06/19
Re: [I] RAT check in Maven build scans temporary release/scratch directories, making release builds slow [datafusion-comet]
via GitHub
2026/06/19
Re: [PR] feat: add docker-compose.quick.yml and fix onboarding docs [datafusion-ballista]
via GitHub
2026/06/19
Re: [PR] feat: add docker-compose.quick.yml and fix onboarding docs [datafusion-ballista]
via GitHub
2026/06/19
Re: [I] Support `MapType` for `ElementAt` [datafusion-comet]
via GitHub
2026/06/19
Re: [PR] feat: add docker-compose.quick.yml and fix onboarding docs [datafusion-ballista]
via GitHub
2026/06/19
[PR] fix: gate non-default collations for Spark 4 datetime expressions [datafusion-comet]
via GitHub
2026/06/19
Re: [PR] feat: add docker-compose.quick.yml and fix onboarding docs [datafusion-ballista]
via GitHub
2026/06/19
Re: [PR] fix: preserve no-filter SMJ matches across pending outer batches [datafusion]
via GitHub
2026/06/19
Re: [PR] deps: Upgrade to DataFusion 54.0.0 [datafusion-comet]
via GitHub
2026/06/19
Re: [PR] deps: Upgrade to DataFusion 54.0.0 [datafusion-comet]
via GitHub
2026/06/19
Re: [PR] feat: support native Comet scan of plain Delta Lake tables [datafusion-comet]
via GitHub
2026/06/19
Re: [PR] feat: implement native empty2null spark inner function [datafusion-comet]
via GitHub
2026/06/19
Re: [PR] refactor: Simplify `approx_distinct` (-200 LoC) [datafusion]
via GitHub
2026/06/19
Re: [PR] perf: Extend WindowTopN to support RANK [datafusion]
via GitHub
2026/06/19
Re: [PR] feat: introduce pluggable SpillFile trait and TempFileFactory for custom spill backends [datafusion]
via GitHub
2026/06/19
Re: [PR] fix: propagate nested cast errors [datafusion-comet]
via GitHub
2026/06/19
Re: [PR] chore(deps): bump itertools from 0.14.0 to 0.15.0 in /native [datafusion-comet]
via GitHub
2026/06/19
Re: [PR] chore(deps): bump the all-other-cargo-deps group in /native with 2 updates [datafusion-comet]
via GitHub
2026/06/19
Re: [PR] feat: implement native empty2null spark inner function [datafusion-comet]
via GitHub
2026/06/19
Re: [PR] chore(deps): bump actions/checkout from 6 to 7 [datafusion-comet]
via GitHub
2026/06/19
Re: [PR] refactor(physical-plan): externalize statistics traversal into StatisticsContext [datafusion]
via GitHub
2026/06/19
Re: [PR] refactor(physical-plan): externalize statistics traversal into StatisticsContext [datafusion]
via GitHub
2026/06/19
Re: [PR] feat: introduce pluggable SpillFile trait and TempFileFactory for custom spill backends [datafusion]
via GitHub
2026/06/19
Re: [PR] feat: introduce pluggable SpillFile trait and TempFileFactory for custom spill backends [datafusion]
via GitHub
2026/06/19
Re: [PR] feat: introduce pluggable SpillFile trait and TempFileFactory for custom spill backends [datafusion]
via GitHub
2026/06/19
Re: [PR] feat: introduce pluggable SpillFile trait and TempFileFactory for custom spill backends [datafusion]
via GitHub
2026/06/19
Re: [PR] feat: introduce pluggable SpillFile trait and TempFileFactory for custom spill backends [datafusion]
via GitHub
2026/06/19
Re: [PR] feat: introduce pluggable SpillFile trait and TempFileFactory for custom spill backends [datafusion]
via GitHub
2026/06/19
Re: [PR] feat: introduce pluggable SpillFile trait and TempFileFactory for custom spill backends [datafusion]
via GitHub
2026/06/19
Re: [PR] test: correct feature gating of two datafusion-common tests [datafusion]
via GitHub
2026/06/19
Re: [PR] Prune implicit FD group keys in SQL aggregates [datafusion]
via GitHub
2026/06/19
[PR] refactor(physical-plan): externalize statistics traversal into StatisticsContext [datafusion]
via GitHub
2026/06/19
Re: [PR] refactor: name build-row and matchable-map presence checks in hash join [datafusion]
via GitHub
2026/06/19
Re: [PR] fix: materialize ConstantColumnVector on Comet's serialize/export paths [datafusion-comet]
via GitHub
2026/06/19
[PR] feat: support string to numeric coercion for arithmetic operators [datafusion]
via GitHub
2026/06/19
Re: [PR] fix(shuffle): tolerate non-UTF-8 bytes in get_string (lossy decode) [datafusion-comet]
via GitHub
2026/06/19
Re: [PR] fix: decline CreateArray with struct-nullability-divergent children [datafusion-comet]
via GitHub
2026/06/19
Re: [PR] fix(shuffle): tolerate non-UTF-8 bytes in get_string (lossy decode) [datafusion-comet]
via GitHub
2026/06/19
Re: [PR] Prune implicit FD group keys in SQL aggregates [datafusion]
via GitHub
2026/06/19
Re: [PR] feat: support native Comet scan of plain Delta Lake tables [datafusion-comet]
via GitHub
2026/06/19
Re: [PR] fix: preserve no-filter SMJ matches across pending outer batches [datafusion]
via GitHub
2026/06/19
Re: [PR] feat: implement native empty2null spark inner function [datafusion-comet]
via GitHub
2026/06/19
Re: [PR] Skip loading Parquet page index when row-group statistics already prove it cannot prune [datafusion]
via GitHub
2026/06/19
[PR] fix: preserve no-filter SMJ matches across pending outer batches [datafusion]
via GitHub
2026/06/19
[I] CI failure: fuzz_cases::join_fuzz::test_left_anti_join_1k [datafusion]
via GitHub
2026/06/19
Re: [I] CI failure: fuzz_cases::join_fuzz::test_left_anti_join_1k [datafusion]
via GitHub
2026/06/19
Re: [PR] Fix shared TopK early exit with shared prefix threshold [datafusion]
via GitHub
2026/06/19
Re: [PR] feat: functional dependencies in JOIN [datafusion]
via GitHub
2026/06/19
[PR] feat: functional dependencies in JOIN [datafusion]
via GitHub
2026/06/19
Re: [PR] Fix shared TopK early exit with shared prefix threshold [datafusion]
via GitHub
2026/06/19
Re: [I] Avoid concatenating record batches in joins to alleviate memory pressure [datafusion]
via GitHub
2026/06/19
Re: [PR] Fix shared TopK early exit with shared prefix threshold [datafusion]
via GitHub
2026/06/19
Re: [PR] Add any_value aggregate function [datafusion]
via GitHub
2026/06/19
Re: [PR] Add sorted TopK TPC-H benchmark target [datafusion]
via GitHub
2026/06/19
Re: [I] Call for Presentations: Community Showcase: Regular series for sharing what you're building with DataFusion [datafusion]
via GitHub
2026/06/19
Re: [PR] Fix DuckDB unparse for optimized join projections [datafusion]
via GitHub
2026/06/19
Re: [I] Improve job failure handling [datafusion-ballista]
via GitHub
2026/06/19
Re: [I] CometScanRule: decline native V1 scans on object_store-unsupported filesystem schemes (fall back to Spark) [datafusion-comet]
via GitHub
2026/06/19
Re: [PR] fix: decline native V1 scans on object_store-unsupported filesystem schemes [datafusion-comet]
via GitHub
2026/06/19
Re: [PR] Add sorted TopK TPC-H benchmark target [datafusion]
via GitHub
2026/06/19
[PR] fix: union coercion coerces to string instead of number [datafusion]
via GitHub
2026/06/19
[I] coalesce(Int, Utf8) coerces to Int [datafusion]
via GitHub
2026/06/19
Re: [PR] implement map_agg [datafusion]
via GitHub
2026/06/19
Re: [PR] implement map_agg [datafusion]
via GitHub
2026/06/19
Re: [PR] refactor: name build-row and matchable-map presence checks in hash join [datafusion]
via GitHub
2026/06/19
Re: [I] Improve job failure handling [datafusion-ballista]
via GitHub
2026/06/19
[PR] test: correct feature gating of two datafusion-common tests [datafusion]
via GitHub
2026/06/19
Re: [I] Update ClickBench benchmarks with DataFusion 54.0.0 (when released) [datafusion]
via GitHub
2026/06/19
Re: [I] Update ClickBench benchmarks with DataFusion 54.0.0 (when released) [datafusion]
via GitHub
2026/06/19
Re: [I] Update ClickBench benchmarks with DataFusion 54.0.0 (when released) [datafusion]
via GitHub
2026/06/19
Re: [PR] fix: Parquet bloom filter pruning can incorrectly filter decimals encoded as FIXED_LEN_BYTE_ARRAY [datafusion]
via GitHub
2026/06/19
Re: [PR] fix: Parquet bloom filter pruning can incorrectly filter decimals encoded as FIXED_LEN_BYTE_ARRAY [datafusion]
via GitHub
2026/06/19
Re: [PR] fix: Parquet bloom filter pruning can incorrectly filter decimals encoded as FIXED_LEN_BYTE_ARRAY [datafusion]
via GitHub
2026/06/19
[PR] Add any_value aggregate function [datafusion]
via GitHub
2026/06/19
[I] Support string to numeric coercion [datafusion]
via GitHub
2026/06/19
[PR] feat: string to numeric coercion [datafusion]
via GitHub
2026/06/19
[PR] chore(deps-dev): bump launch-editor from 2.10.0 to 2.14.1 in /datafusion/wasmtest/datafusion-wasm-app [datafusion-sandbox]
via GitHub
2026/06/19
Re: [I] Add `any_value` aggregate function [datafusion]
via GitHub
2026/06/19
[PR] Move cache tests into default_cache [datafusion]
via GitHub
2026/06/19
[PR] feat: subqueries in Project [datafusion]
via GitHub
2026/06/19
Re: [PR] feat: use aligned slice access during bulk append in SparkUnsafeArray [datafusion-comet]
via GitHub
2026/06/19
Re: [PR] Fix DuckDB unparse for optimized join projections [datafusion]
via GitHub
2026/06/19
[PR] feat: support nullary aggregate UDFs [datafusion]
via GitHub
2026/06/19
Re: [PR] Intermediate result blocked approach to aggregation memory management [datafusion]
via GitHub
2026/06/19
Re: [PR] refactor: centralize date_bin per-row mapping [datafusion]
via GitHub
2026/06/19
Re: [PR] implement map_agg [datafusion]
via GitHub
2026/06/19
Re: [PR] Add sorted TopK TPC-H benchmark target [datafusion]
via GitHub
2026/06/19
Re: [PR] implement map_agg [datafusion]
via GitHub
2026/06/19
Re: [PR] chore(deps-dev): bump webpack-dev-server from 5.2.4 to 5.2.5 in /datafusion/wasmtest/datafusion-wasm-app [datafusion]
via GitHub
2026/06/19
Re: [I] Support projecting columns that do not exist in the table [datafusion]
via GitHub
2026/06/19
Re: [PR] Centralize DATE_BIN source scaling and binning through shared helper [datafusion]
via GitHub
2026/06/19
Re: [PR] feat: support file-level parquet row selections [datafusion]
via GitHub
2026/06/19
Re: [PR] implement map_agg [datafusion]
via GitHub
2026/06/19
Re: [PR] feat: support file-level parquet row selections [datafusion]
via GitHub
2026/06/19
Re: [I] Support file-level Parquet `RowSelection` [datafusion]
via GitHub
2026/06/19
Re: [PR] Fix DuckDB unparse for optimized join projections [datafusion]
via GitHub
2026/06/19
Re: [PR] refactor: name build-row and matchable-map presence checks in hash join [datafusion]
via GitHub
2026/06/19
Re: [PR] implement map_agg [datafusion]
via GitHub
2026/06/19
Re: [I] Emit warning with attached `Diagnostic` when doing `= NULL` [datafusion]
via GitHub
2026/06/19
Re: [PR] feat: warn on NULL equality predicates [datafusion]
via GitHub
2026/06/19
[PR] chore(deps): bump itertools from 0.14.0 to 0.15.0 in /native [datafusion-comet]
via GitHub
2026/06/19
[PR] chore(deps): bump the all-other-cargo-deps group in /native with 2 updates [datafusion-comet]
via GitHub
2026/06/19
[PR] chore(deps): bump actions/checkout from 6 to 7 [datafusion-comet]
via GitHub
2026/06/19
Re: [PR] feat(unparser): support DISTINCT FROM operators in the MySQL dialect [datafusion]
via GitHub
2026/06/19
[I] Backport workflow-hardening fix (unpinned-uses) to `maint-16.x` [datafusion]
via GitHub
2026/06/19
Re: [PR] Add sorted TopK TPC-H benchmark target [datafusion]
via GitHub
2026/06/19
Re: [PR] Add sorted TopK TPC-H benchmark target [datafusion]
via GitHub
2026/06/19
Re: [PR] Add sorted TopK TPC-H benchmark target [datafusion]
via GitHub
2026/06/19
Re: [PR] feat: add docker-compose.quick.yml and fix onboarding docs [datafusion-ballista]
via GitHub
2026/06/19
Re: [PR] feat(unparser): support DISTINCT FROM operators in the MySQL dialect [datafusion]
via GitHub
2026/06/19
Re: [PR] chore(deps): upgrade to DataFusion 54 [datafusion-ballista]
via GitHub
2026/06/19
Re: [PR] chore(ci): Use Ubuntu ARM64 for some tests on Linux [datafusion-ballista]
via GitHub
2026/06/19
Re: [PR] chore(ci): Use Ubuntu ARM64 for some tests on Linux [datafusion-ballista]
via GitHub
2026/06/19
Re: [PR] chore(deps): bump actions/checkout from 6.0.3 to 7.0.0 [datafusion-ballista]
via GitHub
2026/06/18
Re: [I] [DISCUSSION] 2026 Q3-Q4 Roadmap Discussion [datafusion]
via GitHub
2026/06/18
[I] [EPIC] Sort Pushdown: skip sorts and skip IO for ORDER BY / TopK queries [datafusion]
via GitHub
2026/06/18
Re: [I] Extend ColocatedJoinRule and BroadcastSmallSideRule to SortMergeJoinExec [datafusion-ballista]
via GitHub
2026/06/18
Re: [PR] IN LIST: reinterpret small-width types for bitmap filters [datafusion]
via GitHub
2026/06/18
Re: [PR] IN LIST: add direct-probe hash filter for large primitive lists [datafusion]
via GitHub
2026/06/18
[PR] IN LIST: unify bitmap filter implementations [datafusion]
via GitHub
2026/06/18
Re: [I] Avoid concatenating record batches in joins to alleviate memory pressure [datafusion]
via GitHub
2026/06/18
Re: [PR] IN LIST: add branchless filter for small primitive lists [datafusion]
via GitHub
2026/06/18
[PR] refactor: centralize date_bin per-row mapping [datafusion]
via GitHub
2026/06/18
Re: [PR] feat(unparser): support binary literals [datafusion]
via GitHub
2026/06/18
Re: [I] Unparser: support unparsing binary scalars [datafusion]
via GitHub
2026/06/18
[PR] chore(deps): bump actions/checkout from 6.0.3 to 7.0.0 [datafusion-ballista]
via GitHub
2026/06/18
Re: [I] [Spark] SparkWidthBucket return_type is Int32, should be Int64 to match Spark [datafusion]
via GitHub
2026/06/18
Re: [PR] bugfix: changed return type of spark's width_bucket to i64 [datafusion]
via GitHub
2026/06/18
Re: [I] pref: Use builtin compression for arrow ipc writer [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] feat(unparser): support binary literals [datafusion]
via GitHub
2026/06/18
Re: [PR] bugfix: changed return type of spark's width_bucket to i64 [datafusion]
via GitHub
2026/06/18
Re: [I] array_union result ordering versus DataFusion is unverified [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] chore: fix `ConstantFolding` rule exclusion for benchmarks [datafusion-comet]
via GitHub
2026/06/18
Re: [I] chore: Remove invalid `spark.sql.optimizer.constantFolding.enabled` configuration from java benchmarks. [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] chore: add ordering tests for `array_union` [datafusion-comet]
via GitHub
2026/06/18
[PR] feat: add docker-compose.quick.yml and fix onboarding docs [datafusion-ballista]
via GitHub
2026/06/18
[PR] fix: round large UInt64 values without narrowing [datafusion]
via GitHub
2026/06/18
Re: [I] Call for Presentations: Community Showcase: Regular series for sharing what you're building with DataFusion [datafusion]
via GitHub
2026/06/18
Re: [PR] fix: Correct array_contains behavior for Spark-style null semantics [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] perf: do not build parquet pruning predicates if no page index [datafusion]
via GitHub
2026/06/18
Re: [PR] Io dynamic [datafusion]
via GitHub
2026/06/18
Re: [PR] Add DecomposeAggregate optimizer to rewrite AVG as SUM/COUNT [datafusion]
via GitHub
2026/06/18
Re: [PR] feat: initialize TopK dynamic filter threshold from parquet statistics [datafusion]
via GitHub
2026/06/18
[PR] Join avoid concat [datafusion]
via GitHub
2026/06/18
[I] Avoid concatenating record batches in joins to alleviate memory pressure [datafusion]
via GitHub
2026/06/18
Re: [PR] feat: add datafusion-json crate with json_get_str scaffolding [datafusion]
via GitHub
2026/06/18
Re: [PR] feat: add datafusion-json crate with json_get_str scaffolding [datafusion]
via GitHub
2026/06/18
Re: [I] Natively support time-window grouping expressions: window, session_window, window_time [datafusion-comet]
via GitHub
2026/06/18
Re: [I] CI is broken [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] feat: logical plan protobuf representation for range repartitioning [datafusion]
via GitHub
2026/06/18
Re: [I] Support logical protobuf serialization for range repartitioning [datafusion]
via GitHub
2026/06/18
Re: [PR] feat: logical plan protobuf representation for range repartitioning [datafusion]
via GitHub
2026/06/18
Re: [PR] feat: logical plan protobuf representation for range repartitioning [datafusion]
via GitHub
2026/06/18
Re: [PR] feat: logical plan protobuf representation for range repartitioning [datafusion]
via GitHub
2026/06/18
[PR] feat: logical plan protobuf representation for range repartitioning [datafusion]
via GitHub
2026/06/18
Re: [I] Snapshot tests in physical_optimizer are not deterministic across CPU-count environments [datafusion]
via GitHub
2026/06/18
Re: [I] Snapshot tests in physical_optimizer are not deterministic across CPU-count environments [datafusion]
via GitHub
2026/06/18
Re: [I] Call for Presentations: Community Showcase: Regular series for sharing what you're building with DataFusion [datafusion]
via GitHub
2026/06/18
Re: [I] OpenLineage support [datafusion]
via GitHub
Later messages