github
Thread
Date
Earlier messages
Messages by Thread
[I] Expressions from left and right of a join can fail planning during substrait converstion if their name is the same [datafusion]
via GitHub
[I] Phase 2: make sort pushdown support exact mode which can remove sort, also can File reordering based on statistics [datafusion]
via GitHub
[PR] chore(deps): bump actions/cache from 4 to 5 [datafusion-comet]
via GitHub
[PR] chore(deps): bump actions/upload-artifact from 5 to 6 [datafusion-comet]
via GitHub
[PR] chore(deps): bump actions/download-artifact from 6 to 7 [datafusion-comet]
via GitHub
[PR] chore(deps): bump ctor from 0.6.1 to 0.6.3 [datafusion]
via GitHub
[PR] chore(deps): bump insta from 1.43.2 to 1.44.3 [datafusion]
via GitHub
[PR] chore(deps): bump clap from 4.5.50 to 4.5.53 [datafusion]
via GitHub
[PR] chore(deps): bump the proto group with 3 updates [datafusion]
via GitHub
Re: [PR] feat: support Spark-compatible abs math function part 2 - ANSI mode [datafusion]
via GitHub
[PR] refactor: refactor spark like function to use datafusion like [datafusion]
via GitHub
Re: [PR] refactor: refactor spark like function to use datafusion like [datafusion]
via GitHub
Re: [PR] refactor: refactor spark like function to use datafusion like [datafusion]
via GitHub
Re: [PR] refactor: refactor spark like function to use datafusion like [datafusion]
via GitHub
[PR] fix: derive Spark sha2 nullability and add tests [datafusion]
via GitHub
Re: [PR] fix: derive Spark sha2 nullability and add tests [datafusion]
via GitHub
Re: [PR] fix: derive Spark sha2 nullability and add tests [datafusion]
via GitHub
[I] bug: Median() encountered integer overflow and truncates integer results [datafusion]
via GitHub
[PR] Add placeholder type inference for CASE expressions [datafusion]
via GitHub
[I] [expr] Placeholder inference for CASE statements [datafusion]
via GitHub
Re: [I] [expr] Placeholder inference for CASE statements [datafusion]
via GitHub
Re: [I] [EPIC] Support `VARIANT` type for unstructured data [datafusion]
via GitHub
[PR] chore: Refactor string benchmarks [datafusion-comet]
via GitHub
Re: [PR] chore: Refactor string benchmarks (~10x reduction in LOC) [datafusion-comet]
via GitHub
Re: [PR] chore: Refactor string benchmarks (~10x reduction in LOC) [datafusion-comet]
via GitHub
[PR] fix: remove advertise_flight_sql_endpoint config from scheduler [datafusion-ballista]
via GitHub
[PR] build(deps): bump actions/upload-artifact from 4 to 6 [datafusion-python]
via GitHub
[PR] build(deps): bump actions/cache from 4 to 5 [datafusion-python]
via GitHub
[PR] build(deps): bump actions/download-artifact from 5 to 7 [datafusion-python]
via GitHub
Re: [PR] build(deps): bump actions/upload-artifact from 4 to 5 [datafusion-python]
via GitHub
Re: [PR] build(deps): bump actions/upload-artifact from 4 to 5 [datafusion-python]
via GitHub
Re: [PR] build(deps): bump actions/download-artifact from 5 to 6 [datafusion-python]
via GitHub
Re: [PR] build(deps): bump actions/download-artifact from 5 to 6 [datafusion-python]
via GitHub
[PR] feat: Proof-of-concept of AQE cost-based optimization [datafusion-comet]
via GitHub
Re: [PR] feat: Proof-of-concept of AQE cost-based optimization [datafusion-comet]
via GitHub
Re: [PR] feat: Proof-of-concept of AQE cost-based optimization [datafusion-comet]
via GitHub
Re: [PR] feat: Proof-of-concept of AQE cost-based optimization [datafusion-comet]
via GitHub
Re: [I] Optimize histogram bucket calculation based on `case` [datafusion]
via GitHub
Re: [I] How to write csv file to disk from a empty dataframe? [datafusion]
via GitHub
[PR] Store example data directly inside the datafusion-examples (#19141) [datafusion]
via GitHub
Re: [PR] Store example data directly inside the datafusion-examples (#19141) [datafusion]
via GitHub
[PR] Feat: map_from_entries [datafusion-comet]
via GitHub
[PR] Minor: remove unnecessary unit tests for fixed size binary [datafusion]
via GitHub
Re: [PR] Minor: remove unnecessary unit tests for fixed size binary [datafusion]
via GitHub
Re: [I] FixedSizeBinary should be an allowable input to base64 encoding [datafusion]
via GitHub
[PR] Minor: clean up titles and links n extending operators and optimizer pages [datafusion]
via GitHub
[PR] feat: Add `auto_explain` mode [datafusion]
via GitHub
Re: [PR] feat: Add `auto_explain` mode [datafusion]
via GitHub
Re: [PR] feat: Add `auto_explain` mode [datafusion]
via GitHub
[PR] Fix regression for negative-scale decimals in log [datafusion]
via GitHub
[I] pow() with integer base and negative float exponent returns error [datafusion]
via GitHub
Re: [I] pow() with integer base and negative float exponent returns error [datafusion]
via GitHub
Re: [PR] Immutable parser (Tokenize borrow strings cont) [datafusion-sqlparser-rs]
via GitHub
[PR] fix: Disallows dropping duplicate keys when using full outer join [datafusion-python]
via GitHub
Re: [I] Support for filtered arrow datasets [datafusion]
via GitHub
Re: [I] Support for filtered arrow datasets [datafusion]
via GitHub
[PR] Move `newlines_in_values` from `FileScanConfig` to `CsvSource` [datafusion]
via GitHub
Re: [PR] Move `newlines_in_values` from `FileScanConfig` to `CsvSource` [datafusion]
via GitHub
[I] feat: plan-time SQL expression simplifying [datafusion]
via GitHub
[PR] feat: plan-time SQL expression simplifying [datafusion]
via GitHub
Re: [PR] feat: plan-time SQL expression simplifying [datafusion]
via GitHub
Re: [PR] feat: plan-time SQL expression simplifying [datafusion]
via GitHub
Re: [PR] feat: plan-time SQL expression simplifying [datafusion]
via GitHub
Re: [I] Avoid evaluating filters when they can be discarded purely from statistics [datafusion]
via GitHub
Re: [I] `datafusion.execution.time_zone` is not used for basic time zone inference [datafusion]
via GitHub
Re: [I] `datafusion.execution.time_zone` is not used for basic time zone inference [datafusion]
via GitHub
Re: [I] `datafusion.execution.time_zone` is not used for basic time zone inference [datafusion]
via GitHub
Re: [I] `datafusion.execution.time_zone` is not used for basic time zone inference [datafusion]
via GitHub
Re: [I] `datafusion.execution.time_zone` is not used for basic time zone inference [datafusion]
via GitHub
[I] Columnar shuffle with nested types is slower than Spark [datafusion-comet]
via GitHub
[PR] docs: Add documentation on running microbenchmark [WIP] [datafusion-comet]
via GitHub
[PR] chore: Add shuffle benchmark for deeply nested schemas [datafusion-comet]
via GitHub
Re: [PR] chore: Add shuffle benchmark for deeply nested schemas [datafusion-comet]
via GitHub
Re: [PR] chore: Add shuffle benchmark for deeply nested schemas [datafusion-comet]
via GitHub
[PR] feat: support_spark_4_cast_fix_tests [datafusion-comet]
via GitHub
Re: [PR] feat: support_spark_4_cast_fix_tests [datafusion-comet]
via GitHub
Re: [PR] feat: support_ansi-mode_aggregated_benchmarking [datafusion-comet]
via GitHub
Re: [PR] feat: support_ansi-mode_aggregated_benchmarking [datafusion-comet]
via GitHub
Re: [PR] feat: support_ansi-mode_aggregated_benchmarking [datafusion-comet]
via GitHub
Re: [PR] feat: support_ansi-mode_aggregated_benchmarking [datafusion-comet]
via GitHub
Re: [I] Move physical plan filter pushdown optimizer rule to avoid adding unnecessary nodes [datafusion]
via GitHub
Re: [I] Move physical plan filter pushdown optimizer rule to avoid adding unnecessary nodes [datafusion]
via GitHub
Re: [I] Avoid re-implementing expression simplification in pruning.rs [datafusion]
via GitHub
Re: [I] Avoid re-implementing expression simplification in pruning.rs [datafusion]
via GitHub
Re: [I] feature: implement PhysicalExpr const simplifier/evaluator [datafusion]
via GitHub
Re: [I] feature: implement PhysicalExpr const simplifier/evaluator [datafusion]
via GitHub
Re: [I] Push down entire hash table from HashJoinExec into scans [datafusion]
via GitHub
Re: [I] Push down entire hash table from HashJoinExec into scans [datafusion]
via GitHub
[PR] chore: enforce `clippy::allow_attributes` for ffi, optimizer and macros [datafusion]
via GitHub
Re: [PR] chore: enforce `clippy::allow_attributes` for ffi, optimizer and macros [datafusion]
via GitHub
Re: [PR] chore: enforce `clippy::allow_attributes` for ffi, optimizer and macros [datafusion]
via GitHub
Re: [PR] chore: enforce `clippy::allow_attributes` for ffi, optimizer and macros [datafusion]
via GitHub
[PR] Support ansi mode aggregated benchmarks [datafusion-comet]
via GitHub
Re: [PR] Support ansi mode aggregated benchmarks [datafusion-comet]
via GitHub
[PR] feat: Make shuffle writer buffer size configurable [WIP] [datafusion-comet]
via GitHub
Re: [PR] feat: Make shuffle writer buffer size configurable [WIP] [datafusion-comet]
via GitHub
Re: [PR] feat: Make shuffle writer buffer size configurable [datafusion-comet]
via GitHub
[PR] fix: Fix double counting of shuffle bytes written [WIP] [datafusion-comet]
via GitHub
Re: [PR] fix: Fix double counting of shuffle bytes written [WIP] [datafusion-comet]
via GitHub
Re: [PR] fix: Fix double counting of shuffle bytes written [WIP] [datafusion-comet]
via GitHub
Re: [PR] Refactor `power()` signature away from user defined [datafusion]
via GitHub
Re: [PR] Refactor `power()` signature away from user defined [datafusion]
via GitHub
[PR] fix: array to array cast [datafusion-comet]
via GitHub
Re: [PR] fix: array to array cast [datafusion-comet]
via GitHub
Re: [PR] Testing [datafusion]
via GitHub
[PR] chore: enforce clippy::allow_attributes for spark,sql,sustrait [datafusion]
via GitHub
Re: [PR] chore: enforce clippy::allow_attributes for spark,sql,sustrait [datafusion]
via GitHub
Re: [PR] chore: enforce clippy::allow_attributes for spark,sql,sustrait [datafusion]
via GitHub
Re: [I] Attach `Diagnostic` to "invalid function argument types" error [datafusion]
via GitHub
[PR] Fix allow to skip middle optional named parameters [datafusion]
via GitHub
Re: [PR] Remove Whitespace Tokens from Parser [datafusion-sqlparser-rs]
via GitHub
Re: [PR] Remove Whitespace Tokens from Parser [datafusion-sqlparser-rs]
via GitHub
Re: [PR] Remove Whitespace Tokens from Parser [datafusion-sqlparser-rs]
via GitHub
[PR] Chore: refactor bit_not [datafusion-comet]
via GitHub
Re: [PR] Chore: refactor bit_not [datafusion-comet]
via GitHub
Re: [PR] Chore: refactor bit_not [datafusion-comet]
via GitHub
Re: [PR] Chore: refactor bit_not [datafusion-comet]
via GitHub
Re: [PR] Extract source comments [datafusion-sqlparser-rs]
via GitHub
Re: [PR] Extract source comments [datafusion-sqlparser-rs]
via GitHub
[PR] chore: fix return_field_from_args doc [datafusion]
via GitHub
Re: [PR] chore: fix return_field_from_args doc [datafusion]
via GitHub
Re: [PR] chore: fix return_field_from_args doc [datafusion]
via GitHub
[I] Sample code of return_field_from_args is misleading [datafusion]
via GitHub
Re: [I] Sample code of return_field_from_args is misleading [datafusion]
via GitHub
[PR] feat: implement runtime_env and execution_props for FFI_Session [datafusion]
via GitHub
Re: [PR] feat: implement runtime_env and execution_props for FFI_Session [datafusion]
via GitHub
Re: [PR] feat: implement runtime_env and execution_props for FFI_Session [datafusion]
via GitHub
[PR] Iceberg rest [datafusion-comet]
via GitHub
Re: [PR] feat: [iceberg] REST catalog support for CometNativeIcebergScan [datafusion-comet]
via GitHub
Re: [PR] feat: [iceberg] REST catalog support for CometNativeIcebergScan [datafusion-comet]
via GitHub
[I] Comet falls back to Spark for final hash aggregate in some cases when it could be supported [datafusion-comet]
via GitHub
[PR] Gene.bordegaray/2025/12/hash partitioning satisfies subset [datafusion]
via GitHub
Re: [PR] Gene.bordegaray/2025/12/hash partitioning satisfies subset [datafusion]
via GitHub
Re: [PR] Gene.bordegaray/2025/12/hash partitioning satisfies subset [datafusion]
via GitHub
Re: [PR] Gene.bordegaray/2025/12/hash partitioning satisfies subset [datafusion]
via GitHub
Re: [PR] Gene.bordegaray/2025/12/hash partitioning satisfies subset [datafusion]
via GitHub
Re: [PR] Gene.bordegaray/2025/12/hash partitioning satisfies subset [datafusion]
via GitHub
Re: [PR] Gene.bordegaray/2025/12/hash partitioning satisfies subset [datafusion]
via GitHub
Re: [PR] feat: hash partitioning satisfies subset [datafusion]
via GitHub
Re: [PR] feat: hash partitioning satisfies subset [datafusion]
via GitHub
[PR] chore: Refactor hash aggregate planning [WIP] [datafusion-comet]
via GitHub
Re: [PR] chore: Refactor hash aggregate planning [WIP] [datafusion-comet]
via GitHub
Re: [PR] chore: Refactor hash aggregate planning [WIP] [datafusion-comet]
via GitHub
[I] [EPIC] Improve planning of hash aggregates [datafusion-comet]
via GitHub
[PR] fix: modify CometNativeScan to generate the file partitions without instantiating RDD [datafusion-comet]
via GitHub
Re: [PR] fix: modify CometNativeScan to generate the file partitions without instantiating RDD [datafusion-comet]
via GitHub
Re: [PR] fix: modify CometNativeScan to generate the file partitions without instantiating RDD [datafusion-comet]
via GitHub
Re: [I] feat: bucketed scan for native_datafusion Parquet scan [datafusion-comet]
via GitHub
[I] Support HDFS writes with Comet writer [datafusion-comet]
via GitHub
[I] Bloom filter intermediate aggregate buffers are not compatible between Spark and Comet [datafusion-comet]
via GitHub
Re: [I] Add push down sort to the source (table provider) [datafusion]
via GitHub
[PR] Do not convert pyarrow scalar values to plain python types when passing as `lit` [datafusion-python]
via GitHub
Re: [PR] Do not convert pyarrow scalar values to plain python types when passing as `lit` [datafusion-python]
via GitHub
Re: [PR] Do not convert pyarrow scalar values to plain python types when passing as `lit` [datafusion-python]
via GitHub
[PR] fix: pow() with integer base and negative float exponent returns error [datafusion]
via GitHub
Re: [PR] fix: pow() with integer base and negative float exponent returns error [datafusion]
via GitHub
Re: [PR] fix: pow() with integer base and negative float exponent returns error [datafusion]
via GitHub
Re: [PR] fix: pow() with integer base and negative float exponent returns error [datafusion]
via GitHub
Re: [PR] fix: pow() with integer base and negative float exponent returns error [datafusion]
via GitHub
Re: [PR] fix: pow() with integer base and negative float exponent returns error [datafusion]
via GitHub
Re: [PR] fix: pow() with integer base and negative float exponent returns error [datafusion]
via GitHub
Re: [PR] fix: pow() with integer base and negative float exponent returns error [datafusion]
via GitHub
Re: [PR] fix: pow() with integer base and negative float exponent returns error [datafusion]
via GitHub
Re: [PR] fix: pow() with integer base and negative float exponent returns error [datafusion]
via GitHub
Re: [PR] fix: pow() with integer base and negative float exponent returns error [datafusion]
via GitHub
Re: [PR] fix: pow() with integer base and negative float exponent returns error [datafusion]
via GitHub
Re: [PR] fix: pow() with integer base and negative float exponent returns error [datafusion]
via GitHub
[PR] feat: Enable bucket pruning with native_datafusion by using EmptyExec when no files for this partition [datafusion-comet]
via GitHub
Re: [PR] feat: Enable bucket pruning with native_datafusion scans [datafusion-comet]
via GitHub
Re: [PR] feat: Enable bucket pruning with native_datafusion scans [datafusion-comet]
via GitHub
Re: [PR] feat: Enable bucket pruning with native_datafusion scans [datafusion-comet]
via GitHub
[PR] Add heap_size to statistics [datafusion]
via GitHub
Re: [PR] Add heap_size for statistics [datafusion]
via GitHub
Re: [PR] Add heap_size for statistics [datafusion]
via GitHub
Re: [PR] Add heap_size for statistics [datafusion]
via GitHub
Re: [PR] Add heap_size for statistics [datafusion]
via GitHub
Re: [PR] Add heap_size for statistics [datafusion]
via GitHub
[PR] [TESTING] Test parquet filter pushdown with mask backed row selection [datafusion]
via GitHub
Re: [PR] [TESTING] Test parquet filter pushdown with mask backed row selection [datafusion]
via GitHub
Re: [PR] [TESTING] Test parquet filter pushdown with mask backed row selection [datafusion]
via GitHub
Re: [PR] [TESTING] Test parquet filter pushdown with mask backed row selection [datafusion]
via GitHub
Re: [PR] [TESTING] Test parquet filter pushdown with mask backed row selection [datafusion]
via GitHub
Re: [PR] [TESTING] Test parquet filter pushdown with mask backed row selection [datafusion]
via GitHub
Re: [PR] [TESTING] Test parquet filter pushdown with mask backed row selection [datafusion]
via GitHub
[PR] replace HashTableLookupExpr with lit(true) in proto serialization [datafusion]
via GitHub
Re: [PR] replace HashTableLookupExpr with lit(true) in proto serialization [datafusion]
via GitHub
Re: [PR] replace HashTableLookupExpr with lit(true) in proto serialization [datafusion]
via GitHub
Re: [PR] replace HashTableLookupExpr with lit(true) in proto serialization [datafusion]
via GitHub
Re: [PR] replace HashTableLookupExpr with lit(true) in proto serialization [datafusion]
via GitHub
[I] Should not convert HashAggregate if following shuffle is not converted [datafusion-comet]
via GitHub
Re: [I] Should not convert HashAggregate if following shuffle is not converted [datafusion-comet]
via GitHub
Re: [PR] Chore: refactor bit_count [datafusion-comet]
via GitHub
[PR] Add recursive protection on planner's `create_physical_expr` [datafusion]
via GitHub
Re: [PR] Add recursive protection on planner's `create_physical_expr` [datafusion]
via GitHub
Re: [PR] Add recursive protection on planner's `create_physical_expr` [datafusion]
via GitHub
Re: [PR] Add recursive protection on planner's `create_physical_expr` [datafusion]
via GitHub
Re: [PR] Add recursive protection on planner's `create_physical_expr` [datafusion]
via GitHub
Re: [PR] Add recursive protection on planner's `create_physical_expr` [datafusion]
via GitHub
Re: [PR] Add recursive protection on planner's `create_physical_expr` [datafusion]
via GitHub
Re: [PR] Add recursive protection on planner's `create_physical_expr` [datafusion]
via GitHub
Earlier messages