Messages by Thread
-
-
Re: [I] RFC: What 3 level naming system should we use for catalog providers? [datafusion-python]
via GitHub
-
Re: [I] Update CI to use rust based tpc-h data generator [datafusion-python]
via GitHub
-
[I] bug: datafusion-spark format_string %t timestamp specifiers do not match Spark behavior [datafusion]
via GitHub
-
Re: [I] Update release documentation to use uv [datafusion-python]
via GitHub
-
Re: [I] Add remaining non-wrapped functions [datafusion-python]
via GitHub
-
[I] bug: datafusion-spark mod/pmod returns NaN instead of NULL for float division by zero [datafusion]
via GitHub
-
[PR] sql: render PostgreSQL array literals as ARRAY[...] in unparser [datafusion]
via GitHub
-
[I] bug: datafusion-spark array_repeat incorrectly returns NULL when element is NULL [datafusion]
via GitHub
-
[PR] Update datafusion-testing submodule to latest revision [datafusion]
via GitHub
-
[I] substring incompatible with spark for negative start index [datafusion-comet]
via GitHub
-
Re: [PR] feat: [iceberg] allow native Iceberg scans with non-identity transform residuals [datafusion-comet]
via GitHub
-
Re: [PR] perf: use DynComparator in sort-merge join (SMJ), microbenchmark queries up to 12% faster, TPC-H overall ~5% faster [datafusion]
via GitHub
-
Re: [PR] perf: use DynComparator in sort-merge join (SMJ), microbenchmark queries up to 12% faster, TPC-H overall ~5% faster [datafusion]
via GitHub
-
Re: [PR] perf: use DynComparator in sort-merge join (SMJ), microbenchmark queries up to 12% faster, TPC-H overall ~5% faster [datafusion]
via GitHub
-
Re: [PR] perf: use DynComparator in sort-merge join (SMJ), microbenchmark queries up to 12% faster, TPC-H overall ~5% faster [datafusion]
via GitHub
-
Re: [PR] perf: use DynComparator in sort-merge join (SMJ), microbenchmark queries up to 12% faster, TPC-H overall ~5% faster [datafusion]
via GitHub
-
Re: [PR] perf: use DynComparator in sort-merge join (SMJ), microbenchmark queries up to 12% faster, TPC-H overall ~5% faster [datafusion]
via GitHub
-
Re: [PR] perf: use DynComparator in sort-merge join (SMJ), microbenchmark queries up to 12% faster, TPC-H overall ~5% faster [datafusion]
via GitHub
-
Re: [PR] perf: use DynComparator in sort-merge join (SMJ), microbenchmark queries up to 12% faster, TPC-H overall ~5% faster [datafusion]
via GitHub
-
Re: [PR] perf: use DynComparator in sort-merge join (SMJ), microbenchmark queries up to 12% faster, TPC-H overall ~5% faster [datafusion]
via GitHub
-
Re: [PR] perf: use DynComparator in sort-merge join (SMJ), microbenchmark queries up to 12% faster, TPC-H overall ~5% faster [datafusion]
via GitHub
-
Re: [PR] perf: use DynComparator in sort-merge join (SMJ), microbenchmark queries up to 12% faster, TPC-H overall ~5% faster [datafusion]
via GitHub
-
Re: [PR] Support multi-column aliases in SELECT items [datafusion-sqlparser-rs]
via GitHub
-
[I] bug: datafusion-spark substring returns wrong result for large negative start positions [datafusion]
via GitHub
-
Re: [I] Current shuffle format has too much overhead with default batch size [datafusion-comet]
via GitHub
-
Re: [I] Parquet column arrow type does not determine DataFusion column data_type [datafusion-python]
via GitHub
-
Re: [PR] Perf: Window topn optimisation [datafusion]
via GitHub
-
[PR] feat: add `with_metadata` scalar UDF to attach Arrow field metadata [datafusion]
via GitHub
-
Re: [I] Logical plan generation inconsistency [datafusion-python]
via GitHub
-
Re: [PR] feat: extend single ndv optimization to non-numeric types for equality predicates [datafusion]
via GitHub
-
[PR] fix: generate integer keys instead of floats in TPC-DS data [datafusion-benchmarks]
via GitHub
-
[PR] feat: add PySpark validation script for datafusion-spark .slt tests [datafusion]
via GitHub
-
Re: [I] lpad, rpad, translate should use codepoints, not graphemes [datafusion]
via GitHub
-
Re: [PR] feat: use byte-based target batch size for shuffle IPC blocks [datafusion-comet]
via GitHub
-
Re: [I] test_binary_string_functions fails locally [datafusion-python]
via GitHub
-
Re: [I] ParserError when "WITHIN GROUP" is specified in SELECT [datafusion-python]
via GitHub
-
Re: [PR] feat: add cast_to_type UDF for type-based casting [datafusion]
via GitHub
-
Re: [PR] fix: Use codepoints in `lpad`, `rpad`, `translate` [datafusion]
via GitHub
-
Re: [PR] Add more regexp_replace test coverage [datafusion]
via GitHub
-
Re: [I] fail to sql query if column contains capitalized letter [datafusion-python]
via GitHub
-
Re: [I] [EPIC] Expose all operators and expressions in Python [datafusion-python]
via GitHub
-
Re: [I] EPIC: Add all `SessionContext` and `DataFrame` methods to Python API [datafusion-python]
via GitHub
-
Re: [I] Missing docstring examples for `to_date`, `to_time`, and `to_local_time` in scalar temporal functions [datafusion-python]
via GitHub
-
Re: [PR] feat: add initial support for `array_exists` with lambda expression support [datafusion-comet]
via GitHub
-
Re: [PR] feat: replace custom shuffle block format with Arrow IPC streams [datafusion-comet]
via GitHub
-
Re: [PR] feat: Optimize ORDER BY by Pruning Functionally Redundant Sort Keys [datafusion]
via GitHub
-
Re: [I] Expose Spark functions [datafusion-python]
via GitHub
-
Re: [PR] feat: add array_exists with lambda expression support [datafusion-comet]
via GitHub
-
[I] Incorrect query results for GROUP BY with UNIQUE constraint [datafusion]
via GitHub
-
Re: [PR] feat: Add pluggable StatisticsRegistry for operator-level statistics propagation [datafusion]
via GitHub
-
Re: [PR] MySQL: Add support for `ORDER BY` on single-table `UPDATE` [datafusion-sqlparser-rs]
via GitHub
-
Re: [PR] ci: add breaking change detector [datafusion]
via GitHub
-
Re: [PR] perf : Optimize count distinct [datafusion]
via GitHub