Messages by Thread
-
-
Re: [I] Make PyDataFrame.inner_df() public [datafusion-python]
via GitHub
-
Re: [I] Expand use of sql parsing string expressions in DataFrame [datafusion-python]
via GitHub
-
Re: [I] Support for automatic replacement scans [datafusion-python]
via GitHub
-
Re: [I] Separable Python and Rust components [datafusion-python]
via GitHub
-
Re: [I] Add `ctx = SessionContext()` to __init__ [datafusion-python]
via GitHub
-
Re: [I] Why uuid is only assigned for create_dataframe, not assigned for read_xxx [datafusion-python]
via GitHub
-
Re: [I] Enhance `__repr__` and `_repr_html_` with a note for additional rows [datafusion-python]
via GitHub
-
Re: [I] Improve release candidate numbering [datafusion-python]
via GitHub
-
Re: [I] how to use datafusion-contrib through the python bindings [datafusion-python]
via GitHub
-
Re: [I] Add `PyarrowScalarUDF` and convert `PyScalarUDF` to API recommended upstream [datafusion-python]
via GitHub
-
Re: [I] Add support for conversion of in memory tables to protobuf [datafusion-python]
via GitHub
-
Re: [I] Can't convert date_bin aggregated with count(*) to arrow if some windows contain null data [datafusion-python]
via GitHub
-
Re: [I] RFC: Re-work some DataFrame APIs [datafusion-python]
via GitHub
-
Re: [I] follwing sql reports `Error during planning: Unsupported operator in the subquery plan.` [datafusion-python]
via GitHub
-
Re: [PR] fix: Iceberg reflection for current() on TableOperations hierarchy [datafusion-comet]
via GitHub
-
Re: [I] ORDER BY is ignored when COPYing from a pyarrow table to a csv file [datafusion-python]
via GitHub
-
Re: [PR] refactor: extract sort pushdown logic from FileScanConfig into separate module [datafusion]
via GitHub
-
[PR] Add xml '...' TypedString support for PostgreSQL [datafusion-sqlparser-rs]
via GitHub
-
Re: [I] Weird behaviour with explain in SQL [datafusion-python]
via GitHub
-
Re: [PR] Introduce Morselizer API, rewrite `ParquetOpener` to `ParquetMorselizer` [datafusion]
via GitHub
-
Re: [I] Use pyarrow.substrait to execute scans on Pyarrow Datasets [datafusion-python]
via GitHub
-
Re: [I] `UPDATE ... ORDER BY ... LIMIT` support in MySQL Dialect [datafusion-sqlparser-rs]
via GitHub
-
Re: [PR] feat: Use single spill file for multiple partitions in native shuffle [datafusion-comet]
via GitHub
-
Re: [I] Load runtime configuration from environment vars, configuration file, or command line [datafusion-python]
via GitHub
-
Re: [PR] Migrate PhysicalExprAdapter to unified CastExpr and remove CastColumnExpr usage [datafusion]
via GitHub
-
[PR] perf: replace SMJ's join_filter_not_matched_map HashMap with Vec<FilterState> [datafusion]
via GitHub
-
[PR] feat: add AI skill to find and improve the Pythonic interface to functions [datafusion-python]
via GitHub
-
Re: [PR] [TEST] iterate morsels API [datafusion]
via GitHub
-
[I] bug: datafusion-spark string literals don't interpret escape sequences like Spark [datafusion]
via GitHub
-
Re: [I] RFC: What 3 level naming system should we use for catalog providers? [datafusion-python]
via GitHub
-
Re: [I] Update CI to use rust based tpc-h data generator [datafusion-python]
via GitHub
-
[I] bug: datafusion-spark format_string %t timestamp specifiers do not match Spark behavior [datafusion]
via GitHub
-
Re: [I] Update release documentation to use uv [datafusion-python]
via GitHub
-
Re: [I] Add remaining non-wrapped functions [datafusion-python]
via GitHub
-
[I] bug: datafusion-spark mod/pmod returns NaN instead of NULL for float division by zero [datafusion]
via GitHub
-
[PR] sql: render PostgreSQL array literals as ARRAY[...] in unparser [datafusion]
via GitHub
-
[I] bug: datafusion-spark array_repeat incorrectly returns NULL when element is NULL [datafusion]
via GitHub
-
[PR] Update datafusion-testing submodule to latest revision [datafusion]
via GitHub
-
[I] substring incompatible with spark for negative start index [datafusion-comet]
via GitHub
-
Re: [PR] feat: [iceberg] allow native Iceberg scans with non-identity transform residuals [datafusion-comet]
via GitHub
-
Re: [PR] perf: use DynComparator in sort-merge join (SMJ), microbenchmark queries up to 12% faster, TPC-H overall ~5% faster [datafusion]
via GitHub
-
Re: [PR] perf: use DynComparator in sort-merge join (SMJ), microbenchmark queries up to 12% faster, TPC-H overall ~5% faster [datafusion]
via GitHub
-
Re: [PR] perf: use DynComparator in sort-merge join (SMJ), microbenchmark queries up to 12% faster, TPC-H overall ~5% faster [datafusion]
via GitHub
-
Re: [PR] perf: use DynComparator in sort-merge join (SMJ), microbenchmark queries up to 12% faster, TPC-H overall ~5% faster [datafusion]
via GitHub
-
Re: [PR] perf: use DynComparator in sort-merge join (SMJ), microbenchmark queries up to 12% faster, TPC-H overall ~5% faster [datafusion]
via GitHub
-
Re: [PR] perf: use DynComparator in sort-merge join (SMJ), microbenchmark queries up to 12% faster, TPC-H overall ~5% faster [datafusion]
via GitHub
-
Re: [PR] perf: use DynComparator in sort-merge join (SMJ), microbenchmark queries up to 12% faster, TPC-H overall ~5% faster [datafusion]
via GitHub
-
Re: [PR] perf: use DynComparator in sort-merge join (SMJ), microbenchmark queries up to 12% faster, TPC-H overall ~5% faster [datafusion]
via GitHub
-
Re: [PR] perf: use DynComparator in sort-merge join (SMJ), microbenchmark queries up to 12% faster, TPC-H overall ~5% faster [datafusion]
via GitHub
-
Re: [PR] perf: use DynComparator in sort-merge join (SMJ), microbenchmark queries up to 12% faster, TPC-H overall ~5% faster [datafusion]
via GitHub
-
Re: [PR] perf: use DynComparator in sort-merge join (SMJ), microbenchmark queries up to 12% faster, TPC-H overall ~5% faster [datafusion]
via GitHub
-
Re: [PR] Support multi-column aliases in SELECT items [datafusion-sqlparser-rs]
via GitHub
-
[I] bug: datafusion-spark substring returns wrong result for large negative start positions [datafusion]
via GitHub
-
Re: [I] Current shuffle format has too much overhead with default batch size [datafusion-comet]
via GitHub
-
Re: [I] Parquet column arrow type does not determine DataFusion column data_type [datafusion-python]
via GitHub
-
Re: [PR] Perf: Window topn optimisation [datafusion]
via GitHub
-
[PR] feat: add `with_metadata` scalar UDF to attach Arrow field metadata [datafusion]
via GitHub
-
Re: [I] Logical plan generation inconsistency [datafusion-python]
via GitHub
-
Re: [PR] feat: extend single ndv optimization to non-numeric types for equality predicates [datafusion]
via GitHub
-
[PR] fix: generate integer keys instead of floats in TPC-DS data [datafusion-benchmarks]
via GitHub
-
[PR] feat: add PySpark validation script for datafusion-spark .slt tests [datafusion]
via GitHub