github
Thread
Date
Earlier messages
Messages by Thread
[I] Potential performance regression with `parquet 56.1.0` / data ranges [datafusion]
via GitHub
Re: [I] Upgrade to protobuf 3.15 so that we can use `optional` keyword [datafusion]
via GitHub
Re: [I] Upgrade to protobuf 3.15 so that we can use `optional` keyword [datafusion]
via GitHub
Re: [I] Consider using upstream arrow-avro reader [datafusion]
via GitHub
Re: [I] Consider using upstream arrow-avro reader [datafusion]
via GitHub
Re: [I] Typo in error message in substring [datafusion]
via GitHub
[PR] Add explicit PMC/committers list to governance docs page [datafusion]
via GitHub
Re: [PR] Add explicit PMC/committers list to governance docs page [datafusion]
via GitHub
Re: [PR] Add explicit PMC/committers list to governance docs page [datafusion]
via GitHub
Re: [PR] Add explicit PMC/committers list to governance docs page [datafusion]
via GitHub
Re: [PR] Add explicit PMC/committers list to governance docs page [datafusion]
via GitHub
Re: [PR] Add explicit PMC/committers list to governance docs page [datafusion]
via GitHub
Re: [PR] Add explicit PMC/committers list to governance docs page [datafusion]
via GitHub
Re: [PR] Add explicit PMC/committers list to governance docs page [datafusion]
via GitHub
Re: [PR] Add explicit PMC/committers list to governance docs page [datafusion]
via GitHub
Re: [PR] Add explicit PMC/committers list to governance docs page [datafusion]
via GitHub
Re: [PR] Add explicit PMC/committers list to governance docs page [datafusion]
via GitHub
Re: [PR] Add explicit PMC/committers list to governance docs page [datafusion]
via GitHub
Re: [PR] Add explicit PMC/committers list to governance docs page [datafusion]
via GitHub
Re: [PR] Add explicit PMC/committers list to governance docs page [datafusion]
via GitHub
Re: [PR] Add explicit PMC/committers list to governance docs page [datafusion]
via GitHub
Re: [PR] Add explicit PMC/committers list to governance docs page [datafusion]
via GitHub
Re: [PR] Upgrade arrow/parquet to 56.1.0 [datafusion]
via GitHub
Re: [PR] Upgrade arrow/parquet to 56.1.0 [datafusion]
via GitHub
Re: [PR] Upgrade arrow/parquet to 56.1.0 [datafusion]
via GitHub
Re: [PR] Upgrade arrow/parquet to 56.1.0 [datafusion]
via GitHub
Re: [PR] Upgrade arrow/parquet to 56.1.0 [datafusion]
via GitHub
Re: [PR] Upgrade arrow/parquet to 56.1.0 [datafusion]
via GitHub
[PR] Add Table of content to past blogs [datafusion-site]
via GitHub
Re: [I] Use taiki-e/install-action in CI [datafusion]
via GitHub
[PR] Use taiki-e/install-action and binstall in CI [datafusion]
via GitHub
Re: [PR] Use taiki-e/install-action and binstall in CI [datafusion]
via GitHub
Re: [PR] Use taiki-e/install-action and binstall in CI [datafusion]
via GitHub
Re: [PR] Use taiki-e/install-action and binstall in CI [datafusion]
via GitHub
Re: [PR] Use taiki-e/install-action and binstall in CI [datafusion]
via GitHub
Re: [PR] Use taiki-e/install-action and binstall in CI [datafusion]
via GitHub
Re: [PR] Implemented the `From` method for all clear variants in Statement [datafusion-sqlparser-rs]
via GitHub
Re: [PR] Implemented the `From` method for all clear variants in Statement [datafusion-sqlparser-rs]
via GitHub
[PR] fix error message typo [datafusion]
via GitHub
Re: [I] Update code to use new, non deprecated UnionExec API [datafusion]
via GitHub
Re: [I] Update code to use new, non deprecated UnionExec API [datafusion]
via GitHub
[I] Add an optimizer rule to remove empty inputs from union [datafusion]
via GitHub
Re: [I] Add an optimizer rule to remove empty inputs from union [datafusion]
via GitHub
[PR] chore(deps): bump substrait from 0.58.0 to 0.60.0 [datafusion]
via GitHub
Re: [PR] chore(deps): bump semver from 1.0.26 to 1.0.27 [datafusion]
via GitHub
[PR] Use `Display` formatting of `DataType`:s in error messagwe [datafusion]
via GitHub
Re: [PR] Use `Display` formatting of `DataType`:s in error messagwe [datafusion]
via GitHub
Re: [PR] Use `Display` formatting of `DataType`:s in error messagwe [datafusion]
via GitHub
Re: [PR] Use `Display` formatting of `DataType`:s in error messagwe [datafusion]
via GitHub
[PR] chore(deps): bump cc from 1.2.36 to 1.2.37 in /native [datafusion-comet]
via GitHub
[PR] Prevent exponential planning time for Window functions [datafusion]
via GitHub
Re: [I] ListingTable and FileScanConfig assume all files accessible via single ObjectStore instance [datafusion]
via GitHub
[PR] fix: Check broadcast plan of ReusedExchangeExec [datafusion-comet]
via GitHub
Re: [PR] fix: Check reused broadcast plan in non-AQE [datafusion-comet]
via GitHub
Re: [PR] fix: Check reused broadcast plan in non-AQE and make setNumPartitions thread safe [datafusion-comet]
via GitHub
Re: [I] More types in `try_*` [datafusion-comet]
via GitHub
[PR] perf: Improve NLJ for very small right side case [datafusion]
via GitHub
Re: [PR] perf: Improve NLJ for very small right side case [datafusion]
via GitHub
Re: [PR] perf: Improve NLJ for very small right side case [datafusion]
via GitHub
Re: [PR] perf: Improve NLJ for very small right side case [datafusion]
via GitHub
[I] Fix type hint to allow DataFusion PyCapsule provider into `udaf` function [datafusion-python]
via GitHub
[I] Incorrect NaN Comparison [datafusion-python]
via GitHub
Re: [I] Provide the option of displaying Explain Plans as JSON [datafusion]
via GitHub
Re: [I] Provide the option of displaying Explain Plans as JSON [datafusion]
via GitHub
Re: [I] Variance aggregation calculation does not work for a single item [datafusion]
via GitHub
Re: [I] Variance aggregation calculation does not work for a single item [datafusion]
via GitHub
Re: [PR] Perf: Optimize in memory sort [datafusion]
via GitHub
Re: [I] Add datafusion-cli to the workspace [datafusion]
via GitHub
Re: [I] Add datafusion-cli to the workspace [datafusion]
via GitHub
[PR] perf: Implement specialized aggregates for `COUNT(*)` and `COUNT(expr)` [datafusion-comet]
via GitHub
Re: [PR] perf: Implement specialized aggregates for `COUNT(*)` and `COUNT(expr)` [datafusion-comet]
via GitHub
Re: [PR] perf: Implement specialized aggregates for `COUNT(*)` and `COUNT(expr)` [datafusion-comet]
via GitHub
Re: [I] CometHashAggregate prefixed with ! in explain plan [datafusion-comet]
via GitHub
Re: [PR] minor: Update doc comments on type signature [datafusion]
via GitHub
Re: [PR] minor: Update doc comments on type signature [datafusion]
via GitHub
Re: [I] [EPIC] Ballista 2025/H2 Roadmap Proposal [datafusion-ballista]
via GitHub
Re: [I] Display of Lag With with_column uses autogenerated name and provided name [datafusion-python]
via GitHub
[PR] fix: remove redundant column when using window functions [datafusion-python]
via GitHub
Re: [PR] fix: remove redundant column when using window functions [datafusion-python]
via GitHub
[PR] Microbench [datafusion-comet]
via GitHub
Re: [PR] minor: Update TPC-DS microbenchmarks to remove "scan only" and "exec only" runs [datafusion-comet]
via GitHub
Re: [I] Incorrect NaN comparison [datafusion]
via GitHub
Re: [I] Incorrect NaN comparison [datafusion]
via GitHub
[PR] feat: support spark udf format_string [datafusion]
via GitHub
[I] [EPIC] Shuffle file execs improvement [datafusion-ballista]
via GitHub
[PR] `avg(distinct)` support for decimal types [datafusion]
via GitHub
Re: [PR] `avg(distinct)` support for decimal types [datafusion]
via GitHub
[I] Release DataFusion `50.0.0` (Nov 2025) [datafusion]
via GitHub
Re: [I] Release DataFusion `50.0.0` (Nov 2025) [datafusion]
via GitHub
Re: [I] Release DataFusion `51.0.0` (Nov 2025) [datafusion]
via GitHub
Re: [I] Release DataFusion `51.0.0` (Nov 2025) [datafusion]
via GitHub
Re: [I] Consider using gRPC streams + chunking to avoid message size limits [datafusion-ballista]
via GitHub
Re: [I] Consider using gRPC streams + chunking to avoid message size limits [datafusion-ballista]
via GitHub
Re: [PR] feat: optimize and unparse grouping [datafusion]
via GitHub
Re: [PR] feat: optimize and unparse grouping [datafusion]
via GitHub
[I] Native decimal 256 bit support for log [datafusion]
via GitHub
Re: [I] feat: support decimal for math functions: log [datafusion]
via GitHub
Re: [I] External sort failing with modest memory limit when writing parquet files [datafusion]
via GitHub
Re: [I] External sorting not working for (maybe only for string columns??) [datafusion]
via GitHub
Re: [I] CachedParquetFileReader should respect the metadata prefetch hint [datafusion]
via GitHub
Re: [I] Publish blog post for 0.10.0 release [datafusion-comet]
via GitHub
[PR] Implement arithmetic overflow error handling [datafusion]
via GitHub
Re: [PR] Implement arithmetic overflow error handling [datafusion]
via GitHub
Re: [PR] Implement arithmetic overflow error handling [datafusion]
via GitHub
[PR] docs: [branch-0.10] Update version number in branch-0.10 user guide [datafusion-comet]
via GitHub
[PR] docs: Publish 0.10.0 user guide [datafusion-comet]
via GitHub
Re: [I] Can't read a directory of CSV files: incorrect number of fields for line 1, expected 17 got 20 [datafusion]
via GitHub
[PR] feat: Support reading CSV files with inconsistent column counts [datafusion]
via GitHub
Re: [PR] feat: Support reading CSV files with inconsistent column counts [datafusion]
via GitHub
[PR] Add Comet 0.10.0 blog post draft [datafusion-site]
via GitHub
Re: [PR] Add Comet 0.10.0 blog post draft [datafusion-site]
via GitHub
Re: [PR] Add Comet 0.10.0 blog post draft [datafusion-site]
via GitHub
Re: [PR] feat: Use PartialSortExec when input data is sorted on prefix columns [datafusion]
via GitHub
Re: [PR] feat: Improve Remote Shuffle Read Speed and Resource Utilisation [datafusion-ballista]
via GitHub
Re: [PR] feat: Improve Remote Shuffle Read Speed and Resource Utilisation [datafusion-ballista]
via GitHub
Re: [PR] feat: Improve Remote Shuffle Read Speed and Resource Utilisation [datafusion-ballista]
via GitHub
Re: [PR] feat: Improve Remote Shuffle Read Speed and Resource Utilisation [datafusion-ballista]
via GitHub
Re: [PR] feat: Improve Remote Shuffle Read Speed and Resource Utilisation [datafusion-ballista]
via GitHub
[PR] Adds micro-benchmark queries for existence joins [datafusion]
via GitHub
Re: [PR] Adds micro-benchmark queries for existence joins [datafusion]
via GitHub
Re: [PR] Adds micro-benchmark queries for existence joins [datafusion]
via GitHub
Re: [PR] Adds micro-benchmark queries for existence joins [datafusion]
via GitHub
[PR] fix: Prevent duplicate expressions in DynamicPhysicalExpr [datafusion]
via GitHub
[I] Detecting unused dependencies in CI [datafusion]
via GitHub
Re: [I] Detecting unused dependencies in CI [datafusion]
via GitHub
[I] Support timestamp formats on CsvReadOptions or Schema [datafusion]
via GitHub
Re: [I] Support timestamp formats on CsvReadOptions or Schema [datafusion]
via GitHub
[PR] feat: Validate explain format [datafusion]
via GitHub
Re: [PR] feat: Ensure explain format in config is valid [datafusion]
via GitHub
Re: [PR] feat: Ensure explain format in config is valid [datafusion]
via GitHub
Re: [PR] Implement timestamp_cast_dtype for SqliteDialect [datafusion]
via GitHub
[I] Support map types for aggregation [datafusion-comet]
via GitHub
Re: [I] Support map types for aggregation [datafusion-comet]
via GitHub
[I] Make Nested Loop Join more efficient for very small right input [datafusion]
via GitHub
Re: [I] Make Nested Loop Join more efficient for very small right input [datafusion]
via GitHub
[PR] Always run CI checks [datafusion]
via GitHub
Re: [I] Panic happens when adding a decimal256 to a float (SQLancer) [datafusion]
via GitHub
Re: [I] Release sqlparser-rs version `0.59.0` around 2025-09-15 [datafusion-sqlparser-rs]
via GitHub
Re: [I] Release sqlparser-rs version `0.59.0` around 2025-09-15 [datafusion-sqlparser-rs]
via GitHub
Re: [D] DISCUSSION: DataFusion Meetup in New York, NY, USA - Sep 15, 2025 [datafusion]
via GitHub
Re: [D] DISCUSSION: DataFusion Meetup in New York, NY, USA - Sep 15, 2025 [datafusion]
via GitHub
[PR] refactor: Scala hygiene - remove `scala.collection.JavaConverters` [datafusion-comet]
via GitHub
Re: [PR] refactor: Scala hygiene - remove `scala.collection.JavaConverters` [datafusion-comet]
via GitHub
Re: [PR] refactor: Scala hygiene - remove `scala.collection.JavaConverters` [datafusion-comet]
via GitHub
[PR] fix: ignore non-existent columns when adding filter equivalence info in `FileScanConfig` [datafusion]
via GitHub
Re: [PR] fix: ignore non-existent columns when adding filter equivalence info in `FileScanConfig` [datafusion]
via GitHub
Re: [PR] fix: ignore non-existent columns when adding filter equivalence info in `FileScanConfig` [datafusion]
via GitHub
Re: [PR] fix: ignore non-existent columns when adding filter equivalence info in `FileScanConfig` [datafusion]
via GitHub
Re: [PR] fix: ignore non-existent columns when adding filter equivalence info in `FileScanConfig` [datafusion]
via GitHub
Re: [PR] fix: ignore non-existent columns when adding filter equivalence info in `FileScanConfig` [datafusion]
via GitHub
Re: [PR] fix: ignore non-existent columns when adding filter equivalence info in `FileScanConfig` [datafusion]
via GitHub
[PR] Trying cargo machete to prune unused deps. [datafusion]
via GitHub
Re: [PR] Trying cargo machete to prune unused deps. [datafusion]
via GitHub
Re: [PR] Trying cargo machete to prune unused deps. [datafusion]
via GitHub
Re: [PR] Trying cargo machete to prune unused deps. [datafusion]
via GitHub
Re: [PR] Trying cargo machete to prune unused deps. [datafusion]
via GitHub
Re: [PR] Trying cargo machete to prune unused deps. [datafusion]
via GitHub
[I] API Suggestion match casing for isnan and is_null [datafusion-python]
via GitHub
[PR] [branch-50] fix: Implement AggregateUDFImpl::reverse_expr for StringAgg (#17165) (#17473) [datafusion]
via GitHub
Re: [PR] [branch-50] fix: Implement AggregateUDFImpl::reverse_expr for StringAgg (#17165) (#17473) [datafusion]
via GitHub
[I] Removing ad-hoc implementation of `encode_arrow_schema` [datafusion]
via GitHub
Re: [I] Removing ad-hoc implementation of `encode_arrow_schema` [datafusion]
via GitHub
[I] Implement CometInMemoryTableScanExec [datafusion-comet]
via GitHub
Re: [I] Implement CometInMemoryTableScanExec [datafusion-comet]
via GitHub
[I] [iceberg] Tracking PR for deleted rows support [datafusion-comet]
via GitHub
[I] TPC-DS query #88 fails with disabled AQE [datafusion-comet]
via GitHub
Re: [I] TPC-DS query #88 fails with disabled AQE [datafusion-comet]
via GitHub
Re: [I] TPC-DS query #88 fails with disabled AQE [datafusion-comet]
via GitHub
Re: [I] TPC-DS query #88 fails with disabled AQE [datafusion-comet]
via GitHub
Re: [I] TPC-DS query #88 fails with disabled AQE [datafusion-comet]
via GitHub
Re: [I] TPC-DS query #88 fails with disabled AQE [datafusion-comet]
via GitHub
Re: [I] TPC-DS query #88 fails with disabled AQE [datafusion-comet]
via GitHub
[I] Numeric overflow should result in query error [datafusion]
via GitHub
Re: [I] Numeric overflow should result in query error [datafusion]
via GitHub
Re: [I] Numeric overflow should result in query error [datafusion]
via GitHub
Re: [I] Numeric overflow should result in query error [datafusion]
via GitHub
Re: [I] Numeric overflow should result in query error [datafusion]
via GitHub
Re: [I] Numeric overflow should result in query error [datafusion]
via GitHub
[PR] Using `encode_arrow_schema` from arrow-rs. [datafusion]
via GitHub
Re: [PR] Using `encode_arrow_schema` from arrow-rs. [datafusion]
via GitHub
[I] Consumer receives duplicate predicates when join mode is CollectLeft [datafusion]
via GitHub
Re: [I] Consumer receives duplicate bound predicates when join mode is CollectLeft [datafusion]
via GitHub
[PR] Update Bug issue template to use Bug issue type [datafusion]
via GitHub
Re: [PR] Update Bug issue template to use Bug issue type [datafusion]
via GitHub
[PR] Disable `required_status_checks` for now [datafusion]
via GitHub
Re: [PR] Disable `required_status_checks` for now [datafusion]
via GitHub
[PR] Introduce `avg_distinct()` and `sum_distinct()` functions to DataFrame API [datafusion]
via GitHub
Re: [PR] Introduce `avg_distinct()` and `sum_distinct()` functions to DataFrame API [datafusion]
via GitHub
[PR] fix: Change `OuterReferenceColumn` to contain the entire outer field to prevent metadata loss [datafusion]
via GitHub
Re: [PR] fix: Change `OuterReferenceColumn` to contain the entire outer field to prevent metadata loss [datafusion]
via GitHub
[PR] Support `CAST` from temporal to `Utf8View` [datafusion]
via GitHub
[I] Panic when cast from `DATE` to `TIMESTAMP` overflows [datafusion]
via GitHub
Re: [I] Panic when cast from `DATE` to `TIMESTAMP` overflows [datafusion]
via GitHub
[I] Cast from DATE to VARCHAR fails [datafusion]
via GitHub
Re: [I] Cast from DATE to VARCHAR fails [datafusion]
via GitHub
Re: [I] Cast from DATE to VARCHAR fails [datafusion]
via GitHub
Re: [I] Cast from DATE to VARCHAR fails [datafusion]
via GitHub
Re: [I] Performance of `distinct on (columns)` [datafusion]
via GitHub
[I] Investigate OpenDAL features [datafusion-comet]
via GitHub
[I] Improve Remote Shuffle Read Speed and Resource Utilisation [datafusion-ballista]
via GitHub
Earlier messages