Messages by Thread
-
[PR] perf: Elimiate SortExec on generate_series() [datafusion]
via GitHub
-
Re: [PR] perf: [shuffle] extend field-major processing to nested struct fields [datafusion-comet]
via GitHub
-
Re: [PR] 2PC and staging output [datafusion-comet]
via GitHub
-
Re: [PR] feat: aes encrypt support [datafusion-comet]
via GitHub
-
Re: [PR] fix: multi-insert with native writer in Spark 4.x (#3430) [datafusion-comet]
via GitHub
-
Re: [PR] chore(deps): bump lz4_flex from 0.12.0 to 0.12.1 [datafusion-sandbox]
via GitHub
-
Re: [PR] feat(spark): add `levenshtein` with optional threshold support [datafusion]
via GitHub
-
[I] native_datafusion (Spark 3.x): shim's ParquetSchemaConvert translation produces an extra SparkException cause-chain layer [datafusion-comet]
via GitHub
-
Re: [PR] Add function origin API to replace name-based function checks in the optimizer [datafusion]
via GitHub
-
[PR] docs: redesign sidebar UX with collapsible groups, in-place search, and section scoping [datafusion-comet]
via GitHub
-
[I] native_datafusion: tests asserting parquet-mr's permissive overflow/narrowing behavior cannot be made to pass [datafusion-comet]
via GitHub
-
[I] native_datafusion: BINARY column without DecimalLogicalTypeAnnotation should be rejected when read as DECIMAL [datafusion-comet]
via GitHub
-
[PR] feat: implement parse_url [datafusion-comet]
via GitHub
-
Re: [I] [Feature] Support Spark expression: current_time_zone [datafusion-comet]
via GitHub
-
[PR] feat: wire `factorial` and update wire skill [datafusion-comet]
via GitHub
-
[PR] Add SelectivityTracker adaptive filter cost model [datafusion]
via GitHub
-
[PR] Adaptive filter pushdown for the parquet scan [datafusion]
via GitHub
-
[PR] Per-conjunct pruning statistics for PruningPredicate [datafusion]
via GitHub
-
[PR] Add OptionalFilterPhysicalExpr wrapper + proto support [datafusion]
via GitHub
-
[I] panic: lead window function subtracts past i64::MIN with i64::MAX offset [datafusion]
via GitHub
-
[I] panic: interval analysis handle_overflow unreachable!() for DATE + INTERVAL near boundary [datafusion]
via GitHub
-
[I] panic: interval analysis Interval::cardinality adds past u64::MAX for BETWEEN spanning full i64 range [datafusion]
via GitHub
-
[PR] refactor: Update SortMergeJoin to use async spill abstractions and remove open_sync [datafusion]
via GitHub
-
[I] panic: deep binary expression in SELECT projection aborts with stack overflow [datafusion]
via GitHub
-
[I] panic: arrow_cast(NULL, 'FixedSizeBinary(-1)') panics with LayoutError [datafusion]
via GitHub
-
[I] panic: array_repeat capacity overflow on constant scalar with large count [datafusion]
via GitHub
-
[I] panic: array_resize capacity overflow with large target size [datafusion]
via GitHub
-
[PR] [INFRA] Set up default rulesets for default and release branches [datafusion-ray]
via GitHub
-
[PR] [INFRA] Set up default rulesets for default and release branches [datafusion-ballista]
via GitHub
-
[I] Optimize `arrays_zip` to avoid row-by-row copying in the perfect-zip case [datafusion]
via GitHub
-
[I] panic: nth_value window function negates i64::MIN [datafusion]
via GitHub
-
[I] panic: lag window function negates i64::MIN offset during execution [datafusion]
via GitHub
-
[I] panic: date_trunc unwraps out-of-range lower-bound timestamp truncation [datafusion]
via GitHub
-
[I] panic: to_timestamp overflows converting large Decimal128 to nanoseconds [datafusion]
via GitHub
-
[I] panic: array_repeat list path overflows inner element count multiplication [datafusion]
via GitHub
-
[I] panic: date_bin compute_distance subtracts past i64::MIN [datafusion]
via GitHub
-
[I] panic: repeat string array path overflows capacity calculation [datafusion]
via GitHub
-
[I] panic: date_bin overflows scaling extreme Time64(Microsecond) source [datafusion]
via GitHub
-
[I] panic: date_bin overflows subtracting extreme nanosecond timestamp origin [datafusion]
via GitHub
-
[I] panic: generate_series table function overflows when integer step passes i64::MAX [datafusion]
via GitHub
-
[I] panic: INSERT VALUES placeholder $0 underflows parameter index inference [datafusion]
via GitHub
-
[I] panic: lead window function negates i64::MIN offset [datafusion]
via GitHub
-
[I] panic: array_position underflows start_from at i64::MIN [datafusion]
via GitHub
-
[I] panic: date_bin overflows scaling extreme Time64(Microsecond) origin [datafusion]
via GitHub
-
[I] panic: array_repeat scalar path overflows total repeated-value count [datafusion]
via GitHub
-
[I] panic: date_bin overflows scaling extreme Timestamp(Second) source [datafusion]
via GitHub
-
[I] panic: date_trunc overflows converting extreme non-ns timestamps to nanoseconds [datafusion]
via GitHub
-
Re: [PR] feat: add input_file_name() for file-backed scans (plumbing PR) [datafusion]
via GitHub
-
[PR] feat: Support Spark expression: current_time_zone [datafusion-comet]
via GitHub
-
[I] Enable spark.comet.exec.localTableScan.enabled when running Spark SQL tests [datafusion-comet]
via GitHub
-
Re: [D] DISCUSSION: Boston Datafusion Meetup September 2026 [datafusion]
via GitHub
-
[PR] deps: bump arrow and parquet to 58.3.0, fix page_util.rs after API change [datafusion-comet]
via GitHub
-
[PR] docs: user guide + runnable examples for distributing expressions (4/4) [datafusion-python]
via GitHub
-
[PR] feat: per-session Python UDF inlining toggle + sender ctx + strict refusal (3/4) [datafusion-python]
via GitHub
-
[PR] feat: inline encoding for Python aggregate and window UDFs (2/4) [datafusion-python]
via GitHub
-
[PR] feat: pickle support for Expr via inline scalar UDF encoding (1/4) [datafusion-python]
via GitHub
-
[PR] Add expression partitioning enum variant [datafusion]
via GitHub
-
[PR] Validate arrow_cast time units [datafusion]
via GitHub
-
[PR] perf: reclaim capacity in take_n during OOM-triggered partial-aggregation emit [datafusion]
via GitHub
-
[PR] Handle EXECUTE without statement name [datafusion]
via GitHub
-
[PR] feat: support stateful CometUDFs [datafusion-comet]
via GitHub
-
[PR] feat: reuse tree visitor to display tree for `LogicalPlan` [datafusion]
via GitHub
-
[PR] Support distributed processing by pickling expressions [datafusion-python]
via GitHub
-
[I] Reduce channel fanout overhead in `RepartitionExec` [datafusion]
via GitHub
-
[PR] build(deps-dev): bump pytest from 8.3.4 to 9.0.3 [datafusion-python]
via GitHub
-
Re: [I] Serialize user defined functions and table providers via protobuf [datafusion-python]
via GitHub
-
[PR] [TUI] Add screenshots of the TUI application in README/cli.md [datafusion-ballista]
via GitHub
-
[PR] [TUI] Executor's id is not a numeric column. It should be center aligned [datafusion-ballista]
via GitHub
-
Re: [PR] fix: replace default-allow cast unwrap with closed-by-default allowlist (timestamp + integer widening) [datafusion]
via GitHub
-
[I] Add screenshots of the TUI app in README/cli.md [datafusion-ballista]
via GitHub
-
Re: [PR] perf: collect nested struct addresses once in field-major append [jvm shuffle / r2c] [datafusion-comet]
via GitHub
-
Re: [PR] fix: wait for spawned tokio task before releasing native plan [datafusion-comet]
via GitHub
-
Re: [PR] fix: drop JNI GlobalRef before detaching thread in memory pool errors [datafusion-comet]
via GitHub
-
Re: [PR] test: add Scala test coverage for spark.sql.optimizer.nestedSchemaPruning.enabled [datafusion-comet]
via GitHub
-
Re: [PR] feat: add base64 expression [datafusion-comet]
via GitHub
-
Re: [PR] ci: ignore dev/ changes except dev/diffs in workflows [datafusion-comet]
via GitHub
-
Re: [PR] feat: memory-budget-aware SortMergeJoin to ShuffledHashJoin rewrite [datafusion-comet]
via GitHub
-
[PR] Test semver1 [datafusion]
via GitHub
-
[I] native_datafusion: silent wrong-answer paths for integer-to-decimal Parquet conversions Spark rejects [datafusion-comet]
via GitHub
-
[I] native_datafusion: silent wrong-answer paths for decimal-to-decimal precision/scale narrowing Spark rejects [datafusion-comet]
via GitHub
-
[PR] [TUI] Add support for horizontal scrolling to the job/stage plan popups [datafusion-ballista]
via GitHub
-
[I] [TUI] Support horizontal scrolling in job/stage plan popups [datafusion-ballista]
via GitHub
-
[I] Reading arrow schemas from parquet files is expensive [datafusion]
via GitHub
-
[PR] Test semver [datafusion]
via GitHub
-
[PR] chore(deps): bump arrow from 58.2.0 to 58.3.0 in /native [datafusion-comet]
via GitHub
-
[PR] chore(deps): bump parquet from 58.1.0 to 58.2.0 in /native [datafusion-comet]
via GitHub
-
[PR] chore(deps): bump the all-other-cargo-deps group in /native with 3 updates [datafusion-comet]
via GitHub
-
[I] Follow-ups for stats-based RG / file reorder (#21956): multi-column, function-wrapped, compound exprs [datafusion]
via GitHub
-
[PR] feat(dataframe): expose withColumn and unnestColumns [datafusion-java]
via GitHub
-
Re: [D] DISCUSSION: DataFusion Meetup in China 2026 [datafusion]
via GitHub
-
Re: [D] DISCUSSION: DataFusion Meetup in Asia and China 2026 [datafusion]
via GitHub
-
[I] panic on EXEC() / EXECUTE() with no name: Option::unwrap() on None in statement.rs [datafusion]
via GitHub
-
[PR] Use latest release tag as semver-check baseline in CI [datafusion]
via GitHub
-
Re: [PR] In list range pruning [datafusion]
via GitHub
-
[PR] feat(dataframe): add writeCsv with CsvWriteOptions [datafusion-java]
via GitHub
-
[PR] chore(deps): bump config from 0.15.22 to 0.15.23 [datafusion-ballista]
via GitHub
-
[PR] Fix within group aggregates with unparser [datafusion]
via GitHub
-
[I] arrow_cast accepts invalid Time32(Microsecond)/Time32(Nanosecond)/Time64(Second)/Time64(Millisecond), panics on use [datafusion]
via GitHub
-
[I] panic: multiply overflow in generate_series(DATE, DATE, INTERVAL) for dates far from 1970 [datafusion]
via GitHub
-
[PR] feat(arrow): expose Arrow IPC reader via registerArrow and readArrow [datafusion-java]
via GitHub
-
[I] Semver CI should compare against last released API, not main [datafusion]
via GitHub
-
[PR] Refactor parquet row filter setup [datafusion]
via GitHub
-
[PR] Saturate scheduler job elapsed time [datafusion-ballista]
via GitHub
-
[I] DataFusion drops grouped MIN/MAX rows with NULL sort keys under ORDER BY + LIMIT [datafusion]
via GitHub
-
[PR] [codex] Saturate scheduler job elapsed time [datafusion-ballista]
via GitHub
-
[I] Make parquet metrics use lazy-registration pattern [datafusion]
via GitHub
-
[PR] feat(contrib): introduce contrib extension SPI [datafusion-comet]
via GitHub
-
Re: [PR] fix: propagate inner-field metadata through composite-type constructors [datafusion]
via GitHub