[
https://issues.apache.org/jira/browse/SPARK-56626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated SPARK-56626:
-----------------------------------
Labels: pull-request-available (was: )
> Introduce SupportsReportCatalogStatistics mixin for Table
> ---------------------------------------------------------
>
> Key: SPARK-56626
> URL: https://issues.apache.org/jira/browse/SPARK-56626
> Project: Spark
> Issue Type: Improvement
> Components: SQL
> Affects Versions: 4.2.0
> Reporter: Anton Lykov
> Priority: Major
> Labels: pull-request-available
>
> Today DSv2 connectors can only report table-level statistics through
> `Scan.estimateStatistics()`, which conflates catalog-level (pre-filter) and
> scan-level (post-pushdown) stats and forces construction of a `ScanBuilder`
> just to read them —
> potentially triggering file listing or remote metadata round-trips. This
> proposes a new `@Evolving` `Table` mix-in `SupportsReportCatalogStatistics`
> that exposes table-wide, scan-independent statistics, analogous to DSv1's
> `CatalogStatistics`.
> It is complementary to `Scan.SupportsReportStatistics`; a table may
> implement both. See the PR for details.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]