Karuppayya created SPARK-53848:
----------------------------------

             Summary: Support for Sketch family in ThetaSketch Aggregates
                 Key: SPARK-53848
                 URL: https://issues.apache.org/jira/browse/SPARK-53848
             Project: Spark
          Issue Type: Improvement
          Components: SQL
    Affects Versions: 4.1.0
            Reporter: Karuppayya


Theta sketch aggregate currently supports only quick select.

Consumers like Iceberg{^}[1][2]{^} might benefit will benefit from the sketch 
aggregate if has the ability to specify `ALPHA family`

[1] [Iceberg specification that quotes to use 
ALPHA|https://iceberg.apache.org/puffin-spec/#apache-datasketches-theta-v1-blob-type]

[2] [Custom implementation of theta sketch aggregates in 
Iceberg|https://github.com/apache/iceberg/blob/2f6e7e6371902bcb72f21deeaea8889d4768004e/spark/v3.5/spark/src/main/scala/org/apache/spark/sql/stats/ThetaSketchAgg.scala#L67
 that can be replaced with Spark Theta aggregates



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to