[jira] [Commented] (SPARK-36645) Aggregate (Min/Max/Count) push down for Parquet

2022-03-08 Thread Thomas Graves (Jira)


[ 
https://issues.apache.org/jira/browse/SPARK-36645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17503212#comment-17503212
 ] 

Thomas Graves commented on SPARK-36645:
---

Note it appears this only really pushes down count because:

Parquet Binary min/max could be truncated. We may get wrong result if we rely 
on parquet Binary min/max.

I'm going to update the title to reflect this.

> Aggregate (Min/Max/Count) push down for Parquet
> ---
>
> Key: SPARK-36645
> URL: https://issues.apache.org/jira/browse/SPARK-36645
> Project: Spark
>  Issue Type: Sub-task
>  Components: SQL
>Affects Versions: 3.3.0
>Reporter: Huaxin Gao
>Assignee: Huaxin Gao
>Priority: Major
> Fix For: 3.3.0
>
>
> Push down Aggregate (Min/Max/Count)  for Parquet for performance improvement



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-36645) Aggregate (Min/Max/Count) push down for Parquet

2021-10-20 Thread Apache Spark (Jira)


[ 
https://issues.apache.org/jira/browse/SPARK-36645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17432127#comment-17432127
 ] 

Apache Spark commented on SPARK-36645:
--

User 'huaxingao' has created a pull request for this issue:
https://github.com/apache/spark/pull/34346

> Aggregate (Min/Max/Count) push down for Parquet
> ---
>
> Key: SPARK-36645
> URL: https://issues.apache.org/jira/browse/SPARK-36645
> Project: Spark
>  Issue Type: Sub-task
>  Components: SQL
>Affects Versions: 3.3.0
>Reporter: Huaxin Gao
>Assignee: Huaxin Gao
>Priority: Major
> Fix For: 3.3.0
>
>
> Push down Aggregate (Min/Max/Count)  for Parquet for performance improvement



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-36645) Aggregate (Min/Max/Count) push down for Parquet

2021-09-01 Thread Apache Spark (Jira)


[ 
https://issues.apache.org/jira/browse/SPARK-36645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17408359#comment-17408359
 ] 

Apache Spark commented on SPARK-36645:
--

User 'huaxingao' has created a pull request for this issue:
https://github.com/apache/spark/pull/33639

> Aggregate (Min/Max/Count) push down for Parquet
> ---
>
> Key: SPARK-36645
> URL: https://issues.apache.org/jira/browse/SPARK-36645
> Project: Spark
>  Issue Type: Sub-task
>  Components: SQL
>Affects Versions: 3.3.0
>Reporter: Huaxin Gao
>Priority: Major
>
> Push down Aggregate (Min/Max/Count)  for Parquet for performance improvement



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org