[ 
https://issues.apache.org/jira/browse/SPARK-33628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

jinhai updated SPARK-33628:
---------------------------
    Description: 
When partitions are tracked by the catalog, that will compute all custom 
partition locations, especially when dynamic partitions, and the field 
staticPartitions is empty.
 The poor performance of the method listPartitions results in a long period of 
no response at the Driver.

When read 12253 partitions, the method getPartitionsByNames takes 2 seconds, 
and the getPartitions takes 457 seconds, nearly 8 minutes

!image-2020-12-02-16-57-43-619.png|width=812,height=56!

  was:
When partitions are tracked by the catalog, that will compute all custom 
partition locations, especially when dynamic partitions, and the field 
staticPartitions is empty.
 The poor performance of the method listPartitions results in a long period of 
no response at the Driver.

When read 12253 partitions, the method getPartitionsByNames takes 2 seconds, 
and the getPartitions takes 457 seconds, nearly 8 minutes

!image-2020-12-02-16-57-43-619.png!


> Use the Hive.getPartitionsByNames method instead of Hive.getPartitions in the 
> HiveClientImpl
> --------------------------------------------------------------------------------------------
>
>                 Key: SPARK-33628
>                 URL: https://issues.apache.org/jira/browse/SPARK-33628
>             Project: Spark
>          Issue Type: Improvement
>          Components: SQL
>    Affects Versions: 3.0.0, 3.0.1
>            Reporter: jinhai
>            Priority: Major
>         Attachments: image-2020-12-02-16-57-43-619.png, 
> image-2020-12-03-14-38-19-221.png
>
>
> When partitions are tracked by the catalog, that will compute all custom 
> partition locations, especially when dynamic partitions, and the field 
> staticPartitions is empty.
>  The poor performance of the method listPartitions results in a long period 
> of no response at the Driver.
> When read 12253 partitions, the method getPartitionsByNames takes 2 seconds, 
> and the getPartitions takes 457 seconds, nearly 8 minutes
> !image-2020-12-02-16-57-43-619.png|width=812,height=56!



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to