[ 
https://issues.apache.org/jira/browse/HIVE-17814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

anishek reassigned HIVE-17814:
------------------------------


> Reduce Memory footprint for large database bootstrap replication load 
> ----------------------------------------------------------------------
>
>                 Key: HIVE-17814
>                 URL: https://issues.apache.org/jira/browse/HIVE-17814
>             Project: Hive
>          Issue Type: Bug
>          Components: HiveServer2
>    Affects Versions: 3.0.0
>            Reporter: anishek
>            Assignee: anishek
>             Fix For: 3.0.0
>
>
> As part of HIVE-16896 we are doing dynamic Query Task generation for 
> bootstrap repl load. This was done since the number of tasks for large 
> databases will generate a very large graph with hundreds of thousands of 
> objects, this would put additional memory pressure on hive. 
> The execution hook's however still keep reference to the query plan which 
> gets dynamically modified and at the end of all task execution hive will have 
> the whole DAG in memory which is what we have to prevent, Additionally for 
> PostExecution Hive hooks we are additionally storing the TaskRunner objects 
> for each task that is executed. 
> We have to handle these issues to prevent excessive memory usage for 
> replication specifically bootstrap replication. 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to