[ 
https://issues.apache.org/jira/browse/PIG-2653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13254455#comment-13254455
 ] 

Dmitriy V. Ryaboy commented on PIG-2653:
----------------------------------------

Reusing the physical plan is going to be a problem -- as Ashutosh mentioned, 
physical plan includes actual load / store locations. Plus things are set on 
the physical plan based on examining the actual data (things like parallelism).

Reusing the Logical Plan would be easier, as it does not change between 
executions. Anyone know how long the different compilation steps take relative 
to each other?

Don't know about metastore as storage for this. Too permanent. Something like 
memcache of hbase seems better, as both have built-in expiration mechanisms. 
Maybe a pluggable (off by default) plan caching strategy.

Would need a way to force bypassing the cache, or overwriting the cache.

Would also want to make pig build part of the cache key, so that different 
versions of pig can be used on the same cluster without setting up different 
caches.
                
> Precompile option in PIG (Ability to store the plan for queries which are run 
> multiple times)
> ---------------------------------------------------------------------------------------------
>
>                 Key: PIG-2653
>                 URL: https://issues.apache.org/jira/browse/PIG-2653
>             Project: Pig
>          Issue Type: Improvement
>          Components: parser
>            Reporter: Rajesh Balamohan
>
> Based on the size of the PIG script, it takes 1 or 2 minutes in certain cases 
> for PIG compiler to create the MR plan. If the same script has to be run 
> later point in time, it has to go through this process again. 
> It would be nice, if PIG can store the result (execution plan) which can be 
> reused when the same script is run again. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to