[ 
https://issues.apache.org/jira/browse/PIG-2983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13477108#comment-13477108
 ] 

Bill Graham commented on PIG-2983:
----------------------------------

As a percentage of the execution time I'd think the time to build the plan 
would be small. How long is this taking for you?

For a unique signature see {{pig.logical.plan.signature}}, which is a hash of 
the logical plan. 
                
> Enable storing/loading an execution plan
> ----------------------------------------
>
>                 Key: PIG-2983
>                 URL: https://issues.apache.org/jira/browse/PIG-2983
>             Project: Pig
>          Issue Type: Wish
>          Components: impl
>            Reporter: Mathias Herberts
>
> I have no idea at the moment if this is feasible or not, but here is the idea.
> When running Pig in production it happens quite frequently that the building 
> of the plan takes a fair amount of time compared to the actual execution of 
> the job on the Hadoop cluster.
> For jobs which are run periodically without modification of the underlying 
> pig script it would be nice to be able to store the execution plan when it is 
> first built, then load that persisted plan for all other executions.
> A unique signature of the script could be computed, for example by removing 
> comments and blank lines and computing a digest, and the computed plan stored 
> under that name with a signature mechanism to ensure it's correct.
> This would speed up lots of executions of Pig.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to