[
https://issues.apache.org/jira/browse/PIG-3169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13637832#comment-13637832
]
Mark Wagner commented on PIG-3169:
----------------------------------
Thanks for taking a look :)
# That could be one examples. Others I was thinking of are distributed cache
jars, the result of an openIterator call (DUMP), and other user created
temporary files like the one that was causing the previous test failures. So I
think the deleteTempFiles call is still necessary. Those are all much smaller
relative to the intermediate data though.
# Yes, a set structure would probably be more appropriate for FileLocalizer.
I'll do that.
# That's what I wanted to do, but it seems the plan gets destroyed when it's
executed so I can't walk it afterwords and clean up while walking.
# My mistake. Those should be consistent.
> Remove intermediate data after a job finishes
> ---------------------------------------------
>
> Key: PIG-3169
> URL: https://issues.apache.org/jira/browse/PIG-3169
> Project: Pig
> Issue Type: Improvement
> Reporter: Mark Wagner
> Assignee: Mark Wagner
> Priority: Minor
> Fix For: 0.12
>
> Attachments: PIG-3169.1.patch, PIG-3169.2.patch, PIG-3169.3.patch,
> PIG-3169-hotfix.patch
>
>
> When using Grunt, intermediate data and distributed caches files are left in
> 'pig.temp.dir' until the session is closed. It would be nice to cleanup files
> as they are no longer needed.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira