[ 
https://issues.apache.org/jira/browse/PIG-1838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12989481#comment-12989481
 ] 

Allen Wittenauer commented on PIG-1838:
---------------------------------------

I have a few thoughts on a better/programmatic way for Pig to be better behaved 
without depending on users doing the right thing. One or more of these would 
probably work:

a) Redefine java.io.tmpdir itself after it gets the Hadoop property files loaded
b) In the pig wrapper script, parse mapred-site.xml and pull out the mapred tmp 
space
c) Override Java's createTempFile method to use Hadoop's tmp 
location/$TEMPDIR/$TMPDIR/$TEMP/some other value
d) Change the jar assembly such that it goes into a 
create->submit->delete->repeat pattern.  (From a casual glance, it appears to 
create all the jars at once rather than just when needed.)



> On a large farm, some pigs die of /tmp starvation
> -------------------------------------------------
>
>                 Key: PIG-1838
>                 URL: https://issues.apache.org/jira/browse/PIG-1838
>             Project: Pig
>          Issue Type: Wish
>          Components: impl
>    Affects Versions: 0.8.0
>            Reporter: Allen Wittenauer
>
> We're starting to issues where interactive/command line pig users blow up due 
> to so many large jar creations in /tmp. (In other words, pig execution prior 
> to the java.io.tmpdir fix that Hadoop makes can kick in.)  Pig should 
> probably not depend upon users being savvy enough to override java.io.tmpdir 
> on their own in these situations and/or a better steward of the space it does 
> use.  

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to