[ 
https://issues.apache.org/jira/browse/NUTCH-2111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14904727#comment-14904727
 ] 

ASF GitHub Bot commented on NUTCH-2111:
---------------------------------------

GitHub user kwhitehall opened a pull request:

    https://github.com/apache/nutch/pull/64

    fix for NUTCH-2111 contributed by kwhitehall

    Further investigation showed that changing the temporary path does not get 
rid of the tmp files that eat up space. Further, if a selenium grid is 
utilized, the location chosen on a given node may not be available on all 
nodes. As such, it is best to stay with the default /tmp location and handle 
deleting the files there instead. The patch submitted does this.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/kwhitehall/nutch NUTCH-2111

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/nutch/pull/64.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #64
    
----
commit 5ef50e123a0e00eb5b55cdc44dc82c8be8fbf565
Author: Kim Whitehall <kim.d.whiteh...@jpl.nasa.gov>
Date:   2015-09-23T16:16:07Z

    fix for NUTCH-2111 contributed by kwhitehall

----


> Set temporary file location for selenium tmp files
> --------------------------------------------------
>
>                 Key: NUTCH-2111
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2111
>             Project: Nutch
>          Issue Type: Improvement
>          Components: plugin
>    Affects Versions: 1.10
>            Reporter: Kim Whitehall
>
> When using the selenium plug in (local mode or selenium grid), a large # tmp 
> files can be generated for each webdriver executed. The default location for 
> selenium is the /tmp library. Thus very quickly (and inadvertently) the 
> nutch-selenium interaction can lead to filesystem issues. 
> I propose to include a config in nutch-default.xml that allows users to 
> specify where they want the selenium tmp files to be written. 
>  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to