[ 
https://issues.apache.org/jira/browse/TINKERPOP3-988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15033825#comment-15033825
 ] 

ASF GitHub Bot commented on TINKERPOP3-988:
-------------------------------------------

GitHub user dalaro opened a pull request:

    https://github.com/apache/incubator-tinkerpop/pull/164

    TINKERPOP3-988 Change GraphComputer executors

    This supersedes https://github.com/apache/incubator-tinkerpop/pull/154.
    
    This PR is similar to its predecessor, but with two differences:
    
    * This PR shuts down executors it creates in a finally block instead of 
leaking them
    * This PR covers both SparkGraphComputer and GiraphGraphComputer

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/dalaro/incubator-tinkerpop TINKERPOP3-988-bis

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/incubator-tinkerpop/pull/164.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #164
    
----
commit 562d43af8f0063aa5a77d730480c4efa9b8b992f
Author: Dan LaRocque <[email protected]>
Date:   2015-12-01T13:23:42Z

    TINKERPOP3-988 Change GraphComputer executors
    
    Changed (Giraph|Spark)GraphComputer's submit methods to use a custom
    executor instead of ForkJoinPool.commonPool.
    
    These submit methods do very little in the calling thread; most of the
    work is done in an asynchronously-executed completable future.
    
    If the async task executes on ForkJoinPool.commonPool (the default if
    no executor is supplied), then two problems can arise:
    
    1. Loss of context classloader
    
    The context classloader of the thread that called submit is not
    necessarily the same as the context classloader common forkjoin pool
    threads. This matters because multiple bits of code reachable from
    submit's async task rely on the context classloader. SparkMemory is
    one; Hadoop's UserGroupInformation is another, depending on the
    credentials configuration (UGI is reached indirectly via
    FileSystem.get). This basically means that the caller has to use
    whatever context classloader is currently in use by the fork join
    common pool, or else bad things can happen, such as
    nonsensical-looking ClassCastExceptions.
    
    2. Inability to set context classloader
    
    When System.getSecurityManager() != null, the common forkjoin pool
    switches from its default worker thread factory implementation to a
    more restrictive alternative called
    InnocuousForkJoinWorkerThreadFactory. Threads created by this factory
    can't call setContextClassLoader. Attempting to do so throws a
    SecurityException. However, UserGroupInformation.newLoginContext must
    be able to call setContextClassLoader. It saves the CCL to a variable,
    does some work, then restores the CCL from a variable. This is
    impossible if the method throws a SecurityException. So, if a security
    manager is present in the VM, submit's async task can die in
    FileSystem.get -> UGI before any useful work even begins.
    
    This commit introduces shared logic around making and shutting down a
    Executors.newSingleThreadExecutor.  I considered pulling it up to
    AbstractHadoopGraphComputer, the common public base class of both
    Spark- and GiraphGraphComputer, but I couldn't see a way to do it that
    would preserve compatibility for a third-party computer implementation
    that extends the same base class.  That's why I put the common logic
    in a new static helper method instead.

----


> SparkGraphComputer.submit shouldn't use ForkJoinPool.commonPool
> ---------------------------------------------------------------
>
>                 Key: TINKERPOP3-988
>                 URL: https://issues.apache.org/jira/browse/TINKERPOP3-988
>             Project: TinkerPop 3
>          Issue Type: Improvement
>          Components: hadoop
>    Affects Versions: 3.1.0-incubating
>            Reporter: Dan LaRocque
>            Assignee: Dan LaRocque
>             Fix For: 3.1.1-incubating
>
>
> {{SparkGraphComputer.submit}} delegates most of its work to a closure that 
> executes on the common forkjoin pool.  The closure does a lot of stuff.  It 
> calls into both Spark and Hadoop.
> This approach has two problems:
> 1.  Inability to customize the context classloader used within the closure
> The context classloader of the thread that called {{submit}} is not 
> necessarily the same as the context classloader common forkjoin pool threads. 
>  This matters because multiple bits of code reachable from {{submit}}'s 
> closure rely on the context classloader.  SparkMemory is one; Hadoop's 
> UserGroupInformation is another, depending on the credentials configuration 
> (UGI is reached indirectly via {{FileSystem.get}}).  This basically means 
> that the caller has to use whatever context classloader is currently in use 
> by the fork join common pool, or else bad things can happen, such as 
> nonsensical-looking ClassCastExceptions.
> 2. Inability to override the context classloader inside the closure
> When {{System.getSecurityManager() != null}}, the common forkjoin pool 
> switches from its default worker thread factory implementation to a more 
> restrictive alternative called InnocuousForkJoinWorkerThreadFactory.  Threads 
> created by this factory can't call {{setContextClassLoader}}.  Attempting to 
> do so throws a SecurityException.  However, 
> UserGroupInformation.newLoginContext must be able to call 
> {{setContextClassLoader}}.  It saves the CCL to a variable, does some work, 
> then restores the CCL from a variable.  This is impossible if the method 
> throws a SecurityException.  So, if a security manager is present in the VM, 
> {{submit}}'s closure can die in {{FileSystem.get}} -> UGI before any useful 
> work even begins.
> I set the Affects Version to the version on which I observed it, but it might 
> affect earlier versions too.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to