[GitHub] spark pull request #20151: [SPARK-22959][PYTHON] Configuration to select the...

HyukjinKwon Sat, 06 Jan 2018 01:25:28 -0800

Github user HyukjinKwon commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20151#discussion_r160021803
  
    --- Diff: 
core/src/main/scala/org/apache/spark/api/python/PythonWorkerFactory.scala ---
    @@ -34,17 +34,25 @@ private[spark] class PythonWorkerFactory(pythonExec: 
String, envVars: Map[String
     
       import PythonWorkerFactory._
     
    -  // Because forking processes from Java is expensive, we prefer to launch 
a single Python daemon
    -  // (pyspark/daemon.py) and tell it to fork new workers for our tasks. 
This daemon currently
    -  // only works on UNIX-based systems now because it uses signals for 
child management, so we can
    -  // also fall back to launching workers (pyspark/worker.py) directly.
    +  // Because forking processes from Java is expensive, we prefer to launch 
a single Python daemon,
    +  // pyspark/daemon.py (by default) and tell it to fork new workers for 
our tasks. This daemon
    +  // currently only works on UNIX-based systems now because it uses 
signals for child management,
    +  // so we can also fall back to launching workers, pyspark/worker.py (by 
default) directly.
       val useDaemon = {
         val useDaemonEnabled = 
SparkEnv.get.conf.getBoolean("spark.python.use.daemon", true)
     
         // This flag is ignored on Windows as it's unable to fork.
         !System.getProperty("os.name").startsWith("Windows") && 
useDaemonEnabled
       }
     
    +  // This configuration indicates the module to run the daemon to execute 
its Python workers.
    +  val daemonModule = SparkEnv.get.conf.get("spark.python.daemon.module", 
"pyspark.daemon")
    --- End diff --
    
    Ah, yup that's true in general. But please let me stick to "module" here as 
that's what we execute (`python -m`) describes:
    
    ```
    python --help
    ...
    -m mod : run library module as a script (terminates option list)
    ...
    ```




---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #20151: [SPARK-22959][PYTHON] Configuration to select the...

Reply via email to