[ 
https://issues.apache.org/jira/browse/SPARK-30002?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17384070#comment-17384070
 ] 

Manish Gupta commented on SPARK-30002:
--------------------------------------

We recently encountered this use case where we wanted to run a scala spark job 
and pyspark job in spark local mode. Once a SparkContext was created by scala 
spark job it was not being used by the pyspark job and job was failing while 
trying to create a new SparkContext for the pyspark job.

The fix suggested by [~skp33] worked for us.

[~hyukjin.kwon] can we consider this fix. We are using spark 3.0.1 version.

> Reuse SparkSession in pyspark via Gateway
> -----------------------------------------
>
>                 Key: SPARK-30002
>                 URL: https://issues.apache.org/jira/browse/SPARK-30002
>             Project: Spark
>          Issue Type: Improvement
>          Components: PySpark
>    Affects Versions: 3.1.0
>            Reporter: Kaushal Prajapati
>            Priority: Minor
>
> In PySpark, we create SparkContext via user spark configurations or the 
> default ones, and it gets launched through py4j gateway internally.
> Let's say if I have launched py4j gateway from another application then to 
> communicate with the same py4j gateway I have to set below configuration:-
>  
> {code:java}
> export PYSPARK_GATEWAY_PORT=12345
> export PYSPARK_GATEWAY_SECRET=***********************
> {code}
>  
> So when PySpark tries to create its own SparkContext after the communication 
> has been set up, it doesn't check whether there is any available SparkContext 
> in the same JVM.
> Current code snippet:-
>  
> {code:java}
> def _initialize_context(self, jconf):
>     """
>     Initialize SparkContext in function to allow subclass specific 
> initialization
>     """
>     return self._jvm.JavaSparkContext(jconf){code}
>  
>  
> After changing it to the following, it works fine for me.
> {code:java}
> def _initialize_context(self, jconf):
>     """
>     Initialize SparkContext in function to allow subclass specific 
> initialization
>     """
>     return 
> self._jvm.JavaSparkContext(self._jvm.org.apache.spark.SparkContext.getOrCreate(jconf)){code}
>  
> It looks like a good use case for improvement.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to