Vsevolod Stepanov created SPARK-54530:
-----------------------------------------

             Summary: TaskContext support in Spark Connect Scala
                 Key: SPARK-54530
                 URL: https://issues.apache.org/jira/browse/SPARK-54530
             Project: Spark
          Issue Type: Improvement
          Components: Connect
    Affects Versions: 4.1.0
            Reporter: Vsevolod Stepanov


In 
[PySpark|https://github.com/apache/spark/blob/master/python/pyspark/taskcontext.py],
 it's possible to get TaskContext information such as local properties and task 
information from a UDF, in both Spark Classic and Spark Connect.

It's not possible to do in Spark Connect Scala Client, since Scala's 
TaskContext is purely a server-side component. It's located in the `core` 
module, and is coupled with Spark.

 

We should have TaskContext support in Spark Connect Scala client, which:
 * is binary compatible with TaskContext from Spark Core
 * Can be used in UDFs that are defined in Spark Connect to retrieve 
TaskContext information, similar to how it's done in PySpark

 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to