[
https://issues.apache.org/jira/browse/SPARK-54452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Dongjoon Hyun resolved SPARK-54452.
-----------------------------------
Fix Version/s: 4.1.0
Resolution: Fixed
Issue resolved by pull request 53156
[https://github.com/apache/spark/pull/53156]
> Fix empty response from SparkConnect server for spark.sql(...) inside
> FlowFunction
> ----------------------------------------------------------------------------------
>
> Key: SPARK-54452
> URL: https://issues.apache.org/jira/browse/SPARK-54452
> Project: Spark
> Issue Type: Sub-task
> Components: Declarative Pipelines
> Affects Versions: 4.1.0
> Reporter: Yuheng Chang
> Assignee: Yuheng Chang
> Priority: Major
> Labels: pull-request-available
> Fix For: 4.1.0
>
>
> n PR SPARK-54020, we added support for {{spark.sql(...)}} inside a
> FlowFunction for SDP. For these calls, instead of eagerly executing the SQL,
> the Spark Connect server should return the raw logical plan to the client and
> defer execution to the flow function.
> However, in that PR we constructed the response object but forgot to actually
> return it to the Spark Connect client, so the client received an empty
> response.
> This went unnoticed in tests because, when the client sees an empty
> {{spark.sql(...)}} response, [it falls back to creating an empty DataFrame
> holding the raw logical
> plan|https://github.com/apache/spark/blob/master/python/pyspark/sql/connect/session.py#L829-L835],
> which happens to match the desired behavior. We should fixe the bug by
> returning the proper response instead of relying on that implicit fallback.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]