[
https://issues.apache.org/jira/browse/SPARK-56084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yicong Huang updated SPARK-56084:
---------------------------------
Description:
Remove `ArrowStreamGroupUDFSerializer` and its child
`CogroupArrowUDFSerializer` from `serializers.py`.
These classes will be unused after:
- SPARK-55608 refactors `SQL_GROUPED_MAP_ARROW_UDF` /
`SQL_GROUPED_MAP_ARROW_ITER_UDF` to use `ArrowStreamGroupSerializer`
- SPARK-56312 refactors `SQL_COGROUPED_MAP_ARROW_UDF` to use
`ArrowStreamCoGroupSerializer`
Blocked by: SPARK-55608, SPARK-56312
was:
After the refactoring of SQL_WINDOW_AGG_ARROW_UDF (SPARK-56189),
ArrowStreamAggArrowUDFSerializer is no longer used by any eval type. Delete it.
Its parent ArrowStreamArrowUDFSerializer still has one child
(ArrowBatchUDFSerializer, used by SQL_ARROW_BATCHED_UDF) so cannot be deleted
yet.
Summary: Remove unused ArrowStreamGroupUDFSerializer and
CogroupArrowUDFSerializer (was: Delete ArrowStreamAggArrowUDFSerializer)
> Remove unused ArrowStreamGroupUDFSerializer and CogroupArrowUDFSerializer
> -------------------------------------------------------------------------
>
> Key: SPARK-56084
> URL: https://issues.apache.org/jira/browse/SPARK-56084
> Project: Spark
> Issue Type: Sub-task
> Components: PySpark
> Affects Versions: 4.2.0
> Reporter: Yicong Huang
> Priority: Major
>
> Remove `ArrowStreamGroupUDFSerializer` and its child
> `CogroupArrowUDFSerializer` from `serializers.py`.
> These classes will be unused after:
> - SPARK-55608 refactors `SQL_GROUPED_MAP_ARROW_UDF` /
> `SQL_GROUPED_MAP_ARROW_ITER_UDF` to use `ArrowStreamGroupSerializer`
> - SPARK-56312 refactors `SQL_COGROUPED_MAP_ARROW_UDF` to use
> `ArrowStreamCoGroupSerializer`
> Blocked by: SPARK-55608, SPARK-56312
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]