[
https://issues.apache.org/jira/browse/SPARK-55930?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xi Lyu updated SPARK-55930:
---------------------------
Description:
We should replace character-length checks with UTF-8 byte size checks when
enforcing
the gRPC metadata size limit. Multi-byte characters (e.g. CJK) can cause
length-based checks to undercount the actual byte size sent on the wire,
which could cause message size limit exceed error.
was:
We should replace character-length checks with UTF-8 byte size checks when
enforcing
the gRPC metadata size limit. Multi-byte characters (e.g. CJK) can cause
length-based checks to undercount the actual byte size sent on the wire.
> Make ErrorUtils string size checks byte-aware
> ---------------------------------------------
>
> Key: SPARK-55930
> URL: https://issues.apache.org/jira/browse/SPARK-55930
> Project: Spark
> Issue Type: Bug
> Components: Connect
> Affects Versions: 4.1.1
> Reporter: Xi Lyu
> Priority: Major
>
> We should replace character-length checks with UTF-8 byte size checks when
> enforcing
> the gRPC metadata size limit. Multi-byte characters (e.g. CJK) can cause
> length-based checks to undercount the actual byte size sent on the wire,
> which could cause message size limit exceed error.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]