stefankandic commented on code in PR #46280: URL: https://github.com/apache/spark/pull/46280#discussion_r1598823433
########## python/pyspark/sql/types.py: ########## @@ -1702,9 +1791,16 @@ def _parse_datatype_json_string(json_string: str) -> DataType: return _parse_datatype_json_value(json.loads(json_string)) -def _parse_datatype_json_value(json_value: Union[dict, str]) -> DataType: +def _parse_datatype_json_value( + json_value: Union[dict, str], + fieldPath: str = "", + collationsMap: Optional[Dict[str, str]] = None, +) -> DataType: if not isinstance(json_value, dict): if json_value in _all_atomic_types.keys(): + if collationsMap is not None and fieldPath in collationsMap: + collationName = collationsMap[fieldPath].split(".")[1] Review Comment: In spark we only use name to identify collations and we don't need the provider at all. That is why I asked if provider could be optional in the delta protocol but the issue would be parsing the identifier in that case https://github.com/delta-io/delta/pull/3068#discussion_r1593742078 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org