[ https://issues.apache.org/jira/browse/SPARK-40315?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Wenchen Fan reassigned SPARK-40315: ----------------------------------- Assignee: Carmen Kwan > Non-deterministic hashCode() calculations for ArrayBasedMapData on equal > objects > -------------------------------------------------------------------------------- > > Key: SPARK-40315 > URL: https://issues.apache.org/jira/browse/SPARK-40315 > Project: Spark > Issue Type: Bug > Components: Spark Core > Affects Versions: 3.2.2 > Reporter: Carmen Kwan > Assignee: Carmen Kwan > Priority: Major > Fix For: 3.1.4, 3.4.0, 3.3.1, 3.2.3 > > > There is no explicit `hashCode()` function override for the > `ArrayBasedMapData` LogicalPlan. As a result, the `hashCode()` computed for > `ArrayBasedMapData` can be different for two equal objects (objects with > equal keys and values). > This error is non-deterministic and hard to reproduce, as we don't control > the default `hashCode()` function. > We should override the `hashCode` function so that it works exactly as we > expect. We should also have an explicit `equals()` function for consistency > with how `Literals` check for equality of `ArrayBasedMapData`. -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org