Jim Hughes created FLINK-39677:
----------------------------------

             Summary: ARRAY_SORT fails at runtime with "Comparison method 
violates its general contract!" on arrays with duplicate elements
                 Key: FLINK-39677
                 URL: https://issues.apache.org/jira/browse/FLINK-39677
             Project: Flink
          Issue Type: Bug
    Affects Versions: 2.2.1, 2.0.2, 2.1.2, 1.20.4, 2.3.0
            Reporter: Jim Hughes


*Title:* ARRAY_SORT fails at runtime with "Comparison method violates its 
general contract!" on arrays with duplicate elements

*Affects Versions:* 2.0 and earlier (wherever {{ArraySortFunction}} exists)

h3. Description

{{ArraySortFunction.ArraySortComparator.compare}} is built from a single SQL 
{{>}} evaluator and returns {{+1}} or {{-1}} based on the boolean result — it 
never returns {{0}}. For two equal elements {{a}} and {{b}}, both {{a > b}} and 
{{b > a}} are {{FALSE}}, so {{compare(a, b) == compare(b, a) == -1}}. This 
violates {{java.util.Comparator}}'s antisymmetry requirement.

TimSort detects the contradiction once the array is large enough to take the 
merge path (≥32 elements with duplicates), throwing:

{noformat}
java.lang.IllegalArgumentException: Comparison method violates its general 
contract!
    at java.util.TimSort.mergeHi(TimSort.java:903)
    at java.util.Arrays.sort(Arrays.java:1234)
    at 
org.apache.flink.table.runtime.functions.scalar.ArraySortFunction.eval(ArraySortFunction.java:91)
{noformat}




--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to