I'd like to echo a question that was asked earlier this year:

If we do a global sort of a dataframe (with two columns: col_1, col_2) by
(col_1, col_2/desc) and then dropDuplicates on col_1, will it retain the
first row of each sorted group? i.e. Will it return the row with the
greatest value of col_2 for each col_1 group?

Thanks,
Allen

Reply via email to