You are correct and we should document that.

Any suggestions on where we should document this? In DoubleType and
FloatType?

On Tuesday, February 16, 2016, Maciej Szymkiewicz <mszymkiew...@gmail.com>
wrote:

> I am not sure if I've missed something obvious but as far as I can tell
> DataFrame API doesn't provide a clearly defined ordering rules excluding
> NaN handling. Methods like DataFrame.sort or sql.functions like min /
> max provide only general description. Discrepancy between functions.max
> (min) and GroupedData.max where the latter one supports only numeric
> makes current situation even more confusing. With growing number of
> orderable types I believe that documentation should clearly define
> ordering rules including:
>
> - NULL behavior
> - collation
> - behavior on complex types (structs, arrays)
>
> While this information can extracted from the source it is not easily
> accessible and without explicit specification it is not clear if current
> behavior is contractual. It can be also confusing if user expects an
> order depending on a current locale (R).
>
> Best,
> Maciej
>
>

Reply via email to