[ 
https://issues.apache.org/jira/browse/ARROW-16234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17526659#comment-17526659
 ] 

Will Ayd commented on ARROW-16234:
----------------------------------

I think we also need to consider how to handle NULL. In my current design I was 
thinking we should delegate as much responsibility to the standard sorting 
behavior, but AFAICT there are only SortOptions to rank NULLs at the start or 
the end, not necessarily to ignore NULL altogether. If we want to completely 
remove NULL from being calculated in the ranking algorithm I wonder if we 
should try and work that up the class hierarchy a bit to to the same thing in 
general sorting

> [C++] Implement Rank Kernel
> ---------------------------
>
>                 Key: ARROW-16234
>                 URL: https://issues.apache.org/jira/browse/ARROW-16234
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: C++
>            Reporter: Will Ayd
>            Assignee: Will Ayd
>            Priority: Minor
>              Labels: C++, good-second-issue, kernel, pull-request-available
>          Time Spent: 20m
>  Remaining Estimate: 0h
>
> Didn't see this in the library already so apologies if overlooked, but I 
> think it would be nice to add a compute kernel for ranking. Here is a similar 
> function in pandas:
> [https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.Series.rank.html]



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

Reply via email to