[ https://issues.apache.org/jira/browse/ARROW-16513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17534479#comment-17534479 ]
Weston Pace commented on ARROW-16513: ------------------------------------- Yes it is. Thanks. > [C++] Add a compute function to hash inputs > ------------------------------------------- > > Key: ARROW-16513 > URL: https://issues.apache.org/jira/browse/ARROW-16513 > Project: Apache Arrow > Issue Type: Bug > Components: C++ > Reporter: Weston Pace > Priority: Major > > We have a lot of internal logic for hashing inputs and it might be nice to > expose some of this to users (e.g. > https://stackoverflow.com/questions/72177022/how-to-get-hash-of-string-column-in-polars-or-pyarrow) > The `HashBatch` method in `key_hash.h` (not quite merged but close) is likely > to be the most performant. However, it does make some sacrifices on > uniqueness of hashes in the spirit of performance (so we should make sure to > document these). -- This message was sent by Atlassian Jira (v8.20.7#820007)