[ https://issues.apache.org/jira/browse/ARROW-1741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Benjamin Kietzman reassigned ARROW-1741: ---------------------------------------- Assignee: Benjamin Kietzman > [C++] Comparison function for DictionaryArray to determine if indices are > "compatible" > -------------------------------------------------------------------------------------- > > Key: ARROW-1741 > URL: https://issues.apache.org/jira/browse/ARROW-1741 > Project: Apache Arrow > Issue Type: New Feature > Components: C++ > Reporter: Wes McKinney > Assignee: Benjamin Kietzman > Priority: Major > Fix For: 0.15.0 > > > For example, if one array's dictionary is larger than the other, but the > overlapping beginning portion is the same, then the respective dictionary > indices correspond to the same values. Therefore, in analytics, one may > choose to drop the smaller dictionary in favor of the larger dictionary, and > this need not incur any computational overhead (beyond comparing the > dictionary prefixes -- there may be some way to engineer "dictionary lineage" > to make this comparison even cheaper) -- This message was sent by Atlassian Jira (v8.3.2#803003)