Wes McKinney created ARROW-1741: ----------------------------------- Summary: [C++] Comparison function for DictionaryArray to determine if indices are "compatible" Key: ARROW-1741 URL: https://issues.apache.org/jira/browse/ARROW-1741 Project: Apache Arrow Issue Type: New Feature Components: C++ Reporter: Wes McKinney
For example, if one array's dictionary is larger than the other, but the overlapping beginning portion is the same, then the respective dictionary indices correspond to the same values. Therefore, in analytics, one may choose to drop the smaller dictionary in favor of the larger dictionary, and this need not incur any computational overhead (beyond comparing the dictionary prefixes -- there may be some way to engineer "dictionary lineage" to make this comparison even cheaper) -- This message was sent by Atlassian JIRA (v6.4.14#64029)