Wes McKinney created ARROW-1741:
-----------------------------------

             Summary: [C++] Comparison function for DictionaryArray to 
determine if indices are "compatible"
                 Key: ARROW-1741
                 URL: https://issues.apache.org/jira/browse/ARROW-1741
             Project: Apache Arrow
          Issue Type: New Feature
          Components: C++
            Reporter: Wes McKinney


For example, if one array's dictionary is larger than the other, but the 
overlapping beginning portion is the same, then the respective dictionary 
indices correspond to the same values. Therefore, in analytics, one may choose 
to drop the smaller dictionary in favor of the larger dictionary, and this need 
not incur any computational overhead (beyond comparing the dictionary prefixes 
-- there may be some way to engineer "dictionary lineage" to make this 
comparison even cheaper)



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to