Wes McKinney created ARROW-5340: ----------------------------------- Summary: [C++] See if possible to deduplicate dictionaries in IPC streams in some way Key: ARROW-5340 URL: https://issues.apache.org/jira/browse/ARROW-5340 Project: Apache Arrow Issue Type: Improvement Components: C++ Reporter: Wes McKinney
As follow-on work to ARROW-3144, there are cases where a dictionary may be shared by multiple fields in a RecordBatch. The presumption of {{arrow::ipc::DictionaryMemo}} is that there is a 1-to-1 mapping between fields and dictionaries, and dictionary id assignment occurs prior to observing the dictionaries (to know whether or not they are used multiple times), so it may not be feasible, or at least not easy. -- This message was sent by Atlassian JIRA (v7.6.3#76005)