Will Jones created ARROW-17923: ---------------------------------- Summary: [C++] Consider dictionary arrays for special fragment fields Key: ARROW-17923 URL: https://issues.apache.org/jira/browse/ARROW-17923 Project: Apache Arrow Issue Type: Improvement Components: C++ Reporter: Will Jones
I noticed in ARROW-15281 we made {{__filename}} a string column. In common cases, this will be inefficient if materialized. If possible, it may be better to have them be dictionary arrays. As an example, [here|https://github.com/apache/arrow/pull/12826#issuecomment-1230745059] is a user report of 10x increased memory usage caused by accidentally including these special fragment columns. -- This message was sent by Atlassian Jira (v8.20.10#820010)