[ https://issues.apache.org/jira/browse/ARROW-5207?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Rok Mihevc updated ARROW-5207: ------------------------------ External issue URL: https://github.com/apache/arrow/issues/21682 > [Java] add APIs to support vector reuse > --------------------------------------- > > Key: ARROW-5207 > URL: https://issues.apache.org/jira/browse/ARROW-5207 > Project: Apache Arrow > Issue Type: Improvement > Components: Java > Reporter: Ji Liu > Assignee: Ji Liu > Priority: Minor > Labels: pull-request-available > Time Spent: 1.5h > Remaining Estimate: 0h > > In some scenarios we hope that ValueVector could be reused to reduce creation > overhead. This is very common in shuffle stage, it's no need to create > ValueVector or realloc buffers every time, suppose that the recordCount of > ValueVector and capacity of its buffers is written in stream, when we > deserialize it, we can simply judge whether realloc is needed through > dataLength. > My proposal is that add APIs in ValueVector to process this logic, otherwise > users have to implement by themselves if they want to reuse which is not > user-friendly. > If you agree with this, I would like to take this ticket. Thanks -- This message was sent by Atlassian Jira (v8.20.10#820010)