[ https://issues.apache.org/jira/browse/FLINK-32410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Flink Jira Bot updated FLINK-32410: ----------------------------------- Labels: pull-request-available stale-assigned (was: pull-request-available) I am the [Flink Jira Bot|https://github.com/apache/flink-jira-bot/] and I help the community manage its development. I see this issue is assigned but has not received an update in 30 days, so it has been labeled "stale-assigned". If you are still working on the issue, please remove the label and add a comment updating the community on your progress. If this issue is waiting on feedback, please consider this a reminder to the committer/reviewer. Flink is a very active project, and so we appreciate your patience. If you are no longer working on the issue, please unassign yourself so someone else may work on it. > Allocate hash-based collections with sufficient capacity for expected size > -------------------------------------------------------------------------- > > Key: FLINK-32410 > URL: https://issues.apache.org/jira/browse/FLINK-32410 > Project: Flink > Issue Type: Improvement > Reporter: Stefan Richter > Assignee: Stefan Richter > Priority: Major > Labels: pull-request-available, stale-assigned > Fix For: 1.18.0 > > > The JDK API to create hash-based collections for a certain capacity is > arguable misleading because it doesn't size the collections to "hold a > specific number of items" like you'd expect it would. Instead it sizes it to > hold load-factor% of the specified number. > For the common pattern to allocate a hash-based collection with the size of > expected elements to avoid rehashes, this means that a rehash is essentially > guaranteed. > We should introduce helper methods (similar to Guava's > `Maps.newHashMapWithExpectedSize(int)`) for allocations for expected size and > replace the direct constructor calls with those. -- This message was sent by Atlassian Jira (v8.20.10#820010)