[ https://issues.apache.org/jira/browse/BEAM-13541?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Beam JIRA Bot updated BEAM-13541: --------------------------------- Priority: P3 (was: P2) > Use runtime information to improve CoGroupByKey caching > ------------------------------------------------------- > > Key: BEAM-13541 > URL: https://issues.apache.org/jira/browse/BEAM-13541 > Project: Beam > Issue Type: Improvement > Components: sdk-ideas > Reporter: Sunil Pedapudi > Priority: P3 > Labels: stale-P2 > Fix For: 2.36.0 > > Time Spent: 6.5h > Remaining Estimate: 0h > > Currently, CoGroupByKey creates UnionTables that are Flattened. The Flattened > output is processed by a GroupByKey to produce a CoGbkResult (via > ConstructCoGbkResultFn). > > Given the performance of CoGBK is greatly impacted based on the which > elements are cached in the (finitely sized) in-memory results, it would be > useful if CoGbkResult can use runtime information to prioritize which > elements are stored in-memory. -- This message was sent by Atlassian Jira (v8.20.7#820007)