[
https://issues.apache.org/jira/browse/BEAM-12204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Anonymous updated BEAM-12204:
-----------------------------
Status: Triage Needed (was: Resolved)
> Portable Java caches entirety of iterable side inputs.
> ------------------------------------------------------
>
> Key: BEAM-12204
> URL: https://issues.apache.org/jira/browse/BEAM-12204
> Project: Beam
> Issue Type: Bug
> Components: java-fn-execution
> Reporter: Robert Bradshaw
> Assignee: Robert Bradshaw
> Priority: P2
> Fix For: 2.30.0
>
> Time Spent: 1h 20m
> Remaining Estimate: 0h
>
> Though the data is read over the state channel in a paginated way, it is
> wrapped in LazyCachingIteratorToIterable which forces the entire iterable to
> be cached. On the other hand, not caching at all for re-iteration could be
> expensive. We should do what Python does and cache a bit (e.g. the first
> page) and still support re-iteration by re-requesting the data over the state
> channel.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)