[ 
https://issues.apache.org/jira/browse/BEAM-12204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Bradshaw updated BEAM-12204:
-----------------------------------
    Resolution: Fixed
        Status: Resolved  (was: Open)

> Portable Java caches entirety of iterable side inputs.
> ------------------------------------------------------
>
>                 Key: BEAM-12204
>                 URL: https://issues.apache.org/jira/browse/BEAM-12204
>             Project: Beam
>          Issue Type: Bug
>          Components: java-fn-execution
>            Reporter: Robert Bradshaw
>            Assignee: Robert Bradshaw
>            Priority: P2
>             Fix For: 2.30.0
>
>          Time Spent: 1h 20m
>  Remaining Estimate: 0h
>
> Though the data is read over the state channel in a paginated way, it is 
> wrapped in LazyCachingIteratorToIterable which forces the entire iterable to 
> be cached. On the other hand, not caching at all for re-iteration could be 
> expensive. We should do what Python does and cache a bit (e.g. the first 
> page) and still support re-iteration by re-requesting the data over the state 
> channel. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to