[
https://issues.apache.org/jira/browse/BEAM-7989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ismaël Mejía resolved BEAM-7989.
--------------------------------
Resolution: Fixed
Fix Version/s: 2.16.0
> SparkRunner CacheVisitor counts PCollections from SideInputs
> ------------------------------------------------------------
>
> Key: BEAM-7989
> URL: https://issues.apache.org/jira/browse/BEAM-7989
> Project: Beam
> Issue Type: Bug
> Components: runner-spark
> Affects Versions: 2.14.0
> Reporter: Kyle Winkelman
> Assignee: Kyle Winkelman
> Priority: Major
> Fix For: 2.16.0
>
> Time Spent: 1h 10m
> Remaining Estimate: 0h
>
> The SparkRunner's CacheVisitor looks at all inputs for a
> TransformHierarchy.Node. Those inputs include the PCollections from the
> PCollectionViews that are supplied as sideInputs.
> The SparkRunner should not count these instances of sideInputs as the
> PCollections are not actually accessed. They are only accessed when the
> CreatePCollectionView Transform is processed.
--
This message was sent by Atlassian JIRA
(v7.6.14#76016)