[ 
https://issues.apache.org/jira/browse/BEAM-8804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Heejong Lee updated BEAM-8804:
------------------------------
    Description: Currently, Beam model doesn't have any information on the 
order of input/output PCollections from PTransforms. Therefore, PCollectionList 
needs to be converted to PCollectionTuple when it goes across the 
cross-language boundaries (or even in the same language, whenever it is 
converted between in-memory object and proto) and it's impossible to recreate 
PCollectionList from proto with the original order. The possible workaround is 
just to use PCollectionTuple with integer id (starting from 0 like indexes) 
instead of PCollectionList. In that case, we should first well-define how we 
generate proto from PCollectionList since each SDK uses a different convention. 
 (was: Currently, Beam model doesn't have any information on the order of 
output PCollections from PTransforms. So, PCollectionList needs to be converted 
to PCollectionTuple when it goes across the cross-language boundary (or even in 
the same language, when it is converted between in-memory object and proto).)

> PCollectionList support in cross-language transforms
> ----------------------------------------------------
>
>                 Key: BEAM-8804
>                 URL: https://issues.apache.org/jira/browse/BEAM-8804
>             Project: Beam
>          Issue Type: Improvement
>          Components: beam-model
>            Reporter: Heejong Lee
>            Assignee: Heejong Lee
>            Priority: Major
>
> Currently, Beam model doesn't have any information on the order of 
> input/output PCollections from PTransforms. Therefore, PCollectionList needs 
> to be converted to PCollectionTuple when it goes across the cross-language 
> boundaries (or even in the same language, whenever it is converted between 
> in-memory object and proto) and it's impossible to recreate PCollectionList 
> from proto with the original order. The possible workaround is just to use 
> PCollectionTuple with integer id (starting from 0 like indexes) instead of 
> PCollectionList. In that case, we should first well-define how we generate 
> proto from PCollectionList since each SDK uses a different convention.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to