[ https://issues.apache.org/jira/browse/BEAM-315?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ivan Li updated BEAM-315: ------------------------- Attachment: UniPatNumToOrigNumFn.java UniPatNumToLineFn.java PacUniPatToCiteGroupFn.java NcUniPatToCiteGroupFn.java CiteGroupPatentNumberUpdateFnRunner.java CiteGroupPatentNumberUpdateDataFlowOptions.java > Flink Runner compares keys unencoded which may produce incorrect results > ------------------------------------------------------------------------ > > Key: BEAM-315 > URL: https://issues.apache.org/jira/browse/BEAM-315 > Project: Beam > Issue Type: Bug > Components: runner-flink > Affects Versions: 0.1.0-incubating, 0.2.0-incubating > Reporter: Pawel Szczur > Assignee: Aljoscha Krettek > Fix For: 0.3.0-incubating > > Attachments: CiteGroupPatentNumberUpdateDataFlowOptions.java, > CiteGroupPatentNumberUpdateFnRunner.java, CoGroupPipelineStringKey.java, > execution.log, execution_split.log, execution_split_sorted.log, > NcUniPatToCiteGroupFn.java, PacUniPatToCiteGroupFn.java, > UniPatNumToLineFn.java, UniPatNumToOrigNumFn.java > > > Same keys are processed multiple times. > A repo to reproduce the bug: > https://github.com/orian/cogroup-wrong-grouping > Discussion: > http://mail-archives.apache.org/mod_mbox/incubator-beam-user/201605.mbox/%3CCAB2uKkG2xHsWpLFUkYnt8eEzdxU%3DB_nu6crTwVi-ZuUpugxkPQ%40mail.gmail.com%3E > Notice: I haven't tested other runners (didn't manage to configure Spark). -- This message was sent by Atlassian JIRA (v6.3.15#6346)