[ https://issues.apache.org/jira/browse/DATAFU-31?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
jian wang reassigned DATAFU-31: ------------------------------- Assignee: jian wang > bags.DistinctBy works incorrectly on string containing minuses > -------------------------------------------------------------- > > Key: DATAFU-31 > URL: https://issues.apache.org/jira/browse/DATAFU-31 > Project: DataFu > Issue Type: Bug > Affects Versions: 1.3.0 > Reporter: Roman Borisov > Assignee: jian wang > Attachments: 0001-fix-issue-bags.DistinctBy-works-incorrectly.patch > > > How to reproduce: > Input: > {(a-b,c), (a-b,d)} > define distinct as DistinctBy('1') > input = load 'input' as vs:bag{(v0:chararray,v1:chararray)}; > output = foreach input generate distinct(vs); > dump output; > expected: {(a-b,c), (a-b,d)} > actual: {(a-b,c)} > The bug is caused by the implementation based on splitting the tuple string > by '-' to get tuple parts. -- This message was sent by Atlassian JIRA (v6.1.5#6160)