[ https://issues.apache.org/jira/browse/DATAFU-31?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13902768#comment-13902768 ]
Matthew Hayes commented on DATAFU-31: ------------------------------------- +1 Looks good to me. Thanks for the fix! > bags.DistinctBy works incorrectly on string containing minuses > -------------------------------------------------------------- > > Key: DATAFU-31 > URL: https://issues.apache.org/jira/browse/DATAFU-31 > Project: DataFu > Issue Type: Bug > Affects Versions: 1.3.0 > Reporter: Roman Borisov > Assignee: jian wang > Attachments: 0001-fix-issue-bags.DistinctBy-works-incorrectly.patch > > > How to reproduce: > Input: > {(a-b,c), (a-b,d)} > define distinct as DistinctBy('1') > input = load 'input' as vs:bag{(v0:chararray,v1:chararray)}; > output = foreach input generate distinct(vs); > dump output; > expected: {(a-b,c), (a-b,d)} > actual: {(a-b,c)} > The bug is caused by the implementation based on splitting the tuple string > by '-' to get tuple parts. -- This message was sent by Atlassian JIRA (v6.1.5#6160)