[ https://issues.apache.org/jira/browse/DATAFU-31?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13904612#comment-13904612 ]
Matthew Hayes commented on DATAFU-31: ------------------------------------- Bag tests pass [testng] =============================================== [testng] Ant suite [testng] Total tests run: 29, Failures: 0, Skips: 0 [testng] =============================================== > bags.DistinctBy works incorrectly on string containing minuses > -------------------------------------------------------------- > > Key: DATAFU-31 > URL: https://issues.apache.org/jira/browse/DATAFU-31 > Project: DataFu > Issue Type: Bug > Reporter: Roman Borisov > Assignee: jian wang > Fix For: 1.3.0 > > Attachments: 0001-fix-issue-bags.DistinctBy-works-incorrectly.patch > > > How to reproduce: > Input: > {(a-b,c), (a-b,d)} > define distinct as DistinctBy('1') > input = load 'input' as vs:bag{(v0:chararray,v1:chararray)}; > output = foreach input generate distinct(vs); > dump output; > expected: {(a-b,c), (a-b,d)} > actual: {(a-b,c)} > The bug is caused by the implementation based on splitting the tuple string > by '-' to get tuple parts. -- This message was sent by Atlassian JIRA (v6.1.5#6160)