[
https://issues.apache.org/jira/browse/DATAFU-31?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Matthew Hayes updated DATAFU-31:
--------------------------------
Affects Version/s: (was: 1.3.0)
> bags.DistinctBy works incorrectly on string containing minuses
> --------------------------------------------------------------
>
> Key: DATAFU-31
> URL: https://issues.apache.org/jira/browse/DATAFU-31
> Project: DataFu
> Issue Type: Bug
> Reporter: Roman Borisov
> Assignee: jian wang
> Fix For: 1.3.0
>
> Attachments: 0001-fix-issue-bags.DistinctBy-works-incorrectly.patch
>
>
> How to reproduce:
> Input:
> {(a-b,c), (a-b,d)}
> define distinct as DistinctBy('1')
> input = load 'input' as vs:bag{(v0:chararray,v1:chararray)};
> output = foreach input generate distinct(vs);
> dump output;
> expected: {(a-b,c), (a-b,d)}
> actual: {(a-b,c)}
> The bug is caused by the implementation based on splitting the tuple string
> by '-' to get tuple parts.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)