[
https://issues.apache.org/jira/browse/DATAFU-114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15149120#comment-15149120
]
Matthew Hayes commented on DATAFU-114:
--------------------------------------
[~eyal], are you adding the project from the root of the DataFu repo or are you
adding each sub-project individually? You need to add each each project
separately: build-plugin, datafu-pig, etc. Is this your issue? I don't think
this was documented. Also if you follow the instructions at
https://cwiki.apache.org/confluence/display/DATAFU/Contributing+to+Apache+DataFu
you should be able to generate a patch that RB accepts. This is what I do.
> Make FirstTupleFromBag implement Accumulator
> --------------------------------------------
>
> Key: DATAFU-114
> URL: https://issues.apache.org/jira/browse/DATAFU-114
> Project: DataFu
> Issue Type: Improvement
> Affects Versions: 1.3.0
> Environment: All
> Reporter: Eyal Allweil
> Assignee: Eyal Allweil
> Priority: Minor
> Labels: easyfix, newbie, performance
> Fix For: 1.3.1
>
> Attachments: FirstTupleFromBag.java
>
>
> FirstTupleFromBag only needs the first tuple from the bag, but because it
> doesn't implement Accumulator the entire bag needs to be passed to it
> in-memory. The fix is very minor and will make the UDF support large bags.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)