Eyal Allweil created DATAFU-165: ----------------------------------- Summary: Add collectLimitedList and dedupRandomN methods Key: DATAFU-165 URL: https://issues.apache.org/jira/browse/DATAFU-165 Project: DataFu Issue Type: New Feature Reporter: Eyal Allweil
This was opened as [Github PR #24|https://github.com/apache/datafu/pull/24] - it's additional functionality that's in the PayPal package that datafu-spark originated from. It breaks support for Spark 2.1.x, so announcing that we're doing that has to happen before this gets merged. -- This message was sent by Atlassian Jira (v8.20.10#820010)