Eyal Allweil created DATAFU-165:
-----------------------------------
Summary: Add collectLimitedList and dedupRandomN methods
Key: DATAFU-165
URL: https://issues.apache.org/jira/browse/DATAFU-165
Project: DataFu
Issue Type: New Feature
Reporter: Eyal Allweil
This was opened as [Github PR #24|https://github.com/apache/datafu/pull/24] -
it's additional functionality that's in the PayPal package that datafu-spark
originated from.
It breaks support for Spark 2.1.x, so announcing that we're doing that has to
happen before this gets merged.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)