Hi all, updating that a new contributor (Alon Hartanu) completed DATAFU-180
so I will start preparing the release. I will do this release myself since
it's been delayed for so long, but I'd like someone else to do the next
one, so start thinking if it's something you'd like to / be willing to do.

Eyal

On Tue, Feb 11, 2025 at 9:41 PM Eyal Allweil <e...@apache.org> wrote:

> Yes, that makes sense - any last minute takers? It's a good issue for new
> contributors.
>
> On 2025/02/11 06:45:17 Ohad Raviv wrote:
> > Hi,
> >
> > I think we should also include DATAFU-180 in the upcoming release, WDYT?
> >
> > Ohad.
> >
> > On Sun, 9 Feb 2025 at 10:50, Eyal Allweil <e...@apache.org> wrote:
> >
> > > Hi everyone,
> > >
> > > All three issues have been completed and merged - anything else worth
> > > waiting for or can we start a release?
> > >
> > > Cheers,
> > > Eyal
> > >
> > > On 2024/12/03 15:41:00 Eyal Allweil wrote:
> > > > Hi all,
> > > >
> > > > We've had a relatively inactive year, but now we have three
> issues/pull
> > > > requests that need to be reviewed. Once we merge them I think we can
> > > > release a new version, which will support Spark all the way to Spark
> > > 3.4.x
> > > > and bring us up to date with their releases. I'm writing a short
> > > > description here - anyone who can, please review them. If you don't
> have
> > > > time for an in-depth code review, checking the
> interface/documentation of
> > > > the two new methods is also important.
> > > >
> > > > The issues are:
> > > >
> > > > DATAFU-176 <https://issues.apache.org/jira/browse/DATAFU-176> - do
> > > > dedupTopN with combiner. This is like our dedupTopN, but uses the
> > > combiner
> > > > to deal with extreme skew efficiently. A use case that came up at
> PayPal.
> > > >
> > > > DATAFU-177 <https://issues.apache.org/jira/browse/DATAFU-177>- Add
> > > > dedupByAllExcept. This method is for deduplicating otherwise
> identical
> > > rows
> > > > with differing ids. Also a use case that came up at PayPal.
> > > >
> > > > DATAFU-179 <https://issues.apache.org/jira/browse/DATAFU-179> -
> support
> > > > Spark 3.3.x and 3.4.x. Self-explanatory. I did this one, and it
> didn't
> > > take
> > > > much, but I'd be glad for some good double-checking.
> > > >
> > > > A note to interested non-committers - *we welcome your review and
> > > comments*!
> > > > Feel free to write in either the Jira issues or the Github PRs.
> > > >
> > > > Cheers,
> > > > Eyal
> > > >
> > >
> >
>

Reply via email to