Re: [DataFusion] Question about Accumulator API and maybe potential bugs

2022-01-04 Thread LM
Hi Jorge, That makes sense, thanks for the clarification. Thanks, Lin On Mon, 3 Jan 2022 at 23:49, Jorge Cardoso Leitão wrote: > Hi, > > The accumulator API is designed to accept multiple columns (e.g. the > pearson correlation takes 2 columns, not one). [0] corresponds to > the first column

Re: [DataFusion] Question about Accumulator API and maybe potential bugs

2022-01-03 Thread Jorge Cardoso Leitão
Hi, The accumulator API is designed to accept multiple columns (e.g. the pearson correlation takes 2 columns, not one). [0] corresponds to the first column passed to the accumulator. All concrete implementations of accumulators in DataFusion atm only accept one column (Sum, Avg, Count, Min, Max),

[DataFusion] Question about Accumulator API and maybe potential bugs

2022-01-03 Thread LM
Hi All, I just started looking into DataFusion and am considering using it as the platform for our next gen analytics solution. To get started, I tried to add a few functions such as stddev. While writing the code I noticed some discrepancies (it may also be my unfamiliarity of the code base) in