[GitHub] [arrow-datafusion] realno commented on pull request #1525: Add stddev operator

GitBox Sat, 08 Jan 2022 15:07:07 -0800


realno commented on pull request #1525:
URL: 
https://github.com/apache/arrow-datafusion/pull/1525#issuecomment-1008174462



   @alamb thanks for the through review! I will open a followup PR to add 
`batch_update` and `batch_merge` as discussed.
   
   I want to discuss the aggregator interface a bit further. The `update` 
function doesn't seem to provide much value with the current architecture, it 
seems under all circumstances `batch_update` will perform better. Would it be a 
reasonable choice to just keep `batch_update` (and remove `update`)? This way 
we can get rid a lot of code dealing with record level ScalarValue arithmetic. 
IMO it only makes sense to differentiate the two if `update` can provide 
additional value such as lower latency in streaming context. 
   
   With that said I am still new to the code base and could have missed 
something here. It would be nice to hear more people's opinion. Also please let 
me know if there is a more appropriate forum to discuss this.
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [arrow-datafusion] realno commented on pull request #1525: Add stddev operator

Reply via email to