Brian Hulette created BEAM-12550:
------------------------------------
Summary: Implement parallelized skew and kurtosis
Key: BEAM-12550
URL: https://issues.apache.org/jira/browse/BEAM-12550
Project: Beam
Issue Type: Improvement
Components: dsl-dataframe
Reporter: Brian Hulette
skew and kurtosis should be parallelizable using a similar [approach as std and
var|https://github.com/apache/beam/blob/a0f5e932d8a9aa491b16361abdc629b5e9a483f6/sdks/python/apache_beam/dataframe/frames.py#L1307-L1310].
See
https://en.wikipedia.org/wiki/Algorithms_for_calculating_variance#Higher-order_statistics
which has information on extending that approach to calculating the third and
fourth central moments, needed for skew and kurtosis.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)