Brian Hulette created BEAM-12550:
------------------------------------

             Summary: Implement parallelized skew and kurtosis 
                 Key: BEAM-12550
                 URL: https://issues.apache.org/jira/browse/BEAM-12550
             Project: Beam
          Issue Type: Improvement
          Components: dsl-dataframe
            Reporter: Brian Hulette


skew and kurtosis should be parallelizable using a similar [approach as std and 
var|https://github.com/apache/beam/blob/a0f5e932d8a9aa491b16361abdc629b5e9a483f6/sdks/python/apache_beam/dataframe/frames.py#L1307-L1310].
 See 
https://en.wikipedia.org/wiki/Algorithms_for_calculating_variance#Higher-order_statistics
which has information on extending that approach to calculating the third and 
fourth central moments, needed for skew and kurtosis.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to