Frank McQuillan created MADLIB-1166:
---------------------------------------

             Summary: Correlation - do not drop whole row if a target column is 
NULL
                 Key: MADLIB-1166
                 URL: https://issues.apache.org/jira/browse/MADLIB-1166
             Project: Apache MADlib
          Issue Type: Improvement
          Components: Module: Descriptive Statistics
            Reporter: Frank McQuillan
             Fix For: v2.0


http://madlib.apache.org/docs/latest/group__grp__correlation.html

Current implementation ignores a row that contains NULL entirely. This means 
any correlation in such a row (with NULLs) does not contribute to the final 
answer.

This improvement is not to drop the whole row if a target column is NULL, but 
rather to do pairwise correlation for any target columns that exist, to make 
most use of the data.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to