Frank McQuillan created MADLIB-1166:
---------------------------------------
Summary: Correlation - do not drop whole row if a target column is
NULL
Key: MADLIB-1166
URL: https://issues.apache.org/jira/browse/MADLIB-1166
Project: Apache MADlib
Issue Type: Improvement
Components: Module: Descriptive Statistics
Reporter: Frank McQuillan
Fix For: v2.0
http://madlib.apache.org/docs/latest/group__grp__correlation.html
Current implementation ignores a row that contains NULL entirely. This means
any correlation in such a row (with NULLs) does not contribute to the final
answer.
This improvement is not to drop the whole row if a target column is NULL, but
rather to do pairwise correlation for any target columns that exist, to make
most use of the data.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)