pranav.lo...@cloudera.com has uploaded this change for review. ( http://gerrit.cloudera.org:8080/20345
Change subject: IMPALA-12361: Implementation of regr_count(), regr_avgx() and regr_avgy() ...................................................................... IMPALA-12361: Implementation of regr_count(), regr_avgx() and regr_avgy() The linear regression functions fit an ordinary-least-squares regression line to a set of number pairs. They can be used both as aggregate and analytic functions. regr_count() returns an integer that is the number of non-null number pairs used to fit the regression line. regr_avgx() evaluates the average of the independent variable(x) of the regression line. It computes avg(expr2) after the elimination of null (expr1, expr2). regr_avgy() evaluates the average of the dependent variable(y) of the regression line. It computes avg(expr1) after the elimination of null (expr1, expr2). Testing: The functions are extensively tested and cross-checked with Hive. The tests can be found in aggregation.test. Change-Id: Ia17c565758565b868d3a54b3ebc12da51c37e143 --- M be/src/exprs/aggregate-functions-ir.cc M be/src/exprs/aggregate-functions.h M bin/cmake_aux/create_py3_virtualenv.sh M fe/src/main/java/org/apache/impala/catalog/BuiltinsDb.java M testdata/workloads/functional-query/queries/QueryTest/aggregation.test 5 files changed, 964 insertions(+), 29 deletions(-) git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/45/20345/2 -- To view, visit http://gerrit.cloudera.org:8080/20345 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-Project: Impala-ASF Gerrit-Branch: master Gerrit-MessageType: newchange Gerrit-Change-Id: Ia17c565758565b868d3a54b3ebc12da51c37e143 Gerrit-Change-Number: 20345 Gerrit-PatchSet: 2 Gerrit-Owner: Anonymous Coward <pranav.lo...@cloudera.com> Gerrit-Reviewer: Joe McDonnell <joemcdonn...@cloudera.com>