pranav.lo...@cloudera.com has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/20345


Change subject: IMPALA-12361: Implementation of regr_count(), regr_avgx() and 
regr_avgy()
......................................................................

IMPALA-12361: Implementation of regr_count(), regr_avgx() and regr_avgy()

The linear regression functions fit an ordinary-least-squares regression
line to a set of number pairs. They can be used both as aggregate and
analytic functions.

regr_count() returns an integer that is the number of non-null
number pairs used to fit the regression line.
regr_avgx() evaluates the average of the independent variable(x)
of the regression line. It computes avg(expr2) after the
elimination of null (expr1, expr2).
regr_avgy() evaluates the average of the dependent variable(y)
of the regression line. It computes avg(expr1) after the
elimination of null (expr1, expr2).

Testing:
The functions are extensively tested and cross-checked with Hive.
The tests can be found in aggregation.test.

Change-Id: Ia17c565758565b868d3a54b3ebc12da51c37e143
---
M be/src/exprs/aggregate-functions-ir.cc
M be/src/exprs/aggregate-functions.h
M bin/cmake_aux/create_py3_virtualenv.sh
M fe/src/main/java/org/apache/impala/catalog/BuiltinsDb.java
M testdata/workloads/functional-query/queries/QueryTest/aggregation.test
5 files changed, 964 insertions(+), 29 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/45/20345/2
--
To view, visit http://gerrit.cloudera.org:8080/20345
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: Ia17c565758565b868d3a54b3ebc12da51c37e143
Gerrit-Change-Number: 20345
Gerrit-PatchSet: 2
Gerrit-Owner: Anonymous Coward <pranav.lo...@cloudera.com>
Gerrit-Reviewer: Joe McDonnell <joemcdonn...@cloudera.com>

Reply via email to