[jira] [Commented] (DRILL-1487) Drill window functions return wrong results

Timothy Chen (JIRA) Mon, 13 Oct 2014 17:04:08 -0700

    [ 
https://issues.apache.org/jira/browse/DRILL-1487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14170266#comment-14170266
 ]


Timothy Chen commented on DRILL-1487:
-------------------------------------

Hi [~jni], so if I understand correctly you're saying that when order by is 
added I need to include all rows in the same order by value computing the 
aggregation function that matches the partition by column? 

In other words, with dataset
employee id | position_id | salary
   1                         1                  4
   2                         1                  4
   3                         2                  2

The sum with partition by and order by on position id returns:

1                       1                 8
2                       1                 8
3                        2                 2

instead of 

1                       1                  4
2                        1                 8
3                        2                 2

Where the 2nd table is a sliding window sum?

And also what happens when we add configurable offsets?



> Drill window functions return wrong results
> -------------------------------------------
>
>                 Key: DRILL-1487
>                 URL: https://issues.apache.org/jira/browse/DRILL-1487
>             Project: Apache Drill
>          Issue Type: Bug
>    Affects Versions: 0.6.0
>            Reporter: Neeraja
>            Assignee: Timothy Chen
>
> Executing the following window function with the requirement to see how a 
> given employee salary would compare to the avg(salary) in his/her position. 
> Query executes fine however returns wrong results(expect the avg(salary) to 
> stay same for a given window (i.e position id)
> 0: jdbc:drill:zk=local> SELECT employee_id,position_id, salary, avg(salary) 
> OVER (PARTITION BY position_id order by position_id) FROM cp.`employee.json` 
> order by employee_id;
> +-------------+-------------+------------+------------+
> | employee_id | position_id |   salary   |   EXPR$3   |
> +-------------+-------------+------------+------------+
> | 1           | 1           | 80000.0    | 80000.0    |
> | 2           | 2           | 40000.0    | 37500.0    |
> | 4           | 2           | 40000.0    | 38333.333333333336 |
> | 5           | 2           | 35000.0    | 35000.0    |
> | 6           | 3           | 25000.0    | 25000.0    |
> | 7           | 4           | 15000.0    | 15000.0    |
> | 8           | 11          | 10000.0    | 14333.333333333334 |
> | 9           | 11          | 17000.0    | 17000.0    |



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (DRILL-1487) Drill window functions return wrong results

Reply via email to