[
https://issues.apache.org/jira/browse/IMPALA-7564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16626577#comment-16626577
]
Paul Rogers commented on IMPALA-7564:
-
Great description. I think we can tease apart
[
https://issues.apache.org/jira/browse/IMPALA-7604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16626508#comment-16626508
]
Paul Rogers commented on IMPALA-7604:
-
Thanks, [~tarmstrong], for the very clear exp
[
https://issues.apache.org/jira/browse/IMPALA-7310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16626483#comment-16626483
]
Paul Rogers edited comment on IMPALA-7310 at 9/24/18 9:27 PM:
[
https://issues.apache.org/jira/browse/IMPALA-7310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16626483#comment-16626483
]
Paul Rogers commented on IMPALA-7310:
-
Final solution is even simpler, since we don'
[
https://issues.apache.org/jira/browse/IMPALA-7601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Paul Rogers updated IMPALA-7601:
Description:
Impala makes extensive use of table stats during query planning. For example,
the ND
[
https://issues.apache.org/jira/browse/IMPALA-7601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Paul Rogers updated IMPALA-7601:
Summary: Improve cardinality and selectivity estimates (was: Improve
default selectivity values)
[
https://issues.apache.org/jira/browse/IMPALA-7601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Paul Rogers updated IMPALA-7601:
Description:
Impala makes extensive use of table stats during query planning. For example,
the ND
[
https://issues.apache.org/jira/browse/IMPALA-7601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Paul Rogers updated IMPALA-7601:
Description:
Impala makes extensive use of table stats during query planning. For example,
the ND
[
https://issues.apache.org/jira/browse/IMPALA-7601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16626203#comment-16626203
]
Paul Rogers commented on IMPALA-7601:
-
Based on the above reasoning, here is a recom
[
https://issues.apache.org/jira/browse/IMPALA-7601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Paul Rogers updated IMPALA-7601:
Description:
Impala makes extensive use of table stats during query planning. For example,
the ND
[
https://issues.apache.org/jira/browse/IMPALA-7601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Paul Rogers updated IMPALA-7601:
Description:
Impala makes extensive use of table stats during query planning. For example,
the ND
[
https://issues.apache.org/jira/browse/IMPALA-7601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Paul Rogers updated IMPALA-7601:
Summary: Improve default selectivity values (was: Define better default
selectivity values)
> Im
[
https://issues.apache.org/jira/browse/IMPALA-7601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Paul Rogers updated IMPALA-7601:
Summary: Define better default selectivity values (was: Define a-priori
selectivity and NDV value
[
https://issues.apache.org/jira/browse/IMPALA-7603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Paul Rogers updated IMPALA-7603:
Summary: Incorrect NDV expression for col1 mathop col2 (was: Incorrect NDV
expression for col1 op
[
https://issues.apache.org/jira/browse/IMPALA-7608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Paul Rogers updated IMPALA-7608:
Description:
Impala makes heavy use of stats, which is a good thing. Stats feed into query
planni
Paul Rogers created IMPALA-7608:
---
Summary: Estimate row count from file size when no stats available
Key: IMPALA-7608
URL: https://issues.apache.org/jira/browse/IMPALA-7608
Project: IMPALA
Issu
[
https://issues.apache.org/jira/browse/IMPALA-7601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Paul Rogers updated IMPALA-7601:
Description:
Impala makes extensive use of table stats during query planning. For example,
the ND
[
https://issues.apache.org/jira/browse/IMPALA-7560?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16624019#comment-16624019
]
Paul Rogers commented on IMPALA-7560:
-
Created a unit test for this.
{noformat}
[
https://issues.apache.org/jira/browse/IMPALA-7604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16623051#comment-16623051
]
Paul Rogers commented on IMPALA-7604:
-
[~tarmstrong], in my experience, using planne
Paul Rogers created IMPALA-7604:
---
Summary: In AggregationNode.computeStats, handle cardinality
overflow better
Key: IMPALA-7604
URL: https://issues.apache.org/jira/browse/IMPALA-7604
Project: IMPALA
[
https://issues.apache.org/jira/browse/IMPALA-7310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16622856#comment-16622856
]
Paul Rogers edited comment on IMPALA-7310 at 9/21/18 12:04 AM:
---
[
https://issues.apache.org/jira/browse/IMPALA-7310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16622856#comment-16622856
]
Paul Rogers edited comment on IMPALA-7310 at 9/20/18 11:36 PM:
---
[
https://issues.apache.org/jira/browse/IMPALA-7310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16622856#comment-16622856
]
Paul Rogers commented on IMPALA-7310:
-
The planner uses NDVs to make binary decision
[
https://issues.apache.org/jira/browse/IMPALA-7603?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16622779#comment-16622779
]
Paul Rogers commented on IMPALA-7603:
-
Turns out that a similar limitation exists fo
[
https://issues.apache.org/jira/browse/IMPALA-7603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Paul Rogers updated IMPALA-7603:
Description:
Consider the
[[{{ExprNdvTest}}|https://github.com/apache/impala/blob/master/fe/src/t
[
https://issues.apache.org/jira/browse/IMPALA-7603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Paul Rogers updated IMPALA-7603:
Description:
Consider the
[{{ExprNdvTest}}|https://github.com/apache/impala/blob/master/fe/src/te
Paul Rogers created IMPALA-7603:
---
Summary: Incorrect NDV expression for col1 op col2
Key: IMPALA-7603
URL: https://issues.apache.org/jira/browse/IMPALA-7603
Project: IMPALA
Issue Type: Bug
[
https://issues.apache.org/jira/browse/IMPALA-7310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16622712#comment-16622712
]
Paul Rogers edited comment on IMPALA-7310 at 9/20/18 9:08 PM:
Paul Rogers created IMPALA-7602:
---
Summary: Definition of NDV differs between planner and stats
mechanism
Key: IMPALA-7602
URL: https://issues.apache.org/jira/browse/IMPALA-7602
Project: IMPALA
[
https://issues.apache.org/jira/browse/IMPALA-7310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16622712#comment-16622712
]
Paul Rogers commented on IMPALA-7310:
-
Per the suggestion of [~jeszyb], created IMPA
[
https://issues.apache.org/jira/browse/IMPALA-7601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16622709#comment-16622709
]
Paul Rogers commented on IMPALA-7601:
-
Please see [~tarmstr...@cloudera.com]'s comme
[
https://issues.apache.org/jira/browse/IMPALA-7601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Paul Rogers updated IMPALA-7601:
Description:
Impala makes extensive use of table stats during query planning. For example,
the ND
Paul Rogers created IMPALA-7601:
---
Summary: Define a-priori selectivity and NDV values
Key: IMPALA-7601
URL: https://issues.apache.org/jira/browse/IMPALA-7601
Project: IMPALA
Issue Type: Improve
[
https://issues.apache.org/jira/browse/IMPALA-7310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16621434#comment-16621434
]
Paul Rogers edited comment on IMPALA-7310 at 9/20/18 3:02 AM:
[
https://issues.apache.org/jira/browse/IMPALA-7310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16621434#comment-16621434
]
Paul Rogers commented on IMPALA-7310:
-
Odd. Looked at the tests in {{ExprNdvTest}}.
[
https://issues.apache.org/jira/browse/IMPALA-7310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16621096#comment-16621096
]
Paul Rogers edited comment on IMPALA-7310 at 9/20/18 1:50 AM:
[
https://issues.apache.org/jira/browse/IMPALA-7310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16621222#comment-16621222
]
Paul Rogers commented on IMPALA-7310:
-
[~jeszyb], agree completely. Here I'm digging
[
https://issues.apache.org/jira/browse/IMPALA-7310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16621196#comment-16621196
]
Paul Rogers commented on IMPALA-7310:
-
Here, it is worth pointing out the risk of an
[
https://issues.apache.org/jira/browse/IMPALA-7310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16621171#comment-16621171
]
Paul Rogers commented on IMPALA-7310:
-
The original description pointed out the meth
[
https://issues.apache.org/jira/browse/IMPALA-7310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16621126#comment-16621126
]
Paul Rogers commented on IMPALA-7310:
-
As noted above, the code uses -1 as an "undef
[
https://issues.apache.org/jira/browse/IMPALA-7310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16621096#comment-16621096
]
Paul Rogers commented on IMPALA-7310:
-
Simplest case: a binary predicate. Current be
[
https://issues.apache.org/jira/browse/IMPALA-7560?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16620946#comment-16620946
]
Paul Rogers commented on IMPALA-7560:
-
The table in DRILL-5254 suggests how to use t
[
https://issues.apache.org/jira/browse/IMPALA-7310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16619968#comment-16619968
]
Paul Rogers commented on IMPALA-7310:
-
Simple reproduction:
{noformat}
create table
[
https://issues.apache.org/jira/browse/IMPALA-7310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Work on IMPALA-7310 started by Paul Rogers.
---
> Compute Stats not computing NULLs as a distinct value causing wrong estimates
>
[
https://issues.apache.org/jira/browse/IMPALA-7310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Paul Rogers reassigned IMPALA-7310:
---
Assignee: Paul Rogers
> Compute Stats not computing NULLs as a distinct value causing wrong
[
https://issues.apache.org/jira/browse/IMPALA-7560?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16619827#comment-16619827
]
Paul Rogers edited comment on IMPALA-7560 at 9/19/18 1:34 AM:
[
https://issues.apache.org/jira/browse/IMPALA-7560?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16619827#comment-16619827
]
Paul Rogers commented on IMPALA-7560:
-
Turns out that Apache Drill did a similar ana
601 - 647 of 647 matches
Mail list logo