[jira] [Commented] (DRILL-4673) Implement "DROP TABLE IF EXISTS" for drill to prevent FAILED status on command return
[ https://issues.apache.org/jira/browse/DRILL-4673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15368694#comment-15368694 ] ASF GitHub Bot commented on DRILL-4673: --- Github user parthchandra commented on the issue: https://github.com/apache/drill/pull/541 There are two files that are still showing up as binary. Can you update the patch? > Implement "DROP TABLE IF EXISTS" for drill to prevent FAILED status on > command return > - > > Key: DRILL-4673 > URL: https://issues.apache.org/jira/browse/DRILL-4673 > Project: Apache Drill > Issue Type: New Feature > Components: Functions - Drill >Reporter: Vitalii Diravka >Assignee: Vitalii Diravka >Priority: Minor > Labels: drill > > Implement "DROP TABLE IF EXISTS" for drill to prevent FAILED status on > command "DROP TABLE" return if table doesn't exist. > The same for "DROP VIEW IF EXISTS" -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (DRILL-4673) Implement "DROP TABLE IF EXISTS" for drill to prevent FAILED status on command return
[ https://issues.apache.org/jira/browse/DRILL-4673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15368665#comment-15368665 ] ASF GitHub Bot commented on DRILL-4673: --- Github user vdiravka commented on a diff in the pull request: https://github.com/apache/drill/pull/541#discussion_r70153763 --- Diff: exec/java-exec/src/main/java/org/apache/drill/exec/planner/sql/parser/SqlDropTable.java --- @@ -92,4 +105,6 @@ public SqlIdentifier getTableIdentifier() { return tableName; } + public boolean checkTableExistance() { return tableExistenceCheck; } --- End diff -- Done. > Implement "DROP TABLE IF EXISTS" for drill to prevent FAILED status on > command return > - > > Key: DRILL-4673 > URL: https://issues.apache.org/jira/browse/DRILL-4673 > Project: Apache Drill > Issue Type: New Feature > Components: Functions - Drill >Reporter: Vitalii Diravka >Assignee: Vitalii Diravka >Priority: Minor > Labels: drill > > Implement "DROP TABLE IF EXISTS" for drill to prevent FAILED status on > command "DROP TABLE" return if table doesn't exist. > The same for "DROP VIEW IF EXISTS" -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (DRILL-4673) Implement "DROP TABLE IF EXISTS" for drill to prevent FAILED status on command return
[ https://issues.apache.org/jira/browse/DRILL-4673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15368656#comment-15368656 ] ASF GitHub Bot commented on DRILL-4673: --- Github user vdiravka commented on a diff in the pull request: https://github.com/apache/drill/pull/541#discussion_r70153568 --- Diff: exec/java-exec/src/main/java/org/apache/drill/exec/planner/sql/parser/SqlDropTable.java --- @@ -57,13 +63,20 @@ public SqlOperator getOperator() { @Override public List getOperandList() { -return Collections.singletonList((SqlNode) tableName); +List ops = Lists.newArrayList(); --- End diff -- Done. > Implement "DROP TABLE IF EXISTS" for drill to prevent FAILED status on > command return > - > > Key: DRILL-4673 > URL: https://issues.apache.org/jira/browse/DRILL-4673 > Project: Apache Drill > Issue Type: New Feature > Components: Functions - Drill >Reporter: Vitalii Diravka >Assignee: Vitalii Diravka >Priority: Minor > Labels: drill > > Implement "DROP TABLE IF EXISTS" for drill to prevent FAILED status on > command "DROP TABLE" return if table doesn't exist. > The same for "DROP VIEW IF EXISTS" -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (DRILL-4673) Implement "DROP TABLE IF EXISTS" for drill to prevent FAILED status on command return
[ https://issues.apache.org/jira/browse/DRILL-4673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15368654#comment-15368654 ] ASF GitHub Bot commented on DRILL-4673: --- Github user vdiravka commented on a diff in the pull request: https://github.com/apache/drill/pull/541#discussion_r70153513 --- Diff: exec/java-exec/src/main/java/org/apache/drill/exec/planner/sql/handlers/DropTableHandler.java --- @@ -69,6 +71,15 @@ public PhysicalPlan getPlan(SqlNode sqlNode) throws ValidationException, RelConv .build(logger); } +final Table existingTable = SqlHandlerUtil.getTableFromSchema(drillSchema, tableName); + +if (dropTableNode.checkTableExistance()) { + if (existingTable == null || existingTable.getJdbcTableType() != Schema.TableType.TABLE){ --- End diff -- Done. > Implement "DROP TABLE IF EXISTS" for drill to prevent FAILED status on > command return > - > > Key: DRILL-4673 > URL: https://issues.apache.org/jira/browse/DRILL-4673 > Project: Apache Drill > Issue Type: New Feature > Components: Functions - Drill >Reporter: Vitalii Diravka >Assignee: Vitalii Diravka >Priority: Minor > Labels: drill > > Implement "DROP TABLE IF EXISTS" for drill to prevent FAILED status on > command "DROP TABLE" return if table doesn't exist. > The same for "DROP VIEW IF EXISTS" -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (DRILL-4673) Implement "DROP TABLE IF EXISTS" for drill to prevent FAILED status on command return
[ https://issues.apache.org/jira/browse/DRILL-4673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15368647#comment-15368647 ] ASF GitHub Bot commented on DRILL-4673: --- Github user vdiravka commented on a diff in the pull request: https://github.com/apache/drill/pull/541#discussion_r70153040 --- Diff: exec/java-exec/src/main/java/org/apache/drill/exec/planner/sql/handlers/DropTableHandler.java --- @@ -69,6 +71,15 @@ public PhysicalPlan getPlan(SqlNode sqlNode) throws ValidationException, RelConv .build(logger); } +final Table existingTable = SqlHandlerUtil.getTableFromSchema(drillSchema, tableName); --- End diff -- I'm not sure about returning `null ` by `getTableFromSchema(...)` only if the table does not exist. Therefore renamed it to `tableToDrop ` and moved inside `if` block > Implement "DROP TABLE IF EXISTS" for drill to prevent FAILED status on > command return > - > > Key: DRILL-4673 > URL: https://issues.apache.org/jira/browse/DRILL-4673 > Project: Apache Drill > Issue Type: New Feature > Components: Functions - Drill >Reporter: Vitalii Diravka >Assignee: Vitalii Diravka >Priority: Minor > Labels: drill > > Implement "DROP TABLE IF EXISTS" for drill to prevent FAILED status on > command "DROP TABLE" return if table doesn't exist. > The same for "DROP VIEW IF EXISTS" -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (DRILL-4746) Verification Failures (Decimal values) in drill's regression tests
[ https://issues.apache.org/jira/browse/DRILL-4746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15368588#comment-15368588 ] Chun Chang commented on DRILL-4746: --- This test fails as well due to the change: /root/drillAutomation/framework-master/framework/resources/Advanced/tpcds/tpcds_sf1/original/text/query9.sql > Verification Failures (Decimal values) in drill's regression tests > -- > > Key: DRILL-4746 > URL: https://issues.apache.org/jira/browse/DRILL-4746 > Project: Apache Drill > Issue Type: Bug > Components: Execution - Data Types, Storage - Text & CSV >Affects Versions: 1.7.0 >Reporter: Rahul Challapalli >Assignee: Arina Ielchiieva >Priority: Critical > Fix For: 1.8.0 > > > We started seeing the below 4 functional test failures in drill's extended > tests [1]. The data for the below tests can be downloaded from [2] > {code} > framework/resources/Functional/aggregates/tpcds_variants/text/aggregate28.q > framework/resources/Functional/tpcds/impala/text/q43.q > framework/resources/Functional/tpcds/variants/text/q6_1.sql > framework/resources/Functional/aggregates/tpcds_variants/text/aggregate29.q > {code} > The failures started showing up from the commit [3] > [1] https://github.com/mapr/drill-test-framework > [2] http://apache-drill.s3.amazonaws.com/files/tpcds-sf1-text.tgz > [3] > https://github.com/apache/drill/commit/223507b76ff6c2227e667ae4a53f743c92edd295 > Let me know if more information is needed to reproduce this issue. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Closed] (DRILL-2385) count on complex objects failed with missing function implementation
[ https://issues.apache.org/jira/browse/DRILL-2385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chun Chang closed DRILL-2385. - added automation tests. > count on complex objects failed with missing function implementation > > > Key: DRILL-2385 > URL: https://issues.apache.org/jira/browse/DRILL-2385 > Project: Apache Drill > Issue Type: Bug > Components: Functions - Drill >Affects Versions: 0.8.0 >Reporter: Chun Chang >Assignee: Vitalii Diravka >Priority: Minor > Fix For: 1.7.0 > > > #Wed Mar 04 01:23:42 EST 2015 > git.commit.id.abbrev=71b6bfe > Have a complex type looks like the following: > {code} > 0: jdbc:drill:schema=dfs.drillTestDirComplexJ> select t.sia from > `complex.json` t limit 1; > ++ > |sia | > ++ > | [1,11,101,1001] | > ++ > {code} > A count on the complex type will fail with missing function implementation: > {code} > 0: jdbc:drill:schema=dfs.drillTestDirComplexJ> select t.gbyi, count(t.sia) > countsia from `complex.json` t group by t.gbyi; > Query failed: RemoteRpcException: Failure while running fragment., Schema is > currently null. You must call buildSchema(SelectionVectorMode) before this > container can return a schema. [ 12856530-3133-45be-bdf4-ef8cc784f7b3 on > qa-node119.qa.lab:31010 ] > [ 12856530-3133-45be-bdf4-ef8cc784f7b3 on qa-node119.qa.lab:31010 ] > Error: exception while executing query: Failure while executing query. > (state=,code=0) > {code} > drillbit.log > {code} > 2015-03-04 13:44:51,383 [2b08832b-9247-e90c-785d-751f02fc1548:frag:2:0] ERROR > o.a.drill.exec.ops.FragmentContext - Fragment Context received failure. > org.apache.drill.exec.exception.SchemaChangeException: Failure while > materializing expression. > Error in expression at index 0. Error: Missing function implementation: > [count(BIGINT-REPEATED)]. Full expression: null. > at > org.apache.drill.exec.physical.impl.aggregate.HashAggBatch.createAggregatorInternal(HashAggBatch.java:210) > [drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT] > at > org.apache.drill.exec.physical.impl.aggregate.HashAggBatch.createAggregator(HashAggBatch.java:158) > [drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT] > at > org.apache.drill.exec.physical.impl.aggregate.HashAggBatch.buildSchema(HashAggBatch.java:101) > [drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT] > at > org.apache.drill.exec.record.AbstractRecordBatch.next(AbstractRecordBatch.java:130) > [drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT] > at > org.apache.drill.exec.physical.impl.validate.IteratorValidatorBatchIterator.next(IteratorValidatorBatchIterator.java:118) > [drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT] > at > org.apache.drill.exec.physical.impl.BaseRootExec.next(BaseRootExec.java:67) > [drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT] > at > org.apache.drill.exec.physical.impl.partitionsender.PartitionSenderRootExec.innerNext(PartitionSenderRootExec.java:114) > [drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT] > at > org.apache.drill.exec.physical.impl.BaseRootExec.next(BaseRootExec.java:57) > [drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:121) > [drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT] > at > org.apache.drill.exec.work.WorkManager$RunnableWrapper.run(WorkManager.java:303) > [drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT] > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) > [na:1.7.0_45] > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) > [na:1.7.0_45] > at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45] > 2015-03-04 13:44:51,383 [2b08832b-9247-e90c-785d-751f02fc1548:frag:2:0] WARN > o.a.d.e.w.fragment.FragmentExecutor - Error while initializing or executing > fragment > java.lang.NullPointerException: Schema is currently null. You must call > buildSchema(SelectionVectorMode) before this container can return a schema. > at > com.google.common.base.Preconditions.checkNotNull(Preconditions.java:208) > ~[guava-14.0.1.jar:na] > at > org.apache.drill.exec.record.VectorContainer.getSchema(VectorContainer.java:261) > ~[drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT] > at > org.apache.drill.exec.record.AbstractRecordBatch.getSchema(AbstractRecordBatch.java:155) > ~[drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT] > at >
[jira] [Updated] (DRILL-4771) Drill should avoid doing the same join twice if count(distinct) exists
[ https://issues.apache.org/jira/browse/DRILL-4771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gautam Kumar Parai updated DRILL-4771: -- Affects Version/s: (was: 1.6.0) 1.2.0 > Drill should avoid doing the same join twice if count(distinct) exists > -- > > Key: DRILL-4771 > URL: https://issues.apache.org/jira/browse/DRILL-4771 > Project: Apache Drill > Issue Type: Improvement >Affects Versions: 1.2.0 >Reporter: Gautam Kumar Parai >Assignee: Gautam Kumar Parai > > When the query has one distinct aggregate and one or more non-distinct > aggregates, the join instance need not produce the join-based plan. We can > generate multi-phase aggregates. Another approach would be to use grouping > sets. However, Drill is unable to support grouping sets and instead relies on > the join-based plan (see the plan below) > {code} > select emp.empno, count(*), avg(distinct dept.deptno) > from sales.emp emp inner join sales.dept dept > on emp.deptno = dept.deptno > group by emp.empno > LogicalProject(EMPNO=[$0], EXPR$1=[$1], EXPR$2=[$3]) > LogicalJoin(condition=[IS NOT DISTINCT FROM($0, $2)], joinType=[inner]) > LogicalAggregate(group=[{0}], EXPR$1=[COUNT()]) > LogicalProject(EMPNO=[$0], DEPTNO0=[$9]) > LogicalJoin(condition=[=($7, $9)], joinType=[inner]) > LogicalTableScan(table=[[CATALOG, SALES, EMP]]) > LogicalTableScan(table=[[CATALOG, SALES, DEPT]]) > LogicalAggregate(group=[{0}], EXPR$2=[AVG($1)]) > LogicalAggregate(group=[{0, 1}]) > LogicalProject(EMPNO=[$0], DEPTNO0=[$9]) > LogicalJoin(condition=[=($7, $9)], joinType=[inner]) > LogicalTableScan(table=[[CATALOG, SALES, EMP]]) > LogicalTableScan(table=[[CATALOG, SALES, DEPT]]) > {code} > The more efficient form should look like this > {code} > select emp.empno, count(*), avg(distinct dept.deptno) > from sales.emp emp inner join sales.dept dept > on emp.deptno = dept.deptno > group by emp.empno > LogicalAggregate(group=[{0}], EXPR$1=[SUM($2)], EXPR$2=[AVG($1)]) > LogicalAggregate(group=[{0, 1}], EXPR$1=[COUNT()]) > LogicalProject(EMPNO=[$0], DEPTNO0=[$9]) > LogicalJoin(condition=[=($7, $9)], joinType=[inner]) > LogicalTableScan(table=[[CATALOG, SALES, EMP]]) > LogicalTableScan(table=[[CATALOG, SALES, DEPT]]) > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Created] (DRILL-4771) Drill should avoid doing the same join twice if count(distinct) exists
Gautam Kumar Parai created DRILL-4771: - Summary: Drill should avoid doing the same join twice if count(distinct) exists Key: DRILL-4771 URL: https://issues.apache.org/jira/browse/DRILL-4771 Project: Apache Drill Issue Type: Improvement Affects Versions: 1.6.0 Reporter: Gautam Kumar Parai Assignee: Gautam Kumar Parai When the query has one distinct aggregate and one or more non-distinct aggregates, the join instance need not produce the join-based plan. We can generate multi-phase aggregates. Another approach would be to use grouping sets. However, Drill is unable to support grouping sets and instead relies on the join-based plan (see the plan below) {code} select emp.empno, count(*), avg(distinct dept.deptno) from sales.emp emp inner join sales.dept dept on emp.deptno = dept.deptno group by emp.empno LogicalProject(EMPNO=[$0], EXPR$1=[$1], EXPR$2=[$3]) LogicalJoin(condition=[IS NOT DISTINCT FROM($0, $2)], joinType=[inner]) LogicalAggregate(group=[{0}], EXPR$1=[COUNT()]) LogicalProject(EMPNO=[$0], DEPTNO0=[$9]) LogicalJoin(condition=[=($7, $9)], joinType=[inner]) LogicalTableScan(table=[[CATALOG, SALES, EMP]]) LogicalTableScan(table=[[CATALOG, SALES, DEPT]]) LogicalAggregate(group=[{0}], EXPR$2=[AVG($1)]) LogicalAggregate(group=[{0, 1}]) LogicalProject(EMPNO=[$0], DEPTNO0=[$9]) LogicalJoin(condition=[=($7, $9)], joinType=[inner]) LogicalTableScan(table=[[CATALOG, SALES, EMP]]) LogicalTableScan(table=[[CATALOG, SALES, DEPT]]) {code} The more efficient form should look like this {code} select emp.empno, count(*), avg(distinct dept.deptno) from sales.emp emp inner join sales.dept dept on emp.deptno = dept.deptno group by emp.empno LogicalAggregate(group=[{0}], EXPR$1=[SUM($2)], EXPR$2=[AVG($1)]) LogicalAggregate(group=[{0, 1}], EXPR$1=[COUNT()]) LogicalProject(EMPNO=[$0], DEPTNO0=[$9]) LogicalJoin(condition=[=($7, $9)], joinType=[inner]) LogicalTableScan(table=[[CATALOG, SALES, EMP]]) LogicalTableScan(table=[[CATALOG, SALES, DEPT]]) {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (DRILL-4514) Add describe schema command
[ https://issues.apache.org/jira/browse/DRILL-4514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15368325#comment-15368325 ] ASF GitHub Bot commented on DRILL-4514: --- Github user jinfengni commented on the issue: https://github.com/apache/drill/pull/436 The new output looks better. I checked hive's command. Seems it lists schema name and its relevant properties. Does it make sense to add schema | database name in the output? > Add describe schema command > - > > Key: DRILL-4514 > URL: https://issues.apache.org/jira/browse/DRILL-4514 > Project: Apache Drill > Issue Type: New Feature >Affects Versions: Future >Reporter: Arina Ielchiieva >Assignee: Arina Ielchiieva > > Add describe database command which will return directory > associated with a database on the fly. > Syntax: > describe database > describe schema > Output: > {code:sql} > DESCRIBE SCHEMA dfs.tmp; > {code} > {noformat} > ++ > | properties | > ++ > | { > "type" : "file", > "enabled" : true, > "connection" : "file:///", > "config" : null, > "formats" : { > "psv" : { > "type" : "text", > "extensions" : [ "tbl" ], > "delimiter" : "|" > }, > "csv" : { > "type" : "text", > "extensions" : [ "csv" ], > "delimiter" : "," > }, > "tsv" : { > "type" : "text", > "extensions" : [ "tsv" ], > "delimiter" : "\t" > }, > "parquet" : { > "type" : "parquet" > }, > "json" : { > "type" : "json", > "extensions" : [ "json" ] > }, > "avro" : { > "type" : "avro" > }, > "sequencefile" : { > "type" : "sequencefile", > "extensions" : [ "seq" ] > }, > "csvh" : { > "type" : "text", > "extensions" : [ "csvh" ], > "extractHeader" : true, > "delimiter" : "," > } > }, > "location" : "/tmp", > "writable" : true, > "defaultInputFormat" : null > } | > ++ > {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (DRILL-2385) count on complex objects failed with missing function implementation
[ https://issues.apache.org/jira/browse/DRILL-2385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15368194#comment-15368194 ] Chun Chang commented on DRILL-2385: --- Verified fix with: {noformat} 0: jdbc:drill:schema=dfs.drillTestDir> select * from sys.version; +-+---+-++-++ | version | commit_id | commit_message |commit_time | build_email | build_time | +-+---+-++-++ | 1.8.0-SNAPSHOT | 05c42eae79ce3e309028b3824f9449b98e329f29 | DRILL-4707: Fix memory leak or incorrect query result in case two column names are case-insensitive identical. | 29.06.2016 @ 08:15:13 PDT | inram...@gmail.com | 07.07.2016 @ 10:50:40 PDT | +-+---+-++-++ 1 row selected (0.44 seconds) 0: jdbc:drill:schema=dfs.drillTestDir> use dfs.drillTestDirComplexJson; +---+--+ | ok | summary | +---+--+ | true | Default schema changed to [dfs.drillTestDirComplexJson] | +---+--+ 1 row selected (0.478 seconds) 0: jdbc:drill:schema=dfs.drillTestDir> select t.sia from `complex.json` t limit 1; +--+ | sia| +--+ | [1,11,101,1001] | +--+ 1 row selected (0.363 seconds) 0: jdbc:drill:schema=dfs.drillTestDir> select t.gbyi, count(t.sia) countsia from `complex.json` t group by t.gbyi; +---+---+ | gbyi | countsia | +---+---+ | 1 | 6667 | | 2 | 6667 | | 3 | 6667 | | 4 | 6667 | | 5 | 6667 | | 6 | 6667 | | 7 | 6667 | | 8 | 6667 | | 9 | 6667 | | 10| 6667 | | 11| | | 12| | | 13| | | 14| | | 15| | +---+---+ 15 rows selected (1.693 seconds) {noformat} > count on complex objects failed with missing function implementation > > > Key: DRILL-2385 > URL: https://issues.apache.org/jira/browse/DRILL-2385 > Project: Apache Drill > Issue Type: Bug > Components: Functions - Drill >Affects Versions: 0.8.0 >Reporter: Chun Chang >Assignee: Vitalii Diravka >Priority: Minor > Fix For: 1.7.0 > > > #Wed Mar 04 01:23:42 EST 2015 > git.commit.id.abbrev=71b6bfe > Have a complex type looks like the following: > {code} > 0: jdbc:drill:schema=dfs.drillTestDirComplexJ> select t.sia from > `complex.json` t limit 1; > ++ > |sia | > ++ > | [1,11,101,1001] | > ++ > {code} > A count on the complex type will fail with missing function implementation: > {code} > 0: jdbc:drill:schema=dfs.drillTestDirComplexJ> select t.gbyi, count(t.sia) > countsia from `complex.json` t group by t.gbyi; > Query failed: RemoteRpcException: Failure while running fragment., Schema is > currently null. You must call buildSchema(SelectionVectorMode) before this > container can return a schema. [ 12856530-3133-45be-bdf4-ef8cc784f7b3 on > qa-node119.qa.lab:31010 ] > [ 12856530-3133-45be-bdf4-ef8cc784f7b3 on qa-node119.qa.lab:31010 ] > Error: exception while executing query: Failure while executing query. > (state=,code=0) > {code} > drillbit.log > {code} > 2015-03-04 13:44:51,383 [2b08832b-9247-e90c-785d-751f02fc1548:frag:2:0] ERROR > o.a.drill.exec.ops.FragmentContext - Fragment Context received failure. > org.apache.drill.exec.exception.SchemaChangeException: Failure while > materializing expression. > Error in expression at index 0. Error: Missing function implementation: > [count(BIGINT-REPEATED)]. Full expression: null. > at > org.apache.drill.exec.physical.impl.aggregate.HashAggBatch.createAggregatorInternal(HashAggBatch.java:210) > [drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT] > at >
[jira] [Updated] (DRILL-4199) Add Support for HBase 1.X
[ https://issues.apache.org/jira/browse/DRILL-4199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suresh Ollala updated DRILL-4199: - Reviewer: Abhishek Girish (was: Jacques Nadeau) > Add Support for HBase 1.X > - > > Key: DRILL-4199 > URL: https://issues.apache.org/jira/browse/DRILL-4199 > Project: Apache Drill > Issue Type: New Feature > Components: Storage - HBase >Affects Versions: 1.7.0 >Reporter: Divjot singh >Assignee: Aditya Kishore > Fix For: 1.7.0 > > > Is there any Road map to upgrade the Hbase version to 1.x series. Currently > drill supports Hbase 0.98 version. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (DRILL-2593) 500 error when crc for a query profile is out of sync
[ https://issues.apache.org/jira/browse/DRILL-2593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suresh Ollala updated DRILL-2593: - Reviewer: Krystal > 500 error when crc for a query profile is out of sync > - > > Key: DRILL-2593 > URL: https://issues.apache.org/jira/browse/DRILL-2593 > Project: Apache Drill > Issue Type: Bug > Components: Client - HTTP >Affects Versions: 0.7.0 >Reporter: Jason Altekruse >Assignee: Arina Ielchiieva > Fix For: 1.7.0 > > Attachments: warning1.JPG, warning2.JPG > > > To reproduce, on a machine where an embedded drillbit has been run, edit one > of the profiles stored in /tmp/drill/profiles and try to navigate to the > profiles page on the Web UI. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (DRILL-4716) status.json doesn't work in drill ui
[ https://issues.apache.org/jira/browse/DRILL-4716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suresh Ollala updated DRILL-4716: - Reviewer: Krystal > status.json doesn't work in drill ui > > > Key: DRILL-4716 > URL: https://issues.apache.org/jira/browse/DRILL-4716 > Project: Apache Drill > Issue Type: Bug > Components: Client - HTTP >Affects Versions: 1.6.0 >Reporter: Arina Ielchiieva >Assignee: Arina Ielchiieva >Priority: Minor > Fix For: 1.7.0 > > > 1. http://localhost:8047/status returns "Running!" > But http://localhost:8047/status.json gives error. > {code} > { > "errorMessage" : "HTTP 404 Not Found" > } > {code} > 2. Remove link to System Options on page http://localhost:8047/status as > redundant. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (DRILL-4770) ParquetRecordReader throws NPE querying a single int64 column file
[ https://issues.apache.org/jira/browse/DRILL-4770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chun Chang updated DRILL-4770: -- Attachment: int64_10_bs10k_ps1k_uncompressed.parquet > ParquetRecordReader throws NPE querying a single int64 column file > -- > > Key: DRILL-4770 > URL: https://issues.apache.org/jira/browse/DRILL-4770 > Project: Apache Drill > Issue Type: Bug > Components: Storage - Parquet >Affects Versions: 1.8.0 >Reporter: Chun Chang > Attachments: int64_10_bs10k_ps1k_uncompressed.parquet > > > I have a parquet file with a single int64 column. > {noformat} > [root@perfnode166 parquet-mr]# java -jar > parquet-tools/target/parquet-tools-1.8.2-SNAPSHOT.jar dump > /mapr/drill50.perf.lab/drill/testdata/parquet_storage/int64_10_bs10k_ps1k_uncompressed.parquet > row group 0 > > int64_field_required: INT64 UNCOMPRESSED DO:0 FPO:4 SZ:55/55/1.00 VC:10 > [more]... > int64_field_required TV=10 RL=0 DL=0 > > > page 0: DLE:RLE RLE:RLE VLE:DELTA_BINARY_PACKED ST:[min: 0, max: > [more]... VC:10 > INT64 int64_field_required > > *** row group 1 of 1, values 1 to 10 *** > value 1: R:0 D:0 V:0 > value 2: R:0 D:0 V:1 > value 3: R:0 D:0 V:2 > value 4: R:0 D:0 V:3 > value 5: R:0 D:0 V:4 > value 6: R:0 D:0 V:5 > value 7: R:0 D:0 V:6 > value 8: R:0 D:0 V:7 > value 9: R:0 D:0 V:8 > value 10: R:0 D:0 V:9 > {noformat} > Drill version: > {noformat} > 0: jdbc:drill:schema=dfs.drillTestDir> select * from sys.version; > +-+---+-++-++ > | version | commit_id | > commit_message > |commit_time | build_email | >build_time | > +-+---+-++-++ > | 1.8.0-SNAPSHOT | 05c42eae79ce3e309028b3824f9449b98e329f29 | DRILL-4707: > Fix memory leak or incorrect query result in case two column names are > case-insensitive identical. | 29.06.2016 @ 08:15:13 PDT | > inram...@gmail.com | 07.07.2016 @ 10:50:40 PDT | > +-+---+-++-++ > 1 row selected (0.44 seconds) > {noformat} > drill throws NPE: > {noformat} > 2016-07-08 11:08:55,156 [288013c7-f122-f6be-936e-c18ebe9b92ef:foreman] INFO > o.a.drill.exec.work.foreman.Foreman - Query text for query id > 288013c7-f122-f6be-936e-c18ebe9b92ef: select * from > dfs.`drill/testdata/parquet_storage/int64_10_bs10k_ps1k_uncompressed.parquet` > 2016-07-08 11:08:55,292 [288013c7-f122-f6be-936e-c18ebe9b92ef:foreman] INFO > o.a.d.exec.store.parquet.Metadata - Took 0 ms to get file statuses > 2016-07-08 11:08:55,295 [288013c7-f122-f6be-936e-c18ebe9b92ef:foreman] INFO > o.a.d.exec.store.parquet.Metadata - Fetch parquet metadata: Executed 1 out of > 1 using 1 threads. Time: 2ms total, 2.423069ms avg, 2ms max. > 2016-07-08 11:08:55,295 [288013c7-f122-f6be-936e-c18ebe9b92ef:foreman] INFO > o.a.d.exec.store.parquet.Metadata - Fetch parquet metadata: Executed 1 out of > 1 using 1 threads. Earliest start: 1.347000 μs, Latest start: 1.347000 μs, > Average start: 1.347000 μs . > 2016-07-08 11:08:55,295 [288013c7-f122-f6be-936e-c18ebe9b92ef:foreman] INFO > o.a.d.exec.store.parquet.Metadata - Took 2 ms to read file metadata > 2016-07-08 11:08:55,377 [288013c7-f122-f6be-936e-c18ebe9b92ef:frag:0:0] INFO > o.a.d.e.w.fragment.FragmentExecutor - > 288013c7-f122-f6be-936e-c18ebe9b92ef:0:0: State change requested > AWAITING_ALLOCATION --> RUNNING > 2016-07-08 11:08:55,377 [288013c7-f122-f6be-936e-c18ebe9b92ef:frag:0:0] INFO > o.a.d.e.w.f.FragmentStatusReporter - > 288013c7-f122-f6be-936e-c18ebe9b92ef:0:0: State to report: RUNNING > 2016-07-08 11:08:55,386 [288013c7-f122-f6be-936e-c18ebe9b92ef:frag:0:0] INFO > o.a.d.e.w.fragment.FragmentExecutor - > 288013c7-f122-f6be-936e-c18ebe9b92ef:0:0: State change requested RUNNING --> >
[jira] [Updated] (DRILL-2385) count on complex objects failed with missing function implementation
[ https://issues.apache.org/jira/browse/DRILL-2385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suresh Ollala updated DRILL-2385: - Reviewer: Chun Chang > count on complex objects failed with missing function implementation > > > Key: DRILL-2385 > URL: https://issues.apache.org/jira/browse/DRILL-2385 > Project: Apache Drill > Issue Type: Bug > Components: Functions - Drill >Affects Versions: 0.8.0 >Reporter: Chun Chang >Assignee: Vitalii Diravka >Priority: Minor > Fix For: 1.7.0 > > > #Wed Mar 04 01:23:42 EST 2015 > git.commit.id.abbrev=71b6bfe > Have a complex type looks like the following: > {code} > 0: jdbc:drill:schema=dfs.drillTestDirComplexJ> select t.sia from > `complex.json` t limit 1; > ++ > |sia | > ++ > | [1,11,101,1001] | > ++ > {code} > A count on the complex type will fail with missing function implementation: > {code} > 0: jdbc:drill:schema=dfs.drillTestDirComplexJ> select t.gbyi, count(t.sia) > countsia from `complex.json` t group by t.gbyi; > Query failed: RemoteRpcException: Failure while running fragment., Schema is > currently null. You must call buildSchema(SelectionVectorMode) before this > container can return a schema. [ 12856530-3133-45be-bdf4-ef8cc784f7b3 on > qa-node119.qa.lab:31010 ] > [ 12856530-3133-45be-bdf4-ef8cc784f7b3 on qa-node119.qa.lab:31010 ] > Error: exception while executing query: Failure while executing query. > (state=,code=0) > {code} > drillbit.log > {code} > 2015-03-04 13:44:51,383 [2b08832b-9247-e90c-785d-751f02fc1548:frag:2:0] ERROR > o.a.drill.exec.ops.FragmentContext - Fragment Context received failure. > org.apache.drill.exec.exception.SchemaChangeException: Failure while > materializing expression. > Error in expression at index 0. Error: Missing function implementation: > [count(BIGINT-REPEATED)]. Full expression: null. > at > org.apache.drill.exec.physical.impl.aggregate.HashAggBatch.createAggregatorInternal(HashAggBatch.java:210) > [drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT] > at > org.apache.drill.exec.physical.impl.aggregate.HashAggBatch.createAggregator(HashAggBatch.java:158) > [drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT] > at > org.apache.drill.exec.physical.impl.aggregate.HashAggBatch.buildSchema(HashAggBatch.java:101) > [drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT] > at > org.apache.drill.exec.record.AbstractRecordBatch.next(AbstractRecordBatch.java:130) > [drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT] > at > org.apache.drill.exec.physical.impl.validate.IteratorValidatorBatchIterator.next(IteratorValidatorBatchIterator.java:118) > [drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT] > at > org.apache.drill.exec.physical.impl.BaseRootExec.next(BaseRootExec.java:67) > [drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT] > at > org.apache.drill.exec.physical.impl.partitionsender.PartitionSenderRootExec.innerNext(PartitionSenderRootExec.java:114) > [drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT] > at > org.apache.drill.exec.physical.impl.BaseRootExec.next(BaseRootExec.java:57) > [drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:121) > [drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT] > at > org.apache.drill.exec.work.WorkManager$RunnableWrapper.run(WorkManager.java:303) > [drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT] > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) > [na:1.7.0_45] > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) > [na:1.7.0_45] > at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45] > 2015-03-04 13:44:51,383 [2b08832b-9247-e90c-785d-751f02fc1548:frag:2:0] WARN > o.a.d.e.w.fragment.FragmentExecutor - Error while initializing or executing > fragment > java.lang.NullPointerException: Schema is currently null. You must call > buildSchema(SelectionVectorMode) before this container can return a schema. > at > com.google.common.base.Preconditions.checkNotNull(Preconditions.java:208) > ~[guava-14.0.1.jar:na] > at > org.apache.drill.exec.record.VectorContainer.getSchema(VectorContainer.java:261) > ~[drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT] > at > org.apache.drill.exec.record.AbstractRecordBatch.getSchema(AbstractRecordBatch.java:155) > ~[drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT] > at >
[jira] [Updated] (DRILL-3623) Limit 0 should avoid execution when querying a known schema
[ https://issues.apache.org/jira/browse/DRILL-3623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suresh Ollala updated DRILL-3623: - Reviewer: Krystal > Limit 0 should avoid execution when querying a known schema > --- > > Key: DRILL-3623 > URL: https://issues.apache.org/jira/browse/DRILL-3623 > Project: Apache Drill > Issue Type: Sub-task > Components: Storage - Hive >Affects Versions: 1.1.0 > Environment: MapR cluster >Reporter: Andries Engelbrecht >Assignee: Sudheesh Katkam > Labels: doc-impacting > Fix For: 1.7.0 > > > Running a select * from hive.table limit 0 does not return (hangs). > Select * from hive.table limit 1 works fine > Hive table is about 6GB with 330 files with parquet using snappy compression. > Data types are int, bigint, string and double. > Querying directory with parquet files through the DFS plugin works fine > select * from dfs.root.`/user/hive/warehouse/database/table` limit 0; -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (DRILL-3474) Add implicit file columns support
[ https://issues.apache.org/jira/browse/DRILL-3474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suresh Ollala updated DRILL-3474: - Reviewer: Krystal > Add implicit file columns support > - > > Key: DRILL-3474 > URL: https://issues.apache.org/jira/browse/DRILL-3474 > Project: Apache Drill > Issue Type: New Feature > Components: Metadata >Affects Versions: 1.1.0 >Reporter: Jim Scott >Assignee: Arina Ielchiieva > Labels: doc-impacting > Fix For: 1.7.0 > > > I could not find another ticket which talks about this ... > The file name should be a column which can be selected or filtered when > querying a directory just like dir0, dir1 are available. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (DRILL-4707) Conflicting columns names under case-insensitive policy lead to either memory leak or incorrect result
[ https://issues.apache.org/jira/browse/DRILL-4707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suresh Ollala updated DRILL-4707: - Reviewer: Chun Chang > Conflicting columns names under case-insensitive policy lead to either memory > leak or incorrect result > -- > > Key: DRILL-4707 > URL: https://issues.apache.org/jira/browse/DRILL-4707 > Project: Apache Drill > Issue Type: Bug >Reporter: Jinfeng Ni >Assignee: Jinfeng Ni >Priority: Critical > Fix For: 1.8.0 > > > On latest master branch: > {code} > select version, commit_id, commit_message from sys.version; > +-+---+-+ > | version | commit_id | > commit_message | > +-+---+-+ > | 1.7.0-SNAPSHOT | 3186217e5abe3c6c2c7e504cdb695567ff577e4c | DRILL-4607: > Add a split function that allows to separate string by a delimiter | > +-+---+-+ > {code} > If a query has two conflicting column names under case-insensitive policy, > Drill will either hit memory leak, or incorrect issue. > Q1. > {code} > select r_regionkey as XYZ, r_name as xyz FROM cp.`tpch/region.parquet`; > Error: SYSTEM ERROR: IllegalStateException: Memory was leaked by query. > Memory leaked: (131072) > Allocator(op:0:0:1:Project) 100/131072/2490368/100 > (res/actual/peak/limit) > Fragment 0:0 > {code} > Q2: return only one column in the result. > {code} > select n_nationkey as XYZ, n_regionkey as xyz FROM cp.`tpch/nation.parquet`; > +--+ > | XYZ | > +--+ > | 0| > | 1| > | 1| > | 1| > | 4| > | 0| > | 3| > {code} > The cause of the problem seems to be that the Project thinks the two incoming > columns as identical (since Drill adopts case-insensitive for column names in > execution). > The planner should make sure that the conflicting columns are resolved, since > execution is name-based. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (DRILL-3559) Make filename available to sql statments just like dirN
[ https://issues.apache.org/jira/browse/DRILL-3559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suresh Ollala updated DRILL-3559: - Reviewer: Krystal > Make filename available to sql statments just like dirN > --- > > Key: DRILL-3559 > URL: https://issues.apache.org/jira/browse/DRILL-3559 > Project: Apache Drill > Issue Type: Improvement > Components: SQL Parser >Affects Versions: 1.1.0 >Reporter: Stefán Baxter >Assignee: Arina Ielchiieva >Priority: Minor > Labels: doc-impacting > Fix For: 1.7.0 > > -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (DRILL-4693) Incorrect column ordering when CONVERT_FROM() json is used
[ https://issues.apache.org/jira/browse/DRILL-4693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suresh Ollala updated DRILL-4693: - Reviewer: Chun Chang > Incorrect column ordering when CONVERT_FROM() json is used > --- > > Key: DRILL-4693 > URL: https://issues.apache.org/jira/browse/DRILL-4693 > Project: Apache Drill > Issue Type: Bug > Components: Execution - Relational Operators, Query Planning & > Optimization >Affects Versions: 1.6.0 >Reporter: Aman Sinha >Assignee: Aman Sinha > Fix For: 1.7.0 > > > For the following query, the column order in the results is wrong..it should > be col1, col2, col3. > {noformat} > 0: jdbc:drill:zk=local> select 'abc' as col1, convert_from('{"x" : "y"}', > 'json') as col2, 'xyz' as col3 from cp.`tpch/region.parquet`; > +---+---++ > | col1 | col3 |col2| > +---+---++ > | abc | xyz | {"x":"y"} | > | abc | xyz | {"x":"y"} | > | abc | xyz | {"x":"y"} | > | abc | xyz | {"x":"y"} | > | abc | xyz | {"x":"y"} | > +---+---++ > {noformat} > The EXPLAIN plan: > {noformat} > 0: jdbc:drill:zk=local> explain plan for select 'abc' as col1, > convert_from('{"x" : "y"}', 'json') as col2, 'xyz' as col3 from > cp.`tpch/region.parquet`; > +--+--+ > | text | json | > +--+--+ > | 00-00Screen > 00-01 Project(col1=['abc'], col2=[CONVERT_FROMJSON('{"x" : "y"}')], > col3=['xyz']) > 00-02Scan(groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath > [path=classpath:/tpch/region.parquet]], > selectionRoot=classpath:/tpch/region.parquet, numFiles=1, > usedMetadataFile=false, columns=[]]]) > {noformat} > This happens on current master branch as well as 1.6.0 and even earlier (I > checked 1.4.0 as well which also has the same behavior). So it is a > pre-existing bug. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (DRILL-4733) max(dir0) reading more columns than necessary
[ https://issues.apache.org/jira/browse/DRILL-4733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suresh Ollala updated DRILL-4733: - Reviewer: Chun Chang > max(dir0) reading more columns than necessary > - > > Key: DRILL-4733 > URL: https://issues.apache.org/jira/browse/DRILL-4733 > Project: Apache Drill > Issue Type: Bug > Components: Query Planning & Optimization, Storage - Parquet >Affects Versions: 1.7.0 >Reporter: Rahul Challapalli >Assignee: Arina Ielchiieva >Priority: Critical > Fix For: 1.7.0 > > Attachments: bug.tgz > > > The below query started to fail from this commit : > 3209886a8548eea4a2f74c059542672f8665b8d2 > {code} > select max(dir0) from dfs.`/drill/testdata/bug/2016`; > Error: UNSUPPORTED_OPERATION ERROR: Streaming aggregate does not support > schema changes > Fragment 0:0 > [Error Id: b0060205-e9a6-428a-9803-7b4312b2c6f4 on qa-node190.qa.lab:31010] > (state=,code=0) > {code} > The sub-folders contains files which do have schema change for one column > "contributions" (int32 vs double). However prior to this commit we did not > fail in the scenario. Log files and test data are attached -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (DRILL-4701) Fix log name and missing lines in logs on Web UI
[ https://issues.apache.org/jira/browse/DRILL-4701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suresh Ollala updated DRILL-4701: - Reviewer: Krystal > Fix log name and missing lines in logs on Web UI > > > Key: DRILL-4701 > URL: https://issues.apache.org/jira/browse/DRILL-4701 > Project: Apache Drill > Issue Type: Bug >Reporter: Arina Ielchiieva >Assignee: Arina Ielchiieva > Fix For: 1.7.0 > > > 1. When the log files are downloaded from the ui, the name of the downloaded > file is "download". We should save the file with the same name as the log > file (ie. drillbit.log) > 2. The last N lines of the log file displayed in the web UI do not match the > log file itself. Some lines are missing compared with actual log. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Created] (DRILL-4770) ParquetRecordReader throws NPE querying a single int64 column file
Chun Chang created DRILL-4770: - Summary: ParquetRecordReader throws NPE querying a single int64 column file Key: DRILL-4770 URL: https://issues.apache.org/jira/browse/DRILL-4770 Project: Apache Drill Issue Type: Bug Components: Storage - Parquet Affects Versions: 1.8.0 Reporter: Chun Chang I have a parquet file with a single int64 column. {noformat} [root@perfnode166 parquet-mr]# java -jar parquet-tools/target/parquet-tools-1.8.2-SNAPSHOT.jar dump /mapr/drill50.perf.lab/drill/testdata/parquet_storage/int64_10_bs10k_ps1k_uncompressed.parquet row group 0 int64_field_required: INT64 UNCOMPRESSED DO:0 FPO:4 SZ:55/55/1.00 VC:10 [more]... int64_field_required TV=10 RL=0 DL=0 page 0: DLE:RLE RLE:RLE VLE:DELTA_BINARY_PACKED ST:[min: 0, max: [more]... VC:10 INT64 int64_field_required *** row group 1 of 1, values 1 to 10 *** value 1: R:0 D:0 V:0 value 2: R:0 D:0 V:1 value 3: R:0 D:0 V:2 value 4: R:0 D:0 V:3 value 5: R:0 D:0 V:4 value 6: R:0 D:0 V:5 value 7: R:0 D:0 V:6 value 8: R:0 D:0 V:7 value 9: R:0 D:0 V:8 value 10: R:0 D:0 V:9 {noformat} Drill version: {noformat} 0: jdbc:drill:schema=dfs.drillTestDir> select * from sys.version; +-+---+-++-++ | version | commit_id | commit_message |commit_time | build_email | build_time | +-+---+-++-++ | 1.8.0-SNAPSHOT | 05c42eae79ce3e309028b3824f9449b98e329f29 | DRILL-4707: Fix memory leak or incorrect query result in case two column names are case-insensitive identical. | 29.06.2016 @ 08:15:13 PDT | inram...@gmail.com | 07.07.2016 @ 10:50:40 PDT | +-+---+-++-++ 1 row selected (0.44 seconds) {noformat} drill throws NPE: {noformat} 2016-07-08 11:08:55,156 [288013c7-f122-f6be-936e-c18ebe9b92ef:foreman] INFO o.a.drill.exec.work.foreman.Foreman - Query text for query id 288013c7-f122-f6be-936e-c18ebe9b92ef: select * from dfs.`drill/testdata/parquet_storage/int64_10_bs10k_ps1k_uncompressed.parquet` 2016-07-08 11:08:55,292 [288013c7-f122-f6be-936e-c18ebe9b92ef:foreman] INFO o.a.d.exec.store.parquet.Metadata - Took 0 ms to get file statuses 2016-07-08 11:08:55,295 [288013c7-f122-f6be-936e-c18ebe9b92ef:foreman] INFO o.a.d.exec.store.parquet.Metadata - Fetch parquet metadata: Executed 1 out of 1 using 1 threads. Time: 2ms total, 2.423069ms avg, 2ms max. 2016-07-08 11:08:55,295 [288013c7-f122-f6be-936e-c18ebe9b92ef:foreman] INFO o.a.d.exec.store.parquet.Metadata - Fetch parquet metadata: Executed 1 out of 1 using 1 threads. Earliest start: 1.347000 μs, Latest start: 1.347000 μs, Average start: 1.347000 μs . 2016-07-08 11:08:55,295 [288013c7-f122-f6be-936e-c18ebe9b92ef:foreman] INFO o.a.d.exec.store.parquet.Metadata - Took 2 ms to read file metadata 2016-07-08 11:08:55,377 [288013c7-f122-f6be-936e-c18ebe9b92ef:frag:0:0] INFO o.a.d.e.w.fragment.FragmentExecutor - 288013c7-f122-f6be-936e-c18ebe9b92ef:0:0: State change requested AWAITING_ALLOCATION --> RUNNING 2016-07-08 11:08:55,377 [288013c7-f122-f6be-936e-c18ebe9b92ef:frag:0:0] INFO o.a.d.e.w.f.FragmentStatusReporter - 288013c7-f122-f6be-936e-c18ebe9b92ef:0:0: State to report: RUNNING 2016-07-08 11:08:55,386 [288013c7-f122-f6be-936e-c18ebe9b92ef:frag:0:0] INFO o.a.d.e.w.fragment.FragmentExecutor - 288013c7-f122-f6be-936e-c18ebe9b92ef:0:0: State change requested RUNNING --> FAILED 2016-07-08 11:08:55,386 [288013c7-f122-f6be-936e-c18ebe9b92ef:frag:0:0] INFO o.a.d.e.w.fragment.FragmentExecutor - 288013c7-f122-f6be-936e-c18ebe9b92ef:0:0: State change requested FAILED --> FINISHED 2016-07-08 11:08:55,387 [288013c7-f122-f6be-936e-c18ebe9b92ef:frag:0:0] ERROR o.a.d.e.w.fragment.FragmentExecutor - SYSTEM ERROR: NullPointerException Fragment 0:0 [Error Id: 21fcc35b-6151-46b6-a750-0ce6f2141a7d on
[jira] [Updated] (DRILL-4588) Enable JMXReporter to Expose Metrics
[ https://issues.apache.org/jira/browse/DRILL-4588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suresh Ollala updated DRILL-4588: - Reviewer: Krystal > Enable JMXReporter to Expose Metrics > > > Key: DRILL-4588 > URL: https://issues.apache.org/jira/browse/DRILL-4588 > Project: Apache Drill > Issue Type: Bug >Reporter: Sudheesh Katkam >Assignee: Sudheesh Katkam > Fix For: 1.7.0 > > > -There is a static initialization order issue that needs to be fixed.- > The code is commented out. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (DRILL-3643) NTILE(0) returns RuntimeException
[ https://issues.apache.org/jira/browse/DRILL-3643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15368107#comment-15368107 ] ASF GitHub Bot commented on DRILL-3643: --- Github user adeneche closed the pull request at: https://github.com/apache/drill/pull/124 > NTILE(0) returns RuntimeException > - > > Key: DRILL-3643 > URL: https://issues.apache.org/jira/browse/DRILL-3643 > Project: Apache Drill > Issue Type: Bug > Components: Execution - Flow >Affects Versions: 1.2.0 > Environment: private-branch > https://github.com/adeneche/incubator-drill/tree/new-window-funcs >Reporter: Khurram Faraaz >Assignee: Deneche A. Hakim >Priority: Minor > Labels: window_function > Fix For: 1.2.0 > > > NTILE(0) returns Divide by zero error, on developers private branch > {code} > 0: jdbc:drill:schema=dfs.tmp> select col7 , col9 , ntile(0) over(order by > col0) lastVal_col9 from FEWRWSPQQ_101; > java.lang.RuntimeException: java.sql.SQLException: SYSTEM ERROR: > ArithmeticException: / by zero > Fragment 0:0 > [Error Id: 18d44cb4-f4f3-46eb-a69d-8ace607ef8fe on centos-03.qa.lab:31010] > at sqlline.IncrementalRows.hasNext(IncrementalRows.java:73) > at > sqlline.TableOutputFormat$ResizingRowsProvider.next(TableOutputFormat.java:87) > at sqlline.TableOutputFormat.print(TableOutputFormat.java:118) > at sqlline.SqlLine.print(SqlLine.java:1583) > at sqlline.Commands.execute(Commands.java:852) > at sqlline.Commands.sql(Commands.java:751) > at sqlline.SqlLine.dispatch(SqlLine.java:738) > at sqlline.SqlLine.begin(SqlLine.java:612) > at sqlline.SqlLine.start(SqlLine.java:366) > at sqlline.SqlLine.main(SqlLine.java:259) > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (DRILL-4673) Implement "DROP TABLE IF EXISTS" for drill to prevent FAILED status on command return
[ https://issues.apache.org/jira/browse/DRILL-4673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15367893#comment-15367893 ] ASF GitHub Bot commented on DRILL-4673: --- Github user vdiravka commented on a diff in the pull request: https://github.com/apache/drill/pull/541#discussion_r70099210 --- Diff: exec/java-exec/src/main/codegen/includes/parserImpls.ftl --- @@ -245,15 +247,17 @@ SqlNode SqlCreateTable() : * Parses a drop table statement. * DROP TABLE table_name; */ -SqlNode SqlDropTable() : +SqlNode SqlDropTableIfExists() : { SqlParserPos pos; +boolean tableExistenceCheck = false; } { { pos = getPos(); } +[ { tableExistenceCheck = true; } ] { -return new SqlDropTable(pos, CompoundIdentifier()); +return new SqlDropTable(pos, CompoundIdentifier(), tableExistenceCheck); --- End diff -- Done in a new commit. > Implement "DROP TABLE IF EXISTS" for drill to prevent FAILED status on > command return > - > > Key: DRILL-4673 > URL: https://issues.apache.org/jira/browse/DRILL-4673 > Project: Apache Drill > Issue Type: New Feature > Components: Functions - Drill >Reporter: Vitalii Diravka >Assignee: Vitalii Diravka >Priority: Minor > Labels: drill > > Implement "DROP TABLE IF EXISTS" for drill to prevent FAILED status on > command "DROP TABLE" return if table doesn't exist. > The same for "DROP VIEW IF EXISTS" -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (DRILL-4673) Implement "DROP TABLE IF EXISTS" for drill to prevent FAILED status on command return
[ https://issues.apache.org/jira/browse/DRILL-4673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15367891#comment-15367891 ] ASF GitHub Bot commented on DRILL-4673: --- Github user vdiravka commented on a diff in the pull request: https://github.com/apache/drill/pull/541#discussion_r70098841 --- Diff: contrib/storage-hive/core/src/test/java/org/apache/drill/exec/fn/hive/TestInbuiltHiveUDFs.java --- @@ -84,4 +84,10 @@ public void testGetJsonObject() throws Exception { "first_name","Bh","last_name","Venkata","position","Store")) .go(); } + + @Test // DRILL-3272 + public void testIf() throws Exception { +printResult(testSqlWithResults("select `if`(1999 > 2000, 'latest', 'old') from hive.kv limit 1")); --- End diff -- Done in the new commit. > Implement "DROP TABLE IF EXISTS" for drill to prevent FAILED status on > command return > - > > Key: DRILL-4673 > URL: https://issues.apache.org/jira/browse/DRILL-4673 > Project: Apache Drill > Issue Type: New Feature > Components: Functions - Drill >Reporter: Vitalii Diravka >Assignee: Vitalii Diravka >Priority: Minor > Labels: drill > > Implement "DROP TABLE IF EXISTS" for drill to prevent FAILED status on > command "DROP TABLE" return if table doesn't exist. > The same for "DROP VIEW IF EXISTS" -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (DRILL-4673) Implement "DROP TABLE IF EXISTS" for drill to prevent FAILED status on command return
[ https://issues.apache.org/jira/browse/DRILL-4673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15367889#comment-15367889 ] ASF GitHub Bot commented on DRILL-4673: --- Github user vdiravka commented on a diff in the pull request: https://github.com/apache/drill/pull/541#discussion_r70098588 --- Diff: exec/java-exec/src/main/codegen/includes/parserImpls.ftl --- @@ -199,15 +199,17 @@ SqlNode SqlCreateOrReplaceView() : * Parses a drop view statement. * DROP VIEW view_name; */ -SqlNode SqlDropView() : +SqlNode SqlDropViewIfExists() : { SqlParserPos pos; +boolean viewExistenceCheck = false; } { { pos = getPos(); } +[ { viewExistenceCheck = true; } ] { -return new SqlDropView(pos, CompoundIdentifier()); +return new SqlDropView(pos, CompoundIdentifier(), viewExistenceCheck); --- End diff -- I did accordingly to `SqlCreateView` (without `OrReplace`) but agree with new names. Done. > Implement "DROP TABLE IF EXISTS" for drill to prevent FAILED status on > command return > - > > Key: DRILL-4673 > URL: https://issues.apache.org/jira/browse/DRILL-4673 > Project: Apache Drill > Issue Type: New Feature > Components: Functions - Drill >Reporter: Vitalii Diravka >Assignee: Vitalii Diravka >Priority: Minor > Labels: drill > > Implement "DROP TABLE IF EXISTS" for drill to prevent FAILED status on > command "DROP TABLE" return if table doesn't exist. > The same for "DROP VIEW IF EXISTS" -- This message was sent by Atlassian JIRA (v6.3.4#6332)