[jira] [Created] (DRILL-6902) Extra limit operator is not needed

2018-12-12 Thread Robert Hou (JIRA)
Robert Hou created DRILL-6902:
-

 Summary: Extra limit operator is not needed
 Key: DRILL-6902
 URL: https://issues.apache.org/jira/browse/DRILL-6902
 Project: Apache Drill
  Issue Type: Bug
  Components: Query Planning & Optimization
Affects Versions: 1.15.0
Reporter: Robert Hou
Assignee: Pritesh Maker


For TPCDS query 49, there is an extra limit operator that is not needed.

Here is the query:
{noformat}
SELECT 'web' AS channel, 
   web.item, 
   web.return_ratio, 
   web.return_rank, 
   web.currency_rank 
FROM   (SELECT item, 
   return_ratio, 
   currency_ratio, 
   Rank() 
 OVER ( 
   ORDER BY return_ratio)   AS return_rank, 
   Rank() 
 OVER ( 
   ORDER BY currency_ratio) AS currency_rank 
FROM   (SELECT ws.ws_item_sk   AS 
   item, 
   ( Cast(Sum(COALESCE(wr.wr_return_quantity, 0)) AS 
DEC(15, 
  4)) / 
 Cast( 
 Sum(COALESCE(ws.ws_quantity, 0)) AS DEC(15, 4)) ) AS 
   return_ratio, 
   ( Cast(Sum(COALESCE(wr.wr_return_amt, 0)) AS DEC(15, 4)) 
 / Cast( 
 Sum( 
 COALESCE(ws.ws_net_paid, 0)) AS DEC(15, 
 4)) ) AS 
   currency_ratio 
FROM   web_sales ws 
   LEFT OUTER JOIN web_returns wr 
ON ( ws.ws_order_number = 
wr.wr_order_number 
 AND ws.ws_item_sk = wr.wr_item_sk ), 
   date_dim 
WHERE  wr.wr_return_amt > 1 
   AND ws.ws_net_profit > 1 
   AND ws.ws_net_paid > 0 
   AND ws.ws_quantity > 0 
   AND ws_sold_date_sk = d_date_sk 
   AND d_year = 1999 
   AND d_moy = 12 
GROUP  BY ws.ws_item_sk) in_web) web 
WHERE  ( web.return_rank <= 10 
  OR web.currency_rank <= 10 ) 
UNION 
SELECT 'catalog' AS channel, 
   catalog.item, 
   catalog.return_ratio, 
   catalog.return_rank, 
   catalog.currency_rank 
FROM   (SELECT item, 
   return_ratio, 
   currency_ratio, 
   Rank() 
 OVER ( 
   ORDER BY return_ratio)   AS return_rank, 
   Rank() 
 OVER ( 
   ORDER BY currency_ratio) AS currency_rank 
FROM   (SELECT cs.cs_item_sk   AS 
   item, 
   ( Cast(Sum(COALESCE(cr.cr_return_quantity, 0)) AS 
DEC(15, 
  4)) / 
 Cast( 
 Sum(COALESCE(cs.cs_quantity, 0)) AS DEC(15, 4)) ) AS 
   return_ratio, 
   ( Cast(Sum(COALESCE(cr.cr_return_amount, 0)) AS DEC(15, 
4 
  )) / 
 Cast(Sum( 
 COALESCE(cs.cs_net_paid, 0)) AS DEC( 
 15, 4)) ) AS 
   currency_ratio 
FROM   catalog_sales cs 
   LEFT OUTER JOIN catalog_returns cr 
ON ( cs.cs_order_number = 
cr.cr_order_number 
 AND cs.cs_item_sk = cr.cr_item_sk ), 
   date_dim 
WHERE  cr.cr_return_amount > 1 
   AND cs.cs_net_profit > 1 
   AND cs.cs_net_paid > 0 
   AND cs.cs_quantity > 0 
   AND cs_sold_date_sk = d_date_sk 
   AND d_year = 1999 
   AND d_moy = 12 
GROUP  BY cs.cs_item_sk) in_cat) catalog 
WHERE  ( catalog.return_rank <= 10 
  OR catalog.currency_rank <= 10 ) 
UNION 
SELECT 'store' AS channel, 
   store.item, 
   store.return_ratio, 
   store.return_rank, 
   store.currency_rank 
FROM   (SELECT item, 
   return_ratio, 
   currency_ratio, 
   Rank() 
 OVER ( 
   ORDER BY return_ratio)   AS return_rank, 
   Rank() 
 OVER ( 
   ORDER BY currency_ratio) AS currency_rank 
FROM   (SELECT sts.ss_item_sk   AS 
   item, 
   ( Cast(Sum(COALESCE(sr.sr_return_quantity, 0)

[jira] [Created] (DRILL-6901) Move SchemaBuilder from test to main for use outside tests

2018-12-12 Thread Paul Rogers (JIRA)
Paul Rogers created DRILL-6901:
--

 Summary: Move SchemaBuilder from test to main for use outside tests
 Key: DRILL-6901
 URL: https://issues.apache.org/jira/browse/DRILL-6901
 Project: Apache Drill
  Issue Type: Improvement
Affects Versions: 1.14.0
Reporter: Paul Rogers
Assignee: Paul Rogers
 Fix For: 1.15.0


The {{SchemaBuilder}} class provides a fluent way to build a {{TupleMetadata}}. 
It currently resides in the test name space as it was original used only for 
tests. Move it into a "main" package for use by regular code.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (DRILL-6900) maven-release-plugin failure on the release:perform stage

2018-12-12 Thread Vitalii Diravka (JIRA)
Vitalii Diravka created DRILL-6900:
--

 Summary: maven-release-plugin failure on the release:perform stage
 Key: DRILL-6900
 URL: https://issues.apache.org/jira/browse/DRILL-6900
 Project: Apache Drill
  Issue Type: Bug
  Components: Tools, Build & Test
Affects Versions: 1.14.0
Reporter: Vitalii Diravka
Assignee: Vitalii Diravka
 Fix For: 1.15.0


{{release:perform}} stage fails on current master branch. It blocs to prepare 
the release candidate: {code}
{{mvn release:perform 
-DconnectionUrl=scm:git:g...@github.com:vkorukanti/drill.git -DskipTests 
-Darguments="-Dgpg.passphrase=${GPG_PASSPHRASE} -DskipTests=true 
-DconnectionUrl=scm:git:g...@github.com:vkorukanti/drill.git"}}
{code}

{code}
[INFO] [ERROR] 
/home/vitalii/src/release/drill/target/checkout/contrib/storage-hive/core/src/main/java/org/apache/drill/exec/expr/fn/impl/hive/AbstractDrillPrimitiveObjectInspector.java:[26,47]
 error: cannot find symbol
[INFO]   symbol:   class PrimitiveTypeInfo
[INFO]   location: class AbstractDrillPrimitiveObjectInspector
[INFO] [INFO] 100 errors 
[INFO] [INFO] -
[INFO] [INFO] 

[INFO] [INFO] Reactor Summary for Apache Drill Root POM 1.15.0:
[INFO] [INFO] 
[INFO] [INFO] Apache Drill Root POM .. SUCCESS [ 
37.579 s]
[INFO] [INFO] tools/Parent Pom ... SUCCESS [  
6.360 s]
[INFO] [INFO] tools/freemarker codegen tooling ... SUCCESS [ 
26.726 s]
[INFO] [INFO] Drill Protocol . SUCCESS [ 
41.030 s]
[INFO] [INFO] Common (Logical Plan, Base expressions)  SUCCESS [ 
27.754 s]
[INFO] [INFO] Logical Plan, Base expressions . SUCCESS [ 
29.154 s]
[INFO] [INFO] exec/Parent Pom  SUCCESS [  
6.252 s]
[INFO] [INFO] exec/memory/Parent Pom . SUCCESS [  
6.074 s]
[INFO] [INFO] exec/memory/base ... SUCCESS [ 
24.909 s]
[INFO] [INFO] exec/rpc ... SUCCESS [ 
24.191 s]
[INFO] [INFO] exec/Vectors ... SUCCESS 
[01:50 min]
[INFO] [INFO] contrib/Parent Pom . SUCCESS [  
6.173 s]
[INFO] [INFO] contrib/data/Parent Pom  SUCCESS [  
6.058 s]
[INFO] [INFO] contrib/data/tpch-sample-data .. SUCCESS [ 
19.704 s]
[INFO] [INFO] exec/Java Execution Engine . SUCCESS 
[01:57 min]
[INFO] [INFO] exec/JDBC Driver using dependencies  SUCCESS [ 
32.619 s]
[INFO] [INFO] JDBC JAR with all dependencies . FAILURE 
[01:16 min]
[INFO] [INFO] Drill-on-YARN .. SKIPPED
[INFO] [INFO] contrib/kudu-storage-plugin  SKIPPED
[INFO] [INFO] contrib/opentsdb-storage-plugin  SKIPPED
[INFO] [INFO] contrib/mongo-storage-plugin ... SKIPPED
[INFO] [INFO] contrib/hbase-storage-plugin ... SKIPPED
[INFO] [INFO] contrib/jdbc-storage-plugin  SKIPPED
[INFO] [INFO] contrib/hive-storage-plugin/Parent Pom . SKIPPED
[INFO] [INFO] contrib/hive-storage-plugin/hive-exec-shaded ... SKIPPED
[INFO] [INFO] contrib/mapr-format-plugin . SKIPPED
[INFO] [INFO] contrib/hive-storage-plugin/core ... SKIPPED
[INFO] [INFO] contrib/kafka-storage-plugin ... SKIPPED
[INFO] [INFO] contrib/drill-udfs . SKIPPED
[INFO] [INFO] Packaging and Distribution Assembly  SKIPPED
[INFO] [INFO] 

[INFO] [INFO] BUILD FAILURE
[INFO] [INFO] 

[INFO] [INFO] Total time:  09:58 min
[INFO] [INFO] Finished at: 2018-12-12T19:18:08+02:00
[INFO] [INFO] 

[INFO] [ERROR] Failed to execute goal 
org.apache.maven.plugins:maven-compiler-plugin:3.8.0:compile (default-compile) 
on project drill-storage-hive-core: Compilation failure: Compilation failure: 
[INFO] [ERROR] 
/home/vitalii/src/release/drill/target/checkout/contrib/storage-hive/core/src/main/java/org/apache/drill/exec/store/hive/HiveMetadataProvider.java:[32,36]
 error: package org.apache.hadoop.hive.common does not exist
[INFO] [ERROR] 
/home/vitalii/src/release/drill/target/checkout/contrib/storage-hive/core/src/main/java/org/apache/drill/exec/store/hive/HiveMetadataProvider.java:[33,34]
 error: package org.apache.hadoop.hive.conf does not exist
[INFO] [ERROR] 
/home/vitalii/src/relea

[jira] [Created] (DRILL-6899) Fix timestamp issues in unit tests ignored with DRILL-6833

2018-12-12 Thread Gautam Parai (JIRA)
Gautam Parai created DRILL-6899:
---

 Summary: Fix timestamp issues in unit tests ignored with DRILL-6833
 Key: DRILL-6899
 URL: https://issues.apache.org/jira/browse/DRILL-6899
 Project: Apache Drill
  Issue Type: Bug
Affects Versions: 1.15.0
Reporter: Gautam Parai
Assignee: Gautam Parai
 Fix For: 1.16.0


{{The following tests were disabled in the PR for DRILL-6833}}

{{IndexPlanTest.testCastTimestampPlan() - Re-enable after the MapRDB format 
plugin issue is fixed.}}
{{IndexPlanTest.testRowkeyJoinPushdown_13() - Re-enable the testcase after 
fixing the execution issue with HashJoin used as Rowkeyjoin.}}
{{IndexPlanTest.testRowkeyJoinPushdown_12() - Remove the testcase since the 
SemiJoin transformation makes the rowkeyjoinpushdown transformation invalid.}}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (DRILL-6881) Hash-Table insert and probe: Compare hash values before keys

2018-12-12 Thread Boaz Ben-Zvi (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-6881?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Boaz Ben-Zvi updated DRILL-6881:

Fix Version/s: (was: 1.16.0)

> Hash-Table insert and probe: Compare hash values before keys
> 
>
> Key: DRILL-6881
> URL: https://issues.apache.org/jira/browse/DRILL-6881
> Project: Apache Drill
>  Issue Type: Improvement
>  Components: Execution - Relational Operators
>Affects Versions: 1.14.0
>Reporter: Boaz Ben-Zvi
>Assignee: Boaz Ben-Zvi
>Priority: Major
>
>   When checking for existence of a key in the hash table (during _put_ or 
> _probe_ operations), the value of that key is compared (using generated code) 
> with a potential match key (same bucket). 
>    This comparison is slightly expensive (e.g., long keys, multi column keys, 
> checking null conditions, NaN, etc). Instead, if the hash-values of the two 
> keys are compared first (at practically zero cost), then the costly 
> comparison can be avoided in case the hash values don't match.
>  This code change is trivial, and given that the relevant Hash-Table code is 
> *hot code*, then even minute improvements could add up.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (DRILL-6898) Web UI cannot be used without internet connection (jquery loaded from ajax.googleapis.com)

2018-12-12 Thread Arina Ielchiieva (JIRA)


[ 
https://issues.apache.org/jira/browse/DRILL-6898?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16719135#comment-16719135
 ] 

Arina Ielchiieva commented on DRILL-6898:
-

Might be fixed in https://issues.apache.org/jira/browse/DRILL-6776.

> Web UI cannot be used without internet connection (jquery loaded from 
> ajax.googleapis.com)
> --
>
> Key: DRILL-6898
> URL: https://issues.apache.org/jira/browse/DRILL-6898
> Project: Apache Drill
>  Issue Type: Improvement
>  Components: Web Server
>Affects Versions: 1.14.0
>Reporter: Paul Bormans
>Priority: Major
>
> When opening the web ui in an environment that does not have an internet 
> connection, then the jquery js library is not loaded and the website does not 
> function as it should.
> One solution can be to add a configuration option to use local/packages 
> javascript libraries iso loading these from a CDN.
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (DRILL-6898) Web UI cannot be used without internet connection (jquery loaded from ajax.googleapis.com)

2018-12-12 Thread Paul Bormans (JIRA)
Paul Bormans created DRILL-6898:
---

 Summary: Web UI cannot be used without internet connection (jquery 
loaded from ajax.googleapis.com)
 Key: DRILL-6898
 URL: https://issues.apache.org/jira/browse/DRILL-6898
 Project: Apache Drill
  Issue Type: Improvement
  Components: Web Server
Affects Versions: 1.14.0
Reporter: Paul Bormans


When opening the web ui in an environment that does not have an internet 
connection, then the jquery js library is not loaded and the website does not 
function as it should.

One solution can be to add a configuration option to use local/packages 
javascript libraries iso loading these from a CDN.

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (DRILL-6893) Invalid output for star and self-join queries for RDBMS Storage Plugin

2018-12-12 Thread Vitalii Diravka (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-6893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitalii Diravka updated DRILL-6893:
---
Description: 
Invalid output for star and self-join queries for RDBMS Storage Plugin:
{code:java}
0: jdbc:drill:zk=local> SELECT * FROM (SELECT * FROM 
mysql.`testdb`.`mscIdentities3` WHERE `PersonID` = 10) AS `t` INNER JOIN 
(SELECT * FROM mysql.`testdb`.`mscIdentities3` WHERE `PersonID` = 10) AS `t0` 
ON `t`.`PersonID` = `t0`.`PersonID` ;
+---+--+-+--++---+--+---+
| PersonID  | OrderID  | ItemID  | GroupID  | PersonID0  | OrderID0  | ItemID0  
| GroupID0  |
+---+--+-+--++---+--+---+
| 10| 10   | 10  | 10   | null   | null  | null 
| null  |
+---+--+-+--++---+--+---+
1 row selected (1.402 seconds)
0: jdbc:drill:zk=local> select * from sys.version;
+--+---+++++
| version  | commit_id |
 commit_message |commit_time |
build_email | build_time |
+--+---+++++
| 1.15.0-SNAPSHOT  | 100a68b314230d4cf327477f7a10f9c650720513  | DRILL-540: 
Allow querying hive views in Drill  | 30.11.2018 @ 10:50:46 EET  | 
vitalii.dira...@gmail.com  | 10.12.2018 @ 15:46:54 EET  |
+--+---+++++
1 row selected (0.302 seconds)
{code}
The same result in older 1.11.0 Drill version:
{code:java}
0: jdbc:drill:zk=local> SELECT * FROM (SELECT * FROM 
mysql.`testdb`.`mscIdentities3` WHERE `PersonID` = 10) AS `t` INNER JOIN 
(SELECT * FROM mysql.`testdb`.`mscIdentities3` WHERE `PersonID` = 10) AS `t0` 
ON `t`.`PersonID` = `t0`.`PersonID`;
+---+--+-+--++---+--+---+
| PersonID  | OrderID  | ItemID  | GroupID  | PersonID0  | OrderID0  | ItemID0  
| GroupID0  |
+---+--+-+--++---+--+---+
| 10| 10   | 10  | 10   | null   | null  | null 
| null  |
+---+--+-+--++---+--+---+
1 row selected (1.344 seconds)
0: jdbc:drill:zk=local> select * from sys.version;
+--+---+--+-+++
| version  | commit_id |
commit_message| commit_time |
build_email | build_time |
+--+---+--+-+++
| 1.11.0   | 4220fb2fffbc81883df3e5fea575fa0a584852b3  | [maven-release-plugin] 
prepare release drill-1.11.0  | 24.07.2017 @ 16:47:07 EEST  | 
vitalii.dira...@gmail.com  | 06.12.2018 @ 14:36:39 EET  |
+--+---+--+-+++
1 row selected (0.271 seconds)
{code}
But the same query in MySQL:
{code:java}
mysql> select * from `mscIdentities3` t1 join `mscIdentities3` t2 on 
t1.`PersonId` = t2.`PersonID` where t1.`PersonID` = 10;
+--+-++-+--+-++-+
| PersonID | OrderID | ItemID | GroupID | PersonID | OrderID | ItemID | GroupID 
|
+--+-++-+--+-++-+
|   10 |  10 | 10 |  10 |   10 |  10 | 10 |  10 
|
+--+-++-+--+-++-+
1 row in set (1.27 sec)
{code}
Drill physical plan for above query is:
{code:java}
00-00Screen : rowType = RecordType(INTEGER PersonID, INTEGER OrderID, 
INTEGER ItemID, INTEGER GroupID, INTEGER PersonID0, INTEGER OrderID0, INTEGER 
ItemID0, INTEGER GroupID0): rowcount = 15.0, cumulative cost = {31.5 rows, 
136.5 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 735
00-01

[jira] [Updated] (DRILL-6893) Invalid output for star and self-join queries for RDBMS Storage Plugin

2018-12-12 Thread Vitalii Diravka (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-6893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitalii Diravka updated DRILL-6893:
---
Summary: Invalid output for star and self-join queries for RDBMS Storage 
Plugin  (was: Invalid output for star and self-join queries for RDBS Storage 
Plugin)

> Invalid output for star and self-join queries for RDBMS Storage Plugin
> --
>
> Key: DRILL-6893
> URL: https://issues.apache.org/jira/browse/DRILL-6893
> Project: Apache Drill
>  Issue Type: Bug
>  Components: Storage - JDBC
>Affects Versions: 1.14.0
> Environment: mysql-5.7.23-0ubuntu0.18.04.1
> mysql-connector-java-5.1.39-bin.jar
>Reporter: Vitalii Diravka
>Priority: Major
> Fix For: Future
>
>
> Invalid output for star and self-join queries for RDBS Storage Plugin:
> {code}
> 0: jdbc:drill:zk=local> SELECT * FROM (SELECT * FROM 
> mysql.`testdb`.`mscIdentities3` WHERE `PersonID` = 10) AS `t` INNER JOIN 
> (SELECT * FROM mysql.`testdb`.`mscIdentities3` WHERE `PersonID` = 10) AS `t0` 
> ON `t`.`PersonID` = `t0`.`PersonID` ;
> +---+--+-+--++---+--+---+
> | PersonID  | OrderID  | ItemID  | GroupID  | PersonID0  | OrderID0  | 
> ItemID0  | GroupID0  |
> +---+--+-+--++---+--+---+
> | 10| 10   | 10  | 10   | null   | null  | null   
>   | null  |
> +---+--+-+--++---+--+---+
> 1 row selected (1.402 seconds)
> 0: jdbc:drill:zk=local> select * from sys.version;
> +--+---+++++
> | version  | commit_id |  
>commit_message |commit_time |
> build_email | build_time |
> +--+---+++++
> | 1.15.0-SNAPSHOT  | 100a68b314230d4cf327477f7a10f9c650720513  | DRILL-540: 
> Allow querying hive views in Drill  | 30.11.2018 @ 10:50:46 EET  | 
> vitalii.dira...@gmail.com  | 10.12.2018 @ 15:46:54 EET  |
> +--+---+++++
> 1 row selected (0.302 seconds)
> {code}
> The same result in older 1.11.0 Drill version:
> {code}
> 0: jdbc:drill:zk=local> SELECT * FROM (SELECT * FROM 
> mysql.`testdb`.`mscIdentities3` WHERE `PersonID` = 10) AS `t` INNER JOIN 
> (SELECT * FROM mysql.`testdb`.`mscIdentities3` WHERE `PersonID` = 10) AS `t0` 
> ON `t`.`PersonID` = `t0`.`PersonID`;
> +---+--+-+--++---+--+---+
> | PersonID  | OrderID  | ItemID  | GroupID  | PersonID0  | OrderID0  | 
> ItemID0  | GroupID0  |
> +---+--+-+--++---+--+---+
> | 10| 10   | 10  | 10   | null   | null  | null   
>   | null  |
> +---+--+-+--++---+--+---+
> 1 row selected (1.344 seconds)
> 0: jdbc:drill:zk=local> select * from sys.version;
> +--+---+--+-+++
> | version  | commit_id |
> commit_message| commit_time |
> build_email | build_time |
> +--+---+--+-+++
> | 1.11.0   | 4220fb2fffbc81883df3e5fea575fa0a584852b3  | 
> [maven-release-plugin] prepare release drill-1.11.0  | 24.07.2017 @ 16:47:07 
> EEST  | vitalii.dira...@gmail.com  | 06.12.2018 @ 14:36:39 EET  |
> +--+---+--+-+++
> 1 row selected (0.271 seconds)
> {code}
> But the same query in MySQL:
> {code}
> mysql> select * from `mscIdentities3` t1 join `mscIdentities3` t2 on 
> t1.`PersonId` = t2.

[jira] [Commented] (DRILL-6878) Index Plan Test failures after DRILL-6833

2018-12-12 Thread Vitalii Diravka (JIRA)


[ 
https://issues.apache.org/jira/browse/DRILL-6878?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16718911#comment-16718911
 ] 

Vitalii Diravka commented on DRILL-6878:


Merged to Apache Drill master branch with commit id b954c57c

> Index Plan Test failures after DRILL-6833
> -
>
> Key: DRILL-6878
> URL: https://issues.apache.org/jira/browse/DRILL-6878
> Project: Apache Drill
>  Issue Type: Bug
>Affects Versions: 1.15.0
>Reporter: Gautam Parai
>Assignee: Gautam Parai
>Priority: Blocker
>  Labels: ready-to-commit
> Fix For: 1.15.0
>
>
> [ERROR] Failures: 
> [ERROR] IndexPlanTest.testCastTimestampPlan:1148 Incorrect number of rows 
> returned by query. expected:<1> but was:<0>
> [ERROR] IndexPlanTest.testRowkeyJoinPushdown_1:1702 Did not find expected 
> pattern in plan: RowKeyJoin
> 00-00 Screen
> 00-01 Project(ssn=[$0])
> 00-02 Project(ssn=[$1])
> 00-03 HashJoin(condition=[=($0, $2)], joinType=[inner], semi-join: =[true])
> 00-05 Project(_id=[$0], ITEM=[ITEM($1, 'ssn')])
> 00-06 Scan(table=[[hbase, index_test_primary]], groupscan=[JsonTableGroupScan 
> [ScanSpec=JsonScanSpec [tableName=maprfs:///tmp/index_test_primary, 
> condition=null], columns=[`_id`, `id`.`ssn`], maxwidth=2]])
> 00-04 Scan(table=[[hbase, index_test_primary]], groupscan=[JsonTableGroupScan 
> [ScanSpec=JsonScanSpec [tableName=maprfs:///tmp/index_test_primary, 
> condition=($0 =
> {"$date":"2013-02-04T22:34:38.000Z"}
> ), indexName=hash_i_cast_timestamp_firstlogin], columns=[`_id`], 
> maxwidth=10]])
> [ERROR] IndexPlanTest.testRowkeyJoinPushdown_10:1883 Incorrect number of rows 
> returned by query. expected:<1> but was:<0>
> [ERROR] IndexPlanTest.testRowkeyJoinPushdown_12:1918 Did not find expected 
> pattern in plan: HashJoin(.*[
> ])+.*Scan.*indexName=i_state_city_dl(.*[
> ])+.*RowKeyJoin(.*[
> ])+.*RestrictedJsonTableGroupScan(.*[
> ])+.*HashAgg(group=[\{0}])(.*[
> ])+.*HashJoin
> 00-00 Screen
> 00-01 Project(ssn=[$0])
> 00-02 Project(ssn=[$1])
> 00-03 HashJoin(condition=[=($0, $5)], joinType=[inner], semi-join: =[true])
> 00-05 HashJoin(condition=[=($2, $3)], joinType=[inner], semi-join: =[false])
> 00-08 Project(_id=[$0], ITEM=[ITEM($1, 'ssn')], ITEM2=[ITEM($2, 'city')])
> 00-11 Scan(table=[[hbase, index_test_primary]], groupscan=[JsonTableGroupScan 
> [ScanSpec=JsonScanSpec [tableName=maprfs:///tmp/index_test_primary, 
> condition=null], columns=[`_id`, `id`.`ssn`, `address`.`city`], maxwidth=2]])
> 00-07 Project(ITEM0=[$0], ITEM1=[$1])
> 00-10 Project(ITEM=[ITEM($0, 'city')], ITEM1=[ITEM($0, 'state')])
> 00-14 Scan(table=[[hbase, index_test_primary]], groupscan=[JsonTableGroupScan 
> [ScanSpec=JsonScanSpec [tableName=maprfs:///tmp/index_test_primary, 
> condition=(address.state = "pc"), indexName=i_state_city_dl], 
> columns=[`address`.`city`, `address`.`state`], maxwidth=1]])
> 00-04 Project(EXPR$0=[CAST(CAST($0):INTEGER):VARCHAR(10) CHARACTER SET 
> "UTF-16LE" COLLATE "UTF-16LE$en_US$primary"])
> 00-06 Project(rowid=[$1], ITEM=[$2], ITEM2=[$3], ITEM0=[$0])
> 00-09 HashJoin(condition=[=($2, $0)], joinType=[inner], semi-join: =[false])
> 00-13 Project(ITEM=[ITEM($0, 'city')])
> 00-16 Scan(table=[[hbase, index_test_primary]], groupscan=[JsonTableGroupScan 
> [ScanSpec=JsonScanSpec [tableName=maprfs:///tmp/index_test_primary, 
> condition=null], columns=[`address`.`city`], maxwidth=2]])
> 00-12 Project(rowid=[$0], ITEM0=[$1], ITEM2=[$2])
> 00-15 SelectionVectorRemover
> 00-17 Filter(condition=[=(CAST($2):TIMESTAMP(0), 2013-02-04 22:34:38)])
> 00-18 Project(rowid=[$0], ITEM=[ITEM($1, 'city')], ITEM2=[ITEM(ITEM($2, 
> 'irs'), 'firstlogin')])
> 00-19 Scan(table=[[hbase, index_test_primary]], groupscan=[JsonTableGroupScan 
> [ScanSpec=JsonScanSpec [tableName=maprfs:///tmp/index_test_primary, 
> condition=null], columns=[`rowid`, `address`.`city`, 
> `activity`.`irs`.`firstlogin`], maxwidth=2]])
> [ERROR] IndexPlanTest.testRowkeyJoinPushdown_6:1799 Did not find expected 
> pattern in plan: RowKeyJoin
> 00-00 Screen
> 00-01 Project(ssn=[$0])
> 00-02 Project(ssn=[$1])
> 00-03 HashJoin(condition=[=($0, $2)], joinType=[inner], semi-join: =[true])
> 00-05 Project(_id=[$0], ITEM=[ITEM($1, 'ssn')])
> 00-07 Scan(table=[[hbase, index_test_primary]], groupscan=[JsonTableGroupScan 
> [ScanSpec=JsonScanSpec [tableName=maprfs:///tmp/index_test_primary, 
> condition=null], columns=[`_id`, `id`.`ssn`], maxwidth=2]])
> 00-04 Project(EXPR$0=[CAST(CAST($0):INTEGER):VARCHAR(10) CHARACTER SET 
> "UTF-16LE" COLLATE "UTF-16LE$en_US$primary"])
> 00-06 Project(rowid=[$1], ITEM=[$2], ITEM2=[$3], ITEM0=[$0])
> 00-08 HashJoin(condition=[=($2, $0)], joinType=[inner], semi-join: =[false])
> 00-10 Project(ITEM=[ITEM($0, 'city')])
> 00-12 Scan(table=[[hbase, index_test_primary]], groupscan=[JsonTableGroupScan 
> [ScanSpec=Js

[jira] [Updated] (DRILL-6878) Index Plan Test failures after DRILL-6833

2018-12-12 Thread Vitalii Diravka (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-6878?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitalii Diravka updated DRILL-6878:
---
Labels: ready-to-commit  (was: )

> Index Plan Test failures after DRILL-6833
> -
>
> Key: DRILL-6878
> URL: https://issues.apache.org/jira/browse/DRILL-6878
> Project: Apache Drill
>  Issue Type: Bug
>Affects Versions: 1.15.0
>Reporter: Gautam Parai
>Assignee: Gautam Parai
>Priority: Blocker
>  Labels: ready-to-commit
> Fix For: 1.15.0
>
>
> [ERROR] Failures: 
> [ERROR] IndexPlanTest.testCastTimestampPlan:1148 Incorrect number of rows 
> returned by query. expected:<1> but was:<0>
> [ERROR] IndexPlanTest.testRowkeyJoinPushdown_1:1702 Did not find expected 
> pattern in plan: RowKeyJoin
> 00-00 Screen
> 00-01 Project(ssn=[$0])
> 00-02 Project(ssn=[$1])
> 00-03 HashJoin(condition=[=($0, $2)], joinType=[inner], semi-join: =[true])
> 00-05 Project(_id=[$0], ITEM=[ITEM($1, 'ssn')])
> 00-06 Scan(table=[[hbase, index_test_primary]], groupscan=[JsonTableGroupScan 
> [ScanSpec=JsonScanSpec [tableName=maprfs:///tmp/index_test_primary, 
> condition=null], columns=[`_id`, `id`.`ssn`], maxwidth=2]])
> 00-04 Scan(table=[[hbase, index_test_primary]], groupscan=[JsonTableGroupScan 
> [ScanSpec=JsonScanSpec [tableName=maprfs:///tmp/index_test_primary, 
> condition=($0 =
> {"$date":"2013-02-04T22:34:38.000Z"}
> ), indexName=hash_i_cast_timestamp_firstlogin], columns=[`_id`], 
> maxwidth=10]])
> [ERROR] IndexPlanTest.testRowkeyJoinPushdown_10:1883 Incorrect number of rows 
> returned by query. expected:<1> but was:<0>
> [ERROR] IndexPlanTest.testRowkeyJoinPushdown_12:1918 Did not find expected 
> pattern in plan: HashJoin(.*[
> ])+.*Scan.*indexName=i_state_city_dl(.*[
> ])+.*RowKeyJoin(.*[
> ])+.*RestrictedJsonTableGroupScan(.*[
> ])+.*HashAgg(group=[\{0}])(.*[
> ])+.*HashJoin
> 00-00 Screen
> 00-01 Project(ssn=[$0])
> 00-02 Project(ssn=[$1])
> 00-03 HashJoin(condition=[=($0, $5)], joinType=[inner], semi-join: =[true])
> 00-05 HashJoin(condition=[=($2, $3)], joinType=[inner], semi-join: =[false])
> 00-08 Project(_id=[$0], ITEM=[ITEM($1, 'ssn')], ITEM2=[ITEM($2, 'city')])
> 00-11 Scan(table=[[hbase, index_test_primary]], groupscan=[JsonTableGroupScan 
> [ScanSpec=JsonScanSpec [tableName=maprfs:///tmp/index_test_primary, 
> condition=null], columns=[`_id`, `id`.`ssn`, `address`.`city`], maxwidth=2]])
> 00-07 Project(ITEM0=[$0], ITEM1=[$1])
> 00-10 Project(ITEM=[ITEM($0, 'city')], ITEM1=[ITEM($0, 'state')])
> 00-14 Scan(table=[[hbase, index_test_primary]], groupscan=[JsonTableGroupScan 
> [ScanSpec=JsonScanSpec [tableName=maprfs:///tmp/index_test_primary, 
> condition=(address.state = "pc"), indexName=i_state_city_dl], 
> columns=[`address`.`city`, `address`.`state`], maxwidth=1]])
> 00-04 Project(EXPR$0=[CAST(CAST($0):INTEGER):VARCHAR(10) CHARACTER SET 
> "UTF-16LE" COLLATE "UTF-16LE$en_US$primary"])
> 00-06 Project(rowid=[$1], ITEM=[$2], ITEM2=[$3], ITEM0=[$0])
> 00-09 HashJoin(condition=[=($2, $0)], joinType=[inner], semi-join: =[false])
> 00-13 Project(ITEM=[ITEM($0, 'city')])
> 00-16 Scan(table=[[hbase, index_test_primary]], groupscan=[JsonTableGroupScan 
> [ScanSpec=JsonScanSpec [tableName=maprfs:///tmp/index_test_primary, 
> condition=null], columns=[`address`.`city`], maxwidth=2]])
> 00-12 Project(rowid=[$0], ITEM0=[$1], ITEM2=[$2])
> 00-15 SelectionVectorRemover
> 00-17 Filter(condition=[=(CAST($2):TIMESTAMP(0), 2013-02-04 22:34:38)])
> 00-18 Project(rowid=[$0], ITEM=[ITEM($1, 'city')], ITEM2=[ITEM(ITEM($2, 
> 'irs'), 'firstlogin')])
> 00-19 Scan(table=[[hbase, index_test_primary]], groupscan=[JsonTableGroupScan 
> [ScanSpec=JsonScanSpec [tableName=maprfs:///tmp/index_test_primary, 
> condition=null], columns=[`rowid`, `address`.`city`, 
> `activity`.`irs`.`firstlogin`], maxwidth=2]])
> [ERROR] IndexPlanTest.testRowkeyJoinPushdown_6:1799 Did not find expected 
> pattern in plan: RowKeyJoin
> 00-00 Screen
> 00-01 Project(ssn=[$0])
> 00-02 Project(ssn=[$1])
> 00-03 HashJoin(condition=[=($0, $2)], joinType=[inner], semi-join: =[true])
> 00-05 Project(_id=[$0], ITEM=[ITEM($1, 'ssn')])
> 00-07 Scan(table=[[hbase, index_test_primary]], groupscan=[JsonTableGroupScan 
> [ScanSpec=JsonScanSpec [tableName=maprfs:///tmp/index_test_primary, 
> condition=null], columns=[`_id`, `id`.`ssn`], maxwidth=2]])
> 00-04 Project(EXPR$0=[CAST(CAST($0):INTEGER):VARCHAR(10) CHARACTER SET 
> "UTF-16LE" COLLATE "UTF-16LE$en_US$primary"])
> 00-06 Project(rowid=[$1], ITEM=[$2], ITEM2=[$3], ITEM0=[$0])
> 00-08 HashJoin(condition=[=($2, $0)], joinType=[inner], semi-join: =[false])
> 00-10 Project(ITEM=[ITEM($0, 'city')])
> 00-12 Scan(table=[[hbase, index_test_primary]], groupscan=[JsonTableGroupScan 
> [ScanSpec=JsonScanSpec [tableName=maprfs:///tmp/index_test_primary, 
> condition=null], column

[jira] [Commented] (DRILL-6879) Indicate a warning in the WebUI when a query makes little to no progress for a while

2018-12-12 Thread ASF GitHub Bot (JIRA)


[ 
https://issues.apache.org/jira/browse/DRILL-6879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16718815#comment-16718815
 ] 

ASF GitHub Bot commented on DRILL-6879:
---

arina-ielchiieva commented on a change in pull request #1572: DRILL-6879: Show 
warnings for potential performance issues
URL: https://github.com/apache/drill/pull/1572#discussion_r240975519
 
 

 ##
 File path: 
exec/java-exec/src/main/java/org/apache/drill/exec/server/rest/profile/ProfileWrapper.java
 ##
 @@ -73,6 +73,10 @@ public ProfileWrapper(final QueryProfile profile, 
DrillConfig drillConfig) {
 final List majors = new 
ArrayList<>(profile.getFragmentProfileList());
 Collections.sort(majors, Comparators.majorId);
 
+//Setting warning thresholds for performance-degrading queries (DRILL-6879)
 
 Review comment:
   Do not use static methods, just pass drillConfig in the constructor for each 
of there classes and set warning threshold value during instance creation.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Indicate a warning in the WebUI when a query makes little to no progress for 
> a while
> 
>
> Key: DRILL-6879
> URL: https://issues.apache.org/jira/browse/DRILL-6879
> Project: Apache Drill
>  Issue Type: Improvement
>  Components: Execution - Monitoring, Web Server
>Affects Versions: 1.14.0
>Reporter: Kunal Khatua
>Assignee: Kunal Khatua
>Priority: Major
>  Labels: user-experience
> Fix For: 1.16.0
>
> Attachments: image-2018-12-04-11-54-54-247.png, 
> image-2018-12-06-11-19-00-339.png, image-2018-12-06-11-27-14-719.png
>
>   Original Estimate: 168h
>  Remaining Estimate: 168h
>
> When running a very large query on a cluster with limited resource, we 
> noticed that one of the node's VM thread freezes the fragment threads as it 
> tries to do some work (GC perhaps?). This is a clear indication that the 
> query is stuck in a weird state where it might not recover from.
>  Under such circumstances, it makes sense to cancel or atleast warn the user 
> on that page of the query exceeding a certain threshold. 
>  For detecting this, the user will find that the {{Last Progress}} column in 
> the Fragments Overview section will show large times.
> !image-2018-12-04-11-54-54-247.png|width=969,height=336!
> In addition, there are instances where a query might have buffered operators 
> spilling to disk, which also hits performance (and, subsequently, longer run 
> times). Calling out this skew can be very useful.
> !image-2018-12-06-11-27-14-719.png|width=969,height=256!  
> Or there might be cases where a single fragment takes much longer than the 
> average (indicated by an extreme skew in the Gantt chart).
> !image-2018-12-06-11-19-00-339.png|width=969,height=150!
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)