Hi
With using phoenix-4.8.1-HBase-1.1-bin I have problems doing a join with hints.
Doing a select with a join on two tables when trying to use multiple hints
phoenix does include them in the execution plan:
Should be: CLIENT 1-CHUNK 0 ROWS 0 BYTES PARALLEL 1-WAY ROUND ROBIN RANGE SCAN
OVER C12_SIDR_CDR_20160820_INDEX ['293400115278890'] )
For example:
SELECT /*+ INDEX(SIDR_CDR_20160820 c12_SIDR_CDR_20160820_INDEX),
USE_SORT_MERGE_JOIN */
crm.c5, crm.c6,crm.c7, crm.c10, crm.c11, cdr.c2, cdr.c4, cdr.c7, cdr.c8,
cdr.c9, cdr.c10, cdr.c11, cdr.c12, cdr.c5, cdr.c6, cdr.c19, cdr.c20, cdr.c15,
cdr.c16, cdr.c13, cdr.c5
FROM
SIDR_CDR_20160820 cdr,
SIDR_CRM crm
WHERE
((cdr.c8 IN (1, 130, 29, 3, 2, 131, 31, 132, 30, 133, 136, 137, 135, 134,
150))
AND (cdr.c12 = '293400115278890'))
AND (cdr.c2 <= TO_DATE('20.08.2016 23:59:59', 'dd.MM.yyyy HH:mm:ss'))
AND crm.c1 = '0' || SUBSTR(cdr.c5, 4)
AND cdr.c2 >= crm.c2
AND (cdr.c2 < crm.c3 OR crm.c3 IS NULL);
+----------------------------------------------------------------------------------------------------------------------------------------------------------------+
| SORT-MERGE-JOIN (INNER) TABLES
|
| CLIENT 794-CHUNK 41607713 ROWS 20709402545 BYTES PARALLEL 794-WAY FULL
SCAN OVER SIDR_CDR_20160820
|
| SERVER FILTER BY (C8 IN
(1,2,3,29,30,31,130,131,132,133,134,135,136,137,150) AND C12 =
'293400115278890' AND C2 <= TIMESTAMP '2016-08-20 23:59:59.000' |
| SERVER SORTED BY [('0' || SUBSTR(CDR.C5, 4, null))]
|
| CLIENT MERGE SORT
|
| AND
|
| CLIENT 35-CHUNK 1864177 ROWS 865075995 BYTES PARALLEL 1-WAY FULL SCAN
OVER SIDR_CRM
|
| CLIENT FILTER BY (CDR.C2 >= CRM.C2 AND (CDR.C2 < CRM.C3 OR CRM.C3 IS NULL))
|
+----------------------------------------------------------------------------------------------------------------------------------------------------------------+
When further investigating this, phoenix I think has two errors:
1. This is a query without a join only selecting from SIDR_CDR_20160820
cdr. I am not able to mix a real table name (for hints) and synonime (for
select), the second query has the correct execution plan
explain SELECT /*+ INDEX(SIDR_CDR_20160820 c12_SIDR_CDR_20160820_INDEX)*/
cdr.c2, cdr.c4, cdr.c7, cdr.c8, cdr.c9, cdr.c10, cdr.c11, cdr.c12, cdr.c5,
cdr.c6, cdr.c19, cdr.c20, cdr.c15, cdr.c16, cdr.c13, cdr.c5 FROM
SIDR_CDR_20160820 cdr WHERE ((cdr.c8 IN (1, 130, 29, 3, 2, 131, 31, 132, 30,
133, 136, 137, 135, 134, 150)) AND (cdr.c12 = '293400115278890')) AND (cdr.c2
<= TO_DATE('20.08.2016 23:59:59', 'dd.MM.yyyy HH:mm:ss'));
+----------------------------------------------------------------------------------------------------------------------------------------------------------------+
| CLIENT 794-CHUNK 41607713 ROWS 20709402545 BYTES PARALLEL 1-WAY ROUND ROBIN
FULL SCAN OVER SIDR_CDR_20160820
|
| SERVER FILTER BY ((C8 IN
(1,2,3,29,30,31,130,131,132,133,134,135,136,137,150) AND C12 =
'293400115278890') AND C2 <= TIMESTAMP '2016-08-20 23:59:59.000') |
+----------------------------------------------------------------------------------------------------------------------------------------------------------------+
A correct execution plan with the same query only with a changed hint /*+
INDEX(cdr c12_SIDR_CDR_20160820_INDEX)*/
0: jdbc:phoenix:lxdr1> explain SELECT /*+ INDEX(cdr
c12_SIDR_CDR_20160820_INDEX)*/ cdr.c2, cdr.c4, cdr.c7, cdr.c8, cdr.c9, cdr.c10,
cdr.c11, cdr.c12, cdr.c5, cdr.c6, cdr.c19, cdr.c20, cdr.c15, cdr.c16, cdr.c13,
cdr.c5 FROM SIDR_CDR_20160820 cdr WHERE ((cdr.c8 IN (1, 130, 29, 3, 2, 131, 31,
132, 30, 133, 136, 137, 135, 134, 150)) AND (cdr.c12 = '293400115278890')) AND
(cdr.c2 <= TO_DATE('20.08.2016 23:59:59', 'dd.MM.yyyy HH:mm:ss'));
+-----------------------------------------------------------------------------------------------------------------------------------+
| PLAN
|
+-----------------------------------------------------------------------------------------------------------------------------------+
| CLIENT 794-CHUNK 41607713 ROWS 20709402545 BYTES PARALLEL 1-WAY ROUND ROBIN
FULL SCAN OVER SIDR_CDR_20160820 |
| SERVER FILTER BY C8 IN
(1,2,3,29,30,31,130,131,132,133,134,135,136,137,150)
|
| SKIP-SCAN-JOIN TABLE 0
|
| CLIENT 1-CHUNK 0 ROWS 0 BYTES PARALLEL 1-WAY ROUND ROBIN RANGE SCAN
OVER C12_SIDR_CDR_20160820_INDEX ['293400115278890'] |
| SERVER FILTER BY FIRST KEY ONLY AND "C2" <= TIMESTAMP '2016-08-20
23:59:59.000' |
| DYNAMIC SERVER FILTER BY ("CDR.C1", "CDR.C2", "CDR.C3", "CDR.C4") IN
(($422.$424, $422.$425, $422.$426, $422.$427)) |
+-----------------------------------------------------------------------------------------------------------------------------------+
2. Phoenix throws an exception parsing two hints at least with a join:
The hint
SELECT /*+ INDEX(cdr c12_SIDR_CDR_20160820_INDEX), USE_SORT_MERGE_JOIN */
0: jdbc:phoenix:lxdr1> explain SELECT /*+ INDEX(cdr
c12_SIDR_CDR_20160820_INDEX), USE_SORT_MERGE_JOIN */ crm.c5, crm.c6,crm.c7,
crm.c10, crm.c11, cdr.c2, cdr.c4, cdr.c7, cdr.c8, cdr.c9, cdr.c10, cdr.c11,
cdr.c12, cdr.c5, cdr.c6, cdr.c19, cdr.c20, cdr.c15, cdr.c16, cdr.c13, cdr.c5
FROM SIDR_CDR_20160820 cdr, SIDR_CRM crm WHERE ((cdr.c8 IN (1, 130, 29, 3, 2,
131, 31, 132, 30, 133, 136, 137, 135, 134, 150)) AND (cdr.c12 =
'293400115278890')) AND (cdr.c2 <= TO_DATE('20.08.2016 23:59:59', 'dd.MM.yyyy
HH:mm:ss')) AND crm.c1 = '0' || SUBSTR(cdr.c5, 4) AND cdr.c2 >= crm.c2 AND
(cdr.c2 < crm.c3 OR crm.c3 IS NULL);
java.lang.NullPointerException
at
org.apache.phoenix.parse.IndexExpressionParseNodeRewriter.<init>(IndexExpressionParseNodeRewriter.java:43)
at
org.apache.phoenix.compile.JoinCompiler.optimize(JoinCompiler.java:1250)
at
org.apache.phoenix.compile.QueryCompiler.compileSelect(QueryCompiler.java:194)
at
org.apache.phoenix.compile.QueryCompiler.compile(QueryCompiler.java:157)
at
org.apache.phoenix.jdbc.PhoenixStatement$ExecutableSelectStatement.compilePlan(PhoenixStatement.java:406)
at
org.apache.phoenix.jdbc.PhoenixStatement$ExecutableSelectStatement.compilePlan(PhoenixStatement.java:380)
at
org.apache.phoenix.jdbc.PhoenixStatement$ExecutableExplainStatement.compilePlan(PhoenixStatement.java:465)
at
org.apache.phoenix.jdbc.PhoenixStatement$ExecutableExplainStatement.compilePlan(PhoenixStatement.java:445)
at
org.apache.phoenix.jdbc.PhoenixStatement$1.call(PhoenixStatement.java:271)
at
org.apache.phoenix.jdbc.PhoenixStatement$1.call(PhoenixStatement.java:266)
at org.apache.phoenix.call.CallRunner.run(CallRunner.java:53)
at
org.apache.phoenix.jdbc.PhoenixStatement.executeQuery(PhoenixStatement.java:265)
at
org.apache.phoenix.jdbc.PhoenixStatement.execute(PhoenixStatement.java:1446)
at sqlline.Commands.execute(Commands.java:822)
at sqlline.Commands.sql(Commands.java:732)
at sqlline.SqlLine.dispatch(SqlLine.java:807)
at sqlline.SqlLine.begin(SqlLine.java:681)
at sqlline.SqlLine.start(SqlLine.java:398)
at sqlline.SqlLine.main(SqlLine.java:292)
I will appreciate any help.
Regards, Gasper