[jira] [Created] (IMPALA-10726) cdpd master compilation failing

2021-06-02 Thread Norbert Luksa (Jira)
Norbert Luksa created IMPALA-10726:
--

 Summary: cdpd master compilation failing
 Key: IMPALA-10726
 URL: https://issues.apache.org/jira/browse/IMPALA-10726
 Project: IMPALA
  Issue Type: Bug
Reporter: Norbert Luksa


We have compilation error in the latest cdpd build:
{code:java}
08:45:44 [ERROR] 
/data0/jenkins/workspace/impala-cdpd-master-core/repos/Impala/fe/src/main/java/org/apache/impala/catalog/metastore/MetastoreServiceHandler.java:[100,44]
 cannot find symbol
08:45:44 [ERROR]   symbol:   class GetOpenTxnsRequest
08:45:44 [ERROR]   location: package org.apache.hadoop.hive.metastore.api
08:45:44 [ERROR] 
/data0/jenkins/workspace/impala-cdpd-master-core/repos/Impala/fe/src/main/java/org/apache/impala/catalog/metastore/MetastoreServiceHandler.java:[2840,48]
 cannot find symbol
08:45:44 [ERROR]   symbol:   class GetOpenTxnsRequest
08:45:44 [ERROR]   location: class 
org.apache.impala.catalog.metastore.MetastoreServiceHandler{code}
See job for more info:

https://master-03.jenkins.cloudera.com/job/impala-cdpd-master-core/55/testReport/junit/generate_junitxml/build/a0f544c01ed359a6c400cd6ffb6cba97/



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-10726) cdpd master FE compilation failing

2021-06-02 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-10726?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa updated IMPALA-10726:
---
Summary: cdpd master FE compilation failing  (was: cdpd master compilation 
failing)

> cdpd master FE compilation failing
> --
>
> Key: IMPALA-10726
> URL: https://issues.apache.org/jira/browse/IMPALA-10726
> Project: IMPALA
>  Issue Type: Bug
>Reporter: Norbert Luksa
>Priority: Blocker
>  Labels: build-failure
>
> We have compilation error in the latest cdpd build:
> {code:java}
> 08:45:44 [ERROR] 
> /data0/jenkins/workspace/impala-cdpd-master-core/repos/Impala/fe/src/main/java/org/apache/impala/catalog/metastore/MetastoreServiceHandler.java:[100,44]
>  cannot find symbol
> 08:45:44 [ERROR]   symbol:   class GetOpenTxnsRequest
> 08:45:44 [ERROR]   location: package org.apache.hadoop.hive.metastore.api
> 08:45:44 [ERROR] 
> /data0/jenkins/workspace/impala-cdpd-master-core/repos/Impala/fe/src/main/java/org/apache/impala/catalog/metastore/MetastoreServiceHandler.java:[2840,48]
>  cannot find symbol
> 08:45:44 [ERROR]   symbol:   class GetOpenTxnsRequest
> 08:45:44 [ERROR]   location: class 
> org.apache.impala.catalog.metastore.MetastoreServiceHandler{code}
> See job for more info:
> https://master-03.jenkins.cloudera.com/job/impala-cdpd-master-core/55/testReport/junit/generate_junitxml/build/a0f544c01ed359a6c400cd6ffb6cba97/



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Closed] (IMPALA-10726) cdpd master FE compilation failing

2021-06-02 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-10726?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa closed IMPALA-10726.
--
Resolution: Duplicate

> cdpd master FE compilation failing
> --
>
> Key: IMPALA-10726
> URL: https://issues.apache.org/jira/browse/IMPALA-10726
> Project: IMPALA
>  Issue Type: Bug
>Reporter: Norbert Luksa
>Priority: Blocker
>  Labels: build-failure
>
> We have compilation error in the latest cdpd build:
> {code:java}
> 08:45:44 [ERROR] 
> /data0/jenkins/workspace/impala-cdpd-master-core/repos/Impala/fe/src/main/java/org/apache/impala/catalog/metastore/MetastoreServiceHandler.java:[100,44]
>  cannot find symbol
> 08:45:44 [ERROR]   symbol:   class GetOpenTxnsRequest
> 08:45:44 [ERROR]   location: package org.apache.hadoop.hive.metastore.api
> 08:45:44 [ERROR] 
> /data0/jenkins/workspace/impala-cdpd-master-core/repos/Impala/fe/src/main/java/org/apache/impala/catalog/metastore/MetastoreServiceHandler.java:[2840,48]
>  cannot find symbol
> 08:45:44 [ERROR]   symbol:   class GetOpenTxnsRequest
> 08:45:44 [ERROR]   location: class 
> org.apache.impala.catalog.metastore.MetastoreServiceHandler{code}
> See job for more info:
> https://master-03.jenkins.cloudera.com/job/impala-cdpd-master-core/55/testReport/junit/generate_junitxml/build/a0f544c01ed359a6c400cd6ffb6cba97/



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-10733) TestKuduOperations.test_replica_selection failing

2021-06-07 Thread Norbert Luksa (Jira)
Norbert Luksa created IMPALA-10733:
--

 Summary: TestKuduOperations.test_replica_selection failing
 Key: IMPALA-10733
 URL: https://issues.apache.org/jira/browse/IMPALA-10733
 Project: IMPALA
  Issue Type: Bug
Reporter: Norbert Luksa
Assignee: Wenzhe Zhou


Noticed in the recent builds these failures:
{code:java}
query_test/test_kudu.py:567: in test_replica_selection
assert cursor.fetchall() == [(100,)]
E   assert [(85,)] == [(100,)]
E At index 0 diff: (85,) != (100,)
E Use -v to get the full diff
{code}

Job: https://master-03.jenkins.cloudera.com/job/impala-cdh-7.1-maint-core-s3/23/

[~wzhou] I'm assigning this to you, since you had relevant commits when this 
failure appeared. Could you please check?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-10734) Failure in webserver.test_web_pages.TestWebPage.test_query_stmt

2021-06-07 Thread Norbert Luksa (Jira)
Norbert Luksa created IMPALA-10734:
--

 Summary: Failure in 
webserver.test_web_pages.TestWebPage.test_query_stmt 
 Key: IMPALA-10734
 URL: https://issues.apache.org/jira/browse/IMPALA-10734
 Project: IMPALA
  Issue Type: Bug
Reporter: Norbert Luksa


Noticed exception in recent exhaustive run:

{code:java}
webserver/test_web_pages.py:469: in test_query_stmt
assert check_if_contains, "No matching statement found in the jsons."
E   AssertionError: No matching statement found in the jsons.
E   assert False
{code}

Job: 
https://master-03.jenkins.cloudera.com/job/impala-cdpd-master-exhaustive/41/




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-10621) TestSpillingDebugActionDimensions.test_spilling_large_rows hanging

2021-06-07 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-10621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17358842#comment-17358842
 ] 

Norbert Luksa commented on IMPALA-10621:


Not hanging, but the test failed now with the following exception. Maybe it's 
relevant?

{code:java}
query_test/test_spilling.py:83: in test_spilling_large_rows
self.run_test_case('QueryTest/spilling-large-rows', vector, unique_database)
common/impala_test_suite.py:709: in run_test_case
self.__verify_results_and_errors(vector, test_section, result, use_db)
common/impala_test_suite.py:545: in __verify_results_and_errors
replace_filenames_with_placeholder)
common/test_result_verifier.py:469: in verify_raw_results
VERIFIER_MAP[verifier](expected, actual)
common/test_result_verifier.py:278: in verify_query_result_is_equal
assert expected_results == actual_results
E   assert Comparing QueryTestResults (expected vs actual):
E '0',10 == '0',10
E '0',1000 == '0',1000
E '1',10 == '1',10
E '1',1000 == '1',1000
E '2',10 == '2',10
E '2',1000 == '2',1000
E '3',10 == '3',10
E '3',1000 == '3',1000
E '4',10 == '4',10
E '4',1000 == '4',1000
E '5',10 == '5',10
E '5',1000 != '6',10
E '6',10 != '6',1000
E '6',1000 != '7',10
E '7',10 != '8',10
E '7',1000 != '8',1000
E '8',10 != '9',10
E '8',1000 != '9',1000
E '9',10 != None
E '9',1000 != None
E Number of rows returned (expected vs actual): 20 != 18
{code}

Job: 
https://master-03.jenkins.cloudera.com/job/impala-asf-master-exhaustive-release/43/

> TestSpillingDebugActionDimensions.test_spilling_large_rows hanging
> --
>
> Key: IMPALA-10621
> URL: https://issues.apache.org/jira/browse/IMPALA-10621
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Reporter: Quanlong Huang
>Assignee: Quanlong Huang
>Priority: Critical
>
> Saw a possible haning in test_spilling_large_rows in a downstream exhaustive 
> release build. The query didn't finish in 12mins:
> {code:java}
> query_test/test_spilling.py:83: in test_spilling_large_rows
> self.run_test_case('QueryTest/spilling-large-rows', vector, 
> unique_database)
> common/impala_test_suite.py:665: in run_test_case
> result = exec_fn(query, user=test_section.get('USER', '').strip() or None)
> common/impala_test_suite.py:603: in __exec_in_impala
> result = self.__execute_query(target_impalad_client, query, user=user)
> common/impala_test_suite.py:923: in __execute_query
> return impalad_client.execute(query, user=user)
> common/impala_connection.py:205: in execute
> return self.__beeswax_client.execute(sql_stmt, user=user)
> beeswax/impala_beeswax.py:205: in execute
> result = self.fetch_results(query_string, handle)
> beeswax/impala_beeswax.py:452: in fetch_results
> exec_result = self.__fetch_results(query_handle, max_rows)
> beeswax/impala_beeswax.py:463: in __fetch_results
> results = self.__do_rpc(lambda: self.imp_service.fetch(handle, False, 
> fetch_rows))
> beeswax/impala_beeswax.py:520: in __do_rpc
> raise ImpalaBeeswaxException(self.__build_error_message(b), b)
> E   ImpalaBeeswaxException: ImpalaBeeswaxException:
> EINNER EXCEPTION: 
> EMESSAGE: Query b94d5cd2894cbe68:68acfd63 cancelled due to 
> unresponsive backend: 127.0.0.1:27000 has not sent a report in 753530ms (max 
> allowed lag is 72ms) {code}
> impalad.INFO indicates it's waiting for the coordinator fragment to finish:
> {code:java}
> I0325 23:54:16.146754 16521 Frontend.java:1600] 
> b94d5cd2894cbe68:68acfd63] Analyzing query: select 
> group_concat(string_col), length(bigstr) from bigstrs2
> group by bigstr db: test_spilling_large_rows_6df9f183
> I0325 23:54:16.147089 16521 BaseAuthorizationChecker.java:110] 
> b94d5cd2894cbe68:68acfd63] Authorization check took 0 ms
> I0325 23:54:16.147109 16521 Frontend.java:1643] 
> b94d5cd2894cbe68:68acfd63] Analysis and authorization finished.
> I0325 23:54:16.148593 20605 admission-controller.cc:1747] 
> b94d5cd2894cbe68:68acfd63] Trying to admit 
> id=b94d5cd2894cbe68:68acfd63 in pool_name=default-pool 
> executor_group_name=default per_host_mem_estimate=309.98 MB 
> dedicated_coord_mem_estimate=134.99 MB max_requests=-1 max_queued=200 
> max_mem=-1.00 B
> I0325 23:54:16.148609 20605 admission-controller.cc:1755] 
> b94d5cd2894cbe68:68acfd63] Stats: agg_num_running=0, 
> agg_num_queued=0, agg_mem_reserved=1.00 GB,  local_host(local_mem_admitted=0, 
> num_admitted_running=0, num_queued=0, backend_mem_reserved=0, 
> topN_query_stats: queries=[3e42ce0b9

[jira] [Assigned] (IMPALA-10922) test_orc_stats failing on exhaustive builds

2021-09-21 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-10922?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa reassigned IMPALA-10922:
--

Assignee: Quanlong Huang  (was: Norbert Luksa)

> test_orc_stats failing on exhaustive builds
> ---
>
> Key: IMPALA-10922
> URL: https://issues.apache.org/jira/browse/IMPALA-10922
> Project: IMPALA
>  Issue Type: Bug
>Reporter: Vihang Karajgaonkar
>Assignee: Quanlong Huang
>Priority: Blocker
>  Labels: broken-build
>
> test_orc_stats.py is failing on certain exhaustive builds. The stack trace of 
> the failure looks like below.
> {noformat}
> query_test/test_orc_stats.py:40: in test_orc_stats
> self.run_test_case('QueryTest/orc-stats', vector, use_db=unique_database)
> common/impala_test_suite.py:779: in run_test_case
> update_section=pytest.config.option.update_results)
> common/test_result_verifier.py:653: in verify_runtime_profile
> % (function, field, expected_value, actual_value, op, actual))
> E   AssertionError: Aggregation of SUM over RowsRead did not match expected 
> results.
> E   EXPECTED VALUE:
> E   0
> E   
> E   
> E   ACTUAL VALUE:
> E   10
> E   
> {noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-10922) test_orc_stats failing on exhaustive builds

2021-09-21 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-10922?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17418008#comment-17418008
 ] 

Norbert Luksa commented on IMPALA-10922:


Reassigned to [~stigahuang] as he finished the work in IMPALA-6505.

> test_orc_stats failing on exhaustive builds
> ---
>
> Key: IMPALA-10922
> URL: https://issues.apache.org/jira/browse/IMPALA-10922
> Project: IMPALA
>  Issue Type: Bug
>Reporter: Vihang Karajgaonkar
>Assignee: Quanlong Huang
>Priority: Blocker
>  Labels: broken-build
>
> test_orc_stats.py is failing on certain exhaustive builds. The stack trace of 
> the failure looks like below.
> {noformat}
> query_test/test_orc_stats.py:40: in test_orc_stats
> self.run_test_case('QueryTest/orc-stats', vector, use_db=unique_database)
> common/impala_test_suite.py:779: in run_test_case
> update_section=pytest.config.option.update_results)
> common/test_result_verifier.py:653: in verify_runtime_profile
> % (function, field, expected_value, actual_value, op, actual))
> E   AssertionError: Aggregation of SUM over RowsRead did not match expected 
> results.
> E   EXPECTED VALUE:
> E   0
> E   
> E   
> E   ACTUAL VALUE:
> E   10
> E   
> {noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-10822) Allow multiple group returns for LDAP search bind authentication group search

2021-12-20 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-10822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17463046#comment-17463046
 ] 

Norbert Luksa commented on IMPALA-10822:


Hi [~tmate], this can be closed now, right?

> Allow multiple group returns for LDAP search bind authentication group search
> -
>
> Key: IMPALA-10822
> URL: https://issues.apache.org/jira/browse/IMPALA-10822
> Project: IMPALA
>  Issue Type: Improvement
>  Components: Backend, Security
>Reporter: Tamas Mate
>Assignee: Tamas Mate
>Priority: Major
>
> Group search uses the same LdapSearch method that the user which has a 
> condition that only one result should be returned, otherwise the 
> authentication is ambiguous. However, this condition is too restrictive for 
> group searches and multiple search results should be allowed.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-4741) ORDER BY behavior with UNION is incorrect

2019-11-07 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-4741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16969143#comment-16969143
 ] 

Norbert Luksa commented on IMPALA-4741:
---

Had a look at the parser, it looks like it's deliberately different from 
MySQL's parsing, [as stated in a 
comment|https://github.com/apache/impala/blob/f49f8d8a32d12128eafb4c76632ca2908d22fa28/fe/src/main/cup/sql-parser.cup#L2281].
 Looks like the current parser is designed around UNION, and the binding of the 
ORDER BY relies on its left precedence. 

Tried to play around the precedence, but did not find a simple solution. 
Changing the precedence for ORDER BY just for this scenario would introduce 
conflicts at other parts of the parser which did not seem to be easy fixes. 
Also, in another go I tried to define precedence between UNION and SELECT, but 
that did not work either, breaking other things.

> ORDER BY behavior with UNION is incorrect
> -
>
> Key: IMPALA-4741
> URL: https://issues.apache.org/jira/browse/IMPALA-4741
> Project: IMPALA
>  Issue Type: Bug
>  Components: Frontend
>Affects Versions: Impala 2.8.0
>Reporter: Greg Rahn
>Assignee: Norbert Luksa
>Priority: Critical
>  Labels: correctness, incompatibility, ramp-up, sql-language, 
> tpc-ds
> Attachments: query36a.sql, query49.sql
>
>
> When a query uses the UNION, EXCEPT, or INTERSECT operators, the ORDER BY 
> clause must be specified at the end of the statement and the results of the 
> combined queries are sorted.  ORDER BY clauses are not allowed in individual 
> branches unless the branch is enclosed by parentheses.
> There are two bugs currently:
> # An ORDER BY is allowed in a branch of a UNION that is not enclosed in 
> parentheses
> # The final ORDER BY of a UNION is attached to the nearest branch when it 
> should be sorting the combined results of the UNION(s)
> For example, this is not valid syntax but is allowed in Impala
> {code}
> select * from t1 order by 1
> union all
> select * from t2
> {code}
> And for queries like this, the ORDER BY should order the unioned result, not 
> just the nearest branch which is the current behavior.
> {code}
> select * from t1
> union all
> select * from t2
> order by 1
> {code}
> If one wants ordering within a branch, the query block must be enclosed by 
> parentheses like such:
> {code}
> (select * from t1 order by 1)
> union all
> (select * from t2 order by 2)
> {code}
> Here is an example where incorrect results are returned.
> Impala
> {code}
> [impalad:21000] > select r_regionkey, r_name from region union all select 
> r_regionkey, r_name from region order by 1 limit 2;
> +-+-+
> | r_regionkey | r_name  |
> +-+-+
> | 0   | AFRICA  |
> | 1   | AMERICA |
> | 2   | ASIA|
> | 3   | EUROPE  |
> | 4   | MIDDLE EAST |
> | 0   | AFRICA  |
> | 1   | AMERICA |
> +-+-+
> Fetched 7 row(s) in 0.12s
> {code}
> PostgreSQL
> {code}
> tpch=# select r_regionkey, r_name from region union all select r_regionkey, 
> r_name from region order by 1 limit 2;
>  r_regionkey |  r_name
> -+---
>0 | AFRICA
>0 | AFRICA
> (2 rows) 
> {code}
> see also https://cloud.google.com/spanner/docs/query-syntax#syntax_5



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Assigned] (IMPALA-4618) # hosts is inaccurate in exec summary when mt_dop > 0

2019-11-13 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-4618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa reassigned IMPALA-4618:
-

Assignee: Norbert Luksa

> # hosts is inaccurate in exec summary when mt_dop > 0
> -
>
> Key: IMPALA-4618
> URL: https://issues.apache.org/jira/browse/IMPALA-4618
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Affects Versions: Impala 2.8.0
>Reporter: Tim Armstrong
>Assignee: Norbert Luksa
>Priority: Major
>  Labels: multithreading, ramp-up, supportability
>
> I'm seeing this on a 3-node minicluster:
> {code}
> Operator   #Hosts   Avg Time   Max Time   #Rows  Est. #Rows   Peak Mem  
> Est. Peak Mem  Detail
> -
> 06:AGGREGATE1  212.433us  212.433us   0   164.00 B
> -1.00 B  FINALIZE  
> 05:EXCHANGE 10.000ns0.000ns   0   1  0
> -1.00 B  UNPARTITIONED 
> 02:AGGREGATE9  131.439us  167.821us   0   164.00 B
>10.00 MB
> 04:AGGREGATE90.000ns0.000ns   0  -1  183.42 MB
>   128.00 MB
> 03:EXCHANGE 9  707.426ms  823.681ms  20.97M  -1  0
>   0  HASH(`_c0`)   
> 01:AGGREGATE95s222ms5s355ms  22.89M  -13.83 MB
>   128.00 MB  STREAMING 
> 00:SCAN HDFS92s623ms2s896ms  23.33M  -1  492.02 KB
>88.00 MB  default.big_uuids
> {code}
> It seems like it's reporting the # of fragment instances instead of the 
> number of hosts.
> We should probably report both #Hosts and #Instances if mt_dop is enabled 
> (and maybe in all cases)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Work started] (IMPALA-4618) # hosts is inaccurate in exec summary when mt_dop > 0

2019-11-14 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-4618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on IMPALA-4618 started by Norbert Luksa.
-
> # hosts is inaccurate in exec summary when mt_dop > 0
> -
>
> Key: IMPALA-4618
> URL: https://issues.apache.org/jira/browse/IMPALA-4618
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Affects Versions: Impala 2.8.0
>Reporter: Tim Armstrong
>Assignee: Norbert Luksa
>Priority: Major
>  Labels: multithreading, ramp-up, supportability
>
> I'm seeing this on a 3-node minicluster:
> {code}
> Operator   #Hosts   Avg Time   Max Time   #Rows  Est. #Rows   Peak Mem  
> Est. Peak Mem  Detail
> -
> 06:AGGREGATE1  212.433us  212.433us   0   164.00 B
> -1.00 B  FINALIZE  
> 05:EXCHANGE 10.000ns0.000ns   0   1  0
> -1.00 B  UNPARTITIONED 
> 02:AGGREGATE9  131.439us  167.821us   0   164.00 B
>10.00 MB
> 04:AGGREGATE90.000ns0.000ns   0  -1  183.42 MB
>   128.00 MB
> 03:EXCHANGE 9  707.426ms  823.681ms  20.97M  -1  0
>   0  HASH(`_c0`)   
> 01:AGGREGATE95s222ms5s355ms  22.89M  -13.83 MB
>   128.00 MB  STREAMING 
> 00:SCAN HDFS92s623ms2s896ms  23.33M  -1  492.02 KB
>88.00 MB  default.big_uuids
> {code}
> It seems like it's reporting the # of fragment instances instead of the 
> number of hosts.
> We should probably report both #Hosts and #Instances if mt_dop is enabled 
> (and maybe in all cases)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Work stopped] (IMPALA-4741) ORDER BY behavior with UNION is incorrect

2019-11-14 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-4741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on IMPALA-4741 stopped by Norbert Luksa.
-
> ORDER BY behavior with UNION is incorrect
> -
>
> Key: IMPALA-4741
> URL: https://issues.apache.org/jira/browse/IMPALA-4741
> Project: IMPALA
>  Issue Type: Bug
>  Components: Frontend
>Affects Versions: Impala 2.8.0
>Reporter: Greg Rahn
>Assignee: Norbert Luksa
>Priority: Critical
>  Labels: correctness, incompatibility, ramp-up, sql-language, 
> tpc-ds
> Attachments: query36a.sql, query49.sql
>
>
> When a query uses the UNION, EXCEPT, or INTERSECT operators, the ORDER BY 
> clause must be specified at the end of the statement and the results of the 
> combined queries are sorted.  ORDER BY clauses are not allowed in individual 
> branches unless the branch is enclosed by parentheses.
> There are two bugs currently:
> # An ORDER BY is allowed in a branch of a UNION that is not enclosed in 
> parentheses
> # The final ORDER BY of a UNION is attached to the nearest branch when it 
> should be sorting the combined results of the UNION(s)
> For example, this is not valid syntax but is allowed in Impala
> {code}
> select * from t1 order by 1
> union all
> select * from t2
> {code}
> And for queries like this, the ORDER BY should order the unioned result, not 
> just the nearest branch which is the current behavior.
> {code}
> select * from t1
> union all
> select * from t2
> order by 1
> {code}
> If one wants ordering within a branch, the query block must be enclosed by 
> parentheses like such:
> {code}
> (select * from t1 order by 1)
> union all
> (select * from t2 order by 2)
> {code}
> Here is an example where incorrect results are returned.
> Impala
> {code}
> [impalad:21000] > select r_regionkey, r_name from region union all select 
> r_regionkey, r_name from region order by 1 limit 2;
> +-+-+
> | r_regionkey | r_name  |
> +-+-+
> | 0   | AFRICA  |
> | 1   | AMERICA |
> | 2   | ASIA|
> | 3   | EUROPE  |
> | 4   | MIDDLE EAST |
> | 0   | AFRICA  |
> | 1   | AMERICA |
> +-+-+
> Fetched 7 row(s) in 0.12s
> {code}
> PostgreSQL
> {code}
> tpch=# select r_regionkey, r_name from region union all select r_regionkey, 
> r_name from region order by 1 limit 2;
>  r_regionkey |  r_name
> -+---
>0 | AFRICA
>0 | AFRICA
> (2 rows) 
> {code}
> see also https://cloud.google.com/spanner/docs/query-syntax#syntax_5



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Assigned] (IMPALA-4741) ORDER BY behavior with UNION is incorrect

2019-11-14 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-4741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa reassigned IMPALA-4741:
-

Assignee: (was: Norbert Luksa)

> ORDER BY behavior with UNION is incorrect
> -
>
> Key: IMPALA-4741
> URL: https://issues.apache.org/jira/browse/IMPALA-4741
> Project: IMPALA
>  Issue Type: Bug
>  Components: Frontend
>Affects Versions: Impala 2.8.0
>Reporter: Greg Rahn
>Priority: Critical
>  Labels: correctness, incompatibility, ramp-up, sql-language, 
> tpc-ds
> Attachments: query36a.sql, query49.sql
>
>
> When a query uses the UNION, EXCEPT, or INTERSECT operators, the ORDER BY 
> clause must be specified at the end of the statement and the results of the 
> combined queries are sorted.  ORDER BY clauses are not allowed in individual 
> branches unless the branch is enclosed by parentheses.
> There are two bugs currently:
> # An ORDER BY is allowed in a branch of a UNION that is not enclosed in 
> parentheses
> # The final ORDER BY of a UNION is attached to the nearest branch when it 
> should be sorting the combined results of the UNION(s)
> For example, this is not valid syntax but is allowed in Impala
> {code}
> select * from t1 order by 1
> union all
> select * from t2
> {code}
> And for queries like this, the ORDER BY should order the unioned result, not 
> just the nearest branch which is the current behavior.
> {code}
> select * from t1
> union all
> select * from t2
> order by 1
> {code}
> If one wants ordering within a branch, the query block must be enclosed by 
> parentheses like such:
> {code}
> (select * from t1 order by 1)
> union all
> (select * from t2 order by 2)
> {code}
> Here is an example where incorrect results are returned.
> Impala
> {code}
> [impalad:21000] > select r_regionkey, r_name from region union all select 
> r_regionkey, r_name from region order by 1 limit 2;
> +-+-+
> | r_regionkey | r_name  |
> +-+-+
> | 0   | AFRICA  |
> | 1   | AMERICA |
> | 2   | ASIA|
> | 3   | EUROPE  |
> | 4   | MIDDLE EAST |
> | 0   | AFRICA  |
> | 1   | AMERICA |
> +-+-+
> Fetched 7 row(s) in 0.12s
> {code}
> PostgreSQL
> {code}
> tpch=# select r_regionkey, r_name from region union all select r_regionkey, 
> r_name from region order by 1 limit 2;
>  r_regionkey |  r_name
> -+---
>0 | AFRICA
>0 | AFRICA
> (2 rows) 
> {code}
> see also https://cloud.google.com/spanner/docs/query-syntax#syntax_5



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Assigned] (IMPALA-8046) Support CREATE TABLE from an ORC file

2019-11-21 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa reassigned IMPALA-8046:
-

Assignee: Norbert Luksa

> Support CREATE TABLE from an ORC file
> -
>
> Key: IMPALA-8046
> URL: https://issues.apache.org/jira/browse/IMPALA-8046
> Project: IMPALA
>  Issue Type: New Feature
>Reporter: Quanlong Huang
>Assignee: Norbert Luksa
>Priority: Major
>
> Impala supports creating a table using the schema of a file. However, only 
> parquet is supported currently. This ticket tracks adding support for 
> creating table from ORC files.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Resolved] (IMPALA-9179) Enable reading from bucketed tables

2019-11-21 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-9179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa resolved IMPALA-9179.
---
Resolution: Implemented

[IMPALA-8593|https://issues.apache.org/jira/browse/IMPALA-8593] only prohibited 
writing to bucketed tables, while reading is left enabled.

> Enable reading from bucketed tables
> ---
>
> Key: IMPALA-9179
> URL: https://issues.apache.org/jira/browse/IMPALA-9179
> Project: IMPALA
>  Issue Type: Sub-task
>Reporter: Zoltán Borók-Nagy
>Assignee: Norbert Luksa
>Priority: Major
>
> Reading from bucketed tables should just work out of the box, so we can just 
> enable it and add proper testing. INSERTs must still raise an error.
> Later (in a separate jira) we can improve join and aggregation performance of 
> bucketed tables.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Closed] (IMPALA-9179) Enable reading from bucketed tables

2019-11-21 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-9179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa closed IMPALA-9179.
-

> Enable reading from bucketed tables
> ---
>
> Key: IMPALA-9179
> URL: https://issues.apache.org/jira/browse/IMPALA-9179
> Project: IMPALA
>  Issue Type: Sub-task
>Reporter: Zoltán Borók-Nagy
>Assignee: Norbert Luksa
>Priority: Major
>
> Reading from bucketed tables should just work out of the box, so we can just 
> enable it and add proper testing. INSERTs must still raise an error.
> Later (in a separate jira) we can improve join and aggregation performance of 
> bucketed tables.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Work started] (IMPALA-8046) Support CREATE TABLE from an ORC file

2019-11-21 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on IMPALA-8046 started by Norbert Luksa.
-
> Support CREATE TABLE from an ORC file
> -
>
> Key: IMPALA-8046
> URL: https://issues.apache.org/jira/browse/IMPALA-8046
> Project: IMPALA
>  Issue Type: New Feature
>Reporter: Quanlong Huang
>Assignee: Norbert Luksa
>Priority: Major
>
> Impala supports creating a table using the schema of a file. However, only 
> parquet is supported currently. This ticket tracks adding support for 
> creating table from ORC files.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Resolved] (IMPALA-6660) -0/+0 floating point do not compare as equal in hash table

2019-11-27 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-6660?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa resolved IMPALA-6660.
---
Fix Version/s: Impala 3.4.0
   Resolution: Fixed

> -0/+0 floating point do not compare as equal in hash table
> --
>
> Key: IMPALA-6660
> URL: https://issues.apache.org/jira/browse/IMPALA-6660
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Affects Versions: Impala 2.6.0, Impala 2.7.0, Impala 2.8.0, Impala 2.9.0, 
> Impala 2.10.0, Impala 2.11.0, Impala 3.0, Impala 2.12.0
>Reporter: Tim Armstrong
>Assignee: Norbert Luksa
>Priority: Major
>  Labels: correctness, ramp-up
> Fix For: Impala 3.4.0
>
>
> This can happen because we hash the binary representation of the numbers. 
> -0/0 should be treated as equal for hash joins.
> {noformat}
> [localhost:21000] > select * from (select cast("-0" as float) c1) v1,  
> (select cast("0" as float) c2) v2 where v1.c1 = v2.c2;
> Fetched 0 row(s) in 0.12s
> [localhost:21000] > select * from (select cast("0" as float) c1) v1,  (select 
> cast("0" as float) c2) v2 where v1.c1 = v2.c2;
> +++
> | c1 | c2 |
> +++
> | 0  | 0  |
> +++
> Fetched 1 row(s) in 0.11s
> [localhost:21000] > select * from (select cast("-0" as float) c1) v1,  
> (select cast("-0" as float) c2) v2 where v1.c1 = v2.c2;
> +++
> | c1 | c2 |
> +++
> | -0 | -0 |
> +++
> Fetched 1 row(s) in 0.11s
> {noformat}
> With aggregations, we get separate groups. I could see the argument either 
> way on whether this is the preferred behaviour for group by, since group by 
> already handles equality of NULL differently. The behaviour here is tied to 
> the behaviour in the join right now, so we should make sure to add a test for 
> this case when fixing the join.
> {noformat}
> [localhost:21000] > select distinct * from (values(cast("-0" as float)), 
> (cast("0" as float))) v;
> +-+
> | cast('-0' as float) |
> +-+
> | -0  |
> | 0   |
> +-+
> {noformat}
> *Workaround*
> Casting the floating point numbers to decimal fixes the problem.
> *Proposed solution*
> The frontend could wrap floating point expressions in the hash join or hash 
> aggregation in a normalisation function that converts -0 to +0.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-4618) # hosts is inaccurate in exec summary when mt_dop > 0

2019-11-28 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-4618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16984388#comment-16984388
 ] 

Norbert Luksa commented on IMPALA-4618:
---

[~arodoni] should I create an issue for updating the docs/update them myself?

As Tim mentioned there are some places showing the result of SUMMARY.

> # hosts is inaccurate in exec summary when mt_dop > 0
> -
>
> Key: IMPALA-4618
> URL: https://issues.apache.org/jira/browse/IMPALA-4618
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Affects Versions: Impala 2.8.0
>Reporter: Tim Armstrong
>Assignee: Norbert Luksa
>Priority: Major
>  Labels: multithreading, ramp-up, supportability
>
> I'm seeing this on a 3-node minicluster:
> {code}
> Operator   #Hosts   Avg Time   Max Time   #Rows  Est. #Rows   Peak Mem  
> Est. Peak Mem  Detail
> -
> 06:AGGREGATE1  212.433us  212.433us   0   164.00 B
> -1.00 B  FINALIZE  
> 05:EXCHANGE 10.000ns0.000ns   0   1  0
> -1.00 B  UNPARTITIONED 
> 02:AGGREGATE9  131.439us  167.821us   0   164.00 B
>10.00 MB
> 04:AGGREGATE90.000ns0.000ns   0  -1  183.42 MB
>   128.00 MB
> 03:EXCHANGE 9  707.426ms  823.681ms  20.97M  -1  0
>   0  HASH(`_c0`)   
> 01:AGGREGATE95s222ms5s355ms  22.89M  -13.83 MB
>   128.00 MB  STREAMING 
> 00:SCAN HDFS92s623ms2s896ms  23.33M  -1  492.02 KB
>88.00 MB  default.big_uuids
> {code}
> It seems like it's reporting the # of fragment instances instead of the 
> number of hosts.
> We should probably report both #Hosts and #Instances if mt_dop is enabled 
> (and maybe in all cases)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-9207) Impala Doc: Document adding #Inst in exec summary

2019-11-29 Thread Norbert Luksa (Jira)
Norbert Luksa created IMPALA-9207:
-

 Summary: Impala Doc: Document adding #Inst in exec summary
 Key: IMPALA-9207
 URL: https://issues.apache.org/jira/browse/IMPALA-9207
 Project: IMPALA
  Issue Type: Sub-task
Reporter: Norbert Luksa
Assignee: Alexandra Rodoni


https://gerrit.cloudera.org/#/c/14715/



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Resolved] (IMPALA-4618) # hosts is inaccurate in exec summary when mt_dop > 0

2019-12-05 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-4618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa resolved IMPALA-4618.
---
Fix Version/s: Impala 3.4.0
   Resolution: Fixed

> # hosts is inaccurate in exec summary when mt_dop > 0
> -
>
> Key: IMPALA-4618
> URL: https://issues.apache.org/jira/browse/IMPALA-4618
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Affects Versions: Impala 2.8.0
>Reporter: Tim Armstrong
>Assignee: Norbert Luksa
>Priority: Major
>  Labels: multithreading, ramp-up, supportability
> Fix For: Impala 3.4.0
>
>
> I'm seeing this on a 3-node minicluster:
> {code}
> Operator   #Hosts   Avg Time   Max Time   #Rows  Est. #Rows   Peak Mem  
> Est. Peak Mem  Detail
> -
> 06:AGGREGATE1  212.433us  212.433us   0   164.00 B
> -1.00 B  FINALIZE  
> 05:EXCHANGE 10.000ns0.000ns   0   1  0
> -1.00 B  UNPARTITIONED 
> 02:AGGREGATE9  131.439us  167.821us   0   164.00 B
>10.00 MB
> 04:AGGREGATE90.000ns0.000ns   0  -1  183.42 MB
>   128.00 MB
> 03:EXCHANGE 9  707.426ms  823.681ms  20.97M  -1  0
>   0  HASH(`_c0`)   
> 01:AGGREGATE95s222ms5s355ms  22.89M  -13.83 MB
>   128.00 MB  STREAMING 
> 00:SCAN HDFS92s623ms2s896ms  23.33M  -1  492.02 KB
>88.00 MB  default.big_uuids
> {code}
> It seems like it's reporting the # of fragment instances instead of the 
> number of hosts.
> We should probably report both #Hosts and #Instances if mt_dop is enabled 
> (and maybe in all cases)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Work started] (IMPALA-9175) Revisit the error handling logics in ORC scanner

2019-12-13 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-9175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on IMPALA-9175 started by Norbert Luksa.
-
> Revisit the error handling logics in ORC scanner
> 
>
> Key: IMPALA-9175
> URL: https://issues.apache.org/jira/browse/IMPALA-9175
> Project: IMPALA
>  Issue Type: Task
>Reporter: Quanlong Huang
>Assignee: Norbert Luksa
>Priority: Blocker
>
> This is a task to revisit all the corresponding error handling logics in the 
> ORC scanner comparing to the Parquet scanner. For each kind of error handling 
> in the parquet scanner, make sure we already handle it in the orc scanner, 
> otherwise create separate JIRAs to handle them.
> Also need to make sure whether the exposed error messages are enough for 
> debugging. For instance, one frequently encountered error when Impala has 
> stale metadata of an ORC file is:
> {code:java}
> Encountered parse error in tail of ORC file 
> hdfs://hadoop2cluster/user/hive-0.13.1/warehouse/bi_ucar.db/alliance_driver_stat_hour_api/dt=2019-08-09/part-6:
>  Invalid ORC postscript length
> {code}
> It'd be better to also print the postscript length we read and the file size. 
> So users can know whether the file is corrupt (so need data regeneration) or 
> the metadata is stale (so need refresh). We may need changes in the ORC lib 
> for these.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Work started] (IMPALA-9226) Improve string allocations of the ORC scanner

2019-12-19 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-9226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on IMPALA-9226 started by Norbert Luksa.
-
> Improve string allocations of the ORC scanner
> -
>
> Key: IMPALA-9226
> URL: https://issues.apache.org/jira/browse/IMPALA-9226
> Project: IMPALA
>  Issue Type: Improvement
>Reporter: Zoltán Borók-Nagy
>Assignee: Norbert Luksa
>Priority: Major
>  Labels: orc
>
> Currently the ORC scanner allocates new memory for each string values (except 
> for fixed size strings):
> [https://github.com/apache/impala/blob/85425b81f04c856d7d5ec375242303f78ec7964e/be/src/exec/orc-column-readers.cc#L172]
> Besides the too many allocations and copying it's also bad for memory 
> locality.
> Since ORC-501 StringVectorBatch has a member named 'blob' that contains the 
> strings in the batch: 
> [https://github.com/apache/orc/blob/branch-1.6/c%2B%2B/include/orc/Vector.hh#L126]
> 'blob' has type DataBuffer which is movable, so Impala might be able to get 
> ownership of it. Or, at least we could copy the whole blob array instead of 
> copying the strings one-by-one.
> ORC-501 is included in ORC version 1.6, but Impala currently only uses ORC 
> 1.5.5.
> ORC 1.6 also introduces a new string vector type, EncodedStringVectorBatch:
> [https://github.com/apache/orc/blob/e40b9a7205d51995f11fe023c90769c0b7c4bb93/c%2B%2B/include/orc/Vector.hh#L153]
> It uses dictionary encoding for storing the values. Impala could copy/move 
> the dictionary as well.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Resolved] (IMPALA-8046) Support CREATE TABLE from an ORC file

2020-01-15 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa resolved IMPALA-8046.
---
Fix Version/s: Impala 3.4.0
   Resolution: Implemented

> Support CREATE TABLE from an ORC file
> -
>
> Key: IMPALA-8046
> URL: https://issues.apache.org/jira/browse/IMPALA-8046
> Project: IMPALA
>  Issue Type: New Feature
>Reporter: Quanlong Huang
>Assignee: Norbert Luksa
>Priority: Major
> Fix For: Impala 3.4.0
>
>
> Impala supports creating a table using the schema of a file. However, only 
> parquet is supported currently. This ticket tracks adding support for 
> creating table from ORC files.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-9310) OrcStringColumnReader could move blob instead of copy

2020-01-21 Thread Norbert Luksa (Jira)
Norbert Luksa created IMPALA-9310:
-

 Summary: OrcStringColumnReader could move blob instead of copy
 Key: IMPALA-9310
 URL: https://issues.apache.org/jira/browse/IMPALA-9310
 Project: IMPALA
  Issue Type: Bug
Reporter: Norbert Luksa


Since IMPALA-9226, in OrcStringColumnReader the blobs of a batch are copied to 
a freshly allocated space by Impala. As Zoltán pointed out in a 
[review|https://gerrit.cloudera.org/#/c/15051/1/be/src/exec/orc-column-readers.cc@158],
 a possible improvement is moving a blob by finding a way of getting hold of 
the memory allocated by the ORC library instead of copying.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Assigned] (IMPALA-9290) ORC scanner should support schema evolution between date and timestamp types

2020-01-27 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-9290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa reassigned IMPALA-9290:
-

Assignee: Norbert Luksa

> ORC scanner should support schema evolution between date and timestamp types
> 
>
> Key: IMPALA-9290
> URL: https://issues.apache.org/jira/browse/IMPALA-9290
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Affects Versions: Impala 3.3.0
>Reporter: Gabor Kaszab
>Assignee: Norbert Luksa
>Priority: Major
>  Labels: orc
>
> *This is the desired use case:*
> 1. Create an ORC table TBL1 with a DATE column.
> 2. Create an ORC table TBL2 with a TIMESTAMP column that has the same 
> location as TBL1.
> 3. Insert some DATE values into TBL1 and some TIMESTAMP values into TBL2.
> 4. select from TBL1 returns both DATE and TIMESTAMP values (converted to 
> DATE).
> 5. select from TBL2 returns both DATE and TIMESTAMPS values. The DATE values 
> are converted to TIMESTAMP.
> Without this feature Impala return an error:
> {code:java}
> ERROR: Type mismatch: table column DATE is map to column timestamp in ORC 
> file 'hdfs://localhost:20500/test-warehouse/orc_date_tbl/00_0_copy_1'
> {code}
> *Note:*
> With https://issues.apache.org/jira/browse/IMPALA-8801 implementing Date type 
> for ORC it is possible to read date values in ORC format. However, writing is 
> still not supported and has to be done by Hive.
> *Let me copy-paste a code review comment from IMPALA-8801 as a suggestion for 
> the implementation:*
> We can modify OrcTimestampReader to support reading orc::TimestampVectorBatch 
> into Date type slots. In its constructor it knows which kind of slots 
> (timestamp or date) it's writting to. So in ReadValue() it can have different 
> behaviors based on different modes (timestamp values => timestamp slots / 
> timestamp values => date slots). We can do the same on OrcDateColumnReader to 
> let it support reading ORC Date values into Timestamp type slots.
> Note that the life cycle of a OrcColumnReader is within the life cycle of the 
> HdfsOrcScanner which only reads a split of an ORC file, and an ORC file can't 
> have two types for one column (e.g. column1 is timestamp in stripe1 and is 
> date in stripe2). So we don't need to deal with different batch types in 
> UpdateInputBatch().
> BTW, It'd be better to add test coverage for this type compactibility check 
> in test_scanners.py (See TestOrc.test_type_conversions).



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Work started] (IMPALA-9290) ORC scanner should support schema evolution between date and timestamp types

2020-01-31 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-9290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on IMPALA-9290 started by Norbert Luksa.
-
> ORC scanner should support schema evolution between date and timestamp types
> 
>
> Key: IMPALA-9290
> URL: https://issues.apache.org/jira/browse/IMPALA-9290
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Affects Versions: Impala 3.3.0
>Reporter: Gabor Kaszab
>Assignee: Norbert Luksa
>Priority: Major
>  Labels: orc
>
> *This is the desired use case:*
> 1. Create an ORC table TBL1 with a DATE column.
> 2. Create an ORC table TBL2 with a TIMESTAMP column that has the same 
> location as TBL1.
> 3. Insert some DATE values into TBL1 and some TIMESTAMP values into TBL2.
> 4. select from TBL1 returns both DATE and TIMESTAMP values (converted to 
> DATE).
> 5. select from TBL2 returns both DATE and TIMESTAMPS values. The DATE values 
> are converted to TIMESTAMP.
> Without this feature Impala return an error:
> {code:java}
> ERROR: Type mismatch: table column DATE is map to column timestamp in ORC 
> file 'hdfs://localhost:20500/test-warehouse/orc_date_tbl/00_0_copy_1'
> {code}
> *Note:*
> With https://issues.apache.org/jira/browse/IMPALA-8801 implementing Date type 
> for ORC it is possible to read date values in ORC format. However, writing is 
> still not supported and has to be done by Hive.
> *Let me copy-paste a code review comment from IMPALA-8801 as a suggestion for 
> the implementation:*
> We can modify OrcTimestampReader to support reading orc::TimestampVectorBatch 
> into Date type slots. In its constructor it knows which kind of slots 
> (timestamp or date) it's writting to. So in ReadValue() it can have different 
> behaviors based on different modes (timestamp values => timestamp slots / 
> timestamp values => date slots). We can do the same on OrcDateColumnReader to 
> let it support reading ORC Date values into Timestamp type slots.
> Note that the life cycle of a OrcColumnReader is within the life cycle of the 
> HdfsOrcScanner which only reads a split of an ORC file, and an ORC file can't 
> have two types for one column (e.g. column1 is timestamp in stripe1 and is 
> date in stripe2). So we don't need to deal with different batch types in 
> UpdateInputBatch().
> BTW, It'd be better to add test coverage for this type compactibility check 
> in test_scanners.py (See TestOrc.test_type_conversions).



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-9351) AnalyzeDDLTest.TestCreateTableLikeFileOrc failed due to non-existing path

2020-02-04 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-9351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17029728#comment-17029728
 ] 

Norbert Luksa commented on IMPALA-9351:
---

Not sure what could be the problem there.
The data loading completed successfully and tests querying the same table 
passed.
The test failed twice, then passed after. Let's see if this happens again?
https://master-02.jenkins.cloudera.com/job/impala-cdpd-master-core/314/testReport/junit/org.apache.impala.analysis/AnalyzeDDLTest/TestCreateTableLikeFileOrc/history/

> AnalyzeDDLTest.TestCreateTableLikeFileOrc failed due to non-existing path
> -
>
> Key: IMPALA-9351
> URL: https://issues.apache.org/jira/browse/IMPALA-9351
> Project: IMPALA
>  Issue Type: Bug
>  Components: Frontend
>Reporter: Fang-Yu Rao
>Assignee: Norbert Luksa
>Priority: Blocker
>  Labels: broken-build, flaky-test
>
> AnalyzeDDLTest.TestCreateTableLikeFileOrc failed due to a non-existing path. 
> Specifically, we see the following error message.
> {code:java}
> Error Message
> Error during analysis:
> org.apache.impala.common.AnalysisException: Cannot infer schema, path does 
> not exist: 
> hdfs://localhost:20500/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0
> sql:
> create table if not exists newtbl_DNE like orc 
> '/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0'
> {code}
> The stack trace is provided in the following.
> {code:java}
> Stacktrace
> java.lang.AssertionError: 
> Error during analysis:
> org.apache.impala.common.AnalysisException: Cannot infer schema, path does 
> not exist: 
> hdfs://localhost:20500/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0
> sql:
> create table if not exists newtbl_DNE like orc 
> '/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0'
>   at org.junit.Assert.fail(Assert.java:88)
>   at 
> org.apache.impala.common.FrontendFixture.analyzeStmt(FrontendFixture.java:397)
>   at 
> org.apache.impala.common.FrontendTestBase.AnalyzesOk(FrontendTestBase.java:244)
>   at 
> org.apache.impala.common.FrontendTestBase.AnalyzesOk(FrontendTestBase.java:185)
>   at 
> org.apache.impala.analysis.AnalyzeDDLTest.TestCreateTableLikeFileOrc(AnalyzeDDLTest.java:2045)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at 
> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:50)
>   at 
> org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
>   at 
> org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:47)
>   at 
> org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
>   at 
> org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
>   at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:78)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:57)
>   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
>   at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71)
>   at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:288)
>   at org.junit.runners.ParentRunner.access$000(ParentRunner.java:58)
>   at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268)
>   at 
> org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
>   at 
> org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
>   at org.junit.runners.ParentRunner.run(ParentRunner.java:363)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:365)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeWithRerun(JUnit4Provider.java:272)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:236)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:159)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.invokeProviderInSameClassLoader(ForkedBooter.java:386)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:323)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:143)
> {code}
> This test was recently added by [~norbertluksa], and [~boroknagy

[jira] [Commented] (IMPALA-9351) AnalyzeDDLTest.TestCreateTableLikeFileOrc failed due to non-existing path

2020-02-13 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-9351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17036126#comment-17036126
 ] 

Norbert Luksa commented on IMPALA-9351:
---

I don't think there is anything special, looks like this table is created like 
any other table.
Maybe the location/filename can be different sometimes? 
[~stigahuang], asking since you have worked more with ORC, have you experienced 
anything like that with your ORC files?

> AnalyzeDDLTest.TestCreateTableLikeFileOrc failed due to non-existing path
> -
>
> Key: IMPALA-9351
> URL: https://issues.apache.org/jira/browse/IMPALA-9351
> Project: IMPALA
>  Issue Type: Bug
>  Components: Frontend
>Reporter: Fang-Yu Rao
>Assignee: Norbert Luksa
>Priority: Blocker
>  Labels: broken-build, flaky-test
>
> AnalyzeDDLTest.TestCreateTableLikeFileOrc failed due to a non-existing path. 
> Specifically, we see the following error message.
> {code:java}
> Error Message
> Error during analysis:
> org.apache.impala.common.AnalysisException: Cannot infer schema, path does 
> not exist: 
> hdfs://localhost:20500/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0
> sql:
> create table if not exists newtbl_DNE like orc 
> '/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0'
> {code}
> The stack trace is provided in the following.
> {code:java}
> Stacktrace
> java.lang.AssertionError: 
> Error during analysis:
> org.apache.impala.common.AnalysisException: Cannot infer schema, path does 
> not exist: 
> hdfs://localhost:20500/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0
> sql:
> create table if not exists newtbl_DNE like orc 
> '/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0'
>   at org.junit.Assert.fail(Assert.java:88)
>   at 
> org.apache.impala.common.FrontendFixture.analyzeStmt(FrontendFixture.java:397)
>   at 
> org.apache.impala.common.FrontendTestBase.AnalyzesOk(FrontendTestBase.java:244)
>   at 
> org.apache.impala.common.FrontendTestBase.AnalyzesOk(FrontendTestBase.java:185)
>   at 
> org.apache.impala.analysis.AnalyzeDDLTest.TestCreateTableLikeFileOrc(AnalyzeDDLTest.java:2045)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at 
> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:50)
>   at 
> org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
>   at 
> org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:47)
>   at 
> org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
>   at 
> org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
>   at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:78)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:57)
>   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
>   at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71)
>   at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:288)
>   at org.junit.runners.ParentRunner.access$000(ParentRunner.java:58)
>   at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268)
>   at 
> org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
>   at 
> org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
>   at org.junit.runners.ParentRunner.run(ParentRunner.java:363)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:365)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeWithRerun(JUnit4Provider.java:272)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:236)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:159)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.invokeProviderInSameClassLoader(ForkedBooter.java:386)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:323)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:143)
> {code}
> This test was recently added by [~norbertluksa], and [~boroknagyz] gave a +2, 
> maybe [~boroknagyz] could provide some insight into this? Thanks!



--
Th

[jira] [Commented] (IMPALA-9351) AnalyzeDDLTest.TestCreateTableLikeFileOrc failed due to non-existing path

2020-02-17 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-9351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17038412#comment-17038412
 ] 

Norbert Luksa commented on IMPALA-9351:
---

Thanks, [~joemcdonnell], sent in https://gerrit.cloudera.org/#/c/15234/

> AnalyzeDDLTest.TestCreateTableLikeFileOrc failed due to non-existing path
> -
>
> Key: IMPALA-9351
> URL: https://issues.apache.org/jira/browse/IMPALA-9351
> Project: IMPALA
>  Issue Type: Bug
>  Components: Frontend
>Reporter: Fang-Yu Rao
>Assignee: Norbert Luksa
>Priority: Blocker
>  Labels: broken-build, flaky-test
>
> AnalyzeDDLTest.TestCreateTableLikeFileOrc failed due to a non-existing path. 
> Specifically, we see the following error message.
> {code:java}
> Error Message
> Error during analysis:
> org.apache.impala.common.AnalysisException: Cannot infer schema, path does 
> not exist: 
> hdfs://localhost:20500/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0
> sql:
> create table if not exists newtbl_DNE like orc 
> '/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0'
> {code}
> The stack trace is provided in the following.
> {code:java}
> Stacktrace
> java.lang.AssertionError: 
> Error during analysis:
> org.apache.impala.common.AnalysisException: Cannot infer schema, path does 
> not exist: 
> hdfs://localhost:20500/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0
> sql:
> create table if not exists newtbl_DNE like orc 
> '/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0'
>   at org.junit.Assert.fail(Assert.java:88)
>   at 
> org.apache.impala.common.FrontendFixture.analyzeStmt(FrontendFixture.java:397)
>   at 
> org.apache.impala.common.FrontendTestBase.AnalyzesOk(FrontendTestBase.java:244)
>   at 
> org.apache.impala.common.FrontendTestBase.AnalyzesOk(FrontendTestBase.java:185)
>   at 
> org.apache.impala.analysis.AnalyzeDDLTest.TestCreateTableLikeFileOrc(AnalyzeDDLTest.java:2045)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at 
> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:50)
>   at 
> org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
>   at 
> org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:47)
>   at 
> org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
>   at 
> org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
>   at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:78)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:57)
>   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
>   at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71)
>   at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:288)
>   at org.junit.runners.ParentRunner.access$000(ParentRunner.java:58)
>   at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268)
>   at 
> org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
>   at 
> org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
>   at org.junit.runners.ParentRunner.run(ParentRunner.java:363)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:365)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeWithRerun(JUnit4Provider.java:272)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:236)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:159)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.invokeProviderInSameClassLoader(ForkedBooter.java:386)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:323)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:143)
> {code}
> This test was recently added by [~norbertluksa], and [~boroknagyz] gave a +2, 
> maybe [~boroknagyz] could provide some insight into this? Thanks!



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additiona

[jira] [Assigned] (IMPALA-6505) Min-Max predicate push down in ORC scanner

2020-02-18 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-6505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa reassigned IMPALA-6505:
-

Assignee: Norbert Luksa

> Min-Max predicate push down in ORC scanner
> --
>
> Key: IMPALA-6505
> URL: https://issues.apache.org/jira/browse/IMPALA-6505
> Project: IMPALA
>  Issue Type: New Feature
>  Components: Backend
>Reporter: Quanlong Huang
>Assignee: Norbert Luksa
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-9351) AnalyzeDDLTest.TestCreateTableLikeFileOrc failed due to non-existing path

2020-02-21 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-9351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17041951#comment-17041951
 ] 

Norbert Luksa commented on IMPALA-9351:
---

Thanks again for the help, Joe, sent in [https://gerrit.cloudera.org/#/c/15266/]

 

> AnalyzeDDLTest.TestCreateTableLikeFileOrc failed due to non-existing path
> -
>
> Key: IMPALA-9351
> URL: https://issues.apache.org/jira/browse/IMPALA-9351
> Project: IMPALA
>  Issue Type: Bug
>  Components: Frontend
>Reporter: Fang-Yu Rao
>Assignee: Norbert Luksa
>Priority: Blocker
>  Labels: broken-build, flaky-test
>
> AnalyzeDDLTest.TestCreateTableLikeFileOrc failed due to a non-existing path. 
> Specifically, we see the following error message.
> {code:java}
> Error Message
> Error during analysis:
> org.apache.impala.common.AnalysisException: Cannot infer schema, path does 
> not exist: 
> hdfs://localhost:20500/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0
> sql:
> create table if not exists newtbl_DNE like orc 
> '/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0'
> {code}
> The stack trace is provided in the following.
> {code:java}
> Stacktrace
> java.lang.AssertionError: 
> Error during analysis:
> org.apache.impala.common.AnalysisException: Cannot infer schema, path does 
> not exist: 
> hdfs://localhost:20500/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0
> sql:
> create table if not exists newtbl_DNE like orc 
> '/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0'
>   at org.junit.Assert.fail(Assert.java:88)
>   at 
> org.apache.impala.common.FrontendFixture.analyzeStmt(FrontendFixture.java:397)
>   at 
> org.apache.impala.common.FrontendTestBase.AnalyzesOk(FrontendTestBase.java:244)
>   at 
> org.apache.impala.common.FrontendTestBase.AnalyzesOk(FrontendTestBase.java:185)
>   at 
> org.apache.impala.analysis.AnalyzeDDLTest.TestCreateTableLikeFileOrc(AnalyzeDDLTest.java:2045)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at 
> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:50)
>   at 
> org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
>   at 
> org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:47)
>   at 
> org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
>   at 
> org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
>   at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:78)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:57)
>   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
>   at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71)
>   at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:288)
>   at org.junit.runners.ParentRunner.access$000(ParentRunner.java:58)
>   at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268)
>   at 
> org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
>   at 
> org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
>   at org.junit.runners.ParentRunner.run(ParentRunner.java:363)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:365)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeWithRerun(JUnit4Provider.java:272)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:236)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:159)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.invokeProviderInSameClassLoader(ForkedBooter.java:386)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:323)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:143)
> {code}
> This test was recently added by [~norbertluksa], and [~boroknagyz] gave a +2, 
> maybe [~boroknagyz] could provide some insight into this? Thanks!



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
F

[jira] [Commented] (IMPALA-9351) AnalyzeDDLTest.TestCreateTableLikeFileOrc failed due to non-existing path

2020-02-22 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-9351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17042459#comment-17042459
 ] 

Norbert Luksa commented on IMPALA-9351:
---

Thanks Tim!

> AnalyzeDDLTest.TestCreateTableLikeFileOrc failed due to non-existing path
> -
>
> Key: IMPALA-9351
> URL: https://issues.apache.org/jira/browse/IMPALA-9351
> Project: IMPALA
>  Issue Type: Bug
>  Components: Frontend
>Reporter: Fang-Yu Rao
>Assignee: Norbert Luksa
>Priority: Blocker
>  Labels: broken-build, flaky-test
>
> AnalyzeDDLTest.TestCreateTableLikeFileOrc failed due to a non-existing path. 
> Specifically, we see the following error message.
> {code:java}
> Error Message
> Error during analysis:
> org.apache.impala.common.AnalysisException: Cannot infer schema, path does 
> not exist: 
> hdfs://localhost:20500/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0
> sql:
> create table if not exists newtbl_DNE like orc 
> '/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0'
> {code}
> The stack trace is provided in the following.
> {code:java}
> Stacktrace
> java.lang.AssertionError: 
> Error during analysis:
> org.apache.impala.common.AnalysisException: Cannot infer schema, path does 
> not exist: 
> hdfs://localhost:20500/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0
> sql:
> create table if not exists newtbl_DNE like orc 
> '/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0'
>   at org.junit.Assert.fail(Assert.java:88)
>   at 
> org.apache.impala.common.FrontendFixture.analyzeStmt(FrontendFixture.java:397)
>   at 
> org.apache.impala.common.FrontendTestBase.AnalyzesOk(FrontendTestBase.java:244)
>   at 
> org.apache.impala.common.FrontendTestBase.AnalyzesOk(FrontendTestBase.java:185)
>   at 
> org.apache.impala.analysis.AnalyzeDDLTest.TestCreateTableLikeFileOrc(AnalyzeDDLTest.java:2045)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at 
> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:50)
>   at 
> org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
>   at 
> org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:47)
>   at 
> org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
>   at 
> org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
>   at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:78)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:57)
>   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
>   at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71)
>   at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:288)
>   at org.junit.runners.ParentRunner.access$000(ParentRunner.java:58)
>   at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268)
>   at 
> org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
>   at 
> org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
>   at org.junit.runners.ParentRunner.run(ParentRunner.java:363)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:365)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeWithRerun(JUnit4Provider.java:272)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:236)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:159)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.invokeProviderInSameClassLoader(ForkedBooter.java:386)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:323)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:143)
> {code}
> This test was recently added by [~norbertluksa], and [~boroknagyz] gave a +2, 
> maybe [~boroknagyz] could provide some insight into this? Thanks!



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-9420) test_scanners.TestOrc.test_type conversions fails after first run

2020-02-25 Thread Norbert Luksa (Jira)
Norbert Luksa created IMPALA-9420:
-

 Summary: test_scanners.TestOrc.test_type conversions fails after 
first run
 Key: IMPALA-9420
 URL: https://issues.apache.org/jira/browse/IMPALA-9420
 Project: IMPALA
  Issue Type: Bug
Reporter: Norbert Luksa
Assignee: Gabor Kaszab


The mentioned test passes on the first run, but fails later on, finding more 
rows than expected.
 By running
{code:java}
hdfs dfs -ls -R / | grep union_comlextypes
{code}
we can find that the previously created files are not cleaned up, so Impala 
will find and scan them.

The problem could be that the union_complextypes and ill_complextypes tables 
are created as externals.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-9420) test_scanners.TestOrc.test_type conversions fails after first run

2020-02-25 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-9420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa updated IMPALA-9420:
--
Labels: orc ramp-up  (was: )

> test_scanners.TestOrc.test_type conversions fails after first run
> -
>
> Key: IMPALA-9420
> URL: https://issues.apache.org/jira/browse/IMPALA-9420
> Project: IMPALA
>  Issue Type: Bug
>Reporter: Norbert Luksa
>Assignee: Gabor Kaszab
>Priority: Major
>  Labels: orc, ramp-up
>
> The mentioned test passes on the first run, but fails later on, finding more 
> rows than expected.
>  By running
> {code:java}
> hdfs dfs -ls -R / | grep union_comlextypes
> {code}
> we can find that the previously created files are not cleaned up, so Impala 
> will find and scan them.
> The problem could be that the union_complextypes and ill_complextypes tables 
> are created as externals.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Resolved] (IMPALA-9226) Improve string allocations of the ORC scanner

2020-03-02 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-9226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa resolved IMPALA-9226.
---
Fix Version/s: Impala 4.0
   Resolution: Fixed

> Improve string allocations of the ORC scanner
> -
>
> Key: IMPALA-9226
> URL: https://issues.apache.org/jira/browse/IMPALA-9226
> Project: IMPALA
>  Issue Type: Improvement
>Reporter: Zoltán Borók-Nagy
>Assignee: Norbert Luksa
>Priority: Major
>  Labels: orc
> Fix For: Impala 4.0
>
>
> Currently the ORC scanner allocates new memory for each string values (except 
> for fixed size strings):
> [https://github.com/apache/impala/blob/85425b81f04c856d7d5ec375242303f78ec7964e/be/src/exec/orc-column-readers.cc#L172]
> Besides the too many allocations and copying it's also bad for memory 
> locality.
> Since ORC-501 StringVectorBatch has a member named 'blob' that contains the 
> strings in the batch: 
> [https://github.com/apache/orc/blob/branch-1.6/c%2B%2B/include/orc/Vector.hh#L126]
> 'blob' has type DataBuffer which is movable, so Impala might be able to get 
> ownership of it. Or, at least we could copy the whole blob array instead of 
> copying the strings one-by-one.
> ORC-501 is included in ORC version 1.6, but Impala currently only uses ORC 
> 1.5.5.
> ORC 1.6 also introduces a new string vector type, EncodedStringVectorBatch:
> [https://github.com/apache/orc/blob/e40b9a7205d51995f11fe023c90769c0b7c4bb93/c%2B%2B/include/orc/Vector.hh#L153]
> It uses dictionary encoding for storing the values. Impala could copy/move 
> the dictionary as well.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-9351) AnalyzeDDLTest.TestCreateTableLikeFileOrc failed due to non-existing path

2020-03-02 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-9351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17048996#comment-17048996
 ] 

Norbert Luksa commented on IMPALA-9351:
---

Looks like the test has not failed since the last fix. [~joemcdonnell], what do 
you think, can we close this Jira now?

> AnalyzeDDLTest.TestCreateTableLikeFileOrc failed due to non-existing path
> -
>
> Key: IMPALA-9351
> URL: https://issues.apache.org/jira/browse/IMPALA-9351
> Project: IMPALA
>  Issue Type: Bug
>  Components: Frontend
>Reporter: Fang-Yu Rao
>Assignee: Norbert Luksa
>Priority: Blocker
>  Labels: broken-build, flaky-test
>
> AnalyzeDDLTest.TestCreateTableLikeFileOrc failed due to a non-existing path. 
> Specifically, we see the following error message.
> {code:java}
> Error Message
> Error during analysis:
> org.apache.impala.common.AnalysisException: Cannot infer schema, path does 
> not exist: 
> hdfs://localhost:20500/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0
> sql:
> create table if not exists newtbl_DNE like orc 
> '/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0'
> {code}
> The stack trace is provided in the following.
> {code:java}
> Stacktrace
> java.lang.AssertionError: 
> Error during analysis:
> org.apache.impala.common.AnalysisException: Cannot infer schema, path does 
> not exist: 
> hdfs://localhost:20500/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0
> sql:
> create table if not exists newtbl_DNE like orc 
> '/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0'
>   at org.junit.Assert.fail(Assert.java:88)
>   at 
> org.apache.impala.common.FrontendFixture.analyzeStmt(FrontendFixture.java:397)
>   at 
> org.apache.impala.common.FrontendTestBase.AnalyzesOk(FrontendTestBase.java:244)
>   at 
> org.apache.impala.common.FrontendTestBase.AnalyzesOk(FrontendTestBase.java:185)
>   at 
> org.apache.impala.analysis.AnalyzeDDLTest.TestCreateTableLikeFileOrc(AnalyzeDDLTest.java:2045)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at 
> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:50)
>   at 
> org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
>   at 
> org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:47)
>   at 
> org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
>   at 
> org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
>   at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:78)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:57)
>   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
>   at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71)
>   at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:288)
>   at org.junit.runners.ParentRunner.access$000(ParentRunner.java:58)
>   at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268)
>   at 
> org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
>   at 
> org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
>   at org.junit.runners.ParentRunner.run(ParentRunner.java:363)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:365)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeWithRerun(JUnit4Provider.java:272)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:236)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:159)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.invokeProviderInSameClassLoader(ForkedBooter.java:386)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:323)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:143)
> {code}
> This test was recently added by [~norbertluksa], and [~boroknagyz] gave a +2, 
> maybe [~boroknagyz] could provide some insight into this? Thanks!



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues

[jira] [Work started] (IMPALA-9447) TupleRowCompareTest.DecimalTest crashes on release build

2020-03-03 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-9447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on IMPALA-9447 started by Norbert Luksa.
-
> TupleRowCompareTest.DecimalTest crashes on release build
> 
>
> Key: IMPALA-9447
> URL: https://issues.apache.org/jira/browse/IMPALA-9447
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Affects Versions: Impala 3.4.0
>Reporter: Joe McDonnell
>Assignee: Norbert Luksa
>Priority: Blocker
>  Labels: broken-build
>
> TupleRowCompareTest.DecimalTest crashes on the release build with the 
> following stack:
> {noformat}
> Operating system: Linux
>   0.0.0 Linux 4.15.0-64-generic #73~16.04.1-Ubuntu SMP Fri 
> Sep 13 09:56:18 UTC 2019 x86_64
> CPU: amd64
>  family 6 model 94 stepping 3
>  1 CPUGPU: UNKNOWNCrash reason:  SIGSEGV
> Crash address: 0x0
> Process uptime: not availableThread 0 (crashed)
>  0  unifiedbetests!int 
> impala::TupleRowCompareTest::DecimalDecimalTest
>  >(long, long, long, long, int, int) [tuple-row-compare-test.cc : 121 + 0x0]
> rax = 0x0d2b2598   rdx = 0x0021
> rcx = 0x0e34e640   rbx = 0x0d1def00
> rsi = 0x05b9   rdi = 0x0d1df0d8
> rbp = 0x7ffcc5aff280   rsp = 0x7ffcc5aff150
>  r8 = 0xr9 = 0x0064
> r10 = 0x   r11 = 0x0001
> r12 = 0x0d1df0d8   r13 = 0x0002
> r14 = 0x7ffcc5aff101   r15 = 0x0d2b2580
> rip = 0x014b6981
> Found by: given as instruction pointer in context
>  1  unifiedbetests!impala::TupleRowCompareTest_DecimalTest_Test::TestBody() 
> [tuple-row-compare-test.cc : 560 + 0x30]
> rbx = 0x7ffcc5aff2b0   rbp = 0x7ffcc5aff2f0
> rsp = 0x7ffcc5aff290   r12 = 0x7ffcc5aff2a0
> r13 = 0x7ffcc5aff2c0   r14 = 0x0d1def00
> r15 = 0x03399ef0   rip = 0x014a0c81
> Found by: call frame info
>  2  unifiedbetests!void 
> testing::internal::HandleExceptionsInMethodIfSupported void>(testing::Test*, void (testing::Test::*)(), char const*) + 0x33
> rbx = 0x014a01d0   rbp = 0x
> rsp = 0x7ffcc5aff300   r12 = 0x0d1def00
> r13 = 0x03576465   r14 = 0x01709c90bbdc
> r15 = 0x03399ef0   rip = 0x033b4f03
> Found by: call frame info
>  3  unifiedbetests!testing::Test::Run() + 0xba
> rbx = 0x0d1def00   rbp = 0x03399ef0
> rsp = 0x7ffcc5aff340   r12 = 0x06aa2000
> r13 = 0x0d1def00   r14 = 0x01709c90bbdc
> r15 = 0x03399ef0   rip = 0x033acb5a
> Found by: call frame info
>  4  unifiedbetests!testing::TestInfo::Run() + 0x118
> rbx = 0x06ad48f0   rbp = 0x06aa2000
> rsp = 0x7ffcc5aff360   r12 = 0x06a90de0
> r13 = 0x0d1def00   r14 = 0x01709c90bbdc
> r15 = 0x03399ef0   rip = 0x033acca8
> Found by: call frame info
>  5  unifiedbetests!testing::TestCase::Run() + 0xb5
> rbx = 0x000b   rbp = 0x06ac5500
> rsp = 0x7ffcc5aff3a0   r12 = 0x06aa2000
> r13 = 0x06a90de0   r14 = 0x03399ef0
> r15 = 0x01709c90b7de   rip = 0x033acd85
> Found by: call frame info
>  6  unifiedbetests!testing::internal::UnitTestImpl::RunAllTests() + 0x258
> rbx = 0x06aa2000   rbp = 0x06a90de0
> rsp = 0x7ffcc5aff3f0   r12 = 0x006c
> r13 = 0x   r14 = 0x01709c90b7de
> r15 = 0x0478d2d4   rip = 0x033ae008
> Found by: call frame info{noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-9447) TupleRowCompareTest.DecimalTest crashes on release build

2020-03-03 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-9447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17050030#comment-17050030
 ] 

Norbert Luksa commented on IMPALA-9447:
---

This is test issue, caused by gcc generating unaligned instructions like 
int128_t. They raises SegmentFault when addresses are not aligned to 16 bytes.

Sent in a fix: https://gerrit.cloudera.org/#/c/15347/

> TupleRowCompareTest.DecimalTest crashes on release build
> 
>
> Key: IMPALA-9447
> URL: https://issues.apache.org/jira/browse/IMPALA-9447
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Affects Versions: Impala 3.4.0
>Reporter: Joe McDonnell
>Assignee: Norbert Luksa
>Priority: Blocker
>  Labels: broken-build
>
> TupleRowCompareTest.DecimalTest crashes on the release build with the 
> following stack:
> {noformat}
> Operating system: Linux
>   0.0.0 Linux 4.15.0-64-generic #73~16.04.1-Ubuntu SMP Fri 
> Sep 13 09:56:18 UTC 2019 x86_64
> CPU: amd64
>  family 6 model 94 stepping 3
>  1 CPUGPU: UNKNOWNCrash reason:  SIGSEGV
> Crash address: 0x0
> Process uptime: not availableThread 0 (crashed)
>  0  unifiedbetests!int 
> impala::TupleRowCompareTest::DecimalDecimalTest
>  >(long, long, long, long, int, int) [tuple-row-compare-test.cc : 121 + 0x0]
> rax = 0x0d2b2598   rdx = 0x0021
> rcx = 0x0e34e640   rbx = 0x0d1def00
> rsi = 0x05b9   rdi = 0x0d1df0d8
> rbp = 0x7ffcc5aff280   rsp = 0x7ffcc5aff150
>  r8 = 0xr9 = 0x0064
> r10 = 0x   r11 = 0x0001
> r12 = 0x0d1df0d8   r13 = 0x0002
> r14 = 0x7ffcc5aff101   r15 = 0x0d2b2580
> rip = 0x014b6981
> Found by: given as instruction pointer in context
>  1  unifiedbetests!impala::TupleRowCompareTest_DecimalTest_Test::TestBody() 
> [tuple-row-compare-test.cc : 560 + 0x30]
> rbx = 0x7ffcc5aff2b0   rbp = 0x7ffcc5aff2f0
> rsp = 0x7ffcc5aff290   r12 = 0x7ffcc5aff2a0
> r13 = 0x7ffcc5aff2c0   r14 = 0x0d1def00
> r15 = 0x03399ef0   rip = 0x014a0c81
> Found by: call frame info
>  2  unifiedbetests!void 
> testing::internal::HandleExceptionsInMethodIfSupported void>(testing::Test*, void (testing::Test::*)(), char const*) + 0x33
> rbx = 0x014a01d0   rbp = 0x
> rsp = 0x7ffcc5aff300   r12 = 0x0d1def00
> r13 = 0x03576465   r14 = 0x01709c90bbdc
> r15 = 0x03399ef0   rip = 0x033b4f03
> Found by: call frame info
>  3  unifiedbetests!testing::Test::Run() + 0xba
> rbx = 0x0d1def00   rbp = 0x03399ef0
> rsp = 0x7ffcc5aff340   r12 = 0x06aa2000
> r13 = 0x0d1def00   r14 = 0x01709c90bbdc
> r15 = 0x03399ef0   rip = 0x033acb5a
> Found by: call frame info
>  4  unifiedbetests!testing::TestInfo::Run() + 0x118
> rbx = 0x06ad48f0   rbp = 0x06aa2000
> rsp = 0x7ffcc5aff360   r12 = 0x06a90de0
> r13 = 0x0d1def00   r14 = 0x01709c90bbdc
> r15 = 0x03399ef0   rip = 0x033acca8
> Found by: call frame info
>  5  unifiedbetests!testing::TestCase::Run() + 0xb5
> rbx = 0x000b   rbp = 0x06ac5500
> rsp = 0x7ffcc5aff3a0   r12 = 0x06aa2000
> r13 = 0x06a90de0   r14 = 0x03399ef0
> r15 = 0x01709c90b7de   rip = 0x033acd85
> Found by: call frame info
>  6  unifiedbetests!testing::internal::UnitTestImpl::RunAllTests() + 0x258
> rbx = 0x06aa2000   rbp = 0x06a90de0
> rsp = 0x7ffcc5aff3f0   r12 = 0x006c
> r13 = 0x   r14 = 0x01709c90b7de
> r15 = 0x0478d2d4   rip = 0x033ae008
> Found by: call frame info{noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Comment Edited] (IMPALA-9447) TupleRowCompareTest.DecimalTest crashes on release build

2020-03-03 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-9447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17050030#comment-17050030
 ] 

Norbert Luksa edited comment on IMPALA-9447 at 3/3/20 9:16 AM:
---

This is test issue, caused by gcc generating unaligned instructions like movaps 
for int128_t. They raises SegmentFault when addresses are not aligned to 16 
bytes.

Sent in a fix: [https://gerrit.cloudera.org/#/c/15347/]


was (Author: norbertluksa):
This is test issue, caused by gcc generating unaligned instructions like 
int128_t. They raises SegmentFault when addresses are not aligned to 16 bytes.

Sent in a fix: https://gerrit.cloudera.org/#/c/15347/

> TupleRowCompareTest.DecimalTest crashes on release build
> 
>
> Key: IMPALA-9447
> URL: https://issues.apache.org/jira/browse/IMPALA-9447
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Affects Versions: Impala 3.4.0
>Reporter: Joe McDonnell
>Assignee: Norbert Luksa
>Priority: Blocker
>  Labels: broken-build
>
> TupleRowCompareTest.DecimalTest crashes on the release build with the 
> following stack:
> {noformat}
> Operating system: Linux
>   0.0.0 Linux 4.15.0-64-generic #73~16.04.1-Ubuntu SMP Fri 
> Sep 13 09:56:18 UTC 2019 x86_64
> CPU: amd64
>  family 6 model 94 stepping 3
>  1 CPUGPU: UNKNOWNCrash reason:  SIGSEGV
> Crash address: 0x0
> Process uptime: not availableThread 0 (crashed)
>  0  unifiedbetests!int 
> impala::TupleRowCompareTest::DecimalDecimalTest
>  >(long, long, long, long, int, int) [tuple-row-compare-test.cc : 121 + 0x0]
> rax = 0x0d2b2598   rdx = 0x0021
> rcx = 0x0e34e640   rbx = 0x0d1def00
> rsi = 0x05b9   rdi = 0x0d1df0d8
> rbp = 0x7ffcc5aff280   rsp = 0x7ffcc5aff150
>  r8 = 0xr9 = 0x0064
> r10 = 0x   r11 = 0x0001
> r12 = 0x0d1df0d8   r13 = 0x0002
> r14 = 0x7ffcc5aff101   r15 = 0x0d2b2580
> rip = 0x014b6981
> Found by: given as instruction pointer in context
>  1  unifiedbetests!impala::TupleRowCompareTest_DecimalTest_Test::TestBody() 
> [tuple-row-compare-test.cc : 560 + 0x30]
> rbx = 0x7ffcc5aff2b0   rbp = 0x7ffcc5aff2f0
> rsp = 0x7ffcc5aff290   r12 = 0x7ffcc5aff2a0
> r13 = 0x7ffcc5aff2c0   r14 = 0x0d1def00
> r15 = 0x03399ef0   rip = 0x014a0c81
> Found by: call frame info
>  2  unifiedbetests!void 
> testing::internal::HandleExceptionsInMethodIfSupported void>(testing::Test*, void (testing::Test::*)(), char const*) + 0x33
> rbx = 0x014a01d0   rbp = 0x
> rsp = 0x7ffcc5aff300   r12 = 0x0d1def00
> r13 = 0x03576465   r14 = 0x01709c90bbdc
> r15 = 0x03399ef0   rip = 0x033b4f03
> Found by: call frame info
>  3  unifiedbetests!testing::Test::Run() + 0xba
> rbx = 0x0d1def00   rbp = 0x03399ef0
> rsp = 0x7ffcc5aff340   r12 = 0x06aa2000
> r13 = 0x0d1def00   r14 = 0x01709c90bbdc
> r15 = 0x03399ef0   rip = 0x033acb5a
> Found by: call frame info
>  4  unifiedbetests!testing::TestInfo::Run() + 0x118
> rbx = 0x06ad48f0   rbp = 0x06aa2000
> rsp = 0x7ffcc5aff360   r12 = 0x06a90de0
> r13 = 0x0d1def00   r14 = 0x01709c90bbdc
> r15 = 0x03399ef0   rip = 0x033acca8
> Found by: call frame info
>  5  unifiedbetests!testing::TestCase::Run() + 0xb5
> rbx = 0x000b   rbp = 0x06ac5500
> rsp = 0x7ffcc5aff3a0   r12 = 0x06aa2000
> r13 = 0x06a90de0   r14 = 0x03399ef0
> r15 = 0x01709c90b7de   rip = 0x033acd85
> Found by: call frame info
>  6  unifiedbetests!testing::internal::UnitTestImpl::RunAllTests() + 0x258
> rbx = 0x06aa2000   rbp = 0x06a90de0
> rsp = 0x7ffcc5aff3f0   r12 = 0x006c
> r13 = 0x   r14 = 0x01709c90b7de
> r15 = 0x0478d2d4   rip = 0x033ae008
> Found by: call frame info{noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Work started] (IMPALA-6505) Min-Max predicate push down in ORC scanner

2020-03-03 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-6505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on IMPALA-6505 started by Norbert Luksa.
-
> Min-Max predicate push down in ORC scanner
> --
>
> Key: IMPALA-6505
> URL: https://issues.apache.org/jira/browse/IMPALA-6505
> Project: IMPALA
>  Issue Type: New Feature
>  Components: Backend
>Reporter: Quanlong Huang
>Assignee: Norbert Luksa
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Resolved] (IMPALA-9351) AnalyzeDDLTest.TestCreateTableLikeFileOrc failed due to non-existing path

2020-03-03 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-9351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa resolved IMPALA-9351.
---
Fix Version/s: Impala 3.4.0
   Resolution: Fixed

> AnalyzeDDLTest.TestCreateTableLikeFileOrc failed due to non-existing path
> -
>
> Key: IMPALA-9351
> URL: https://issues.apache.org/jira/browse/IMPALA-9351
> Project: IMPALA
>  Issue Type: Bug
>  Components: Frontend
>Reporter: Fang-Yu Rao
>Assignee: Norbert Luksa
>Priority: Blocker
>  Labels: broken-build, flaky-test
> Fix For: Impala 3.4.0
>
>
> AnalyzeDDLTest.TestCreateTableLikeFileOrc failed due to a non-existing path. 
> Specifically, we see the following error message.
> {code:java}
> Error Message
> Error during analysis:
> org.apache.impala.common.AnalysisException: Cannot infer schema, path does 
> not exist: 
> hdfs://localhost:20500/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0
> sql:
> create table if not exists newtbl_DNE like orc 
> '/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0'
> {code}
> The stack trace is provided in the following.
> {code:java}
> Stacktrace
> java.lang.AssertionError: 
> Error during analysis:
> org.apache.impala.common.AnalysisException: Cannot infer schema, path does 
> not exist: 
> hdfs://localhost:20500/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0
> sql:
> create table if not exists newtbl_DNE like orc 
> '/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0'
>   at org.junit.Assert.fail(Assert.java:88)
>   at 
> org.apache.impala.common.FrontendFixture.analyzeStmt(FrontendFixture.java:397)
>   at 
> org.apache.impala.common.FrontendTestBase.AnalyzesOk(FrontendTestBase.java:244)
>   at 
> org.apache.impala.common.FrontendTestBase.AnalyzesOk(FrontendTestBase.java:185)
>   at 
> org.apache.impala.analysis.AnalyzeDDLTest.TestCreateTableLikeFileOrc(AnalyzeDDLTest.java:2045)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at 
> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:50)
>   at 
> org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
>   at 
> org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:47)
>   at 
> org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
>   at 
> org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
>   at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:78)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:57)
>   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
>   at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71)
>   at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:288)
>   at org.junit.runners.ParentRunner.access$000(ParentRunner.java:58)
>   at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268)
>   at 
> org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
>   at 
> org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
>   at org.junit.runners.ParentRunner.run(ParentRunner.java:363)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:365)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeWithRerun(JUnit4Provider.java:272)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:236)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:159)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.invokeProviderInSameClassLoader(ForkedBooter.java:386)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:323)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:143)
> {code}
> This test was recently added by [~norbertluksa], and [~boroknagyz] gave a +2, 
> maybe [~boroknagyz] could provide some insight into this? Thanks!



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h.

[jira] [Work started] (IMPALA-9351) AnalyzeDDLTest.TestCreateTableLikeFileOrc failed due to non-existing path

2020-03-03 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-9351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on IMPALA-9351 started by Norbert Luksa.
-
> AnalyzeDDLTest.TestCreateTableLikeFileOrc failed due to non-existing path
> -
>
> Key: IMPALA-9351
> URL: https://issues.apache.org/jira/browse/IMPALA-9351
> Project: IMPALA
>  Issue Type: Bug
>  Components: Frontend
>Reporter: Fang-Yu Rao
>Assignee: Norbert Luksa
>Priority: Blocker
>  Labels: broken-build, flaky-test
>
> AnalyzeDDLTest.TestCreateTableLikeFileOrc failed due to a non-existing path. 
> Specifically, we see the following error message.
> {code:java}
> Error Message
> Error during analysis:
> org.apache.impala.common.AnalysisException: Cannot infer schema, path does 
> not exist: 
> hdfs://localhost:20500/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0
> sql:
> create table if not exists newtbl_DNE like orc 
> '/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0'
> {code}
> The stack trace is provided in the following.
> {code:java}
> Stacktrace
> java.lang.AssertionError: 
> Error during analysis:
> org.apache.impala.common.AnalysisException: Cannot infer schema, path does 
> not exist: 
> hdfs://localhost:20500/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0
> sql:
> create table if not exists newtbl_DNE like orc 
> '/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0'
>   at org.junit.Assert.fail(Assert.java:88)
>   at 
> org.apache.impala.common.FrontendFixture.analyzeStmt(FrontendFixture.java:397)
>   at 
> org.apache.impala.common.FrontendTestBase.AnalyzesOk(FrontendTestBase.java:244)
>   at 
> org.apache.impala.common.FrontendTestBase.AnalyzesOk(FrontendTestBase.java:185)
>   at 
> org.apache.impala.analysis.AnalyzeDDLTest.TestCreateTableLikeFileOrc(AnalyzeDDLTest.java:2045)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at 
> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:50)
>   at 
> org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
>   at 
> org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:47)
>   at 
> org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
>   at 
> org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
>   at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:78)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:57)
>   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
>   at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71)
>   at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:288)
>   at org.junit.runners.ParentRunner.access$000(ParentRunner.java:58)
>   at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268)
>   at 
> org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
>   at 
> org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
>   at org.junit.runners.ParentRunner.run(ParentRunner.java:363)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:365)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeWithRerun(JUnit4Provider.java:272)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:236)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:159)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.invokeProviderInSameClassLoader(ForkedBooter.java:386)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:323)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:143)
> {code}
> This test was recently added by [~norbertluksa], and [~boroknagyz] gave a +2, 
> maybe [~boroknagyz] could provide some insight into this? Thanks!



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-9464) Tuple::DeepCopyVarlenData crashes on memcpy

2020-03-05 Thread Norbert Luksa (Jira)
Norbert Luksa created IMPALA-9464:
-

 Summary: Tuple::DeepCopyVarlenData crashes on memcpy
 Key: IMPALA-9464
 URL: https://issues.apache.org/jira/browse/IMPALA-9464
 Project: IMPALA
  Issue Type: Bug
Reporter: Norbert Luksa


The stack trace is

{code:java}
Thread 1 (LWP 29440):
[...]
#6 
#7 0x0037e4e896ce in memcpy () from ./lib64/libc.so.6
#8 0x00b3fcc6 in impala::Tuple::DeepCopyVarlenData 
(this=0x7f318b494948, desc=..., pool=pool@entry=0x7f3486927468) at 
/usr/src/debug/impala-2.11.0-cdh5.14.2/be/src/runtime/tuple.cc:103
#9 0x00b3fdf4 in impala::Tuple::DeepCopy 
(this=this@entry=0x7f31f0b200f6, dst=dst@entry=0x7f318b494948, desc=..., 
pool=pool@entry=0x7f3486927468) at 
/usr/src/debug/impala-2.11.0-cdh5.14.2/be/src/runtime/tuple.cc:92
#10 0x00b3fe8f in impala::Tuple::DeepCopy (this=0x7f31f0b200f6, 
desc=..., pool=0x7f3486927468) at 
/usr/src/debug/impala-2.11.0-cdh5.14.2/be/src/runtime/tuple.cc:83
#11 0x011e5811 in impala::DataStreamSender::Channel::AddRow 
(this=0x7f349da09500, row=row@entry=0x191a7f80) at 
/usr/src/debug/impala-2.11.0-cdh5.14.2/be/src/runtime/data-stream-sender.cc:258
#12 0x011e63af in impala::DataStreamSender::Send (this=0x7f3485ca7860, 
state=0x21b32e80, batch=0x262ae750) at 
/usr/src/debug/impala-2.11.0-cdh5.14.2/be/src/runtime/data-stream-sender.cc:475
#13 0x00b72124 in impala::FragmentInstanceState::ExecInternal 
(this=this@entry=0x1a4ad680) at 
/usr/src/debug/impala-2.11.0-cdh5.14.2/be/src/runtime/fragment-instance-state.cc:275
#14 0x00b749f2 in impala::FragmentInstanceState::Exec 
(this=this@entry=0x1a4ad680) at 
/usr/src/debug/impala-2.11.0-cdh5.14.2/be/src/runtime/fragment-instance-state.cc:89
#15 0x00b64038 in impala::QueryState::ExecFInstance (this=0x14347600, 
fis=0x1a4ad680) at 
/usr/src/debug/impala-2.11.0-cdh5.14.2/be/src/runtime/query-state.cc:382
[..]
{code}

Relevant part of Tuple::DeepCopyVarlenData

{code:c++}
char* string_copy = reinterpret_cast(pool->Allocate(string_v->len));
Ubsan::MemCpy(string_copy, string_v->ptr, string_v->len);
{code}

We could rewrite the function to return Status. This does not solve the root 
cause, but at least we could avoid crashes if the allocation fails.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-9513) query_test.test_kudu.TestKuduOperations.test_column_storage_attributes fails on exhaustive tests

2020-03-16 Thread Norbert Luksa (Jira)
Norbert Luksa created IMPALA-9513:
-

 Summary: 
query_test.test_kudu.TestKuduOperations.test_column_storage_attributes fails on 
exhaustive tests
 Key: IMPALA-9513
 URL: https://issues.apache.org/jira/browse/IMPALA-9513
 Project: IMPALA
  Issue Type: Bug
Reporter: Norbert Luksa


Encountered the mentioned test failures in recent exhaustive tests. The failed 
assertion is: 
{code:java}
query_test/test_kudu.py:436: in test_column_storage_attributes assert 
cursor.fetchall() == \ E assert [(0, True, 0, 0, 0, 0, ...)] == [(0, True, 0, 
0, 0, 0, ...)] E At index 0 diff: (0, True, 0, 0, 0, 0, 0.0, 0.0, '0', 
datetime.datetime(2009, 1, 1, 0, 0), Decimal('0'), '2010-01-01') != (0, True, 
0, 0, 0, 0, 0.0, 0.0, '0', datetime.datetime(2009, 1, 1, 0, 0), 0, 
datetime.date(2010, 1, 1)) E
{code}

Looks like it is caused by IMPALA-8800.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-9513) query_test.test_kudu.TestKuduOperations.test_column_storage_attributes fails on exhaustive tests

2020-03-16 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-9513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa updated IMPALA-9513:
--
Description: 
Encountered the mentioned test failures in recent exhaustive tests. The failed 
assertion is: 
{code:java}
query_test/test_kudu.py:436: in test_column_storage_attributes assert 
cursor.fetchall() == \ E assert [(0, True, 0, 0, 0, 0, ...)] == [(0, True, 0, 
0, 0, 0, ...)] E At index 0 diff: (0, True, 0, 0, 0, 0, 0.0, 0.0, '0', 
datetime.datetime(2009, 1, 1, 0, 0), Decimal('0'), '2010-01-01') != (0, True, 
0, 0, 0, 0, 0.0, 0.0, '0', datetime.datetime(2009, 1, 1, 0, 0), 0, 
datetime.date(2010, 1, 1)) E
{code}

Looks like it is caused by https://gerrit.cloudera.org/#/c/14705/.

  was:
Encountered the mentioned test failures in recent exhaustive tests. The failed 
assertion is: 
{code:java}
query_test/test_kudu.py:436: in test_column_storage_attributes assert 
cursor.fetchall() == \ E assert [(0, True, 0, 0, 0, 0, ...)] == [(0, True, 0, 
0, 0, 0, ...)] E At index 0 diff: (0, True, 0, 0, 0, 0, 0.0, 0.0, '0', 
datetime.datetime(2009, 1, 1, 0, 0), Decimal('0'), '2010-01-01') != (0, True, 
0, 0, 0, 0, 0.0, 0.0, '0', datetime.datetime(2009, 1, 1, 0, 0), 0, 
datetime.date(2010, 1, 1)) E
{code}

Looks like it is caused by IMPALA-8800.


> query_test.test_kudu.TestKuduOperations.test_column_storage_attributes fails 
> on exhaustive tests
> 
>
> Key: IMPALA-9513
> URL: https://issues.apache.org/jira/browse/IMPALA-9513
> Project: IMPALA
>  Issue Type: Bug
>Reporter: Norbert Luksa
>Priority: Blocker
>
> Encountered the mentioned test failures in recent exhaustive tests. The 
> failed assertion is: 
> {code:java}
> query_test/test_kudu.py:436: in test_column_storage_attributes assert 
> cursor.fetchall() == \ E assert [(0, True, 0, 0, 0, 0, ...)] == [(0, True, 0, 
> 0, 0, 0, ...)] E At index 0 diff: (0, True, 0, 0, 0, 0, 0.0, 0.0, '0', 
> datetime.datetime(2009, 1, 1, 0, 0), Decimal('0'), '2010-01-01') != (0, True, 
> 0, 0, 0, 0, 0.0, 0.0, '0', datetime.datetime(2009, 1, 1, 0, 0), 0, 
> datetime.date(2010, 1, 1)) E
> {code}
> Looks like it is caused by https://gerrit.cloudera.org/#/c/14705/.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-9513) query_test.test_kudu.TestKuduOperations.test_column_storage_attributes fails on exhaustive tests

2020-03-16 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-9513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17060280#comment-17060280
 ] 

Norbert Luksa commented on IMPALA-9513:
---

[~vladimir_committer] could you please take a look?

> query_test.test_kudu.TestKuduOperations.test_column_storage_attributes fails 
> on exhaustive tests
> 
>
> Key: IMPALA-9513
> URL: https://issues.apache.org/jira/browse/IMPALA-9513
> Project: IMPALA
>  Issue Type: Bug
>Reporter: Norbert Luksa
>Priority: Blocker
>
> Encountered the mentioned test failures in recent exhaustive tests. The 
> failed assertion is: 
> {code:java}
> query_test/test_kudu.py:436: in test_column_storage_attributes assert 
> cursor.fetchall() == \ E assert [(0, True, 0, 0, 0, 0, ...)] == [(0, True, 0, 
> 0, 0, 0, ...)] E At index 0 diff: (0, True, 0, 0, 0, 0, 0.0, 0.0, '0', 
> datetime.datetime(2009, 1, 1, 0, 0), Decimal('0'), '2010-01-01') != (0, True, 
> 0, 0, 0, 0, 0.0, 0.0, '0', datetime.datetime(2009, 1, 1, 0, 0), 0, 
> datetime.date(2010, 1, 1)) E
> {code}
> Looks like it is caused by https://gerrit.cloudera.org/#/c/14705/.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Reopened] (IMPALA-8857) test_kudu_col_not_null_changed may fail because client reads older timestamp

2020-03-16 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa reopened IMPALA-8857:
---

Encountered the issue recently in the following build:
https://jenkins.impala.io/job/ubuntu-16.04-from-scratch/9713/

Not sure if it's relevant, but a similar test, test_kudu_col_changed failed due 
to network error here:
https://jenkins.impala.io/job/ubuntu-16.04-from-scratch/9710/

> test_kudu_col_not_null_changed may fail because client reads older timestamp
> 
>
> Key: IMPALA-8857
> URL: https://issues.apache.org/jira/browse/IMPALA-8857
> Project: IMPALA
>  Issue Type: Bug
>  Components: Infrastructure
>Affects Versions: Impala 3.3.0
>Reporter: Tim Armstrong
>Assignee: Thomas Tauber-Marshall
>Priority: Critical
>  Labels: flaky
>
> {noformat}
> uery_test/test_kudu.py:242: in test_kudu_col_not_null_changed
> assert len(cursor.fetchall()) == 100
> E   assert 61 == 100
> E+  where 61 = len([(0, None), (2, None), (4, None), (11, None), (12, 
> None), (19, None), ...])
> E+where [(0, None), (2, None), (4, None), (11, None), (12, None), 
> (19, None), ...] =  >()
> E+  where  > = 
> .fetchall
> {noformat}
> I believe this is a flaky tests, since there's no attempt to pass the 
> timestamp from the kudu client that did the insert to the impala client 
> that's doing the reading.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-9463) TestInsertQueries.test_insert failed on S3

2020-03-17 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-9463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17060979#comment-17060979
 ] 

Norbert Luksa commented on IMPALA-9463:
---

In a recent build, query_test.test_mt_dop.TestMtDopParquet.test_mt_dop_insert 
also failed with a similar exception:
{code:java}
ImpalaBeeswaxException: ImpalaBeeswaxException: 
Query aborted:InternalException: Error adding partitions CAUSED BY:
  MetaException: java.io.IOException: Got exception: 
java.io.FileNotFoundException getVersionMarkerItem on ../VERSION: 
com.amazonaws.services.dynamodbv2.model.ResourceNotFoundException:
  Requested resource not found (Service: AmazonDynamoDBv2; 
Status Code: 400;
Error Code: ResourceNotFoundException;
Request ID: 
0S5CT4S81MJ7HFQL60P9021D4VVV4KQNSO5AEMVJF66Q9ASUAAJG)
{code}

Stacktrace:

{code:java}
query_test/test_mt_dop.py:122: in test_mt_dop_insert
self.run_test_case('QueryTest/insert', vector)
/data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/tests/common/impala_test_suite.py:659:
 in run_test_case
result = exec_fn(query, user=test_section.get('USER', '').strip() or None)
/data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/tests/common/impala_test_suite.py:594:
 in __exec_in_impala
result = self.__execute_query(target_impalad_client, query, user=user)
/data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/tests/common/impala_test_suite.py:934:
 in __execute_query
return impalad_client.execute(query, user=user)
/data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/tests/common/impala_connection.py:205:
 in execute
return self.__beeswax_client.execute(sql_stmt, user=user)
/data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/tests/beeswax/impala_beeswax.py:187:
 in execute
handle = self.__execute_query(query_string.strip(), user=user)
/data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/tests/beeswax/impala_beeswax.py:365:
 in __execute_query
self.wait_for_finished(handle)
/data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/tests/beeswax/impala_beeswax.py:386:
 in wait_for_finished
raise ImpalaBeeswaxException("Query aborted:" + error_log, None)
E   ImpalaBeeswaxException: ImpalaBeeswaxException:
EQuery aborted:InternalException: Error adding partitions
E   CAUSED BY: MetaException: java.io.IOException: Got exception: 
java.io.FileNotFoundException getVersionMarkerItem on ../VERSION: 
com.amazonaws.services.dynamodbv2.model.ResourceNotFoundException: Requested 
resource not found (Service: AmazonDynamoDBv2; Status Code: 400; Error Code: 
ResourceNotFoundException; Request ID: 
0S5CT4S81MJ7HFQL60P9021D4VVV4KQNSO5AEMVJF66Q9ASUAAJG)
{code}


> TestInsertQueries.test_insert failed on S3
> --
>
> Key: IMPALA-9463
> URL: https://issues.apache.org/jira/browse/IMPALA-9463
> Project: IMPALA
>  Issue Type: Bug
>Reporter: Zoltán Borók-Nagy
>Priority: Major
>  Labels: broken-build
>
> query_test.test_insert.TestInsertQueries.test_insert failed on S3 with the 
> following error:
> {noformat}
> ImpalaBeeswaxException: ImpalaBeeswaxException:
> Query aborted:InternalException: Error adding partitions CAUSED BY:
>   MetaException: java.io.IOException: Got exception: 
> java.io.FileNotFoundException getVersionMarkerItem on ../VERSION:
> com.amazonaws.services.dynamodbv2.model.ResourceNotFoundException:
>   Requested resource not found (Service: AmazonDynamoDBv2;
> Status Code: 400;
> Error Code: ResourceNotFoundException;
> Request ID: 
> 3NHON9GO4LOH94UN11B7KJ81KBVV4KQNSO5AEMVJF66Q9ASUAAJG){noformat}
> Stack trace of the test was:
> {noformat}
> query_test/test_insert.py:138: in test_insert
> multiple_impalad=vector.get_value('exec_option')['sync_ddl'] == 1)
> /data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/tests/common/impala_test_suite.py:659:
>  in run_test_case
> result = exec_fn(query, user=test_section.get('USER', '').strip() or None)
> /data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/tests/common/impala_test_suite.py:594:
>  in __exec_in_impala
> result = self.__execute_query(target_impalad_client, query, user=user)
> /data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/tests/common/impala_test_suite.py:934:
>  in __execute_query
> return impalad_client.execute(query, user=user)
> /data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/tests/common/impala_connection.py:205:
>  in execute
> return self.__beeswax_client.execute(sql_stmt, user=user)
> /data/jenkins/workspace/impala-asf-master-core

[jira] [Commented] (IMPALA-9463) TestInsertQueries.test_insert failed on S3

2020-03-17 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-9463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17060981#comment-17060981
 ] 

Norbert Luksa commented on IMPALA-9463:
---

The failed build was: 
[impala-asf-master-core-s3|https://master-02.jenkins.cloudera.com/job/impala-asf-master-core-s3/578/]

> TestInsertQueries.test_insert failed on S3
> --
>
> Key: IMPALA-9463
> URL: https://issues.apache.org/jira/browse/IMPALA-9463
> Project: IMPALA
>  Issue Type: Bug
>Reporter: Zoltán Borók-Nagy
>Priority: Major
>  Labels: broken-build
>
> query_test.test_insert.TestInsertQueries.test_insert failed on S3 with the 
> following error:
> {noformat}
> ImpalaBeeswaxException: ImpalaBeeswaxException:
> Query aborted:InternalException: Error adding partitions CAUSED BY:
>   MetaException: java.io.IOException: Got exception: 
> java.io.FileNotFoundException getVersionMarkerItem on ../VERSION:
> com.amazonaws.services.dynamodbv2.model.ResourceNotFoundException:
>   Requested resource not found (Service: AmazonDynamoDBv2;
> Status Code: 400;
> Error Code: ResourceNotFoundException;
> Request ID: 
> 3NHON9GO4LOH94UN11B7KJ81KBVV4KQNSO5AEMVJF66Q9ASUAAJG){noformat}
> Stack trace of the test was:
> {noformat}
> query_test/test_insert.py:138: in test_insert
> multiple_impalad=vector.get_value('exec_option')['sync_ddl'] == 1)
> /data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/tests/common/impala_test_suite.py:659:
>  in run_test_case
> result = exec_fn(query, user=test_section.get('USER', '').strip() or None)
> /data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/tests/common/impala_test_suite.py:594:
>  in __exec_in_impala
> result = self.__execute_query(target_impalad_client, query, user=user)
> /data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/tests/common/impala_test_suite.py:934:
>  in __execute_query
> return impalad_client.execute(query, user=user)
> /data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/tests/common/impala_connection.py:205:
>  in execute
> return self.__beeswax_client.execute(sql_stmt, user=user)
> /data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/tests/beeswax/impala_beeswax.py:187:
>  in execute
> handle = self.__execute_query(query_string.strip(), user=user)
> /data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/tests/beeswax/impala_beeswax.py:365:
>  in __execute_query
> self.wait_for_finished(handle)
> /data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/tests/beeswax/impala_beeswax.py:386:
>  in wait_for_finished
> raise ImpalaBeeswaxException("Query aborted:" + error_log, None)
> E   ImpalaBeeswaxException: ImpalaBeeswaxException:
> EQuery aborted:InternalException: Error adding partitions
> E   CAUSED BY: MetaException: java.io.IOException: Got exception: 
> java.io.FileNotFoundException getVersionMarkerItem on ../VERSION: 
> com.amazonaws.services.dynamodbv2.model.ResourceNotFoundException: Requested 
> resource not found (Service: AmazonDynamoDBv2; Status Code: 400; Error Code: 
> ResourceNotFoundException; Request ID: 
> 3NHON9GO4LOH94UN11B7KJ81KBVV4KQNSO5AEMVJF66Q9ASUAAJG){noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-9446) query_test.test_compressed_formats.TestReadZtsdLibCompressedFile fails on S3

2020-03-17 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-9446?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17060993#comment-17060993
 ] 

Norbert Luksa commented on IMPALA-9446:
---

Looks like ASF Jira bot failed to copy the commit message, so here it is for 
reference:
IMPALA-9446: Fix bug that impala failed to read zstd file on s3

S3 filesystem doesn't have a block concept, so scheduler splits each
file into a smaller scan ranges. Adding ZSTD as a case in the switch
statement to add support for zstd.

Testing done:
In mini-cluster, read external table located on s3 zstd file.
Run end to end test successfully on s3.

Change-Id: I13f8837fda7454ddb4bb47d20a675d6315a3462d
Reviewed-on: http://gerrit.cloudera.org:8080/15391
Reviewed-by: Impala Public Jenkins 
Tested-by: Tim Armstrong 

> query_test.test_compressed_formats.TestReadZtsdLibCompressedFile fails on S3
> 
>
> Key: IMPALA-9446
> URL: https://issues.apache.org/jira/browse/IMPALA-9446
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Affects Versions: Impala 3.4.0
>Reporter: Joe McDonnell
>Assignee: Xiaomeng Zhang
>Priority: Blocker
>  Labels: broken-build
>
> On the S3 configuration, 
> query_test.test_compressed_formats.TestReadZtsdLibCompressedFile is 
> encountering the following error:
>  
> {noformat}
> query_test/test_compressed_formats.py:325: in test_query_large_file
> result = self.client.execute("select count(*) from %s" % 
> self.COMPRESSED_TABLE_NAME)
> common/impala_connection.py:205: in execute
> return self.__beeswax_client.execute(sql_stmt, user=user)
> beeswax/impala_beeswax.py:187: in execute
> handle = self.__execute_query(query_string.strip(), user=user)
> beeswax/impala_beeswax.py:365: in __execute_query
> self.wait_for_finished(handle)
> beeswax/impala_beeswax.py:386: in wait_for_finished
> raise ImpalaBeeswaxException("Query aborted:" + error_log, None)
> E   ImpalaBeeswaxException: ImpalaBeeswaxException:
> EQuery aborted:Rejected query from pool default-pool: Error during 
> scheduling: Invalid file descriptor compression code: 10{noformat}
> The error comes from here:
>  
> [https://github.com/apache/impala/blob/master/be/src/util/flat_buffer.cc#L63]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-9430) Kerberos configs should be passed through to Kerberos libraries even if principal is not set

2020-03-17 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-9430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17060994#comment-17060994
 ] 

Norbert Luksa commented on IMPALA-9430:
---

Looks like ASF Jira bot failed to copy the commit message, so here it is for 
reference:
IMPALA-9430: always pass through kerberos configs

The behaviour of kerberos-related command line flags is changed so that
their values are always passed through to underlying libraries,
even if Kerberos isn't enabled for internal communication in Impala.

This is good because:
* Various libraries that communicate with external systems may use
  kerberos for outgoing connections, if *incoming* connections are
  not authenticated.
  e.g. it might just be enabled for HMS. Having them pick up different
  kerberos settings for outgoing connections if kerberos is disabled
  for incoming connections is a little weird. This
  is a safer default that reduces chances of inadvertant
  misconfigurations.
* It matches the documentation of the flags.

Some validations are still disabled when --principal is not set,
e.g. we don't check the replay cache directory. This is to avoid
any potential regressions or startup failures on non-kerberised
clusters.

Testing:
Added unit tests for flag validation and env var setting on the
code paths that I touched.

Change-Id: If4bb311c7ab7173232aab36c5ed801f93f38f5b9
Reviewed-on: http://gerrit.cloudera.org:8080/15340
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 

> Kerberos configs should be passed through to Kerberos libraries even if 
> principal is not set
> 
>
> Key: IMPALA-9430
> URL: https://issues.apache.org/jira/browse/IMPALA-9430
> Project: IMPALA
>  Issue Type: Improvement
>  Components: Backend
>Reporter: Tim Armstrong
>Assignee: Tim Armstrong
>Priority: Major
>  Labels: kerberos, security
> Fix For: Impala 3.4.0
>
>
> InitKerberosEnv() configures native and JDK kerberos implementations based on 
> command-line flags: 
> https://github.com/apache/impala/blob/d1b42c836c3458a2ef3662c0b0b1fd8fbf8f2baf/be/src/rpc/authentication.cc#L866
>  . It only does this when --principal is set.
> It's possible that Impala can be set up to use kerberos to communicate with 
> some external services, e.g. HMS or Hive, even if --principal is not set, 
> since those clients read in config XML files that are independent of the 
> Impala flags. This isn't a recommended configuration and requires a fair bit 
> of expertise to get right, but I think it's very surprising that the configs 
> *don't* get passed through in the case. The documentation doesn't mention 
> this behaviour.
> The suggested change here is to apply the config changes independent of the 
> value of --principal. It should be a noop if kerberos is not configured for 
> any services.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-9359) Recover gracefully from corrupt kerberos credential cache

2020-03-17 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-9359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17060995#comment-17060995
 ] 

Norbert Luksa commented on IMPALA-9359:
---

Looks like ASF Jira bot failed to copy the commit message, so here it is for 
reference:
IMPALA-9359: recover from corrupt kerberos ccache

This is a clean cherry-pick of KUDU-3050. The original commit
message is below.

KUDU-3050: recover from corrupt kerberos ccache

This handles two failure modes:
* krb5_cc_start_seq_get() can fail if the kerberos credential cache gets
  corrupted on disk, e.g. is truncated.
* the renewal can fail to find a credential in the credential cache,
  either if it is missing or the renewal thread hits an error while
  reading through credentials.

Also add some additional logging and limit the max backoff time
to make it easier to debug other kinds of renewal errors.

The test triggers a pre-existing memory leak bug in some older
Kerberos libraries. Added a suppression for leak sanitizer
to ClientNegotiation::CheckGSSAPI() to suppress it.

Test:
Add a test that exercises the recovery logic after truncating
the credential cache. The test failed before this change.

Change-Id: I86567f16816d1c6729679398ce56296744cb30c9
Reviewed-on: http://gerrit.cloudera.org:8080/15407
Reviewed-by: Thomas Tauber-Marshall 
Tested-by: Impala Public Jenkins 

> Recover gracefully from corrupt kerberos credential cache
> -
>
> Key: IMPALA-9359
> URL: https://issues.apache.org/jira/browse/IMPALA-9359
> Project: IMPALA
>  Issue Type: Improvement
>  Components: Security
>Affects Versions: Impala 3.3.0
>Reporter: Tim Armstrong
>Assignee: Tim Armstrong
>Priority: Major
>  Labels: kerberos
> Fix For: Impala 3.4.0
>
>
> # Start up a kerberized Impala cluster
> # Corrupt the kerberos ticket cache used by impala /tmp/krb5cc_impala_internal
> # Observe queries fail. The details depend a lot on timing, etc. I have seen 
> communication failures between impalads and with other systems, e.g. HDFS.
> # The system will stay wedge in this state indefinitely
> We have seen this happen once in production from /tmp filling up.
> I prototyped a fix that amounts to re-running Kinit() to blow away the broken 
> credential cache. It needs more work to be production-ready



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-9456) Allow disabling kerberos for incoming internal and external connections even if --principal is set

2020-03-17 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-9456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17060996#comment-17060996
 ] 

Norbert Luksa commented on IMPALA-9456:
---

Looks like ASF Jira bot failed to copy the commit message, so here it is for 
reference:
IMPALA-9456: allow disabling kerberos selectively

There are specific use cases where we need to talk to kerberized
services (HMS, etc) and want to keep our TGT up to date using Impala's
kinit infrastructure, but don't want to kerberize all connections.

Adds --skip_internal_kerberos_auth and --skip_external_kerberos_auth
to disable Kerberos authentication for incoming connections even if
--principal is set. The daemon does a kinit and keeps tickets
up-to-date with the background thread even if kerberos is disabled
for all incoming connections.

Behaviour only changes when those flags are toggled.

The change required restructuring the code a bit, specifically
pulling the call to InitKerberosForServer() out of
SecureAuthProvider, which I think is a net improvement that makes
the control flow clearer.

Testing:
Add unit tests to:
* confirm that the kinit occurs even with auth disabled.
* confirm that incoming KRPC connections do not require authentication.

I would have liked to add automated tests for thrift interfaces but did
not have the infrastructure to unit-test it. I think the changes I made
are fairly low risk because they do not increase the number of code paths
in AuthManager::Init() and can be verified by inspection. The tests I
would have liked to add are:
* confirm that incoming external connections do not require auth
* confirm that internal thrift connections do not require auth.

Manually started kerberized minicluster with internal/external kerberos
disabled, e.g. with the command line:

  start-impala-cluster.py
  --impalad_args=--skip_external_kerberos_auth=true
  --impalad_args=--skip_internal_kerberos_auth=true
  --state_store_args=--skip_internal_kerberos_auth=true
  --catalogd_args=--skip_internal_kerberos_auth=true

Confirmed that impala-shell connected without -k when
--skip_external_kerberos_auth=true and requires -k otherwise.
Confirmed that we could run queries against HDFS tables even
with internal and external auth disabled. Checked logs to see
that tickets were reacquired.

  I0303 08:33:49.319911 16079 init.cc:303] Successfully reacquired a new 
kerberos TGT

Tested that a partially kerberised minicluster worked as expected, i.e.
that impalad <-> catalog/statestore connections can have auth disabled
so that the impalad can authenticate even if it does not have the right
principal set. The first start-impala-cluster.py command below succeeds
but the second and third fail because they cannot authenticate with the
catalog and statestore respectively because the processes did not kinit
as the impala principal.

  kinit -kt impala.keytab tarmstrong/localh...@example.com
  start-impala-cluster.py --impalad_args='--principal="" --be_principal=""
  --keytab_file="" --krb5_ccname="/tmp/krb5cc_1000"'
  --state_store_args=--skip_internal_kerberos_auth=true
  --catalogd_args=--skip_internal_kerberos_auth=true
  kinit -kt impala.keytab tarmstrong/localh...@example.com
  start-impala-cluster.py --impalad_args='--principal="" --be_principal=""
  --keytab_file="" --krb5_ccname="/tmp/krb5cc_1000"'
  --state_store_args=--skip_internal_kerberos_auth=false
  --catalogd_args=--skip_internal_kerberos_auth=true
  kinit -kt impala.keytab tarmstrong/localh...@example.com
  start-impala-cluster.py --impalad_args='--principal="" --be_principal=""
  --keytab_file="" --krb5_ccname="/tmp/krb5cc_1000"'
  --state_store_args=--skip_internal_kerberos_auth=false
  --catalogd_args=--skip_internal_kerberos_auth=true

Change-Id: I3b1c641e05e588287e4d9d9cd8389d96fc71cf74
Reviewed-on: http://gerrit.cloudera.org:8080/15351
Reviewed-by: Tim Armstrong 
Tested-by: Impala Public Jenkins 

> Allow disabling kerberos for incoming internal and external connections even 
> if --principal is set
> --
>
> Key: IMPALA-9456
> URL: https://issues.apache.org/jira/browse/IMPALA-9456
> Project: IMPALA
>  Issue Type: Improvement
>  Components: Backend
>Reporter: Tim Armstrong
>Assignee: Tim Armstrong
>Priority: Major
>  Labels: kerberos, security
> Fix For: Impala 3.4.0
>
>
> This would be useful in cases where we need to talk to kerberized services 
> (HMS, etc) and want to keep our TGT up to date using Impala's kinit 
> infrastructure, but don't want to kerberize all connections.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e

[jira] [Commented] (IMPALA-8800) Add DATE type support to Kudu scanner

2020-03-17 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-8800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17060997#comment-17060997
 ] 

Norbert Luksa commented on IMPALA-8800:
---

Looks like ASF Jira bot failed to copy the commit message, so here it is for 
reference:
IMPALA-8800: Added support of Kudu DATE type to Impala

This patch supports reading and writing DATE values
to Kudu tables. It does not add min-max filter runtime
support, but there is followup JIRA IMPALA-9294.
Corresponding Kudu JIRA is KUDU-2632.

Change-Id: I91656749a58ac769b54c2a63bdd4f85c89520b32
Reviewed-on: http://gerrit.cloudera.org:8080/14705
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 

> Add DATE type support to Kudu scanner
> -
>
> Key: IMPALA-8800
> URL: https://issues.apache.org/jira/browse/IMPALA-8800
> Project: IMPALA
>  Issue Type: Sub-task
>Reporter: Attila Jeges
>Assignee: Attila Jeges
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-8800) Add DATE type support to Kudu scanner

2020-03-17 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-8800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17060999#comment-17060999
 ] 

Norbert Luksa commented on IMPALA-8800:
---

[~attilaj] could you have a look at IMPALA-9513?

> Add DATE type support to Kudu scanner
> -
>
> Key: IMPALA-8800
> URL: https://issues.apache.org/jira/browse/IMPALA-8800
> Project: IMPALA
>  Issue Type: Sub-task
>Reporter: Attila Jeges
>Assignee: Attila Jeges
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-9490) DOC: Include brief statement of support for reading Apache Hudi optimized table

2020-03-17 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-9490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17061000#comment-17061000
 ] 

Norbert Luksa commented on IMPALA-9490:
---

Looks like ASF Jira bot failed to copy the commit message, so here it is for 
reference:
IMPALA-9490 [DOCS] State support for reading Apache Hudi optimized table

Added a row impala_file_format.xml, created impala_hudi.xml, added new file to 
ditamap, changed id of concept tag from orc to hudi.

Change-Id: If210cd545a8deb059e66fd36e62e0df4402fc96c
Reviewed-on: http://gerrit.cloudera.org:8080/15418
Tested-by: Impala Public Jenkins 
Reviewed-by: Tim Armstrong 

> DOC: Include brief statement of support for reading Apache Hudi optimized 
> table
> ---
>
> Key: IMPALA-9490
> URL: https://issues.apache.org/jira/browse/IMPALA-9490
> Project: IMPALA
>  Issue Type: Documentation
>  Components: Backend
>Affects Versions: Impala 3.4.0
>Reporter: Kris Hahn
>Assignee: Kris Hahn
>Priority: Major
>
> Document experimental support for Apache Hudi Read Optimized Table. See 
> IMPALA-8778.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-9369) Inserts on large tables could be very slow when event processing it turned on

2020-03-17 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-9369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17061001#comment-17061001
 ] 

Norbert Luksa commented on IMPALA-9369:
---

Looks like ASF Jira bot failed to copy the commit message, so here it is for 
reference:
IMPALA-9369: Make createInsertEvents() async.

This patch makes the createInsertEvents() method async to avoid
blocking the insert code path for long periods for tables with
large number of partitions and files.

Currently the createInsertEvents() method fires the HMS insert
event one partition at a time. This makes insert statements
with thousands of new files significantly slower. This change
makes the createInsertEvent() call asynchronous by making it
run in a separate thread.

Testing:
- Ran MetastoreEventsProcessorTest#testInsertEvents.
- Ran test_events_processing::test_insert_events.

Change-Id: I97802a5c03abc067fccf9e3a9d0047324626706e
Reviewed-on: http://gerrit.cloudera.org:8080/15263
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 

> Inserts on large tables could be very slow when event processing it turned on
> -
>
> Key: IMPALA-9369
> URL: https://issues.apache.org/jira/browse/IMPALA-9369
> Project: IMPALA
>  Issue Type: Improvement
>  Components: Catalog
>Reporter: Vihang Karajgaonkar
>Assignee: Anurag Mantripragada
>Priority: Critical
>
> In case where large number files are being inserted into a table, the 
> {{createInsertEvents}} method fires insert events to HMS for each partition 
> one take a time. This could be very slow for a insert statement which is 
> added hundreds or thousands of files.
> We should see if we can fire the insert events asynchronously instead of 
> blocking the query from returning to the user until all the insert events are 
> fired.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-9504) DOC: Remove "experimental" from description of ORC support.

2020-03-17 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-9504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17061003#comment-17061003
 ] 

Norbert Luksa commented on IMPALA-9504:
---

Looks like ASF Jira bot failed to copy the commit message, so here it is for 
reference:
IMPALA-9446: Fix bug that impala failed to read zstd file on s3

S3 filesystem doesn't have a block concept, so scheduler splits each
file into a smaller scan ranges. Adding ZSTD as a case in the switch
statement to add support for zstd.

Testing done:
In mini-cluster, read external table located on s3 zstd file.
Run end to end test successfully on s3.

Change-Id: I13f8837fda7454ddb4bb47d20a675d6315a3462d
Reviewed-on: http://gerrit.cloudera.org:8080/15391
Reviewed-by: Impala Public Jenkins 
Tested-by: Tim Armstrong 

> DOC: Remove "experimental" from description of ORC support.
> ---
>
> Key: IMPALA-9504
> URL: https://issues.apache.org/jira/browse/IMPALA-9504
> Project: IMPALA
>  Issue Type: Documentation
>Reporter: Kris Hahn
>Assignee: Kris Hahn
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-9513) query_test.test_kudu.TestKuduOperations.test_column_storage_attributes fails on exhaustive tests

2020-03-17 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-9513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa updated IMPALA-9513:
--
Labels: build-failure  (was: )

> query_test.test_kudu.TestKuduOperations.test_column_storage_attributes fails 
> on exhaustive tests
> 
>
> Key: IMPALA-9513
> URL: https://issues.apache.org/jira/browse/IMPALA-9513
> Project: IMPALA
>  Issue Type: Bug
>Reporter: Norbert Luksa
>Priority: Blocker
>  Labels: build-failure
>
> Encountered the mentioned test failures in recent exhaustive tests. The 
> failed assertion is: 
> {code:java}
> query_test/test_kudu.py:436: in test_column_storage_attributes assert 
> cursor.fetchall() == \ E assert [(0, True, 0, 0, 0, 0, ...)] == [(0, True, 0, 
> 0, 0, 0, ...)] E At index 0 diff: (0, True, 0, 0, 0, 0, 0.0, 0.0, '0', 
> datetime.datetime(2009, 1, 1, 0, 0), Decimal('0'), '2010-01-01') != (0, True, 
> 0, 0, 0, 0, 0.0, 0.0, '0', datetime.datetime(2009, 1, 1, 0, 0), 0, 
> datetime.date(2010, 1, 1)) E
> {code}
> Looks like it is caused by https://gerrit.cloudera.org/#/c/14705/.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-9467) Impala Doc: Improve Impala shell usability by enabling live_progress in the interactive mode

2020-03-17 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-9467?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17061112#comment-17061112
 ] 

Norbert Luksa commented on IMPALA-9467:
---

Looks like ASF Jira bot failed to copy the commit message, so here it is for 
reference:
IMPALA-9467: [DOCS] live_progress enabled by default in interactive mode

The following documents were impacted by the change:
- impala_live_progress.xml, revised to explain new behavior
- impala_shell_options.xml, added --disable_live_progress option

Change-Id: I94e624b7bb916ecb5aeb4f007c0610807f7b18cf
Reviewed-on: http://gerrit.cloudera.org:8080/15442
Tested-by: Impala Public Jenkins 
Reviewed-by: Alice Fan 
Reviewed-by: Joe McDonnell 

> Impala Doc: Improve Impala shell usability by enabling live_progress in the 
> interactive mode
> 
>
> Key: IMPALA-9467
> URL: https://issues.apache.org/jira/browse/IMPALA-9467
> Project: IMPALA
>  Issue Type: Documentation
>  Components: Clients
>Reporter: Alice Fan
>Assignee: Kris Hahn
>Priority: Major
>
> https://gerrit.cloudera.org/#/c/15219/
> We enable shell option live_progress in interactive mode by default. As for 
> in the non-interactive mode, live reporting is not supported. Impala-shell 
> will disable live_progress if the mode is detected. Need to update the doc to 
> reflect the changes.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-9114) TestKuduHMSIntegration failing: Kudu create table failing

2020-03-19 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-9114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17062590#comment-17062590
 ] 

Norbert Luksa commented on IMPALA-9114:
---

Found a similar exception during a [data-loading 
test|https://master-02.jenkins.cloudera.com/job/impala-asf-master-core-data-load/941/]:
{code:sql}
ERROR: CREATE TABLE functional_kudu.alltypes (
  id INT PRIMARY KEY,
  bool_col BOOLEAN,
  tinyint_col TINYINT,
  smallint_col SMALLINT,
  int_col INT,
  bigint_col BIGINT,
  float_col FLOAT,
  double_col DOUBLE,
  date_string_col STRING,
  string_col STRING,
  timestamp_col TIMESTAMP,
  year INT,
  month INT
)
PARTITION BY HASH (id) PARTITIONS 3 STORED AS KUDU


Traceback (most recent call last):
  File 
"/data/jenkins/workspace/impala-asf-master-core-data-load/repos/Impala/bin/load-data.py",
 line 208, in exec_impala_query_from_file
result = impala_client.execute(query)
  File 
"/data/jenkins/workspace/impala-asf-master-core-data-load/repos/Impala/tests/beeswax/impala_beeswax.py",
 line 187, in execute
handle = self.__execute_query(query_string.strip(), user=user)
  File 
"/data/jenkins/workspace/impala-asf-master-core-data-load/repos/Impala/tests/beeswax/impala_beeswax.py",
 line 363, in __execute_query
handle = self.execute_query_async(query_string, user=user)
  File 
"/data/jenkins/workspace/impala-asf-master-core-data-load/repos/Impala/tests/beeswax/impala_beeswax.py",
 line 357, in execute_query_async
handle = self.__do_rpc(lambda: self.imp_service.query(query,))
  File 
"/data/jenkins/workspace/impala-asf-master-core-data-load/repos/Impala/tests/beeswax/impala_beeswax.py",
 line 520, in __do_rpc
raise ImpalaBeeswaxException(self.__build_error_message(b), b)
ImpalaBeeswaxException: ImpalaBeeswaxException:
 INNER EXCEPTION: 
 MESSAGE: AnalysisException: Cannot analyze Kudu table 'alltypes': Error 
determining if Kudu's integration with the Hive Metastore is enabled: too many 
attempts: KuduRpc(method=getHiveMetastoreConfig, tablet=null, attempt=101, 
TimeoutTracker(timeout=18, elapsed=171248), Trace Summary(170894 ms): 
Sent(0), Received(0), Delayed(100), MasterRefresh(0), AuthRefresh(0), 
Truncated: false
 Delayed: (UNKNOWN, [ getHiveMetastoreConfig, 100 ]))

{code}

> TestKuduHMSIntegration failing: Kudu create table failing
> -
>
> Key: IMPALA-9114
> URL: https://issues.apache.org/jira/browse/IMPALA-9114
> Project: IMPALA
>  Issue Type: Bug
>Affects Versions: Impala 3.4.0
>Reporter: Bikramjeet Vig
>Assignee: Hao Hao
>Priority: Minor
>  Labels: broken-build
>
> {noformat}
> Error Message
> ImpalaBeeswaxException: ImpalaBeeswaxException:  INNER EXCEPTION:  'beeswaxd.ttypes.BeeswaxException'>  MESSAGE: AnalysisException: Cannot 
> analyze Kudu table 't': Error determining if Kudu's integration with the Hive 
> Metastore is enabled: cannot complete before timeout: 
> KuduRpc(method=getHiveMetastoreConfig, tablet=null, attempt=97, 
> TimeoutTracker(timeout=18, elapsed=178723), Trace Summary(177842 ms): 
> Sent(0), Received(0), Delayed(96), MasterRefresh(0), AuthRefresh(0), 
> Truncated: false  Delayed: (UNKNOWN, [ getHiveMetastoreConfig, 96 ]))
> Stacktrace
> custom_cluster/test_kudu.py:150: in test_create_managed_kudu_tables
> self.run_test_case('QueryTest/kudu_create', vector, 
> use_db=unique_database)
> common/impala_test_suite.py:621: in run_test_case
> result = exec_fn(query, user=test_section.get('USER', '').strip() or None)
> common/impala_test_suite.py:556: in __exec_in_impala
> result = self.__execute_query(target_impalad_client, query, user=user)
> common/impala_test_suite.py:893: in __execute_query
> return impalad_client.execute(query, user=user)
> common/impala_connection.py:205: in execute
> return self.__beeswax_client.execute(sql_stmt, user=user)
> beeswax/impala_beeswax.py:187: in execute
> handle = self.__execute_query(query_string.strip(), user=user)
> beeswax/impala_beeswax.py:362: in __execute_query
> handle = self.execute_query_async(query_string, user=user)
> beeswax/impala_beeswax.py:356: in execute_query_async
> handle = self.__do_rpc(lambda: self.imp_service.query(query,))
> beeswax/impala_beeswax.py:519: in __do_rpc
> raise ImpalaBeeswaxException(self.__build_error_message(b), b)
> E   ImpalaBeeswaxException: ImpalaBeeswaxException:
> EINNER EXCEPTION: 
> EMESSAGE: AnalysisException: Cannot analyze Kudu table 't': Error 
> determining if Kudu's integration with the Hive Metastore is enabled: cannot 
> complete before timeout: KuduRpc(method=getHiveMetastoreConfig, tablet=null, 
> attempt=97, TimeoutTracker(timeout=18, elapsed=178723), Trace 
> Summary(177842 ms): Sent(0), Received(0), Delayed(96), MasterRefresh(0), 
> AuthRefresh(0), Truncated:

[jira] [Comment Edited] (IMPALA-9114) TestKuduHMSIntegration failing: Kudu create table failing

2020-03-19 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-9114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17062590#comment-17062590
 ] 

Norbert Luksa edited comment on IMPALA-9114 at 3/19/20, 1:56 PM:
-

Found a similar exception during a [data-loading 
test|https://master-02.jenkins.cloudera.com/job/impala-asf-master-core-data-load/941/]
 in create-functional-query-exhaustive-impala-generated-kudu-none-none.sql:
{code:sql}
ERROR: CREATE TABLE functional_kudu.alltypes (
  id INT PRIMARY KEY,
  bool_col BOOLEAN,
  tinyint_col TINYINT,
  smallint_col SMALLINT,
  int_col INT,
  bigint_col BIGINT,
  float_col FLOAT,
  double_col DOUBLE,
  date_string_col STRING,
  string_col STRING,
  timestamp_col TIMESTAMP,
  year INT,
  month INT
)
PARTITION BY HASH (id) PARTITIONS 3 STORED AS KUDU


Traceback (most recent call last):
  File 
"/data/jenkins/workspace/impala-asf-master-core-data-load/repos/Impala/bin/load-data.py",
 line 208, in exec_impala_query_from_file
result = impala_client.execute(query)
  File 
"/data/jenkins/workspace/impala-asf-master-core-data-load/repos/Impala/tests/beeswax/impala_beeswax.py",
 line 187, in execute
handle = self.__execute_query(query_string.strip(), user=user)
  File 
"/data/jenkins/workspace/impala-asf-master-core-data-load/repos/Impala/tests/beeswax/impala_beeswax.py",
 line 363, in __execute_query
handle = self.execute_query_async(query_string, user=user)
  File 
"/data/jenkins/workspace/impala-asf-master-core-data-load/repos/Impala/tests/beeswax/impala_beeswax.py",
 line 357, in execute_query_async
handle = self.__do_rpc(lambda: self.imp_service.query(query,))
  File 
"/data/jenkins/workspace/impala-asf-master-core-data-load/repos/Impala/tests/beeswax/impala_beeswax.py",
 line 520, in __do_rpc
raise ImpalaBeeswaxException(self.__build_error_message(b), b)
ImpalaBeeswaxException: ImpalaBeeswaxException:
 INNER EXCEPTION: 
 MESSAGE: AnalysisException: Cannot analyze Kudu table 'alltypes': Error 
determining if Kudu's integration with the Hive Metastore is enabled: too many 
attempts: KuduRpc(method=getHiveMetastoreConfig, tablet=null, attempt=101, 
TimeoutTracker(timeout=18, elapsed=171248), Trace Summary(170894 ms): 
Sent(0), Received(0), Delayed(100), MasterRefresh(0), AuthRefresh(0), 
Truncated: false
 Delayed: (UNKNOWN, [ getHiveMetastoreConfig, 100 ]))

{code}


was (Author: norbertluksa):
Found a similar exception during a [data-loading 
test|https://master-02.jenkins.cloudera.com/job/impala-asf-master-core-data-load/941/]:
{code:sql}
ERROR: CREATE TABLE functional_kudu.alltypes (
  id INT PRIMARY KEY,
  bool_col BOOLEAN,
  tinyint_col TINYINT,
  smallint_col SMALLINT,
  int_col INT,
  bigint_col BIGINT,
  float_col FLOAT,
  double_col DOUBLE,
  date_string_col STRING,
  string_col STRING,
  timestamp_col TIMESTAMP,
  year INT,
  month INT
)
PARTITION BY HASH (id) PARTITIONS 3 STORED AS KUDU


Traceback (most recent call last):
  File 
"/data/jenkins/workspace/impala-asf-master-core-data-load/repos/Impala/bin/load-data.py",
 line 208, in exec_impala_query_from_file
result = impala_client.execute(query)
  File 
"/data/jenkins/workspace/impala-asf-master-core-data-load/repos/Impala/tests/beeswax/impala_beeswax.py",
 line 187, in execute
handle = self.__execute_query(query_string.strip(), user=user)
  File 
"/data/jenkins/workspace/impala-asf-master-core-data-load/repos/Impala/tests/beeswax/impala_beeswax.py",
 line 363, in __execute_query
handle = self.execute_query_async(query_string, user=user)
  File 
"/data/jenkins/workspace/impala-asf-master-core-data-load/repos/Impala/tests/beeswax/impala_beeswax.py",
 line 357, in execute_query_async
handle = self.__do_rpc(lambda: self.imp_service.query(query,))
  File 
"/data/jenkins/workspace/impala-asf-master-core-data-load/repos/Impala/tests/beeswax/impala_beeswax.py",
 line 520, in __do_rpc
raise ImpalaBeeswaxException(self.__build_error_message(b), b)
ImpalaBeeswaxException: ImpalaBeeswaxException:
 INNER EXCEPTION: 
 MESSAGE: AnalysisException: Cannot analyze Kudu table 'alltypes': Error 
determining if Kudu's integration with the Hive Metastore is enabled: too many 
attempts: KuduRpc(method=getHiveMetastoreConfig, tablet=null, attempt=101, 
TimeoutTracker(timeout=18, elapsed=171248), Trace Summary(170894 ms): 
Sent(0), Received(0), Delayed(100), MasterRefresh(0), AuthRefresh(0), 
Truncated: false
 Delayed: (UNKNOWN, [ getHiveMetastoreConfig, 100 ]))

{code}

> TestKuduHMSIntegration failing: Kudu create table failing
> -
>
> Key: IMPALA-9114
> URL: https://issues.apache.org/jira/browse/IMPALA-9114
> Project: IMPALA
>  Issue Type: Bug
>Affects Versions: Impala 3.4.0
>Reporter: Bikramjeet Vig
>Assignee: Hao Hao
>Priority: Minor
>

[jira] [Commented] (IMPALA-7181) Fix flaky test shell/test_shell_commandline.py::TestImpalaShell::test_socket_opening

2020-03-19 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-7181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17062600#comment-17062600
 ] 

Norbert Luksa commented on IMPALA-7181:
---

This happened again in 
[impala-cdpd-master-core:|https://master-02.jenkins.cloudera.com/job/impala-cdpd-master-core/393/]
{code:java}
/data/jenkins/workspace/impala-cdpd-master-core/repos/Impala/tests/shell/test_shell_commandline.py:822:
 in test_socket_opening
self._validate_expected_socket_connected(vector, args2, s)
/data/jenkins/workspace/impala-cdpd-master-core/repos/Impala/tests/shell/test_shell_commandline.py:792:
 in _validate_expected_socket_connected
connection, client_address = sock.accept()
/usr/lib64/python2.7/socket.py:202: in accept
sock, addr = self._sock.accept()
E   error: [Errno 4] Interrupted system call
{code}

> Fix flaky test 
> shell/test_shell_commandline.py::TestImpalaShell::test_socket_opening 
> -
>
> Key: IMPALA-7181
> URL: https://issues.apache.org/jira/browse/IMPALA-7181
> Project: IMPALA
>  Issue Type: Bug
>Affects Versions: Impala 2.13.0, Impala 3.1.0
>Reporter: Vincent Tran
>Assignee: Vincent Tran
>Priority: Blocker
>  Labels: broken-build
> Fix For: Impala 2.13.0, Impala 3.1.0
>
>
> shell/test_shell_commandline.py::TestImpalaShell::test_socket_opening uses 
> netcat to listen to an ephemeral port to verify impala-shell socket opening 
> behavior.
> The port is hardcoded to 42000 which can fail the test if this port is used 
> by an outgoing socket.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-7181) Fix flaky test shell/test_shell_commandline.py::TestImpalaShell::test_socket_opening

2020-03-19 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-7181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17062604#comment-17062604
 ] 

Norbert Luksa commented on IMPALA-7181:
---

[~thundergun] could you take a look?

> Fix flaky test 
> shell/test_shell_commandline.py::TestImpalaShell::test_socket_opening 
> -
>
> Key: IMPALA-7181
> URL: https://issues.apache.org/jira/browse/IMPALA-7181
> Project: IMPALA
>  Issue Type: Bug
>Affects Versions: Impala 2.13.0, Impala 3.1.0
>Reporter: Vincent Tran
>Assignee: Vincent Tran
>Priority: Blocker
>  Labels: broken-build
> Fix For: Impala 2.13.0, Impala 3.1.0
>
>
> shell/test_shell_commandline.py::TestImpalaShell::test_socket_opening uses 
> netcat to listen to an ephemeral port to verify impala-shell socket opening 
> behavior.
> The port is hardcoded to 42000 which can fail the test if this port is used 
> by an outgoing socket.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Assigned] (IMPALA-9572) Impalad crash when process decimal value

2020-03-30 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-9572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa reassigned IMPALA-9572:
-

Assignee: Csaba Ringhofer  (was: Norbert Luksa)

> Impalad crash when process decimal value
> 
>
> Key: IMPALA-9572
> URL: https://issues.apache.org/jira/browse/IMPALA-9572
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Affects Versions: Impala 4.0
>Reporter: Yongzhi Chen
>Assignee: Csaba Ringhofer
>Priority: Blocker
>  Labels: broken-build, crash
> Attachments: 
> impalad.impala-ec2-centos74-m5-4xlarge-ondemand-1a73.vpc.cloudera.com.jenkins.log.ERROR.20200328-031659.18222.gz
>
>
> In impala-asf-master-exhaustive build, minidump shows impalad crashed:
> Crash reason:  SIGABRT
> Crash address: 0x7d1472e
> Process uptime: not available
> Thread 396 (crashed)
>  0  libc-2.17.so + 0x351f7
> rax = 0x   rdx = 0x0006
> rcx = 0x   rbx = 0x07246600
> rsi = 0x4273   rdi = 0x472e
> rbp = 0x7f6e43ee7050   rsp = 0x7f6e43ee6ce8
>  r8 = 0xr9 = 0x7f6e43ee6b60
> r10 = 0x0008   r11 = 0x0206
> r12 = 0x07246680   r13 = 0x0044
> r14 = 0x0724dfc4   r15 = 0x07246600
> rip = 0x7f6f2b1da1f7
> Found by: given as instruction pointer in context
>  1  libc-2.17.so + 0x368e8
> rbp = 0x7f6e43ee7050   rsp = 0x7f6e43ee6cf0
> rip = 0x7f6f2b1db8e8
> Found by: stack scanning
>  2  impalad!google_breakpad::ExceptionHandler::HandleSignal(int, siginfo_t*, 
> void*) + 0x1e0
> rbp = 0x7f6e43ee7050   rsp = 0x7f6e43ee6d78
> rip = 0x04ed0840
> Found by: stack scanning
>  3  impalad!google::DumpStackTraceAndExit() + 0x24
> rbp = 0x7f6e43ee7050   rsp = 0x7f6e43ee6e20
> rip = 0x04ea2554
> Found by: stack scanning
>  4  impalad!google::LogMessage::Fail() + 0xd
> rbx = 0x07246600   rbp = 0x7f6e43ee7050
> rsp = 0x7f6e43ee6ed0   rip = 0x04e98fad
> Found by: call frame info
>  5  impalad!google::LogMessage::SendToLog() + 0x2b2
> rbx = 0x07246600   rbp = 0x7f6e43ee7050
> rsp = 0x7f6e43ee6ee0   rip = 0x04e9a852
> Found by: call frame info
>  6  impalad!google::LogMessage::Flush() + 0x157
> rbx = 0x7f6e43ee7090   rbp = 0x7f6f2bd8a6a0
> rsp = 0x7f6e43ee7060   r12 = 0x7f6e43ee707f
> r13 = 0x072554f8   r14 = 0x7f6e43ee7120
> r15 = 0x18954c50   rip = 0x04e98987
> Found by: call frame info
>  7  impalad!google::LogMessageFatal::~LogMessageFatal() + 0xe
> rbx = 0x7f6e43ee7120   rbp = 0x7f6e43ee7160
> rsp = 0x7f6e43ee70e0   r12 = 0x0001
> r13 = 0x072554f8   r14 = 0x000118ba
> r15 = 0x18954c50   rip = 0x04e9bf4e
> Found by: call frame info
>  8  impalad!impala::StrideWriter::StrideWriter(long*, long) [mem-util.h 
> : 40 + 0xd]
> rbx = 0x0001   rbp = 0x7f6e43ee7160
> rsp = 0x7f6e43ee7100   r12 = 0x0001
> r13 = 0x072554f8   r14 = 0x000118ba
> r15 = 0x18954c50   rip = 0x02c07535
> Found by: call frame info
>  9  
> impalad!impala::BaseScalarColumnReader::FillPositionsInCandidateRange(int, 
> int, unsigned char*, int) [parquet-column-readers.cc : 1319 + 0x1f]
> rbx = 0x17d4   rbp = 0x7f6e43ee7270
> rsp = 0x7f6e43ee7170   r12 = 0x
> r13 = 0x0039   r14 = 0x000118ba
> r15 = 0x18954c50   rip = 0x02c008d3
> Found by: call frame info
> 10  impalad!impala::ScalarColumnReader, 
> (parquet::Type::type)7, true>::MaterializeValueBatchRepeatedDefLevel(int, 
> int, unsigned char*, int*) [parquet-column-readers.cc : 612 + 0x2b]
> rbx = 0x0007   rbp = 0x7f6e43ee73a0
> rsp = 0x7f6e43ee7280   r12 = 0x
> r13 = 0x0039   r14 = 0x000118ba
> r15 = 0x18954c50   rip = 0x02c731f7
> Found by: call frame info
> 11  impalad!bool impala::ScalarColumnReader, 
> (parquet::Type::type)7, true>::ReadValueBatch(int, int, unsigned char*, 
> int*) [parquet-column-readers.cc : 468 + 0x26]
> rbx = 0x   rbp = 0x7f6e43ee7520
> rsp = 0x7f6e43ee73b0   r12 = 0x0400
> r13 = 0x0039   r14 = 0x000118ba
> r15 = 0x18954c50   rip = 0x02c54db1
> Found by: call frame info
> 12  impalad!impala::ScalarColumnReader, 
> (parquet::Type::type)7, true>::ReadValueBatch(impala::MemPool*, int, int, 
> uns

[jira] [Commented] (IMPALA-8755) Implement Z-ordering for Impala

2020-05-19 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-8755?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17110961#comment-17110961
 ] 

Norbert Luksa commented on IMPALA-8755:
---

Hi, I'll try to make some time to do this this week.

> Implement Z-ordering for Impala
> ---
>
> Key: IMPALA-8755
> URL: https://issues.apache.org/jira/browse/IMPALA-8755
> Project: IMPALA
>  Issue Type: New Feature
>Reporter: Zoltán Borók-Nagy
>Assignee: Norbert Luksa
>Priority: Major
>
> Implement Z-ordering for Impala: [https://en.wikipedia.org/wiki/Z-order_curve]
> A Z-order curve defines an ordering on multi-dimensional data. Data sorted 
> that way can be efficiently filtered by min/max statistics regarding to the 
> columns participating in the ordering.
> Impala currently only supports lexicographic ordering via the SORT BY clause. 
> This strongly prefers the first column, i.e. given the "SORT BY A, B, C" 
> clause => A will be totally ordered (hence filtering on A will be very 
> efficient), but values belonging to B and C will be scattered throughout the 
> data set (hence filtering on B or C will barely do any good).
> We could add a new clause, e.g. a "ZSORT BY" clause to Impala that writes the 
> data in Z-order.
> "ZSORT BY A, B C" would cluster the rows in a way that filtering on A, B, or 
> C would be equally efficient.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Resolved] (IMPALA-8755) Implement Z-ordering for Impala

2020-06-30 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8755?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa resolved IMPALA-8755.
---
Target Version: Impala 4.0
Resolution: Implemented

> Implement Z-ordering for Impala
> ---
>
> Key: IMPALA-8755
> URL: https://issues.apache.org/jira/browse/IMPALA-8755
> Project: IMPALA
>  Issue Type: New Feature
>Reporter: Zoltán Borók-Nagy
>Assignee: Norbert Luksa
>Priority: Major
>
> Implement Z-ordering for Impala: [https://en.wikipedia.org/wiki/Z-order_curve]
> A Z-order curve defines an ordering on multi-dimensional data. Data sorted 
> that way can be efficiently filtered by min/max statistics regarding to the 
> columns participating in the ordering.
> Impala currently only supports lexicographic ordering via the SORT BY clause. 
> This strongly prefers the first column, i.e. given the "SORT BY A, B, C" 
> clause => A will be totally ordered (hence filtering on A will be very 
> efficient), but values belonging to B and C will be scattered throughout the 
> data set (hence filtering on B or C will barely do any good).
> We could add a new clause, e.g. a "ZSORT BY" clause to Impala that writes the 
> data in Z-order.
> "ZSORT BY A, B C" would cluster the rows in a way that filtering on A, B, or 
> C would be equally efficient.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-9912) Move unlock_zorder_sort flag to the graveyard

2020-06-30 Thread Norbert Luksa (Jira)
Norbert Luksa created IMPALA-9912:
-

 Summary: Move unlock_zorder_sort flag to the graveyard
 Key: IMPALA-9912
 URL: https://issues.apache.org/jira/browse/IMPALA-9912
 Project: IMPALA
  Issue Type: Task
Reporter: Norbert Luksa
Assignee: Norbert Luksa






--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Reopened] (IMPALA-10054) test_multiple_sort_run_bytes_limits fails in parallel-all-tests-nightly

2020-08-19 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-10054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa reopened IMPALA-10054:


> test_multiple_sort_run_bytes_limits fails in parallel-all-tests-nightly
> ---
>
> Key: IMPALA-10054
> URL: https://issues.apache.org/jira/browse/IMPALA-10054
> Project: IMPALA
>  Issue Type: Bug
>Affects Versions: Impala 4.0
>Reporter: Attila Jeges
>Assignee: Riza Suminto
>Priority: Blocker
>  Labels: broken-build, flaky
> Fix For: Impala 4.0
>
>
> test_multiple_sort_run_bytes_limits  introduced in IMPALA-6692 seems to be 
> flaky.
> Jenkins job that triggered the error:
> https://jenkins.impala.io/job/parallel-all-tests-nightly/1173
> Failing job:
> https://jenkins.impala.io/job/ubuntu-16.04-dockerised-tests/2899/testReport/
> {code}
> Stacktrace
> query_test/test_sort.py:89: in test_multiple_sort_run_bytes_limits
> assert "SpilledRuns: " + spilled_runs in query_result.runtime_profile
> E   assert ('SpilledRuns: ' + '3') in 'Query 
> (id=404da0b1e56e7248:120789cd):\n  DEBUG MODE WARNING: Query profile 
> created while running a DEBUG buil... 27.999ms\n - WriteIoBytes: 
> 0\n - WriteIoOps: 0 (0)\n - WriteIoWaitTime: 
> 0.000ns\n'
> E+  where 'Query (id=404da0b1e56e7248:120789cd):\n  DEBUG MODE 
> WARNING: Query profile created while running a DEBUG buil... 27.999ms\n   
>   - WriteIoBytes: 0\n - WriteIoOps: 0 (0)\n - 
> WriteIoWaitTime: 0.000ns\n' = 
>  0x7f51da77fb50>.runtime_profile
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-10054) test_multiple_sort_run_bytes_limits fails in parallel-all-tests-nightly

2020-08-19 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-10054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17180362#comment-17180362
 ] 

Norbert Luksa commented on IMPALA-10054:


[~rizaon] this has reoccurred in 
[https://master-02.jenkins.cloudera.com/job/impala-cdpd-master-core-asan/454/]

Could you please have a look at it?

> test_multiple_sort_run_bytes_limits fails in parallel-all-tests-nightly
> ---
>
> Key: IMPALA-10054
> URL: https://issues.apache.org/jira/browse/IMPALA-10054
> Project: IMPALA
>  Issue Type: Bug
>Affects Versions: Impala 4.0
>Reporter: Attila Jeges
>Assignee: Riza Suminto
>Priority: Blocker
>  Labels: broken-build, flaky
> Fix For: Impala 4.0
>
>
> test_multiple_sort_run_bytes_limits  introduced in IMPALA-6692 seems to be 
> flaky.
> Jenkins job that triggered the error:
> https://jenkins.impala.io/job/parallel-all-tests-nightly/1173
> Failing job:
> https://jenkins.impala.io/job/ubuntu-16.04-dockerised-tests/2899/testReport/
> {code}
> Stacktrace
> query_test/test_sort.py:89: in test_multiple_sort_run_bytes_limits
> assert "SpilledRuns: " + spilled_runs in query_result.runtime_profile
> E   assert ('SpilledRuns: ' + '3') in 'Query 
> (id=404da0b1e56e7248:120789cd):\n  DEBUG MODE WARNING: Query profile 
> created while running a DEBUG buil... 27.999ms\n - WriteIoBytes: 
> 0\n - WriteIoOps: 0 (0)\n - WriteIoWaitTime: 
> 0.000ns\n'
> E+  where 'Query (id=404da0b1e56e7248:120789cd):\n  DEBUG MODE 
> WARNING: Query profile created while running a DEBUG buil... 27.999ms\n   
>   - WriteIoBytes: 0\n - WriteIoOps: 0 (0)\n - 
> WriteIoWaitTime: 0.000ns\n' = 
>  0x7f51da77fb50>.runtime_profile
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-10094) TestResetMetadata.test_refresh_updated_partitions fails due to connection error

2020-08-19 Thread Norbert Luksa (Jira)
Norbert Luksa created IMPALA-10094:
--

 Summary: TestResetMetadata.test_refresh_updated_partitions fails 
due to connection error
 Key: IMPALA-10094
 URL: https://issues.apache.org/jira/browse/IMPALA-10094
 Project: IMPALA
  Issue Type: Bug
Reporter: Norbert Luksa
Assignee: Vihang Karajgaonkar


This has occurred in the last few builds in impala-cdpd-master-staging-core-s3.

Error message:
{code:java}
metadata/test_reset_metadata.py:49: in test_refresh_updated_partitions
"alter table {0} add partition (year=2020, month=8)".format(tbl))
common/impala_test_suite.py:983: in run_stmt_in_hive
raise RuntimeError(stderr)
E   RuntimeError: SLF4J: Class path contains multiple SLF4J bindings.
E   SLF4J: Found binding in 
[jar:file:/data0/jenkins/workspace/impala-cdpd-master-staging-core-s3/Impala-Toolchain/cdp_components-5250295/apache-hive-3.1.3000.7.2.2.0-135-bin/lib/log4j-slf4j-impl-2.10.0.jar!/org/slf4j/impl/StaticLoggerBinder.class]
E   SLF4J: Found binding in 
[jar:file:/data0/jenkins/workspace/impala-cdpd-master-staging-core-s3/Impala-Toolchain/cdp_components-5250295/hadoop-3.1.1.7.2.2.0-135/share/hadoop/common/lib/slf4j-log4j12-1.7.30.jar!/org/slf4j/impl/StaticLoggerBinder.class]
E   SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an 
explanation.
E   SLF4J: Actual binding is of type 
[org.apache.logging.slf4j.Log4jLoggerFactory]
E   ERROR StatusLogger No log4j2 configuration file found. Using default 
configuration: logging only errors to the console. Set system property 
'log4j2.debug' to show Log4j2 internal initialization logging.
E   SLF4J: Class path contains multiple SLF4J bindings.
E   SLF4J: Found binding in 
[jar:file:/data0/jenkins/workspace/impala-cdpd-master-staging-core-s3/Impala-Toolchain/cdp_components-5250295/apache-hive-3.1.3000.7.2.2.0-135-bin/lib/log4j-slf4j-impl-2.10.0.jar!/org/slf4j/impl/StaticLoggerBinder.class]
E   SLF4J: Found binding in 
[jar:file:/data0/jenkins/workspace/impala-cdpd-master-staging-core-s3/Impala-Toolchain/cdp_components-5250295/hadoop-3.1.1.7.2.2.0-135/share/hadoop/common/lib/slf4j-log4j12-1.7.30.jar!/org/slf4j/impl/StaticLoggerBinder.class]
E   SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an 
explanation.
E   SLF4J: Actual binding is of type 
[org.apache.logging.slf4j.Log4jLoggerFactory]
E   Connecting to jdbc:hive2://localhost:11050
E   20/08/18 05:10:24 [main]: WARN jdbc.HiveConnection: Failed to connect to 
localhost:11050
E   Could not open connection to the HS2 server. Please check the server URI 
and if the URI is correct, then ask the administrator to check the server 
status.
E   Error: Could not open client transport with JDBC Uri: 
jdbc:hive2://localhost:11050: java.net.ConnectException: Connection refused 
(Connection refused) (state=08S01,code=0){code}
 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-10094) TestResetMetadata.test_refresh_updated_partitions fails due to connection error

2020-08-19 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-10094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa updated IMPALA-10094:
---
Description: 
This has occurred in the last few builds in impala-cdpd-master-staging-core-s3: 
[https://master-02.jenkins.cloudera.com/job/impala-cdpd-master-staging-core-s3/14/]

Error message:
{code:java}
metadata/test_reset_metadata.py:49: in test_refresh_updated_partitions
"alter table {0} add partition (year=2020, month=8)".format(tbl))
common/impala_test_suite.py:983: in run_stmt_in_hive
raise RuntimeError(stderr)
E   RuntimeError: SLF4J: Class path contains multiple SLF4J bindings.
E   SLF4J: Found binding in 
[jar:file:/data0/jenkins/workspace/impala-cdpd-master-staging-core-s3/Impala-Toolchain/cdp_components-5250295/apache-hive-3.1.3000.7.2.2.0-135-bin/lib/log4j-slf4j-impl-2.10.0.jar!/org/slf4j/impl/StaticLoggerBinder.class]
E   SLF4J: Found binding in 
[jar:file:/data0/jenkins/workspace/impala-cdpd-master-staging-core-s3/Impala-Toolchain/cdp_components-5250295/hadoop-3.1.1.7.2.2.0-135/share/hadoop/common/lib/slf4j-log4j12-1.7.30.jar!/org/slf4j/impl/StaticLoggerBinder.class]
E   SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an 
explanation.
E   SLF4J: Actual binding is of type 
[org.apache.logging.slf4j.Log4jLoggerFactory]
E   ERROR StatusLogger No log4j2 configuration file found. Using default 
configuration: logging only errors to the console. Set system property 
'log4j2.debug' to show Log4j2 internal initialization logging.
E   SLF4J: Class path contains multiple SLF4J bindings.
E   SLF4J: Found binding in 
[jar:file:/data0/jenkins/workspace/impala-cdpd-master-staging-core-s3/Impala-Toolchain/cdp_components-5250295/apache-hive-3.1.3000.7.2.2.0-135-bin/lib/log4j-slf4j-impl-2.10.0.jar!/org/slf4j/impl/StaticLoggerBinder.class]
E   SLF4J: Found binding in 
[jar:file:/data0/jenkins/workspace/impala-cdpd-master-staging-core-s3/Impala-Toolchain/cdp_components-5250295/hadoop-3.1.1.7.2.2.0-135/share/hadoop/common/lib/slf4j-log4j12-1.7.30.jar!/org/slf4j/impl/StaticLoggerBinder.class]
E   SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an 
explanation.
E   SLF4J: Actual binding is of type 
[org.apache.logging.slf4j.Log4jLoggerFactory]
E   Connecting to jdbc:hive2://localhost:11050
E   20/08/18 05:10:24 [main]: WARN jdbc.HiveConnection: Failed to connect to 
localhost:11050
E   Could not open connection to the HS2 server. Please check the server URI 
and if the URI is correct, then ask the administrator to check the server 
status.
E   Error: Could not open client transport with JDBC Uri: 
jdbc:hive2://localhost:11050: java.net.ConnectException: Connection refused 
(Connection refused) (state=08S01,code=0){code}
 

  was:
This has occurred in the last few builds in impala-cdpd-master-staging-core-s3.

Error message:
{code:java}
metadata/test_reset_metadata.py:49: in test_refresh_updated_partitions
"alter table {0} add partition (year=2020, month=8)".format(tbl))
common/impala_test_suite.py:983: in run_stmt_in_hive
raise RuntimeError(stderr)
E   RuntimeError: SLF4J: Class path contains multiple SLF4J bindings.
E   SLF4J: Found binding in 
[jar:file:/data0/jenkins/workspace/impala-cdpd-master-staging-core-s3/Impala-Toolchain/cdp_components-5250295/apache-hive-3.1.3000.7.2.2.0-135-bin/lib/log4j-slf4j-impl-2.10.0.jar!/org/slf4j/impl/StaticLoggerBinder.class]
E   SLF4J: Found binding in 
[jar:file:/data0/jenkins/workspace/impala-cdpd-master-staging-core-s3/Impala-Toolchain/cdp_components-5250295/hadoop-3.1.1.7.2.2.0-135/share/hadoop/common/lib/slf4j-log4j12-1.7.30.jar!/org/slf4j/impl/StaticLoggerBinder.class]
E   SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an 
explanation.
E   SLF4J: Actual binding is of type 
[org.apache.logging.slf4j.Log4jLoggerFactory]
E   ERROR StatusLogger No log4j2 configuration file found. Using default 
configuration: logging only errors to the console. Set system property 
'log4j2.debug' to show Log4j2 internal initialization logging.
E   SLF4J: Class path contains multiple SLF4J bindings.
E   SLF4J: Found binding in 
[jar:file:/data0/jenkins/workspace/impala-cdpd-master-staging-core-s3/Impala-Toolchain/cdp_components-5250295/apache-hive-3.1.3000.7.2.2.0-135-bin/lib/log4j-slf4j-impl-2.10.0.jar!/org/slf4j/impl/StaticLoggerBinder.class]
E   SLF4J: Found binding in 
[jar:file:/data0/jenkins/workspace/impala-cdpd-master-staging-core-s3/Impala-Toolchain/cdp_components-5250295/hadoop-3.1.1.7.2.2.0-135/share/hadoop/common/lib/slf4j-log4j12-1.7.30.jar!/org/slf4j/impl/StaticLoggerBinder.class]
E   SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an 
explanation.
E   SLF4J: Actual binding is of type 
[org.apache.logging.slf4j.Log4jLoggerFactory]
E   Connecting to jdbc:hive2://localhost:11050
E   20/08/18 05:10:24 [main]: WARN jdbc.HiveConnection: Failed to connect to 
localhost:11050
E   Could not open c

[jira] [Commented] (IMPALA-9923) Data loading of TPC-DS ORC fails with "Fail to get checksum"

2020-08-19 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-9923?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17180432#comment-17180432
 ] 

Norbert Luksa commented on IMPALA-9923:
---

This also occurred in functional data loads: 
[https://master-02.jenkins.cloudera.com/job/impala-cdpd-master-exhaustive/454/]

The error from 
load-functional-query-exhaustive-hive-generated-orc-def-block.sql.log:
{code:java}
INFO  : Loading data to table functional_orc_def.complextypestbl_medium from 
hdfs://localhost:20500/test-warehouse/managed/complextypestbl_medium_orc_defINFO
  : Loading data to table functional_orc_def.complextypestbl_medium from 
hdfs://localhost:20500/test-warehouse/managed/complextypestbl_medium_orc_defERROR
 : FAILED: Execution Error, return code 1 from 
org.apache.hadoop.hive.ql.exec.MoveTask. java.io.IOException: Fail to get 
checksum, since file 
/test-warehouse/managed/complextypestbl_medium_orc_def/base_001/_orc_acid_version
 is under construction.INFO  : Completed executing 
command(queryId=jenkins_20200816064113_1cafbc10-8d60-4acb-8812-567fb8e911c6); 
Time taken: 4.04 secondsINFO  : OKError: Error while compiling statement: 
FAILED: Execution Error, return code 1 from 
org.apache.hadoop.hive.ql.exec.MoveTask. java.io.IOException: Fail to get 
checksum, since file 
/test-warehouse/managed/complextypestbl_medium_orc_def/base_001/_orc_acid_version
 is under construction. (state=08S01,code=1)java.sql.SQLException: Error while 
compiling statement: FAILED: Execution Error, return code 1 from 
org.apache.hadoop.hive.ql.exec.MoveTask. java.io.IOException: Fail to get 
checksum, since file 
/test-warehouse/managed/complextypestbl_medium_orc_def/base_001/_orc_acid_version
 is under construction. at 
org.apache.hive.jdbc.HiveStatement.waitForOperationToComplete(HiveStatement.java:401)
 at org.apache.hive.jdbc.HiveStatement.execute(HiveStatement.java:266) at 
org.apache.hive.beeline.Commands.executeInternal(Commands.java:1007) at 
org.apache.hive.beeline.Commands.execute(Commands.java:1217) at 
org.apache.hive.beeline.Commands.sql(Commands.java:1146) at 
org.apache.hive.beeline.BeeLine.dispatch(BeeLine.java:1497) at 
org.apache.hive.beeline.BeeLine.execute(BeeLine.java:1355) at 
org.apache.hive.beeline.BeeLine.executeFile(BeeLine.java:1329) at 
org.apache.hive.beeline.BeeLine.begin(BeeLine.java:1127) at 
org.apache.hive.beeline.BeeLine.begin(BeeLine.java:1082) at 
org.apache.hive.beeline.BeeLine.mainWithInputRedirection(BeeLine.java:546) at 
org.apache.hive.beeline.BeeLine.main(BeeLine.java:528) at 
sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) 
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
 at java.lang.reflect.Method.invoke(Method.java:498) at 
org.apache.hadoop.util.RunJar.run(RunJar.java:318) at 
org.apache.hadoop.util.RunJar.main(RunJar.java:232)Closing: 0: 
jdbc:hive2://localhost:11050/default;auth=none
{code}

> Data loading of TPC-DS ORC fails with "Fail to get checksum"
> 
>
> Key: IMPALA-9923
> URL: https://issues.apache.org/jira/browse/IMPALA-9923
> Project: IMPALA
>  Issue Type: Bug
>  Components: Infrastructure
>Reporter: Tim Armstrong
>Assignee: Zoltán Borók-Nagy
>Priority: Critical
>  Labels: broken-build, flaky
> Attachments: load-tpcds-core-hive-generated-orc-def-block.sql, 
> load-tpcds-core-hive-generated-orc-def-block.sql.log
>
>
> {noformat}
> INFO  : Loading data to table tpcds_orc_def.store_sales partition 
> (ss_sold_date_sk=null) from 
> hdfs://localhost:20500/test-warehouse/managed/tpcds.store_sales_orc_def
> INFO  : 
> ERROR : FAILED: Execution Error, return code 1 from 
> org.apache.hadoop.hive.ql.exec.MoveTask. java.io.IOException: Fail to get 
> checksum, since file 
> /test-warehouse/managed/tpcds.store_sales_orc_def/ss_sold_date_sk=2451646/base_003/_orc_acid_version
>  is under construction.
> INFO  : Completed executing 
> command(queryId=ubuntu_20200707055650_a1958916-1e85-4db5-b1bc-cc63d80b3537); 
> Time taken: 14.512 seconds
> INFO  : OK
> Error: Error while compiling statement: FAILED: Execution Error, return code 
> 1 from org.apache.hadoop.hive.ql.exec.MoveTask. java.io.IOException: Fail to 
> get checksum, since file 
> /test-warehouse/managed/tpcds.store_sales_orc_def/ss_sold_date_sk=2451646/base_003/_orc_acid_version
>  is under construction. (state=08S01,code=1)
> java.sql.SQLException: Error while compiling statement: FAILED: Execution 
> Error, return code 1 from org.apache.hadoop.hive.ql.exec.MoveTask. 
> java.io.IOException: Fail to get checksum, since file 
> /test-warehouse/managed/tpcds.store_sales_orc_def/ss_sold_date_sk=2451646/base_003/_orc_aci

[jira] [Commented] (IMPALA-9920) BufferPoolTest.WriteErrorBlacklistHolepunch failed on FindPageInDir check

2020-08-23 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-9920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17182829#comment-17182829
 ] 

Norbert Luksa commented on IMPALA-9920:
---

This happened again in

[https://master-02.jenkins.cloudera.com/job/impala-cdpd-master-staging-core-ubsan/15/]

Also, not sure if the same, but I saw similar errors in the log for 

[https://master-02.jenkins.cloudera.com/job/impala-asf-master-core-ubsan/56/]

Attached the logs.

> BufferPoolTest.WriteErrorBlacklistHolepunch failed on FindPageInDir check
> -
>
> Key: IMPALA-9920
> URL: https://issues.apache.org/jira/browse/IMPALA-9920
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Affects Versions: Impala 4.0
> Environment: BUILD_TAG
> jenkins-impala-cdpd-master-core-ubsan-108
>Reporter: Wenzhe Zhou
>Assignee: Tim Armstrong
>Priority: Major
>  Labels: broken-build, build, test
> Attachments: buffer-pool-test.ERROR
>
>
> BufferPoolTest.WriteErrorBlacklistHolepunch failed with following error 
> messages:
> Error Message
> Value of: FindPageInDir(pages[NO_ERROR_QUERY], good_dir) != NULL
>  Actual: false
> Expected: true
> Stacktrace
> /data/jenkins/workspace/impala-cdpd-master-core-ubsan/repos/Impala/be/src/runtime/bufferpool/buffer-pool-test.cc:1763
> Value of: FindPageInDir(pages[NO_ERROR_QUERY], good_dir) != NULL
>  Actual: false
> Expected: true
>  
> Saw following messages in buffer-pool-test.ERROR log file:
>  F0702 14:17:08.745453 8976 reservation-tracker.cc:389] Check failed: bytes 
> <= unused_reservation() (1024 vs. 0) 
> F0702 14:17:14.581707 12727 reservation-tracker.cc:389] Check failed: bytes 
> <= unused_reservation() (1024 vs. 0) 
> F0702 14:17:14.671219 12728 reservation-tracker.cc:389] Check failed: bytes 
> <= unused_reservation() (1024 vs. 0) 
> F0702 14:17:14.840062 12940 buffer-pool.cc:216] Check failed: 
> page_handle->is_pinned() 
> F0702 14:17:15.167520 13459 buffer-pool.cc:493] Check failed: 
> spilling_enabled() 
> F0702 14:17:15.326829 13946 reservation-tracker.cc:389] Check failed: bytes 
> <= unused_reservation() (1024 vs. 0) 
> E0702 14:17:17.119957 16180 tmp-file-mgr.cc:334] Error for temporary file 
> '/tmp/impala-scratch/:_354de439-8418-4013-8ebe-55214f8396c5':
>  Disk I/O error on 
> impala-ec2-centos74-m5-4xlarge-ondemand-1884.vpc.cloudera.com:22000: open() 
> failed for 
> /tmp/impala-scratch/:_354de439-8418-4013-8ebe-55214f8396c5.
>  Access denied for the process' user errno=13
> E0702 14:17:17.270885 16570 tmp-file-mgr.cc:334] Error for temporary file 
> '/tmp/buffer-pool-test.0/impala-scratch/:_21e8f7c1-2a63-44d5-8a5c-4ba78e9acb6e':
>  Disk I/O error on 
> impala-ec2-centos74-m5-4xlarge-ondemand-1884.vpc.cloudera.com:22000: open() 
> failed for 
> /tmp/buffer-pool-test.0/impala-scratch/:_21e8f7c1-2a63-44d5-8a5c-4ba78e9acb6e.
>  Access denied for the process' user errno=13
> E0702 14:17:17.436445 16964 tmp-file-mgr.cc:334] Error for temporary file 
> '/tmp/buffer-pool-test.0/impala-scratch/:_78d2ef68-254a-4738-baf6-6e80b3b1ee2a':
>  Disk I/O error on 
> impala-ec2-centos74-m5-4xlarge-ondemand-1884.vpc.cloudera.com:22000: open() 
> failed for 
> /tmp/buffer-pool-test.0/impala-scratch/:_78d2ef68-254a-4738-baf6-6e80b3b1ee2a.
>  Access denied for the process' user errno=13



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Reopened] (IMPALA-9920) BufferPoolTest.WriteErrorBlacklistHolepunch failed on FindPageInDir check

2020-08-23 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-9920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa reopened IMPALA-9920:
---

> BufferPoolTest.WriteErrorBlacklistHolepunch failed on FindPageInDir check
> -
>
> Key: IMPALA-9920
> URL: https://issues.apache.org/jira/browse/IMPALA-9920
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Affects Versions: Impala 4.0
> Environment: BUILD_TAG
> jenkins-impala-cdpd-master-core-ubsan-108
>Reporter: Wenzhe Zhou
>Assignee: Tim Armstrong
>Priority: Major
>  Labels: broken-build, build, test
> Attachments: buffer-pool-test-master-core-ubsan-56.ERROR, 
> buffer-pool-test.ERROR
>
>
> BufferPoolTest.WriteErrorBlacklistHolepunch failed with following error 
> messages:
> Error Message
> Value of: FindPageInDir(pages[NO_ERROR_QUERY], good_dir) != NULL
>  Actual: false
> Expected: true
> Stacktrace
> /data/jenkins/workspace/impala-cdpd-master-core-ubsan/repos/Impala/be/src/runtime/bufferpool/buffer-pool-test.cc:1763
> Value of: FindPageInDir(pages[NO_ERROR_QUERY], good_dir) != NULL
>  Actual: false
> Expected: true
>  
> Saw following messages in buffer-pool-test.ERROR log file:
>  F0702 14:17:08.745453 8976 reservation-tracker.cc:389] Check failed: bytes 
> <= unused_reservation() (1024 vs. 0) 
> F0702 14:17:14.581707 12727 reservation-tracker.cc:389] Check failed: bytes 
> <= unused_reservation() (1024 vs. 0) 
> F0702 14:17:14.671219 12728 reservation-tracker.cc:389] Check failed: bytes 
> <= unused_reservation() (1024 vs. 0) 
> F0702 14:17:14.840062 12940 buffer-pool.cc:216] Check failed: 
> page_handle->is_pinned() 
> F0702 14:17:15.167520 13459 buffer-pool.cc:493] Check failed: 
> spilling_enabled() 
> F0702 14:17:15.326829 13946 reservation-tracker.cc:389] Check failed: bytes 
> <= unused_reservation() (1024 vs. 0) 
> E0702 14:17:17.119957 16180 tmp-file-mgr.cc:334] Error for temporary file 
> '/tmp/impala-scratch/:_354de439-8418-4013-8ebe-55214f8396c5':
>  Disk I/O error on 
> impala-ec2-centos74-m5-4xlarge-ondemand-1884.vpc.cloudera.com:22000: open() 
> failed for 
> /tmp/impala-scratch/:_354de439-8418-4013-8ebe-55214f8396c5.
>  Access denied for the process' user errno=13
> E0702 14:17:17.270885 16570 tmp-file-mgr.cc:334] Error for temporary file 
> '/tmp/buffer-pool-test.0/impala-scratch/:_21e8f7c1-2a63-44d5-8a5c-4ba78e9acb6e':
>  Disk I/O error on 
> impala-ec2-centos74-m5-4xlarge-ondemand-1884.vpc.cloudera.com:22000: open() 
> failed for 
> /tmp/buffer-pool-test.0/impala-scratch/:_21e8f7c1-2a63-44d5-8a5c-4ba78e9acb6e.
>  Access denied for the process' user errno=13
> E0702 14:17:17.436445 16964 tmp-file-mgr.cc:334] Error for temporary file 
> '/tmp/buffer-pool-test.0/impala-scratch/:_78d2ef68-254a-4738-baf6-6e80b3b1ee2a':
>  Disk I/O error on 
> impala-ec2-centos74-m5-4xlarge-ondemand-1884.vpc.cloudera.com:22000: open() 
> failed for 
> /tmp/buffer-pool-test.0/impala-scratch/:_78d2ef68-254a-4738-baf6-6e80b3b1ee2a.
>  Access denied for the process' user errno=13



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-9920) BufferPoolTest.WriteErrorBlacklistHolepunch failed on FindPageInDir check

2020-08-23 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-9920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa updated IMPALA-9920:
--
Attachment: buffer-pool-test-master-core-ubsan-56.ERROR

> BufferPoolTest.WriteErrorBlacklistHolepunch failed on FindPageInDir check
> -
>
> Key: IMPALA-9920
> URL: https://issues.apache.org/jira/browse/IMPALA-9920
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Affects Versions: Impala 4.0
> Environment: BUILD_TAG
> jenkins-impala-cdpd-master-core-ubsan-108
>Reporter: Wenzhe Zhou
>Assignee: Tim Armstrong
>Priority: Major
>  Labels: broken-build, build, test
> Attachments: buffer-pool-test-master-core-ubsan-56.ERROR, 
> buffer-pool-test.ERROR
>
>
> BufferPoolTest.WriteErrorBlacklistHolepunch failed with following error 
> messages:
> Error Message
> Value of: FindPageInDir(pages[NO_ERROR_QUERY], good_dir) != NULL
>  Actual: false
> Expected: true
> Stacktrace
> /data/jenkins/workspace/impala-cdpd-master-core-ubsan/repos/Impala/be/src/runtime/bufferpool/buffer-pool-test.cc:1763
> Value of: FindPageInDir(pages[NO_ERROR_QUERY], good_dir) != NULL
>  Actual: false
> Expected: true
>  
> Saw following messages in buffer-pool-test.ERROR log file:
>  F0702 14:17:08.745453 8976 reservation-tracker.cc:389] Check failed: bytes 
> <= unused_reservation() (1024 vs. 0) 
> F0702 14:17:14.581707 12727 reservation-tracker.cc:389] Check failed: bytes 
> <= unused_reservation() (1024 vs. 0) 
> F0702 14:17:14.671219 12728 reservation-tracker.cc:389] Check failed: bytes 
> <= unused_reservation() (1024 vs. 0) 
> F0702 14:17:14.840062 12940 buffer-pool.cc:216] Check failed: 
> page_handle->is_pinned() 
> F0702 14:17:15.167520 13459 buffer-pool.cc:493] Check failed: 
> spilling_enabled() 
> F0702 14:17:15.326829 13946 reservation-tracker.cc:389] Check failed: bytes 
> <= unused_reservation() (1024 vs. 0) 
> E0702 14:17:17.119957 16180 tmp-file-mgr.cc:334] Error for temporary file 
> '/tmp/impala-scratch/:_354de439-8418-4013-8ebe-55214f8396c5':
>  Disk I/O error on 
> impala-ec2-centos74-m5-4xlarge-ondemand-1884.vpc.cloudera.com:22000: open() 
> failed for 
> /tmp/impala-scratch/:_354de439-8418-4013-8ebe-55214f8396c5.
>  Access denied for the process' user errno=13
> E0702 14:17:17.270885 16570 tmp-file-mgr.cc:334] Error for temporary file 
> '/tmp/buffer-pool-test.0/impala-scratch/:_21e8f7c1-2a63-44d5-8a5c-4ba78e9acb6e':
>  Disk I/O error on 
> impala-ec2-centos74-m5-4xlarge-ondemand-1884.vpc.cloudera.com:22000: open() 
> failed for 
> /tmp/buffer-pool-test.0/impala-scratch/:_21e8f7c1-2a63-44d5-8a5c-4ba78e9acb6e.
>  Access denied for the process' user errno=13
> E0702 14:17:17.436445 16964 tmp-file-mgr.cc:334] Error for temporary file 
> '/tmp/buffer-pool-test.0/impala-scratch/:_78d2ef68-254a-4738-baf6-6e80b3b1ee2a':
>  Disk I/O error on 
> impala-ec2-centos74-m5-4xlarge-ondemand-1884.vpc.cloudera.com:22000: open() 
> failed for 
> /tmp/buffer-pool-test.0/impala-scratch/:_78d2ef68-254a-4738-baf6-6e80b3b1ee2a.
>  Access denied for the process' user errno=13



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-9799) Flakiness in TestFetchFirst due to wrong results of get_num_in_flight_queries

2020-08-23 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-9799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17182831#comment-17182831
 ] 

Norbert Luksa commented on IMPALA-9799:
---

Saw this again in

 [https://master-02.jenkins.cloudera.com/job/impala-cdpd-master-core-asan/456/]

Stacktrace:
{code:java}
/data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/hs2_test_suite.py:63:
 in add_session lambda: fn(self)) 
/data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/hs2_test_suite.py:44:
 in add_session_helper fn() 
/data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/hs2_test_suite.py:63:
 in  lambda: fn(self)) 
/data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/test_fetch_first.py:120:
 in test_query_stmts_v6_with_result_spooling 
self.run_query_stmts_test({'spool_query_results': 'true'}) 
/data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/test_fetch_first.py:181:
 in run_query_stmts_test self.__test_invalid_result_caching("SELECT COUNT(*) 
FROM functional.alltypes") 
/data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/test_fetch_first.py:63:
 in __test_invalid_result_caching assert 0 == 
impalad.get_num_in_flight_queries() E assert 0 == 1 E + where 1 = >() E + where 
> = 
.get_num_in_flight_queries

{code}

> Flakiness in TestFetchFirst due to wrong results of get_num_in_flight_queries
> -
>
> Key: IMPALA-9799
> URL: https://issues.apache.org/jira/browse/IMPALA-9799
> Project: IMPALA
>  Issue Type: Bug
>Affects Versions: Impala 4.0
>Reporter: Quanlong Huang
>Assignee: Sahil Takiar
>Priority: Critical
>  Labels: broken-build
> Fix For: Impala 4.0
>
>
> Saw two failures for this test in different jenkins jobs:
> hs2.test_fetch_first.TestFetchFirst.test_query_stmts_v6 (from pytest)
>  Stacktrace:
> {code:java}
> /data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/hs2_test_suite.py:63:
>  in add_session
> lambda: fn(self))
> /data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/hs2_test_suite.py:44:
>  in add_session_helper
> fn()
> /data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/hs2_test_suite.py:63:
>  in 
> lambda: fn(self))
> /data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/test_fetch_first.py:110:
>  in test_query_stmts_v6
> self.run_query_stmts_test()
> /data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/test_fetch_first.py:181:
>  in run_query_stmts_test
> self.__test_invalid_result_caching("SELECT COUNT(*) FROM 
> functional.alltypes")
> /data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/test_fetch_first.py:63:
>  in __test_invalid_result_caching
> assert 0 == impalad.get_num_in_flight_queries()
> E   assert 0 == 1
> E+  where 1 =  >()
> E+where  > = 
>  0x6d25d10>.get_num_in_flight_queries{code}
> hs2.test_fetch_first.TestFetchFirst.test_query_stmts_v6_with_result_spooling 
> (from pytest)
>  Stacktrace:
> {code:java}
> /data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/hs2_test_suite.py:63:
>  in add_session
> lambda: fn(self))
> /data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/hs2_test_suite.py:44:
>  in add_session_helper
> fn()
> /data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/hs2_test_suite.py:63:
>  in 
> lambda: fn(self))
> /data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/test_fetch_first.py:120:
>  in test_query_stmts_v6_with_result_spooling
> self.run_query_stmts_test({'spool_query_results': 'true'})
> /data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/test_fetch_first.py:181:
>  in run_query_stmts_test
> self.__test_invalid_result_caching("SELECT COUNT(*) FROM 
> functional.alltypes")
> /data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/test_fetch_first.py:63:
>  in __test_invalid_result_caching
> assert 0 == impalad.get_num_in_flight_queries()
> E   assert 0 == 1
> E+  where 1 =  >()
> E+where  > = 
>  0x81d4990>.get_num_in_flight_queries{code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Comment Edited] (IMPALA-9799) Flakiness in TestFetchFirst due to wrong results of get_num_in_flight_queries

2020-08-23 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-9799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17182831#comment-17182831
 ] 

Norbert Luksa edited comment on IMPALA-9799 at 8/23/20, 7:19 PM:
-

Saw this again in

 [https://master-02.jenkins.cloudera.com/job/impala-cdpd-master-core-asan/456/]

Stacktrace:
{code:java}
/data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/hs2_test_suite.py:63:
 in add_session
lambda: fn(self))
/data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/hs2_test_suite.py:44:
 in add_session_helper
fn()
/data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/hs2_test_suite.py:63:
 in 
lambda: fn(self))
/data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/test_fetch_first.py:120:
 in test_query_stmts_v6_with_result_spooling
self.run_query_stmts_test({'spool_query_results': 'true'})
/data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/test_fetch_first.py:181:
 in run_query_stmts_test
self.__test_invalid_result_caching("SELECT COUNT(*) FROM 
functional.alltypes")
/data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/test_fetch_first.py:63:
 in __test_invalid_result_caching
assert 0 == impalad.get_num_in_flight_queries()
E   assert 0 == 1
E+  where 1 = >()
E+where > = 
.get_num_in_flight_queries
{code}


was (Author: norbertluksa):
Saw this again in

 [https://master-02.jenkins.cloudera.com/job/impala-cdpd-master-core-asan/456/]

Stacktrace:
{code:java}
/data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/hs2_test_suite.py:63:
 in add_session lambda: fn(self)) 
/data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/hs2_test_suite.py:44:
 in add_session_helper fn() 
/data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/hs2_test_suite.py:63:
 in  lambda: fn(self)) 
/data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/test_fetch_first.py:120:
 in test_query_stmts_v6_with_result_spooling 
self.run_query_stmts_test({'spool_query_results': 'true'}) 
/data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/test_fetch_first.py:181:
 in run_query_stmts_test self.__test_invalid_result_caching("SELECT COUNT(*) 
FROM functional.alltypes") 
/data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/test_fetch_first.py:63:
 in __test_invalid_result_caching assert 0 == 
impalad.get_num_in_flight_queries() E assert 0 == 1 E + where 1 = >() E + where 
> = 
.get_num_in_flight_queries

{code}

> Flakiness in TestFetchFirst due to wrong results of get_num_in_flight_queries
> -
>
> Key: IMPALA-9799
> URL: https://issues.apache.org/jira/browse/IMPALA-9799
> Project: IMPALA
>  Issue Type: Bug
>Affects Versions: Impala 4.0
>Reporter: Quanlong Huang
>Assignee: Sahil Takiar
>Priority: Critical
>  Labels: broken-build
> Fix For: Impala 4.0
>
>
> Saw two failures for this test in different jenkins jobs:
> hs2.test_fetch_first.TestFetchFirst.test_query_stmts_v6 (from pytest)
>  Stacktrace:
> {code:java}
> /data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/hs2_test_suite.py:63:
>  in add_session
> lambda: fn(self))
> /data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/hs2_test_suite.py:44:
>  in add_session_helper
> fn()
> /data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/hs2_test_suite.py:63:
>  in 
> lambda: fn(self))
> /data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/test_fetch_first.py:110:
>  in test_query_stmts_v6
> self.run_query_stmts_test()
> /data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/test_fetch_first.py:181:
>  in run_query_stmts_test
> self.__test_invalid_result_caching("SELECT COUNT(*) FROM 
> functional.alltypes")
> /data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/test_fetch_first.py:63:
>  in __test_invalid_result_caching
> assert 0 == impalad.get_num_in_flight_queries()
> E   assert 0 == 1
> E+  where 1 =  >()
> E+where  > = 
>  0x6d25d10>.get_num_in_flight_queries{code}
> hs2.test_fetch_first.TestFetchFirst.test_query_stmts_v6_with_result_spooling 
> (from pytest)
>  Stacktrace:
> {code:java}
> /data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/hs2_test_suite.py:63:
>  in add_session
> lambda: fn(self))
> /data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/hs2_test_suite.py:44:
>  in add_session_helper
> fn()
> /data/jenkins/workspace/impala-cdpd-master-core-asan

[jira] [Reopened] (IMPALA-9799) Flakiness in TestFetchFirst due to wrong results of get_num_in_flight_queries

2020-08-23 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-9799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa reopened IMPALA-9799:
---

> Flakiness in TestFetchFirst due to wrong results of get_num_in_flight_queries
> -
>
> Key: IMPALA-9799
> URL: https://issues.apache.org/jira/browse/IMPALA-9799
> Project: IMPALA
>  Issue Type: Bug
>Affects Versions: Impala 4.0
>Reporter: Quanlong Huang
>Assignee: Sahil Takiar
>Priority: Critical
>  Labels: broken-build
> Fix For: Impala 4.0
>
>
> Saw two failures for this test in different jenkins jobs:
> hs2.test_fetch_first.TestFetchFirst.test_query_stmts_v6 (from pytest)
>  Stacktrace:
> {code:java}
> /data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/hs2_test_suite.py:63:
>  in add_session
> lambda: fn(self))
> /data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/hs2_test_suite.py:44:
>  in add_session_helper
> fn()
> /data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/hs2_test_suite.py:63:
>  in 
> lambda: fn(self))
> /data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/test_fetch_first.py:110:
>  in test_query_stmts_v6
> self.run_query_stmts_test()
> /data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/test_fetch_first.py:181:
>  in run_query_stmts_test
> self.__test_invalid_result_caching("SELECT COUNT(*) FROM 
> functional.alltypes")
> /data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/test_fetch_first.py:63:
>  in __test_invalid_result_caching
> assert 0 == impalad.get_num_in_flight_queries()
> E   assert 0 == 1
> E+  where 1 =  >()
> E+where  > = 
>  0x6d25d10>.get_num_in_flight_queries{code}
> hs2.test_fetch_first.TestFetchFirst.test_query_stmts_v6_with_result_spooling 
> (from pytest)
>  Stacktrace:
> {code:java}
> /data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/hs2_test_suite.py:63:
>  in add_session
> lambda: fn(self))
> /data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/hs2_test_suite.py:44:
>  in add_session_helper
> fn()
> /data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/hs2_test_suite.py:63:
>  in 
> lambda: fn(self))
> /data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/test_fetch_first.py:120:
>  in test_query_stmts_v6_with_result_spooling
> self.run_query_stmts_test({'spool_query_results': 'true'})
> /data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/test_fetch_first.py:181:
>  in run_query_stmts_test
> self.__test_invalid_result_caching("SELECT COUNT(*) FROM 
> functional.alltypes")
> /data/jenkins/workspace/impala-cdpd-master-core-asan/repos/Impala/tests/hs2/test_fetch_first.py:63:
>  in __test_invalid_result_caching
> assert 0 == impalad.get_num_in_flight_queries()
> E   assert 0 == 1
> E+  where 1 =  >()
> E+where  > = 
>  0x81d4990>.get_num_in_flight_queries{code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-10054) test_multiple_sort_run_bytes_limits fails in parallel-all-tests-nightly

2020-08-23 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-10054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17182832#comment-17182832
 ] 

Norbert Luksa commented on IMPALA-10054:


Right, that is the problem. Should we leave this Jira open until it gets 
cherry-picked or we can close it now? (the current process is not entirely 
clear to me)

> test_multiple_sort_run_bytes_limits fails in parallel-all-tests-nightly
> ---
>
> Key: IMPALA-10054
> URL: https://issues.apache.org/jira/browse/IMPALA-10054
> Project: IMPALA
>  Issue Type: Bug
>Affects Versions: Impala 4.0
>Reporter: Attila Jeges
>Assignee: Riza Suminto
>Priority: Blocker
>  Labels: broken-build, flaky
> Fix For: Impala 4.0
>
>
> test_multiple_sort_run_bytes_limits  introduced in IMPALA-6692 seems to be 
> flaky.
> Jenkins job that triggered the error:
> https://jenkins.impala.io/job/parallel-all-tests-nightly/1173
> Failing job:
> https://jenkins.impala.io/job/ubuntu-16.04-dockerised-tests/2899/testReport/
> {code}
> Stacktrace
> query_test/test_sort.py:89: in test_multiple_sort_run_bytes_limits
> assert "SpilledRuns: " + spilled_runs in query_result.runtime_profile
> E   assert ('SpilledRuns: ' + '3') in 'Query 
> (id=404da0b1e56e7248:120789cd):\n  DEBUG MODE WARNING: Query profile 
> created while running a DEBUG buil... 27.999ms\n - WriteIoBytes: 
> 0\n - WriteIoOps: 0 (0)\n - WriteIoWaitTime: 
> 0.000ns\n'
> E+  where 'Query (id=404da0b1e56e7248:120789cd):\n  DEBUG MODE 
> WARNING: Query profile created while running a DEBUG buil... 27.999ms\n   
>   - WriteIoBytes: 0\n - WriteIoOps: 0 (0)\n - 
> WriteIoWaitTime: 0.000ns\n' = 
>  0x7f51da77fb50>.runtime_profile
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-10054) test_multiple_sort_run_bytes_limits fails in parallel-all-tests-nightly

2020-09-01 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-10054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17188238#comment-17188238
 ] 

Norbert Luksa commented on IMPALA-10054:


I see, thanks for clarifying!

> test_multiple_sort_run_bytes_limits fails in parallel-all-tests-nightly
> ---
>
> Key: IMPALA-10054
> URL: https://issues.apache.org/jira/browse/IMPALA-10054
> Project: IMPALA
>  Issue Type: Bug
>Affects Versions: Impala 4.0
>Reporter: Attila Jeges
>Assignee: Riza Suminto
>Priority: Blocker
>  Labels: broken-build, flaky
> Fix For: Impala 4.0
>
>
> test_multiple_sort_run_bytes_limits  introduced in IMPALA-6692 seems to be 
> flaky.
> Jenkins job that triggered the error:
> https://jenkins.impala.io/job/parallel-all-tests-nightly/1173
> Failing job:
> https://jenkins.impala.io/job/ubuntu-16.04-dockerised-tests/2899/testReport/
> {code}
> Stacktrace
> query_test/test_sort.py:89: in test_multiple_sort_run_bytes_limits
> assert "SpilledRuns: " + spilled_runs in query_result.runtime_profile
> E   assert ('SpilledRuns: ' + '3') in 'Query 
> (id=404da0b1e56e7248:120789cd):\n  DEBUG MODE WARNING: Query profile 
> created while running a DEBUG buil... 27.999ms\n - WriteIoBytes: 
> 0\n - WriteIoOps: 0 (0)\n - WriteIoWaitTime: 
> 0.000ns\n'
> E+  where 'Query (id=404da0b1e56e7248:120789cd):\n  DEBUG MODE 
> WARNING: Query profile created while running a DEBUG buil... 27.999ms\n   
>   - WriteIoBytes: 0\n - WriteIoOps: 0 (0)\n - 
> WriteIoWaitTime: 0.000ns\n' = 
>  0x7f51da77fb50>.runtime_profile
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Resolved] (IMPALA-10054) test_multiple_sort_run_bytes_limits fails in parallel-all-tests-nightly

2020-09-01 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-10054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa resolved IMPALA-10054.

Resolution: Fixed

> test_multiple_sort_run_bytes_limits fails in parallel-all-tests-nightly
> ---
>
> Key: IMPALA-10054
> URL: https://issues.apache.org/jira/browse/IMPALA-10054
> Project: IMPALA
>  Issue Type: Bug
>Affects Versions: Impala 4.0
>Reporter: Attila Jeges
>Assignee: Riza Suminto
>Priority: Blocker
>  Labels: broken-build, flaky
> Fix For: Impala 4.0
>
>
> test_multiple_sort_run_bytes_limits  introduced in IMPALA-6692 seems to be 
> flaky.
> Jenkins job that triggered the error:
> https://jenkins.impala.io/job/parallel-all-tests-nightly/1173
> Failing job:
> https://jenkins.impala.io/job/ubuntu-16.04-dockerised-tests/2899/testReport/
> {code}
> Stacktrace
> query_test/test_sort.py:89: in test_multiple_sort_run_bytes_limits
> assert "SpilledRuns: " + spilled_runs in query_result.runtime_profile
> E   assert ('SpilledRuns: ' + '3') in 'Query 
> (id=404da0b1e56e7248:120789cd):\n  DEBUG MODE WARNING: Query profile 
> created while running a DEBUG buil... 27.999ms\n - WriteIoBytes: 
> 0\n - WriteIoOps: 0 (0)\n - WriteIoWaitTime: 
> 0.000ns\n'
> E+  where 'Query (id=404da0b1e56e7248:120789cd):\n  DEBUG MODE 
> WARNING: Query profile created while running a DEBUG buil... 27.999ms\n   
>   - WriteIoBytes: 0\n - WriteIoOps: 0 (0)\n - 
> WriteIoWaitTime: 0.000ns\n' = 
>  0x7f51da77fb50>.runtime_profile
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-9351) AnalyzeDDLTest.TestCreateTableLikeFileOrc failed due to non-existing path

2020-09-10 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-9351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17193532#comment-17193532
 ] 

Norbert Luksa commented on IMPALA-9351:
---

Thanks [~stigahuang], I've reassigned the jira.

> AnalyzeDDLTest.TestCreateTableLikeFileOrc failed due to non-existing path
> -
>
> Key: IMPALA-9351
> URL: https://issues.apache.org/jira/browse/IMPALA-9351
> Project: IMPALA
>  Issue Type: Bug
>  Components: Frontend
>Reporter: Fang-Yu Rao
>Assignee: Norbert Luksa
>Priority: Blocker
>  Labels: broken-build, flaky-test
> Fix For: Impala 3.4.0
>
>
> AnalyzeDDLTest.TestCreateTableLikeFileOrc failed due to a non-existing path. 
> Specifically, we see the following error message.
> {code:java}
> Error Message
> Error during analysis:
> org.apache.impala.common.AnalysisException: Cannot infer schema, path does 
> not exist: 
> hdfs://localhost:20500/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0
> sql:
> create table if not exists newtbl_DNE like orc 
> '/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0'
> {code}
> The stack trace is provided in the following.
> {code:java}
> Stacktrace
> java.lang.AssertionError: 
> Error during analysis:
> org.apache.impala.common.AnalysisException: Cannot infer schema, path does 
> not exist: 
> hdfs://localhost:20500/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0
> sql:
> create table if not exists newtbl_DNE like orc 
> '/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0'
>   at org.junit.Assert.fail(Assert.java:88)
>   at 
> org.apache.impala.common.FrontendFixture.analyzeStmt(FrontendFixture.java:397)
>   at 
> org.apache.impala.common.FrontendTestBase.AnalyzesOk(FrontendTestBase.java:244)
>   at 
> org.apache.impala.common.FrontendTestBase.AnalyzesOk(FrontendTestBase.java:185)
>   at 
> org.apache.impala.analysis.AnalyzeDDLTest.TestCreateTableLikeFileOrc(AnalyzeDDLTest.java:2045)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at 
> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:50)
>   at 
> org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
>   at 
> org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:47)
>   at 
> org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
>   at 
> org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
>   at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:78)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:57)
>   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
>   at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71)
>   at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:288)
>   at org.junit.runners.ParentRunner.access$000(ParentRunner.java:58)
>   at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268)
>   at 
> org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
>   at 
> org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
>   at org.junit.runners.ParentRunner.run(ParentRunner.java:363)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:365)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeWithRerun(JUnit4Provider.java:272)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:236)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:159)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.invokeProviderInSameClassLoader(ForkedBooter.java:386)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:323)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:143)
> {code}
> This test was recently added by [~norbertluksa], and [~boroknagyz] gave a +2, 
> maybe [~boroknagyz] could provide some insight into this? Thanks!



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org

[jira] [Assigned] (IMPALA-9351) AnalyzeDDLTest.TestCreateTableLikeFileOrc failed due to non-existing path

2020-09-10 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-9351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa reassigned IMPALA-9351:
-

Assignee: Quanlong Huang  (was: Norbert Luksa)

> AnalyzeDDLTest.TestCreateTableLikeFileOrc failed due to non-existing path
> -
>
> Key: IMPALA-9351
> URL: https://issues.apache.org/jira/browse/IMPALA-9351
> Project: IMPALA
>  Issue Type: Bug
>  Components: Frontend
>Reporter: Fang-Yu Rao
>Assignee: Quanlong Huang
>Priority: Blocker
>  Labels: broken-build, flaky-test
> Fix For: Impala 3.4.0
>
>
> AnalyzeDDLTest.TestCreateTableLikeFileOrc failed due to a non-existing path. 
> Specifically, we see the following error message.
> {code:java}
> Error Message
> Error during analysis:
> org.apache.impala.common.AnalysisException: Cannot infer schema, path does 
> not exist: 
> hdfs://localhost:20500/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0
> sql:
> create table if not exists newtbl_DNE like orc 
> '/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0'
> {code}
> The stack trace is provided in the following.
> {code:java}
> Stacktrace
> java.lang.AssertionError: 
> Error during analysis:
> org.apache.impala.common.AnalysisException: Cannot infer schema, path does 
> not exist: 
> hdfs://localhost:20500/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0
> sql:
> create table if not exists newtbl_DNE like orc 
> '/test-warehouse/functional_orc_def.db/complextypes_fileformat/00_0'
>   at org.junit.Assert.fail(Assert.java:88)
>   at 
> org.apache.impala.common.FrontendFixture.analyzeStmt(FrontendFixture.java:397)
>   at 
> org.apache.impala.common.FrontendTestBase.AnalyzesOk(FrontendTestBase.java:244)
>   at 
> org.apache.impala.common.FrontendTestBase.AnalyzesOk(FrontendTestBase.java:185)
>   at 
> org.apache.impala.analysis.AnalyzeDDLTest.TestCreateTableLikeFileOrc(AnalyzeDDLTest.java:2045)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at 
> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:50)
>   at 
> org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
>   at 
> org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:47)
>   at 
> org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
>   at 
> org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
>   at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:78)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:57)
>   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
>   at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71)
>   at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:288)
>   at org.junit.runners.ParentRunner.access$000(ParentRunner.java:58)
>   at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268)
>   at 
> org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
>   at 
> org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
>   at org.junit.runners.ParentRunner.run(ParentRunner.java:363)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:365)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeWithRerun(JUnit4Provider.java:272)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:236)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:159)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.invokeProviderInSameClassLoader(ForkedBooter.java:386)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:323)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:143)
> {code}
> This test was recently added by [~norbertluksa], and [~boroknagyz] gave a +2, 
> maybe [~boroknagyz] could provide some insight into this? Thanks!



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h.

[jira] [Created] (IMPALA-10411) Build failed for impala-cdh-7.1-maint-exhaustive due to permission denied error

2020-12-28 Thread Norbert Luksa (Jira)
Norbert Luksa created IMPALA-10411:
--

 Summary: Build failed for impala-cdh-7.1-maint-exhaustive due to 
permission denied error
 Key: IMPALA-10411
 URL: https://issues.apache.org/jira/browse/IMPALA-10411
 Project: IMPALA
  Issue Type: Bug
  Components: Jenkins
Reporter: Norbert Luksa


The error is:
{code:java}
19:39:47 [ERROR] Failed to execute goal on project impala-minimal-hive-exec: 
Could not resolve dependencies for project 
org.apache.impala:impala-minimal-hive-exec:jar:0.1-SNAPSHOT: Failed to collect 
dependencies for [org.apache.hive:hive-exec:jar:3.1.3000.7.1.6.0-104 
(compile)]: Failed to read artifact descriptor for 
net.minidev:json-smart:jar:2.3-SNAPSHOT: Could not transfer artifact 
net.minidev:json-smart:pom:2.3-SNAPSHOT from/to impala.cdh.repo 
(https://native-toolchain.s3.amazonaws.com/build/cdh_components/1814051/maven): 
Access denied to: 
https://native-toolchain.s3.amazonaws.com/build/cdh_components/1814051/maven/net/minidev/json-smart/2.3-SNAPSHOT/json-smart-2.3-SNAPSHOT.pom,
 ReasonPhrase:Forbidden. -> [Help 1] 19:39:47 [ERROR] 19:39:47 [ERROR] To see 
the full stack trace of the errors, re-run Maven with the -e switch. 19:39:47 
[ERROR] Re-run Maven using the -X switch to enable full debug logging. 19:39:47 
[ERROR] 19:39:47 [ERROR] For more information about the errors and possible 
solutions, please read the following articles: 19:39:47 [ERROR] [Help 1] 
http://cwiki.apache.org/confluence/display/MAVEN/DependencyResolutionException 
19:39:47 mvn -U -s 
/data/jenkins/workspace/impala-cdh-7.1-maint-exhaustive/repos/Impala-auxiliary-tests/jenkins/m2-settings.xml
 -U -B install -DskipTests exited with code 0
{code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-10411) Build failed for impala-cdh-7.1-maint-exhaustive due to access denied error

2020-12-28 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-10411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa updated IMPALA-10411:
---
Summary: Build failed for impala-cdh-7.1-maint-exhaustive due to access 
denied error  (was: Build failed for impala-cdh-7.1-maint-exhaustive due to 
permission denied error)

> Build failed for impala-cdh-7.1-maint-exhaustive due to access denied error
> ---
>
> Key: IMPALA-10411
> URL: https://issues.apache.org/jira/browse/IMPALA-10411
> Project: IMPALA
>  Issue Type: Bug
>  Components: Jenkins
>Reporter: Norbert Luksa
>Priority: Major
>  Labels: broken-build
>
> The error is:
> {code:java}
> 19:39:47 [ERROR] Failed to execute goal on project impala-minimal-hive-exec: 
> Could not resolve dependencies for project 
> org.apache.impala:impala-minimal-hive-exec:jar:0.1-SNAPSHOT: Failed to 
> collect dependencies for [org.apache.hive:hive-exec:jar:3.1.3000.7.1.6.0-104 
> (compile)]: Failed to read artifact descriptor for 
> net.minidev:json-smart:jar:2.3-SNAPSHOT: Could not transfer artifact 
> net.minidev:json-smart:pom:2.3-SNAPSHOT from/to impala.cdh.repo 
> (https://native-toolchain.s3.amazonaws.com/build/cdh_components/1814051/maven):
>  Access denied to: 
> https://native-toolchain.s3.amazonaws.com/build/cdh_components/1814051/maven/net/minidev/json-smart/2.3-SNAPSHOT/json-smart-2.3-SNAPSHOT.pom,
>  ReasonPhrase:Forbidden. -> [Help 1] 19:39:47 [ERROR] 19:39:47 [ERROR] To see 
> the full stack trace of the errors, re-run Maven with the -e switch. 19:39:47 
> [ERROR] Re-run Maven using the -X switch to enable full debug logging. 
> 19:39:47 [ERROR] 19:39:47 [ERROR] For more information about the errors and 
> possible solutions, please read the following articles: 19:39:47 [ERROR] 
> [Help 1] 
> http://cwiki.apache.org/confluence/display/MAVEN/DependencyResolutionException
>  19:39:47 mvn -U -s 
> /data/jenkins/workspace/impala-cdh-7.1-maint-exhaustive/repos/Impala-auxiliary-tests/jenkins/m2-settings.xml
>  -U -B install -DskipTests exited with code 0
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Resolved] (IMPALA-12403) Kerberos authentication fails when connecting with a proxy user that passes LDAP user and group filters but does not delegate another user

2023-09-15 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-12403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa resolved IMPALA-12403.

Resolution: Fixed

> Kerberos authentication fails when connecting with a proxy user that passes 
> LDAP user and group filters but does not delegate another user
> --
>
> Key: IMPALA-12403
> URL: https://issues.apache.org/jira/browse/IMPALA-12403
> Project: IMPALA
>  Issue Type: Bug
>  Components: be
>Reporter: Gergely Farkas
>Assignee: Gergely Farkas
>Priority: Major
>
> When connecting with a proxy user without _doAs_ request parameter or 
> _impala.doas.user_ connection config then the filters are executed with the 
> authenticated user itself, however, in case of Kerberos auth, the 
> authenticated user is a Kerberos user principal which will definitely not 
> pass the LDAP checks, because LDAP filters here need to be checked with a 
> short username (that needs to be extracted from the Kerberos user principal).
> During the Kerberos authentication process, the short username is checked ( 
> see 
> [https://github.com/apache/impala/blob/master/be/src/rpc/authentication.cc#L757-L764]),
>  , the only point where it doesn't work like that is this: 
> [https://github.com/apache/impala/blob/master/be/src/service/impala-hs2-server.cc#L394-L403]
> [https://github.com/apache/impala/blob/master/be/src/util/auth-util.cc#L43-L52]
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-12403) Kerberos authentication fails when connecting with a proxy user that passes LDAP user and group filters but does not delegate another user

2023-09-15 Thread Norbert Luksa (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-12403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17765482#comment-17765482
 ] 

Norbert Luksa commented on IMPALA-12403:


Change is merged: https://gerrit.cloudera.org/#/c/20421/

> Kerberos authentication fails when connecting with a proxy user that passes 
> LDAP user and group filters but does not delegate another user
> --
>
> Key: IMPALA-12403
> URL: https://issues.apache.org/jira/browse/IMPALA-12403
> Project: IMPALA
>  Issue Type: Bug
>  Components: be
>Reporter: Gergely Farkas
>Assignee: Gergely Farkas
>Priority: Major
>
> When connecting with a proxy user without _doAs_ request parameter or 
> _impala.doas.user_ connection config then the filters are executed with the 
> authenticated user itself, however, in case of Kerberos auth, the 
> authenticated user is a Kerberos user principal which will definitely not 
> pass the LDAP checks, because LDAP filters here need to be checked with a 
> short username (that needs to be extracted from the Kerberos user principal).
> During the Kerberos authentication process, the short username is checked ( 
> see 
> [https://github.com/apache/impala/blob/master/be/src/rpc/authentication.cc#L757-L764]),
>  , the only point where it doesn't work like that is this: 
> [https://github.com/apache/impala/blob/master/be/src/service/impala-hs2-server.cc#L394-L403]
> [https://github.com/apache/impala/blob/master/be/src/util/auth-util.cc#L43-L52]
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-8752) Add Jaro-winkler edit distance and similarity built-in function

2019-07-11 Thread Norbert Luksa (JIRA)
Norbert Luksa created IMPALA-8752:
-

 Summary: Add Jaro-winkler edit distance and similarity built-in 
function
 Key: IMPALA-8752
 URL: https://issues.apache.org/jira/browse/IMPALA-8752
 Project: IMPALA
  Issue Type: New Feature
Reporter: Norbert Luksa
Assignee: Norbert Luksa


References:
 * [Apache commons - JaroWinklerDistance 
|[https://commons.apache.org/proper/commons-text/apidocs/org/apache/commons/text/similarity/JaroWinklerDistance.html]]
 * [Apache commons - JaroWinklerSimilarity 
|[https://commons.apache.org/proper/commons-text/apidocs/org/apache/commons/text/similarity/JaroWinklerSimilarity.html]]
 * [Oracle - 
JARO_WINKLER[_SIMILARITY]|[https://oracle-base.com/articles/11g/utl_match-string-matching-in-oracle]]

Notable difference:
 * With similarity, the Oracle version returns a normalized result ranging from 
0 to 100.
 * In the Appache version, null values result in exceptions.
 * Apache rounds the values to two digits 

The scaling factor of the algorithm can be added as an extra/default argument.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Work started] (IMPALA-8755) Implement Z-ordering for Impala

2019-07-17 Thread Norbert Luksa (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8755?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on IMPALA-8755 started by Norbert Luksa.
-
> Implement Z-ordering for Impala
> ---
>
> Key: IMPALA-8755
> URL: https://issues.apache.org/jira/browse/IMPALA-8755
> Project: IMPALA
>  Issue Type: New Feature
>Reporter: Zoltán Borók-Nagy
>Assignee: Norbert Luksa
>Priority: Major
>
> Implement Z-ordering for Impala: [https://en.wikipedia.org/wiki/Z-order_curve]
> A Z-order curve defines an ordering on multi-dimensional data. Data sorted 
> that way can be efficiently filtered by min/max statistics regarding to the 
> columns participating in the ordering.
> Impala currently only supports lexicographic ordering via the SORT BY clause. 
> This strongly prefers the first column, i.e. given the "SORT BY A, B, C" 
> clause => A will be totally ordered (hence filtering on A will be very 
> efficient), but values belonging to B and C will be scattered throughout the 
> data set (hence filtering on B or C will barely do any good).
> We could add a new clause, e.g. a "ZSORT BY" clause to Impala that writes the 
> data in Z-order.
> "ZSORT BY A, B C" would cluster the rows in a way that filtering on A, B, or 
> C would be equally efficient.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Work started] (IMPALA-8752) Add Jaro-winkler edit distance and similarity built-in function

2019-07-17 Thread Norbert Luksa (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on IMPALA-8752 started by Norbert Luksa.
-
> Add Jaro-winkler edit distance and similarity built-in function
> ---
>
> Key: IMPALA-8752
> URL: https://issues.apache.org/jira/browse/IMPALA-8752
> Project: IMPALA
>  Issue Type: New Feature
>Reporter: Norbert Luksa
>Assignee: Norbert Luksa
>Priority: Major
>  Labels: built-in-function
>
> References:
>  * [Apache commons - JaroWinklerDistance 
> |[https://commons.apache.org/proper/commons-text/apidocs/org/apache/commons/text/similarity/JaroWinklerDistance.html]]
>  * [Apache commons - JaroWinklerSimilarity 
> |[https://commons.apache.org/proper/commons-text/apidocs/org/apache/commons/text/similarity/JaroWinklerSimilarity.html]]
>  * [Oracle - 
> JARO_WINKLER[_SIMILARITY]|[https://oracle-base.com/articles/11g/utl_match-string-matching-in-oracle]]
> Notable difference:
>  * With similarity, the Oracle version returns a normalized result ranging 
> from 0 to 100.
>  * In the Appache version, null values result in exceptions.
>  * Apache rounds the values to two digits 
> The scaling factor of the algorithm can be added as an extra/default argument.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Assigned] (IMPALA-7770) SPLIT_PART to support negative indexes

2019-07-18 Thread Norbert Luksa (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-7770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa reassigned IMPALA-7770:
-

Assignee: Norbert Luksa

> SPLIT_PART to support negative indexes
> --
>
> Key: IMPALA-7770
> URL: https://issues.apache.org/jira/browse/IMPALA-7770
> Project: IMPALA
>  Issue Type: Improvement
>  Components: Backend
>Reporter: Tristan Stevens
>Assignee: Norbert Luksa
>Priority: Major
>  Labels: built-in-function
>
> Request is for SPLIT_PART to support negative indexes i.e. support right to 
> left searching.
> See Snowflake documentation for details: 
> [https://docs.snowflake.net/manuals/sql-reference/functions/split_part.html|https://docs.snowflake.net/manuals/sql-reference/functions/split_part.html]
> partNr: Requested part of the split (1-based). 0 is treated as 1. If the 
> value is negative, the parts are counted from the right side of the string.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-7770) SPLIT_PART to support negative indexes

2019-07-30 Thread Norbert Luksa (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-7770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16896104#comment-16896104
 ] 

Norbert Luksa commented on IMPALA-7770:
---

Feel free to review:

https://gerrit.cloudera.org/#/c/13880/

> SPLIT_PART to support negative indexes
> --
>
> Key: IMPALA-7770
> URL: https://issues.apache.org/jira/browse/IMPALA-7770
> Project: IMPALA
>  Issue Type: Improvement
>  Components: Backend
>Reporter: Tristan Stevens
>Assignee: Norbert Luksa
>Priority: Major
>  Labels: built-in-function
>
> Request is for SPLIT_PART to support negative indexes i.e. support right to 
> left searching.
> See Snowflake documentation for details: 
> [https://docs.snowflake.net/manuals/sql-reference/functions/split_part.html|https://docs.snowflake.net/manuals/sql-reference/functions/split_part.html]
> partNr: Requested part of the split (1-based). 0 is treated as 1. If the 
> value is negative, the parts are counted from the right side of the string.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-8752) Add Jaro-winkler edit distance and similarity built-in function

2019-07-30 Thread Norbert Luksa (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-8752?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16896106#comment-16896106
 ] 

Norbert Luksa commented on IMPALA-8752:
---

https://gerrit.cloudera.org/#/c/13870/

> Add Jaro-winkler edit distance and similarity built-in function
> ---
>
> Key: IMPALA-8752
> URL: https://issues.apache.org/jira/browse/IMPALA-8752
> Project: IMPALA
>  Issue Type: New Feature
>Reporter: Norbert Luksa
>Assignee: Norbert Luksa
>Priority: Major
>  Labels: built-in-function
>
> References:
>  * [Apache commons - JaroWinklerDistance 
> |[https://commons.apache.org/proper/commons-text/apidocs/org/apache/commons/text/similarity/JaroWinklerDistance.html]]
>  * [Apache commons - JaroWinklerSimilarity 
> |[https://commons.apache.org/proper/commons-text/apidocs/org/apache/commons/text/similarity/JaroWinklerSimilarity.html]]
>  * [Oracle - 
> JARO_WINKLER[_SIMILARITY]|[https://oracle-base.com/articles/11g/utl_match-string-matching-in-oracle]]
> Notable difference:
>  * With similarity, the Oracle version returns a normalized result ranging 
> from 0 to 100.
>  * In the Appache version, null values result in exceptions.
>  * Apache rounds the values to two digits 
> The scaling factor of the algorithm can be added as an extra/default argument.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-7770) SPLIT_PART to support negative indexes

2019-07-31 Thread Norbert Luksa (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-7770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16896930#comment-16896930
 ] 

Norbert Luksa commented on IMPALA-7770:
---

Yes, it's only a small change.

> SPLIT_PART to support negative indexes
> --
>
> Key: IMPALA-7770
> URL: https://issues.apache.org/jira/browse/IMPALA-7770
> Project: IMPALA
>  Issue Type: Improvement
>  Components: Backend
>Reporter: Tristan Stevens
>Assignee: Norbert Luksa
>Priority: Major
>  Labels: built-in-function
>
> Request is for SPLIT_PART to support negative indexes i.e. support right to 
> left searching.
> See Snowflake documentation for details: 
> [https://docs.snowflake.net/manuals/sql-reference/functions/split_part.html|https://docs.snowflake.net/manuals/sql-reference/functions/split_part.html]
> partNr: Requested part of the split (1-based). 0 is treated as 1. If the 
> value is negative, the parts are counted from the right side of the string.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Resolved] (IMPALA-8752) Add Jaro-winkler edit distance and similarity built-in function

2019-08-14 Thread Norbert Luksa (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa resolved IMPALA-8752.
---
   Resolution: Fixed
Fix Version/s: Impala 3.3.0

> Add Jaro-winkler edit distance and similarity built-in function
> ---
>
> Key: IMPALA-8752
> URL: https://issues.apache.org/jira/browse/IMPALA-8752
> Project: IMPALA
>  Issue Type: New Feature
>Reporter: Norbert Luksa
>Assignee: Norbert Luksa
>Priority: Major
>  Labels: built-in-function
> Fix For: Impala 3.3.0
>
>
> References:
>  * [Apache commons - JaroWinklerDistance 
> |[https://commons.apache.org/proper/commons-text/apidocs/org/apache/commons/text/similarity/JaroWinklerDistance.html]]
>  * [Apache commons - JaroWinklerSimilarity 
> |[https://commons.apache.org/proper/commons-text/apidocs/org/apache/commons/text/similarity/JaroWinklerSimilarity.html]]
>  * [Oracle - 
> JARO_WINKLER[_SIMILARITY]|[https://oracle-base.com/articles/11g/utl_match-string-matching-in-oracle]]
> Notable difference:
>  * With similarity, the Oracle version returns a normalized result ranging 
> from 0 to 100.
>  * In the Appache version, null values result in exceptions.
>  * Apache rounds the values to two digits 
> The scaling factor of the algorithm can be added as an extra/default argument.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Resolved] (IMPALA-7770) SPLIT_PART to support negative indexes

2019-08-15 Thread Norbert Luksa (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-7770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Norbert Luksa resolved IMPALA-7770.
---
   Resolution: Implemented
Fix Version/s: Impala 3.3.0

> SPLIT_PART to support negative indexes
> --
>
> Key: IMPALA-7770
> URL: https://issues.apache.org/jira/browse/IMPALA-7770
> Project: IMPALA
>  Issue Type: Improvement
>  Components: Backend
>Reporter: Tristan Stevens
>Assignee: Norbert Luksa
>Priority: Major
>  Labels: built-in-function
> Fix For: Impala 3.3.0
>
>
> Request is for SPLIT_PART to support negative indexes i.e. support right to 
> left searching.
> See Snowflake documentation for details: 
> [https://docs.snowflake.net/manuals/sql-reference/functions/split_part.html|https://docs.snowflake.net/manuals/sql-reference/functions/split_part.html]
> partNr: Requested part of the split (1-based). 0 is treated as 1. If the 
> value is negative, the parts are counted from the right side of the string.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Work started] (IMPALA-8498) Write column index for floating types when NaN is not present

2019-09-12 Thread Norbert Luksa (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on IMPALA-8498 started by Norbert Luksa.
-
> Write column index for floating types when NaN is not present
> -
>
> Key: IMPALA-8498
> URL: https://issues.apache.org/jira/browse/IMPALA-8498
> Project: IMPALA
>  Issue Type: Bug
>Reporter: Zoltán Borók-Nagy
>Assignee: Norbert Luksa
>Priority: Major
>  Labels: ramp-up
>
> IMPALA-7304 disabled column index writing for floating point columns until 
> PARQUET-1222 is resolved.
> PARQUET-1222 is responsible for defining a total order for floating values, 
> but the problematic values are only the NaNs. Therefore we can write the 
> column index if NaNs are not present in the data. Parquet-MR also does this, 
> following the principles in 
> [https://github.com/apache/parquet-format/blob/75eb7a7b84e6e62bfb09668b6d8d40b12597456e/src/main/thrift/parquet.thrift#L827-L834]
>  
> Impala should follow this behavior, and also when storing zeroes, it should 
> store -0.0 as minimum and +0.0 as maximum.



--
This message was sent by Atlassian Jira
(v8.3.2#803003)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



  1   2   >