[jira] [Resolved] (IMPALA-7527) Expose fetch-from-catalogd cache and latency metrics in profiles

2018-10-03 Thread Vuk Ercegovac (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-7527?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vuk Ercegovac resolved IMPALA-7527.
---
Resolution: Fixed

> Expose fetch-from-catalogd cache and latency metrics in profiles
> 
>
> Key: IMPALA-7527
> URL: https://issues.apache.org/jira/browse/IMPALA-7527
> Project: IMPALA
>  Issue Type: Sub-task
>Affects Versions: Impala 3.1.0
>Reporter: Todd Lipcon
>Assignee: Vuk Ercegovac
>Priority: Major
> Fix For: Impala 3.1.0
>
>
> Since we now have some caching and potential remote calls on the planning 
> path, it's important to be able to understand how that contributes to the 
> performance of planning. This JIRA tracks adding such information to the 
> profile.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Assigned] (IMPALA-6742) Profiles of running queries should include execution summary

2018-10-03 Thread Yongjun Zhang (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-6742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yongjun Zhang reassigned IMPALA-6742:
-

Assignee: Yongjun Zhang

> Profiles of running queries should include execution summary
> 
>
> Key: IMPALA-6742
> URL: https://issues.apache.org/jira/browse/IMPALA-6742
> Project: IMPALA
>  Issue Type: Improvement
>  Components: Backend
>Reporter: Balazs Jeszenszky
>Assignee: Yongjun Zhang
>Priority: Minor
>
> The profile omits the execution summary when taken off of a running query. It 
> would be helpful to have it there as well.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-7651) Add Kudu support to scheduler-related query hints and options

2018-10-03 Thread Alex Rodoni (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-7651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Rodoni updated IMPALA-7651:

Issue Type: Bug  (was: Documentation)

> Add Kudu support to scheduler-related query hints and options
> -
>
> Key: IMPALA-7651
> URL: https://issues.apache.org/jira/browse/IMPALA-7651
> Project: IMPALA
>  Issue Type: Bug
>  Components: Docs
>Affects Versions: Impala 3.1.0
>Reporter: Lars Volker
>Assignee: Alex Rodoni
>Priority: Major
>
> Scheduling for Kudu works the same way for HDFS and Kudu, however our docs 
> pages don't mention the latter: 
> https://impala.apache.org/docs/build/html/topics/impala_schedule_random_replica.html
> We should add Kudu to the docs for the SCHEDULE_RANDOM_REPLICA query option 
> and the RANDOM_REPLICA hint.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Assigned] (IMPALA-7651) Add Kudu support to scheduler-related query hints and options

2018-10-03 Thread Alex Rodoni (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-7651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Rodoni reassigned IMPALA-7651:
---

Assignee: Alex Rodoni

> Add Kudu support to scheduler-related query hints and options
> -
>
> Key: IMPALA-7651
> URL: https://issues.apache.org/jira/browse/IMPALA-7651
> Project: IMPALA
>  Issue Type: Documentation
>  Components: Docs
>Affects Versions: Impala 3.1.0
>Reporter: Lars Volker
>Assignee: Alex Rodoni
>Priority: Major
>
> Scheduling for Kudu works the same way for HDFS and Kudu, however our docs 
> pages don't mention the latter: 
> https://impala.apache.org/docs/build/html/topics/impala_schedule_random_replica.html
> We should add Kudu to the docs for the SCHEDULE_RANDOM_REPLICA query option 
> and the RANDOM_REPLICA hint.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-7652) Fix BytesRead for Kudu scans

2018-10-03 Thread Thomas Tauber-Marshall (JIRA)
Thomas Tauber-Marshall created IMPALA-7652:
--

 Summary: Fix BytesRead for Kudu scans
 Key: IMPALA-7652
 URL: https://issues.apache.org/jira/browse/IMPALA-7652
 Project: IMPALA
  Issue Type: Improvement
  Components: Backend
Affects Versions: Impala 3.1.0
Reporter: Thomas Tauber-Marshall


In the runtime profile, all scan nodes have a BytesRead counter. Kudu scan 
nodes do not actually set this counter, which creates confusion when Kudu scans 
report 0 bytes read even though they did in fact read data.

We should either fill in this counter for Kudu scans, or if its difficult to 
get the count of bytes read from Kudu then we should eliminate the counter for 
Kudu scan node profiles.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-7561) Expired queries should end up in a consistent state (EXCEPTION, CANCELLED, or something else)

2018-10-03 Thread Tim Armstrong (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-7561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Armstrong updated IMPALA-7561:
--
Description: 
ClientRequestState treats "eos" as a terminal state and won't transition to 
EXCEPTION or CANCELLED out of it. See the below code snippet from 
ClientRequestState::Cancel():

{code}
bool already_done = eos_ || operation_state_ == 
TOperationState::ERROR_STATE;
if (!already_done && cause != NULL) {
  DCHECK(!cause->ok());
  discard_result(UpdateQueryStatus(*cause));
  query_events_->MarkEvent("Cancelled");
  DCHECK_EQ(operation_state_, TOperationState::ERROR_STATE);
}
{code}

If the cancellation is initiated by the server (e.g. because of query_timeout_s 
or exec_time_limit_s) then we should arguably treat that as an exception and 
put the query into the EXCEPTION state (because it is not a user-initiated 
cancellation) if it is not already in an EXCEPTION or CANCELLED state. That way 
the client can see that the expiry happened and access the cause of expiry.

One argument against using the EXCEPTION state is that admins or users might 
think that something went wrong and needs to be investigated, see IMPALA-471.

A separate issue is that user-initiated cancellation should use the CANCELLED 
state: IMPALA-1262

  was:
ClientRequestState treats "eos" as a terminal state and won't transition to 
EXCEPTION or CANCELLED out of it. See the below code snippet from 
ClientRequestState::Cancel():

{code}
bool already_done = eos_ || operation_state_ == 
TOperationState::ERROR_STATE;
if (!already_done && cause != NULL) {
  DCHECK(!cause->ok());
  discard_result(UpdateQueryStatus(*cause));
  query_events_->MarkEvent("Cancelled");
  DCHECK_EQ(operation_state_, TOperationState::ERROR_STATE);
}
{code}

I think that if the cancellation is initiated by the server (e.g. because of 
query_timeout_s or exec_time_limit_s) then we should treat that as an exception 
and put the query into the EXCEPTION state (because it is not a user-initiated 
cancellation) if it is not already in an EXCEPTION or CANCELLED state. That way 
the client can see that the expiry happened and access the cause of expiry.

A separate issue is that user-initiated cancellation should use the CANCELLED 
state: IMPALA-1262


> Expired queries should end up in a consistent state (EXCEPTION, CANCELLED, or 
> something else)
> -
>
> Key: IMPALA-7561
> URL: https://issues.apache.org/jira/browse/IMPALA-7561
> Project: IMPALA
>  Issue Type: Improvement
>  Components: Backend
>Reporter: Tim Armstrong
>Priority: Major
>  Labels: query-lifecycle
>
> ClientRequestState treats "eos" as a terminal state and won't transition to 
> EXCEPTION or CANCELLED out of it. See the below code snippet from 
> ClientRequestState::Cancel():
> {code}
> bool already_done = eos_ || operation_state_ == 
> TOperationState::ERROR_STATE;
> if (!already_done && cause != NULL) {
>   DCHECK(!cause->ok());
>   discard_result(UpdateQueryStatus(*cause));
>   query_events_->MarkEvent("Cancelled");
>   DCHECK_EQ(operation_state_, TOperationState::ERROR_STATE);
> }
> {code}
> If the cancellation is initiated by the server (e.g. because of 
> query_timeout_s or exec_time_limit_s) then we should arguably treat that as 
> an exception and put the query into the EXCEPTION state (because it is not a 
> user-initiated cancellation) if it is not already in an EXCEPTION or 
> CANCELLED state. That way the client can see that the expiry happened and 
> access the cause of expiry.
> One argument against using the EXCEPTION state is that admins or users might 
> think that something went wrong and needs to be investigated, see IMPALA-471.
> A separate issue is that user-initiated cancellation should use the CANCELLED 
> state: IMPALA-1262



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-7561) Expired queries should end up in a consistent state (EXCEPTION, CANCELLED, or something else)

2018-10-03 Thread Tim Armstrong (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-7561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Armstrong updated IMPALA-7561:
--
Summary: Expired queries should end up in a consistent state (EXCEPTION, 
CANCELLED, or something else)  (was: Expired queries should not stay in 
FINISHED state when at eos)

> Expired queries should end up in a consistent state (EXCEPTION, CANCELLED, or 
> something else)
> -
>
> Key: IMPALA-7561
> URL: https://issues.apache.org/jira/browse/IMPALA-7561
> Project: IMPALA
>  Issue Type: Improvement
>  Components: Backend
>Reporter: Tim Armstrong
>Priority: Major
>  Labels: query-lifecycle
>
> ClientRequestState treats "eos" as a terminal state and won't transition to 
> EXCEPTION or CANCELLED out of it. See the below code snippet from 
> ClientRequestState::Cancel():
> {code}
> bool already_done = eos_ || operation_state_ == 
> TOperationState::ERROR_STATE;
> if (!already_done && cause != NULL) {
>   DCHECK(!cause->ok());
>   discard_result(UpdateQueryStatus(*cause));
>   query_events_->MarkEvent("Cancelled");
>   DCHECK_EQ(operation_state_, TOperationState::ERROR_STATE);
> }
> {code}
> I think that if the cancellation is initiated by the server (e.g. because of 
> query_timeout_s or exec_time_limit_s) then we should treat that as an 
> exception and put the query into the EXCEPTION state (because it is not a 
> user-initiated cancellation) if it is not already in an EXCEPTION or 
> CANCELLED state. That way the client can see that the expiry happened and 
> access the cause of expiry.
> A separate issue is that user-initiated cancellation should use the CANCELLED 
> state: IMPALA-1262



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-4714) Idle session expired query goes in to exception state - And this is confusing

2018-10-03 Thread Balazs Jeszenszky (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-4714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16637112#comment-16637112
 ] 

Balazs Jeszenszky commented on IMPALA-4714:
---

SGTM

> Idle session expired query goes in to exception state - And this is confusing
> -
>
> Key: IMPALA-4714
> URL: https://issues.apache.org/jira/browse/IMPALA-4714
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Affects Versions: Impala 2.6.0
>Reporter: Mala Chikka Kempanna
>Priority: Major
>  Labels: query-lifecycle
>
> After setting idle_session_timeout , impala server, after completing 
> execution of query, moves it into exception state , if there is no client 
> activity.
> Example profile excerpt showing this behavior:
> {code}
> Query Timeline
> Start execution: 0ns (0ns)
> Planning finished: 9ms (9ms)
> Child queries finished: 8.3m (8.3m)
> Metastore update finished: 8.3m (661ms)
> Rows available: 8.3m (0ns)
> Cancelled: 11.3m (3.0m)
> Unregister query: 12.0m (42.55s)
> {code}
> Query status and query state-
> {code}
> Query Type: DDL
> Query State: EXCEPTION
> Start Time: Dec 22, 2016 11:45:01 AM
> End Time: Dec 22, 2016 11:57:01 AM
> Duration: 11m, 59s
> Admission Result: Unknown
> Client Fetch Wait Time: 3.7m
> Client Fetch Wait Time Percentage: 31
> Connected User: admin
> DDL Type: COMPUTE_STATS
> File Formats:
> Impala Version: impalad version 2.5.0-cdh5.7.2 RELEASE (build 
> 1140f8289dc0d2b1517bcf70454bb4575eb8cc70)
> Network Address: 10.17.100.123:44618
> Out of Memory: false
> Planning Wait Time: 9ms
> Planning Wait Time Percentage: 0
> Query Status: Query d141e0d996c91e72:bb8726fb917537bb expired due to client 
> inactivity (timeout is 3m)
> Session ID: 3043ff5042860968:8f92bc3bd2a0ca83
> Session Type: HIVESERVER2
> {code}
> Though query status string is very clear saying "expired due to client 
> inactivity (timeout is 3m)", the problem is with "Query State: EXCEPTION"
> This makes user, think something went wrong with query execution.
> So I recommend that queries completed, but expired due to client-inactivity 
> be marked as 
> "Query State: FINISHED"



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-4714) Idle session expired query goes in to exception state - And this is confusing

2018-10-03 Thread Tim Armstrong (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-4714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16637111#comment-16637111
 ] 

Tim Armstrong commented on IMPALA-4714:
---

Yeah that's a valid point, maybe an argument for using CANCELED. I think 
FINISHED has problems in terms of confusing clients - I recently saw an example 
of a client that kept the session containing the query alive indefinitely 
because it thought the query in a FINISHED state was still active and kept 
pinging the query for its state. How about I use IMPALA-7561 to track the 
general issue of rethinking what state these queries should go into?

> Idle session expired query goes in to exception state - And this is confusing
> -
>
> Key: IMPALA-4714
> URL: https://issues.apache.org/jira/browse/IMPALA-4714
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Affects Versions: Impala 2.6.0
>Reporter: Mala Chikka Kempanna
>Priority: Major
>  Labels: query-lifecycle
>
> After setting idle_session_timeout , impala server, after completing 
> execution of query, moves it into exception state , if there is no client 
> activity.
> Example profile excerpt showing this behavior:
> {code}
> Query Timeline
> Start execution: 0ns (0ns)
> Planning finished: 9ms (9ms)
> Child queries finished: 8.3m (8.3m)
> Metastore update finished: 8.3m (661ms)
> Rows available: 8.3m (0ns)
> Cancelled: 11.3m (3.0m)
> Unregister query: 12.0m (42.55s)
> {code}
> Query status and query state-
> {code}
> Query Type: DDL
> Query State: EXCEPTION
> Start Time: Dec 22, 2016 11:45:01 AM
> End Time: Dec 22, 2016 11:57:01 AM
> Duration: 11m, 59s
> Admission Result: Unknown
> Client Fetch Wait Time: 3.7m
> Client Fetch Wait Time Percentage: 31
> Connected User: admin
> DDL Type: COMPUTE_STATS
> File Formats:
> Impala Version: impalad version 2.5.0-cdh5.7.2 RELEASE (build 
> 1140f8289dc0d2b1517bcf70454bb4575eb8cc70)
> Network Address: 10.17.100.123:44618
> Out of Memory: false
> Planning Wait Time: 9ms
> Planning Wait Time Percentage: 0
> Query Status: Query d141e0d996c91e72:bb8726fb917537bb expired due to client 
> inactivity (timeout is 3m)
> Session ID: 3043ff5042860968:8f92bc3bd2a0ca83
> Session Type: HIVESERVER2
> {code}
> Though query status string is very clear saying "expired due to client 
> inactivity (timeout is 3m)", the problem is with "Query State: EXCEPTION"
> This makes user, think something went wrong with query execution.
> So I recommend that queries completed, but expired due to client-inactivity 
> be marked as 
> "Query State: FINISHED"



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-1262) user initiated query cancellation should return operation_state = CANCELED_STATE

2018-10-03 Thread Tim Armstrong (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-1262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16637110#comment-16637110
 ] 

Tim Armstrong commented on IMPALA-1262:
---

[~jeszyb] mentioned on IMPALA-4714 that EXCEPTION might cause confusion because 
admins look at queries in the EXCEPTION state to understand errors.

> user initiated query cancellation should return operation_state = 
> CANCELED_STATE
> 
>
> Key: IMPALA-1262
> URL: https://issues.apache.org/jira/browse/IMPALA-1262
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Affects Versions: Impala 1.4
>Reporter: Dan Hecht
>Assignee: Bikramjeet Vig
>Priority: Minor
>  Labels: query-lifecycle
>
> I've been told that if the user cancels the query, our intention is to not 
> set query_state to EXCEPTION.  (Is that the desired behavior?)  Adding a test 
> case to test_cancellation.py as follows, shows that this is not the current 
> behavior (at least not in all circumstances).
> {code}
> --- a/tests/query_test/test_cancellation.py
> +++ b/tests/query_test/test_cancellation.py
> @@ -102,6 +102,7 @@ class TestCancellation(ImpalaTestSuite):
>sleep(vector.get_value('cancel_delay'))
>assert self.client.get_state(handle) != 
> self.client.QUERY_STATES['EXCEPTION']
>cancel_result = self.client.cancel(handle)
> +  assert self.client.get_state(handle) != 
> self.client.QUERY_STATES['EXCEPTION']
>assert cancel_result.status_code == 0,\
>'Unexpected status code from cancel request: %s' % cancel_result
>thread.join()
> {code}
> One example of how this can happen is if the cancel happens while the wait 
> thread is still in WaitInternal, then WaitInternal will return 
> Status::CANCELLED, and UpdateQueryStatus indiscriminately sets query_state_ 
> to EXCEPTION if !ok().  But, I don't think that side of the execution can 
> distinguish between user initiated cancel and error initiated cancel.  
> Probably, QueryExecState::Cancel needs to remember which kind of cancellation 
> it was so that the cancelled side can do the right thing.
> We don't have a general workaround.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-7561) Expired queries should not stay in FINISHED state when at eos

2018-10-03 Thread Tim Armstrong (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-7561?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16637107#comment-16637107
 ] 

Tim Armstrong commented on IMPALA-7561:
---

See [~jeszyb]'s comment on IMPALA-4714 for a reason why we shouldn't use 
EXCEPTION as the state here - it makes users/admins think something went wrong.

> Expired queries should not stay in FINISHED state when at eos
> -
>
> Key: IMPALA-7561
> URL: https://issues.apache.org/jira/browse/IMPALA-7561
> Project: IMPALA
>  Issue Type: Improvement
>  Components: Backend
>Reporter: Tim Armstrong
>Priority: Major
>  Labels: query-lifecycle
>
> ClientRequestState treats "eos" as a terminal state and won't transition to 
> EXCEPTION or CANCELLED out of it. See the below code snippet from 
> ClientRequestState::Cancel():
> {code}
> bool already_done = eos_ || operation_state_ == 
> TOperationState::ERROR_STATE;
> if (!already_done && cause != NULL) {
>   DCHECK(!cause->ok());
>   discard_result(UpdateQueryStatus(*cause));
>   query_events_->MarkEvent("Cancelled");
>   DCHECK_EQ(operation_state_, TOperationState::ERROR_STATE);
> }
> {code}
> I think that if the cancellation is initiated by the server (e.g. because of 
> query_timeout_s or exec_time_limit_s) then we should treat that as an 
> exception and put the query into the EXCEPTION state (because it is not a 
> user-initiated cancellation) if it is not already in an EXCEPTION or 
> CANCELLED state. That way the client can see that the expiry happened and 
> access the cause of expiry.
> A separate issue is that user-initiated cancellation should use the CANCELLED 
> state: IMPALA-1262



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-7651) Add Kudu support to scheduler-related query hints and options

2018-10-03 Thread Lars Volker (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-7651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16637092#comment-16637092
 ] 

Lars Volker commented on IMPALA-7651:
-

CC [~arodoni_cloudera], [~twmarshall]

> Add Kudu support to scheduler-related query hints and options
> -
>
> Key: IMPALA-7651
> URL: https://issues.apache.org/jira/browse/IMPALA-7651
> Project: IMPALA
>  Issue Type: Documentation
>  Components: Docs
>Affects Versions: Impala 3.1.0
>Reporter: Lars Volker
>Priority: Major
>
> Scheduling for Kudu works the same way for HDFS and Kudu, however our docs 
> pages don't mention the latter: 
> https://impala.apache.org/docs/build/html/topics/impala_schedule_random_replica.html
> We should add Kudu to the docs for the SCHEDULE_RANDOM_REPLICA query option 
> and the RANDOM_REPLICA hint.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-7651) Add Kudu support to scheduler-related query hints and options

2018-10-03 Thread Lars Volker (JIRA)
Lars Volker created IMPALA-7651:
---

 Summary: Add Kudu support to scheduler-related query hints and 
options
 Key: IMPALA-7651
 URL: https://issues.apache.org/jira/browse/IMPALA-7651
 Project: IMPALA
  Issue Type: Documentation
  Components: Docs
Affects Versions: Impala 3.1.0
Reporter: Lars Volker


Scheduling for Kudu works the same way for HDFS and Kudu, however our docs 
pages don't mention the latter: 
https://impala.apache.org/docs/build/html/topics/impala_schedule_random_replica.html

We should add Kudu to the docs for the SCHEDULE_RANDOM_REPLICA query option and 
the RANDOM_REPLICA hint.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Resolved] (IMPALA-7595) Check failed: IsValidTime(time_) at timestamp-value.h:322

2018-10-03 Thread Csaba Ringhofer (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-7595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Csaba Ringhofer resolved IMPALA-7595.
-
   Resolution: Fixed
Fix Version/s: Impala 3.1.0

> Check failed: IsValidTime(time_) at timestamp-value.h:322 
> --
>
> Key: IMPALA-7595
> URL: https://issues.apache.org/jira/browse/IMPALA-7595
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Affects Versions: Impala 3.1.0
>Reporter: Tim Armstrong
>Assignee: Csaba Ringhofer
>Priority: Blocker
>  Labels: broken-build, crash
> Fix For: Impala 3.1.0
>
>
> See https://jenkins.impala.io/job/ubuntu-16.04-from-scratch/3197/. hash is 
> 23c7d7e57b7868eedbf5a9a4bc4aafd6066a04fb
> Some of the fuzz tests stand out amongst the tests that were running at the 
> same time as the crash, particularly:
>  19:12:17 [gw4] PASSED 
> query_test/test_scanners_fuzz.py::TestScannersFuzzing::test_fuzz_alltypes[exec_option:
>  {'debug_action': '-1:OPEN:SET_DENY_RESERVATION_PROBABILITY@1.0', 
> 'abort_on_error': False, 'mem_limit': '512m', 'num_nodes': 0} | table_format: 
> parquet/none] 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-4714) Idle session expired query goes in to exception state - And this is confusing

2018-10-03 Thread Balazs Jeszenszky (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-4714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16636601#comment-16636601
 ] 

Balazs Jeszenszky commented on IMPALA-4714:
---

[~tarmstrong], hmm. From a user POV, this is really a problem. There's a reason 
these clients don't close their queries and timeouts are our recommended 
solution, making a timed out query completely expected. This way EXCEPTION 
state represents both normal and non-normal termination paths.

There are probably more complete (and more involved) solutions than just 
turning these queries into FINISHED ones (e.g. TIMEDOUT state or similar). For 
users, EXCEPTION means there's something to look into, not simply that it was 
closed by the server. Until we come up with a more complete solution, 
considering these queries normal (ie. FINISHED) would ease the pain.

> Idle session expired query goes in to exception state - And this is confusing
> -
>
> Key: IMPALA-4714
> URL: https://issues.apache.org/jira/browse/IMPALA-4714
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Affects Versions: Impala 2.6.0
>Reporter: Mala Chikka Kempanna
>Priority: Major
>  Labels: query-lifecycle
>
> After setting idle_session_timeout , impala server, after completing 
> execution of query, moves it into exception state , if there is no client 
> activity.
> Example profile excerpt showing this behavior:
> {code}
> Query Timeline
> Start execution: 0ns (0ns)
> Planning finished: 9ms (9ms)
> Child queries finished: 8.3m (8.3m)
> Metastore update finished: 8.3m (661ms)
> Rows available: 8.3m (0ns)
> Cancelled: 11.3m (3.0m)
> Unregister query: 12.0m (42.55s)
> {code}
> Query status and query state-
> {code}
> Query Type: DDL
> Query State: EXCEPTION
> Start Time: Dec 22, 2016 11:45:01 AM
> End Time: Dec 22, 2016 11:57:01 AM
> Duration: 11m, 59s
> Admission Result: Unknown
> Client Fetch Wait Time: 3.7m
> Client Fetch Wait Time Percentage: 31
> Connected User: admin
> DDL Type: COMPUTE_STATS
> File Formats:
> Impala Version: impalad version 2.5.0-cdh5.7.2 RELEASE (build 
> 1140f8289dc0d2b1517bcf70454bb4575eb8cc70)
> Network Address: 10.17.100.123:44618
> Out of Memory: false
> Planning Wait Time: 9ms
> Planning Wait Time Percentage: 0
> Query Status: Query d141e0d996c91e72:bb8726fb917537bb expired due to client 
> inactivity (timeout is 3m)
> Session ID: 3043ff5042860968:8f92bc3bd2a0ca83
> Session Type: HIVESERVER2
> {code}
> Though query status string is very clear saying "expired due to client 
> inactivity (timeout is 3m)", the problem is with "Query State: EXCEPTION"
> This makes user, think something went wrong with query execution.
> So I recommend that queries completed, but expired due to client-inactivity 
> be marked as 
> "Query State: FINISHED"



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org