[jira] [Commented] (IMPALA-8786) BufferedPlanRootSink should directly write to a QueryResultSet if one is available

2019-07-23 Thread Tim Armstrong (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-8786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16891517#comment-16891517
 ] 

Tim Armstrong commented on IMPALA-8786:
---

Maybe the problem has changed enough that it's a little different, but I 
remember last time I thought about it that I had some concerns about handling 
the batch boundaries, since the batches coming from query execution and the 
batches requested by the client won't generally line up to each other.

I think maybe if we're only doing this as a short-circuit optimisation you 
don't run into major problems, since you can just append as many rows will fit 
in the current batch, and append the rest to the buffer.

Anyway, it would be nice to do an experiment to see how large the benefit of a 
fast path was - it's possible that this part of query execution is fast enough 
anyway compared to the network and clients.

> BufferedPlanRootSink should directly write to a QueryResultSet if one is 
> available
> --
>
> Key: IMPALA-8786
> URL: https://issues.apache.org/jira/browse/IMPALA-8786
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Backend
>Reporter: Sahil Takiar
>Assignee: Sahil Takiar
>Priority: Major
>
> {{BufferedPlanRootSink}} uses a {{RowBatchQueue}} to buffer {{RowBatch}}-es 
> and then the consumer thread reads them and writes them to a given 
> {{QueryResultSet}}. Implementations of {{RowBatchQueue}} might end up copying 
> the buffered {{RowBatch}}-es (e.g. if the queue is backed by a 
> {{BufferedTupleStream}}). An optimization would be for the producer thread to 
> directly write to the consumer {{QueryResultSet}}. This optimization would 
> only be triggered if (1) the queue is empty, and (2) the consumer thread has 
> a {{QueryResultSet}} available for writing.
> This "fast path" is useful in a few different scenarios:
>  * If the consumer is faster than at reading rows than the producer is at 
> sending them; in this case, the overhead of buffering rows in a 
> {{RowBatchQueue}} can be completely avoided
>  * For queries that return under 1024 its likely that the consumer will 
> produce a {{QueryResultSet}} before the first {{RowBatch}} is returned 
> (except perhaps for very trivial queries)



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-7290) Move impala-shell to use HS2

2019-07-23 Thread Tim Armstrong (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-7290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16891511#comment-16891511
 ] 

Tim Armstrong commented on IMPALA-7290:
---

[~arodoni_cloudera] the user-facing differences were kept as minimal as 
possible. The display rounding for floating point numbers may be slightly 
different, e.g. 8.071 vs 8.072 - the underlying value is the same 
but they're displayed differently.

It does fix a user-facing bug with handling of tab characters: 
https://issues.apache.org/jira/browse/IMPALA-1840

One minor difference in behaviour that could have consequences on some 
deployments, is that impala-shell is now connecting to the HS2 endpoint that's 
shared with more services, so you could exhaust fe_service_threads sooner. Not 
sure if that's an important enough issue to document.

One caveat that's worth documenting is that you can't connect to the HS2 
interface of older versions of Impala, because the impala-shell HS2 support 
depends on some extensions that were only added to this version.

> Move impala-shell to use HS2
> 
>
> Key: IMPALA-7290
> URL: https://issues.apache.org/jira/browse/IMPALA-7290
> Project: IMPALA
>  Issue Type: Improvement
>  Components: Clients
>Reporter: Tim Armstrong
>Assignee: Tim Armstrong
>Priority: Major
>  Labels: maintainability, shell
> Fix For: Impala 3.3.0
>
>
> Most clients have moved to the HS2 interface. impala-shell is one of the 
> laggards. We should switch impala-shell to use the newer and more standard 
> interface.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-8615) Impala Doc: Doc admission control support for scalable config params

2019-07-23 Thread Alex Rodoni (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Rodoni updated IMPALA-8615:

Description: https://gerrit.cloudera.org/#/c/13906/

> Impala Doc: Doc admission control support for scalable config params
> 
>
> Key: IMPALA-8615
> URL: https://issues.apache.org/jira/browse/IMPALA-8615
> Project: IMPALA
>  Issue Type: Sub-task
>Reporter: Bikramjeet Vig
>Assignee: Alex Rodoni
>Priority: Major
>  Labels: future_release_doc, in_33
>
> https://gerrit.cloudera.org/#/c/13906/



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Work started] (IMPALA-8615) Impala Doc: Doc admission control support for scalable config params

2019-07-23 Thread Alex Rodoni (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on IMPALA-8615 started by Alex Rodoni.
---
> Impala Doc: Doc admission control support for scalable config params
> 
>
> Key: IMPALA-8615
> URL: https://issues.apache.org/jira/browse/IMPALA-8615
> Project: IMPALA
>  Issue Type: Sub-task
>Reporter: Bikramjeet Vig
>Assignee: Alex Rodoni
>Priority: Major
>  Labels: future_release_doc, in_33
>




--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-7290) Move impala-shell to use HS2

2019-07-23 Thread Alex Rodoni (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-7290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16891499#comment-16891499
 ] 

Alex Rodoni commented on IMPALA-7290:
-

[~tarmstrong] Besides, the --protocol flag, what difference would users see b/w 
beeswax and hs2?

> Move impala-shell to use HS2
> 
>
> Key: IMPALA-7290
> URL: https://issues.apache.org/jira/browse/IMPALA-7290
> Project: IMPALA
>  Issue Type: Improvement
>  Components: Clients
>Reporter: Tim Armstrong
>Assignee: Tim Armstrong
>Priority: Major
>  Labels: maintainability, shell
> Fix For: Impala 3.3.0
>
>
> Most clients have moved to the HS2 interface. impala-shell is one of the 
> laggards. We should switch impala-shell to use the newer and more standard 
> interface.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Resolved] (IMPALA-8696) Cmake dependencies for container build do not work correctly

2019-07-23 Thread Tim Armstrong (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8696?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Armstrong resolved IMPALA-8696.
---
   Resolution: Fixed
Fix Version/s: Impala 3.3.0

> Cmake dependencies for container build do not work correctly
> 
>
> Key: IMPALA-8696
> URL: https://issues.apache.org/jira/browse/IMPALA-8696
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Infrastructure
>Affects Versions: Impala 3.3.0
>Reporter: Tim Armstrong
>Assignee: Tim Armstrong
>Priority: Major
> Fix For: Impala 3.3.0
>
>
> I've noticed in many cases that containers are not reliably rebuilt when 
> inputs change, e.g. if the binaries are rebuilt. Investigate and fix this, 
> it's a major development headache.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Resolved] (IMPALA-8622) Generate list of docker images generated by the build

2019-07-23 Thread Tim Armstrong (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Armstrong resolved IMPALA-8622.
---
   Resolution: Fixed
Fix Version/s: Impala 3.3.0

> Generate list of docker images generated by the build
> -
>
> Key: IMPALA-8622
> URL: https://issues.apache.org/jira/browse/IMPALA-8622
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Infrastructure
>Reporter: Tim Armstrong
>Assignee: Tim Armstrong
>Priority: Major
>  Labels: docker
> Fix For: Impala 3.3.0
>
>
> Some consumers of the Impala docker images will want to publish them to a 
> repo. Currently they would have to hardcode a list of the docker images 
> produced by the Impala build to publish them, or do something brittle like 
> "docker image ls" to deduce what images were built.
> It would be useful if the build produced a list of the images in some 
> consumable format that would be updated (either manually or automatically) 
> when the set of images changes. If it needs to be manually updated, we should 
> have some kind of check to verify that it isn't stale.
> One option is to produce JSON, something like:
> {noformat}
> {
>   “docker_images”: {
>“impala_base”: “$docker_registry/impala_base”,
>” impalad_coord_exec”: “$docker_registry/impalad_coord_exec”,
>   
>   }
> }
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Resolved] (IMPALA-8696) Cmake dependencies for container build do not work correctly

2019-07-23 Thread Tim Armstrong (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8696?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Armstrong resolved IMPALA-8696.
---
   Resolution: Fixed
Fix Version/s: Impala 3.3.0

> Cmake dependencies for container build do not work correctly
> 
>
> Key: IMPALA-8696
> URL: https://issues.apache.org/jira/browse/IMPALA-8696
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Infrastructure
>Affects Versions: Impala 3.3.0
>Reporter: Tim Armstrong
>Assignee: Tim Armstrong
>Priority: Major
> Fix For: Impala 3.3.0
>
>
> I've noticed in many cases that containers are not reliably rebuilt when 
> inputs change, e.g. if the binaries are rebuilt. Investigate and fix this, 
> it's a major development headache.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


[jira] [Resolved] (IMPALA-8622) Generate list of docker images generated by the build

2019-07-23 Thread Tim Armstrong (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Armstrong resolved IMPALA-8622.
---
   Resolution: Fixed
Fix Version/s: Impala 3.3.0

> Generate list of docker images generated by the build
> -
>
> Key: IMPALA-8622
> URL: https://issues.apache.org/jira/browse/IMPALA-8622
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Infrastructure
>Reporter: Tim Armstrong
>Assignee: Tim Armstrong
>Priority: Major
>  Labels: docker
> Fix For: Impala 3.3.0
>
>
> Some consumers of the Impala docker images will want to publish them to a 
> repo. Currently they would have to hardcode a list of the docker images 
> produced by the Impala build to publish them, or do something brittle like 
> "docker image ls" to deduce what images were built.
> It would be useful if the build produced a list of the images in some 
> consumable format that would be updated (either manually or automatically) 
> when the set of images changes. If it needs to be manually updated, we should 
> have some kind of check to verify that it isn't stale.
> One option is to produce JSON, something like:
> {noformat}
> {
>   “docker_images”: {
>“impala_base”: “$docker_registry/impala_base”,
>” impalad_coord_exec”: “$docker_registry/impalad_coord_exec”,
>   
>   }
> }
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


[jira] [Work started] (IMPALA-8785) DEBUG build images should be tagged differently from release build

2019-07-23 Thread Tim Armstrong (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on IMPALA-8785 started by Tim Armstrong.
-
> DEBUG build images should be tagged differently from release build
> --
>
> Key: IMPALA-8785
> URL: https://issues.apache.org/jira/browse/IMPALA-8785
> Project: IMPALA
>  Issue Type: Improvement
>  Components: Infrastructure
>Affects Versions: Impala 3.3.0
>Reporter: Tim Armstrong
>Assignee: Tim Armstrong
>Priority: Critical
>  Labels: docker
>
> Currently the container build gives debug and release images the same tag 
> names. This is fine for development, since you'll generally be working on one 
> build at a time, but is confusing if you need to integrate this into a more 
> general build process, since debug and release images are really separate 
> artifacts.
> One example I ran into is a case when I ran ./buildall.sh ... 
> -release_and_debug, then wanted to build docker images for the release 
> artifacts. This is not currently possible in a sane way.
> I think we should tag images with a -debug suffix if they were generated from 
> debug artifacts. start-impala-cluster would need to be smart about picking 
> the image to use. We may want to have different build targets to build the 
> different flavours of image, too.
> Note that there are many flavours of debug images, e.g. ASAN, but I think 
> simply separating release and debug would avoid a lot of confusion.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-8434) Alter Db leads to functions missing unless run "refresh functions"

2019-07-23 Thread Quanlong Huang (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-8434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16891493#comment-16891493
 ] 

Quanlong Huang commented on IMPALA-8434:


[~Xiaomeng Zhang], I think your solution make sense but I can't reproduce the 
hanging of "invalidate metadata" at first. Finally, I figured out that if you 
remove the codes for removing all versions of the tables and functions, the 
"invalidate metadata" command do hang. So I guest your changes may be something 
like this:
{code:java}
diff --git a/fe/src/main/java/org/apache/impala/catalog/ImpaladCatalog.java 
b/fe/src/main/java/org/apache/impala/catalog/ImpaladCatalog.java
index 13cb620..9ddb888 100644
--- a/fe/src/main/java/org/apache/impala/catalog/ImpaladCatalog.java
+++ b/fe/src/main/java/org/apache/impala/catalog/ImpaladCatalog.java
@@ -19,6 +19,7 @@ package org.apache.impala.catalog;
 
 import java.nio.ByteBuffer;
 import java.util.ArrayDeque;
+import java.util.List;
 import java.util.Set;
 import java.util.concurrent.atomic.AtomicLong;
 import java.util.concurrent.atomic.AtomicReference;
@@ -390,11 +391,16 @@ public class ImpaladCatalog extends Catalog implements 
FeCatalog {
   newDb.setCatalogVersion(catalogVersion);
   addDb(newDb);
   if (existingDb != null) {
+for (Table tbl : existingDb.getTables()) {
+  newDb.addTable(tbl);
+}
+for (List functionList : 
existingDb.getAllFunctions().values()) {
+  for (Function func : functionList) {
+newDb.addFunction(func);
+  }
+}
 CatalogObjectVersionSet.INSTANCE.updateVersions(
 existingDb.getCatalogVersion(), catalogVersion);
-CatalogObjectVersionSet.INSTANCE.removeAll(existingDb.getTables());
-CatalogObjectVersionSet.INSTANCE.removeAll(
-existingDb.getFunctions(null, new PatternMatcher()));
   } else {
 CatalogObjectVersionSet.INSTANCE.addVersion(catalogVersion);
   }
{code}
Actually, we shouldn't remove the codes calling 
"CatalogObjectVersionSet.INSTANCE.removeAll". Because "newDb.addTable" and 
"newDb.addFunction" will also add versions into the CatalogObjectVersionSet 
which is a *multi-set*. Removing those three lines causes the old table 
versions being double counted. So the minVersion_ of CatalogObjectVersionSet 
won't be updated since their counters can't decrease to 0.

For "invalidate metadata", in the coordinator side, database updates are 
processed prior to the table/function updates: 
[https://github.com/apache/impala/blob/eb97c746d2309fcf78ff3b50751cd5e27101539a/fe/src/main/java/org/apache/impala/catalog/ImpaladCatalog.java#L124-L129].
 So even if we retain the old tables/functions of a db, they'll be updated 
later in processing the table/function updates of this db. So I think it's no 
problems for "invalidate metadata".

Just uploaded a patch for this: [https://gerrit.cloudera.org/c/13904/]

> Alter Db leads to functions missing unless run "refresh functions" 
> ---
>
> Key: IMPALA-8434
> URL: https://issues.apache.org/jira/browse/IMPALA-8434
> Project: IMPALA
>  Issue Type: Bug
>Reporter: Xiaomeng Zhang
>Assignee: Xiaomeng Zhang
>Priority: Critical
>
> I was testing on master branch. In a database with java and native functions. 
> Run queries below, all functions are missing after alter db until run 
> "refresh functions" in db.
> {code:java}
> [localhost:21000] xm> show functions;
> Query: show functions
> +-++-+---+
> | return type | signature | binary type | is persistent |
> +-++-+---+
> | STRING | add10impala(STRING) | JAVA | true |
> | STRING | add10udf(STRING) | JAVA | true |
> | INT | add2(INT, INT) | NATIVE | true |
> | INT | add2xm(INT, INT) | NATIVE | true |
> | INT | addtwomaster(INT, INT) | NATIVE | true |
> +-++-+---+
> Fetched 5 row(s) in 0.01s
> [localhost:21000] xm> alter database xm set owner user impala218;
> Query: alter database xm set owner user impala218
> +---+
> | summary |
> +---+
> | Updated database. |
> +---+
> Fetched 1 row(s) in 0.59s
> [localhost:21000] xm> show functions;
> Query: show functions
> Fetched 0 row(s) in 0.01s
> [localhost:21000] xm> refresh functions xm;
> Query: refresh functions xm
> Query submitted at: 2019-04-18 14:19:00 (Coordinator: 
> http://xiaomeng-OptiPlex-9020:25000)
> Query progress can be monitored at: 
> http://xiaomeng-OptiPlex-9020:25000/query_plan?query_id=fa40cdffde223550:df2a6cc0
> Fetched 0 row(s) in 0.08s
> [localhost:21000] xm> show functions;
> Query: show functions
> 

[jira] [Commented] (IMPALA-8622) Generate list of docker images generated by the build

2019-07-23 Thread ASF subversion and git services (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-8622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16891480#comment-16891480
 ] 

ASF subversion and git services commented on IMPALA-8622:
-

Commit f689daef7f140fb2e227ee96218f31076b5ba257 in impala's branch 
refs/heads/master from Tim Armstrong
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=f689dae ]

IMPALA-8622,IMPALA-8696: fix docker dependencies, add image list

Adds a plain-text space-separated image list in
docker/docker-images.txt. This is generated based on the images built by
CMake, so is kept in sync with images added to or removed from the
CMake file.

Duplicated logic per image is removed - instead there is a helper
function that is called for each daemon image to be built.

Rips out the timestamp mechanism that was intended to avoid unnecessary
container rebuilds, but has turned out to be brittle. Instead the
containers are rebuilt each time the rule is invoked.

This moves some subdirectories so that the image tag matches the
subdirectory, to simplify the build scripts.

Change-Id: I4d8e215e9b07c6491faa4751969a30f0ed373fe3
Reviewed-on: http://gerrit.cloudera.org:8080/13899
Tested-by: Impala Public Jenkins 
Reviewed-by: Lars Volker 


> Generate list of docker images generated by the build
> -
>
> Key: IMPALA-8622
> URL: https://issues.apache.org/jira/browse/IMPALA-8622
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Infrastructure
>Reporter: Tim Armstrong
>Assignee: Tim Armstrong
>Priority: Major
>  Labels: docker
>
> Some consumers of the Impala docker images will want to publish them to a 
> repo. Currently they would have to hardcode a list of the docker images 
> produced by the Impala build to publish them, or do something brittle like 
> "docker image ls" to deduce what images were built.
> It would be useful if the build produced a list of the images in some 
> consumable format that would be updated (either manually or automatically) 
> when the set of images changes. If it needs to be manually updated, we should 
> have some kind of check to verify that it isn't stale.
> One option is to produce JSON, something like:
> {noformat}
> {
>   “docker_images”: {
>“impala_base”: “$docker_registry/impala_base”,
>” impalad_coord_exec”: “$docker_registry/impalad_coord_exec”,
>   
>   }
> }
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-8696) Cmake dependencies for container build do not work correctly

2019-07-23 Thread ASF subversion and git services (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-8696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16891481#comment-16891481
 ] 

ASF subversion and git services commented on IMPALA-8696:
-

Commit f689daef7f140fb2e227ee96218f31076b5ba257 in impala's branch 
refs/heads/master from Tim Armstrong
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=f689dae ]

IMPALA-8622,IMPALA-8696: fix docker dependencies, add image list

Adds a plain-text space-separated image list in
docker/docker-images.txt. This is generated based on the images built by
CMake, so is kept in sync with images added to or removed from the
CMake file.

Duplicated logic per image is removed - instead there is a helper
function that is called for each daemon image to be built.

Rips out the timestamp mechanism that was intended to avoid unnecessary
container rebuilds, but has turned out to be brittle. Instead the
containers are rebuilt each time the rule is invoked.

This moves some subdirectories so that the image tag matches the
subdirectory, to simplify the build scripts.

Change-Id: I4d8e215e9b07c6491faa4751969a30f0ed373fe3
Reviewed-on: http://gerrit.cloudera.org:8080/13899
Tested-by: Impala Public Jenkins 
Reviewed-by: Lars Volker 


> Cmake dependencies for container build do not work correctly
> 
>
> Key: IMPALA-8696
> URL: https://issues.apache.org/jira/browse/IMPALA-8696
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Infrastructure
>Affects Versions: Impala 3.3.0
>Reporter: Tim Armstrong
>Assignee: Tim Armstrong
>Priority: Major
>
> I've noticed in many cases that containers are not reliably rebuilt when 
> inputs change, e.g. if the binaries are rebuilt. Investigate and fix this, 
> it's a major development headache.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Work started] (IMPALA-8616) Document --disconnected_session_timeout

2019-07-23 Thread Alex Rodoni (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8616?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on IMPALA-8616 started by Alex Rodoni.
---
> Document --disconnected_session_timeout
> ---
>
> Key: IMPALA-8616
> URL: https://issues.apache.org/jira/browse/IMPALA-8616
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Docs
>Reporter: Thomas Tauber-Marshall
>Assignee: Alex Rodoni
>Priority: Major
>  Labels: future_release_doc, in_33
>
> IMPALA-1653 added a new startup flag, --disconnected_session_timeout, which 
> we should document.
> We might also want to document the new session behavior, that hs2 sessions 
> aren't automatically closed when the connection is closed, though I don't see 
> the old behavior documented anywhere so it may be sufficient just to explain 
> the flag.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-8616) Document --disconnected_session_timeout

2019-07-23 Thread Alex Rodoni (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-8616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16891470#comment-16891470
 ] 

Alex Rodoni commented on IMPALA-8616:
-

https://gerrit.cloudera.org/#/c/13903/

> Document --disconnected_session_timeout
> ---
>
> Key: IMPALA-8616
> URL: https://issues.apache.org/jira/browse/IMPALA-8616
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Docs
>Reporter: Thomas Tauber-Marshall
>Assignee: Alex Rodoni
>Priority: Major
>  Labels: future_release_doc, in_33
>
> IMPALA-1653 added a new startup flag, --disconnected_session_timeout, which 
> we should document.
> We might also want to document the new session behavior, that hs2 sessions 
> aren't automatically closed when the connection is closed, though I don't see 
> the old behavior documented anywhere so it may be sufficient just to explain 
> the flag.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-8746) Document the new query option DEFAULT_HINTS_INSERT_STATEMENT

2019-07-23 Thread ASF subversion and git services (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-8746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16891449#comment-16891449
 ] 

ASF subversion and git services commented on IMPALA-8746:
-

Commit 55e52287e7d8fb9f3a22a70c26627eabdcaa48d0 in impala's branch 
refs/heads/master from Alex Rodoni
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=55e5228 ]

IMPALA-8746: [DOCS] Document the DEFAULT_HINTS_INSERT_STATEMENT query option

Change-Id: Ia376721f46eb507901f9f64b5c3341dc0f36475b
Reviewed-on: http://gerrit.cloudera.org:8080/13885
Tested-by: Impala Public Jenkins 
Reviewed-by: Bharath Vissapragada 


> Document the new query option DEFAULT_HINTS_INSERT_STATEMENT
> 
>
> Key: IMPALA-8746
> URL: https://issues.apache.org/jira/browse/IMPALA-8746
> Project: IMPALA
>  Issue Type: Task
>  Components: Docs
>Reporter: Abhishek Rawat
>Assignee: Alex Rodoni
>Priority: Major
>  Labels: future_release_doc, in_33
> Fix For: Impala 3.3.0
>
>
> Related to IMPALA-8673.
> https://gerrit.cloudera.org/#/c/13885/



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-8765) Document JSON Udfs

2019-07-23 Thread ASF subversion and git services (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-8765?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16891451#comment-16891451
 ] 

ASF subversion and git services commented on IMPALA-8765:
-

Commit 1f4e89e66ed45cb28dfa2c98023464621b3ee7ab in impala's branch 
refs/heads/master from Alex Rodoni
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=1f4e89e ]

IMPALA-8765: [DOCS] Document the GET_JSON_OBJECT function

Change-Id: I7135528e84f685bfe1c32d81f4cedb6afc133e04
Reviewed-on: http://gerrit.cloudera.org:8080/13886
Tested-by: Impala Public Jenkins 
Reviewed-by: Quanlong Huang 


> Document JSON Udfs
> --
>
> Key: IMPALA-8765
> URL: https://issues.apache.org/jira/browse/IMPALA-8765
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Docs
>Reporter: Tim Armstrong
>Assignee: Alex Rodoni
>Priority: Major
>  Labels: docs, future_release_doc, in_33
> Fix For: Impala 3.3.0
>
>
> It looks like we missed documenting the new builtin from the parent JIRA
> https://gerrit.cloudera.org/#/c/13886/



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-8701) Document idle session management after IMPALA-7802

2019-07-23 Thread ASF subversion and git services (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-8701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16891450#comment-16891450
 ] 

ASF subversion and git services commented on IMPALA-8701:
-

Commit 1c9af2cb526b038c080f1b09acc403e8d7378364 in impala's branch 
refs/heads/master from Alex Rodoni
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=1c9af2c ]

IMPALA-8701: [DOCS] Document --idle_client_poll_time_s flag

Change-Id: I32ace786904f564b9c5fa3ed594e2b679b76d5c6
Reviewed-on: http://gerrit.cloudera.org:8080/13896
Tested-by: Impala Public Jenkins 
Reviewed-by: Michael Ho 


> Document idle session management after IMPALA-7802
> --
>
> Key: IMPALA-8701
> URL: https://issues.apache.org/jira/browse/IMPALA-8701
> Project: IMPALA
>  Issue Type: Task
>  Components: Docs
>Affects Versions: Impala 3.3.0
>Reporter: Michael Ho
>Assignee: Alex Rodoni
>Priority: Major
>  Labels: future_release_doc, in_33
> Fix For: Impala 3.3.0
>
>
> After IMPALA-7802 is fixed, the network connection of an idle session will be 
> closed {{\-\-idle_client_poll_time_s}} seconds latest after the session has 
> been declared idle. This is a change of behavior from previous versions in 
> which the client connection will hang around until the user explicitly closes 
> the session and disconnects.
> By default, {{\-\-idle_client_poll_time_s}} is set to 30 seconds and the 
> previous behavior can be restored by setting to 0. In addition, the session 
> will only be closed if idle session timeout has been configured to be greater 
> than 0 for the session either via the startup flag {{--idle_session_timeout}} 
> or the query option {{IDLE_SESSION_TIMEOUT}}. If idle session timeout is not 
> configured, a session cannot become idle by definition and therefore its 
> connection cannot be closed until the client explicitly closes it.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Closed] (IMPALA-8765) Document JSON Udfs

2019-07-23 Thread Alex Rodoni (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Rodoni closed IMPALA-8765.
---

> Document JSON Udfs
> --
>
> Key: IMPALA-8765
> URL: https://issues.apache.org/jira/browse/IMPALA-8765
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Docs
>Reporter: Tim Armstrong
>Assignee: Alex Rodoni
>Priority: Major
>  Labels: docs, future_release_doc, in_33
> Fix For: Impala 3.3.0
>
>
> It looks like we missed documenting the new builtin from the parent JIRA
> https://gerrit.cloudera.org/#/c/13886/



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


[jira] [Closed] (IMPALA-8765) Document JSON Udfs

2019-07-23 Thread Alex Rodoni (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Rodoni closed IMPALA-8765.
---

> Document JSON Udfs
> --
>
> Key: IMPALA-8765
> URL: https://issues.apache.org/jira/browse/IMPALA-8765
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Docs
>Reporter: Tim Armstrong
>Assignee: Alex Rodoni
>Priority: Major
>  Labels: docs, future_release_doc, in_33
> Fix For: Impala 3.3.0
>
>
> It looks like we missed documenting the new builtin from the parent JIRA
> https://gerrit.cloudera.org/#/c/13886/



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Resolved] (IMPALA-8765) Document JSON Udfs

2019-07-23 Thread Alex Rodoni (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Rodoni resolved IMPALA-8765.
-
   Resolution: Fixed
Fix Version/s: Impala 3.3.0

> Document JSON Udfs
> --
>
> Key: IMPALA-8765
> URL: https://issues.apache.org/jira/browse/IMPALA-8765
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Docs
>Reporter: Tim Armstrong
>Assignee: Alex Rodoni
>Priority: Major
>  Labels: docs, future_release_doc, in_33
> Fix For: Impala 3.3.0
>
>
> It looks like we missed documenting the new builtin from the parent JIRA
> https://gerrit.cloudera.org/#/c/13886/



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Resolved] (IMPALA-8765) Document JSON Udfs

2019-07-23 Thread Alex Rodoni (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Rodoni resolved IMPALA-8765.
-
   Resolution: Fixed
Fix Version/s: Impala 3.3.0

> Document JSON Udfs
> --
>
> Key: IMPALA-8765
> URL: https://issues.apache.org/jira/browse/IMPALA-8765
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Docs
>Reporter: Tim Armstrong
>Assignee: Alex Rodoni
>Priority: Major
>  Labels: docs, future_release_doc, in_33
> Fix For: Impala 3.3.0
>
>
> It looks like we missed documenting the new builtin from the parent JIRA
> https://gerrit.cloudera.org/#/c/13886/



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


[jira] [Created] (IMPALA-8787) Add compressed formats test for LZO

2019-07-23 Thread Ethan (JIRA)
Ethan created IMPALA-8787:
-

 Summary: Add compressed formats test for LZO
 Key: IMPALA-8787
 URL: https://issues.apache.org/jira/browse/IMPALA-8787
 Project: IMPALA
  Issue Type: Test
  Components: Backend
Reporter: Ethan


Currently, tests/query_test/test_compressed_formats.py, verifies that gzip, 
snappy, bzip, and deflate compressed RC, sequence, and text files are supported 
in Impala. There needs to be testing added for reading text files compressed 
using LZO.

 



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-8787) Add compressed formats test for LZO

2019-07-23 Thread Ethan (JIRA)
Ethan created IMPALA-8787:
-

 Summary: Add compressed formats test for LZO
 Key: IMPALA-8787
 URL: https://issues.apache.org/jira/browse/IMPALA-8787
 Project: IMPALA
  Issue Type: Test
  Components: Backend
Reporter: Ethan


Currently, tests/query_test/test_compressed_formats.py, verifies that gzip, 
snappy, bzip, and deflate compressed RC, sequence, and text files are supported 
in Impala. There needs to be testing added for reading text files compressed 
using LZO.

 



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


[jira] [Work started] (IMPALA-7991) Impala 3.3 Doc: Doc Parquet page index

2019-07-23 Thread Alex Rodoni (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-7991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on IMPALA-7991 started by Alex Rodoni.
---
> Impala 3.3 Doc: Doc Parquet page index
> --
>
> Key: IMPALA-7991
> URL: https://issues.apache.org/jira/browse/IMPALA-7991
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Docs
>Reporter: Alex Rodoni
>Assignee: Alex Rodoni
>Priority: Major
>  Labels: future_release_doc, in_33
>




--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-7991) Impala 3.3 Doc: Doc Parquet page index

2019-07-23 Thread Alex Rodoni (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-7991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Rodoni updated IMPALA-7991:

Description: https://gerrit.cloudera.org/#/c/13900/

> Impala 3.3 Doc: Doc Parquet page index
> --
>
> Key: IMPALA-7991
> URL: https://issues.apache.org/jira/browse/IMPALA-7991
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Docs
>Reporter: Alex Rodoni
>Assignee: Alex Rodoni
>Priority: Major
>  Labels: future_release_doc, in_33
>
> https://gerrit.cloudera.org/#/c/13900/



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Resolved] (IMPALA-8470) Impala Doc: Use a more human-readable flag to switch to a different authorization provider

2019-07-23 Thread Alex Rodoni (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Rodoni resolved IMPALA-8470.
-
   Resolution: Fixed
Fix Version/s: Impala 3.3.0

An example of the flag added to impala_authorization.xml.

> Impala Doc: Use a more human-readable flag to switch to a different 
> authorization provider
> --
>
> Key: IMPALA-8470
> URL: https://issues.apache.org/jira/browse/IMPALA-8470
> Project: IMPALA
>  Issue Type: Task
>  Components: Docs
>Reporter: Alex Rodoni
>Assignee: Alex Rodoni
>Priority: Major
>  Labels: future_release_doc, in_33
> Fix For: Impala 3.3.0
>
>




--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


[jira] [Resolved] (IMPALA-8470) Impala Doc: Use a more human-readable flag to switch to a different authorization provider

2019-07-23 Thread Alex Rodoni (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Rodoni resolved IMPALA-8470.
-
   Resolution: Fixed
Fix Version/s: Impala 3.3.0

An example of the flag added to impala_authorization.xml.

> Impala Doc: Use a more human-readable flag to switch to a different 
> authorization provider
> --
>
> Key: IMPALA-8470
> URL: https://issues.apache.org/jira/browse/IMPALA-8470
> Project: IMPALA
>  Issue Type: Task
>  Components: Docs
>Reporter: Alex Rodoni
>Assignee: Alex Rodoni
>Priority: Major
>  Labels: future_release_doc, in_33
> Fix For: Impala 3.3.0
>
>




--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Closed] (IMPALA-8470) Impala Doc: Use a more human-readable flag to switch to a different authorization provider

2019-07-23 Thread Alex Rodoni (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Rodoni closed IMPALA-8470.
---

> Impala Doc: Use a more human-readable flag to switch to a different 
> authorization provider
> --
>
> Key: IMPALA-8470
> URL: https://issues.apache.org/jira/browse/IMPALA-8470
> Project: IMPALA
>  Issue Type: Task
>  Components: Docs
>Reporter: Alex Rodoni
>Assignee: Alex Rodoni
>Priority: Major
>  Labels: future_release_doc, in_33
> Fix For: Impala 3.3.0
>
>




--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


[jira] [Closed] (IMPALA-8470) Impala Doc: Use a more human-readable flag to switch to a different authorization provider

2019-07-23 Thread Alex Rodoni (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Rodoni closed IMPALA-8470.
---

> Impala Doc: Use a more human-readable flag to switch to a different 
> authorization provider
> --
>
> Key: IMPALA-8470
> URL: https://issues.apache.org/jira/browse/IMPALA-8470
> Project: IMPALA
>  Issue Type: Task
>  Components: Docs
>Reporter: Alex Rodoni
>Assignee: Alex Rodoni
>Priority: Major
>  Labels: future_release_doc, in_33
> Fix For: Impala 3.3.0
>
>




--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Closed] (IMPALA-8701) Document idle session management after IMPALA-7802

2019-07-23 Thread Alex Rodoni (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Rodoni closed IMPALA-8701.
---
   Resolution: Fixed
Fix Version/s: Impala 3.3.0

> Document idle session management after IMPALA-7802
> --
>
> Key: IMPALA-8701
> URL: https://issues.apache.org/jira/browse/IMPALA-8701
> Project: IMPALA
>  Issue Type: Task
>  Components: Docs
>Affects Versions: Impala 3.3.0
>Reporter: Michael Ho
>Assignee: Alex Rodoni
>Priority: Major
>  Labels: future_release_doc, in_33
> Fix For: Impala 3.3.0
>
>
> After IMPALA-7802 is fixed, the network connection of an idle session will be 
> closed {{\-\-idle_client_poll_time_s}} seconds latest after the session has 
> been declared idle. This is a change of behavior from previous versions in 
> which the client connection will hang around until the user explicitly closes 
> the session and disconnects.
> By default, {{\-\-idle_client_poll_time_s}} is set to 30 seconds and the 
> previous behavior can be restored by setting to 0. In addition, the session 
> will only be closed if idle session timeout has been configured to be greater 
> than 0 for the session either via the startup flag {{--idle_session_timeout}} 
> or the query option {{IDLE_SESSION_TIMEOUT}}. If idle session timeout is not 
> configured, a session cannot become idle by definition and therefore its 
> connection cannot be closed until the client explicitly closes it.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-5278) Add Kerberos HTTP SPNEGO authentication support to Impala web consoles

2019-07-23 Thread Chris A (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-5278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16891367#comment-16891367
 ] 

Chris A commented on IMPALA-5278:
-

Thank you! 

> Add Kerberos HTTP SPNEGO authentication support to Impala web consoles
> --
>
> Key: IMPALA-5278
> URL: https://issues.apache.org/jira/browse/IMPALA-5278
> Project: IMPALA
>  Issue Type: Improvement
>  Components: Catalog, Clients
>Reporter: Chris A
>Priority: Major
>  Labels: kerberos, security, spnego, web-console, web-ui
> Fix For: Impala 3.3.0
>
>
> https://issues.apache.org/jira/browse/HADOOP-7119 and 
> https://issues.apache.org/jira/browse/HBASE-5291 show SPNEGO being added to 
> web consoles and web UIs to provide security through Kerberos (as a single 
> sign-on solution)
> According to current documentation 
> (https://www.cloudera.com/documentation/enterprise/5-5-x/topics/impala_kerberos.html),
>  Kerberos HTTP SPNEGO is not supported in Impala's Web UIs. Is there a way to 
> implement this in the Impala WebUIs?



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Work started] (IMPALA-8575) Impala Doc: New query options for Parquet writing

2019-07-23 Thread Alex Rodoni (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on IMPALA-8575 started by Alex Rodoni.
---
> Impala Doc: New query options for Parquet writing
> -
>
> Key: IMPALA-8575
> URL: https://issues.apache.org/jira/browse/IMPALA-8575
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Docs
>Reporter: Alex Rodoni
>Assignee: Alex Rodoni
>Priority: Major
>  Labels: future_release_doc, in_33
>




--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-8575) Impala Doc: New query options for Parquet writing

2019-07-23 Thread Alex Rodoni (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Rodoni updated IMPALA-8575:

Description: https://gerrit.cloudera.org/#/c/13900/

> Impala Doc: New query options for Parquet writing
> -
>
> Key: IMPALA-8575
> URL: https://issues.apache.org/jira/browse/IMPALA-8575
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Docs
>Reporter: Alex Rodoni
>Assignee: Alex Rodoni
>Priority: Major
>  Labels: future_release_doc, in_33
>
> https://gerrit.cloudera.org/#/c/13900/



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-8786) BufferedPlanRootSink should directly write to a QueryResultSet if one is available

2019-07-23 Thread Sahil Takiar (JIRA)
Sahil Takiar created IMPALA-8786:


 Summary: BufferedPlanRootSink should directly write to a 
QueryResultSet if one is available
 Key: IMPALA-8786
 URL: https://issues.apache.org/jira/browse/IMPALA-8786
 Project: IMPALA
  Issue Type: Sub-task
  Components: Backend
Reporter: Sahil Takiar
Assignee: Sahil Takiar


{{BufferedPlanRootSink}} uses a {{RowBatchQueue}} to buffer {{RowBatch}}-es and 
then the consumer thread reads them and writes them to a given 
{{QueryResultSet}}. Implementations of {{RowBatchQueue}} might end up copying 
the buffered {{RowBatch}}-es (e.g. if the queue is backed by a 
{{BufferedTupleStream}}). An optimization would be for the producer thread to 
directly write to the consumer {{QueryResultSet}}. This optimization would only 
be triggered if (1) the queue is empty, and (2) the consumer thread has a 
{{QueryResultSet}} available for writing.

This "fast path" is useful in a few different scenarios:
 * If the consumer is faster than at reading rows than the producer is at 
sending them; in this case, the overhead of buffering rows in a 
{{RowBatchQueue}} can be completely avoided
 * For queries that return under 1024 its likely that the consumer will produce 
a {{QueryResultSet}} before the first {{RowBatch}} is returned (except perhaps 
for very trivial queries)



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-8786) BufferedPlanRootSink should directly write to a QueryResultSet if one is available

2019-07-23 Thread Sahil Takiar (JIRA)
Sahil Takiar created IMPALA-8786:


 Summary: BufferedPlanRootSink should directly write to a 
QueryResultSet if one is available
 Key: IMPALA-8786
 URL: https://issues.apache.org/jira/browse/IMPALA-8786
 Project: IMPALA
  Issue Type: Sub-task
  Components: Backend
Reporter: Sahil Takiar
Assignee: Sahil Takiar


{{BufferedPlanRootSink}} uses a {{RowBatchQueue}} to buffer {{RowBatch}}-es and 
then the consumer thread reads them and writes them to a given 
{{QueryResultSet}}. Implementations of {{RowBatchQueue}} might end up copying 
the buffered {{RowBatch}}-es (e.g. if the queue is backed by a 
{{BufferedTupleStream}}). An optimization would be for the producer thread to 
directly write to the consumer {{QueryResultSet}}. This optimization would only 
be triggered if (1) the queue is empty, and (2) the consumer thread has a 
{{QueryResultSet}} available for writing.

This "fast path" is useful in a few different scenarios:
 * If the consumer is faster than at reading rows than the producer is at 
sending them; in this case, the overhead of buffering rows in a 
{{RowBatchQueue}} can be completely avoided
 * For queries that return under 1024 its likely that the consumer will produce 
a {{QueryResultSet}} before the first {{RowBatch}} is returned (except perhaps 
for very trivial queries)



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


[jira] [Created] (IMPALA-8785) DEBUG build images should be tagged differently from release build

2019-07-23 Thread Tim Armstrong (JIRA)
Tim Armstrong created IMPALA-8785:
-

 Summary: DEBUG build images should be tagged differently from 
release build
 Key: IMPALA-8785
 URL: https://issues.apache.org/jira/browse/IMPALA-8785
 Project: IMPALA
  Issue Type: Improvement
  Components: Infrastructure
Affects Versions: Impala 3.3.0
Reporter: Tim Armstrong
Assignee: Tim Armstrong


Currently the container build gives debug and release images the same tag 
names. This is fine for development, since you'll generally be working on one 
build at a time, but is confusing if you need to integrate this into a more 
general build process, since debug and release images are really separate 
artifacts.

One example I ran into is a case when I ran ./buildall.sh ... 
-release_and_debug, then wanted to build docker images for the release 
artifacts. This is not currently possible in a sane way.

I think we should tag images with a -debug suffix if they were generated from 
debug artifacts. start-impala-cluster would need to be smart about picking the 
image to use. We may want to have different build targets to build the 
different flavours of image, too.

Note that there are many flavours of debug images, e.g. ASAN, but I think 
simply separating release and debug would avoid a lot of confusion.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


[jira] [Created] (IMPALA-8785) DEBUG build images should be tagged differently from release build

2019-07-23 Thread Tim Armstrong (JIRA)
Tim Armstrong created IMPALA-8785:
-

 Summary: DEBUG build images should be tagged differently from 
release build
 Key: IMPALA-8785
 URL: https://issues.apache.org/jira/browse/IMPALA-8785
 Project: IMPALA
  Issue Type: Improvement
  Components: Infrastructure
Affects Versions: Impala 3.3.0
Reporter: Tim Armstrong
Assignee: Tim Armstrong


Currently the container build gives debug and release images the same tag 
names. This is fine for development, since you'll generally be working on one 
build at a time, but is confusing if you need to integrate this into a more 
general build process, since debug and release images are really separate 
artifacts.

One example I ran into is a case when I ran ./buildall.sh ... 
-release_and_debug, then wanted to build docker images for the release 
artifacts. This is not currently possible in a sane way.

I think we should tag images with a -debug suffix if they were generated from 
debug artifacts. start-impala-cluster would need to be smart about picking the 
image to use. We may want to have different build targets to build the 
different flavours of image, too.

Note that there are many flavours of debug images, e.g. ASAN, but I think 
simply separating release and debug would avoid a lot of confusion.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-8784) Implement a RowBatchQueue backed by a BufferedTupleStream

2019-07-23 Thread Sahil Takiar (JIRA)
Sahil Takiar created IMPALA-8784:


 Summary: Implement a RowBatchQueue backed by a BufferedTupleStream
 Key: IMPALA-8784
 URL: https://issues.apache.org/jira/browse/IMPALA-8784
 Project: IMPALA
  Issue Type: Sub-task
  Components: Backend
Reporter: Sahil Takiar
Assignee: Sahil Takiar


The {{BufferedPlanRootSink}} should use a {{RowBatchQueue}} backed by a 
{{BufferedTupleStream}}. This requires the following changes:
 * Creating a new {{SpillableRowBatchQueue}} that implements {{RowBatchQueue}} 
and internally uses a {{BufferedTupleStream}}
 * Changing the implementation of {{RowBatchQueue}} used by 
{{BufferedPlanRootSink}} to {{SpillableRowBatchQueue}}
 * Update {{PlanRootSink.java}} so that it sets a {{ResourceProfile}} that 
should be used by the {{BufferedPlanRootSink}}
 * Update {{DataSinks.thrift}} so that it passes {{ResourceProfile}}-s from the 
fe/ to the be/
 * {{BufferedPlanRootSink}} should Initialize and close a 
{{ReservationManager}} to be used by the {{BufferedTupleStream}}



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-8781) Add additional tests in test_result_spooling.py and validate cancellation logic

2019-07-23 Thread Sahil Takiar (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sahil Takiar updated IMPALA-8781:
-
Description: {{test_result_spooling.py}} currently runs a few basic tests 
with result spooling enabled. We should add some more to cover all necessary 
edge cases (ensure all Impala types are returned correctly, UDFs are evaluated 
correctly, etc.) and add tests to validate the cancellation logic in 
{{PlanRootSink}}.  (was: {{test_result_spooling.py}} currently runs a few basic 
tests with result spooling enabled. We should add some more and validate the 
cancellation logic in {{PlanRootSink}}.)

> Add additional tests in test_result_spooling.py and validate cancellation 
> logic
> ---
>
> Key: IMPALA-8781
> URL: https://issues.apache.org/jira/browse/IMPALA-8781
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Backend
>Reporter: Sahil Takiar
>Assignee: Sahil Takiar
>Priority: Major
>
> {{test_result_spooling.py}} currently runs a few basic tests with result 
> spooling enabled. We should add some more to cover all necessary edge cases 
> (ensure all Impala types are returned correctly, UDFs are evaluated 
> correctly, etc.) and add tests to validate the cancellation logic in 
> {{PlanRootSink}}.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-8780) Implementation of BufferedPlanRootSink where FlushFinal blocks until all rows are fetched

2019-07-23 Thread Sahil Takiar (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sahil Takiar updated IMPALA-8780:
-
Description: 
Implement {{BufferedPlanRootSink}} so that {{FlushFinal}} blocks until all rows 
are fetched. The implementation should use the {{RowBatchQueue}} introduced by 
IMPALA-8779. By blocking in {{FlushFinal}} all non-coordinator fragments will 
be closed if all results fit in the {{RowBatchQueue}}. 
{{BufferedPlanRootSink::Send}} should enqueue each given {{RowBatch}} onto the 
queue and then return. If the queue is full, it should block until there is 
more space left in the queue. {{BufferedPlanRootSink::GetNext}} reads from the 
queue and then fills in the given {{QueryResultSet}} by using the {{DataSink}} 
{{ScalarExprEvaluator}}-s. Since the producer thread can call 
{{BufferedPlanRootSink::Close}} while the consumer is calling 
{{BufferedPlanRootSink::GetNext}} the two methods need to be synchronized so 
that the {{DataSink}} {{MemTracker}}-s are not closed while {{GetNext}} is 
running.

The implementation of {{BufferedPlanRootSink}} should remain the same 
regardless of whether a {{std::queue}} backed {{RowBatchQueue}} or a 
{{BufferedTupleStream}} backed {{RowBatchQueue}} is used.

{{BufferedPlanRootSink}} and {{BlockingPlanRootSink}} are similar in the sense 
that {{BlockingPlanRootSink}} buffers one {{RowBatch}}, so for queries that 
return under 1024 rows, all non-coordinator fragments are closed immediately as 
well. The advantage of {{BufferedPlanRootSink}} is that allows buffering for 1+ 
{{RowBatch}}-es.

  was:Implement {{BufferedPlanRootSink}} so that {{FlushFinal}} blocks until 
all rows are fetched. The implementation should use the {{RowBatchQueue}} 
introduced by IMPALA-8779. By blocking in {{FlushFinal}} all non-coordinator 
fragments will be closed if all results fit in the {{RowBatchQueue}}.


> Implementation of BufferedPlanRootSink where FlushFinal blocks until all rows 
> are fetched
> -
>
> Key: IMPALA-8780
> URL: https://issues.apache.org/jira/browse/IMPALA-8780
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Backend
>Reporter: Sahil Takiar
>Assignee: Sahil Takiar
>Priority: Major
>
> Implement {{BufferedPlanRootSink}} so that {{FlushFinal}} blocks until all 
> rows are fetched. The implementation should use the {{RowBatchQueue}} 
> introduced by IMPALA-8779. By blocking in {{FlushFinal}} all non-coordinator 
> fragments will be closed if all results fit in the {{RowBatchQueue}}. 
> {{BufferedPlanRootSink::Send}} should enqueue each given {{RowBatch}} onto 
> the queue and then return. If the queue is full, it should block until there 
> is more space left in the queue. {{BufferedPlanRootSink::GetNext}} reads from 
> the queue and then fills in the given {{QueryResultSet}} by using the 
> {{DataSink}} {{ScalarExprEvaluator}}-s. Since the producer thread can call 
> {{BufferedPlanRootSink::Close}} while the consumer is calling 
> {{BufferedPlanRootSink::GetNext}} the two methods need to be synchronized so 
> that the {{DataSink}} {{MemTracker}}-s are not closed while {{GetNext}} is 
> running.
> The implementation of {{BufferedPlanRootSink}} should remain the same 
> regardless of whether a {{std::queue}} backed {{RowBatchQueue}} or a 
> {{BufferedTupleStream}} backed {{RowBatchQueue}} is used.
> {{BufferedPlanRootSink}} and {{BlockingPlanRootSink}} are similar in the 
> sense that {{BlockingPlanRootSink}} buffers one {{RowBatch}}, so for queries 
> that return under 1024 rows, all non-coordinator fragments are closed 
> immediately as well. The advantage of {{BufferedPlanRootSink}} is that allows 
> buffering for 1+ {{RowBatch}}-es.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Closed] (IMPALA-8746) Document the new query option DEFAULT_HINTS_INSERT_STATEMENT

2019-07-23 Thread Alex Rodoni (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Rodoni closed IMPALA-8746.
---

> Document the new query option DEFAULT_HINTS_INSERT_STATEMENT
> 
>
> Key: IMPALA-8746
> URL: https://issues.apache.org/jira/browse/IMPALA-8746
> Project: IMPALA
>  Issue Type: Task
>  Components: Docs
>Reporter: Abhishek Rawat
>Assignee: Alex Rodoni
>Priority: Major
>  Labels: future_release_doc, in_33
> Fix For: Impala 3.3.0
>
>
> Related to IMPALA-8673.
> https://gerrit.cloudera.org/#/c/13885/



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Resolved] (IMPALA-8746) Document the new query option DEFAULT_HINTS_INSERT_STATEMENT

2019-07-23 Thread Alex Rodoni (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Rodoni resolved IMPALA-8746.
-
   Resolution: Fixed
Fix Version/s: Impala 3.3.0

> Document the new query option DEFAULT_HINTS_INSERT_STATEMENT
> 
>
> Key: IMPALA-8746
> URL: https://issues.apache.org/jira/browse/IMPALA-8746
> Project: IMPALA
>  Issue Type: Task
>  Components: Docs
>Reporter: Abhishek Rawat
>Assignee: Alex Rodoni
>Priority: Major
>  Labels: future_release_doc, in_33
> Fix For: Impala 3.3.0
>
>
> Related to IMPALA-8673.
> https://gerrit.cloudera.org/#/c/13885/



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


[jira] [Resolved] (IMPALA-8746) Document the new query option DEFAULT_HINTS_INSERT_STATEMENT

2019-07-23 Thread Alex Rodoni (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Rodoni resolved IMPALA-8746.
-
   Resolution: Fixed
Fix Version/s: Impala 3.3.0

> Document the new query option DEFAULT_HINTS_INSERT_STATEMENT
> 
>
> Key: IMPALA-8746
> URL: https://issues.apache.org/jira/browse/IMPALA-8746
> Project: IMPALA
>  Issue Type: Task
>  Components: Docs
>Reporter: Abhishek Rawat
>Assignee: Alex Rodoni
>Priority: Major
>  Labels: future_release_doc, in_33
> Fix For: Impala 3.3.0
>
>
> Related to IMPALA-8673.
> https://gerrit.cloudera.org/#/c/13885/



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Closed] (IMPALA-8746) Document the new query option DEFAULT_HINTS_INSERT_STATEMENT

2019-07-23 Thread Alex Rodoni (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Rodoni closed IMPALA-8746.
---

> Document the new query option DEFAULT_HINTS_INSERT_STATEMENT
> 
>
> Key: IMPALA-8746
> URL: https://issues.apache.org/jira/browse/IMPALA-8746
> Project: IMPALA
>  Issue Type: Task
>  Components: Docs
>Reporter: Abhishek Rawat
>Assignee: Alex Rodoni
>Priority: Major
>  Labels: future_release_doc, in_33
> Fix For: Impala 3.3.0
>
>
> Related to IMPALA-8673.
> https://gerrit.cloudera.org/#/c/13885/



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


[jira] [Created] (IMPALA-8783) Add Kerberos SPNEGO support to the http hs2 server

2019-07-23 Thread Thomas Tauber-Marshall (JIRA)
Thomas Tauber-Marshall created IMPALA-8783:
--

 Summary: Add Kerberos SPNEGO support to the http hs2 server
 Key: IMPALA-8783
 URL: https://issues.apache.org/jira/browse/IMPALA-8783
 Project: IMPALA
  Issue Type: Bug
  Components: Clients
Affects Versions: Impala 3.3.0
Reporter: Thomas Tauber-Marshall
Assignee: Thomas Tauber-Marshall


IMPALA-8538 added support for http connections to the hs2 server along with 
LDAP auth support. We should also add support for Kerberos auth via SPNEGO



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-8783) Add Kerberos SPNEGO support to the http hs2 server

2019-07-23 Thread Thomas Tauber-Marshall (JIRA)
Thomas Tauber-Marshall created IMPALA-8783:
--

 Summary: Add Kerberos SPNEGO support to the http hs2 server
 Key: IMPALA-8783
 URL: https://issues.apache.org/jira/browse/IMPALA-8783
 Project: IMPALA
  Issue Type: Bug
  Components: Clients
Affects Versions: Impala 3.3.0
Reporter: Thomas Tauber-Marshall
Assignee: Thomas Tauber-Marshall


IMPALA-8538 added support for http connections to the hs2 server along with 
LDAP auth support. We should also add support for Kerberos auth via SPNEGO



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


[jira] [Work started] (IMPALA-8696) Cmake dependencies for container build do not work correctly

2019-07-23 Thread Tim Armstrong (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8696?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on IMPALA-8696 started by Tim Armstrong.
-
> Cmake dependencies for container build do not work correctly
> 
>
> Key: IMPALA-8696
> URL: https://issues.apache.org/jira/browse/IMPALA-8696
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Infrastructure
>Affects Versions: Impala 3.3.0
>Reporter: Tim Armstrong
>Assignee: Tim Armstrong
>Priority: Major
>
> I've noticed in many cases that containers are not reliably rebuilt when 
> inputs change, e.g. if the binaries are rebuilt. Investigate and fix this, 
> it's a major development headache.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Work started] (IMPALA-8622) Generate list of docker images generated by the build

2019-07-23 Thread Tim Armstrong (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-8622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on IMPALA-8622 started by Tim Armstrong.
-
> Generate list of docker images generated by the build
> -
>
> Key: IMPALA-8622
> URL: https://issues.apache.org/jira/browse/IMPALA-8622
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Infrastructure
>Reporter: Tim Armstrong
>Assignee: Tim Armstrong
>Priority: Major
>  Labels: docker
>
> Some consumers of the Impala docker images will want to publish them to a 
> repo. Currently they would have to hardcode a list of the docker images 
> produced by the Impala build to publish them, or do something brittle like 
> "docker image ls" to deduce what images were built.
> It would be useful if the build produced a list of the images in some 
> consumable format that would be updated (either manually or automatically) 
> when the set of images changes. If it needs to be manually updated, we should 
> have some kind of check to verify that it isn't stale.
> One option is to produce JSON, something like:
> {noformat}
> {
>   “docker_images”: {
>“impala_base”: “$docker_registry/impala_base”,
>” impalad_coord_exec”: “$docker_registry/impalad_coord_exec”,
>   
>   }
> }
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Resolved] (IMPALA-5278) Add Kerberos HTTP SPNEGO authentication support to Impala web consoles

2019-07-23 Thread Thomas Tauber-Marshall (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-5278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thomas Tauber-Marshall resolved IMPALA-5278.

   Resolution: Fixed
Fix Version/s: Impala 3.3.0

commit 9f0cd9743a9c364d1eb42f29f67494298ed574ae
Author: Todd Lipcon 
Date:   Mon Jul 1 16:59:24 2019 -0700

Support SPNEGO for Impala webserver

This ports over changes from kudu commit
1f291b77ef0868ac888a850678adc2d7cce65529 which implemented SPNEGO for
the Kudu webserver.

Unfortunately, thorough testing of this is difficult given that curl
isn't currently in the toolchain. I was able to manually test this by
adding a 'sleep(1000)' call into the newly added test case, then setting
up $KRB5_CONFIG in my shell to point to the temporary KDC's environment,
and using 'curl -u : --negotiate http://...' to authenticate.

Strangely, using the version of curl on el7 didn't seem to work properly
(perhaps an el7 curl bug) but using curl on my Ubuntu 18 laptop I was
able to authenticate with SPNEGO.

Change-Id: Ife2b04310e1571d231bf8ee1bcfd3b7afc2edd8f
Reviewed-on: http://gerrit.cloudera.org:8080/13774
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 

> Add Kerberos HTTP SPNEGO authentication support to Impala web consoles
> --
>
> Key: IMPALA-5278
> URL: https://issues.apache.org/jira/browse/IMPALA-5278
> Project: IMPALA
>  Issue Type: Improvement
>  Components: Catalog, Clients
>Reporter: Chris A
>Priority: Major
>  Labels: kerberos, security, spnego, web-console, web-ui
> Fix For: Impala 3.3.0
>
>
> https://issues.apache.org/jira/browse/HADOOP-7119 and 
> https://issues.apache.org/jira/browse/HBASE-5291 show SPNEGO being added to 
> web consoles and web UIs to provide security through Kerberos (as a single 
> sign-on solution)
> According to current documentation 
> (https://www.cloudera.com/documentation/enterprise/5-5-x/topics/impala_kerberos.html),
>  Kerberos HTTP SPNEGO is not supported in Impala's Web UIs. Is there a way to 
> implement this in the Impala WebUIs?



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Resolved] (IMPALA-5278) Add Kerberos HTTP SPNEGO authentication support to Impala web consoles

2019-07-23 Thread Thomas Tauber-Marshall (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-5278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thomas Tauber-Marshall resolved IMPALA-5278.

   Resolution: Fixed
Fix Version/s: Impala 3.3.0

commit 9f0cd9743a9c364d1eb42f29f67494298ed574ae
Author: Todd Lipcon 
Date:   Mon Jul 1 16:59:24 2019 -0700

Support SPNEGO for Impala webserver

This ports over changes from kudu commit
1f291b77ef0868ac888a850678adc2d7cce65529 which implemented SPNEGO for
the Kudu webserver.

Unfortunately, thorough testing of this is difficult given that curl
isn't currently in the toolchain. I was able to manually test this by
adding a 'sleep(1000)' call into the newly added test case, then setting
up $KRB5_CONFIG in my shell to point to the temporary KDC's environment,
and using 'curl -u : --negotiate http://...' to authenticate.

Strangely, using the version of curl on el7 didn't seem to work properly
(perhaps an el7 curl bug) but using curl on my Ubuntu 18 laptop I was
able to authenticate with SPNEGO.

Change-Id: Ife2b04310e1571d231bf8ee1bcfd3b7afc2edd8f
Reviewed-on: http://gerrit.cloudera.org:8080/13774
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 

> Add Kerberos HTTP SPNEGO authentication support to Impala web consoles
> --
>
> Key: IMPALA-5278
> URL: https://issues.apache.org/jira/browse/IMPALA-5278
> Project: IMPALA
>  Issue Type: Improvement
>  Components: Catalog, Clients
>Reporter: Chris A
>Priority: Major
>  Labels: kerberos, security, spnego, web-console, web-ui
> Fix For: Impala 3.3.0
>
>
> https://issues.apache.org/jira/browse/HADOOP-7119 and 
> https://issues.apache.org/jira/browse/HBASE-5291 show SPNEGO being added to 
> web consoles and web UIs to provide security through Kerberos (as a single 
> sign-on solution)
> According to current documentation 
> (https://www.cloudera.com/documentation/enterprise/5-5-x/topics/impala_kerberos.html),
>  Kerberos HTTP SPNEGO is not supported in Impala's Web UIs. Is there a way to 
> implement this in the Impala WebUIs?



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


[jira] [Commented] (IMPALA-7936) Enable better control over Parquet writing

2019-07-23 Thread JIRA


[ 
https://issues.apache.org/jira/browse/IMPALA-7936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16891141#comment-16891141
 ] 

Zoltán Borók-Nagy commented on IMPALA-7936:
---

Normally the users won't need to set these query options, but I think it would 
be better to have these documented anyway.

> Enable better control over Parquet writing
> --
>
> Key: IMPALA-7936
> URL: https://issues.apache.org/jira/browse/IMPALA-7936
> Project: IMPALA
>  Issue Type: Improvement
>Reporter: Zoltán Borók-Nagy
>Assignee: Zoltán Borók-Nagy
>Priority: Major
> Fix For: Impala 3.3.0
>
>
> With the introduction of the Parquet page indexes it became desirable to have 
> more control over how Impala writes Parquet files.
> These configuration options (probably implemented as query options) would be:
>  * enable/disable Parquet page index writing (currently we can do it with a 
> command-line argument)
>  * set page-size limits based on row count
>  * -Set truncation length for statistics about string values-   (current 
> truncation length is 64, it is unlikely to have user data that needs longer 
> truncation than that)
> They'd enable writing more complete tests for page filtering. They'd be also 
> useful for fine-tuning the page index for better performance.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-8742) ScannerContext::eosr() should depends on scan_range->bytes_to_read() instead of scan_range->len()

2019-07-23 Thread JIRA


[ 
https://issues.apache.org/jira/browse/IMPALA-8742?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16891133#comment-16891133
 ] 

Zoltán Borók-Nagy commented on IMPALA-8742:
---

Stream::bytes_left() also uses ScanRange::len() instead of bytes_to_read(). We 
don't hit this bug because in the Parquet scanner we issue Stream::GetBuffer() 
only in ReadPageHeader() and we track how many pages we need to read.

There is a TODO in BaseScalarColumnReader::ReadDataPage() about adding a check 
for stream_->eosr(), maybe we should add that check as well after fixing this 
bug. I think I'll add some backend tests for the Stream class, unfortunately 
there isn't any yet.

The other scanners don't issue sub-ranges.

> ScannerContext::eosr() should depends on scan_range->bytes_to_read() instead 
> of scan_range->len()
> -
>
> Key: IMPALA-8742
> URL: https://issues.apache.org/jira/browse/IMPALA-8742
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Affects Versions: Impala 3.2.0
>Reporter: Chen Wang
>Assignee: Zoltán Borók-Nagy
>Priority: Minor
>
> Normally, scan_range->len() equals to scan_range->bytes_to_read().
> But when SubRanges exists, only bytes_to_read was updated, 
> ScannerContext::GetBuffer would read entire scan_range->len() bytes, and I 
> think it would be a bug.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-8782) Impala errors can make profiles huge

2019-07-23 Thread Gabor Kaszab (JIRA)
Gabor Kaszab created IMPALA-8782:


 Summary: Impala errors can make profiles huge
 Key: IMPALA-8782
 URL: https://issues.apache.org/jira/browse/IMPALA-8782
 Project: IMPALA
  Issue Type: Bug
  Components: Backend
Affects Versions: Impala 2.9.0
Reporter: Gabor Kaszab


We observed that in some cases Impala could flood the query profile with error 
messages causing other systems that read the profiles crash (e.g. Cloudera 
Manager) due to the size of the profile. Let's consider limiting the number of 
errors we write into the profile.

In the particular case I saw the profile was full of these errors, one of this 
for each row:
{code:java}
Error converting column: 4 to TIMESTAMP\nError converting column: 10 to 
INT\nError converting column: 11 to INT\nError converting column: 13 to 
INT\nError converting column: 15 to INT\nError converting column: 16 to 
INT\nError converting column: 19 to INT\nError converting column: 20 to 
INT\nError converting column: 21 to INT\nError converting column: 22 to 
INT\nError converting column: 23 to INT\nError converting column: 24 to 
INT\nError converting column: 26 to INT\nError converting column: 29 to 
INT\nError converting column: 31 to INT\nError converting column: 35 to 
INT\nError converting column: 36 to INT\nError converting column: 37 to 
INT\nError converting column: 39 to INT\nError converting column: 40 to 
INT\nError converting column: 41 to INT\nError converting column: 42 to 
INT\nError converting column: 43 to INT\nError converting column: 44 to 
INT\nError converting column: 45 to INT\nError converting column: 46 to 
INT\nError converting column: 48 to INT\nError converting column: 50 to 
INT\nError converting column: 51 to INT\nError converting column: 53 to 
INT\nError converting column: 54 to INT\nError converting column: 59 to 
INT\nError converting column: 60 to INT\nError converting column: 63 to 
INT\nError converting column: 64 to INT\nError converting column: 65 to 
INT\nError converting column: 69 to INT\nError converting column: 70 to 
INT\nError converting column: 71 to INT\nError converting column: 72 to 
INT\nError converting column: 73 to INT\nError converting column: 75 to 
INT\nError converting column: 76 to INT\nError converting column: 77 to 
INT\nError converting column: 78 to INT\nError converting column: 79 to 
INT\nError converting column: 80 to INT\nError converting column: 81 to 
INT\nError converting column: 82 to INT\nError converting column: 83 to 
INT\nError converting column: 84 to INT\nError converting column: 85 to 
INT\nError converting column: 87 to INT\nError converting column: 88 to 
INT\nError converting column: 90 to INT\nError converting column: 91 to 
INT\nError converting column: 95 to INT\nError converting column: 96 to 
INT\nError converting column: 97 to INT\nError converting column: 99 to 
INT\nError converting column: 102 to INT\nError converting column: 103 to 
INT\nError converting column: 104 to INT\nError converting column: 106 to 
INT\nError converting column: 107 to INT\nError converting column: 109 to 
INT\nError converting column: 117 to INT\nError converting column: 119 to 
INT\nError parsing row: file: 
hdfs://nameserviceci/data/CTL/encrypt/db/ingest/raw/capm/file=interfaces/dt=20170626/interfaces.1498453200039.snappy,
 before offset: 43276630\n
{code}
As a result, the uncompressed profile grew something like 700MB.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


[jira] [Created] (IMPALA-8782) Impala errors can make profiles huge

2019-07-23 Thread Gabor Kaszab (JIRA)
Gabor Kaszab created IMPALA-8782:


 Summary: Impala errors can make profiles huge
 Key: IMPALA-8782
 URL: https://issues.apache.org/jira/browse/IMPALA-8782
 Project: IMPALA
  Issue Type: Bug
  Components: Backend
Affects Versions: Impala 2.9.0
Reporter: Gabor Kaszab


We observed that in some cases Impala could flood the query profile with error 
messages causing other systems that read the profiles crash (e.g. Cloudera 
Manager) due to the size of the profile. Let's consider limiting the number of 
errors we write into the profile.

In the particular case I saw the profile was full of these errors, one of this 
for each row:
{code:java}
Error converting column: 4 to TIMESTAMP\nError converting column: 10 to 
INT\nError converting column: 11 to INT\nError converting column: 13 to 
INT\nError converting column: 15 to INT\nError converting column: 16 to 
INT\nError converting column: 19 to INT\nError converting column: 20 to 
INT\nError converting column: 21 to INT\nError converting column: 22 to 
INT\nError converting column: 23 to INT\nError converting column: 24 to 
INT\nError converting column: 26 to INT\nError converting column: 29 to 
INT\nError converting column: 31 to INT\nError converting column: 35 to 
INT\nError converting column: 36 to INT\nError converting column: 37 to 
INT\nError converting column: 39 to INT\nError converting column: 40 to 
INT\nError converting column: 41 to INT\nError converting column: 42 to 
INT\nError converting column: 43 to INT\nError converting column: 44 to 
INT\nError converting column: 45 to INT\nError converting column: 46 to 
INT\nError converting column: 48 to INT\nError converting column: 50 to 
INT\nError converting column: 51 to INT\nError converting column: 53 to 
INT\nError converting column: 54 to INT\nError converting column: 59 to 
INT\nError converting column: 60 to INT\nError converting column: 63 to 
INT\nError converting column: 64 to INT\nError converting column: 65 to 
INT\nError converting column: 69 to INT\nError converting column: 70 to 
INT\nError converting column: 71 to INT\nError converting column: 72 to 
INT\nError converting column: 73 to INT\nError converting column: 75 to 
INT\nError converting column: 76 to INT\nError converting column: 77 to 
INT\nError converting column: 78 to INT\nError converting column: 79 to 
INT\nError converting column: 80 to INT\nError converting column: 81 to 
INT\nError converting column: 82 to INT\nError converting column: 83 to 
INT\nError converting column: 84 to INT\nError converting column: 85 to 
INT\nError converting column: 87 to INT\nError converting column: 88 to 
INT\nError converting column: 90 to INT\nError converting column: 91 to 
INT\nError converting column: 95 to INT\nError converting column: 96 to 
INT\nError converting column: 97 to INT\nError converting column: 99 to 
INT\nError converting column: 102 to INT\nError converting column: 103 to 
INT\nError converting column: 104 to INT\nError converting column: 106 to 
INT\nError converting column: 107 to INT\nError converting column: 109 to 
INT\nError converting column: 117 to INT\nError converting column: 119 to 
INT\nError parsing row: file: 
hdfs://nameserviceci/data/CTL/encrypt/db/ingest/raw/capm/file=interfaces/dt=20170626/interfaces.1498453200039.snappy,
 before offset: 43276630\n
{code}
As a result, the uncompressed profile grew something like 700MB.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-8378) test_cancel_select failed with Invalid query handle

2019-07-23 Thread Quanlong Huang (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-8378?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16890958#comment-16890958
 ] 

Quanlong Huang commented on IMPALA-8378:


Hit this too: [https://jenkins.impala.io/job/ubuntu-16.04-from-scratch/6721/]

> test_cancel_select failed with Invalid query handle
> ---
>
> Key: IMPALA-8378
> URL: https://issues.apache.org/jira/browse/IMPALA-8378
> Project: IMPALA
>  Issue Type: Bug
>Affects Versions: Impala 3.3.0
>Reporter: Gabor Kaszab
>Assignee: Andrew Sherman
>Priority: Critical
>  Labels: broken-build
>
> Error Message
> {code:java}
> query_test/test_cancellation.py:241: in test_cancel_select 
> self.execute_cancel_test(vector) query_test/test_cancellation.py:213: in 
> execute_cancel_test assert 'Cancelled' in str(thread.fetch_results_error) 
> E   assert 'Cancelled' in "ImpalaBeeswaxException:\n INNER EXCEPTION:  'beeswaxd.ttypes.BeeswaxException'>\n MESSAGE: Invalid query handle: 
> b64a91016fe343dd:fefaf600\n" E+  where "ImpalaBeeswaxException:\n 
> INNER EXCEPTION: \n MESSAGE: 
> Invalid query handle: b64a91016fe343dd:fefaf600\n" = 
> str(ImpalaBeeswaxException()) E+where ImpalaBeeswaxException() = 
> .fetch_results_error
> {code}
> Stacktrace
> {code:java}
> query_test/test_cancellation.py:241: in test_cancel_select
> self.execute_cancel_test(vector)
> query_test/test_cancellation.py:213: in execute_cancel_test
> assert 'Cancelled' in str(thread.fetch_results_error)
> E   assert 'Cancelled' in "ImpalaBeeswaxException:\n INNER EXCEPTION:  'beeswaxd.ttypes.BeeswaxException'>\n MESSAGE: Invalid query handle: 
> b64a91016fe343dd:fefaf600\n"
> E+  where "ImpalaBeeswaxException:\n INNER EXCEPTION:  'beeswaxd.ttypes.BeeswaxException'>\n MESSAGE: Invalid query handle: 
> b64a91016fe343dd:fefaf600\n" = str(ImpalaBeeswaxException())
> E+where ImpalaBeeswaxException() =  139721936148224)>.fetch_results_error
> {code}
> Standard error
> {code:java}
> SET 
> client_identifier=query_test/test_cancellation.py::TestCancellationParallel::()::test_cancel_select[protocol:beeswax|table_format:rc/none|exec_option:{'batch_size':0;'num_nodes':0;'disable_codegen_rows_threshold':0;'disable_codegen':False;'abort_on_error':1;'exec_single_no;
> -- executing against localhost:21000
> use tpch_rc;
> -- 2019-03-30 23:15:53,293 INFO MainThread: Started query 
> 5e42c7ec2df1febb:b25eff72
> SET 
> client_identifier=query_test/test_cancellation.py::TestCancellationParallel::()::test_cancel_select[protocol:beeswax|table_format:rc/none|exec_option:{'batch_size':0;'num_nodes':0;'disable_codegen_rows_threshold':0;'disable_codegen':False;'abort_on_error':1;'exec_single_no;
> SET batch_size=0;
> SET num_nodes=0;
> SET disable_codegen_rows_threshold=0;
> SET disable_codegen=False;
> SET abort_on_error=1;
> SET cpu_limit_s=10;
> SET debug_action=0:GETNEXT:WAIT|COORD_CANCEL_QUERY_FINSTANCES_RPC:FAIL;
> SET exec_single_node_rows_threshold=0;
> SET buffer_pool_limit=0;
> -- executing async: localhost:21000
> compute stats lineitem;
> -- 2019-03-30 23:15:53,359 INFO MainThread: Started query 
> f54a0ecfa32d40b4:07510d7a
> SET 
> client_identifier=query_test/test_cancellation.py::TestCancellationParallel::()::test_cancel_select[protocol:beeswax|table_format:rc/none|exec_option:{'batch_size':0;'num_nodes':0;'disable_codegen_rows_threshold':0;'disable_codegen':False;'abort_on_error':1;'exec_single_no;
> -- connecting to: localhost:21000
> -- fetching results from:  object at 0x5d535d0>
> -- getting state for operation: 
> 
> -- canceling operation:  object at 0x5d535d0>
> -- 2019-03-30 23:15:53,427 INFO Thread-5: Starting new HTTP connection 
> (1): localhost
> -- closing query for operation handle: 
> 
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org