[GitHub] [druid] lgtm-com[bot] commented on pull request #10241: Fix an OOM in the kafka ingest task due to overestimate thetasketch byte

2020-10-20 Thread GitBox
lgtm-com[bot] commented on pull request #10241: URL: https://github.com/apache/druid/pull/10241#issuecomment-712643300 This pull request **introduces 1 alert** when merging c934270eedd1afe8f7f5bfed467dd20ae78ce504 into c3cb0e8b02c641746a5225bd3651e6e441437f19 - [view on LGTM.com](https://

[GitHub] [druid] egor-ryashin opened a new issue #10520: Druid UI Load Data -> Edit Spec text area redundant reinsert

2020-10-20 Thread GitBox
egor-ryashin opened a new issue #10520: URL: https://github.com/apache/druid/issues/10520 ### Affected Version 0.19.0 ### Description Druid Edit Spec text area doesn’t allow to replace spec simply. If you delete it then it will be reinserted automatically defyin

[GitHub] [druid] tenghuanhe commented on issue #10395: Getting error when accessing materialized view query in web console and in logs using druid 0.19.0: Could not resolve type id 'view' as a subtype

2020-10-20 Thread GitBox
tenghuanhe commented on issue #10395: URL: https://github.com/apache/druid/issues/10395#issuecomment-712690802 @jagtapsantosh in the `MaterializedViewSelectionDruidModule` class, did you register the view subtype like ``` public List getJacksonModules() { return Immutable

[GitHub] [druid] aditya-r-m commented on pull request #9116: WIP: gcloud pubsub indexing service

2020-10-20 Thread GitBox
aditya-r-m commented on pull request #9116: URL: https://github.com/apache/druid/pull/9116#issuecomment-712738367 Hi @cre8ivejp thank you for your interest in the feature. I've almost completed an at-least once ingestion mechanism with basic task count/duration management (off & on dev

[GitHub] [druid] cre8ivejp commented on pull request #9116: WIP: gcloud pubsub indexing service

2020-10-20 Thread GitBox
cre8ivejp commented on pull request #9116: URL: https://github.com/apache/druid/pull/9116#issuecomment-712743739 @aditya-r-m thank you for your response. I really appreciate your help on this. I will prepare all information and send it to you. I will look at the design doc when

[GitHub] [druid] liran-funaro commented on pull request #10335: Configurable Index Type

2020-10-20 Thread GitBox
liran-funaro commented on pull request #10335: URL: https://github.com/apache/druid/pull/10335#issuecomment-712906532 @jihoonson Do you have any more comments or can we proceed to merge this PR? This is an automated message f

[GitHub] [druid] pjain1 commented on pull request #10407: emit processed bytes metric

2020-10-20 Thread GitBox
pjain1 commented on pull request #10407: URL: https://github.com/apache/druid/pull/10407#issuecomment-712980043 @jihoonson are you actively working on your proposal ? do you think you can reuse the `InputStats` strategy from here ? -

[GitHub] [druid] jihoonson commented on pull request #10407: emit processed bytes metric

2020-10-20 Thread GitBox
jihoonson commented on pull request #10407: URL: https://github.com/apache/druid/pull/10407#issuecomment-713013146 > @jihoonson are you actively working on your proposal ? do you think you can reuse the `InputStats` strategy from here ? @pjain1 sorry, I forgot about this PR. I could

[GitHub] [druid] pjain1 commented on pull request #10407: emit processed bytes metric

2020-10-20 Thread GitBox
pjain1 commented on pull request #10407: URL: https://github.com/apache/druid/pull/10407#issuecomment-713016099 As long as we get the metrics about how many raw bytes are processed from the source (including scans for determining shard specs) I think I am ok with any approach you follow. I

[GitHub] [druid] jihoonson edited a comment on pull request #10407: emit processed bytes metric

2020-10-20 Thread GitBox
jihoonson edited a comment on pull request #10407: URL: https://github.com/apache/druid/pull/10407#issuecomment-713013146 > @jihoonson are you actively working on your proposal ? do you think you can reuse the `InputStats` strategy from here ? @pjain1 sorry, I forgot about this PR. I

[GitHub] [druid] jihoonson edited a comment on pull request #10407: emit processed bytes metric

2020-10-20 Thread GitBox
jihoonson edited a comment on pull request #10407: URL: https://github.com/apache/druid/pull/10407#issuecomment-713013146 > @jihoonson are you actively working on your proposal ? do you think you can reuse the `InputStats` strategy from here ? @pjain1 sorry, I forgot about this PR. I

[GitHub] [druid] jihoonson commented on pull request #10407: emit processed bytes metric

2020-10-20 Thread GitBox
jihoonson commented on pull request #10407: URL: https://github.com/apache/druid/pull/10407#issuecomment-713019274 Apologize, I accidentally clicked the button which published my previous comment incomplete. Updated it now.

[GitHub] [druid] pjain1 commented on pull request #10407: emit processed bytes metric

2020-10-20 Thread GitBox
pjain1 commented on pull request #10407: URL: https://github.com/apache/druid/pull/10407#issuecomment-713033420 Makes sense, so as long as https://github.com/apache/druid/pull/10407#issuecomment-713016099 is satisfied things seems good to me. -

[GitHub] [druid] kroeders commented on pull request #10427: ServerSelectorStrategy to filter servers with missing required lookups

2020-10-20 Thread GitBox
kroeders commented on pull request #10427: URL: https://github.com/apache/druid/pull/10427#issuecomment-713072808 after running the extension on a cluster where the datasource was around 13,000 segments. The random server selector averaged 8.33 ms across several hundred queries while the l

[GitHub] [druid] jihoonson commented on pull request #10407: emit processed bytes metric

2020-10-20 Thread GitBox
jihoonson commented on pull request #10407: URL: https://github.com/apache/druid/pull/10407#issuecomment-713100147 @pjain1 thanks. Yes, per-phase metrics and total metrics will be available for raw input bytes. Other than the issue we have talked, this PR makes sense to me. I don't think m

[GitHub] [druid] clintropolis commented on a change in pull request #10499: support for vectorizing expressions with non-existent inputs, more consistent type handling for non-vectorized expressions

2020-10-20 Thread GitBox
clintropolis commented on a change in pull request #10499: URL: https://github.com/apache/druid/pull/10499#discussion_r508800090 ## File path: core/src/main/java/org/apache/druid/math/expr/ExprTypeConversion.java ## @@ -31,22 +31,40 @@ * Infer the output type of a list of p

[GitHub] [druid-website-src] druid-matt opened a new pull request #181: add Avesta Technologies

2020-10-20 Thread GitBox
druid-matt opened a new pull request #181: URL: https://github.com/apache/druid-website-src/pull/181 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the UR

[GitHub] [druid] jihoonson commented on a change in pull request #10335: Configurable Index Type

2020-10-20 Thread GitBox
jihoonson commented on a change in pull request #10335: URL: https://github.com/apache/druid/pull/10335#discussion_r508821432 ## File path: indexing-service/src/main/java/org/apache/druid/indexing/seekablestream/SeekableStreamIndexTaskTuningConfig.java ## @@ -84,10 +87,11 @@ p

[GitHub] [druid] jihoonson merged pull request #10503: Additional documentation for query caching

2020-10-20 Thread GitBox
jihoonson merged pull request #10503: URL: https://github.com/apache/druid/pull/10503 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go t

[druid] branch master updated (c3cb0e8 -> 04546b6)

2020-10-20 Thread jihoonson
This is an automated email from the ASF dual-hosted git repository. jihoonson pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/druid.git. from c3cb0e8 reduce docker image size (#10506) add 04546b6 Additional documentation for query caching (#10503) N

[GitHub] [druid-website-src] fjy merged pull request #181: add Avesta Technologies

2020-10-20 Thread GitBox
fjy merged pull request #181: URL: https://github.com/apache/druid-website-src/pull/181 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[druid-website-src] branch master updated: add Avesta Technologies

2020-10-20 Thread fjy
This is an automated email from the ASF dual-hosted git repository. fjy pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/druid-website-src.git The following commit(s) were added to refs/heads/master by this push: new 69ad718 add Avesta Technologies ne

[GitHub] [druid] mitchlloyd closed pull request #10386: Upgrade AWS SDK

2020-10-20 Thread GitBox
mitchlloyd closed pull request #10386: URL: https://github.com/apache/druid/pull/10386 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [druid] mitchlloyd commented on pull request #10386: Upgrade AWS SDK

2020-10-20 Thread GitBox
mitchlloyd commented on pull request #10386: URL: https://github.com/apache/druid/pull/10386#issuecomment-713154704 Those logs are very helpful. I took another crack at this and most of the errors seems related to Hadoop versions. There appeared to be just 2 basic s3 tests that didn't use

[GitHub] [druid] mounikanakkala opened a new issue #10521: Druid Lookups Auditing

2020-10-20 Thread GitBox
mounikanakkala opened a new issue #10521: URL: https://github.com/apache/druid/issues/10521 Hello Everyone, We have lookups being updated once in 4 hours to Druid sending POST request for Update, whose size goes up to 50 MB. We are not adding extra set of 50 MB sized lookups in ever

[GitHub] [druid] klDen opened a new issue #10522: Ingesting to same Datasource from 2 different Kafka clusters with exact same topic name/specs overwrites previous Supevisor

2020-10-20 Thread GitBox
klDen opened a new issue #10522: URL: https://github.com/apache/druid/issues/10522 ### Affected Version 0.20.0 ### Description Ingesting to same Datasource from 2 different Kafka clusters with exact same topic name/specs overwrites previous Supevisor. - Steps to reproduc

[GitHub] [druid] klDen commented on issue #10522: Ingesting to same Datasource from 2 different Kafka clusters with exact same topic name/specs overwrites previous Supevisor

2020-10-20 Thread GitBox
klDen commented on issue #10522: URL: https://github.com/apache/druid/issues/10522#issuecomment-713178006 Perhaps a solution is to specify a unique ID in the specs to differentiate them? This is an automated message from t

[GitHub] [druid] technomage opened a new issue #10523: Historical server fails to load segments in kubernetes

2020-10-20 Thread GitBox
technomage opened a new issue #10523: URL: https://github.com/apache/druid/issues/10523 Please provide a detailed title (e.g. "Broker crashes when using TopN query with Bound filter" instead of just "Broker crashes"). We installed druid using the incubator helm chart. It mounts a data v

[GitHub] [druid] asdf2014 commented on issue #10523: Historical server fails to load segments in kubernetes

2020-10-20 Thread GitBox
asdf2014 commented on issue #10523: URL: https://github.com/apache/druid/issues/10523#issuecomment-713260793 Hi, @technomage . Welcome to use Apache Druid's helm chart. @maver1ck @AWaterColorPen and I are the maintainers of this chart and will be happy to help you get through it. Can you p

[GitHub] [druid] AWaterColorPen commented on issue #10523: Historical server fails to load segments in kubernetes

2020-10-20 Thread GitBox
AWaterColorPen commented on issue #10523: URL: https://github.com/apache/druid/issues/10523#issuecomment-713275266 Hi, @technomage . It seems same issue https://github.com/helm/charts/issues/22911, https://github.com/helm/charts/issues/23250, https://github.com/helm/charts/issues/23201

[GitHub] [druid] clintropolis merged pull request #10515: Web console: refresh and tighten up the console styles ✨💅💫

2020-10-20 Thread GitBox
clintropolis merged pull request #10515: URL: https://github.com/apache/druid/pull/10515 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to g

[druid] branch master updated (04546b6 -> f391e89)

2020-10-20 Thread cwylie
This is an automated email from the ASF dual-hosted git repository. cwylie pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/druid.git. from 04546b6 Additional documentation for query caching (#10503) add f391e89 Web console: refresh and tighten up the

[GitHub] [druid] zhangyue19921010 opened a new pull request #10524: Kafka dynamic scale ingest tasks

2020-10-20 Thread GitBox
zhangyue19921010 opened a new pull request #10524: URL: https://github.com/apache/druid/pull/10524 ### Description In druid, users need to set 'taskCount' when submit Kafka ingestion supervisor. It has a few limitations : 1. When su

[GitHub] [druid] shashanksinghal opened a new issue #10525: Incorrect results (including nulls) when querying string column with col <> '' and col is not null

2020-10-20 Thread GitBox
shashanksinghal opened a new issue #10525: URL: https://github.com/apache/druid/issues/10525 ### Affected Version 0.18.0 and 0.20.0 ### Description For druid 0.18.0 and config useDefaultValueForNull to false, querying a table with condition on string column (lets say col) with