Re: Question related to reuse of BytesColumnVector.vector[][].

2024-10-02 Thread Stamatis Zampetakis
Hello, The error looks like a bug and is data/query specific thus I assume reproducible. I would suggest filing a JIRA ticket with as many details as possible (query, DDLs, logs, plans, data) to reproduce the issue. Best, Stamatis On Sun, Sep 29, 2024 at 7:51 AM lisoda wrote: > > Currently, whe

Question related to reuse of BytesColumnVector.vector[][].

2024-09-28 Thread lisoda
Currently, when we run HIVE version 4.0.0, the Sql often breaks abnormally, and the log message is as follows: 2024-09-29 00:18:06,569 [INFO] [Dispatcher thread {Central}] |HistoryEventHandler.criticalEvents|: [HISTORY][DAG:dag_1721298780048_105514_11][Event:TASK_ATTEMPT_FINISHED]: vertexName

Re: Question on Hive Metastore catalog support

2023-11-16 Thread Butao Zhang
Hi, maybe you can check this ticket https://issues.apache.org/jira/browse/HIVE-26227 Thanks, Butao Zhang Replied Message | From | Flavio Junqueira | | Date | 11/15/2023 17:26 | | To | | | Subject | Question on Hive Metastore catalog support | Hello there, I'm interest

Question on Hive Metastore catalog support

2023-11-15 Thread Flavio Junqueira
Hello there, I'm interested in understanding the Hive Metastore catalog support. I see references in the metastore code to catalogs, for example: https://github.com/apache/hive/blob/17525f169b9a08cd715bfb42899e45b7c689c77a/standalone-metastore/metastore-common/src/main/protobuf/org/apache/h

Re: [DISCUSS] Hive EOL question

2022-06-20 Thread Stamatis Zampetakis
Hi Guangming, There was a recent discussion about EOL Hive releases [1] but it was not conclusive. Feel free to reopen that thread if you have some thoughts on the subject. Best, Stamatis [1] https://lists.apache.org/thread/sxcrcf4v9j630tl9domp0bn4m33bdq0s On Sun, Jun 19, 2022 at 11:20 AM Guan

Re: [DISCUSS] Hive EOL question

2022-06-20 Thread Guangming Lu
Hi,     who knows the EOL schedule for each  Hive release? For example, when will 3.1.0 EOL be implemented. Best, Guangming

[DISCUSS] Hive EOL question

2022-06-19 Thread Guangming Lu
Hi, who knows the EOL schedule for each  Hive release? For example, when will 3.1.0 EOL be implemented. Best, Guangming

Re: question about a beeline variable

2022-02-27 Thread Bitfox
I got the idea it's the null value in Hive. 0: jdbc:hive2://localhost:1/default> select size(null); +--+ | _c0 | +--+ | -1 | +--+ Thanks On Sun, Feb 27, 2022 at 4:02 PM Bitfox wrote: > what does this -1 value mean? > > > set mapred.reduce.tasks; > > +--

question about a beeline variable

2022-02-27 Thread Bitfox
what does this -1 value mean? > set mapred.reduce.tasks; +-+ | set | +-+ | mapred.reduce.tasks=-1 | +-+ 1 row selected (0.014 seconds)

HIVE LLAP question

2022-01-13 Thread Oleksiy S
Guys please help. I am using Hive-2.3.6 and I want to configure LLAP with Apache slider. I ran hive --service llap --instances 1 --size 2g --executors 2 --xmx 1024m --cache 1024m --name llap0 --auxhbase=false --startImmediately --directory /tmp After that I can not get llap status curl node2.c

Hive 3.1.0 - A question related to NOTIFICATION_LOG table

2022-01-12 Thread Tale Hive
Hey guys. I used postgresql for hive database and the table NOTIFICATION_LOG is quite big since I upgraded from hive 1.2.1 to hive 3.1.0. Anyone knows what is the purpose of this NOTIFICATION_LOG table please ? Best regards. Tale

Re: Question regarding lock manager

2021-09-07 Thread Antoine DUBOIS
lege.synchronizer however this change seems very recent and there's no reference to such key in documentation either. Thank you for the tips ;-) De: "Rajesh Balamohan" À: user@hive.apache.org Envoyé: Mardi 7 Septembre 2021 00:46:43 Objet: Re: Question regarding lock mana

Re: Question regarding lock manager

2021-09-06 Thread Rajesh Balamohan
. > > -- > *De: *"Antoine DUBOIS" > *À: *user@hive.apache.org > *Envoyé: *Lundi 6 Septembre 2021 10:03:59 > *Objet: *Re: Question regarding lock manager > > Hello Alan, > Thank you for your answer, > I'm pretty sure I've respected

Re: Question regarding lock manager

2021-09-06 Thread Antoine DUBOIS
sible for the use of zookeeper by hiveserver2. However why using ZooKeeper is useful or relevant is another story I can't tell. Regards - Mail original - De: "Jan Fili" À: user@hive.apache.org Envoyé: Lundi 6 Septembre 2021 16:33:28 Objet: Re: Question regarding lock manag

Re: Question regarding lock manager

2021-09-06 Thread Antoine DUBOIS
À: user@hive.apache.org Envoyé: Lundi 6 Septembre 2021 10:03:59 Objet: Re: Question regarding lock manager Hello Alan, Thank you for your answer, I'm pretty sure I've respected the guide provided and did not set any discovery service. However I'm trying to setup ranger as well a

Re: Question regarding lock manager

2021-09-06 Thread Jan Fili
version but I cannot find a proper > compatibility matrix for all hadoop ecosystem. > Thank you very much. > > Antoine > > > De: "Alan Gates" > À: user@hive.apache.org > Envoyé: Vendredi 3 Septembre 2021 17:51:41 > Objet

Re: Question regarding lock manager

2021-09-06 Thread Antoine DUBOIS
Septembre 2021 17:51:41 Objet: Re: Question regarding lock manager You do not need ZooKeeper to use ACID in Hive. The first thing I would check is that you have configured your system as described on this page: [ https://cwiki.apache.org/confluence/display/Hive/Hive+Transactions | https

Re: Question regarding lock manager

2021-09-03 Thread Alan Gates
You do not need ZooKeeper to use ACID in Hive. The first thing I would check is that you have configured your system as described on this page: https://cwiki.apache.org/confluence/display/Hive/Hive+Transactions. Also, make sure you have not set hive.lock.manager to zookeeper. There are other fea

Question regarding lock manager

2021-09-02 Thread Antoine DUBOIS
Hello, I'm trying to configure ACID hive in a kerberos environment with : Hadoop 3.1.4 deployed in HA considered working and now I'm trying to setup hive with remote metastore and ACID configuration. I may misunderstand what is written in documentation https://cwiki.apache.org/confluence/disp

[Question]Why does Hive not use calcite in adaptor way?

2021-06-25 Thread guangyuan wang
I have read the design doc of the Hive planner recently. I've found the Hive only uses calcite as a SQL optimizer. Translate the optimized relnode to hive operator tree, and then transfer it to the Tez execution DAG. Why not implement the calcite adaptor, this way is easier to use calcite? The lin

Question on metadata before and after compaction

2020-10-08 Thread Sungwoo Park
Hi, I have a question on the consistency between data (e.g., on HDFS) and metadata kept by Metastore before and after compaction. Here is a scenario: 1. We back up the database for Metastore (before performing compaction). 2. We perform compaction. 3. After performing compaction, we lose the

[Question] Hive InputFormat relationship to FetchOperator

2020-07-27 Thread Anthony Virtuoso
Hello Hive Community, I have what will hopefully be a simple question. I'm working on a new, or perhaps enhancement to an existing, InputFormat. As part of this research I'm trying to understand where in the Hive codebase the InputFormat is actually used. From my initial tracing, it

Question about IMPORT/EXPORT

2020-02-10 Thread Thibault VERBEQUE
Hi all, I'm currently working on two kerberized clusters and want replicate some tables between them, with hive 3.0.1. I have created two users for this, one for EXPORT operation and one for IMPORT operation. But I stepped on https://issues.apache.org/jira/browse/HIVE-17606. It seems to me that

?????? review the code question

2019-10-31 Thread Alan
Hi!    Thank you very much for your information. --  -- ??: "Peter Vary"https://cwiki.apache.org/confluence/display/Hive/HowToContribute Thanks, Peter On Oct 30, 2019, at 09:28, <849551...@qq.com> wrote: Hello:      How can I get Commiter to h

Re: review the code question

2019-10-31 Thread Peter Vary
Hi! There is a wiki page outlining the way how to contribute. See: https://cwiki.apache.org/confluence/display/Hive/HowToContribute Thanks, Peter > On Oct 30, 2019, at 09:28, 阿伦 <849551...@qq.com> wrote: > > Hello: > >How

review the code question

2019-10-30 Thread ????
Hello:      How can I get Commiter to help review the code ?    thinks.    I am looking forward to your reply??

Re: Question on Hive metastore thrift uri

2019-06-25 Thread Alan Gates
JDBC credentials to the metastore db and configuring the thrift url with the address of the metastore server). Alan. On Mon, Jun 24, 2019 at 9:13 PM reetika agrawal < agrawal.reetika...@gmail.com> wrote: > Hi, > > I have a question , how does the hive connection happen when we con

Question on Hive metastore thrift uri

2019-06-24 Thread reetika agrawal
Hi, I have a question , how does the hive connection happen when we connect it using hive meta store thrift URI ? Does it go through hiveserver2->metastore -> metastore db or directly connects to metastore -> metastore db . If someone could help me with understanding this that would

Re: Standalone Metastore Question

2019-02-26 Thread Alan Gates
The standalone metastore released in 3.0 is the exact same metastore released with Hive 3.0. The only differences are in the install tool 'schematool' and the start and stop script. Hive 3 is being used in production a number of places. I don't know if anyone is running the metastore alone in pr

Standalone Metastore Question

2019-02-26 Thread Abdoulaye Diallo
Hi there, I am new to hive. My goal is to run the Standalone Metastore in the hope to integrate it with Spark/Iceberg without Hadoop/Hive. I downloaded the release from here (version 3.0.0) and successfully initialized a MyS

Re: Hive Order By Question

2019-02-07 Thread Mainak Ghosh
Thanks Gopal. Setting that configuration significantly reduced the query runtime. I chose a value of 0.3f. Is there any empirical way to decide what value to set for this? It is not completely clear from the code how this is being used. Mainak > On Feb 6, 2019, at 7:44 PM, Gopal Vijayaraghava

Re: Hive Order By Question

2019-02-06 Thread Gopal Vijayaraghavan
Hi, That looks like the TopN hash optimization didn't kick in, that must be a settings issue in the install. | Reduce Output Operator | | key expressions: _col0 (type: string) | | sort order: + | |

Re: Hive Order By Question

2019-02-06 Thread Mainak Ghosh
Hey Gopal, I am using Apache Hive v2.3.2. Here is the explain: ++ | Explain | ++ | STAGE DEPENDENCIES:| | Stage-1 is

Re: Hive Order By Question

2019-02-06 Thread Gopal Vijayaraghavan
>I am running an older version of Hive on MR. Does it have it too? Hard to tell without an explain. AFAIK, this was fixed in Aug 2013 - how old is your build? Cheers, Gopal

Re: Hive Order By Question

2019-02-06 Thread Mainak Ghosh
I am running an older version of Hive on MR. Does it have it too? Mainak Sent from my iPhone > On Feb 6, 2019, at 3:43 PM, Gopal Vijayaraghavan wrote: > > >> I expect the maps to do some sorting and limiting in parallel. That way the >> reducer load would be small. I don’t think it does that.

Re: Hive Order By Question

2019-02-06 Thread Gopal Vijayaraghavan
> I expect the maps to do some sorting and limiting in parallel. That way the > reducer load would be small. I don’t think it does that. Can you tell me why?  They do. Which version are you running, is it Tez and do you have an explain for the plan? Cheers, Gopal

Hive Order By Question

2019-02-06 Thread Mainak Ghosh
Hello, Hive Order by is known to be slow. It is slightly odd that it is slow even when we use a limit under strict mode. I am running this query over 3 billion rows with a limit of 20. It takes an hour to run. I expect the maps to do some sorting and limiting in parallel. That way the reducer l

Re: Re: Question about INSERT OVERWRITE TABLE with dynamic partition

2018-10-25 Thread anci_...@yahoo.com
ate: 2018-10-25 08:34 To: user Subject: Re: Question about INSERT OVERWRITE TABLE with dynamic partition A logical explanation could be:- In the first query, you are telling hive which partition to overwrite, so a step which actually deletes the partition data and overwrites it with the query res

Re: Question about INSERT OVERWRITE TABLE with dynamic partition

2018-10-24 Thread Tanvi Thacker
A logical explanation could be:- In the first query, you are telling hive which partition to overwrite, so a step which actually deletes the partition data and overwrites it with the query result, knows that which partition to delete and there is an empty result/file to move. but for the second qu

Question about INSERT OVERWRITE TABLE with dynamic partition

2018-10-23 Thread anci_...@yahoo.com
Dears, I found an interesting thing. When inserting a NULL result into a partition which already contained some records, there was a difference in the results between using static partition INSERT and using dynamic partition INSERT. See the example below: Partition '20180101

Re: Re: Question about OVER clause

2018-09-27 Thread anci_...@yahoo.com
Thanks,but the article talked too little to help. Actually, it only told me that by using this code we would accumulate all records of which v_date was less than or equal to current row. But the question is, what will happen by the code below? (the field v_date is a string value with the format

Re: Question about OVER clause

2018-09-21 Thread Alan Gates
This article might be helpful. It's for SQL Server, but the semantics should be similar. https://www.sqlpassion.at/archive/2015/01/22/sql-server-windowing-functions-rows-vs-range/ Alan. On Wed, Sep 19, 2018 at 6:47 AM 孙志禹 wrote: > Dears, >What is the difference between *ROW BETWEEN* and *

Question about OVER clause

2018-09-19 Thread 孙志禹
Dears,   What is the difference between ROW BETWEEN and RANGE BETWEEN when using a OVER clause? I found it difficult to get an answer about this for hive.    Hope there would be a more detailed help article about OVER clause at the Confluence.   Thanks!

Re: Question about efficiency of SELECT DISTINCT

2018-07-02 Thread Furcy Pin
Hi, They are rigorously equivalent. You can see this with the following queries: CREATE TABLE t1 (a INT, b INT, c INT) ; EXPLAIN SELECT DISTINCT a,b,c FROM t1 ; EXPLAIN SELECT a,b,c FROM t1 GROUP BY a,b,c ; Both queries will return the exact same query plan: Stage-0 Fetch Operator

Question about efficiency of SELECT DISTINCT

2018-07-02 Thread 孙志禹
Dear all,    Does the code1 below have a better efficiency than code2?    Thanks!     CODE1:        select              distinct a,b,c        from table1     CODE2:        select             a,b,c        from table1        group by              a,b,c

Re: ODBC-hiveserver2 question

2018-02-24 Thread Jörn Franke
HDFS support depends on the version. A long time it was not supported. > On 23. Feb 2018, at 21:08, Andy Srine wrote: > > Team, > > Is ADD JAR from HDFS (ADD JAR hdfs:///hive_jars/hive-contrib-2.1.1.jar;) > supported in hiveserver2 via an ODBC connection? > > Some relevant points: > I am abl

Re: ODBC-hiveserver2 question

2018-02-23 Thread Andrew Sears
Add JAR works with HDFS, though perhaps not with ODBC drivers.ADD JAR hdfs://:8020/hive_jars/hive-contrib-2.1.1.jar should work (depending on your nn port and confirm this file exists)Alternative syntaxADD JAR hdfs:/hive_jars/hive-contrib-2.1.1.jarThe ODBC driver could be having an issue with the

Re: ODBC-hiveserver2 question

2018-02-23 Thread Jörn Franke
Add jar works only with local files on the Hive server. > On 23. Feb 2018, at 21:08, Andy Srine wrote: > > Team, > > Is ADD JAR from HDFS (ADD JAR hdfs:///hive_jars/hive-contrib-2.1.1.jar;) > supported in hiveserver2 via an ODBC connection? > > Some relevant points: > I am able to do it in H

ODBC-hiveserver2 question

2018-02-23 Thread Andy Srine
Team, Is ADD JAR from HDFS (ADD JAR hdfs:///hive_jars/hive-contrib-2.1.1.jar;) supported in hiveserver2 via an ODBC connection? Some relevant points: - I am able to do it in Hive 2.1.1 via JDBC (beeline), but not via an ODBC client. - In Hive 1.2.1, I can add a jar from the local node,

Re: Hive update question

2018-02-13 Thread balajee venkatesh
Overwrite your table by taking left join with another table. Let me know if you want any syntactical help. On 14-Feb-2018 12:54 PM, "Andy Srine" wrote: > Hi Team, > > Whats the best way to do an update on one table from another? Variations > of this syntax below doesn't seem to work: > > UPDATE

Hive update question

2018-02-13 Thread Andy Srine
Hi Team, Whats the best way to do an update on one table from another? Variations of this syntax below doesn't seem to work: UPDATE table_x SET column_1 = B.column_1 FROM table_y B WHERE column_2 = B.column_2 ; Thanks, Andy

Re: Question on accessing LLAP as data cache from external containers

2018-02-02 Thread Gopal Vijayaraghavan
on. Here are the slides describing how that is plugged in LLAP from Hadoop Summit 2015. https://www.slideshare.net/Hadoop_Summit/llap-longlived-execution-in-hive/21 The flag in question is hive.llap.execution.mode - the most common use-case imagined for it was something like the mode=map, where o

Re: Question on accessing LLAP as data cache from external containers

2018-01-31 Thread Sungwoo Park
Thanks for the link. My question was how to access LLAP daemon from Containers to retrieve data for Hive jobs. For example, a Hive job may start Tez containers, which then retrieve data from LLAP running concurrently. In the current implementation, this is unrealistic (because every task can be

Re: Question on accessing LLAP as data cache from external containers

2018-01-29 Thread Jörn Franke
Are you looking for sth like this: https://hadoop.apache.org/docs/r2.4.1/hadoop-project-dist/hadoop-hdfs/CentralizedCacheManagement.html To answer your original question: why not implement the whole job in Hive? Or orchestrate using oozie some parts in mr and some in Huve. > On 30. Jan 2

Question on accessing LLAP as data cache from external containers

2018-01-29 Thread Sungwoo Park
Hello all, I wonder if an external YARN container can send requests to LLAP daemon to read data from its in-memory cache. For example, YARN containers owned by a typical MapReduce job (e.g., TeraSort) could fetch data directly from LLAP instead of contacting HDFS. In this scenario, LLAP daemon ju

Log4j PID Daily Rolling File Appender Question

2017-08-01 Thread Vlad Gudikov
Hi community, As for https://issues.apache.org/jira/browse/HIVE-11309 pid coneverter has been added. But the question is that we can add %pid to appender.DRFA.filePattern (it works and creates logs like hive.log.2017-05-31.21847@beta1) but can't add %pid to appender.DRFA.fileName (it looks

Re: question on setting up llap

2017-05-10 Thread Yi Cheng
I have installed python 2.7.13 Still the same error. I am having 3 node. (a,b,c) When the SliderAppMaster is running on a, this can work normally. But when the SliderAppMaster is started on b or c, this will throw the error above. 2017-05-10 10:56 GMT-07:00 Gopal Vijayaraghavan : > > > for the

Re: question on setting up llap

2017-05-10 Thread Gopal Vijayaraghavan
> for the slider 0.92, the patch is already applied, right? Yes, except it has been refactored to a different place. https://github.com/apache/incubator-slider/blob/branches/branch-0.92/slider-agent/src/main/python/agent/NetUtil.py#L44 Cheers, Gopal

Re: question on setting up llap

2017-05-10 Thread Yi Cheng
I am going to upgrade Python from 2.6.6 to 2.7.13 for the slider 0.92, the patch is already applied, right? the patch file, it is following, adding some try blocks: diff --git slider-agent/src/main/python/agent/main.py slider-agent/src/main/python/agent/main.py index 1932a37..2671777 100644 --- s

Re: question on setting up llap

2017-05-09 Thread Gopal Vijayaraghavan
> NetUtil.py:60 - [Errno 8] _ssl.c:492: EOF occurred in violation of protocol The error is directly related to the SSL verification error - TLSv1.0 vs TLSv1.2. JDK8 defaults to v1.2 and Python 2.6 defaults to v1.0. Python 2.7.9 + the patch in 0.92 might be needed to get this to work. AFAIK, t

Re: question on setting up llap

2017-05-09 Thread Yi Cheng
the jira's error is [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed (_ssl.c:581) it is a bit different, what i get is NetUtil.py:60 - [Errno 8] _ssl.c:492: EOF occurred in violation of protocol 2017-05-09 17:35 GMT-07:00 Gopal Vijayaraghavan : > > > ERROR 2017-05-09 22:04:56,469 N

Re: question on setting up llap

2017-05-09 Thread Gopal Vijayaraghavan
> ERROR 2017-05-09 22:04:56,469 NetUtil.py:62 - SSLError: Failed to connect. > Please check openssl library versions. … > I am using hive 2.1.0, slider 0.92.0, tez 0.8.5 AFAIK, this was reportedly fixed in 0.92. https://issues.apache.org/jira/browse/SLIDER-942 I'm not sure if the fix in that

question on setting up llap

2017-05-09 Thread Yi Cheng
Hi: I have problem setting up llap engine. The problem I have is When the LLAP container starts it repeatedly report can’t connect to the https SliderAppMaster. Exception is following: INFO 2017-05-09 22:04:56,437 Registry.py:69 - AM Host = usplselux186.a.com, AM Secured Port = 33067

?????? hive on spark - version question

2017-03-18 Thread yuxh
3 ??: "user@hive.apache.org"; : Re: hive on spark - version question :( gettin' no love on this one. any SME's know if Spark 2.1.0 will work with Hive 2.1.0 ? That JavaSparkListener class looks like a deal breaker to me, alas. thanks in advance. Cheers, Stephen.

Re: hive on spark - version question

2017-03-17 Thread Stephen Sprague
y other apache big data user > groups, provides extremely poor support for users. The answers you usually > get are not even hints to a solution. Their answers usually translate to > "there is nothing I am willing to do about your problem. If I did, I should > get paid" in many

Re: hive on spark - version question

2017-03-17 Thread hernan saab
a solution. Their answers usually translate to "there is nothing I am willing to do about your problem. If I did, I should get paid" in many cryptic ways. If you ask your question to the Spark group they will take you to the Hive group and viceversa (I can almost guarantee it based on pre

Re: hive on spark - version question

2017-03-17 Thread Stephen Sprague
that will derailed you and cause you to waste a lot >> of time. >> >> In addition, this group, as well as many other apache big data user >> groups, provides extremely poor support for users. The answers you usually >> get are not even hints to a solution. Their answers u

Re: hive on spark - version question

2017-03-17 Thread Edward Capriolo
g I am willing to do about your problem. If I did, I should > get paid" in many cryptic ways. > > If you ask your question to the Spark group they will take you to the Hive > group and viceversa (I can almost guarantee it based on previous > experiences) > > But in hind

Re: hive on spark - version question

2017-03-17 Thread hernan saab
extremely poor support for users. The answers you usually get are not even hints to a solution. Their answers usually translate to "there is nothing I am willing to do about your problem. If I did, I should get paid" in many cryptic ways. If you ask your question to the Spark group they

Re: hive on spark - version question

2017-03-17 Thread Stephen Sprague
:( gettin' no love on this one. any SME's know if Spark 2.1.0 will work with Hive 2.1.0 ? That JavaSparkListener class looks like a deal breaker to me, alas. thanks in advance. Cheers, Stephen. On Mon, Mar 13, 2017 at 10:32 PM, Stephen Sprague wrote: > hi guys, > wondering where we stand w

hive on spark - version question

2017-03-13 Thread Stephen Sprague
hi guys, wondering where we stand with Hive On Spark these days? i'm trying to run Spark 2.1.0 with Hive 2.1.0 (purely coincidental versions) and running up against this class not found: java.lang.NoClassDefFoundError: org/apache/spark/JavaSparkListener searching the Cyber i find this: 1. h

[Question] How can I estimate a table size in HIVE without query?

2017-02-15 Thread Beth Lee
Good evening~ I'm Jin. I want to calculate the table size without querying in HIVE or using Hadoop. Because I don't have any permission about HIVE and Hadoop without selecting data in database so I can't use show properties, etc. That is the rule of our company. How can I do this? (For example

Question regarding Logjam vulnerability

2017-01-18 Thread Rishabh Patel
Hello, Is Hive 1.2.1 or greater affected by the Logjam vulnerability? The following link provides more information about the scenario. https://access.redhat.com/articles/1456263 -- Regards, *Rishabh Patel*

Re: Question about partition pruning when there's a type mismatch

2016-11-29 Thread Anthony Hsu
Thanks for the tips, Gopal. I stepped through the code in a debugger and found that in the case of String = String, the predicate was pushed down to the SQL query on the metastore side, whereas in the case of String = Int, the SQL filter pushdown failed, so GenericUDFOPEqual gets evaluated and retu

Re: Question about partition pruning when there's a type mismatch

2016-11-28 Thread Gopal Vijayaraghavan
> I'm wondering why Hive tries to scan all partitions when the quotes are > omitted. Without the quotes, shouldn't 2016-11-28-00 get evaluated as an > arithmetic expression, then get cast to a string, and then partitioning > pruning still occur? The order of evaluation is different - String =

Question about partition pruning when there's a type mismatch

2016-11-28 Thread Anthony Hsu
Hi, Some of our Hive tables are partitioned on a STRING column "datepartition" whose format is "-mm-dd-hh". Usually, when users want to read only a specific day of data, they'll specify WHERE datepartition = '2016-11-28-00', for example. However, sometimes users accidentally forget the quotes

Hive + Kerberos question

2016-08-24 Thread Bear Giles
I have a question about Hive + Kerberos. Perhaps I'm missing something, perhaps it's an oversight, perhaps it's a bug. I can get a TGT ticket using kinit, but it's easier for me to get one using JAAS since there's no dependency on an external command and I can nuke th

Re: Question : Is there a Automated Hive Database/Schema Designer

2016-07-22 Thread Umesh Prasad
Reposting .. Thanks & Regards Umesh Prasad On Thu, Jul 21, 2016 at 8:04 AM, Umesh Prasad wrote: > Hi All, >Does hive a Automated Database Desginer or has anyone tried building it > ? Something which is equivalent to Vertica's DDB and Microsoft SQL > server's Automated Partitioning Design

Question : Is there a Automated Hive Database/Schema Designer

2016-07-20 Thread Umesh Prasad
Hi All, Does hive a Automated Database Desginer or has anyone tried building it ? Something which is equivalent to Vertica's DDB and Microsoft SQL server's Automated Partitioning Design in Parallel Databases. References are : 1. Automated Partitioning Design in Parallel Database Systems ( http

Re: hive.aux.jars.path question

2016-07-06 Thread Fan Xu
Yes, it is a workaround not a fix. From: Igor Kravzov Sent: Wednesday, July 6, 2016 3:02:44 PM To: user@hive.apache.org Subject: Re: hive.aux.jars.path question It worked. Thanks. Still not sure if it is right way to do it. On Wed, Jul 6, 2016 at 5:45 PM, Fan

Re: hive.aux.jars.path question

2016-07-06 Thread Igor Kravzov
hive.apache.org > *Subject:* Re: hive.aux.jars.path question > > Just copied. Didn't help. Do I need to restart nodes? > > On Wed, Jul 6, 2016 at 5:20 PM, Fan Xu wrote: > >> Got similar issues. >> >> I have tried to point hive.aux.jars.path to a local/hdfs locations

Re: hive.aux.jars.path question

2016-07-06 Thread Fan Xu
Yes, we need to restart hiveserver in Ambari From: Igor Kravzov Sent: Wednesday, July 6, 2016 2:36:41 PM To: user@hive.apache.org Subject: Re: hive.aux.jars.path question Just copied. Didn't help. Do I need to restart nodes? On Wed, Jul 6, 2016 at 5:20 PM

Re: hive.aux.jars.path question

2016-07-06 Thread Igor Kravzov
th work, would it be a > bug? > > The only way I made it work is to add my jar to {hive_home}/lib/ > > > Thanks, > > > Fan > > -- > *From:* Igor Kravzov > *Sent:* Wednesday, July 6, 2016 2:13:07 PM > *To:* user@hive.apache.org &g

Re: hive.aux.jars.path question

2016-07-06 Thread Fan Xu
From: Igor Kravzov Sent: Wednesday, July 6, 2016 2:13:07 PM To: user@hive.apache.org Subject: hive.aux.jars.path question Hi guys, HDP 2.4.2 distribution I want to add elasticsearch-hadoop-2.3.2.jar connector to hive path using hive.aux.jars.path property. Added it to custom hive-site.xml

hive.aux.jars.path question

2016-07-06 Thread Igor Kravzov
Hi guys, HDP 2.4.2 distribution I want to add elasticsearch-hadoop-2.3.2.jar connector to hive path using hive.aux.jars.path property. Added it to custom hive-site.xml Copied jar to /user/hive/aux_jars folder in HDFS. I specified path for the property as /user/hive/aux_jars/elasticsearch-hadoop

Re: JsonSerDe file format question

2016-06-08 Thread Igor Kravzov
Found the issue. Looks like rows should be separated by new line. On Wed, Jun 8, 2016 at 2:55 PM, Igor Kravzov wrote: > I am merging multiple JSON file in a bigger one before saving it to HDFS. > So merged file looks like this > > {"id":160889136,"url":" > http://twitter.com/PatrocinarBRA/status

Re: JsonSerDe file format question

2016-06-08 Thread Igor Kravzov
There are 3 files with 3 JSON records each. But when I run select * from testtable; it returns me only first row from each one of files instead of 9. On Wed, Jun 8, 2016 at 2:55 PM, Igor Kravzov wrote: > I am merging multiple JSON file in a bigger one before saving it to HDFS. > So merged file l

JsonSerDe file format question

2016-06-08 Thread Igor Kravzov
I am merging multiple JSON file in a bigger one before saving it to HDFS. So merged file looks like this {"id":160889136,"url":" http://twitter.com/PatrocinarBRA/statuses/740301352052654080";, ..}{"id":160889137,"url":" http://twitter.com/tchiagoolimpio/statuses/740301352253825024 ",...}{"id":1608

Re: Question on Implementing CASE in Hive Join

2016-05-03 Thread Kishore A
t; 18 101 CN Tax Y > 18 101 All Tax Y Smith > > 19 101 CA Tax Y > 19 101 All Tax Y Smith > > 20 101 USA

RE: Question on Implementing CASE in Hive Join

2016-04-27 Thread Kishore A
ll--Tax---X--John > > 101-All-TaxY--Smith > > 142-US-Pus--X--John > > 142-All--Pus--Y--Smith-- > > > > > > 2. > > Requirement > > > > “A left join B” g

Re: Question on Implementing CASE in Hive Join

2016-04-27 Thread Kishore A
a > > > > *where* a.*type* = b.*type* > > *and* a.code like *case* b.code *when* 'ALL' *then* '%' > *else* b.code *end* > > *and* a.indicator like *case* b.indicator *when* 'ALL' *then* '%'

RE: Question on Implementing CASE in Hive Join

2016-04-20 Thread Markovitz, Dudu
b.indicator end ; Dudu From: Kishore A [mailto:kishore.atmak...@gmail.com] Sent: Wednesday, April 20, 2016 5:04 PM To: user@hive.apache.org Subject: Re: Question on Implementing CASE in Hive Join Hi Dudu, Thank you for sending queries around this. I have run these queries and below are the

Re: Question on Implementing CASE in Hive Join

2016-04-20 Thread Kishore A
= b.code > > > > *where* b.code != 'ALL' > > *and* b.indicatior = 'ALL' > > > > *union* *all* > > > > *select* b.code > >,b.*value* > > > > *from*b > >

RE: Question on Implementing CASE in Hive Join

2016-04-19 Thread Markovitz, Dudu
x27; union all select b.code ,b.value fromb left join a on a.type = b.type where b.code = 'ALL' and b.indicatior = 'ALL' ; From: Kishore A [mailto:kishore.atmak...@gmail.com] Se

Re: Question on Implementing CASE in Hive Join

2016-04-19 Thread Kishore A
mailto:kishore.atmak...@gmail.com] > *Sent:* Tuesday, April 19, 2016 2:29 PM > *To:* user@hive.apache.org > *Subject:* Question on Implementing CASE in Hive Join > > > > Hi, > > > > I have a scenario to implement to cases in Hive Joins. I need to implement > cas

RE: Question on Implementing CASE in Hive Join

2016-04-19 Thread Markovitz, Dudu
Before dealing with the technical aspect, can you please explain what is the point of using LEFT JOIN without selecting any field from table A? Thanks Dudu From: Kishore A [mailto:kishore.atmak...@gmail.com] Sent: Tuesday, April 19, 2016 2:29 PM To: user@hive.apache.org Subject: Question on

Question on Implementing CASE in Hive Join

2016-04-19 Thread Kishore A
Hi, I have a scenario to implement to cases in Hive Joins. I need to implement case on the value on which join condition to be applied. Table A Code// Type// Indicator// Value// A 1 XYZ John B 1 PQR Smith C 2 XYZ John C 2 PQR

Re: Query question.

2016-03-12 Thread Mich Talebzadeh
That select (count(*) from Table_X tx where tx.blah = 0.75 will return a single value y and 0.5 * y will be a single value.. Where does sum come into it. It is normally applied to GROUP BY statements. Can you provide the original SQL if you have (Oracle, Sybase whatever)? HTH Dr Mich Talebzade

Query question.

2016-03-12 Thread ws
I need to re-write something like this to Hive: ###select  x.A, sum(0.5 * (select (count(*) from Table_X tx where tx.blah = 0.75)))from Table_X x### Thanks in advance for any pointer to it.

A simple question

2016-01-16 Thread Mich Talebzadeh
We always assume that the strength of Hive comes from its ability to use a very close ensemble to ANSI SQL enabling queries to be done with a smooth learning curve, Additionally we like to assume that Hive strength is about Schema on Read. However, after having some discussion on it, we still n

  1   2   3   4   5   >