Hi Praveen / All,
I also have a requirement similar to the one explained (by Praveen) below:
distinct rows on a single column with corresponding data from other columns.
@hive.apache.org
Subject: Hive Join with distinct rows
Hi Praveen / All,
I also have a requirement similar to the one explained (by Praveen) below:
distinct rows on a single column with corresponding data from other columns.
http://mail-archives.apache.org/mod_mbox/hive-user/201211.mbox/%3ccahmb8ta
in a world of unstructured and un-clean data.
** **
-Marcin
** **
*From:* Sunita Arvind [mailto:sunitarv...@gmail.com]
*Sent:* Tuesday, July 30, 2013 11:00 AM
*To:* user@hive.apache.org
*Subject:* Hive Join with distinct rows
** **
Hi Praveen / All,
I also have
Thanks Mark, I do understand that how Hive works with Distinct keyword.
What I was looking for is a solution for my requirement in Hive, I am not
an expert in SQL, hence looking for suggestions
On Fri, Nov 9, 2012 at 9:54 AM, Mark Grover grover.markgro...@gmail.comwrote:
Hi Praveen,
Let's
I see. I re-read your first email and you would like to query select all
the unique ID's in T1 which are not in T2
Query 1 seems to be doing just fine so I would say that's the way to go. I
personally use IS operator when comparing something with NULLs instead of
=.
There are some optimizations
But I think Hive should support distinct on single column along with
fetching corresponding data from other columns mentioned in the query.
Something like
Select distinct(col1), col2, col3 from TB1
For example
hive SELECT col1, col2 FROM t1;
1 3
1 3
1 4
2 5
-- Selects distinct col1, col2 tuple
Hi Praveen,
Let's take an example:
(from
https://cwiki.apache.org/Hive/languagemanual-select.html#LanguageManualSelect-ALLandDISTINCTClauses
)
-- Print out contents of the table
hive SELECT col1, col2 FROM t1;
1 3
1 3
1 4
2 5
-- Selects distinct col1, col2 tuple
hive SELECT DISTINCT col1, col2