hey Dalia ,
A: both are good its up to u what kinda data you are processing through
them, for many row and billions of col you can you Hbase and if you need to
update data on regular basis then u can you hbase, for hive you can store
data and easy to use as SQL , easy fetching and all. for more
Hey Vikas,
I want to develop a medical API ...
I want to ask whether Hive Hbase Integration performance is good or not,
because I found that Hive queries are faster according to some blogs..
Finally, is there any tutorials using Java API of Hive and Hbase???
Date: Thu, 19 Jan 2012 13:36:30
yup Hive and Hbase Integration is gud to use. you can find those api in
book*: the definitive guide *
2012/1/19 Dalia Sobhy dalia.mohso...@hotmail.com
Hey Vikas,
I want to develop a medical API ...
I want to ask whether Hive Hbase Integration performance is good or not,
because I found
Hi Dalia,
Hive and Hbase integration is not a good choice if you want to
perform anything in real time.Hive is suitable for batch processing.
Hive is a data warehouse that works on top of Hadoop and provides SQL
like functionality. When you fire a Hive query it first gets converted
into a
Hi Michael
Please find some pointers inline
1) Is there a point to compress both SequenceFile and TextFile formats ?
[Bejoy] Textfile, definitely you need to compress if you are looking to
store large volume of data. You can compress Sequence Files as well with LZO.
2)
Hi there,
today I've received a spam e-mail and wondered how they could have caught my
address.
I knew that all posts get archived at http://mail-archives.apache.org, but
usually the
sender address will be anonymised like hadoop hive hadooph...@gmail.com
Unfortunately, any other e-mail
Martin I've just been maintaining a completely separate email account for
mailing lists for this reason.
Sent from my iPhone 4S
On 19/01/2012, at 4:45, Martin Kuhn martin.k...@affinitas.de wrote:
Hi there,
today I've received a spam e-mail and wondered how they could have caught my
Hi,
I have two tables with 8 buckets each on the same key and want to join them.
I ran explain extended and get the plan produced by HIVE which shows that a
map-side join is a possible plan.
I then set in my script the hive.optimize.bucketmapjoin option to true and
reran the explain extended
Hi Avrila
AFAIK the bucketed map join is not default in hive and it happens only
when the values is set to true. It could be because the same value is already
set in the hive configuration xml file. To cross confirm the same could you
explicitly set this to false
(set
Corrected a few typos in previous mail
Hi Avrila
Hi Avrila
AFAIK the bucketed map join is not default in hive and it happens only
when the configuration parameter hive.optimize.bucketmapjoin is set to true.
You may be getting the same execution plan because hive.optimize.bucketmapjoin
We are running this query:
select name, sum_id
from (
select name, players,
array(player1, player2, player3, player4, player5, player6, player7,
player8) arr
from (
select name,
get_json_object(roster_json, '$.memberList.playerId') players,
get_json_object(roster_json,
Best Regards,
Mark Sun
| ebay| APD - China | Office: +86-21-2891 3771 | Cell Phone: +86-13023281237
Enable everyone (eBay, Inc Customers) to courageously make more faster
winning decisions leveraging data analytics products that are trustworthy,
integrated and reliable.
Hey guys,
we found an issue that looks like a bug (hive 0.8 , caldera's distribution).
SELECT count(1)
FROM table1 a LEFT OUTER JOIN table2 b
ON ( a.key1 = b.key1 AND a.key2 = b.KEY2)
fails with java.lang.IllegalArgumentException: Can not create a Path
from an empty string ( full stack
13 matches
Mail list logo