Bucketed table info

2016-11-30 Thread Vijay Ramachandran
Hi. If I have an orc table bucketed and sorted on a column, where does hive keep the mapping from column value to bucket? Specifically, if I know the column value, and need to find the specific hdfs file, is there an api to do this? Related, is there any documentation on how the read path works f

RE: on duplicate update equivalent?

2016-09-23 Thread Vijay Ramachandran
Dimensions change, and I'd rather do update than recreate a snapshot. On 23-Sep-2016 17:23, "Markovitz, Dudu" wrote: > If these are dimension tables, what do you need to update there? > > > > Dudu > > > > *From:* Vijay Ramachandran [mailto:vi...@li

Re: on duplicate update equivalent?

2016-09-23 Thread Vijay Ramachandran
On Fri, Sep 23, 2016 at 3:47 PM, Mich Talebzadeh wrote: > What is the use case for UPSERT in Hive. The functionality does not exist > but there are other solutions. > > Are we talking about a set of dimension tables with primary keys hat need > to be updated (existing rows) or inserted (new rows)

on duplicate update equivalent?

2016-09-23 Thread Vijay Ramachandran
Hello. Is there a way to write a query with a behaviour equivalent to mysql's "on duplicate update"? i.e., try to insert, and if key exists, update the row instead? thanks,