Incremental import from PostgreSQL to Hive having issues

2012-04-12 Thread Roshan Pradeep
Hi I want to import the updated data from my source (PostgreSQL) to hive based on a column (lastmodifiedtime) in postgreSQL *The command I am using* /app/sqoop/bin/sqoop import --hive-table users --connect jdbc:postgresql:// --table users --username XXX --password YY --hive-home /app/hiv

RE: Doubt related to INDEX in Hive

2012-04-12 Thread binhnt22
Index in Hive is nothing like index in RDBMS. It's useful for summary sql like sum, count group by index columns. Detail information you should read in http://www.slideshare.net/NikhilDeshpande/indexed-hive Best regards Nguyen Thanh Binh (Mr) Cell phone: (+84)98.226.0622 From: Bha

Doubt related to INDEX in Hive

2012-04-12 Thread Bhavesh Shah
Hello, I am just started using Hive in Hive. I never used it before. I just want to ask one thing related to this is that: If I use INDEX in my task then Does it be give any change in performance? I mean that If I started using Hive then what about the performance of HIve? Does it increase? --

Re: Hive equivalent of row_number()

2012-04-12 Thread Alex Kozlov
http://www.quora.com/Hive-computing/How-are-SQL-type-analytic-and-windowing-functions-accomplished-in-Hadoop-Hive -- Alex K On Thu, Apr 12, 2012 at 1:43 PM, Saurabh S wrote: > > I have a table with three columns, A, B, and Score,

Hive equivalent of row_number()

2012-04-12 Thread Saurabh S
I have a table with three columns, A, B, and Score, where A and B are some items, and Score is some kind of affinity between A and B. There are N number of items of each A and B, so that the total number of rows in the table are N^2. Is there a way to fetch "top 5 items in B" for each item in A

RE: Graph/Plotting in Hive/Hadoop

2012-04-12 Thread karanveer.singh
Any usages of histogram UDF that you can share please? Regards, -Original Message- From: Edward Capriolo [mailto:edlinuxg...@gmail.com] Sent: 12 April 2012 01:33 To: user@hive.apache.org Subject: Re: Graph/Plotting in Hive/Hadoop The historgram UDF can be used to create data from GNU-