Reporting deadlink at GettingStarted.

2013-02-21 Thread 치민 박
Hello Hive guys. I found a deadline in the GettingStarted document. https://cwiki.apache.org/confluence/display/Hive/GettingStarted But have no way to fix it. so i'm reporting deadline and giving a link which maybe a correct link. wget http://www.grouplens.org/system/files/ml-data.tar+0.gz =>

Re: unbalanced transaction calls

2013-02-21 Thread Hemanth Yamijala
Hi, We are running into the same problem as well. Is there any clue what could be wrong ? Thanks hemanth On Wed, Feb 6, 2013 at 1:51 AM, James Warren < james.war...@stanfordalumni.org> wrote: > As part of our daily workflow, we're running a few hundred hive > queries that are coordinated throu

Re: hive 0.10.0 doc

2013-02-21 Thread Lefty Leverenz
> > Can someone point me to the apache docs for hive 0.10.0? Now you can use the Hive wiki for all documentation except javadocs: - https://cwiki.apache.org/confluence/display/Hive/Home I've just added two docs that weren't originally in the wiki, and for everything else the wiki versions ar

Re: ROW_NUMBER() equivalent in Hive

2013-02-21 Thread Ashutosh Chauhan
Hi Stephen, As I indicated in my previous email, check out file ql/src/test/queries/ clientpositive/ptf_general_queries.q it has plenty of example queries demonstrating the functionality which is available. If you are interested in hive src changes which has enabled this feature.. you may want to

Re: ROW_NUMBER() equivalent in Hive

2013-02-21 Thread Stephen Boesch
Hi Ashutosh, I am interested / reviewing your windowing feature. Can you be more specific about which (a) tests and (b) src files constitute your additions (there are lots of files there ;) ) thanks stephen boesch 2013/2/21 Ashutosh Chauhan > Kumar, > > If you are willing to be on bleedi

please remove me

2013-02-21 Thread Erik Thorson
Can you please take me off the mailing list. Erik Thorson Varick Media Management Lead Engineer 212.337.4796 201.694.1122 [cid:925415C8-B8A6-494F-86E5-213E94DA91FA] [cid:44B259FA-6E9B-447C-8D2E-3BB0CA6A1D7B] [cid:53F1B684-5196-4372-A160-3945F706F949]

Re: ROW_NUMBER() equivalent in Hive

2013-02-21 Thread kumar mr
Owen, it's for entire table. the sample TD query looks like below, SELECT columnA ,columnB , columnC , columnD , columnX ,ROW_NUMBER() OVER (PARTITION BY columnA, columnB, columnC ORDER BY columnX DESC, columnY DESC) AS rank FROM table a Rega

Re: ower of hive table

2013-02-21 Thread Abdelrahman Shettia
Hi All , The ownership of hive table is controlled by the ownership of its hdfs dir. The table dir on hdfs ownership can be changed by running the following command. hadoop fs -chown $user:$grp -R $table_dir $table_dir: can be obtained by running in hive shell . > desc extended 'table_name' ;

Re: ROW_NUMBER() equivalent in Hive

2013-02-21 Thread Ashutosh Chauhan
Kumar, If you are willing to be on bleeding edge, this and many other partitioning and windowing functionality some of us are developing in a branch over at: https://svn.apache.org/repos/asf/hive/branches/ptf-windowing Check out this branch, build hive and than you can have row_number() functional

Re: ROW_NUMBER() equivalent in Hive

2013-02-21 Thread Owen O'Malley
What are the semantics for ROW_NUMBER? Is it a global row number? Per a partition? Per a bucket? -- Owen On Wed, Feb 20, 2013 at 11:33 PM, kumar mr wrote: > Hi, > > This is Kumar, and this is my first question in this group. > > I have a requirement to implement ROW_NUMBER() from Teradata in

Re: Hive 0.7.1 Query hands

2013-02-21 Thread Jarek Jarcec Cecho
Hi sir, the root cause of your issues seems to be java.io.EOFException, that based on the java doc description means the following: "Signals that an end of file or end of stream has been reached unexpectedly during input." What is the health status of the box with ip 10.6.0.55? Isn't it by an

Re: Adding comment to a table for columns

2013-02-21 Thread Chunky Gupta
Hi Bejoy, I checked and didn't find anywhere using extended and formatted simultaneously to describe table. It is like :- DESCRIBE [EXTENDED|FORMATTED] table_name[DOT col_name ( [DOT field_name] | [DOT '$elem$'] | [DOT '$key$'] | [DOT '$value$'] )* ] Everywhere it says i can use only one at a ti

Re: Using HiveJDBC interface

2013-02-21 Thread Aditya Rao
Thanks for the tips. I would think #2 works well when you are setting hiveconf variables that are isolated to your query. I have instances in my scripts where I need to set hadoop properties before executing a query. For example setting the number of reducers using set mapred.reduce.tasks=50 With

Re: Adding comment to a table for columns

2013-02-21 Thread bejoy_ks
Hi Gupta Try out DESCRIBE EXTENDED FORMATTED I vaguely recall a operation like this. Please check hive wiki for the exact syntax. Regards Bejoy KS Sent from remote device, Please excuse typos -Original Message- From: Chunky Gupta Date: Thu, 21 Feb 2013 17:15:37 To: ; ; Reply-To:

Re: Adding comment to a table for columns

2013-02-21 Thread Chunky Gupta
Hi Bejoy, Bhaskar I tried using FORMATTED, but it will not give me comments which I have put while creating table. Its output is like :- col_namedata_type comment cstring from deserializer timestring

Hive 0.7.1 Query hands

2013-02-21 Thread Павел Мезенцев
Hello! I use Hive 0.7.1 over Hadoop 0.20.2 (CHD3u3) on 70 nodes cluster. I have a trouble with query like this: *FROM* ( *SELECT* *id*, {expressions} *FROM* table1 *WHERE* day='2013-02-16' *AND* ({conditions1})*UNION* *ALL* *SELECT* *id*, {expressions} *FROM* table2 *WHERE* day='2013-02-16'

RE: Adding comment to a table for columns

2013-02-21 Thread Bhaskar, Snehalata
Try using 'describe formatted' command i.e. describe formatted test Thanks and regards, Snehalata Deorukhkar From: Chunky Gupta [mailto:chunky.gu...@vizury.com] Sent: Thursday, February 21, 2013 4:47 PM To: user@hive.apache.org Subject: Adding comment to a table for columns Hi, I am using th

Re: Adding comment to a table for columns

2013-02-21 Thread bejoy_ks
Hi Gupta You can the describe output in a formatted way using DESCRIBE FORMATTED ; Regards Bejoy KS Sent from remote device, Please excuse typos -Original Message- From: Chunky Gupta Date: Thu, 21 Feb 2013 16:46:30 To: Reply-To: user@hive.apache.org Subject: Adding comment to a tab

Adding comment to a table for columns

2013-02-21 Thread Chunky Gupta
Hi, I am using this syntax to add comments for all columns :- CREATE EXTERNAL TABLE test ( c STRING COMMENT 'Common class', time STRING COMMENT 'Common time', url STRING COMMENT 'Site URL' ) PARTITIONED BY (dt STRING ) LOCATION 's3://BucketName/' Output of Describe Extended table is like :- (O

Re: Running Hive on multi node

2013-02-21 Thread bejoy_ks
Hi Hamad Fully distributed is a proper cluster where all demons are not on the same machine. You can have hadoop installed in three modes - Stand Alone - Pseudo Distributed (all daemons in same machine) and - Fully Distributed Regards Bejoy KS Sent from remote device, Please excuse typos --

Re: Running Hive on multi node

2013-02-21 Thread Hamza Asad
what do u mean by fully Distributed? On Thu, Feb 21, 2013 at 2:58 PM, wrote: > ** > Hi > > Hive uses the hadoop installation specified in HADOOP_HOME. If your hadoop > home is configured for fully distributed operation it'll utilize the > cluster itself. > Regards > Bejoy KS > > Sent from remot

Re: Running Hive on multi node

2013-02-21 Thread bejoy_ks
Hi Hive uses the hadoop installation specified in HADOOP_HOME. If your hadoop home is configured for fully distributed operation it'll utilize the cluster itself. Regards Bejoy KS Sent from remote device, Please excuse typos -Original Message- From: Hamza Asad Date: Thu, 21 Feb 2013