Re: Hive on Windows w/Hadoop 2.2.0

2014-02-04 Thread Lefty Leverenz
That's from the first sentence of Getting Started, which needs to be rewritten: DISCLAIMER: Hive has only been tested on unix(linux) and mac systems using > Java 1.6 for now - although it may very well work on other similar > platforms. It does not work on Cygwin. What should it say instead? --

Hive queries for disk usage analysis

2014-02-04 Thread Mungre,Surbhi
Hello All, We are doing some analysis for which we need to determine things like size of the largest row or size of the largest column. By size, I am referring to disk space usage. Does HIVE provide any functions to run such queries? Thanks, Surbhi Mungre Software Engineer www.cerner.com

Re: GenericUDF Testing in Hive

2014-02-04 Thread Raj Hadoop
I want to do a simple test like this - but not working - select ComplexUDFExample(List("a", "b", "c"), "b") from table1 limit 10; FAILED: SemanticException [Error 10011]: Line 1:25 Invalid function 'List' On Tuesday, February 4, 2014 2:34 PM, Raj Hadoop wrote: How to test a Hive Generi

Re: GenericUDF Testing in Hive

2014-02-04 Thread Raj Hadoop
How to test a Hive GenericUDF which accepts two parameters List, T List -> Can it be the output of a collect set. Please advise. I have a generic udf which takes List, T. I want to test it how it works through Hive. On Monday, January 20, 2014 5:19 PM, Raj Hadoop wrote:   The following

RE: Hive on Windows w/Hadoop 2.2.0

2014-02-04 Thread Eric Hanson (BIG DATA)
HDP runs on Windows: http://hortonworks.com/products/hdp-windows/#install I don't know if it uses Cygwin or not but everything needed to run Hive on Windows should be in the install package. Eric From: Ian Jackson [mailto:ian_jack...@trilliumsoftware.com] Sent: Tuesday, February 4, 2014 8:55 A

Hive on Windows w/Hadoop 2.2.0

2014-02-04 Thread Ian Jackson
Reading the Wiki (cwiki.apache.org/confluence/display/Hive/GettingStarted), it states "It does not work on Cygwin." Do further details exist or hints to make work on Windows?

Re: Index not getting used for the queries

2014-02-04 Thread Thilina Gunarathne
Thanks Peter. It helped. That property combined with setting the property 'hive.optimize.index.filter' to 'true' got the index working. thanks, Thilina On Mon, Feb 3, 2014 at 6:12 PM, Peter Marron < peter.mar...@trilliumsoftware.com> wrote: > Hi, > > > > Not sure if it is relevant to your prob

collect_list on two columns of the same row

2014-02-04 Thread ZORAIDA HIDALGO SANCHEZ
Dear all, I need to call to one of my UDFs that receives two arrays as a parameters. Each of these arrays represents the result of collect_list(col) after doing group by. Does the first position of the first array correspond with the first position to the second array? Por instace, having this

Re: Optimising mappers for number of nodes

2014-02-04 Thread Lefty Leverenz
Actually that's mapred.max.split.size. Hive doesn't have a configuration parameter named "hive.max.split.size". -- Lefty On Mon, Feb 3, 2014 at 10:59 AM, Prasanth Jayachandran < pjayachand...@hortonworks.com> wrote: > Hi > > hive.max.split.size can be tuned to decrease the number of mappers. >

Issues in retrieval of hive data-reg

2014-02-04 Thread Selvi. rceg
I am doing my project on big data. I had installed *HADOOP 1.2.1* and *HIVE - 0.11.0* in *UBUNTU 11.10* . I have created tables in hive and structured the contents in the table using hive. Now I need to retrieve the structured data. Can you please help me to retrieve the structured data? On trying