Re: Using BI tools with Hive?

2009-04-28 Thread Min Zhou
This is a interesting topic, any further discussion? On Sat, Apr 18, 2009 at 11:22 PM, Ashish Thusoo wrote: > we are dealing with both of these as far as micro strategy is concerned. > For that jdbc does not work and we need an odbc driver instead of a jdbc > driver. We had tried with the jdbc od

Re: Truncate Table

2009-04-28 Thread Frederick Oko
I had commented a while back on the trust-HDFS removal in HIVE-126 as at hi5 we had always been writing to HDFS directly instead of via Hive CLI load. We agreed a config to reenable this behavior would be good so as to not require coupling of Hive into the data load process tho I agree HIVE-91 exte

Re: Hive Application

2009-04-28 Thread Edward Capriolo
On Tue, Apr 28, 2009 at 2:45 PM, Matt Pestritto wrote: > Thanks for your comments all.  In the short-term, I will probably use simple > bash scripting because we have an immediate need.  I'll work on a long-term > solution. > > Thanks > -Matt > > On Tue, Apr 28, 2009 at 2:04 PM, Prasad Chakka wro

Re: GEO-IP as User Defined Function

2009-04-28 Thread Edward Capriolo
Making it dynamically configurable seems a bit complicated. I will consider publishing myself.

Re: Hive Application

2009-04-28 Thread Matt Pestritto
Thanks for your comments all. In the short-term, I will probably use simple bash scripting because we have an immediate need. I'll work on a long-term solution. Thanks -Matt On Tue, Apr 28, 2009 at 2:04 PM, Prasad Chakka wrote: > > I copy and paste pieces into the CLI. > FYI, there hive –e “

Re: Hive Application

2009-04-28 Thread Prasad Chakka
> I copy and paste pieces into the CLI. FYI, there hive -e "hive ql command" or hive -f "file name" options. I suppose you want more but you can use these write a simple framework that patches up all the different steps in your program. From: John Warden Reply-T

Re: GEO-IP as User Defined Function

2009-04-28 Thread Dhruba Borthakur
We cannot put GPL code into Hive... licenses are incompatible. You can make it a dynamically configurable parameter. If the relevant classes in the CLASSPATH then they will be invoked. Otherwise, the stubs (built into hive) can throw an exception. A customer can download the maxmind stuff into his

Re: Truncate Table

2009-04-28 Thread Prasad Chakka
By having partition level metadata and not deriving it, quite a few things can be achieved * partitions can be external (i.e. Location of a partitions need not be relative to the table) * partitions level schemas so that a table's schema can evolve * partitions can be disabled without

Re: GEO-IP as User Defined Function

2009-04-28 Thread Prasad Chakka
I think it will be very useful to put in Hive (or rather a contrib in hive) but I don't think GPL is compatible with Apache. So it will be a no go putting the maxmind code/data in Hive. Prasad From: Edward Capriolo Reply-To: Date: Tue, 28 Apr 2009 07:42:48 -0

GEO-IP as User Defined Function

2009-04-28 Thread Edward Capriolo
Hey all, You may all be familiar with geo-ip from maxmind. http://www.maxmind.com/app/api. GNU General Public License (GPL) I am running a process where I have to geo locate IP addresses. I think this would be a good UDF. Right now I am using an external map reduce process to be inserted back into

RE: Hive Application

2009-04-28 Thread Ashish Thusoo
We implemented something in house called databee which basically does what John mentioned but Kettle is an open source alternative that does that - I think Pentaho acquired them.. Ashish From: John Warden [john.war...@gmail.com] Sent: Monday, April 27, 20