Re: question about partition table in hive

2013-09-13 Thread Jagat Singh
n said earlier , experiment like crazy - and share please - it > will make all of us better as well ! > > > Thanks > > sanjay > > From: ch huang > Reply-To: "user@hive.apache.org" > Date: Thursday, September 12, 2013 6:55 PM > To: "user@hive.apache

Re: question about partition table in hive

2013-09-13 Thread Sanjay Subramanian
quot; mailto:user@hive.apache.org>> Subject: question about partition table in hive hi,all: i use flume collect log data and put it in hdfs ,i want to use hive to do some caculate, query based on timerange,i want to use parttion table , but the data file in hdfs is a big fi

Re: question about partition table in hive

2013-09-13 Thread Dean Wampler
Flume might be able to invoke Hive to do this as the data is ingested, but I don't know anything about Flume. Brent has a nice blog post describing many of the details of partitioning. http://www.brentozar.com/archive/2013/03/introduction-to-hive-partitioning/ We also cover them in our book. The

Re: question about partition table in hive

2013-09-13 Thread Stephen Sprague
and have you done any analysis on this yet using the Hive documentation that's publicly available? if you show some initiative yourself you're more likely to get others to join your cause. :) So what have you tried before asking us for help? On Thu, Sep 12, 2013 at 6:55 PM, ch huang wrote: >

Re: question about partition table in hive

2013-09-13 Thread Nitin Pawar
You will need to define a partition column like date or hour something like this. Then configure flume to rollover filee/directories based on your partition column. You will need some kind of cron which will check for the new data being available into a directory or file and then add it as partitio

question about partition table in hive

2013-09-12 Thread ch huang
hi,all: i use flume collect log data and put it in hdfs ,i want to use hive to do some caculate, query based on timerange,i want to use parttion table , but the data file in hdfs is a big file ,how can i put it into pratition table in hive?