n said earlier , experiment like crazy - and share please - it
> will make all of us better as well !
>
>
> Thanks
>
> sanjay
>
> From: ch huang
> Reply-To: "user@hive.apache.org"
> Date: Thursday, September 12, 2013 6:55 PM
> To: "user@hive.apache
quot;
mailto:user@hive.apache.org>>
Subject: question about partition table in hive
hi,all:
i use flume collect log data and put it in hdfs ,i want to use hive to
do some caculate, query based on timerange,i want to use parttion table ,
but the data file in hdfs is a big fi
Flume might be able to invoke Hive to do this as the data is ingested, but
I don't know anything about Flume.
Brent has a nice blog post describing many of the details of partitioning.
http://www.brentozar.com/archive/2013/03/introduction-to-hive-partitioning/
We also cover them in our book. The
and have you done any analysis on this yet using the Hive documentation
that's publicly available?
if you show some initiative yourself you're more likely to get others to
join your cause. :)
So what have you tried before asking us for help?
On Thu, Sep 12, 2013 at 6:55 PM, ch huang wrote:
>
You will need to define a partition column like date or hour something like
this.
Then configure flume to rollover filee/directories based on your partition
column.
You will need some kind of cron which will check for the new data being
available into a directory or file and then add it as partitio
hi,all:
i use flume collect log data and put it in hdfs ,i want to use hive
to do some caculate, query based on timerange,i want to use parttion table ,
but the data file in hdfs is a big file ,how can i put it into pratition
table in hive?