Re:Re: Re: multiple tables join with only one hug table.

2011-08-14 Thread Daniel,Wu
a simple usage: for retailer data, which keep 10 years of data, that's 10 * 365 =3650 records in the calendar dimension, if there are 8000 stores and 8000 products, totally the sales will have 8000 * 8000 * 3650 =233,600,000,000 records if we has one record for each product/day/store

failed when create an index with partitioned by clause

2011-08-14 Thread Daniel,Wu
create table part (a int,b int) PARTITIONED by (c int); create index part_idx on table part(b,c) AS 'org.apache.hadoop.hive.ql.index.compact.CompactIndexHandler' WITH DEFERRED REBUILD partitioned by (a) ; hive create index part_idx on table part(b,c) AS

Re: how to load data to partitioned table

2011-08-14 Thread hadoopman
Something else I've noticed is when loading LOTS of historical data, if you can try to say load a month of data at a time, try to just load THAT month of data and only that month. I've been able to load several years of data (depending on the data) at a single load however there have been

Re: failed when create an index with partitioned by clause

2011-08-14 Thread John Sichi
The wiki docs are incorrect here. CREATE INDEX does not yet supported a PARTITIONED BY clause; that was added in the spec to support HIVE-1499, which hasn't been implemented yet. For now, the index partitioning always follows the table partitioning exactly. JVS On Aug 14, 2011, at 3:22 AM,

Re: how to load data to partitioned table

2011-08-14 Thread hadoopman
DISTRIBUTE BY and CLUSTER BY didn't resolve all the issues I've seen with very large data sets. I mean I'm loading a couple terabytes in a dataset and running into some rather interesting problems. I noticed however loading a couple months or two at a time (and making sure they were from the

wants to create a JIRA (request): multiple tables join with only one hug table.

2011-08-14 Thread Daniel,Wu
Hi everyone, I'd like to create a change request (or JIRA, not sure), do you think it's feasible? And I search the document about how to contribute, but can't find a way about how to create a request, could anyone point me to the document? At 2011-08-14 17:08:26,Daniel,Wu hadoop...@163.com