Hello,

I am trying to create a table that is bucketed  and sorted by various
columns. My table is created as a sequence file, and I'm populating it with
the LOAD DATA command. However, I just came across this wiki page (
http://wiki.apache.org/hadoop/Hive/LanguageManual/DDL/BucketedTables) which
says that the data will NOT be bucketed when inserted into the table. It
gives an example of using the CLUSTER BY command in a SELECT statement to
insert the data into the table.

Is it possible to somehow get the same effect by using the LOAD DATA
command? Or do I have to create a separate bucketed and non-bucketed table
for my data and move it around like the example in the link above indicates?

Thanks,
Ryan

Reply via email to