"DESCRIBE EXTENDED table_name" does not show that compression is turned on.
---------------------------------------------------------------------------
Key: HIVE-2250
URL: https://issues.apache.org/jira/browse/HIVE-2250
Project: Hive
Issue Type: Bug
Components: CLI, Diagnosability
Affects Versions: 0.7.0
Environment: RHEL, Full Cloudera stack
Reporter: Travis Powell
Priority: Critical
Commands executed in this order:
user@node # hive
hive> SET hive.exec.compress.output=true;
hive> SET io.seqfile.compression.type=BLOCK;
hive> CREATE TABLE table_name ( [...] ) ROW FORMAT DELIMITED FIELDS TERMINATED
BY '\t' STORED AS SEQUENCEFILE;
hive> CREATE TABLE staging_table ( [...] ) ROW FORMAT DELIMITED FIELDS
TERMINATED BY '\t';
hive> LOAD DATA LOCAL INPATH 'file:///root/input/' OVERWRITE INTO TABLE
staging_table;
hive> INSERT OVERWRITE TABLE table_name SELECT * FROM staging_table;
(Map reduce job to change to sequence file...)
hive> DESCRIBE EXTENDED table_name;
Detailed Table Information Table(tableName:table_name,
dbName:benchmarking, owner:root, createTime:1309480053, lastAccessTime:0,
retention:0, sd:StorageDescriptor(cols:[FieldSchema(name:session_key,
type:string, comment:null), FieldSchema(name:remote_address, type:string,
comment:null), FieldSchema(name:canister_lssn, type:string, comment:null),
FieldSchema(name:canister_session_id, type:bigint, comment:null),
FieldSchema(name:tltsid, type:string, comment:null), FieldSchema(name:tltuid,
type:string, comment:null), FieldSchema(name:tltvid, type:string,
comment:null), FieldSchema(name:canister_server, type:string, comment:null),
FieldSchema(name:session_timestamp, type:string, comment:null),
FieldSchema(name:session_duration, type:string, comment:null),
FieldSchema(name:hit_count, type:bigint, comment:null),
FieldSchema(name:http_user_agent, type:string, comment:null),
FieldSchema(name:extractid, type:bigint, comment:null),
FieldSchema(name:site_link, type:string, comment:null), FieldSchema(name:dt,
type:string, comment:null), FieldSchema(name:hour, type:int, comment:null)],
location:hdfs://hadoop2/user/hive/warehouse/benchmarking.db/table_name,
inputFormat:org.apache.hadoop.mapred.SequenceFileInputFormat,
outputFormat:org.apache.hadoop.hive.ql.io.HiveSequenceFileOutputFormat,
compressed:false, numBuckets:-1, serdeInfo:SerDeInfo(name:null,
serializationLib:org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe,
parameters:{serialization.format= , field.delim=
*** SEE ABOVE: Compression is set to FALSE, even though contents of table is
compressed.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira