[ https://issues.apache.org/jira/browse/HIVE-2250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13507271#comment-13507271 ]
Harsh J commented on HIVE-2250: ------------------------------- If we don't really make use of the IS_COMPRESSED attribute of a table, should we just get rid of it (or at least not print it in the {{describe extended/formatted}} output, which causes great confusion as it is always certainly {{No}})? > "DESCRIBE EXTENDED table_name" shows inconsistent compression information. > -------------------------------------------------------------------------- > > Key: HIVE-2250 > URL: https://issues.apache.org/jira/browse/HIVE-2250 > Project: Hive > Issue Type: Bug > Components: CLI, Diagnosability > Affects Versions: 0.7.0 > Environment: RHEL, Full Cloudera stack > Reporter: Travis Powell > Assignee: subramanian raghunathan > Priority: Critical > Attachments: HIVE-2250.patch > > > Commands executed in this order: > user@node # hive > hive> SET hive.exec.compress.output=true; > hive> SET io.seqfile.compression.type=BLOCK; > hive> CREATE TABLE table_name ( [...] ) ROW FORMAT DELIMITED FIELDS > TERMINATED BY '\t' STORED AS SEQUENCEFILE; > hive> CREATE TABLE staging_table ( [...] ) ROW FORMAT DELIMITED FIELDS > TERMINATED BY '\t'; > hive> LOAD DATA LOCAL INPATH 'file:///root/input/' OVERWRITE INTO TABLE > staging_table; > hive> INSERT OVERWRITE TABLE table_name SELECT * FROM staging_table; > (Map reduce job to change to sequence file...) > hive> DESCRIBE EXTENDED table_name; > Detailed Table Information Table(tableName:table_name, > dbName:benchmarking, owner:root, createTime:1309480053, lastAccessTime:0, > retention:0, sd:StorageDescriptor(cols:[FieldSchema(name:session_key, > type:string, comment:null), FieldSchema(name:remote_address, type:string, > comment:null), FieldSchema(name:canister_lssn, type:string, comment:null), > FieldSchema(name:canister_session_id, type:bigint, comment:null), > FieldSchema(name:tltsid, type:string, comment:null), FieldSchema(name:tltuid, > type:string, comment:null), FieldSchema(name:tltvid, type:string, > comment:null), FieldSchema(name:canister_server, type:string, comment:null), > FieldSchema(name:session_timestamp, type:string, comment:null), > FieldSchema(name:session_duration, type:string, comment:null), > FieldSchema(name:hit_count, type:bigint, comment:null), > FieldSchema(name:http_user_agent, type:string, comment:null), > FieldSchema(name:extractid, type:bigint, comment:null), > FieldSchema(name:site_link, type:string, comment:null), FieldSchema(name:dt, > type:string, comment:null), FieldSchema(name:hour, type:int, comment:null)], > location:hdfs://hadoop2/user/hive/warehouse/benchmarking.db/table_name, > inputFormat:org.apache.hadoop.mapred.SequenceFileInputFormat, > outputFormat:org.apache.hadoop.hive.ql.io.HiveSequenceFileOutputFormat, > compressed:false, numBuckets:-1, serdeInfo:SerDeInfo(name:null, > serializationLib:org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe, > parameters:{serialization.format= , field.delim= > *** SEE ABOVE: Compression is set to FALSE, even though contents of table is > compressed. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira