Hi,
Would there be a performance difference query1 vs query2?

*query1 :*
cc = LOAD '$TBL_CLEARCODE'
USING
org.apache.pig.backend.hadoop.hbase.HBaseStorage('cf_data:cq_description
cf_data:cq_category cf_data:cq_dqtimestamp cf_data:cq_checkarray',
'-loadKey true  ')
as (key,description,category,ActiveStagTmStamp,transformArray);

*query2:*
cc = LOAD '$TBL_CLEARCODE'
USING
org.apache.pig.backend.hadoop.hbase.HBaseStorage('cf_data:cq_description
cf_data:cq_category cf_data:cq_dqtimestamp cf_data:cq_checkarray',
'-loadKey true  *-maxTimestamp $CORR_DATE*')
as (key,description,category,ActiveStagTmStamp,transformArray);

The only difference between between the two querys is the -maxTimestamp
parameter in query2.

Regards,
Krishna

Reply via email to