Team,
I need edit permission to Confluence HBaseBulkLoad
<https://cwiki.apache.org/confluence/display/Hive/HBaseBulkLoad>. Some
distribution like HDP has enabled 'Tez' as execution engine which by
default. So HFile generation may not work properly and will throw the
following exception.
{code}
Caused by: java.io.IOException: wrong key class:
org.apache.hadoop.io.LongWritable
is not class org.apache.hadoop.hive.ql.io.HiveKey
at org.apache.hadoop.io.SequenceFile$Reader.next(SequenceFile.java:2332)
at org.apache.hadoop.io.SequenceFile$Reader.next(SequenceFile.java:2384)
at org.apache.hadoop.mapreduce.lib.partition.TotalOrderPartitioner.
readPartitions(TotalOrderPartitioner.java:306)
at org.apache.hadoop.mapreduce.lib.partition.TotalOrderPartitioner.setConf(
TotalOrderPartitioner.java:88)
... 27 more
{code}
Some ref, TEZ-2741 <https://issues.apache.org/jira/browse/TEZ-2741>
*Document fix:* Set map-reduce engine when generating HFiles.
*set hive.execution.engine=mr;*
--
Thank you,
*Karthik Palanisamy*
Bangalore, *India*
Mobile : +91 9940089181
Skype : karthik.p01