fengpod commented on pull request #1494: URL: https://github.com/apache/kylin/pull/1494#issuecomment-737018145
When a large data is written to hbase cluster at the same time,the cluster load will become very high,which will affect the query performance. This pr allows data to be written data to hadoop hdfs when doing step “Convert Cuboid Data to HFile”,and then hfile will be transferred to the hbase cluster by dist-cp。This pr adds a new step " HFile Distcp To HBase" between “Convert Cuboid Data to HFile” and "Load HFile to HBase Table" 。As look like this: ![image](https://user-images.githubusercontent.com/4843586/100835711-013fae00-34a9-11eb-8de8-e69228ba0991.png) ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org