Hi,
1. The file 'updateTableMapping.json' indicates the path for the update Hive
table mapping file, the format is json, for example:
{
"KYLIN_4485_QA.KYLIN_ACCOUNT": {
"database": "KYLIN_4485_PROD",
"tableName": "KYLIN_ACCOUNT"
},
"KYLIN_4485_QA.KYLIN_CAL_DT": {
"database": "KYLIN_4485_PROD"
},
"KYLIN_4485_QA.KYLIN_CATEGORY_GROUPINGS": {
"database": "KYLIN_4485_PROD"
},
"KYLIN_4485_QA.KYLIN_COUNTRY": {
"database": "KYLIN_4485_PROD"
},
"KYLIN_4485_QA.KYLIN_SALES": {
"database": "KYLIN_4485_PROD"
}
}
If the database and tables are the same between two clusters, it doesn't need
this file.
2. According to the error messages you showed, kylin already starts two threads
to use 'distcp' command to copy hdfs files but it failed, the error messages
seems to related to the env of hadoop, we tested this function on CDH 5 and HDP
2, all worked fine.
Can you check the env of the hadoop you used, something like missing some jars.
Best regards,
Zhichao Zhang