[ https://issues.apache.org/jira/browse/KYLIN-5650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Xiaoxiang Yu resolved KYLIN-5650. --------------------------------- Resolution: Fixed > In the cloud environment, there is a probability that the dictionary metadata > file will be read abnormally during building job, resulting in incorrect > query results. > --------------------------------------------------------------------------------------------------------------------------------------------------------------------- > > Key: KYLIN-5650 > URL: https://issues.apache.org/jira/browse/KYLIN-5650 > Project: Kylin > Issue Type: Bug > Components: Tools, Build and Test > Affects Versions: 5.0-alpha > Reporter: Zhiting Guo > Assignee: Zhiting Guo > Priority: Major > Fix For: 5.0-beta > > Attachments: In the cloud environment, there is a probability that > the dictionary metadata file will be read abnormally during building job, > resulting in incorrect query results..pdf > > > Checked the dictionary, there are no duplicate values. Checked the execution > plan of the build dictionary step, there is no problem. Checked the steps of > building a flat table and found that there was a problem in the step of flat > table encoding dictionary. > The reason for the error is that the encoding is not performed after > repartition according to the dictionary column. As shown in the figure, there > is no repartition, and the encode column appears in the plan. > There are also the following logs: > {code:java} > 2023-03-26T20:26:30,868 INFO [logger-thread-0] dict.NGlobalDictHDFSStore : > Commit from > s3a://datalake-kc-s3-prd-bj/kylin/kcprodYcHG_kylin/datalake_kylin/dict/global_dict/GDT.GDT_CMPLYA_FCT_DIST_RESLT/IS_STAT/working > to > s3a://datalake-kc-s3-prd-bj/kylin/kcprodYcHG_kylin/datalake_kylin/dict/global_dict/GDT.GDT_CMPLYA_FCT_DIST_RESLT/IS_STAT/version_1679862387539 > 2023-03-26T20:31:14,501 INFO [logger-thread-0] dict.NGlobalDictionaryV2 : > getMetaInfo versions.length is 12 > 2023-03-26T20:31:14,547 INFO [logger-thread-0] dict.NGlobalDictHDFSStore : > because metaFiles.length is 0, metaInfo is null > 2023-03-26T20:31:14,547 INFO [logger-thread-0] dict.NGlobalDictionaryV2 : > getMetaInfo metadata is null : [true]{code} > This is on s3, after renaming the dictionary directory, no metadata file is > queried. However, if the meta is not obtained in the code and no error is > reported, it is not reasonable to encode directly without repartition. In > short, the result is that the encoding of the dictionary column on the flat > table fails. -- This message was sent by Atlassian Jira (v8.20.10#820010)