xiarixiaoyao opened a new pull request #2634:
URL: https://github.com/apache/hudi/pull/2634


   ## *Tips*
   - *Thank you very much for contributing to Apache Hudi.*
   - *Please review https://hudi.apache.org/contributing.html before opening a 
pull request.*
   
   ## What is the purpose of the pull request
   
   fixed the bug that: Failed to query real-time view use hive/spark-sql when 
hudi mor table contains dateType
   test step:
   step1: prepare raw DataFrame with DateType, and insert it to HudiMorTable
   
   df_raw.withColumn("date", lit(Date.valueOf("2020-11-10")))
   
   merge(df_raw, "bulk_insert", "huditest.bulkinsert_mor_10g")
   
   step2: prepare update DataFrame with DateType, and upsert into HudiMorTable
   
    df_update = sql("select * from 
huditest.bulkinsert_mor_10g_rt").withColumn("date", 
lit(Date.valueOf("2020-11-11")))
   
   merge(df_update, "upsert", "huditest.bulkinsert_mor_10g")
   
   step3: use hive-beeeline/ spark-sql query mor_rt table
   
   use beeline/spark-sql   execute   statement select * from 
huditest.bulkinsert_mor_10g_rt where primary_key = 10000000;
   
   then the follow error will occur:
   
   java.lang.ClassCastExceoption: org.apache.hadoop.io.IntWritable cannot be 
cast to org.apache.hadoop.hive.serde2.io.DateWritableV2
   
   ## Brief change log
   when hudi read log file and convert avro INT type record to 
writable,logicalType is not respected which lead the dateType will cast to 
IntWritable。so cast avro INT type  to writable,  logicalType must be  considered
   
   ## Verify this pull request
   Existing UT tests
   ## Committer checklist
   
    - [ ] Has a corresponding JIRA in PR title & commit
    
    - [ ] Commit message is descriptive of the change
    
    - [ ] CI is green
   
    - [ ] Necessary doc changes done or have another open PR
          
    - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Reply via email to