Jonathan Vexler created HUDI-5263:
-------------------------------------

             Summary: Setting partitioned by (partition_path) with 
nonpartitioned keygenerator in spark-sql will cause the colum to be null
                 Key: HUDI-5263
                 URL: https://issues.apache.org/jira/browse/HUDI-5263
             Project: Apache Hudi
          Issue Type: Bug
          Components: spark-sql
            Reporter: Jonathan Vexler


When creating the table, for example: 
{code:java}
create table hudi_cow_pt_tbl (
  id bigint,
  name string,
  ts bigint,
  dt string,
  hh string
) using hudi
tblproperties (
  type = 'cow',
  primaryKey = 'id',
  preCombineField = 'ts'
  hoodie.table.keygenerator.class = 
'org.apache.hudi.keygen.NonpartitionedKeyGenerator'
 )
partitioned by (dt) {code}
This will cause dt to always be null when you read the record. I don't know if 
the data is stored as null or just reads as null. If this is due to 
implementation issues and the only fix would be to fail the table creation, I 
think that is preferable to the current behavior.  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to