Jonathan Vexler created HUDI-5263: ------------------------------------- Summary: Setting partitioned by (partition_path) with nonpartitioned keygenerator in spark-sql will cause the colum to be null Key: HUDI-5263 URL: https://issues.apache.org/jira/browse/HUDI-5263 Project: Apache Hudi Issue Type: Bug Components: spark-sql Reporter: Jonathan Vexler
When creating the table, for example: {code:java} create table hudi_cow_pt_tbl ( id bigint, name string, ts bigint, dt string, hh string ) using hudi tblproperties ( type = 'cow', primaryKey = 'id', preCombineField = 'ts' hoodie.table.keygenerator.class = 'org.apache.hudi.keygen.NonpartitionedKeyGenerator' ) partitioned by (dt) {code} This will cause dt to always be null when you read the record. I don't know if the data is stored as null or just reads as null. If this is due to implementation issues and the only fix would be to fail the table creation, I think that is preferable to the current behavior. -- This message was sent by Atlassian Jira (v8.20.10#820010)