tooptoop4 commented on issue #1955:
URL: https://github.com/apache/hudi/issues/1955#issuecomment-679356547


   perfect, that is how I expect. perhaps the default should be global index? 
or documentation should be updated?
   
   From coming from RDBMS background the PK is unique at table level not at 
partition level but reading below configs it is not clear that hudi default is 
different and I'm sure will trip up many newcomers to hudi:
   
   "RECORDKEY_FIELD_OPT_KEY (Required): **Primary key** field(s). Nested fields 
can be specified using the dot notation eg: a.b.c. When using multiple columns 
as primary key use comma separated notation, eg: "col1,col2,col3,etc". Single 
or multiple columns as primary key specified by KEYGENERATOR_CLASS_OPT_KEY 
property.
   Default value: "uuid"
   
   PARTITIONPATH_FIELD_OPT_KEY (Required): Columns to be used for 
**partitioning** the table. To prevent partitioning, provide empty string as 
value eg: "". Specify partitioning/no partitioning using 
KEYGENERATOR_CLASS_OPT_KEY. If synchronizing to hive, also specify using 
HIVE_PARTITION_EXTRACTOR_CLASS_OPT_KEY.
   Default value: "partitionpath""
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Reply via email to