tooptoop4 commented on issue #1955:
URL: https://github.com/apache/hudi/issues/1955#issuecomment-679356547
perfect, that is how I expect. perhaps the default should be global index?
or documentation should be updated?
From coming from RDBMS background the PK is unique at table level not at
partition level but reading below configs it is not clear that hudi default is
different and I'm sure will trip up many newcomers to hudi:
"RECORDKEY_FIELD_OPT_KEY (Required): **Primary key** field(s). Nested fields
can be specified using the dot notation eg: a.b.c. When using multiple columns
as primary key use comma separated notation, eg: "col1,col2,col3,etc". Single
or multiple columns as primary key specified by KEYGENERATOR_CLASS_OPT_KEY
property.
Default value: "uuid"
PARTITIONPATH_FIELD_OPT_KEY (Required): Columns to be used for
**partitioning** the table. To prevent partitioning, provide empty string as
value eg: "". Specify partitioning/no partitioning using
KEYGENERATOR_CLASS_OPT_KEY. If synchronizing to hive, also specify using
HIVE_PARTITION_EXTRACTOR_CLASS_OPT_KEY.
Default value: "partitionpath""
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org