unical1988 commented on PR #728:
URL: https://github.com/apache/incubator-xtable/pull/728#issuecomment-3267486110
@the-other-tim-brown after adjusting for Month... partition cases, it seems
that an Iceberg error arises: `java.lang.IllegalStateException: Duplicate key
timestamp (attempted merging values
PartitionValue(partitionField=InternalPartitionField(sourceField=InternalField(name=timestamp,
parentPath=file:/C:/Users/AppData/Local/Temp/junit-16256388177344238019/year=2025/month=09/part-00000-1e1d2c53-0322-49b6-a181-ef70bcb6bd4e.c000.snappy.parquet,
schema=InternalSchema(name=timestamp,
dataType=InternalType.TIMESTAMP(name=timestamp), comment=null,
isNullable=false, fields=null, recordKeyFields=[],
metadata={TIMESTAMP_PRECISION=MILLIS}), defaultValue=null, fieldId=null,
path=file:/C:/Users/AppData/Local/Temp/junit-16256388177344238019/year=2025/month=09/part-00000-1e1d2c53-0322-49b6-a181-ef70bcb6bd4e.c000.snappy.parquet.timestamp,
pathParts=[file:/C:/Users/AppData/Local/Temp/junit-16256388177344238019/year=2025/month=09/part-00000-1e1d2c53-0322-49b6-a181-ef70bcb6bd4e,
c000, snappy, parquet
, timestamp]), partitionFieldNames=[year, month], transformType=MONTH,
transformOptions={}), range=Range(minValue=1735516800000,
maxValue=1735516800000)) and
PartitionValue(partitionField=InternalPartitionField(sourceField=InternalField(name=timestamp,
parentPath=file:/C:/Users/AppData/Local/Temp/junit-16256388177344238019/year=2025/month=09/part-00000-1e1d2c53-0322-49b6-a181-ef70bcb6bd4e.c000.snappy.parquet,
schema=InternalSchema(name=timestamp,
dataType=InternalType.TIMESTAMP(name=timestamp), comment=null,
isNullable=false, fields=null, recordKeyFields=[],
metadata={TIMESTAMP_PRECISION=MILLIS}), defaultValue=null, fieldId=null,
path=file:/C:/Users/AppData/Local/Temp/junit-16256388177344238019/year=2025/month=09/part-00000-1e1d2c53-0322-49b6-a181-ef70bcb6bd4e.c000.snappy.parquet.timestamp,
pathParts=[file:/C:/Users/AppData/Local/Temp/junit-16256388177344238019/year=2025/month=09/part-00000-1e1d2c53-0322-49b6-a181-ef70bcb6bd4e,
c000, snappy, parquet, timestamp]), partitionFieldNames
=[year, month], transformType=MONTH, transformOptions={}),
range=Range(minValue=20995200000, maxValue=20995200000)))` which apparently
relates to the partitionColumn name being duplicated, but as we discussed the
partition column is timestamp and not year and month, could you check it ?
The non partitioned data case works fine as well as when the data is
partitioned by Year only
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]