[GitHub] [incubator-hudi] bwu2 commented on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table
bwu2 commented on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table URL: https://github.com/apache/incubator-hudi/pull/1128#issuecomment-601916430 That is because i copied and pasted the folder to a new location for troubleshooting. The actual creation times match the commit times pretty much. This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org With regards, Apache Git Services
[GitHub] [incubator-hudi] bwu2 commented on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table
bwu2 commented on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table URL: https://github.com/apache/incubator-hudi/pull/1128#issuecomment-594824906 It's not easily reproducible. It seems to be a sporadic thing that happens from time to time; from the 95 tables we ingest (every 4 hours) for the last couple of weeks, the error has occurred 4 or 5 times. The jobs that created these zero bytes files seem to complete successfully and without any error. In the Spark logs for the job, things seem ok (this is the only mention of `.clean` files): `20/02/21 21:11:13 INFO S3NativeFileSystem2: rename s3://bucket/path/to/table/.hoodie/20200221211054.clean.inflight s3://bucket/path/to/table/.hoodie/20200221211054.clean` The s3 `.hoodie` folder looks like (ignore the dates on the files, i copied and pasted from the original folder): ![image](https://user-images.githubusercontent.com/14202231/75920973-2ec1d300-5e15-11ea-9794-ab297fdf9b9f.png) and the archived folder looks like: ![image](https://user-images.githubusercontent.com/14202231/75921034-49944780-5e15-11ea-9c86-5d090c668fcc.png) This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org With regards, Apache Git Services
[GitHub] [incubator-hudi] bwu2 commented on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table
bwu2 commented on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table URL: https://github.com/apache/incubator-hudi/pull/1128#issuecomment-593578500 I'm not sure if this is a separate issue or not (seems very similar but not identical), so am leaving a comment here. Somehow, we have an empty `.clean` file (the commit was successful). This has caused: `20/03/02 18:17:56 ERROR HoodieCommitArchiveLog: Failed to archive commits, .commit file: 20200221211054.clean java.io.IOException: Not an Avro data file` Note this was created by a production release (0.5.0) version of Hudi. We updated our Hudi version to 0.5.1 (and 0.5.2) and it's still not able to handle it. I guess we can delete the 0 byte offending `.clean` file and everything will work. This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org With regards, Apache Git Services