[GitHub] [incubator-hudi] bwu2 commented on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table

2020-03-20 Thread GitBox
bwu2 commented on issue #1128: [HUDI-453] Fix throw failed to archive commits 
error when writing data to MOR/COW table
URL: https://github.com/apache/incubator-hudi/pull/1128#issuecomment-601916430
 
 
   That is because i copied and pasted the folder to a new location for 
troubleshooting. The actual  creation times match the commit times pretty much.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] [incubator-hudi] bwu2 commented on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table

2020-03-04 Thread GitBox
bwu2 commented on issue #1128: [HUDI-453] Fix throw failed to archive commits 
error when writing data to MOR/COW table
URL: https://github.com/apache/incubator-hudi/pull/1128#issuecomment-594824906
 
 
   It's not easily reproducible. It seems to be a sporadic thing that happens 
from time to time; from the 95 tables we ingest (every 4 hours) for the last 
couple of weeks, the error has occurred 4 or 5 times. The jobs that created 
these zero bytes files seem to complete successfully and without any error.
   
   In the Spark logs for the job, things seem ok (this is the only mention of 
`.clean` files):
   `20/02/21 21:11:13 INFO S3NativeFileSystem2: rename 
s3://bucket/path/to/table/.hoodie/20200221211054.clean.inflight 
s3://bucket/path/to/table/.hoodie/20200221211054.clean`
   
   The s3 `.hoodie` folder looks like (ignore the dates on the files, i copied 
and pasted from the original folder): 
   
![image](https://user-images.githubusercontent.com/14202231/75920973-2ec1d300-5e15-11ea-9794-ab297fdf9b9f.png)
   and the archived folder looks like:
   
![image](https://user-images.githubusercontent.com/14202231/75921034-49944780-5e15-11ea-9c86-5d090c668fcc.png)
   
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] [incubator-hudi] bwu2 commented on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table

2020-03-02 Thread GitBox
bwu2 commented on issue #1128: [HUDI-453] Fix throw failed to archive commits 
error when writing data to MOR/COW table
URL: https://github.com/apache/incubator-hudi/pull/1128#issuecomment-593578500
 
 
   I'm not sure if this is a separate issue or not (seems very similar but not 
identical), so am leaving a comment here. Somehow, we have an empty `.clean` 
file (the commit was successful). This has caused:
   
   `20/03/02 18:17:56 ERROR HoodieCommitArchiveLog: Failed to archive commits, 
.commit file: 20200221211054.clean
   java.io.IOException: Not an Avro data file`
   
   Note this was created by a production release (0.5.0) version of Hudi. We 
updated our Hudi version to 0.5.1 (and 0.5.2) and it's still not able to handle 
it. I guess we can delete the 0 byte offending `.clean` file and everything 
will work. 
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services