zherenyu831 edited a comment on issue #1798: URL: https://github.com/apache/hudi/issues/1798#issuecomment-657193426
@umehrot2 @vinothchandar Sorry for lately reply. Here is my snapshot of spark ui. First query I used, files processed by resolveRelation was 950 ``` spark.read.format("org.apache.hudi").load("s3://daas-hudi-test/paylite_payment_read/orders_v6/data/*/*/*").count() ``` and second I used below query, and files processed by resolveRelation was 4750 ``` spark.read.format("org.apache.hudi").load("s3://daas-hudi-test/paylite_payment_read/orders_v6/data/*/*/*/*").count() ``` since we are using spark stream to write data into the table, so the file size will be changed a little when second query run. <img width="1665" alt="スクリーンショット 2020-07-12 17 51 22" src="https://user-images.githubusercontent.com/52404525/87242515-61857300-c468-11ea-9e23-a874afed66b8.png"> <img width="1666" alt="スクリーンショット 2020-07-12 17 51 28" src="https://user-images.githubusercontent.com/52404525/87242522-6a764480-c468-11ea-89f1-f865875783fe.png"> ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org