[GitHub] [hudi] rubenssoto commented on issue #2484: [SUPPORT] Hudi Write Performance

2021-01-25 Thread GitBox
rubenssoto commented on issue #2484: URL: https://github.com/apache/hudi/issues/2484#issuecomment-767143513 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use

[GitHub] [hudi] rubenssoto commented on issue #2484: [SUPPORT] Hudi Write Performance

2021-01-25 Thread GitBox
rubenssoto commented on issue #2484: URL: https://github.com/apache/hudi/issues/2484#issuecomment-767173123 Great, thank you for the explanationits makes sense. If I understand this code right, Hudi will order by partition key and record key, so if I have an unpartitioned table

[GitHub] [hudi] rubenssoto commented on issue #2484: [SUPPORT] Hudi Write Performance

2021-01-25 Thread GitBox
rubenssoto commented on issue #2484: URL: https://github.com/apache/hudi/issues/2484#issuecomment-767155405 Do you mean an Order By before df.write.format('hudi').save() ? This is an automated message from the Apache

[GitHub] [hudi] rubenssoto commented on issue #2484: [SUPPORT] Hudi Write Performance

2021-01-25 Thread GitBox
rubenssoto commented on issue #2484: URL: https://github.com/apache/hudi/issues/2484#issuecomment-767143513 Hello, I changed the option hoodie.datasource.write.row.writer.enable and took only 21 minutes, 30% faster, great