Timothy Brown created HUDI-7237: ----------------------------------- Summary: Minor Improvements to Schema Handling in Delta Sync Key: HUDI-7237 URL: https://issues.apache.org/jira/browse/HUDI-7237 Project: Apache Hudi Issue Type: Improvement Reporter: Timothy Brown
There are a two minor items that we have run into running DeltaStreamer in production. 1. The number of times the schema is fetched is more than it needs to be and can put unnecessary load on schema providers or increase file system reads 2. SchemaProviders that return null target schemas on empty batches cause null schema values in commits leading to unexpected issues later -- This message was sent by Atlassian Jira (v8.20.10#820010)