Timothy Brown created HUDI-7237:
-----------------------------------

             Summary: Minor Improvements to Schema Handling in Delta Sync
                 Key: HUDI-7237
                 URL: https://issues.apache.org/jira/browse/HUDI-7237
             Project: Apache Hudi
          Issue Type: Improvement
            Reporter: Timothy Brown


There are a two minor items that we have run into running DeltaStreamer in 
production.
1. The number of times the schema is fetched is more than it needs to be and 
can put unnecessary load on schema providers or increase file system reads

2. SchemaProviders that return null target schemas on empty batches cause null 
schema values in commits leading to unexpected issues later

 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to