Smotrov commented on issue #10435: URL: https://github.com/apache/datafusion/issues/10435#issuecomment-2104707611
The issue with the way how DataFusion drakes the file into pieces. ```Rust let session_config = SessionConfig::new().with_repartition_file_scans(false); let ctx = SessionContext::new_with_config(session_config); ``` I've found that if repartition is disabled, it works flawlessly. So I suspect something is wrong here in case of ZStd. https://github.com/apache/datafusion/blob/9f0e0164c73c834260f842f0ee942593707730bd/datafusion/core/src/datasource/physical_plan/json.rs#L164 After splitting the file into 10 slices it does decodes some of them, but fails with the others. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
