alamb commented on issue #14144: URL: https://github.com/apache/datafusion/issues/14144#issuecomment-2676880600
> [@cj-zhukov](https://github.com/cj-zhukov) [@alamb](https://github.com/alamb) hi, I've just hit this one. judging file content by an extension doesn't feel fully right to me (as you can put there basically anything). yes if there is an content mismatch (expected vs actual) then there will be an undesired result (although I would expect some parsing error or something). nevertheless enforcing untrustworthy extensions doesn't seem to bring much value in that regard. in our case we intentionally omit the extensions completely in the storage and have other ways how the content info gets propagated (to the point where register_xyz gets picked). thanks for the great lib I agree that ensuring there is some way to override the format used to interpret files (e.g. reading a `.dat` file as parquet) is a good feature. @devhprl if there is something preventing you from doing that, please do file an issue with a reproducer and we can help make sure the APIs support your usecase -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
