alamb commented on issue #14144:
URL: https://github.com/apache/datafusion/issues/14144#issuecomment-2676880600

   > [@cj-zhukov](https://github.com/cj-zhukov) 
[@alamb](https://github.com/alamb) hi, I've just hit this one. judging file 
content by an extension doesn't feel fully right to me (as you can put there 
basically anything). yes if there is an content mismatch (expected vs actual) 
then there will be an undesired result (although I would expect some parsing 
error or something). nevertheless enforcing untrustworthy extensions doesn't 
seem to bring much value in that regard. in our case we intentionally omit the 
extensions completely in the storage and have other ways how the content info 
gets propagated (to the point where register_xyz gets picked). thanks for the 
great lib
   
   I agree that ensuring there is some way to override the format used to 
interpret files (e.g. reading a `.dat` file as parquet) is a good feature. 
@devhprl if there is something preventing you from doing that, please do file 
an issue with a reproducer and we can help make sure the APIs support your 
usecase


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to