[issue40841] Provide mimetypes.sniff API as stdlib

Dong-hee Na Tue, 28 Jul 2020 09:35:43 -0700


Dong-hee Na <donghee.n...@gmail.com> added the comment:


> I think that both functions for detecting file type, by name and by content

I think so too, mime sniffing would not be a way to alternate the method based 
on the file extension. Both APIs should be provided.

> should not we add also the code for detecting the text encoding based on 
> other algorithms used in browsers

I already add the code for text encoding detection based on the whatwg standard 
so if this API is landed, yes text encoding detection will be supported.(e.g 
utf-16be)
IMHO, there would be use-cases since today python is used a lot for text data 
handling (for example crawling, data pre-processing) 

There would be the question that the standard for the browser is appropriate 
for the python stdlib module.
My answer is that the whatwg standard could be the one of best standards to 
follow if make the decision to provide mime sniffing.

The standard handle mime types that are widely used in the real world not only 
for browser but also HTTP server or else.

One of the big stress to maintain mime-types detection is that considering how 
many mime-types should be supported.
Luckily, whatwg can be the strong standard to make the decision.

----------

_______________________________________
Python tracker <rep...@bugs.python.org>
<https://bugs.python.org/issue40841>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

[issue40841] Provide mimetypes.sniff API as stdlib

Reply via email to