On Mon Oct 19 10:36:51 EDT 2009, rogpe...@gmail.com wrote: > 2009/10/19 erik quanstrom <quans...@quanstro.net>: > > why try that hard? just call it utf-8. i can't think of > > any browsers that would have a problem with that today. > > the instance of the problem that i had was when > adding an attachment to a upas mail. > file -m is useful when the attachment might be > binary.
/sys/src/cmd/upas/marshal/marshal.c:/^body already scans the whole file. it could never call something that's not ascii ascii. unfortunately it could be fooled by a bucky bit that's not utf-8, since it doesn't check for valid utf-8. it would be better to at least have a flag to file that tells it to read the whole file and to have file always return the character set to avoid distributing various and sundry hacks about the system. - erik