Re: [XHR2] overrideMimeType

Maciej Stachowiak Sun, 29 Jul 2007 08:26:58 -0700


On Jul 28, 2007, at 11:38 PM, Jonas Sicking wrote:

Maciej Stachowiak wrote:
On Jul 27, 2007, at 12:09 PM, Jonas Sicking wrote:
Anne van Kesteren wrote:
I've been looking at overrideMimeType implementations in Geckoand WebKit and it seems like they differ a bit. In Gecko it hasto be invoked before send(), but in WebKit it would work if youinvoke it just before getting responseXML or responseText.Neither implementation seems to do any input checks.If you have any opinion on how it should be specified I supposenow would be the time to air your thoughts.
Of course I prefer the mozilla way :)
It does seem fairly complicated to allow it to be set after thedownload is finished though. You do have the stream storedin .reponseBody, but at that point all encoding information hasbeen lost. For HTML parsing (which I hope the spec will support inthe future) there are a pile of rules used to guess the encoding,all of which would be useful to use, but can't be used if all youhave access to is the unencoded responseBody.
Why would the encoding information be lost? The only sources ofencoding info are the responseText itself and http headers, both ofwhich the XMLHttpResponse needs to provide anyway.
ResponseText is not the raw byte stream gotten off the wire, it isalready decoded into utf16 using whatever algorithm we define fordetermining the encoding. HTML decoding is a lot more complicatedsince you have to first guess an encoding, then start to parse thedocument, but if you find a
<meta http-equiv="Content-Type" content="text/html; charset=?">
Where charset is different from what you guessed, you have torestart from the beginning using the charset defined in the meta tag.
Yes, it would definitely be possible for the implementation to keeparound the raw byte stream and either lazily decode responseText, orkeep both the utf16 responseText and the raw byte stream around.

A third possibility is to remember what encoding you used whendecoding and turn the UTF-16 back into the original bytes, though Isuppose that wouldn't work if you hit encoding errors originally.

It is a bit quirky behavior though since setting overrideMimeTypecould then change the encoding and therefor both responseXML andresponseText.

If XHR2 offers responseBody with a raw byte array of some kind, itwill be required for implementations to keep the raw bytes aroundanyway.


Regards,
Maciej

Re: [XHR2] overrideMimeType

Reply via email to