Re: unicode URIs

Judson Valeski Mon, 14 May 2001 11:57:37 -0700
bringing this back onto the newsgroup and cc'ing rick.

Andreas Otte wrote:

>Hi!
>
>Judson Valeski wrote:
>
>>We decided on the following proposal. dougt, rpotts, chak, dmose, gagan, valeski, 
>and nhotta attended the meeting.
>>
>>URI's would accept, and store, only UTF8 encoded strings. Protocols not able to 
>handle UTF8 (HTTP for example), would access the charset attribute (proposed) off of 
>nsIURI to convert back to the original string. The charset would be set by the URI 
>creator as they have the best charset context. Is nsIURI the right
>>place for the charset attribute?
>>
>
>I think it is. Also get away with the char representation of the uri
>components. Use strings instead.
> 
>
>>The current ASCII % encoding would be removed from the internal URI representation. 
>Again, this encoding would be pushed out to the protocol level.
>>
>
>So we will have a two levels of %-enconding? I don't think the
>%-encoding can be removed completly. The first level applies to all URIs
>and masks reserved chars as the current stuff does. On a second level
>non ascii chars can be encoded as the protocol needs it.
>
As I understood it, there would be no % encoding at all unless a 
protocol (say HTTP) needed it, in which case, it would pull the UTF8 
data out of the uri object, and % encoding it on it's own.

Jud
Re: unicode URIs

Reply via email to