To comment on the following update, log in, then open the issue:
http://www.openoffice.org/issues/show_bug.cgi?id=74002
                 Issue #|74002
                 Summary|%xx notation in a hyperlink does not work properly
               Component|framework
                 Version|OOo 2.1
                Platform|All
                     URL|
              OS/Version|All
                  Status|NEW
       Status whiteboard|
                Keywords|
              Resolution|
              Issue type|DEFECT
                Priority|P3
            Subcomponent|code
             Assigned to|tm
             Reported by|tora





------- Additional comments from [EMAIL PROTECTED] Mon Jan 29 23:26:46 -0800 
2007 -------
Problem
A hyperlink does not work under certain circumstances.
If a URL of a hyperlink includes characters in %xx and/or &#xx; notation 
encoded in not UTF-8, unexpected garbage characters might appear.

Example
Look at the following URL encoded in EUC-JP:
http://xxx/%A5%BB%A5%F3%A5%C8%A1%A6%A5%D8%A5%EC%A5%F3%A5%BA%BB%B3 

There are 9 Japanese characters in it:
%A5%BB
%A5%F3
%A5%C8
%A1%A6
%A5%D8
%A5%EC
%A5%F3
%A5%BA
%BB%B3 

According to the Section 3.9 "Unicode Encoding Forms" [1], the following 
sequences of bytes can be mistakenly recognized as UTF-8:
%C8%A1
%D8%A5
%F3%A5%BA%BB

Look at an attached example file, the sequeces memtioned above are forcedly 
converted into characters. 

One of possible solutions:
 1. Try to convert a string in %xx and/or &#xx; notation into UCS-2.
 2a. If a conversion error occurs, leave the string untached. 
 2b. If a conversion into UCS-2 is successful, the string will be substitued 
     with the converted one.

For the example above, the first byte, %A5, is not a correct byte of UTF-8, 
thus is cannot be converted into UCS-2. Therefore, the string, whole URL, 
should remain in an original one.

Acknowlegemets
toumatsu firstly reported this henomeon at the OOo FAQ site [2] and then 
kimotomasaya and M.Kamataki confirmed the phenomeon.

[1] http://www.unicode.org/versions/Unicode4.0.0/ch03.pdf#G7404 
[2] http://oooug.jp/faq/index.php?faq%2F4%2F227

---------------------------------------------------------------------
Please do not reply to this automatically generated notification from
Issue Tracker. Please log onto the website and enter your comments.
http://qa.openoffice.org/issue_handling/project_issues.html#notification

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to