DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://nagoya.apache.org/bugzilla/show_bug.cgi?id=11960>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=11960

Apache default config doesn't include uk langauge in mod-negotitaion, and has 
some charset issues.

           Summary: Apache default config doesn't include uk langauge in
                    mod-negotitaion, and has some charset issues.
           Product: Apache httpd-2.0
           Version: HEAD
          Platform: All
        OS/Version: All
            Status: NEW
          Severity: Major
          Priority: Other
         Component: Runtime Config
        AssignedTo: bugs@httpd.apache.org
        ReportedBy: [EMAIL PROTECTED]


I've posted a patch to dev@ and got no single responce in 4 days. Therefore I
post it here.

I've discovered several problems regarding charsets and langauges setup in 
apache
default config. It also appears in apache-1.3-HEAD, I'm not sure if I should
create bugreport there as well.

1) httpd-2.0//docs/conf/httpd-std.conf.in doesn't include "uk" langauge 
definition.

patch:
--- httpd-2.0/docs/conf/httpd-std.conf.in       15 Jul 2002 20:17:25 -0000      
1.13
+++ httpd-2.0/docs/conf/httpd-std.conf.in       23 Aug 2002 07:35:33 -0000
@@ -692,7 +692,8 @@
 # Note 2: The example entries below illustrate that in some cases 
 # the two character 'Language' abbreviation is not identical to 
 # the two character 'Country' code for its country,
-# E.g. 'Danmark/dk' versus 'Danish/da'.
+# E.g. 'Danmark/dk' versus 'Danish/da' or 'Ukraine/ua' versus
+# 'Ukrainian/uk' (the latter is, sometimes, a source of confusion).
 #
 # Note 3: In the case of 'ltz' we violate the RFC by using a three char
 # specifier. There is 'work in progress' to fix this and get
@@ -704,7 +705,7 @@
 # Portugese (pt) - Luxembourgeois* (ltz)
 # Spanish (es) - Swedish (sv) - Catalan (ca) - Czech(cz)
 # Polish (pl) - Brazilian Portuguese (pt-br) - Japanese (ja)
-# Russian (ru) - Croatian (hr)
+# Russian (ru) - Croatian (hr) - Ukrainian (uk)
 #
 AddLanguage da .dk
 AddLanguage nl .nl
@@ -731,6 +732,7 @@
 AddLanguage tw .tw
 AddLanguage zh-tw .tw
 AddLanguage hr .hr
+AddLanguage uk .uk
 
 #
 # LanguagePriority allows you to give precedence to some lanuages


2) Language priority doesn't include all languages.
I dare not to change it, but perhaps it should correcpont to "AddLanguage"
order. At least "uk" should be added there.

3) URL ftp://ftp.isi.edu/in-notes/iana/assignments/character-sets is now
invalid. therefore following change is requred on lines 769 and 806.
-# See ftp://ftp.isi.edu/in-notes/iana/assignments/character-sets for
+# See http://www.iana.org/assignments/character-sets for

4) KOI8-U charset is not defined, although it's widely used, and even has a RFC.
also utf-8 and utf-7 are defined twice -- in upper and lower case.
it also changes .ua binding to KOI8-U, which I guess is the right thing to be.
patch (based on http://www.iana.org/assignments/character-sets information):
@@ -780,26 +782,31 @@
 AddCharset ISO-2022-KR .iso2022-kr .kis
 AddCharset ISO-2022-CN .iso2022-cn .cis
 AddCharset Big5        .Big5       .big5
-# For russian, more than one charset is used (depends on client, mostly):
+# For Russian, more than one charset is used (depends on client, mostly):
 AddCharset WINDOWS-1251 .cp-1251   .win-1251
 AddCharset CP866       .cp866
-AddCharset KOI8-r      .koi8-r .koi8-ru
-AddCharset KOI8-ru     .koi8-uk .ua
+AddCharset KOI8-r      .koi8-r
+# both Russian and Ukrainian (probably other cyrillic-based languages)
+AddCharset KOI8-ru     .koi8-uk  .koi8-ru
+# widely-used Ukrainian encoding, RFC2319
+AddCharset KOI8-U      .koi8-u .ua
+
+# Unicode
 AddCharset ISO-10646-UCS-2 .ucs2
 AddCharset ISO-10646-UCS-4 .ucs4
-AddCharset UTF-8       .utf8
+AddCharset UTF-7        .utf7
+AddCharset UTF-8        .utf8
+AddCharset UTF-16       .utf16
 
 # The set below does not map to a specific (iso) standard
 # but works on a fairly wide range of browsers. Note that
 # capitalization actually matters (it should not, but it
 # does for some browsers).
 #
-# See ftp://ftp.isi.edu/in-notes/iana/assignments/character-sets
+# See http://www.iana.org/assignments/character-sets
 # for a list of sorts. But browsers support few.
 #
 AddCharset GB2312      .gb2312 .gb 
-AddCharset utf-7       .utf7
-AddCharset utf-8       .utf8
 AddCharset big5        .big5 .b5
 AddCharset EUC-TW      .euc-tw
 AddCharset EUC-JP      .euc-jp

If you have suggestion or just think that this patch is wrong, please contact 
me.

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to