We have encountered similar problems with non i18n aware applications, esp.
interoperability between applications who use different encodings.
I don't know much about the ICQ protocol, maybe someone could point out if
it does send encoding information. Working on the assumption that ICQ does
not support it, then messages would of course be treated as iso8859-1 (or
rather most apps will not even care, these days we're luckier because
programmers don't strip the 8th bit of a characters).
Hackee brought out an interesting workaround, which is to do allow the users
to select the encoding at the application level, since you can't do it at
the protocol level due to missing encoding information. However, I disagree
with him to "right click on the ICQ agent, and set preferred encoding". It
should be done at individual message level, that is when viewing / composing
a message, you select the encoding that this message should be viewed in /
converted to for sending.
This is similar to the browsers / email clients. When a html page does not
contain the encoding tag, IE would use the default encoding, but user can
override the encoding using the "View->Encoding" menu.
wil
----- Original Message -----
From: "Eugene Tyurkin" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Cc: <[EMAIL PROTECTED]>; <[EMAIL PROTECTED]>; <[EMAIL PROTECTED]>
Sent: Sunday, April 08, 2001 4:39 PM
Subject: [JDEV] Non english messages problem (solution)
> Hello, everybody. Here's log of my conversation about subject in
> [EMAIL PROTECTED] I want to know what do YOU think about that.
>
> ----------------------------------------------------------------------
> 08 Apr 2001 14:39:53 Hackee Hmm... Maybe someone know who can help me
> (actually not only me but everyone who doesn't use english as primary
> language)
> 08 Apr 2001 14:40:09 uberjay what's the question?
> 08 Apr 2001 14:42:10 Hackee The question is: when you encode something to
> unicode you must say what charset do you use for correct decoding, am I
> right?
> 08 Apr 2001 14:43:24 uberjay I don't think so
> 08 Apr 2001 14:44:28 uberjay The nice thing about unicode is you can mix
> languages in a single file
> 08 Apr 2001 14:45:16 Hackee Well, I use koi8-r charset, my friend using
> cp1251. Jabber use unicode. What should I do?
> 08 Apr 2001 14:46:29 uberjay well, with jabber I would think it would
> encode everything in unicode, and decode everything in unicode so it
> wouldn't be as much of a problem...but maybe I'm missing something
> 08 Apr 2001 14:46:43 uberjay I guess you'd need some client support...
> 08 Apr 2001 14:47:18 uberjay are you running linux?
> 08 Apr 2001 14:47:39 Hackee The beauty of unicode is that when I correctly
> encode message in cp1251 to unicode, I can easily decode it to koi8-r from
> unicode and see what the message was about in my native charset
> 08 Apr 2001 14:47:46 Hackee Yes I'm using linux
> 08 Apr 2001 14:48:20 uberjay right...ok, I'm not familiar with
> cp1251...what is it?
> 08 Apr 2001 14:48:46 Hackee It's a windows russian charset
> 08 Apr 2001 14:49:01 Hackee And koi8-r is a linux russian charset
> 08 Apr 2001 14:49:51 uberjay ok
> 08 Apr 2001 14:50:23 uberjay and you want to be able to use jabber linux
> <-> windows while speaking in russian?
> 08 Apr 2001 14:50:32 Hackee I use gabber to communicate with my friends
> who use ICQ
> 08 Apr 2001 14:50:50 Hackee Sure I want :)
> 08 Apr 2001 14:51:08 uberjay heh, ok. maybe I'm in over my head. :) Does
> ICQ encode in Unicode?
> 08 Apr 2001 14:51:40 Hackee Nope. But icq agent from jabber project does
> 08 Apr 2001 14:51:59 uberjay ah, that's cool
> 08 Apr 2001 14:52:04 Hackee 8)
> 08 Apr 2001 14:52:31 Hackee But it doesn't care what charset is coming to
> it
> 08 Apr 2001 14:52:57 Hackee It's just using latin1, which doesn't support
> russian
> 08 Apr 2001 14:53:09 uberjay ick
> 08 Apr 2001 14:53:19 uberjay so there's a couple problems
> 08 Apr 2001 14:53:50 Hackee That's what I'm talking about
> 08 Apr 2001 14:54:19 uberjay one of them might be that It's hard to tell
> what charset an ICQ message is coming in with, so the transport doesn't
> know how to translate it to unicode
> 08 Apr 2001 14:54:57 Hackee Yes!
> 08 Apr 2001 14:55:08 uberjay the other is, when your outgoing message
> (encoded in unicode) goes from gabber to the icq transport, how does it
> know to translate it from unicode to cp1251?
> 08 Apr 2001 14:55:29 uberjay this sounds difficult...perhaps you should
> get him to run jabber :)
> 08 Apr 2001 14:55:57 Hackee No how. Other side can't read anything from me
> in russian
> 08 Apr 2001 14:56:09 Hackee Aha, and speak only english
> 08 Apr 2001 14:56:27 Hackee It's not a solution...
> 08 Apr 2001 14:56:31 uberjay hmm
> 08 Apr 2001 14:57:04 uberjay unless ICQ has some way to tell eachother
> what character encoding the user is using, I don't see an easy way to fix
> that...
> 08 Apr 2001 14:57:44 Hackee The easiest way is to tell an agent preferable
> encoding, for example I know that messages coming to me from ICQ are
> always be in cp1251
> 08 Apr 2001 14:57:56 uberjay ok, that works
> 08 Apr 2001 14:58:15 Hackee So messages can be coded and decoded correctly
> 08 Apr 2001 14:58:24 uberjay is that possible with gabber though?
> (currently)
> 08 Apr 2001 14:58:50 Hackee Nope. Just because it's a server side problem
> 08 Apr 2001 14:59:07 uberjay right, so you'd have to run a modified
> version of icq-t
> 08 Apr 2001 14:59:45 Hackee Sure, but I can't use my own server for
> communicating with other
> 08 Apr 2001 15:00:25 uberjay you can run your own jabberd...
> 08 Apr 2001 15:01:21 Hackee no, I can't. I just can use client, not
> server. Anyway it must be done for everybody not just for me, am I right?
> ;)
> 08 Apr 2001 15:01:41 uberjay right :)
> 08 Apr 2001 15:02:51 uberjay what would be nice is if you could
> right-click on the ICQ agent in gabber and select a default coding system,
> and have that preference control what the icq transport did server side
> 08 Apr 2001 15:03:21 Hackee Yes! But this must be done in protocol layer
> so jabber will be used by everyone, not only by english speaking people
> 08 Apr 2001 15:03:58 uberjay I don't know enough (well, anything) about
> the jabber protocol to say if this is supported, but it needs to be. of
> course, if everyone just uses jabber, than everything can be unicode, then
> we have no problems :)
> 08 Apr 2001 15:05:02 Hackee Sure, the only problem is to tell the same
> what we speaking about to jabber server developers
> 08 Apr 2001 15:05:12 Hackee 8)
> 08 Apr 2001 15:05:23 uberjay heh, yeah
> 08 Apr 2001 15:05:33 uberjay btw, in case you haven't seen this:
> ftp://ftp.ilog.fr/pub/Users/haible/utf8/Unicode-HOWTO.html
> 08 Apr 2001 15:05:38 uberjay linux unicode howto
> 08 Apr 2001 15:05:46 uberjay it has some useful information
> 08 Apr 2001 15:06:47 uberjay UTF-8 seems to be the most used right now
> 08 Apr 2001 15:07:02 Hackee I know a little about unicode, 8) I use it to
> write crossplatform applications that supports russian interface in both
> sides - linux and windows
> 08 Apr 2001 15:07:17 uberjay well then
> 08 Apr 2001 15:07:35 uberjay that's cool
> 08 Apr 2001 15:07:52 Hackee Thanks ;)
> 08 Apr 2001 15:08:20 uberjay I really need to sleep...good luck. maybe you
> should send an email to the jabber development list?
> 08 Apr 2001 15:08:31 uberjay they might have some more insight into the
> problem
> 08 Apr 2001 15:08:57 Hackee I'll try. Thanks for conversation. Good night
> (here is 15:08 already ;)
> 08 Apr 2001 15:09:54 uberjay hehe, I'm at 03:08 right now
> 08 Apr 2001 15:10:09 Hackee You're late ;)
> 08 Apr 2001 15:10:15 uberjay yes :)
> 08 Apr 2001 15:11:02 uberjay has left
> --------------------------------------------------------------
>
> That's it. Sorry for long letter but I guess it's easier to understand the
> subject.
>
>
> _______________________________________________
> jdev mailing list
> [EMAIL PROTECTED]
> http://mailman.jabber.org/listinfo/jdev
>
_______________________________________________
jdev mailing list
[EMAIL PROTECTED]
http://mailman.jabber.org/listinfo/jdev