On Dec 03, Piotr Strzyzewski via db-wg <db-wg@ripe.net> wrote:

> As the UTF-8 topic was briefly discussed during DB-WG session at RIPE87
> in Rome, I would like to propose moving forward with it. If that means a
> topic for first (?) interim meeting, let it be. Let me know please if
> this works for you. Thanks in advance.
In Rome I talked a bit with Edward about this.
Background: I am the author of the whois client used by all Linux 
distributions.

I fully agree that switching to UTF-8 is desirable, but we cannot just 
change the encoding of port 43 without major side effects.
Since version 5.5.4 (december 2019), the client assumes that the output 
of whois.ripe.net is Latin 1 and then transcodes it to the system 
encoding.
Receiving unexpected UTF-8 would cause mojibake.

My suggestion is to add a new query "command line" option to specify the 
desired encoding (limiting it to either ISO-8859-1 or UTF-8), as 
supported by other whois servers.
-C is the most common choice, but maybe it would be better to use 
--charset to not waste a single letter option.
See https://github.com/rfc1036/whois/blob/next/servers_charset_list .

In a few years then it will be much easier to switch the default from 
Latin 1 to UTF-8.

-- 
ciao,
Marco

Attachment: signature.asc
Description: PGP signature

-- 

To unsubscribe from this mailing list, get a password reminder, or change your 
subscription options, please visit: 
https://lists.ripe.net/mailman/listinfo/db-wg

Reply via email to