Re: UTF-8 locale, strange chars in X terminal pager(s), specifically man pages

2004-10-20 Thread Osamu Aoki
Hi,

 On Saturday 02 Oct 2004 12:50:25 +0200, Osamu Aoki wrote:
  Who said Debian has moved completely to UTF-8 ?
   man, aptitude
I thought these are not UTF ready.  See BTS.
  
  I wish these were.  So if you do, you need multiple locales.
  
  I suggest creating custom meny for xterm started with en_US.ISO-8859-1.

I retract this comment.  It looks like UTF-8 issues has been resolved as
I used to experience.  I can see man pages in UTF-8 for English OK as a
quick look.  Even ja_JP.UTF-8 looks good.

But there may be monir issues left for man.

aptitude now has no issues with CJK with UTF-8 since it was disabled and
forced to use english.

Oh, you may want to check pager with update-alternatives.  

pager - status is manual.
 link currently points to /usr/bin/lv
/bin/more - priority 50
/usr/bin/less - priority 77
 slave pager.1.gz: /usr/share/man/man1/less.1.gz
/usr/bin/w3m - priority 25
 slave pager.1.gz: /usr/share/man/man1/w3m.1.gz
/usr/bin/lv - priority 80
 slave pager.1.gz: /usr/share/man/man1/lv.1.gz
Current `best' version is /usr/bin/lv.

Osamu


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED] 
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Re: UTF-8 locale, strange chars in X terminal pager(s), specifically man pages

2004-10-02 Thread Vincent Lefevre
On 2004-09-30 10:18:45 -0500, Frederick B. Henry Jr. wrote:
 On Thursday 30 Sep 2004 16:39:20 +0200, Vincent Lefevre wrote:
  It may be a problem with your terminal. I've attached a small file
  containing characters in UTF-8. Could you save it and cat it in your
  terminal to see if there are problems?
 
 In aterm, it has problems; in (u)xterm, none.  So perhaps it is a problem
 with my aterm setup.  In aterm, TERM is set to rxvt; in (u)xterm TERM is
 set to xterm.

Yes, there is a problem with your aterm setup (or perhaps aterm
doesn't support UTF-8, I don't know). So, just do your tests with
(u)xterm to solve the problems with man.

 And despite the fact that the file you attached displays fine in
 (u)xterm, I still get the funny chars in man pages.

Does my file display fine with the pager you're using with man?
(Warning if you're using less, as some versions have problems
with UTF-8.)

-- 
Vincent Lefèvre [EMAIL PROTECTED] - Web: http://www.vinc17.org/
100% accessible validated (X)HTML - Blog: http://www.vinc17.org/blog/
Work: CR INRIA - computer arithmetic / SPACES project at LORIA


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED] 
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Re: UTF-8 locale, strange chars in X terminal pager(s), specifically man pages

2004-10-02 Thread Vincent Lefevre
On 2004-10-01 22:52:49 -0500, Frederick B. Henry Jr. wrote:
 I note some differences with your output...just not sure how to
 interpret them. :(

It seems to be OK. You have the ASCII hyphen, and the correct UTF-8
sequence for the copyright symbol. If you do man cat and look at
the COPYRIGHT section at the end, can you see the copyright symbol
between Copyright and 2004? If not, this is probably a problem
with your pager.

-- 
Vincent Lefèvre [EMAIL PROTECTED] - Web: http://www.vinc17.org/
100% accessible validated (X)HTML - Blog: http://www.vinc17.org/blog/
Work: CR INRIA - computer arithmetic / SPACES project at LORIA


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED] 
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Re: UTF-8 locale, strange chars in X terminal pager(s), specifically man pages

2004-10-02 Thread Martin Dickopp
[EMAIL PROTECTED] (Frederick B. Henry Jr.) writes:

 I note some differences with your output...just not sure how to
 interpret them. :(

They are likely not to be significant; you just seem to have a different
version of the man page.

If you type

PAGER=cat man cat

do you see a coypright symbol in the Copyright (C) 2004 Free Software
Foundation, Inc. line (i.e., in the place where I have written (C)
here)? If you type

printf \\302\\251\\n

do you see a copyright symbol? If you type

printf \\342\\200\\220\\n

do you see a small hyphen (i.e. something somewhat smaller than what you
get when you hit the - key)?

If your answer to the second and third question is yes, try to pipe the
output of each command to a pager, e.g.

   printf \\342\\200\\220\\n | less

In place of less, try each pager which you also tried with man.

Martin


-- 
   ,--.  ,= ,-_-. =.
  / ,- )Martin Dickopp, Dresden, Germany((_/)o o(\_))
  \ `-'http://www.zero-based.org/`-'(. .)`-'
   `-.   \_/


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED] 
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Re: UTF-8 locale, strange chars in X terminal pager(s), specifically man pages

2004-10-02 Thread Osamu Aoki
Hi,

On Thu, Sep 30, 2004 at 12:03:49AM -0500, Frederick B. Henry Jr. wrote:
 Greetings, 
 
 Context: Debian unstable. I have the ucs fonts package installed, as well
 as several classical polytonic Greek fonts, which render very well in
 Mozilla Firefox.  I can start an xterm thusly: 
...
 Since switching my locale to en_US.UTF-8 (dpkg-reconfigure locales),
 whenever I use any pager (more, less, most) to read a man page I get
 strange chars, e.g.:

Who said Debian has moved completely to UTF-8 ?
 man, aptitude
  I thought these are not UTF ready.  See BTS.

I wish these were.  So if you do, you need multiple locales.

I suggest creating custom meny for xterm started with en_US.ISO-8859-1.

More on it in my Debian Reference around 9.7.9 Example for a
multilingual X window system:
   http://www.nl.debian.org/doc/manuals/reference/ch-tune.en.html#s-x-cjk
Or get it from testing debian-reference-en package and follow menu under
Help (Debian menu).

Cheers,

Osamu
PS: I am thinking UTF-8 transition helper script ...


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED] 
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Re: UTF-8 locale, strange chars in X terminal pager(s), specifically man pages

2004-10-02 Thread Frederick B. Henry Jr.
On Saturday 02 Oct 2004 12:43:08 +0200, Martin Dickopp wrote:

 If you type
 
 PAGER=cat man cat
 
 do you see a coypright symbol in the Copyright (C) 2004 Free Software
 Foundation, Inc. line (i.e., in the place where I have written (C)
 here)? If you type
 
 printf \\302\\251\\n
 
 do you see a copyright symbol? If you type
 
 printf \\342\\200\\220\\n
 
 do you see a small hyphen (i.e. something somewhat smaller than what you
 get when you hit the - key)?
 
 If your answer to the second and third question is yes, try to pipe the
 output of each command to a pager, e.g.
 
printf \\342\\200\\220\\n | less
 
 In place of less, try each pager which you also tried with man.

If I use the above commands in uxterm, all is well.  I see the copyright
symbol and the small hyphen.  Of course, aterm is a no go, but that is not
a surprise.  

So I tried the following pagers: cat, less, more, most.  (I was enjoying
most for the colorization)

It ultimately does seem to be a PAGER problem, since with every pager but
most, when used in a uxterm, the man pages render perfectly well with no
odd symbols.  It seems the culprit was most.   

As a final minor issue, not critical for me by any means, when I use a
working pager like less to view the UTF-8-demon.txt file, the only text
that gives a problem (boxes) is Amharic Ethiopian.  

Thank you, as well as the other respondents, for all your patient
assistance.

Cheers,

Fred Henry, Jr.


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED] 
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Re: UTF-8 locale, strange chars in X terminal pager(s), specifically man pages

2004-10-02 Thread Frederick B. Henry Jr.
On Saturday 02 Oct 2004 10:52:28 +0200, Vincent Lefevre wrote:

 On 2004-10-01 22:52:49 -0500, Frederick B. Henry Jr. wrote:
  I note some differences with your output...just not sure how to
  interpret them. :(
 
 It seems to be OK. You have the ASCII hyphen, and the correct UTF-8
 sequence for the copyright symbol. If you do man cat and look at
 the COPYRIGHT section at the end, can you see the copyright symbol
 between Copyright and 2004? If not, this is probably a problem
 with your pager.


Yes, I'm pretty certain now that the issue was the most pager.

Cheers,

Fred Henry, Jr.


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED] 
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Re: UTF-8 locale, strange chars in X terminal pager(s), specifically man pages

2004-10-02 Thread Martin Dickopp
[EMAIL PROTECTED] (Frederick B. Henry Jr.) writes:
 As a final minor issue, not critical for me by any means, when I use a
 working pager like less to view the UTF-8-demon.txt file, the only text
 that gives a problem (boxes) is Amharic Ethiopian.  

Same here. It seems the X11 fonts do not (yet?) provide glyphs for that
language.

Martin


-- 
   ,--.  ,= ,-_-. =.
  / ,- )Martin Dickopp, Dresden, Germany((_/)o o(\_))
  \ `-'http://www.zero-based.org/`-'(. .)`-'
   `-.   \_/


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED] 
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Re: UTF-8 locale, strange chars in X terminal pager(s), specifically man pages

2004-10-01 Thread Frederick B. Henry Jr.
On Friday 01 Oct 2004 01:52:13 +0200, martin f krafft wrote:

 also sprach Martin Dickopp [EMAIL PROTECTED] [2004.10.01.0135 +0200]:
   On Thursday 30 September 2004 13:03, Frederick B. Henry Jr. wrote:
   Greetings,
   Since switching my locale to en_US.UTF-8 (dpkg-reconfigure locales),
   whenever I use any pager (more, less, most) to read a man page I get
   strange chars, e.g.:
 
 what terminal emulator do you use? do you get the problem if you use
 rxvt-unicode?

The problem occurs regardless of which terminal emulator I use.  I
typically use aterm, but uxterm, unicode rxvt etc does not help.  

Fred Henry, Jr.


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED] 
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Re: UTF-8 locale, strange chars in X terminal pager(s), specifically man pages

2004-10-01 Thread Frederick B. Henry Jr.
On Friday 01 Oct 2004 01:54:19 +0200, Martin Dickopp wrote:

 [EMAIL PROTECTED] (Frederick B. Henry Jr.) writes:
 
  Context: Debian unstable. [...] Since switching my locale to
  en_US.UTF-8 (dpkg-reconfigure locales),
 
 I think dpkg-reconfigure locales takes of that, but just to be sure,
 could you verify that /etc/locale.gen contains a line
 
   en_US.UTF-8 UTF-8

It does indeed. 

 If you re-run locale-gen as root, does that emit any error messages?

No errors.

  whenever I use any pager (more, less, most) to read a man page I get
  strange chars, e.g.:
 
  man procmailrc (PAGER=/usr/bin/most) yields: 
 
  delivering and [EMAIL PROTECTED] recipes..  
 
  That intercalating pattern -- [EMAIL PROTECTED] -- is throughout the man page.
 
 To find out if it's a problem of man or the pager, use cat as pager.
 In particular, what does
 
   PAGER=cat man cat | hexdump -C | grep ^06

[snip your hexdump] 

Here is the output of your command on my box:

[EMAIL PROTECTED]:~$  PAGER=cat man cat | hexdump -C | grep ^06
Reformatting cat(1), please wait...
0600  20 47 72 61 6e 6c 75 6e  64 20 61 6e 64 20 52 69  | Granlund and Ri|
0610  63 68 61 72 64 20 4d 2e  20 53 74 61 6c 6c 6d 61  |chard M. Stallma|
0620  6e 2e 0a 0a 52 08 52 45  08 45 50 08 50 4f 08 4f  |n...R.RE.EP.PO.O|
0630  52 08 52 54 08 54 49 08  49 4e 08 4e 47 08 47 20  |R.RT.TI.IN.NG.G |
0640  42 08 42 55 08 55 47 08  47 53 08 53 0a 20 20 20  |B.BU.UG.GS.S.   |
0650  20 20 20 20 52 65 70 6f  72 74 20 62 75 67 73 20  |Report bugs |
0660  74 6f 20 3c 62 75 67 2d  63 6f 72 65 75 74 69 6c  |to bug-coreutil|
0670  73 40 67 6e 75 2e 6f 72  67 3e 2e 0a 0a 43 08 43  |[EMAIL PROTECTED]...C.C|
0680  4f 08 4f 50 08 50 59 08  59 52 08 52 49 08 49 47  |O.OP.PY.YR.RI.IG|
0690  08 47 48 08 48 54 08 54  0a 20 20 20 20 20 20 20  |.GH.HT.T.   |
06a0  43 6f 70 79 72 69 67 68  74 20 c2 a9 20 32 30 30  |Copyright .. 200|
06b0  34 20 46 72 65 65 20 53  6f 66 74 77 61 72 65 20  |4 Free Software |
06c0  46 6f 75 6e 64 61 74 69  6f 6e 2c 20 49 6e 63 2e  |Foundation, Inc.|
06d0  0a 20 20 20 20 20 20 20  54 68 69 73 20 69 73 20  |.   This is |
06e0  66 72 65 65 20 73 6f 66  74 77 61 72 65 3b 20 73  |free software; s|
06f0  65 65 20 74 68 65 20 73  6f 75 72 63 65 20 66 6f  |ee the source fo|

I note some differences with your output...just not sure how to interpret them. :(

Fred Henry, Jr.



-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED] 
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Re: UTF-8 locale, strange chars in X terminal pager(s), specifically man pages

2004-09-30 Thread Andrea Vettorello
On Thu, 30 Sep 2004 00:03:49 -0500, Frederick B. Henry Jr.
[EMAIL PROTECTED] wrote:
 Greetings,
 
 Context: Debian unstable. I have the ucs fonts package installed, as well
 as several classical polytonic Greek fonts, which render very well in
 Mozilla Firefox.  I can start an xterm thusly:
 
 xterm -u8 -fn \
 '-misc-fixed-medium-r-semicondensed--13-120-75-75-c-60-iso10646-1'
 
 It displays the UTF-8-demo.txt file nicely, with the sole exception of
 Amharic Ethiopian, which is displayed as outline boxes per character.
 
 Since switching my locale to en_US.UTF-8 (dpkg-reconfigure locales),
 whenever I use any pager (more, less, most) to read a man page I get
 strange chars, e.g.:
 
 man procmailrc (PAGER=/usr/bin/most) yields:
 
 delivering and [EMAIL PROTECTED] recipes..

[...]

Can most (your pager) support UTF8? Have you tried to use less instead?


Andrea



Re: UTF-8 locale, strange chars in X terminal pager(s), specifically man pages

2004-09-30 Thread Frederick B. Henry Jr.
On Thursday 30 Sep 2004 08:43:40 +0200, Andrea Vettorello wrote:

 On Thu, 30 Sep 2004 00:03:49 -0500, Frederick B. Henry Jr.
 [EMAIL PROTECTED] wrote:
  
  Context: Debian unstable. I have the ucs fonts package installed, as well
  as several classical polytonic Greek fonts, which render very well in
  Mozilla Firefox.  I can start an xterm thusly:
  
  xterm -u8 -fn \
  '-misc-fixed-medium-r-semicondensed--13-120-75-75-c-60-iso10646-1'
  
  It displays the UTF-8-demo.txt file nicely, with the sole exception of
  Amharic Ethiopian, which is displayed as outline boxes per character.
  
  Since switching my locale to en_US.UTF-8 (dpkg-reconfigure locales),
  whenever I use any pager (more, less, most) to read a man page I get
  strange chars, e.g.:
  
  man procmailrc (PAGER=/usr/bin/most) yields:
  
  delivering and [EMAIL PROTECTED] recipes..
 
 [...]
 
 Can most (your pager) support UTF8? Have you tried to use less instead?

See above.  I have tried three pagers: more, less, most.  I thought that if
the xterm had the proper and useable unicode font it should not be a
problem.  I have tried those three pagers, and they all yield the same
result.  I have tried uxterm and mlterm as well--same result.  Please
correct me if my font selection is way off.  

Regards,

Fred henry, Jr. 


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED] 
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Re: UTF-8 locale, strange chars in X terminal pager(s), specifically man pages

2004-09-30 Thread Arne Gtje ()
On Thursday 30 September 2004 13:03, Frederick B. Henry Jr. wrote:
 Greetings,
 Since switching my locale to en_US.UTF-8 (dpkg-reconfigure locales),
 whenever I use any pager (more, less, most) to read a man page I get
 strange chars, e.g.:

AFAIK, 'man' does not support UTF-8. Bad luck. :)

Take a look here when the page is online again.

http://www.jw-stumpel.nl/stestu

Cheers
Arne
-- 
Arne Gtje () [EMAIL PROTECTED] 
(Spam catcher.  Address might change in future!)
PGP/GnuPG key: 1024D/685D1E8C
Fingerprint: 2056 F6B7 DEA8 B478 311F  1C34 6E9F D06E 685D 1E8C
Key available at wwwkeys.pgp.net.   Encrypted e-mail preferred.


pgpJOEsr8X98T.pgp
Description: PGP signature


Re: UTF-8 locale, strange chars in X terminal pager(s), specifically man pages

2004-09-30 Thread Frederick B. Henry Jr.
On Thursday 30 Sep 2004 15:06:56 +0800, Arne G?tje (?) wrote:

 AFAIK, 'man' does not support UTF-8. Bad luck. :)
 
 Take a look here when the page is online again.
 
 http://www.jw-stumpel.nl/stestu

Thanks Arne,

I'll save the url.  Ironic, given the intent of unicode, that man pages of
all things would be marginalized.  I can still read them...I guess it's a
minor annoyance :)  

fbhjr





-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED] 
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Re: UTF-8 locale, strange chars in X terminal pager(s), specifically man pages

2004-09-30 Thread Vincent Lefevre
On 2004-09-30 00:03:49 -0500, Frederick B. Henry Jr. wrote:
 Since switching my locale to en_US.UTF-8 (dpkg-reconfigure locales),
 whenever I use any pager (more, less, most) to read a man page I get
 strange chars, e.g.:
 
 man procmailrc (PAGER=/usr/bin/most) yields: 
 
 delivering and [EMAIL PROTECTED] recipes..  

In an UTF-8 sequence, you shouldn't have a ^P; this is strange.
Now, Debian doesn't use the non-ASCII hyphen to make searching in
man pages easier (see /etc/groff/man.local). So, this seems to be
a bug in the procmailrc page (or perhaps the non-ASCII hyphen is
normal in this special context).

On my machine, I don't see any problem, perhaps because I have
uncommented the last two lines of

.  \ Many UTF-8 man pages use - instead of \- for dashes such as those
.  \ in command-line options. This is a bug in those pages, but if you want
.  \ all hyphens to be rendered as the ASCII-compatible HYPHEN-MINUS
.  \ anyway, then uncomment this.
.  \ if '\*[.T]'utf8' \
.  \   char - \N'45'

(more precisely, I copied both files from /etc/groff to $HOME/etc/groff,
uncommented these lines and set GROFF_TMAC_PATH to $HOME/etc/groff).

BTW, I've tried after unsetting GROFF_TMAC_PATH, and this non-ASCII
hyphen (shorter than the ASCII one) displays without any problem (in
a UXTerm, i.e. XTerm with UTF-8 locales and UTF-8 fonts).

 Is there some specific font I should be using? How do I get a clean
 man page in an xterm/aterm/rxvt/mlterm with a UTF-8 locale?

If you use UXTerm, the fonts are already set up:
/etc/X11/app-defaults/XTerm contains

*VT100.utf8Fonts.font2: -misc-fixed-medium-r-normal--8-80-75-75-c-50-iso10646-1
*VT100.utf8Fonts.font:  
-misc-fixed-medium-r-semicondensed--13-120-75-75-c-60-iso10646-1
*VT100.utf8Fonts.font3: -misc-fixed-medium-r-normal--14-130-75-75-c-70-iso10646-1
*VT100.utf8Fonts.font4: -misc-fixed-medium-r-normal--13-120-75-75-c-80-iso10646-1
*VT100.utf8Fonts.font5: -misc-fixed-medium-r-normal--18-120-100-100-c-90-iso10646-1
*VT100.utf8Fonts.font6: -misc-fixed-medium-r-normal--20-200-75-75-c-100-iso10646-1

-- 
Vincent Lefèvre [EMAIL PROTECTED] - Web: http://www.vinc17.org/
100% accessible validated (X)HTML - Blog: http://www.vinc17.org/blog/
Work: CR INRIA - computer arithmetic / SPACES project at LORIA


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED] 
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Re: UTF-8 locale, strange chars in X terminal pager(s), specifically man pages

2004-09-30 Thread Frederick B. Henry Jr.
On Thursday 30 Sep 2004 09:39:50 +0200, Vincent Lefevre wrote:

 In an UTF-8 sequence, you shouldn't have a ^P; this is strange.
 Now, Debian doesn't use the non-ASCII hyphen to make searching in
 man pages easier (see /etc/groff/man.local). So, this seems to be
 a bug in the procmailrc page (or perhaps the non-ASCII hyphen is
 normal in this special context).

 On my machine, I don't see any problem, perhaps because I have
 uncommented the last two lines of
 
 .  \ Many UTF-8 man pages use - instead of \- for dashes such as those
 .  \ in command-line options. This is a bug in those pages, but if you want
 .  \ all hyphens to be rendered as the ASCII-compatible HYPHEN-MINUS
 .  \ anyway, then uncomment this.
 .  \ if '\*[.T]'utf8' \
 .  \   char - \N'45'
 

After uncommenting the above last two lines in both /etc/groff/man.local
and /etc/groff/mdoc.local, I still get the strange characters, which appear
not to be limited to the hyphen issue.  Man netstat, for example, at the
very top of the page:

Linux [EMAIL PROTECTED] Manual 

in aterm as well as uxterm.  My /etc/app-defaults/XTerm has the same font
config lines as yours.  Again, this is in *any* pager and with many man
pages that I have tested.

Another solution which someone was kind enough to send me was to set LC_ALL
to C before every invocation of man, which works, but seems kludgy.  I
suppose if man is ultimately incapable of handling a utf-8 locale then that
may have to be the solution (?).   (BTW, in my locale output the only var
unset is LC_ALL -- all the others are set to en_US.UTF-8)

Cheers,

Fred Henry, Jr.


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED] 
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Re: UTF-8 locale, strange chars in X terminal pager(s), specifically man pages

2004-09-30 Thread Vincent Lefevre
On 2004-09-30 08:54:01 -0500, Frederick B. Henry Jr. wrote:
 Another solution which someone was kind enough to send me was to set
 LC_ALL to C before every invocation of man, which works, but
 seems kludgy.

This just deactivates non-ASCII characters, which should be replaced
by ASCII characters.

 I suppose if man is ultimately incapable of handling a utf-8 locale
 then that may have to be the solution (?). (BTW, in my locale output
 the only var unset is LC_ALL -- all the others are set to
 en_US.UTF-8)

It may be a problem with your terminal. I've attached a small file
containing characters in UTF-8. Could you save it and cat it in your
terminal to see if there are problems?

-- 
Vincent Lefèvre [EMAIL PROTECTED] - Web: http://www.vinc17.org/
100% accessible validated (X)HTML - Blog: http://www.vinc17.org/blog/
Work: CR INRIA - computer arithmetic / SPACES project at LORIA
20  !#$%'()*+,-./0123456789:;=?
40 @ABCDEFGHIJKLMNOPQRSTUVWXYZ[\]^_
60 `abcdefghijklmnopqrstuvwxyz{|}~
A0 
C0 
E0 


Re: UTF-8 locale, strange chars in X terminal pager(s), specifically man pages

2004-09-30 Thread Frederick B. Henry Jr.
On Thursday 30 Sep 2004 16:39:20 +0200, Vincent Lefevre wrote:

 On 2004-09-30 08:54:01 -0500, Frederick B. Henry Jr. wrote:
  Another solution which someone was kind enough to send me was to set
  LC_ALL to C before every invocation of man, which works, but
  seems kludgy.
 
 This just deactivates non-ASCII characters, which should be replaced
 by ASCII characters.
 
  I suppose if man is ultimately incapable of handling a utf-8 locale
  then that may have to be the solution (?). (BTW, in my locale output
  the only var unset is LC_ALL -- all the others are set to
  en_US.UTF-8)
 
 It may be a problem with your terminal. I've attached a small file
 containing characters in UTF-8. Could you save it and cat it in your
 terminal to see if there are problems?

In aterm, it has problems; in (u)xterm, none.  So perhaps it is a problem
with my aterm setup.  In aterm, TERM is set to rxvt; in (u)xterm TERM is
set to xterm.

And despite the fact that the file you attached displays fine in
(u)xterm, I still get the funny chars in man pages.  So it seems that even
if aterm has some problems with that file, and xterm does not, man pages
are still not displayed perfectly in (u)xterm.  I'm thinking it is less a
terminal issue than a man issue.  As I mentioned before, I can see in my
(u)xterm Greek, Thai, Georgian, Cyrillic and even Runes (only the Amharic
Ethiopian is all boxes).  Seems like unicode is partially working at the
least.   

Thanks again for your help!

Fred Henry, Jr.   



-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED] 
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Re: UTF-8 locale, strange chars in X terminal pager(s), specifically man pages

2004-09-30 Thread Martin Dickopp
Arne Gtje [EMAIL PROTECTED] writes:

 On Thursday 30 September 2004 13:03, Frederick B. Henry Jr. wrote:
 Greetings,
 Since switching my locale to en_US.UTF-8 (dpkg-reconfigure locales),
 whenever I use any pager (more, less, most) to read a man page I get
 strange chars, e.g.:

 AFAIK, 'man' does not support UTF-8. Bad luck. :)

It supports UTF-8 just fine here (Debian unstable).

Martin


-- 
   ,--.  ,= ,-_-. =.
  / ,- )Martin Dickopp, Dresden, Germany((_/)o o(\_))
  \ `-'http://www.zero-based.org/`-'(. .)`-'
   `-.   \_/



Re: UTF-8 locale, strange chars in X terminal pager(s), specifically man pages

2004-09-30 Thread martin f krafft
also sprach Martin Dickopp [EMAIL PROTECTED] [2004.10.01.0135 +0200]:
  On Thursday 30 September 2004 13:03, Frederick B. Henry Jr. wrote:
  Greetings,
  Since switching my locale to en_US.UTF-8 (dpkg-reconfigure locales),
  whenever I use any pager (more, less, most) to read a man page I get
  strange chars, e.g.:

what terminal emulator do you use? do you get the problem if you use
rxvt-unicode?

-- 
Please do not CC me when replying to lists; I read them!
 
 .''`. martin f. krafft [EMAIL PROTECTED]
: :'  :proud Debian developer, admin, and user
`. `'`
  `-  Debian - when you have better things to do than fixing a system
 
Invalid/expired PGP subkeys? Use subkeys.pgp.net as keyserver!


signature.asc
Description: Digital signature


Re: UTF-8 locale, strange chars in X terminal pager(s), specifically man pages

2004-09-30 Thread Martin Dickopp
[EMAIL PROTECTED] (Frederick B. Henry Jr.) writes:

 Context: Debian unstable. [...] Since switching my locale to
 en_US.UTF-8 (dpkg-reconfigure locales),

I think dpkg-reconfigure locales takes of that, but just to be sure,
could you verify that /etc/locale.gen contains a line

  en_US.UTF-8 UTF-8

If you re-run locale-gen as root, does that emit any error messages?

 whenever I use any pager (more, less, most) to read a man page I get
 strange chars, e.g.:

 man procmailrc (PAGER=/usr/bin/most) yields: 

 delivering and [EMAIL PROTECTED] recipes..  

 That intercalating pattern -- [EMAIL PROTECTED] -- is throughout the man page.

To find out if it's a problem of man or the pager, use cat as pager.
In particular, what does

  PAGER=cat man cat | hexdump -C | grep ^06

print? It prints

0600  6f 72 6e 20 47 72 61 6e  6c 75 6e 64 20 61 6e 64  |orn Granlund and|
0610  20 52 69 63 68 61 72 64  20 4d 2e 20 53 74 61 6c  | Richard M. Stal|
0620  6c 6d 61 6e 2e 0a 0a 52  08 52 45 08 45 50 08 50  |lman...R.RE.EP.P|
0630  4f 08 4f 52 08 52 54 08  54 49 08 49 4e 08 4e 47  |O.OR.RT.TI.IN.NG|
0640  08 47 20 42 08 42 55 08  55 47 08 47 53 08 53 0a  |.G B.BU.UG.GS.S.|
0650  20 20 20 20 20 20 20 52  65 70 6f 72 74 20 62 75  |   Report bu|
0660  67 73 20 74 6f 20 3c 62  75 67 e2 80 90 63 6f 72  |gs to bug...cor|
0670  65 75 74 69 6c 73 40 67  6e 75 2e 6f 72 67 3e 2e  |[EMAIL PROTECTED].|
0680  0a 0a 43 08 43 4f 08 4f  50 08 50 59 08 59 52 08  |..C.CO.OP.PY.YR.|
0690  52 49 08 49 47 08 47 48  08 48 54 08 54 0a 20 20  |RI.IG.GH.HT.T.  |
06a0  20 20 20 20 20 43 6f 70  79 72 69 67 68 74 20 c2  | Copyright .|
06b0  a9 20 32 30 30 34 20 46  72 65 65 20 53 6f 66 74  |. 2004 Free Soft|
06c0  77 61 72 65 20 46 6f 75  6e 64 61 74 69 6f 6e 2c  |ware Foundation,|
06d0  20 49 6e 63 2e 0a 20 20  20 20 20 20 20 54 68 69  | Inc..   Thi|
06e0  73 20 69 73 20 66 72 65  65 20 73 6f 66 74 77 61  |s is free softwa|
06f0  72 65 3b 20 73 65 65 20  74 68 65 20 73 6f 75 72  |re; see the sour|

here. The interesting parts are the hyphen in bug-coreutils
(bytes e2 80 90) and the copyright character in Copyright (C) 2004
(bytes c2 a9).

Martin


-- 
   ,--.  ,= ,-_-. =.
  / ,- )Martin Dickopp, Dresden, Germany((_/)o o(\_))
  \ `-'http://www.zero-based.org/`-'(. .)`-'
   `-.   \_/



UTF-8 locale, strange chars in X terminal pager(s), specifically man pages

2004-09-29 Thread Frederick B. Henry Jr.
Greetings, 

Context: Debian unstable. I have the ucs fonts package installed, as well
as several classical polytonic Greek fonts, which render very well in
Mozilla Firefox.  I can start an xterm thusly: 

xterm -u8 -fn \
'-misc-fixed-medium-r-semicondensed--13-120-75-75-c-60-iso10646-1'

It displays the UTF-8-demo.txt file nicely, with the sole exception of
Amharic Ethiopian, which is displayed as outline boxes per character.

Since switching my locale to en_US.UTF-8 (dpkg-reconfigure locales),
whenever I use any pager (more, less, most) to read a man page I get
strange chars, e.g.:

man procmailrc (PAGER=/usr/bin/most) yields: 

delivering and [EMAIL PROTECTED] recipes..  

That intercalating pattern -- [EMAIL PROTECTED] -- is throughout the man page.

I have tried the same in a utf-8 xterm with the same result.  

Is there some specific font I should be using?  How do I get a clean man
page in an xterm/aterm/rxvt/mlterm with a UTF-8 locale?

output of locale:

LANG=en_US.UTF-8
LC_CTYPE=en_US.UTF-8
LC_NUMERIC=en_US.UTF-8
LC_TIME=en_US.UTF-8
LC_COLLATE=en_US.UTF-8
LC_MONETARY=en_US.UTF-8
LC_MESSAGES=en_US.UTF-8
LC_PAPER=en_US.UTF-8
LC_NAME=en_US.UTF-8
LC_ADDRESS=en_US.UTF-8
LC_TELEPHONE=en_US.UTF-8
LC_MEASUREMENT=en_US.UTF-8
LC_IDENTIFICATION=en_US.UTF-8
LC_ALL=


Any hints or help would be greatly appreciated.

Cheers,

Fred Henry, Jr.


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED] 
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]