Bug#354284: lynx vs. uxterm

2006-02-24 Thread Dan Jacobson
Package: lynx-cur
Version: 2.8.6-17
Severity: normal

There is something wrong in what lynx sends e.g., uxterm.
$ lynx -dump http://seba.ulyssis.org/thesis/howto-pinyin.php is fine.
Nothing to do with raw utf-8 vs. &#stuff.

My investigation concludes that lynx is sending e.g., M-G ~ P when it should be
sending M-G M-^P, where M- means high bit set. Instead of setting the
high bit of ^P and sending that, it sends two characters, ~ and P.

Looking at their bits,
^P   10 0001
P50 0101
~7e 0110
M-^P 90 1001

we see lynx is sending
11000111 0110 010abcde
instead of
11000111  100abcde
where abcde varies with the particular char being sent.
What a screw up?! Looks like some mis-estimate in your to-utf-8
converter for screen presentation.

(pstree shows a "xterm--luit" connection.)

None of
character_set=UNICODE (UTF-8)
character_set=Transparent
character_set=Chinese
helped.

 Display and Character Set
  Use locale-based character set(!): [ON_]
  Display character set: [UNICODE (UTF-8)]
  Assumed document character set(!): [utf-8___]
  Raw 8-bit (!): [ON_]
also didn't help

And why does "=" show
  Charset: utf-8
when "o" shows
  Assumed document character set(!): [iso-8859-1__]

-assume_local_charset=UTF-8 no help, -assume_charset=UTF-8 same.
file:///usr/share/doc/HOWTO/en-html/Unicode-HOWTO-4.html no help.

w3m works fine.


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]



Bug#354284: lynx vs. uxterm

2006-02-25 Thread Thomas Dickey
On Sat, Feb 25, 2006 at 12:20:14AM +0100, Dan Jacobson wrote:
> Package: lynx-cur
> Version: 2.8.6-17
> Severity: normal
> 
> There is something wrong in what lynx sends e.g., uxterm.
> $ lynx -dump http://seba.ulyssis.org/thesis/howto-pinyin.php is fine.
> Nothing to do with raw utf-8 vs. &#stuff.

if the dump is fine, then it sounds as if you're talking about the interactive
display - I don't see any tildes in that (after setting the display
character set to UTF-8, of course).

> And why does "=" show
>   Charset: utf-8
> when "o" shows
>   Assumed document character set(!): [iso-8859-1__]

lynx.cfg says

# ASSUME_CHARSET changes the handling of documents which do not 
# explicitly specify a charset.  Normally Lynx assumes that 8-bit 
# characters in those documents are encoded according to iso-8859-1 
# (the official default for the HTTP protocol).  When ASSUME_CHARSET 
# is defined here or by an -assume_charset command line flag is in effect, 
# Lynx will treat documents as if they were encoded accordingly. 
# See above on how this interacts with "raw mode" and the Display 
# Character Set. 
# ASSUME_CHARSET can also be changed via the 'o'ptions menu but will 
# not be saved as permanent value in user's .lynxrc file to avoid more chaos. 
# 
#ASSUME_CHARSET:iso-8859-1 

> 
> -assume_local_charset=UTF-8 no help, -assume_charset=UTF-8 same.
> file:///usr/share/doc/HOWTO/en-html/Unicode-HOWTO-4.html no help.
> 
> w3m works fine.
> 
> 
> -- 
> To UNSUBSCRIBE, email to [EMAIL PROTECTED]
> with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]
-- 
Thomas E. Dickey
http://invisible-island.net
ftp://invisible-island.net


pgpCcOGH52CUO.pgp
Description: PGP signature


Bug#354284: lynx vs. uxterm

2006-02-25 Thread Dan Jacobson
Starting over.
$ uxterm
$ su -
# su - nobody
$ env
TERM=xterm
HOME=/ #... etc. No LC_ or LANG stuff.
$ export http_proxy="http://localhost:8080/"; #needed for me, WWWOFFLE
$ wget -O - http://zh.wikipedia.org/wiki/%E8%9A%8A%E5%AD%90 #Chinese looks 
great!
$ lynx http://zh.wikipedia.org/wiki/%E8%9A%8A%E5%AD%90
$ export LYNX_CFG=/etc/lynx-cur/lynx-utf.cfg
$ lynx http://zh.wikipedia.org/wiki/%E8%9A%8A%E5%AD%90
Nope. No matter what I do, only 1% of the Chinese characters come thru
OK, all the rest turn into a tilde mess. wget, w3m work fine.
pstree -Al shows
`-xdm---sh-+-icewm
   |-2*[ssh-agent]
   `-xterm---luit---bash-+-pstree
 `-xterm---luit---bash---su---bash---su---sh
$ xterm -v
XTerm(202)
Well, OK I'll try again with 209 or whatever next time I go to town to
do my dist-upgrade downloads.


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]



Bug#354284: lynx vs. uxterm

2006-02-26 Thread Thomas Dickey
On Sun, Feb 26, 2006 at 05:50:09AM +0100, Dan Jacobson wrote:
> Starting over.
> $ uxterm
> $ su -
> # su - nobody
> $ env
> TERM=xterm
> HOME=/ #... etc. No LC_ or LANG stuff.

uxterm should be setting the locale (if your shell unsets them, that
would produce the sort of effect you are describing, but is not a bug
in lynx).

> $ export http_proxy="http://localhost:8080/"; #needed for me, WWWOFFLE
> $ wget -O - http://zh.wikipedia.org/wiki/%E8%9A%8A%E5%AD%90 #Chinese looks 
> great!
> $ lynx http://zh.wikipedia.org/wiki/%E8%9A%8A%E5%AD%90
> $ export LYNX_CFG=/etc/lynx-cur/lynx-utf.cfg
> $ lynx http://zh.wikipedia.org/wiki/%E8%9A%8A%E5%AD%90
> Nope. No matter what I do, only 1% of the Chinese characters come thru
> OK, all the rest turn into a tilde mess. wget, w3m work fine.

I don't read Chinese, so it would be understandable if I overlook relatively
minor things, but I don't see any tildes.  Comparing with w3m, the glyphs
look the same.  I made screenshots to show the two, put them in

ftp://invisible-island.net/temp/db354284-w3m.png
ftp://invisible-island.net/temp/db354284-lynx.png

-- 
Thomas E. Dickey
http://invisible-island.net
ftp://invisible-island.net


pgp81pMzG2Qzm.pgp
Description: PGP signature


Bug#354284: lynx vs. uxterm

2006-02-27 Thread Dan Jacobson
T> uxterm should be setting the locale (if your shell unsets them, that
T> would produce the sort of effect you are describing, but is not a bug in 
lynx).
$ HOME=/ uxterm #instead of su -, but same effect. OK, will await dist-upgrade.
T>  ftp://invisible-island.net/temp/db354284-w3m.png
T>  ftp://invisible-island.net/temp/db354284-lynx.png
==> PASV ... done.==> RETR db354284-w3m.png ...
No such file `db354284-w3m.png'. Same with the other. Anyways all you
need to do is confirm that it looks the same as -dump. No photos
needed. As I'm sure it does, further testing will have to await my
next http://jidanni.org/comp/debian/apt-offline/index_en.html run. OK thanks.


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]



Bug#354284: lynx vs. uxterm

2006-02-28 Thread Thomas Dickey
On Tue, Feb 28, 2006 at 07:40:06AM +0100, Dan Jacobson wrote:
> T> uxterm should be setting the locale (if your shell unsets them, that
> T> would produce the sort of effect you are describing, but is not a bug in 
> lynx).
> $ HOME=/ uxterm #instead of su -, but same effect. OK, will await 
> dist-upgrade.
> T>ftp://invisible-island.net/temp/db354284-w3m.png
> T>ftp://invisible-island.net/temp/db354284-lynx.png

sorry (my typo):

Feb 26 14:18  image/pngdb354384-lynx.png  9Kb
Feb 26 14:18  image/pngdb354384-w3m.png  8Kb

> ==> PASV ... done.==> RETR db354284-w3m.png ...
> No such file `db354284-w3m.png'. Same with the other. Anyways all you
> need to do is confirm that it looks the same as -dump. No photos
> needed. As I'm sure it does, further testing will have to await my
> next http://jidanni.org/comp/debian/apt-offline/index_en.html run. OK thanks.

-- 
Thomas E. Dickey
http://invisible-island.net
ftp://invisible-island.net


pgp15IZyIjsMl.pgp
Description: PGP signature