retitle 673385 lynx-cur: in UTF-8 locales, lynx displays search text at the wrong column if preceded by non-ASCII (multibyte) characters thanks
On 2012-08-15 16:46:03 -0400, Thomas Dickey wrote: > On Wed, Aug 15, 2012 at 01:01:56PM +0200, Vincent Lefevre wrote: > > On 2012-08-15 05:46:53 -0400, Thomas Dickey wrote: > > > On Fri, May 18, 2012 at 10:37:06AM +0200, Vincent Lefevre wrote: > > > > lynx displays search text at the wrong column if preceded by UTF-8 > > > > characters. For instance, consider: > > > > > > This is addressed by setting > > > > > > XHTML_PARSING:true > > > > > > in lynx.cfg > > > > I don't see why a display problem should be affected by parsing. > > The character-set information is given only as an xml processing instruction, > which in turn is used by lynx only when XHTML_PARSING is set. But lynx got the charset information right. Otherwise it couldn't have output the ellipsis characters correctly! Getting incorrect charset information from a (X)HTML file can lead to incorrect characters to be displayed, but certainly not a display consistency problem as reported here. Actually there's *exactly* the same problem with an ASCII XHTML file (here, ASCII refers to the source): in the example, just replace the <p>...</p> line by: <p>……… In <cite>lynx</cite>, search for "foo" by typing: /foo</p> I've retitled the bug, because the "UTF-8" was ambiguous. The problem is not related to the encoding used in the HTML file, but IMHO, to the internal use of UTF-8 for the output to a terminal with UTF-8 locales. I think that lynx assumes that the column is obtained by counting the number of bytes, but in UTF-8 locales, this is wrong due to multibyte characters. -- Vincent Lefèvre <vinc...@vinc17.net> - Web: <http://www.vinc17.net/> 100% accessible validated (X)HTML - Blog: <http://www.vinc17.net/blog/> Work: CR INRIA - computer arithmetic / AriC project (LIP, ENS-Lyon) -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org