Re: UTF-8 problem in 8.0

2010-01-04 Thread Sean C. Farley

On Sun, 3 Jan 2010, Derek Kulinski wrote:


Hello Sean,

Sunday, January 3, 2010, 9:03:01 AM, you wrote:

The last commit that I recall (r192561) fixed dashes in front of 
options (.Fl macro) but not dashes present within a man page. 
Personally, I just create an alias for man to call 'LANG=C man' to 
protect against examples with non-ASCII (U+002D) dashes.


Read the commit message[1] for a better explanation of unresolved 
issues.



Sean
   1. http://svn.freebsd.org/viewvc/base?view=revision&revision=192561


What caused this to be broken? Everything was fine up until 7.2 (or at 
least I didn't notice this earlier)


Previously (to some commit revision), man (nroff) did not have NLS 
enabled, so the output was in ASCII.


Sean
--
s...@freebsd.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: UTF-8 problem in 8.0

2010-01-03 Thread Derek Kulinski
Hello Sean,

Sunday, January 3, 2010, 9:03:01 AM, you wrote:

> The last commit that I recall (r192561) fixed dashes in front of options
> (.Fl macro) but not dashes present within a man page.  Personally, I 
> just create an alias for man to call 'LANG=C man' to protect against 
> examples with non-ASCII (U+002D) dashes.

> Read the commit message[1] for a better explanation of unresolved 
> issues.

> Sean
>1. http://svn.freebsd.org/viewvc/base?view=revision&revision=192561

What caused this to be broken? Everything was fine up until 7.2 (or at
least I didn't notice this earlier)

-- 
Best regards,
 Derekmailto:tak...@takeda.tk

A seminar on Time Travel will be held two weeks ago.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: UTF-8 problem in 8.0

2010-01-03 Thread Sean C. Farley

On Sun, 3 Jan 2010, Ulrich Spörlein wrote:


On Fri, 01.01.2010 at 15:47:36 -0800, Derek Kulinski wrote:

Hello,

After installing FreeBSD 8.0 I noticed that some characters aren't 
displayed correctly (it worked fine for 7.2).


Mainly the dash character is most noticeable (though it's possible 
it's more of them).


Example:
env LANG=C man sh
env LANG=pl_PL.UTF-8 man sh
env LANG=en_US.UTF-8 man sh

The first command shows everything fine, the later two seem to show 
some unprintable character in place of every dash (i.e. in the second 
paragraph word "built-in", or the dashes that break up the words at 
the end of the line)


me too, though I'm only setting LC_CTYPE to de_DE.UTF-8 and I don't 
always see it. It must be some combination of xterm/ssh and/or putty 
that breaks this.


On the other hand, I thought there was a commit done, that would 
render - (the dash) always as - (0x2d) so you can copy&paste the 
examples from the manpage into your shell.


But perhaps I was just dreaming this up?


The last commit that I recall (r192561) fixed dashes in front of options 
(.Fl macro) but not dashes present within a man page.  Personally, I 
just create an alias for man to call 'LANG=C man' to protect against 
examples with non-ASCII (U+002D) dashes.


Read the commit message[1] for a better explanation of unresolved 
issues.


Sean
  1. http://svn.freebsd.org/viewvc/base?view=revision&revision=192561
--
s...@freebsd.org___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

Re: UTF-8 problem in 8.0

2010-01-03 Thread Troels Kofoed Jacobsen
On Fri, Jan 01, 2010 at 03:47:36PM -0800, Derek Kulinski wrote:
> Hello,
> 
> After installing FreeBSD 8.0 I noticed that some characters aren't
> displayed correctly (it worked fine for 7.2).
> 
> Mainly the dash character is most noticeable (though it's possible
> it's more of them).
> 
> Example:
> env LANG=C man sh
> env LANG=pl_PL.UTF-8 man sh
> env LANG=en_US.UTF-8 man sh
> 
> The first command shows everything fine, the later two seem to show
> some unprintable character in place of every dash (i.e. in the second
> paragraph word "built-in", or the dashes that break up the words at
> the end of the line)
> 

I've been playing around with locales myself the past days. My analysis:
# env LANG=C locale charmap
US-ASCII
# env LANG=pl_PL.UTF-8 locale charmap
UTF-8
# env LANG=en_US.UTF-8 locale charmap
UTF-8

So the first one prints US-ASCII to your terminal and the latter two
prints UTF-8 characters. My guess is that your terminal is set to
US-ASCII and not UTF-8 and will thus not understand UTF-8 encoded
characters. 

I'm using konsole where this setting can be found in "View" ->
"Character Encoding"

Hope this helps

/Troels Kofoed Jacobsen

> -- 
> Best regards,
>  Derek  mailto:tak...@takeda.tk
> 
> Daddy, why doesn't this magnet pick up this floppy disk?
> 
> ___
> freebsd-stable@freebsd.org mailing list
> http://lists.freebsd.org/mailman/listinfo/freebsd-stable
> To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: UTF-8 problem in 8.0

2010-01-02 Thread Jonathan Chen
On Sat, Jan 02, 2010 at 07:37:15PM -0800, Derek Kulinski wrote:
> Hello Jonathan,
> 
> Saturday, January 2, 2010, 7:28:07 PM, you wrote:
> 
> >> me too, though I'm only setting LC_CTYPE to de_DE.UTF-8 and I don't
> >> always see it. It must be some combination of xterm/ssh and/or putty
> >> that breaks this.
> 
> > Actually, I've just got en_NZ.UTF8, and I see it on the console as
> > well.
> 
> Put en_NZ.UTF-8 (with the dash), if you set an invalid value it
> switches to default.

Sorry. I actually meant that if I had en_NZ.UTF-8 on the console, some
man pages' characters are mangled as well. If it uses the default of C,
everything is fine.

Cheers.
-- 
Jonathan Chen 
--
   "Lots of folks confuse bad management with destiny"
 - Kin Hubbard
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: UTF-8 problem in 8.0

2010-01-02 Thread Derek Kulinski
Hello Jonathan,

Saturday, January 2, 2010, 7:28:07 PM, you wrote:

>> me too, though I'm only setting LC_CTYPE to de_DE.UTF-8 and I don't
>> always see it. It must be some combination of xterm/ssh and/or putty
>> that breaks this.

> Actually, I've just got en_NZ.UTF8, and I see it on the console as
> well.

Put en_NZ.UTF-8 (with the dash), if you set an invalid value it
switches to default.

-- 
Best regards,
 Derekmailto:tak...@takeda.tk

If you have a procedure with ten parameters, you probably missed some.
-- Alan J. Perlis

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: UTF-8 problem in 8.0

2010-01-02 Thread Jonathan Chen
On Sun, Jan 03, 2010 at 02:38:32AM +0100, Ulrich Sp??rlein wrote:
> On Fri, 01.01.2010 at 15:47:36 -0800, Derek Kulinski wrote:
> > Hello,
> > 
> > After installing FreeBSD 8.0 I noticed that some characters aren't
> > displayed correctly (it worked fine for 7.2).
> > 
> > Mainly the dash character is most noticeable (though it's possible
> > it's more of them).
> > 
> > Example:
> > env LANG=C man sh
> > env LANG=pl_PL.UTF-8 man sh
> > env LANG=en_US.UTF-8 man sh
> > 
> > The first command shows everything fine, the later two seem to show
> > some unprintable character in place of every dash (i.e. in the second
> > paragraph word "built-in", or the dashes that break up the words at
> > the end of the line)
> 
> me too, though I'm only setting LC_CTYPE to de_DE.UTF-8 and I don't
> always see it. It must be some combination of xterm/ssh and/or putty
> that breaks this.

Actually, I've just got en_NZ.UTF8, and I see it on the console as
well.
-- 
Jonathan Chen 
--
"With sufficient thrust, pigs fly just fine. However, this is not necessarily
a good idea. It is hard to be sure where they are going to land, and it
could be dangerous sitting under them as they fly overhead." -- RFC 1925
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: UTF-8 problem in 8.0

2010-01-02 Thread Ulrich Spörlein
On Fri, 01.01.2010 at 15:47:36 -0800, Derek Kulinski wrote:
> Hello,
> 
> After installing FreeBSD 8.0 I noticed that some characters aren't
> displayed correctly (it worked fine for 7.2).
> 
> Mainly the dash character is most noticeable (though it's possible
> it's more of them).
> 
> Example:
> env LANG=C man sh
> env LANG=pl_PL.UTF-8 man sh
> env LANG=en_US.UTF-8 man sh
> 
> The first command shows everything fine, the later two seem to show
> some unprintable character in place of every dash (i.e. in the second
> paragraph word "built-in", or the dashes that break up the words at
> the end of the line)

me too, though I'm only setting LC_CTYPE to de_DE.UTF-8 and I don't
always see it. It must be some combination of xterm/ssh and/or putty
that breaks this.

On the other hand, I thought there was a commit done, that would render
- (the dash) always as - (0x2d) so you can copy&paste the examples from
the manpage into your shell.

But perhaps I was just dreaming this up?

Bye,
Uli
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"