Re: UTF-8 encoding of changelog files

2007-05-16 Thread Lars Wirzenius
On ke, 2007-05-16 at 23:17 +0100, Dagfinn Ilmari Mannsåker wrote: > That's trivial to check. Just feed it to recode and specify UTF-8 > Hexadecimal input: You might also be interested in isutf8 from moreutils. -- Never underestimate the power of a small tactical Lisp interpreter. -- To UNSUBS

Re: UTF-8 encoding of changelog files

2007-05-16 Thread Dagfinn Ilmari Mannsåker
[EMAIL PROTECTED] (Lennart Sorensen) writes: > So is this valid utf-8? I don't believe so. If it is I have to go > reread the UTF-8 spec again. :) > > 4b c4 99 73 74 75 74 69 73 (K..stutis) > (I found this one in base-config from sarge since I happened to have > that one open in a hex editor

Re: UTF-8 encoding of changelog files

2007-05-16 Thread Russ Allbery
Lennart Sorensen <[EMAIL PROTECTED]> writes: > On Wed, May 16, 2007 at 02:53:09PM -0700, Russ Allbery wrote: >> debian-changelog-file-uses-obsolete-national-encoding is the tag. > I wonder how many packages are triggering that right now. 96. http://lintian.debian.org/reports/Tdebian-changelog-f

Re: UTF-8 encoding of changelog files

2007-05-16 Thread Steve Langasek
On Wed, May 16, 2007 at 06:01:40PM -0400, Lennart Sorensen wrote: > On Wed, May 16, 2007 at 02:53:09PM -0700, Russ Allbery wrote: > > Lennart Sorensen <[EMAIL PROTECTED]> writes: > > Yes. > Hmm, well I haven't found one yet and I think I checked 10 so far, all > of which have non ascii characters

Re: UTF-8 encoding of changelog files

2007-05-16 Thread Adeodato Simó
* Lennart Sorensen [Wed, 16 May 2007 18:01:40 -0400]: > I wonder how many packages are triggering that right now. http://lintian.debian.org/reports/Tdebian-changelog-file-uses-obsolete-national-encoding.html > So is this valid utf-8? I don't believe so. If it is I have to go > reread the UTF-8

Re: UTF-8 encoding of changelog files

2007-05-16 Thread Lennart Sorensen
On Wed, May 16, 2007 at 06:01:40PM -0400, Lennart Sorensen wrote: > I wonder how many packages are triggering that right now. > > So is this valid utf-8? I don't believe so. If it is I have to go > reread the UTF-8 spec again. :) > > 4b c4 99 73 74 75 74 69 73 (K..stutis) > (I found this on

Re: UTF-8 encoding of changelog files

2007-05-16 Thread Lennart Sorensen
On Wed, May 16, 2007 at 02:53:09PM -0700, Russ Allbery wrote: > Lennart Sorensen <[EMAIL PROTECTED]> writes: > Yes. Hmm, well I haven't found one yet and I think I checked 10 so far, all of which have non ascii characters but none of which appeared valid to me. > lintian's is, at least. Many oth

Re: UTF-8 encoding of changelog files

2007-05-16 Thread Russ Allbery
Lennart Sorensen <[EMAIL PROTECTED]> writes: > So should changelog files for debian packages be UTF8 Yes. > and if so are there any that actually are lintian's is, at least. Many others are these days as well. All of my packages have UTF-8 changelog files, although not all actually have non-A

UTF-8 encoding of changelog files

2007-05-16 Thread Lennart Sorensen
I am currently trying to find a valid UTF8 encoded text file to check my terminal settings and such to make sure I have it working right. So to do this I figured I would just find some changelog file that claims to be in UTF8 format and see if it views correctly. Well so far no luck. Every single