Package: pstotext Version: 1.9-1sarge1 Severity: minor Depending on which standard you read, and how you read them, 0xAD can mean one of the following: - An indicator that a program may put in a file to say "I broke this word and line here" displayed just like hyphen/minus (one reading of ISO-8859-1). Under this reading, it must only occur at the end of a line. [KORPELA] - An indicator that a line break MAY be taken here. If the displaying program elects to take the line break, it should display e.g., U+2010. Otherwise, it should display nothing. (Unicode; HTML; other reading of 8859-1). See especially [TR14]'s section on the "soft hyphen"
Which one is correct really doesn't matter, but its fairly clear that the hyphenated version of email should be written as either e-mail or e‐mail, but certainly not as email. [Depending on your email client, the last one may just show as "email". Its not --- its e\255mail] pstotext should never output \255. [KORPELA] http://www.cs.tut.fi/~jkorpela/shy.html [TR14] http://www.unicode.org/unicode/reports/tr14/ -- System Information: Debian Release: 3.1 APT prefers unstable APT policy: (99, 'unstable') Architecture: i386 (i686) Kernel: Linux 2.6.14-2-686-smp Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8) Versions of packages pstotext depends on: ii gs 8.01-5 Transitional package ii gs-esp [gs] 7.07.1-9 The Ghostscript PostScript interpr ii gs-gpl [gs] 8.01-5 The GPL Ghostscript PostScript int ii libc6 2.3.2.ds1-22 GNU C Library: Shared libraries an -- no debconf information -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]