Package: pstotext
Version: 1.9-1sarge1
Severity: minor

Depending on which standard you read, and how you read them, 0xAD can
mean one of the following:
    - An indicator that a program may put in a file to say "I broke this
      word and line here" displayed just like hyphen/minus (one reading
      of ISO-8859-1). Under this reading, it must only occur at the end
      of a line. [KORPELA]
    - An indicator that a line break MAY be taken here. If the
      displaying program elects to take the line break, it should
      display e.g., U+2010. Otherwise, it should display nothing.
      (Unicode; HTML; other reading of 8859-1). See especially [TR14]'s
      section on the "soft hyphen"

Which one is correct really doesn't matter, but its fairly clear that
the hyphenated version of email should be written as either e-mail or
e‐mail, but certainly not as e­mail. [Depending on your email client,
the last one may just show as "email". Its not --- its e\255mail]

pstotext should never output \255.

[KORPELA] http://www.cs.tut.fi/~jkorpela/shy.html
[TR14] http://www.unicode.org/unicode/reports/tr14/


-- System Information:
Debian Release: 3.1
  APT prefers unstable
  APT policy: (99, 'unstable')
Architecture: i386 (i686)
Kernel: Linux 2.6.14-2-686-smp
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8)

Versions of packages pstotext depends on:
ii  gs                          8.01-5       Transitional package
ii  gs-esp [gs]                 7.07.1-9     The Ghostscript PostScript interpr
ii  gs-gpl [gs]                 8.01-5       The GPL Ghostscript PostScript int
ii  libc6                       2.3.2.ds1-22 GNU C Library: Shared libraries an

-- no debconf information


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]

Reply via email to