Re: Names of Fields in Control Files

2010-09-29 Thread Bill Allombert
On Sun, Sep 26, 2010 at 01:35:00AM -0700, Russ Allbery wrote:
 Raphael Hertzog hert...@debian.org writes:
  On Sat, 25 Sep 2010, Jonathan Yu wrote:
 
  22:02:40  rra jawnsy: I don't think we say that explicitly, but RFC
  5322 requires it and I can't imagine ever not enforcing that.
  Although you should check with the dpkg maintainers to be sure.
  
  Could we/should we make the Debian Policy more restrictive, and
  specify explicitly that field names must only be ASCII-encoded?
  [...]
  Your comments and feedback on this would be much appreciated.
 
  I think this discussion is theoretical and useless. I hope nobody will
  suggest a field name containing non-ascii characters...
 
 I suspect there might be a communication problem that made this come
 across harsher than it was intended.  But I'll mention that one of the
 things that's sometimes frustrating about trying to nail down the
 specification and standards around Debian's package format is that aspects
 of standardization that would be considered completely routine in, say,
 IETF work are considered theoretical and useless.
 
 If we were standardizing things in any other context, one of the very
 first things we'd do is write an ABNF grammar for Debian control fields,
 which would immediately and unambiguously state the allowed characters for
 each component.

Ideally, there should be a proper dpkg interface documentation, and the policy 
document 
would only need to specify the subset that is mandated by policy.

The current situation when the policy team is in charge of maintaining the dpkg
interface documentation is awkward, especially since policy does not maintain 
the
dpkg interface.

Cheers,
-- 
Bill. ballo...@debian.org

Imagine a large red swirl here. 


-- 
To UNSUBSCRIBE, email to debian-dpkg-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/20100929150556.gb10...@yellowpig



Re: Names of Fields in Control Files

2010-09-26 Thread Raphael Hertzog
On Sat, 25 Sep 2010, Jonathan Yu wrote:
 22:02:40  rra jawnsy: I don't think we say that explicitly, but RFC
 5322 requires it and I can't imagine ever not enforcing that.
 Although you should check with the dpkg maintainers to be sure.
 
 Could we/should we make the Debian Policy more restrictive, and
 specify explicitly that field names must only be ASCII-encoded?
[...]
 Your comments and feedback on this would be much appreciated.

I think this discussion is theoretical and useless. I hope nobody will
suggest a field name containing non-ascii characters...

I don't think dpkg does any special decoding when reading data. dpkg deals
with bytes and not characters AFAIK.

I'm certainly OK with policy requiring field names to be ASCII.

Cheers,
-- 
Raphaël Hertzog ◈ Debian Developer ◈ [Flattr=20693]

Follow my Debian News ▶ http://RaphaelHertzog.com (English)
  ▶ http://RaphaelHertzog.fr (Français)


-- 
To UNSUBSCRIBE, email to debian-dpkg-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/20100926075312.gc22...@rivendell.home.ouaza.com



Re: Names of Fields in Control Files

2010-09-26 Thread Russ Allbery
Raphael Hertzog hert...@debian.org writes:
 On Sat, 25 Sep 2010, Jonathan Yu wrote:

 22:02:40  rra jawnsy: I don't think we say that explicitly, but RFC
 5322 requires it and I can't imagine ever not enforcing that.
 Although you should check with the dpkg maintainers to be sure.
 
 Could we/should we make the Debian Policy more restrictive, and
 specify explicitly that field names must only be ASCII-encoded?
 [...]
 Your comments and feedback on this would be much appreciated.

 I think this discussion is theoretical and useless. I hope nobody will
 suggest a field name containing non-ascii characters...

I suspect there might be a communication problem that made this come
across harsher than it was intended.  But I'll mention that one of the
things that's sometimes frustrating about trying to nail down the
specification and standards around Debian's package format is that aspects
of standardization that would be considered completely routine in, say,
IETF work are considered theoretical and useless.

If we were standardizing things in any other context, one of the very
first things we'd do is write an ABNF grammar for Debian control fields,
which would immediately and unambiguously state the allowed characters for
each component.

 I'm certainly OK with policy requiring field names to be ASCII.

I think that's probably the right thing to do.

-- 
Russ Allbery (r...@debian.org)   http://www.eyrie.org/~eagle/


-- 
To UNSUBSCRIBE, email to debian-dpkg-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/87aan452dn@windlord.stanford.edu



Re: Names of Fields in Control Files

2010-09-26 Thread Charles Plessy
Le Sun, Sep 26, 2010 at 01:35:00AM -0700, Russ Allbery a écrit :
 Raphael Hertzog hert...@debian.org writes:
 
  I'm certainly OK with policy requiring field names to be ASCII.
 
 I think that's probably the right thing to do.

Dear all,

how about simply paraphrasing the RFC 822/5832, which our the source of
inspiration ? In that case, the requirement for field names will be to be
printable ASCII characters, except colons.

I propose the following change in the context the patch that I am preparing for
clarifying the Policy's chapter about control files, in bug #593909.

diff --git a/policy.sgml b/policy.sgml
index be0a505..5c72355 100644
--- a/policy.sgml
+++ b/policy.sgml
@@ -2493,9 +2493,11 @@ endif
p
  Each paragraph consists of a series of data fields; each
  field consists of the field name, followed by a colon and
- then the data/value associated with that field.  It ends at
- the end of the line or at the end of the last
- continuation line (see below).  Horizontal whitespace
+ then the data/value associated with that field.  The field
+ name is composed of printable US-ASCII characters (i.e.,
+ characters that have values between 33 and 126, inclusive),
+ except colon.  The field ends at the end of the line or at
+ the end of the last continuation line (see below).  Horizontal 
whitespace
  (spaces and tabs) may occur immediately before or after the
  value and is ignored there; it is conventional to put a
  single space after the colon.  For example, a field might


Cheers,

-- 
Charles Plessy
Tsurumi, Kanagawa, Japan


-- 
To UNSUBSCRIBE, email to debian-dpkg-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/20100926150735.gb31...@merveille.plessy.net