from:"Martin Duerst"

Re: Atom Bidi Draft Update - Informal Last Call

2007-03-28 Thread Martin Duerst


At 02:25 07/03/23, James M Snell wrote:

It is not year clear if there has been enough independent implementation
of the specification to justify publishing it as a proposed standard.

There is no need for implementations when going to Proposed Standard
(of course, they never hurt). There is a well-defined need when going
from Proposed Standard to Draft Standard, but that's way in the future.

Regards, Martin.


#-#-#  Martin J. Durst, Assoc. Professor, Aoyama Gakuin University
#-#-#  http://www.sw.it.aoyama.ac.jp   mailto:[EMAIL PROTECTED]

Re: Additional 'namespace' attribute on content element?

2007-01-05 Thread Martin Duerst


At 16:13 07/01/05, Asbj\x8F\xD3n Ulsberg wrote:

On Thu, 04 Jan 2007 17:00:29 +0100, Jan Algermissen  
[EMAIL PROTECTED] wrote:

 on the NewsML list, an issue came up that due to they lack of a MIME  
 type for NewsML using NewsML as Atom content is somewhat problematic[1];  
 I think this is the case with most of the more interesting XML  
 applications out there.

The more interesting XML applications out there should get a proper MIME  
type, that's all. Nothing wrong with Atom in this case, imo.

Sorry, wrong.

Using +xml at the end of a Mime Type is a good convention, but it
is only a convention. There is no requirement for a Mime Type that
uses XML to end in +xml. Also, the +xml convention was established
several years later than XML itself.

So I think that Atom is a bit out on it's egde if it says
'if you don't have a +xml Mime Type, you're not XML'.

Regards,Martin.




#-#-#  Martin J. Durst, Assoc. Professor, Aoyama Gakuin University
#-#-#  http://www.sw.it.aoyama.ac.jp   mailto:[EMAIL PROTECTED]

Re: Atom Entry docs

2006-12-13 Thread Martin Duerst


At 13:14 06/12/13, James M Snell wrote:

I think atom.entry and atom-entry are equally ugly; atom.entry would,
however, appear to be more consistent with typical mime conventions.

The dot is used for prefixes like vnd. (vendor) and so on.
In the case of atom entries, atom-entry is more in line with
the convention in other types.

Regards,Martin.

- James

Tim Bray wrote:
 
 [snip]
 (James, did you really mean atom.entry with the ugly dot?)
 
  -Tim
 
 


#-#-#  Martin J. Durst, Assoc. Professor, Aoyama Gakuin University
#-#-#  http://www.sw.it.aoyama.ac.jp   mailto:[EMAIL PROTECTED]

RE: Atom Entry Documents

2006-12-11 Thread Martin Duerst


At 10:32 06/12/11, Franklin Tse wrote:

 Option B) New Entry media type

   application/atom.entry+xml

+1

+1



#-#-#  Martin J. Durst, Assoc. Professor, Aoyama Gakuin University
#-#-#  http://www.sw.it.aoyama.ac.jp   mailto:[EMAIL PROTECTED]

Re: [atom-syntax] [RFC 4685] uri in xml namespace

2006-11-10 Thread Martin Duerst


At 07:50 06/11/10, Asbj\x8F\xD3n Ulsberg wrote:

A good example is the XHTML namespace URI:

   http://www.w3.org/1999/xhtml

Yes, this is a good example in general, but misses one important
point. It should say explicitly that the namespace URI is
http://www.w3.org/1999/xhtml. Why? Because somebody by chance
may arrive at this page from another URI, such as
http://Www.w3.org/1999/xhtml (Opera and Firefox convert this to
http://www.w3.org/1999/xhtml, probably due to the Latin/Cyrillic
confusion/scare a while ago, but IE doesn't).

Regards,Martin.



#-#-#  Martin J. Durst, Assoc. Professor, Aoyama Gakuin University
#-#-#  http://www.sw.it.aoyama.ac.jp   mailto:[EMAIL PROTECTED]

Re: Atom Syndication Format To Draft Standard?

2006-10-03 Thread Martin Duerst


At 04:10 06/10/03, Julian Reschke wrote:

Independantly of that, what do we do with all the normative references to 
proposed standards...:

RFC3987: [PROPOSED STANDARD] -- intended standards level of DRAFT incompatible 
with this document's standard level!

I've definitely thought about moving that a level forward, but that
may take another few months or so.

Regards,Martin.


#-#-#  Martin J. Durst, Assoc. Professor, Aoyama Gakuin University
#-#-#  http://www.sw.it.aoyama.ac.jp   mailto:[EMAIL PROTECTED]

Re: Protocol Action: 'Atom Threading Extensions' to Proposed Standard

2006-07-11 Thread Martin Duerst


At 10:43 06/07/10, Robert Sayre wrote:

Hi Lisa,

Thanks for the clarification. You may have missed another question I
recently asked, so I'll repeat it here. I am concerned that purl.org
lists the document author as the owner of the namespace URI, and I
wonder how the IESG came to the conclusion that the namespace is not a
problem. I see Sam Hartman raised the issue. What was the resolution?
Could the draft advance to Draft- or Full-Standard in that namespace?

I looked at that namespace shortly. It seems that it would be easy
to change the owners to make clear that this is owned by the IETF.
This can be done whenever it's needed. A purl namespace in and
by itself isn't any better or worse than a W3C namespace.

Regards,Martin.



#-#-#  Martin J. Durst, Assoc. Professor, Aoyama Gakuin University
#-#-#  http://www.sw.it.aoyama.ac.jp   mailto:[EMAIL PROTECTED]

Re: Link rel test cases

2006-05-30 Thread Martin Duerst


At 06:34 06/04/26, James M Snell wrote:

A couple more link rel test cases:
http://www.snellspace.com/public/linkreltest.xml

See the bottom of: http://www.intertwingly.net/wiki/pie/LinkConformanceTests

Just a short comment: the use of unregistered,
(see http://www.iana.org/assignments/urn-namespaces)
made-up urn namespaces such as linkreltest in
idurn:linkreltest:feed/id
doesn't really match well with the fact that
atom is an IETF format, and tests are supposed
to be correct and conforming by themselves.

Regards,   Martin.


#-#-#  Martin J. Durst, Assoc. Professor, Aoyama Gakuin University
#-#-#  http://www.sw.it.aoyama.ac.jp   mailto:[EMAIL PROTECTED]

Re: Link rel test cases

2006-05-30 Thread Martin Duerst


At 10:02 06/05/31, James Holderness wrote:

Robert Sayre wrote:
 of Use IRI:Compare, Use String::Compare, or The spec doesn't say, so
 you may use whatever you prefer.

 The URI/IRI specs say use whatever you prefer. If you don't like
 that, have James add it to his revision of 4287. :)

Thanks. That's not how I read RFC3987, but I'm happy to accept your 
interpretation.

Well, RFC 3987 (and 3986) don't *exactly* say use whatever you prefer,
but for the case at hand, what they say comes pretty close.
What they try to do is give advice on what kind of normalization
or comparison function(s) is/are adequate in which case.
Given that it's difficult to immagine every possible case in
advace, the advice they give unfortuanetly has to be somewhat
vague.

BTW, as a co-author of RFC 3987, I of course appreciate any
suggestion on how to make it clearer, and any suggestions for
fixes or changes (should they be necessary). These should be
directed to [EMAIL PROTECTED]

Regards, Martin.



#-#-#  Martin J. Durst, Assoc. Professor, Aoyama Gakuin University
#-#-#  http://www.sw.it.aoyama.ac.jp   mailto:[EMAIL PROTECTED]

Re: Link rel test cases

2006-05-29 Thread Martin Duerst


At 04:37 06/05/27, Robert Sayre wrote:

Huh, I personally feel that the comparison ladder is better. Sort of
like, if it comes down to that, we can't help you, even for atom:id.
The WG definitely felt simple string comparison was needed for
atom:id, so we spent *a lot* of time on that text.

I think for simple IDs, where there is no expectation of resolution,
character-by-character comparison is better. But either way,
the spec is the way it is now.

I don't feel that effort would pay off here, at all. Especially since
a consumer that matched
HTTP://www.IANA.org:80/assignments/relation/alternate would be in
error. That's ridiculous standards weenie stuff, don't you think?

Yes, a content producer that expected
a) HTTP://www.IANA.org:80/assignments/relation/alternate
to always be different from
b) http://www.iana.org/assignments/relation/alternate
would be stupid. But on the other hand, a content producer
that expected a) and b) to always be treated the same would
be ridiculous, too. The conclusion is that a content producer
that uses
HTTP://www.IANA.org:80/assignments/relation/alternate
at all does something wrong.

Regards,Martin.


#-#-#  Martin J. Durst, Assoc. Professor, Aoyama Gakuin University
#-#-#  http://www.sw.it.aoyama.ac.jp   mailto:[EMAIL PROTECTED]

Re: Feed Thread in Last Call

2006-05-17 Thread Martin Duerst



Hello Robert,

It's not the IETF which wants to move this on on Standards Track,
it's (currently) James as an individual submitter. The IESG just
did what it does for all such requests from individuals: Put the
document out for IETF Last Call and tell the relevant mailing lists
about it.

Now it's your (and everybody else's) turn to tell the IESG what
you think about this draft. A short, but clearly argued and nicely
worded mail, with some easy-to-follow pointers to further material,
is all that is needed. This may say okay with Standards Track iff
these things get fixed or should be experimental because ... or
even too controversial for an individual submission, needs a WG.

The time you invest in this email will be a lot better spent than
writing yet another mail to James and this list.

Lisa (the sponsoring AD) and the rest of the IESG will then look
at all the input they get, and will make a decision.

Regards,Martin.

At 07:03 06/05/17, Robert Sayre wrote:

On 5/16/06, Paul Hoffman [EMAIL PROTECTED] wrote:
 At 4:33 AM +0200 5/16/06, Robert Sayre wrote:
 I thought the working group was fairly clear about the dubious value
 and placement of these attributes,

 For the benefit of Lisa, who is the sponsoring AD for this document,
 please list links to those messages.

James changed the document in response to those messages. That should
be enough. Maybe she could ask James about them. It's not my
obligation to spelunk for them, and it's certainly not your place to
start making demands like that.

 So you don't think they're important or needed, and then
 WG doesn't have consensus on them.

 Quite true, but it is true because there has never been a call for
 consensus on the document, and there won't be in the future.

Well, I'm not going to quibble with you about procedural details. But
I have to wonder why they're in the document at all.

Looks like the IETF wants to publish a proposed standard explicitly
designed to break a very popular implementation, with no technical
reason. I think that speaks volumes about the IETF, its management,
and the quality of its individual contributors.

You don't have to listen to the WG, but if one or two WG members are
going to deploy and then standardize whatever they've done, that's an
informational document.

 That is not true. If it is a protocol or a format, standards track is
 also appropriate.

Well, I don't want to standardize some of what James has deployed. It
won't work in Sean's implementation. I'm not sure I can interoperably
implement the parts in question. Your two biggest client implementers
aren't real happy about this. It might be appropriate if you really
stretch, but it's sure not smart.

--

Robert Sayre

Re: Does xml:base apply to type=html content?

2006-04-10 Thread Martin Duerst



At 19:17 06/04/04, Anne van Kesteren wrote:

Quoting James Holderness [EMAIL PROTECTED]:
 If `Content-Location` is not usable or can't be used consistent on a website
 (for example, using it for both Atom and HTML content) I suggest we specify
 something that is consistent with what browsers do. And perhaps try to 
obsolete the relevant header if possible...


 Isn't this something the HTTP WG should be doing?

Just for the record, there is currently no HTTP WG. The mailing
list of the former HTTP WG still exists. As this was an IETF WG,
and is an IETF mailing list (hosted by W3C), you can easily join
and bring this issue up. Here's the necessary data:

List-Id: ietf-http-wg.w3.org
List-Help: http://www.w3.org/Mail/
List-Subscribe: mailto:[EMAIL PROTECTED]
Resent-Message-Id: [EMAIL PROTECTED]
List-Unsubscribe: mailto:[EMAIL PROTECTED]
Resent-Message-Id: [EMAIL PROTECTED]

Regards,Martin.

I guess so. The HTML WG (W3C, same concept) should be doing a lot of things as
well. That doesn't mean it actually happens...


--
Anne van Kesteren
http://annevankesteren.nl/

Re: Datatype for IRIs in RELAX NG

2006-03-21 Thread Martin Duerst



At 02:08 06/03/20, Elliotte Harold wrote:

I would recommend against using xsd:anyURI for IRIs. A URI is much more 
restrictive than an IRI, and one of the easiest things for a schema 
validator to check about an xsd:anyURI is that it only contains URI-legal 
ASCII characters.


This is indeed one of the easiest things, but it would be TOTALLY
wrong.

http://www.w3.org/TR/xmlschema-2/datatypes.html#anyURI says, among else:

   The mapping from anyURI values to URIs is as defined by the URI reference
   escaping procedure defined in Section 5.4 Locator Attribute of [XML
   Linking Language] (see also Section 8 Character Encoding in URI References
   of [Character Model]). This means that a wide range of internationalized
   resource identifiers can be specified when an anyURI is called for, and
   still be understood as URIs per [RFC 2396], as amended by [RFC 2732],
   where appropriate to identify resources.

If there is confusion in other venues about this issue, please help
to make sure it gets fixed.


Regards,Martin.

Re: Atom syndication schema

2006-03-19 Thread Martin Duerst



At 18:49 06/03/17, Bjoern Hoehrmann wrote:

* Martin Duerst wrote:
When looking with a microscope, you will find some little
differences, because xs:anyURI was described before the IRI
spec (RFC 3987) was approved. These differences are:

1) xs:aryURI also allows spaces and a few other ASCII characters
that are not allowed in URIs nor in IRIs (but the IRI spec has
an escape hatch for such cases).
2) The IRI spec contains many more details than the xs:anyURI
description, in particular also some requirements re.
normalization. However, some of the requirements in this
area of the IRI spec may be lowered or removed in the future
because we have received feedback from implementers that
there are difficulties to implement these.

I agree with Martin that it would be incorrect to use xsd:anyURI here.

Sorry, but I never said that it would be incorrect to use
xsd:anyURI. I personally think that it should be okay to
use xsd:anyURI. The differences are microscopic, and they should
become even smaller, or hopefully go away completely, over time.
It does not make sense to perpetuate minor differences for
something that was and is supposed to be one and the same
thing.

Regards,Martin.

Re: Atom syndication schema

2006-03-17 Thread Martin Duerst



At 00:42 06/03/17, Norman Walsh wrote:
/ Thomas Broyer [EMAIL PROTECTED] was heard to say:
| RFC 3987 says (section 1.2 Applicability):
|For example, XML schema [XMLSchema] has an explicit type
|anyURI that includes IRIs and IRI references. Therefore, IRIs
|and IRI references can be in attributes and elements of type
|anyURI.

| So, actually, it seems that the Atom RNC could say atomUri = xs:anyURI.

Yes, I think that's the case.

Some details (from the primary editor of RFC 3987 :-):
From a mile high viewpoint, and even from much lower,
it is definitely the case. From the viewpoint of intent, it is
also definitely the case.

When looking with a microscope, you will find some little
differences, because xs:anyURI was described before the IRI
spec (RFC 3987) was approved. These differences are:

1) xs:aryURI also allows spaces and a few other ASCII characters
   that are not allowed in URIs nor in IRIs (but the IRI spec has
   an escape hatch for such cases).
2) The IRI spec contains many more details than the xs:anyURI
   description, in particular also some requirements re.
   normalization. However, some of the requirements in this
   area of the IRI spec may be lowered or removed in the future
   because we have received feedback from implementers that
   there are difficulties to implement these.

Regards,Martin.

Re: Atom syndication schema

2006-03-14 Thread Martin Duerst



At 08:42 06/03/15, David Powell wrote:


Not sure if this is a known bug, but I just noticed that the RelaxNG
grammar doesn't accept atomCommonAttributes (eg xml:lang) on the
atom:name and atom:uri and atom:email elements used within
Person constructs.

For atom:uri and atom:email at least, not having xml:lang may
be seen as a feature. While these often contain pieces from one
language or another, they are not really in a language.

Regards,   Martin.

Re: ACE - Atom Common Extensions Namespace

2005-10-03 Thread Martin Duerst



At 16:45 05/10/02, James M Snell wrote:

http://www.w3.org/2005/Atom-extensions works for me... assuming, of 
course, that Those-Who-Officially-Assign-Such-Things go along with it.


Tim and Paul know who to contact.

The original .../ace URI was just a working thing pitched with full 
knowledge that it would likely change to something better.


Good. I wanted to comment that I don't like ACE because the term
is also used in the context of Internationalized Domain Names
(where it stands for ASCII-compatible encoding).

Regards,Martin.

Re: ACE - Atom Common Extensions Namespace

2005-10-03 Thread Martin Duerst



At 07:04 05/10/03, Walter Underwood wrote:

--On October 2, 2005 9:35:28 AM +0200 Anne van Kesteren
[EMAIL PROTECTED] wrote:

 Having a file and folder of the same name is not technically possible.
(Although
 you could emulate the effect of course with some mod_rewrite.)

Namespaces aren't files, only names.

Yes. But the W3C insists on having an actual file there, just
for documentation at least, or ideally for some machine-readable
information (schema,...).

Also, some filesystem implementations do allow a file and a folder
with the same name.

Yes. The W3C server could certainly be tweaked to allow that,
but it would be easier not to have to do that.


Regards,   Martin.

Re: If you want Fat Pings just use Atom!

2005-08-24 Thread Martin Duerst



At 22:58 05/08/23, Antone Roundy wrote:

On Monday, August 22, 2005, at 09:54  PM, A. Pagaltzis wrote:
 For this application, I would do just
 that, in which case, as a bonus, non-UTF-8 streams would get to
 avoid resending the XML preamble over and over and over.

Of course, if you do that, you won't be able to keep signatures for 
entries originally published in an encoding other than the one you've chosen.


Wrong, unfortunately. XML Signature requires to transcode
to UTF-8 before signing, exactly to protect against such
problems.

If one were to want to signal an encoding change mid-stream, how might 
that work with what's been proposed thus far?


You can't change an encoding midstream in an XML entity.
You can use different encodings for different external
entities, though.

Regrads,  Martin.

Re: If you want Fat Pings just use Atom!

2005-08-22 Thread Martin Duerst



At 02:15 05/08/23, A. Pagaltzis wrote:

Using a character which is illegal in XML and can never be part
of a well-formed document as a separator is a clever way to avoid
having to do *any* parsing *whatsoever*. You just scan the stream
for the character and start over when you see it, end of story.
No need to keep state or look for patterns or anything else.

Well, modulo character encoding issues, that is. An FF will
look differently in UTF-16 than in ASCII-based encodings.

Regards,   Martin.

Re: Major backtracking on canonicalization

2005-07-07 Thread Martin Duerst



At 03:12 05/07/08, Paul Hoffman wrote:

We are signing the bits only, not some interpretation of the bits. That 
is true for the xml:base, the xml:lang, the xml:something-else, and so on.


Just a clarification that I may have made previously: XML Canonicalization
(both kinds) convert to UTF-8 from whatever encoding your document or
fragment is, so essentially, we are signing characters, not bits.


Regards,Martin.

Re: Clearing a discuss vote on the Atom format

2005-07-01 Thread Martin Duerst



At 10:26 05/07/01, Paul Hoffman wrote:

To be added near the end of Section 5.1 of atompub-format:

Section 6.5.1 of [W3C.REC-xmldsig-core-20020212] requires support
for Canonical XML. Atom Processors that sign Atom Documents MUST
use Canonical XML.

Hello Paul,

The rest of your changes looked reasonable, but the MUST above
looks too strong to me. What about something like
Atom Processors that sign Atom Documents MUST support the use
of Canonical XML. or even
Atom Processors that sign Atom Documents MUST use Canonical XML
by default; they MAY use other canonicalizations at user request.

The reason for this is to make sure we have interoperability
with a mandatory-to-implement (and default-to-use) canonicalization,
but that we don't disallow other canonicalizations that for one
or the other as of now not yet clear reason may be preferable in
some cases in the future (but in your wording would prohibit
the result to be called Atom at all).

Regards, Martin.

Re: IESG defers discussion of the format document for two weeks

2005-06-24 Thread Martin Duerst



At 03:21 05/06/24, Tim Bray wrote:


On Jun 23, 2005, at 9:12 AM, Eric Scheid wrote:

 If there are problems
 that require repair, I don't want to wait two more weeks to find out
 about them.  This is very disappointing. -Tim



 we can start with these two...

 https://datatracker.ietf.org/public/pidtracker.cgi? 
command=view_commentid= 36536


 https://datatracker.ietf.org/public/pidtracker.cgi? 
command=view_commentid= 36539


Nope, these are non-controversial editorial improvements

Mostly, yes.

which need
not slow us down, they can go into that last draft that we'd have to
do anyhow to fix the errors that have been turned up and put in the
contributors and so on.  Specifically, they want us to make it
clearer that yes, we do have two similar-looking instances of type=
with different rules, and to make it clear that to dereference a IRI
you have to turn it into a URI.  The person who put the defer on
didn't raise these issues.  -Tim

Some care (and some headstart) is nonetheless helpful.
Ted at 
https://datatracker.ietf.org/public/pidtracker.cgi?command=view_commentid=36539

says  This is true for any scheme which does not support IRIs natively.
This is slightly inaccurate; schemes are irrelevant here, it's protocols
that count (which is the word used in most of the rest of the text).


Regards,Martin.

Re: Question on Use Case: Choosing atom:content's type when ArchivingMailing Lists

2005-06-10 Thread Martin Duerst



Hello Andreas,

There was quite some discussion between Keith Moore and me
and a few others about how to serve email in the context of
draft-duerst-archived-at-XX.

See e.g. http://www.imc.org/ietf-822/mail-archive/msg05486.html
and many older threads in the same archive.

The general idea seems to be that conversion to HTML is good for
human consumption with a browser, and for human navigation in an
archive. message/rfc822 is the original, and may be very good for
reuse in a mailer (e.g. for replying), except that mailers don't
support it much at the moment.

atom:content type=message/rfc822 
src=http://www.example.org/lists/list/archive/12345/


looks perfectly okay to me if that's what you want to do.
The average atom software may have difficulties treating
the message in a good way, but special applications could
do very interesting things with it. If the spec prohibits
it, that should maybe be fixed in the long term. To show
that there are good uses for it, and working implementations,
is the best start in that direction.

Regards,Martin.

At 10:19 05/06/09, Andreas Sewe wrote:

I have a question regarding a specific Atom use case: What is the 
preferred way to set up an Atom feed listing the contents of a mailing 
list's archive, given that composite media types (in particular 
message/rfc822) are disallowed for atom:content's type attribute? This 
prevents me from doing something like the following, serving the archived 
mails as-is via HTTP:


atom:content type=message/rfc822 
src=http://www.example.org/lists/list/archive/12345/


But then again there might be something more fundamentally wrong with 
serving mails as message/rfc822 via HTTP than I currently realize -- and 
I should better serve the mails as text/plain or convert them to HTML 
first (which is the way the atom-syntax archive works).


Nevertheless, any advice on this issue would be greatly appreciated.

Regards,

Andreas Sewe

Re: 4.2.7.1 Comparing atom:id

2005-05-23 Thread Martin Duerst



At 16:09 05/05/22, Anne van Kesteren wrote:

Robert Sayre wrote:
 I think the last paragraph of RFC3987, section 5.1 already says that :)
 http://rfc.net/rfc3987.html#s5.1.

That also says that fragment components should be excluded. Is that true 
for Atom?


It says:
   When IRIs are compared to
   select (or avoid) a network action, such as retrieval of a
   representation, fragment components (if any) should be excluded from
   the comparison.

IDs are just identifiers, not used for network action, so this doesn't
apply.

Regards,Martin.

Are we going to refer to that specification and that section from 4.2.7.1 
in a future draft?



--
  Anne van Kesteren
  http://annevankesteren.nl/

Re: Deterministic content models (conflict with Atom Publishing Protocol?)

2005-05-23 Thread Martin Duerst



Hello Thomas,

At 07:34 05/05/22, Thomas Broyer wrote:

I'm sorry to raise this issue back again but...

The Atom Publishing Protocol defines SOAP bindings. This (in my mind) 
means there will be WSDL files over there. WSDL rely on XML Schema which in 
turn are limited to deterministic content models.


Will the APP limit Atom entries in SOAP envelopes content model to fit 
WSDL/XML Schema constraints (actually, the APP will already need to limit 
Atom entries to have a mandatory atom:content I think)?


Or should the Atom Syndication Format use deterministic content models to 
allow XML Schema, thus such uses as WSDL for Web Services?


Are you speaking about potential non-determinism or actual non-determinism?
Is such non-determinism in the core of the Atom format, in potential extensions,
or in payloads (e.g. XHTML,...)?

If there is actual non-determinism now in the Atom core, that would be
a reason for concern. If there is potential non-determinism in the
payloads or extensions, it should be possible to handle that by a more
open content model in XML Schema.

Regards,Martin.

Re: Compulsory feed ID?

2005-05-22 Thread Martin Duerst



At 08:47 05/05/20, Eric Scheid wrote:

On 19/5/05 10:41 PM, Tim Bray [EMAIL PROTECTED] wrote:

 We suggest that there may be WG consensus in favor of changing the
 format spec to make atom:id a required child of atom:feed.  (Text not
 provided, we trust the editors to figure out the correct way to say
 this).  Please indicate agreement or disagreement with this proposition
 in the next day or two.

wth, +1

+1 here too.Regards,   Martin.

Re: PaceOptionalXhtmlDiv

2005-05-16 Thread Martin Duerst

(BI just had another idea:
(B
(BWhy don't we use a totally different element name, rather than div?
(BWhat about e.g. wrapper? This could be pro-forma in the XHTML Namespace,
(Bbut as this isn't handed over to the XHTML rendering engine (as far as
(BI understand), it wouldn't matter. Also, it would be much easier to
(Bmake optional (otherwise we get into the discussion of whether it's
(Ba wrapper div or a real div. Also, it would make things clearer to
(Bpeople who read the source but not the spec. Other names would of
(Bcourse be fine, I don't particularly care about a specific one.
(B
(BSo we would have any of the following:
(B
(BA) No wrapper, none needed, because default namespace set outside
(B
(Batom:summary type="xhtml"
(B xmlns:atom="...Atom NS..."
(B xmlns="...XHTML NS..."
(BThis is one busted-ass document - emyow!/em.
(B/atom:summary
(B
(BB) No wrapper, none needed, because XHTML has prefix
(B
(Batom:summary type="xhtml"
(B xmlns:atom="...Atom NS..."
(B xmlns:html="...XHTML NS..."
(BThis is one busted-ass document - html:emyow!/html:em.
(B/atom:summary
(B
(BC) With wrapper
(B
(Bsummary type="xhtml"
(B xmlns="...Atom NS..."
(Bwrapper xmlns="...XHTML NS..."
(BThis is one busted-ass document - emyow!/em.
(B/wrapper
(B/summary
(B
(BIf there is some support, I'll write a pace.
(B
(BRegards,Martin.
(B
(B
(B
(BAt 01:36 05/05/15, Bill de h$B%b(Bra wrote:
(B
(B -1 to PaceOptionalXhtmlDiv.
(B 
(B If we are dealing with systems where a div wrapper is deemed neccessary, 
(Bthen you want to take steps to miminise people's options.
(B 
(B For example, PaceOptionalXhtmlDiv will help this to occur:
(B 
(B summary type="xhtml"
(B xmlns:atom="...Atom NS..."
(B xmlns="...XHTML NS..."
(BThis is one busted-ass document - emyow!/em.
(B /summary
(B 
(B If you can't trust people not to need the div, then you can't make it 
(Boptional. I unfortunately have a good amount of experience dealing with 
(Bthis kind of thing outside Atompub. The simplest answer is to stop the 
(B'envelope' from using a default namespace (don't bother to debate this with 
(Bme, it's not an imo). We're not doing that with Atom. Failing that, the 
(Bnext thing consideration is to add/enforce a protective scoping barrier 
(Bbetween the envelope and the content. We are doing that with Atom.
(B 
(B [I disagree with Graham btw, and would say xhtml:div is actually fighting 
(Bkluges with kluges, but it works more or less as specified.]

Re: extensions -- Atom NS and unprefixed attributes

2005-05-11 Thread Martin Duerst

Hello Paul,
Many thanks for the citations below, this is extremely clear.
Going back to the original question/pace that you gave a -1,
would that change if in the text IETF Consensus was
replaced with IESG Approval?
Regards, Martin.
At 10:56 05/05/11, Paul Hoffman wrote:

At 4:15 PM +0900 5/10/05, Martin Duerst wrote:
What's the difference between IETF consensus (for which you gave a -1)
and it's up to the IESG (which seems what you think we should let happen)?

 From RFC 2434:

   IESG Approval - New assignments must be approved by the IESG, but
there is no requirement that the request be documented in an
RFC (though the IESG has discretion to request documents or
other supporting materials on a case-by-case basis).

   IETF Consensus - New values are assigned through the IETF
consensus process. Specifically, new assignments are made via
RFCs approved by the IESG. Typically, the IESG will seek
input on prospective assignments from appropriate persons
(e.g., a relevant Working Group if one exists).


--Paul Hoffman, Director
--Internet Mail Consortium

Re: the atom:copyright element

2005-05-10 Thread Martin Duerst

At 01:08 05/05/09, Tim Bray wrote:

On May 7, 2005, at 3:29 PM, Robin Cover wrote:

 The publication of a new Implementation Guideline by the
 Open Archives Initiative (OAI) compels me to suggest once
 again [1], as Norm Walsh [2], Bob Wyman [3], and others have
 done before, that the name 'atom:rights' would be better
 than the (current) element name 'atom:copyright'.

+1
+1 here too.Regards,   Martin.

Re: extensions -- Atom NS and unprefixed attributes

2005-05-10 Thread Martin Duerst

Hello Paul,
What's the difference between IETF consensus (for which you gave a -1)
and it's up to the IESG (which seems what you think we should let happen)?
In my view, IETF consensus is another way of saying the IESG has the
last word. If there is a better way to express this in the spec, then
what would that be?
Or would we say that because the atom namespace appears in an IETF spec,
it's obvious that only the IETF (thus ultimately the IESG) can add stuff,
but we don't have to say so?
Regards,   Martin.
At 11:57 05/05/10, Paul Hoffman wrote:

At 7:22 PM -0700 5/9/05, Tim Bray wrote:
On May 9, 2005, at 6:52 PM, Robert Sayre wrote:


I think we should be clearer that new elements in the Atom namespace
and unprefixed attributes with parent elements in the Atom namespace
can only be added by IETF consensus. Thoughts?

I wonder if there's some standard IETF practice when you define a 
language as regards future change control?

No. Nearly every protocol tries to go its own way. It's a mess.

Generally -1.  This spec defines what Atom 1.0 is, why try to 
micro-manage the future? -Tim

Agree on the -1.

At 10:34 PM -0400 5/9/05, Robert Sayre wrote:
Fair enough. But can just anyone add stuff to the Atom namespace?

If the IESG lets them, yes.

We gotta trust the IESG after the WG shuts down. Fortunately, they have 
earned that trust over time.

--Paul Hoffman, Director
--Internet Mail Consortium

RE: PaceAllowDuplicateIDs

2005-05-08 Thread Martin Duerst

At 00:12 05/05/07, Bob Wyman wrote:
Right. We have abstract feeds and entries and we have concrete feeds
and entries. The abstract feed is the actual stream of entries and updates
to entries as they are created over time. Feed documents are concrete
snapshots of this stream or abstract feed of entries. An abstract entry is
made concrete in entry documents or entry elements. An abstract entry may
change over time and may have one or more concrete instantiations.
Some applications are only interested in being exposed to those
concrete entries that reflect the current or most recent state of the
abstract entries -- these apps would prefer to see no duplicate ids in
concrete feed documents even though these duplicates *will* occur in the
abstract feed. Other applications will require visibility to the entire
stream of changes to abstract entries -- these applications will wish to see
concrete feeds that may contain multiple, differing concrete instantiations
of abstract entries. i.e. they will want the concrete feed to be an accurate
representation of the abstract feed. Two needs, to views...
You say 'some applications' and 'other applications', as if they were on
the same footing. In my view, the 'some applicaitons' (only interested
in latest version) should be the usual case, and the 'other applications'
(interested in more than one version) should be the exception.
Mapping that back to the origin, applications generating feeds that in
one way or another rely on the user getting more than one, or more than
the latest, versions of their entries have made a design error, they
have taken the wrong thing for the 'entry'. If they think that they
have two different kinds of audiences, interested in two different
things, they should publish two feeds. Some people claim that we
need a definition for 'entry' to finish this discussion, but once
we confirm that a feed can only contain one version of an entry with
the same ID, the definition of entry is as clear as we need it to be.
This is just the same as for Web pages. If somebody puts up a Web page
for the current weather, there is nothing in HTTP that will help me
get the past versions of this page. If the publisher thinks that
people may be interested in past weather info, they will make up
separate pages. If we think that it would be valuable to be able
to correlate the entries in both feeds, we should define an
extension for that, not mess around with the basic model.
An extension would be rather easy, we only need two rel values
for links in entries. One rel value could be called permaentry,
the other could be called updatingentry. Maybe a third called
updatingfeed, if there is an updating feed for a single changing
entry. I'm sure there are better names.
The main use I see for documents with multiple entries with the same
ID is archives. Everything else can be handled by the creater doing
the right thing, or by an immediary offering a new feed with versions
of the entry (no guarantee to have all of them, in that case). Even
archives could be handled that way if really needed, but it's difficult
to immagine that everybody will publish an archive feed. We can easily
define an archive top level element, and that problem is solved.
For aggregators, wanting to forward two or more entries with the
same ID for me means that they are simply not doing their job.
Aggregators should aggregate, not just function as GIGO (garbage
in, garbage out) processors.
So it should be clear that I'm -1 on PaceAllowDuplicateIDs.
Regards,Martin.

Re: PaceOptionalFeedLink

2005-05-07 Thread Martin Duerst

At 11:50 05/05/06, Sam Ruby wrote:

Tim Bray wrote:
 +1
 There are people who want to publish feeds without rel=alternate 
links.  I'm against telling people they can't do something they want to do 
without strong reasons, as in loss of interoperability.  I don't see the 
reasons here as strong enough.  -Tim

+1 here, too, since long ago.
FYI: we have an instance proof of this requiring an existing tool to do 
additional work:

   http://www.imc.org/atom-syntax/mail-archive/msg13983.html

Well, yes, but how much work can that possibly be?
Regards,Martin.

Re: Last Call: 'The Atom Syndication Format' to Proposed Standard

2005-05-07 Thread Martin Duerst

At 02:27 05/05/04, Thomas Broyer wrote:

Martin Duerst wrote:
 At 03:33 05/04/29, Alexey Melnikov wrote:
  If the value is text, the content of the Text construct MUST NOT
  contain child elements.  Such text is intended to be presented to
  humans in a readable fashion.  Thus, Atom Processors MAY collapse
  white-space (including line-breaks),
  
  Ok, maybe it is just me, but what does it mean to collapse 
white-space? Does this mean to replace FWS (in RFC 2822 sense) with a 
single space?
 Making this more precise is definitely desirable. But there is also
 an i18n issue: This works fine for languages that use spaces between
 words. It doesn't work for languages that don't have spaces between
 words (Chinese, Japanese, Thai,...). If Text elements are only used
 for short things such as names or titles, that's not a big issue,
 the text in question can just be put on a single line. However,
 when the texts in question are long, it's a serious issue, and
 should be fixed.

My understanding of type=text is that this is just text without any 
formatting.

That's my understanding, too.
Hence, it is not meant to be preformatted text such as text/plain or 
inside an (X)HTML pre.

Yes. But that's exactly where the spacing problems with Chinese/Japanese/Thai
are. There are no such problems for preformatted text, because the line breaking
in the source (as sent) is the same as the line breaking when displayed.
For free-flowing text, however, the line breaks in the source and those in
the display are not (necessarily) the same, and so linebreaks have to be
changed to spaces for Western languages, but to nothing for Chinese/Japanese
(and most possibly to a zero-width non-breaking space for Thai), and the spec
has to say something about this.
Regards,Martin.
This means type=text content is a single paragraph of text. If you 
need paragraphs, lists or any other structural formatting, you have to 
use type=html or type=xhtml with the appropriate content.

I was about to writing a Pace about white-space handling in type=text 
(either using xml:space or an attribute that would have mimic'd the 
white-space CSS property) when I understood/recalled that Text 
Constructs have accessibility in mind (hence their limitation to textual 
contents) and preformatted text is not accessible enough.

--
Thomas Broyer

Re: Last Call: 'The Atom Syndication Format' to Proposed Standard

2005-05-03 Thread Martin Duerst

At 03:33 05/04/29, Alexey Melnikov wrote:
The file can be obtained via
http://www.ietf.org/internet-drafts/draft-ietf-atompub-format-08.txt
 3.1.1.1  Text
If the value is text, the content of the Text construct MUST NOT
contain child elements.  Such text is intended to be presented to
humans in a readable fashion.  Thus, Atom Processors MAY collapse
white-space (including line-breaks),

Ok, maybe it is just me, but what does it mean to collapse white-space? 
Does this mean to replace FWS (in RFC 2822 sense) with a single space?

Making this more precise is definitely desirable. But there is also
an i18n issue: This works fine for languages that use spaces between
words. It doesn't work for languages that don't have spaces between
words (Chinese, Japanese, Thai,...). If Text elements are only used
for short things such as names or titles, that's not a big issue,
the text in question can just be put on a single line. However,
when the texts in question are long, it's a serious issue, and
should be fixed.
  and display the text using
typographic techniques such as justification and proportional fonts.


 4.1.3.3  Processing Model
...
2.  If the value of type is html, the content of atom:content
MUST NOT contain child elements, and SHOULD be suitable for
handling as HTML [HTML].  The HTML markup must be escaped; for

Should the must be changed to MUST here?
Yes, please!
 6.3  Software Processing of Foreign Markup
 
...
When unknown foreign markup is encountered in a Text Contruct or
atom:content element, software SHOULD ignore the markup and process
any text content of foreign elements as though the surrounding markup
were not present.

I reread this paragraph few times and I am still not quite sure what the 
paragraph is trying to say. Is it trying to say if the content of a 
foreign element looks like XML with unrecognized schema - just strip the 
markup and process the text?

Reading this, I got confused because we have both Text Construct
and Text as subtitles. I suggest to change the subtitle Text to
something like Text Construct with type='text' or so. Also, starting
a section with just an example looks weird. Please add an introductory
sentence. Same of course for the parallel subsections.
Regards,Martin.

Re: IRI/URI

2005-04-12 Thread Martin Duerst

At 11:10 05/04/12, Porges wrote:

OK, first-time poster :)

I was just thinking about IRIs recently and thought about a possible
source of ambiguousness. If the URI element can be EITHER an IRI or a
URI, then:

urihttp://example.com/200%25equalsZero/uri

This is both a valid IRI and a valid URI, but if it is considered to
be an IRI it will point to a different location than if it is
considered to be a URI.
Sorry, wrong. No need to worry.
IRI (as URI): http://example.com/200%2525equalsZero
Wrong. Please check section 3.1 of RFC 3987 again.
At the start of Step 2, it says:
For each character in 'ucschar' or 'iprivate',
apply steps 2.1 through 2.3 below. The % character
is neither part of 'ucschar' nor of 'iprivate', so
it doesn't get escaped when converting from an IRI
to an URI.
Regards,Martin.
URI: http://example.com/200%25equalsZero

Reading through the draft, it seems like IRIs might be the only thing
allowed. If so, could this be made clearer - through a renaming of the
element for instance

I'm not sure if this is even a problem, perhaps there's something I've
missed in the stack of protocols sitting out there. Feel free to
castrate me if this is the case, I've gotten myself rather confused
trying to work through this by myself and thought I'd ask some more
knowledgeable people ;)

Re: HTML/XHTML type issues, was: FW: XML Directorate Reviewer Comments

2005-04-09 Thread Martin Duerst

At 17:34 05/04/09, Julian Reschke wrote:

Quoting from Andrew Newton's questions relayed by Scott: 
http://www.imc.org/atom-syntax/mail-archive/msg14048.html

 From: Andrew Newton [mailto:[EMAIL PROTECTED] Sent: Tuesday, April 05, 2005 6:25 PM
 To: Scott Hollenbeck
 Cc: [EMAIL PROTECTED]
 Subject: Re: [xml-dir] FW: draft-ietf-atompub-format-07.txt is ready for
 IETF last call

 ...
 2) Section 4.1.3.3 Item 2
The text:
  for example, br as lt;br.
Is this right?  Should it be:
  for example, br as lt;brgt;.

We don't need to escape the  in text content, as far as I can tell.

Correct. From http://www.w3.org/TR/REC-xml/#syntax:
The right angle bracket () MAY be represented using the string gt;,
and MUST, for compatibility, be escaped using either gt; or a character
reference when it appears in the string ]] in content, when that string
is not marking the end of a CDATA section.
I'm slightly surprised to get such a comment from the XML Directorate.
But I guess their main job isn't to check lowly syntax details.
Are you suggesting to escape it anyway for consistency/readability?
Wouldn't be more readable, in my eyes.
I'd like to remind us of another related issue raised a few days ago 
(http://www.imc.org/atom-syntax/mail-archive/msg14045.html). XHTML 
content is currently restricted to XHTML Basic (see 
http://atompub.org/2005/04/04/draft-ietf-atompub-format-07.html#rfc.section.4.1.3.3, 
http://www.w3.org/TR/xhtml-basic/). As far as I can tell, this means *no 
styling*:

- http://www.w3.org/TR/xhtml-basic/#s1.3.1: style attribute not 
supported (but class is), but

I'd propose to go back to XHTML 1.0 Strict instead.
Very good point. A very strong +1.
Regards,Martin.

Re: atom:source interactions with xml:base/xml:lang

2005-04-07 Thread Martin Duerst

+1 to adding these kinds of clarifications and examples to the spec!
Regards,   Martin.
At 08:02 05/04/08, David Powell wrote:


The inheritance of @xml:lang and @xml:base creates a lot of
complexities for an implementation. When they are used in combination
with atom:source, things get particularly bad.

Should we add some notes to the spec to remind people how to correctly
handle base and lang when atom:source-ifying entries?


A couple of the complexities are:

1) An @xml:base attribute declared on an entry might be relative to
the original feed's URI, or to @xml:base attributes on parent
elements. These need to be resolved before they can be copied across
into a second feed.

2) The source feed will have atom:entry as a child of atom:feed, but
the second feed will have atom:source as a child of atom:entry. This
switch in the hierarchy means that if the source feed contains an
@xml:lang tag on its entry, but not on its feed, you need to undeclare
@xml:lang on atom:source. Likewise, any @xml:base on atom:entry need
to be resolved because it can no longer be relative to the base of the
atom:feed element.

Eg:

  atom:feed
atom:titleSource feed/atom:title
...
atom:entry xml:lang=de
  atom:titleGuten Tag/atom:title
  ...
/atom:entry
  atom:feed


needs to be embedded as:

  atom:feed
atom:titleSecond feed/atom:title
...
atom:entry xml:lang=de
  atom:source xml:lang=
atom:titleSource feed/atom:title
...
  /atom:source
  atom:titleGuten Tag/atom:title
  ...
/atom:entry
  atom:feed

--
Dave

Re: PaceFeedIdOrSelf

2005-04-05 Thread Martin Duerst

+1
At 02:59 05/04/05, Antone Roundy wrote:

http://www.intertwingly.net/wiki/pie/PaceFeedIdOrSelf

Re: Why is alternate link a MUST?

2005-04-03 Thread Martin Duerst

Of course I'm also for making an alternate link for a feed a
MAY rather than a MUST.
Regards,   Martin.
At 07:57 05/04/03, David Nesting wrote:

 Why isn't this requirement a may instead of a must? I can see having
 a link with rel=alternate if indeed a alternate version does exist. It
 does not make sense to put in some something misleading if an alternate
 does not exist.

I recently sought out and joined this list precisely because I wanted
to see if this issue had been addressed.  I don't think it's reasonable
to assume there will always be an alternate version of a feed.  If this
remains a MUST, I have no choice but to honor this by using a dummy
value for an alternate page, since I may not have an alternate.

Without any background, it seems to me that the goal here is to require
a link *back* to an HTML page that is presumed to have provided an
alternate link to this Atom resource.  This of course assumes that an
HTML or non-Atom version actually exists, and that resource is independent
of the Atom resource.  (Consider that I may have an HTML version, but
it could be derived from the Atom version using XSLT.  It's not accurate
to consider this an alternate when it's an XML style sheet involved.)

I couldn't find any reference to this issue in the mailing list, aside
from this (thankfully recent) thread.  If it's been addressed before,
could someone point me to the thread in the list archives?

David

--
 == David Nesting WL7RO Fastolfe [EMAIL PROTECTED] http://fastolfe.net/ ==
 fastolfe.net/me/pgp-key A054 47B1 6D4C E97A D882  C41F 3065 57D9 832F AB01

Re: s/url/web/

2005-03-23 Thread Martin Duerst

Hello Dan,
The problem I have with using web is that there is a pars pro
toto (or probably rather the other way) problem here. I.e.
the Web is defined by *all* the resources identified by an URI/IRI,
whereas the element we are trying to name points to just one of
them.
Given all the proposals, my opinion is currently about as follows:
[wrap up and ship:+1]
ref/href: +0.8
uri, iri: +0.5
at, about: +0.2
web, internet: +- 0
url: -0.2   (outdated)
Regards,Martin.
At 03:42 05/03/19, Dan Brickley wrote:

* Bjoern Hoehrmann [EMAIL PROTECTED] [2005-03-18 19:31+0100]
 * Dan Brickley wrote:
  I think the question is which of these is meant by the web:

 I encourage Atom to follow the WebArch REC, let's call it (d),
 http://www.w3.org/TR/webarch/#intro
 [[
 The World Wide Web (WWW, or simply Web) is an information space in which
 the items of interest, referred to as resources, are identified by
 global identifiers called Uniform Resource Identifiers (URI).
 ]]

 Let's ignore that not all IRIs are URIs and let's assume that this is a
 definition of Web, are you saying that this definition is equivalent
 to 'the set of all URIs is the information space Web'? If those are
 not equivalent I am not sure what you are suggesting here.

The definition says, in effect, The Web is an information space in
which we use URIs when identifying things. It does not rule out
other, complementary, identification mechanisms, nor does it equate the
Web with any specific set of entities. It is an over-arching
abstraction, not a set of URI-named things, nor the set of URIs. I guess
it is (and I cringe at the word), a bit like cyberspace. Nobody really
expects there to be real answers to how many things are 'in' cyberspace?,
since that is over-stretching the metaphor. Same with Web, I think.

Dan

Re: IRI references to images

2005-03-22 Thread Martin Duerst

I agree. This allows images to be relative. Please note
that in both cases, fragments are allowed, although I don't
know any image format for which they'd be useful at the moment.
Regards,   Martin.
At 09:38 05/03/19, David Powell wrote:


From draft-06:

 The atom:icon element's content is an IRI [RFC3987] which
 identifies an image which provides iconic visual identification for
 a feed

 The atom:image element's content is an IRI [RFC3987] which
 identifies an image which provides visual identific

I think these should both be IRI references, not IRIs.

--
Dave

Re: Attributes on the xhtml:div wrapper

2005-03-16 Thread Martin Duerst

Mildly put, I was never a big fan of the xhtml:div wrapper.
But if xml:lang is disallowed on the xhtml:div wrapper, this
makes even less sense to me. If Atom processors can handle
(i.e. correctly inherit) xml:lang from atom: elements into
the xhtml: elements as they are required to, I don't see
why they wouldn't be able to handle xml:lang on xhtml:div.
Regards,Martin.
At 14:46 05/03/17, Antone Roundy wrote:

On Wednesday, March 16, 2005, at 03:57  PM, David Powell wrote:
 Should we:

   a) keep the current text.

   b) disallow attributes on the xhtml:div wrapper.

   c) disallow XHTML attributes on the xhtml:div wrapper, but
  allow xml:lang.


 I vote for b).

 Allowing xml:lang on xhtml:div is unnecessary because it can be placed 
on the content construct itself with the same effect.  Allowing attributes 
on a wrapper element which most people would otherwise just throw away adds 
unnecessary complexity and surprise.

I imagine this is what you meant by b), but just to be sure, I'd vote for 
d) disallow attributes other than namespace declarations on the xhtml:div 
wrapper.

Re: Back on type=HTML, going a step farther...

2005-03-07 Thread Martin Duerst

At 08:36 05/03/07, Tim Bray wrote:
I agree also, but for some implementors, HTML is actually easier, if they 
are handing a chunk of bytes to an HTML rendering control, they'd rather 
not reconstruct the syntax from the infoset, they'd rather just take an 
opaque chunk of bytes.  So we really don't have community consensus in 
favor of XHTML, even though it's obviously cleaner. -Tim

I really hope they don't take an opaque chunk of bytes, because if
they do, they'll have no clue about the character encoding.
Regards,Martin.

Re: Back on type=HTML, going a step farther...

2005-03-07 Thread Martin Duerst

At 06:52 05/03/07, Julian Reschke wrote:
Anyway, I recently made a much more modest request that the spec should 
state that XHTML is preferred over HTML (because many recipients will not 
be willing to process tag soup, so in the best case formatting is lost). It 
seems that we can't even get a consensus for that, which is really 
disappointing considering the expectations I had 18 months ago :-(

I made an even more modest request, namely to put XHTML ahead of HTML
in the spec. I still hope we can at least get consensus on that.
Regards,   Martin.

Re: Back on type=HTML, going a step farther...

2005-03-07 Thread Martin Duerst

At 05:37 05/03/08, Julian Reschke wrote:

Tim Bray wrote:
 On Mar 7, 2005, at 10:31 AM, Martin Duerst wrote:

 for some implementors, HTML is actually easier, if they are handing a 
chunk of bytes to an HTML rendering control, they'd rather not reconstruct 
the syntax from the infoset, they'd rather just take an opaque chunk of bytes.

 I really hope they don't take an opaque chunk of bytes, because if
 they do, they'll have no clue about the character encoding.

 Not true; if you can parse the XML to find the atom:content then you 
know the character encoding and can tell your HTML renderer. -Tim

I think Martin was talking about the producer: if all he has is a chunk 
of *bytes*, there's no reliable way top put that into an HTML-typed text 
construct (you'll need characters, not bytes).

It works both ways. If all you get is a chunk of bytes, things won't
work. Of course the Atom processor knows more, but Tim didn't say
that it would pass that info on to the renderer.
Regards,   Martin.

Re: AtomPubIssuesList for 2005/02/22

2005-02-22 Thread Martin Duerst

At 01:44 05/02/23, Paul Hoffman wrote:
This list should still be focused on the format draft. As our document 
editors toil away diligently, we can still offer editorial suggestions. 
This is also a reasonable time to start creating format extensions  and 
talking about them here.

Hello Paul,
Will there be a two-week WG last call for the Atom format
document, as usual with IETF WGs?
Regards,Martin.

Re: link rel=link type=????/

2005-02-22 Thread Martin Duerst

(BAt 04:31 05/02/23, Bob Wyman wrote:
(BAt PubSub, we allow users to subscribe to RSS/Atom entries that contain 
(Buser-specified URIs. (Using a query like: URI:nytimes.com) Users of this 
(Bfeature have often asked us to augment the entries we publish by attaching 
(Bthe URIs we discover to the entries. The idea has been to make it easier 
(Bfor people to build applications that exploit these links and to avoid 
(Bforcing clients to parse the full source entries.
(B
(BSo, we$BCW(Be started adding these $BEM(Binks to URIs$BG(Bto the entries we publish 
(Bon an experimental basis. In order to keep things as simple as possible, 
(Bwhat we$BCW(Be done is simply use the atom link/ element. We$BCS(Be using rel=$BGM(B 
(Bink$BG(Bto indicate that this is just a random link of unknown purpose or 
(Bintent. However, atom requires that a link element have a mime-type. Given 
(Bthat we have no idea what the mime-type for the URI is, what mime-type 
(Bshould we use?
(B
(BAre you talking about the type attribute (as Sam and others have
(Bpointed out, that's no longer required) or the rel attribute.
(BThe text seems to indicate the former, the examples the later.
(B
(BPlease clarify.
(B
(BRegards,Martin.
(B
(B
(B
(BSome examples of links that we$BCW(Be published in the last few minutes follow:
(B
(Blink rel="link" 
(Bhref="http://news.bbc.co.uk/go/click/rss/0.91/public/-/1/hi/world/middle_east/4286129.st"/
(Blink rel="link" href="http://www.nytimes.com/2005/02/22/politics/22memo.html"/
(Blink rel="link" 
(Bhref="http://www.washingtonpost.com/wp-dyn/articles/A43009-2005Feb22.html"/
(Blink rel="link" 
(Bhref="http://www.nytimes.com/2005/02/22/books/22thompson.html"/
(Blink rel="link" href="http://imdb.com/name/nm0588766/"/
(Blink rel="link" 
(Bhref="http://www.denverpost.com/Stories/0,1413,36~78~2723673,00.html"/
(B
(BFor an example feed that contains entries that link to the New York Times, see:
(Bhttp://atom.pubsub.com/32/6c/1bc36c085cdc9ecd23efa73a.xmlhttp://atom.pubsub.com/32/6c/1bc36c085cdc9ecd23efa73a.xml
(B
(BIs there a better $BES(Bel$BG(Bvalue then $BEM(Bink$BG (B What mime-type should we be 
(Busing? Is there a better way to accomplish what we$BCS(Be doing?
(B
(BYour comments would be appreciated. If we$BCW(Be done something completely 
(Binsane here, please be gentle.
(B
(B bob wyman
(B

Re: On PaceXhtmlNamespaceDiv

2005-02-17 Thread Martin Duerst

(BAt 20:44 05/02/17, Bill de h$B".(Bra wrote:
(B 
(B Martin Duerst wrote:
(B  At 19:03 05/02/16, Bill de h$Bec!V(Bra wrote:
(B
(B   The point I'm seeing here is that creating markup using string 
(Bconcats is inherently fragile. No surpise there. Wrt namespaces, fragility 
(Bis eliminated when you stop using  defaults (but there are other 
(Bconsiderations which keep string concat fragile). Use of div covers off the 
(BXHTML case.
(B  Yes, use of div covers that case. But that doesn't mean that if
(B  some people want to use div, everybody has to use div.
(B 
(B It does (I've never said otherwise) - there is a div tax.
(B
(BI'm okay with those paying div tax who think they need a div.
(BI'm not okay with everybody paying div tax for no apparent
(Bgeneral benefit.
(B
(BRegards,Martin.

Re: On PaceXhtmlNamespaceDiv

2005-02-16 Thread Martin Duerst

(BAt 19:03 05/02/16, Bill de h$B%b(Bra wrote:
(B
(B  As long as it's XML and otherwise conformant, I think it's fine.
(B   Probably not. Do you and Julian and Anne and Henri approve?
(B  I don't see how I would want to complain about how you generate
(B  your stuff, as long as the result is following the specs.
(B 
(B The point I'm seeing here is that creating markup using string concats is 
(Binherently fragile. No surpise there. Wrt namespaces, fragility is 
(Beliminated when you stop using  defaults (but there are other 
(Bconsiderations which keep string concat fragile). Use of div covers off the 
(BXHTML case.
(B
(BYes, use of div covers that case. But that doesn't mean that if
(Bsome people want to use div, everybody has to use div.
(B
(BRegards,Martin.

Re: PaceXhtmlNamespaceDiv

2005-02-16 Thread Martin Duerst

(BAt 01:35 05/02/11, Sam Ruby wrote:
(B 
(B Julian Reschke wrote:
(B
(B  Nor am I. The question is what's the best way to enhance the spec. One 
(Balternative suggestion was made by Martin D$BS(Bst in 
(Bhttp://www.imc.org/atom-syntax/mail-archive/msg13531.html:
(B  "Note: It is important to make sure that correct namespace declarations
(B  for XHTML are present. One way to do this is by using an xhtml:div
(B  element as the content of the atom:content element and specifying
(B  the XHTML namespace on that div element. Here are some examples:
(B  ... [use proposed examples]
(B  There are other ways to declare the namespace URI for XHTML content;
(B  this specification does not limit the placement of such declarations
(B  in any way."
(B 
(B My issue with that wording is that it doesn't make it clear whether the 
(Bxhtml:div that is added is to be considered a part of the content or not.
(B
(BFair point.
(B
(B Put another way, how does the consumer know that if a given xhtml:div 
(Belement is part of the content, or was added per the above?
(B 
(B Julian, you previously said "Let's make the spec as clear and simple as 
(Bpossible."  How about this:
(B 
(Bxhtml:div is required.  xhtml:div is not part of the content.
(B 
(B Clear.  Simple.  And difficult to get wrong.
(B
(BHow about the following alternative:
(B
(B xhtml:div is not required. xhtml:div is part of the content.
(B
(BClear. Simple. And difficult to get wrong :-).
(B
(BIf that's not enough, we can add a note, e.g.:
(B
(B Note: A xhtml:div is just an empty wrapper; it will in general
(B   not affect the processing or display of the content.
(B
(B
(BRegards,Martin.

Re: On PaceXhtmlNamespaceDiv (was: Re: Consensus call on last round ofPaces)

2005-02-15 Thread Martin Duerst

[I appologize that this comes late. I was ill last week.]
I'm also still not convinced about this one. It was introduced
with a very good motivation, namely that it would increase the
chance that namespaces would be used correctly. After the changes,
what I understand is left is the following:
Everybody has to use an XHTML div in every atom content of
type XHTML, just to make sure that Sam (any others?) can
realize his dream of atom without namespace prefixes.
This despite the fact that those that want to avoid any and
all namespace prefixes can use a div on their own if they
want. As Sam explained, the div in that case would be part
of the content. That's true, but while adding a div formally
changes the content, it doesn't actually change the content,
because div is just a dummy wrapper, in particular if there
are no other attributes than the default namespace
declaration.
On more general terms, I have observed different choices of
how different namespaces get put together over the last
three if not more years. For the simple case of namespace
B in namespace A, a variety of choices, with a variety of
reasons, has been proposed.
On the outer side (namespace A), the choices range from
foreign namespace stuff can be inserted pretty much anywhere
where it makes sense, just put it in to we have a 'foreignstuff'
element, anything from another namespace has to go in there.
On the inner side (namespace B), the choices again range
from any B element can be used to only a wrapper element
can be used.
As an example, SVG tends to high explicity on both sides,
for the inner side, see included document fragments
(http://www.w3.org/TR/SVG11/conform#ConformingSVGIncludedDocuments),
for the outer side, see foreignObject
(http://www.w3.org/TR/SVG11/extend#ForeignObjectElement).
Most of the motivation for this explicitness in SVG comes from
the fact that without being clear on coordinates/placement/size,
things won't work.
Looking at these cases, it seems to me that what we are doing
with div is really thinly motivated (no need for any special
information such as coordinates), and is also in a way abusing
div, which was created to indicate divisions in HTML documents,
not to serve as a throw-away default namespace holder in foreign
document formats. Actually, instead of div, we could as well
define that we use body, or we could even define that we use
something new like wrapper. Because as it turns out, while this
element is in the XHTML namespace, it's never part of an XHTML
fragment that an XHTML processor would see.
At 05:05 05/02/16, Anne van Kesteren wrote:

Tim Bray wrote:
 PaceXhtmlNamespaceDiv
 This is tough.  There are some people who are really against this and 
who aren't moving.  On the other hand, there are way more who are in favor. 
Reviewing the discussion, the contras are saying that this is sloppy and 
unnecessary and if it solves a problem, that problem really shouldn't be 
there, but they don't seem to be saying it's actively harmful.  But the 
people in favor are arguing that this will reduce errors and improve 
interop.  Also, the Pace was changed in midstream.
 Also, Paul thinks we will get feedback on this from out there in the IETF.

I'm not sure I understand this sentence. Does Paul think that we will
get feedback in the IETF to put it in anyway, so we better already
put it in? Or that we'll get feedback to take that out in the IETF
so we can as well leave it in for the moment? Or what?
Regards,Martin.
 DISPOSITION: Borderline, but accepted.

I'm still not sure how it would improve interop, especially since the 
place the namespace can be defined is optional. I do not think those 
Planet aggregators would handle the following correct for example:

  atom:entry xmlns:atom=... xmlns:b=http://www.w3.org/1999/xhtml;
   atom:content type=XHTML
b:div
 Foo b:br /
 Et cetera.
...

Also, this restriction can be avoided by using |type=application/xml| 
or |type=application/xhtml+xml| I assume? (Although that is probably not 
valid for the TITLE or SUMMARY element (it should be).)


--
  Anne van Kesteren
  http://annevankesteren.nl/

Re: type=HTML

2005-02-08 Thread Martin Duerst

At 23:36 05/02/08, Julian Reschke wrote:
Shouldn't we at least give content producers the hint that producing 
XHTML content is preferred over HTML? (sorry if I'm opening a can of worms here)

I'd be very much in favor of that. The first step is to order the sections
so that XHTML comes first. I have suggested this before. This is very close
to editorial, but can already give some hint.
The second is to add a sentence such as this: Content producers that can 
produce
both XHTML content and HTML content SHOULD produce XHTML content.
Regards,Martin.

Re: PaceXhtmlNamespaceDiv

2005-02-08 Thread Martin Duerst

At 22:59 05/02/08, Julian Reschke wrote:
http://www.intertwingly.net/wiki/pie/PaceXhtmlNamespaceDiv
I have looked at this pace only just very recently, after
following the discussion. So I first want to make sure I
get the current status of this proposal right.
As I currently read it, it does:
1) Require an xhtml:div as the only direct child of content of type XHTML
2) Not limit the placement of namespace declarations in any way above
   and beyond those given in the Namespace spec.
3) Not require any specific namespace prefixes.
If 2) and 3) are correct, I can live with this proposal. Anything
changing 2) or 3), as may have been previously in this Pace, would
get something like a -2 from me. Specs, in particular such fundamental
ones as XML Namespaces, are there to be used as is, not to be tweaked.
As for 1), I could live with it, but the rationale given for it in the
current version says:
Given that even we often forget when writing examples to declare
 the XHTML namespace for XHTML text constructs and content elements,
 it seems likely that people producing actual feeds will forget
 to do so unless the requirement to do so is stated prominently
 and unambiguously.
This doesn't seem to match the proposal, where Namespace declarations
are only used in the examples, but not mentioned in the text.
So while (as said above) I can live with 1), insisting on 1) without
a better rationale doesn't seem to make sense to me.
If the concern expressed in the rationale is important (and I can
agree it is), then addressing this concern can be done in less
constricting ways, i.e. by replacing the requirement for a div
with something like:
Note: It is important to make sure that correct namespace declarations
for XHTML are present. One way to do this is by using an xhtml:div
element as the content of the atom:content element and specifying
the XHTML namespace on that div element. Here are some examples:
... [use proposed examples]
There are other ways to declare the namespace URI for XHTML content;
this specification does not limit the placement of such declarations
in any way.
Regards, Martin.

Re: IRIs

2005-02-01 Thread Martin Duerst

Clarifying some details based on my write-manytimes understanding
of IRIs (RFC 3987).
At 07:15 05/02/01, Robert Sayre wrote:

Graham wrote:

 I don't know much about IRIs. Is it possible to express them as URIs?


My read-the-draft-once understanding is that it is always possible to 
convert an IRI to a URI,

Correct.
 but it may not be possible to convert a URI back to the same IRI it was 
generated from.

Correct.
For example, it is always possible to convert an IRI to ASCII for use in 
HTTP's Request-URI, but it would be better to leave it in Unicode for editing.

Exactly.
There's also a large section of the draft that deals with right-to-left 
IRIs, but I don't understand it.

It's not relevant for a protocol, but it is relevant for input and display
of IRIs that contain Arabic or Hebrew characters.
Regards,Martin.

Re: IRIs (was: Re: Work Queue Rotation #16)

2005-02-01 Thread Martin Duerst

At 08:25 05/02/01, DJWS wrote:

At 22:25 31/01/2005, Graham wrote:
I don't know much about IRIs. Is it possible to express them as URIs?

As far as I understand, and this is my problem : I cannot have a formal 
response on the following.
- there are various way to support a non ASCII IRI (the majority of the 
world needs them - inclunding French language).
- one way is to use Unicode
- another way is to use ToASCII and punycode process (used in IDNs)

There is only one IRI spec (RFC 3987). For backwards compatibility,
it allows the use of punycode for converting domain-name parts
of some URIs. Otherwise, it's just converted to %-encoding
using UTF-8, and then the URI spec (RFC 3986) takes over.
the punycode process :
- is reversible (if it was not, the Domain Name could not be valid).
It is reversible up to equivalent domain names, in particular,
it does not conserve case, if you take stringprep into account.
- is used in IDNs in transcoding the Unicode into ascii for 2LD and above 
level and in adding the prefix xn--.

There is no technical restriction to 2LD and above.
There are some people claiming that it should not be used
for TLDs, but I would disagree. It works for TLDs as well as
for other levels. The difficult problems on the TLD level
are political, but those don't go away with a different
technical solution.
- can obvsiously be used to transcode the TLD of a domain name, but there 
is no specification in RFCs about TLDs being punycoded.

I think rfc 3986 is pretty clear about this, and doesn't
make an exception for TLDs. From http://www.ietf.org/rfc/rfc3986.txt:

   The reg-name syntax allows percent-encoded octets in order to
   represent non-ASCII registered names in a uniform way that is
   independent of the underlying name resolution technology.  Non-ASCII
   characters must first be encoded according to UTF-8 [STD63], and then
   each octet of the corresponding UTF-8 sequence must be percent-
   encoded to be represented as URI characters.  URI producing
   applications must not use percent-encoding in host unless it is used
   to represent a UTF-8 character sequence.  When a non-ASCII registered
   name represents an internationalized domain name intended for
   resolution via the DNS, the name must be transformed to the IDNA
   encoding [RFC3490] prior to name lookup.  URI producers should
   provide these registered names in the IDNA encoding, rather than a
   percent-encoding, if they wish to maximize interoperability with
   legacy URI resolvers.

Yet there are ML.ML domain names already in use (both by Govs agencies 
and in private spaces).

These are just rouge local setups, not conforming to any kind of spec.
I asked the author of the IRI RFC. He reponded to most of my questions. 
But the situation of the TLD is not clear as it is not supported by IE, but 
is supported by plug-in.

In this sense, TLDs are no different from other levels of the
domain name hierarchy.
The case of ATOM is different as not dependent from browsers.
It would be a very bad idea if Atom decided to dereference
URIs or IRIs in a different way from other software.
Also it is likely that in the ATOM definition timeframe the ML.ML 
(multilingual.multilingual) will most probably broadly in use in some areas 
of the world.

It would be good. There is nothing in IRIs or URIs preventing
the use of non-ASCII.non-ASCII domain names (multilingual
in this context is totally inappropriate, although a lot of
people use it that way). It is up to ICANN to decide on new
TLDs, and it's a political problem, not a technical one.
Regards, Martin.

Re: Posted PaceTextRules

2005-02-01 Thread Martin Duerst

At 17:09 05/01/31, Henri Sivonen wrote:

On Jan 31, 2005, at 03:00, Graham wrote:

 Software displaying this text SHOULD remove white-space at the 
beginning and end; collapse white-space between words; and disregard line 
break, tab and other formatting characters. in 3.1.1 (TEXT).

+1 with the minor nit that lone line breaks should be considered 
spaces--not disregarded. Thus: Software displaying this text SHOULD remove 
white-space at the beginning and end; and treat sequences of one or more 
XML white-space characters as a single space.

Except that that's completely inadequate for some Asian languages
(Chinese, Japanese, Thai,...). Point to a spec developed
by experts if you need to rather than trying to reinvent the wheel.
Regards,Martin.

Re: Posted PaceTextRules

2005-02-01 Thread Martin Duerst

At 00:38 05/02/01, Tim Bray wrote:


On Jan 31, 2005, at 5:37 AM, Bjoern Hoehrmann wrote:
 http://www.w3.org/MarkUp/2004/xhtml-faq#xmlspace cites a quite different
 opinion on this matter...

Yes, well that opinion is (a) specific to HTML and (b) wrong.  I'm amazed 
that the W3C allowed that to be published.  -Tim

Would you mind explaining why you think it's wrong?
Regards,Martin.

Re: Questions about -04

2005-02-01 Thread Martin Duerst

Sorry, this was way back, but caught my eye again.
At 11:39 05/01/27, Sam Ruby wrote:

Martin Duerst wrote:
 At 01:51 05/01/26, Asbj n Ulsberg wrote:
  
  On Wed, 12 Jan 2005 16:54:27 -0500, Sam Ruby [EMAIL PROTECTED]
  wrote:
  
   2. Why MUST a feed point to an alternate version. [...]
  
   I'm -1 on removing this restriction.
  
  I thought we came to a sort of consensus that the link should be optional.
  Or was that only for atom:entry? Anyway, I think both of them should be
  optional. That is, I disagree with you, Sam.
 I agree with Asbjoern. Regards,   Martin.

There is consensus that atom:link is not required for atom:entries which
contain content.  That consensus has been reflected in the most recent
drafts.  That is not the question referred to above.
Understood.
There are now, by some counts, ten versions of formats that call
themselves RSS.  Every last one of then has a required channel/link.
Every last one of them.

Relaxing a restriction requires consumers to handle more cases.
How difficult can it possibly be, in this specific case?
My
expectation is that given limited demand for this feature, the more
likely scenario is that consumers will either produce unexpected results
or outright fail for feeds without this data.
What do they need it for in the first place?
Because of this, I would like to request that there be a compelling use
case be found which for feeds for which there can not be a atom:link
defined.
Is there a compelling use case (not it has always been that way,
which doesn't count as an use case) for requiring it?
The use case for not having it is pretty straightforward:
Just a feed.
It's that simple. Feeds can live on their own. They don't
need alternate Web pages, and if they don't have have them,
it's just straightforward design to not require such an
element. Everything else is just cheating ourselves.
Note atom:link is defined as a URI.  While most examples that
we have seen use the HTTP scheme, this is not a requirement.  Given that
this is not a requirement, and given that existing RSS producers have
come out in mass demanding that this restriction be lifted from RSS, I
can only conclude that the burden seems rather low.
So you are saying that
atom:feed
   atom:head
  atom:link rel=alternate
 href=noscheme://nonsense.example.org/this/here/to/keep/Sam/happy
   ...
is better than just leaving that link out? Do we want the above
nonsense to go into the RDF model, and so on? Isn't it much more
straightforward to model the format after actual and potential
realities, rather than to have to cheat?
Regards,Martin.

Re: Feed State [was: Work Queue Rotation #16]

2005-01-31 Thread Martin Duerst

At 08:46 05/02/01, Mark Nottingham wrote:
[[[
x. Managing Feed State

Atom Processors MAY keep state (e.g., metadata in atom:head, entries) 
sourced from Atom Feed Documents and combine them with other Atom Feed 
Documents, in order to facilitate a contiguous view of the contents of the 
feed. The manner in which Atom Feed Documents are combined in order to 
reconstruct a feed (including methods of identifying duplicate entries, 
updating metadata, and dealing with missing entries) is out of the scope of 
this specification, but may be defined by an extension to Atom.
]]]

So, if we drop PaceFeedState, I propose the text above.

Fine with me, except that I'd change the second parenthesis from:
(including methods of identifying duplicate entries, updating metadata,
and dealing with missing entries)
to:
(including methods of updating metadata and dealing with missing entries)
How to identify duplicate entries is clear from the description of the
id attribute, so I don't think it belongs here.
Regards,Martin.

Re: URI canonicalization

2005-01-31 Thread Martin Duerst

I have just looked at the text in question in -05.txt,
and read through the discussion. I'll give my comments
here, but they are not specifically on this mail.
First, for me, the goal of having reproducible id comparison
is most important; this is the basic requirement.
Second, given that there are an amazing number of things
that can be compared differently once you start (the
URI/IRI specs, the scheme-specific specs, and Unicode
give you a lot of rope), character-
by-character comparison is the only way to go.
Third, I don't think that the normalization advice in
3.5.1  Dereferencing Identity Constructs is extremely
important, but I don't mind if it's there.
I have the following actual editing proposals to hopefully
make this part of the spec a bit clearer:
1) switch sections 3.5.1 and 3.5.2 to make clear that
   comparison of IDs is the most important operation.
2) At the start of Comparing Identity Constructs, change the
   sentence Instances of Identity constructs can be compared to
   determine whether an entry or feed is the same as one seen before.
   to To determine whether an entry or feed is the same as one
   seen before, their Identity Constructs are compared.
   This makes it clear that we are talking about here is how
   you do it, rather than here's one way to do it.
3) Change Processors MUST compare Identity constructs on a
   character-by-character basis in a case-sensitive fashion.
   to Processors MUST compare Identity constructs on a
   character-by-character basis. For details, see section
   5.3.1.,  Simple String Comparison, of [RFC3987].
   We may want to add something about case-sensitivity as
   a note, but it should not be in the main text. There are
   way to many other ways that this could go wrong, in particular
   in an Unicode context.
4) Add a sentence saying something like Feeds or Entries
   are identical if their IDs compare identical..
   Seems obvious, but isn't stated anywhere.
5) Add a note saying something like Comparison functions
   provided by many URI classes/implementations make additional
   assumptions about equality that are not true for Identity
   Constructs. Atom processors therefore should use simple
   string functions for comparing Identity Constructs.
   I think such a note could be a good balance to the normalization
   advice.
I understand that in general, we have tried to reduce
implementation advice in the spec as much as possible.
But in my experience, adding such advice or notes is
often a good way to reach better consensus.
Regards,Martin.

At 02:17 05/01/31, Graham wrote:
This controversial text is still in:

Because of the risk of confusion between URIs that would be
equivalent if dereferenced, the following normalization strategy is
strongly encouraged when generating Identity constructs:

o  Provide the scheme in lowercase characters.
o  Provide the host, if any, in lowercase characters.
o  Only perform percent-encoding where it is essential.
o  Use uppercase A-through-F characters when percent-encoding.
o  Prevent dot-segments appearing in paths.
o  For schemes that define a default authority, use an empty
   authority if the default is desired.
o  For schemes that define an empty path to be equivalent to a path
   of /, use /.
o  For schemes that define a port, use an empty port if the default
   is desired.
o  Preserve empty fragment identifiers and queries.
o  Ensure that all portions of the URI are UTF-8 encoded NFC form
   Unicode strings.

For starters its in the Dereferecing section for some reason. Secondly, 
no consensus was reached to include it. Tim shrugged when no proposal 
gained consensus, and included it anyway. Thirdly, the rationale in the 
spec doesn't match any of the even-vaguely-valid ones given on the list. 
Fourthly, none of those rationales were valid. Fifthly, it's micromanaging. 
Of all the things we could go into great detail telling people how to do, 
this doesn't even rate. I've never seen a feed that has any of the problems 
this might solve.

Please delete it.

Graham

Re: PaceIRI status: RFC 3987 and STD 66, RFC 3986, published

2005-01-29 Thread Martin Duerst

Hello Robert,
Thanks for your questions, and for studying IRIs so carefully.
At 09:15 05/01/29, Robert Sayre wrote:
IRIs are a step forward and important to include in the spec, but they 
also worry me. In RFC3987, I read the following:

The approach of defining a new protocol element was chosen instead of
 extending or changing the definition of URIs. This was done in order
 to allow a clear distinction and to avoid incompatibilities with
 existing software.

Yes. That was put in to clearly say that you can't just expect IRIs
to work wherever URIs work without doing anything. Because Atom is
a new protocol, we are in a very good position to do the right thing,
which turns out to be very little.
Do you expect Atom implementors will be using incompatible existing 
software? I think this question should face roughly the same scrutiny that 
PUT/DELETE did.

Yes. If you look at the pace (http://www.intertwingly.net/wiki/pie/PaceIRI,
just updated to take into account the RFC publications, most of this
is described under Impact. Converting an IRI to UTF-8 and doing
%-encoding is a lot easier to program if there is no IRI library
available than to program your own PUT or DELETE requests, in
particular because the former is a completely internal operation,
whereas the later is a network operation. Integrating conversion
to punycode in case there is no IDN support available is a bit
more work, but if you don't do normalization (stringprep,...),
the code footprint should be extremely small. Except for encoding
issues (UTF-8 or UTF-16 vs. UTF-32), you can basically just
copy the code from http://www.rfc-editor.org/rfc/rfc3492.txt.
I'm also worried that the term IRI will cause confusion. After all, the 
catch phrase is not Cool IRIs Don't Change. What can we do minimize confusion?

First, every URI is an IRI, so there is absolutely no need for
any existing URI to change. Second, there is the confusion with
terms. I agree that the term IRI shouldn't be pushed too much,
I expect users e.g. in Japan to just use language like
software x-y-z now accepts URLs/Web addresses with Japanese characters.
But in the spec, we have to make clear that IRIs are allowed, in a way
that implementers actually see it. So that's why I propose to replace
the term URI with the term IRI throughout the spec, but not to change
the attribute names. But I'm open to discussion about this; if you have
some other ideas, please send them to the list.
Regards,Martin.

Re: Issues with draft-ietf-atompub-format-04

2005-01-26 Thread Martin Duerst

At 13:01 05/01/26, Eric Scheid wrote:
It's only clear what's going on when the reader juxtaposes the two sections,
and realises that the concept named 'type' in section [3.1.1] is not the
same concept named 'type' in section [3.5.2]. Without that juxtaposition,
the reader might well never realise that 'type' != 'type' and conflate the
two concepts. Even you made that mistake just now, and you're the editor of
the document. Pity the poor reader.

Looking at it from a usability of specifications p.o.v., it doesn't hurt to
have cross references.
I agree this is a problem. Either we find new names for the attributes
so that each element has a different attribute, or we put pointers
to the other 'type' attribute(s) in each section about a type attribute
(and ideally also a table somewhere that shows all of them).
If we don't do it, confusion will be guaranteed, and we will
know we are the ones to blame.
Regards,Martin.

Re: PaceIRI status

2005-01-25 Thread Martin Duerst

At 17:16 05/01/25, Julian Reschke wrote:
Also; I sypmathize with supporting IRI, but that shouldn't mean it needs 
to replace any usage of URIs

Every URI is an IRI by definition. So all URIs that are in use can be
used with Atom without any problems even if the spec says we use IRIs.
Replacement is definitely not the right term.
(for instance, I also think that the introduction into XMLNS 1.1 was a 
mistake).
Well, I might agree that it was a mistake. Observable behavior for
most implementations (in particular XSLT implementations, where you
most easily can test actual namespace behavior) allows IRIs in
namespaces anyway, so this could just have been a clarification
to the XMLNS 1.0 spec, rather than bumping up the number.
Regards,Martin.

Re: PaceSimpleLanguageTagging status

2005-01-25 Thread Martin Duerst

(BHello Bjoern,
(B
(BFor more details, please see my earlier message at
(Bhttp://www.imc.org/atom-syntax/mail-archive/msg12198.html.
(BPlease comment on the specific points mentioned there.
(B
(BRegards, Martin.
(B
(BAt 16:15 05/01/25, Bjoern Hoehrmann wrote:
(B 
(B * Martin Duerst wrote:
(B   Not yet taken up by the WG, depends on the discussion that comes with
(B this call. Same rules as all the others: there has to be a positive WG
(B consensus that each adds to the base specification.  -Tim
(B  
(B  +1, at least for atom:language inside the header. For elements, well,
(B there _are_ use cases for elements in different languages, so, since it is
(B optional, +1 again.
(B 
(B -1, or better, -2. Inventing things like atom:language when there
(B is xml:lang is just completely useless and superfluous.
(B 
(B Could you clarify how xml:lang solves the problems stated in the Pace?
(B The alternatives to the Pace would seem to be either restrict xml:lang
(B to specific elements or implementations that lose xml:lang information
(B or, in an authoring scenario, do not allow to use it -- i.e., ignoring
(B the problem in the specification. Neither of which is really helped by
(B xml:lang, so your comment seems a bit weird.
(B --
(B Bj$BS(Bn H$BI(Brmann $B%-(B mailto:[EMAIL PROTECTED] $B%-(B http://bjoern.hoehrmann.de
(B Weinh. Str. 22 $B%-(B Telefon: +49(0)621/4309674 $B%-(B http://www.bjoernsworld.de
(B 68309 Mannheim $B%-(B PGP Pub. KeyID: 0xA4357E78 $B%-(B http://www.websitedev.de/

Re: PaceMustBeWellFormed status

2005-01-25 Thread Martin Duerst

At 07:35 05/01/26, Robert Sayre wrote:

Walter Underwood wrote:
 6. Client processing requirements

 Atom feeds served over HTTP MUST be well-formed XML 1.0, as defined in 
Section 2.1 of the XML specification 
http://www.w3.org/TR/REC-xml/#sec-well-formed. Furthermore, the concept 
of XML well-formedness relies on first determining the character encoding 
of the XML document. RFC 3023 defines how to determine the character 
encoding of XML documents served over HTTP.


The first sentence is redundant because all Atom feeds must be 
well-formed. The second sentence is plainly false. The two concepts are 
unrelated.

Could you explain/substantiate your claim that the second sentence
is plainly false? I understand it to be true, and I have implementation
experience with the W3C Markup Validator to back it up.
 5.
 Publishers MUST NOT serve Atom feeds with a media type other than 
application/atom+xml (registered in this Section 8 of document) or one of 
the XML media types defined in RFC 3023 or its successor. In particular, 
text/plain is never an appropriate media type for an Atom feed. When 
retrieving an Atom feed served with a non-XML media type, clients MUST 
reject it as non-well-formed.


We have no business stating this. I will serve Atom feeds as text/plain 
if I want them processed as text documents.

At which time I will claim that it's no longer an Atom feed, it just
looks like one :-). The Atom spec should talk about Atom as Atom;
that somebody might want to look at the Atom document as a text document,
or even as a hex dump, isn't something we should be talking about.
Regards,Martin.

Re: PaceIRI status

2005-01-25 Thread Martin Duerst

At 01:52 05/01/26, Paul Hoffman wrote:

At 9:16 AM +0100 1/25/05, Julian Reschke wrote:
I saw some concerns (with which I agree) that requiring the presence of 
an IDN library is problematic. As far as I can tell, it's likely to be 
ignored by developers of clients that run on somwehat constrained devices.

Just a correction to what Julian wrote: Stringprep may be ignored
or partially ignored, because it requires lots of data, but I don't
think the Unicode-punycode conversion will be ignored.
I would like to hear more from developers whether or not they think this 
is a problem. (I'm asking this wearing my co-chair-looking-for-consensus 
hat, not my author-of-IDN-and-stringprep hat.)

Although I don't have any experience with developing blogging tools,
I have some experience in integrating IDN support into a browser, and
I can report on some other browsers. I hope this breaks the ice for
others to chip in.
1) I have integrated IDN support into libwww, and used it in Amaya
   (that version isn't yet public, but it should be soon).
   Integrating idnkit into libwww was easy (except for the autoconf/
   automake,... files, which I'm leaving to somebody else).
   idnkit of course includes full stringprep.
2) The maker of a widely used browser for Mobile phones has just
   recently announced that they support for internationalized
   domain names at http://www.access.co.jp/english/press/050120.html
   My guess is that they take some shortcuts on stringprep, but
   that's really just a guess. If you really need to know, I may
   be able to ask somebody from that company.
3) Browsers start to integrate blog support. These browsers are at
   the leading edge of development, and usually support IDNs already.
   The examples I know are Opera and Safari. IDNs will just work
   on those browsers, and people will come to expect them to work
   in other contexts.
   I think at least at some stage, Opera didn't implement strinprep
   (or not completely), but I haven't tested lately.
Regards, Martin.

Re: Questions about -04

2005-01-25 Thread Martin Duerst

(BAt 01:51 05/01/26, Asbj$BS(Bn Ulsberg wrote:
(B 
(B On Wed, 12 Jan 2005 16:54:27 -0500, Sam Ruby [EMAIL PROTECTED]
(B wrote:
(B 
(B  2. Why MUST a feed point to an alternate version. [...]
(B 
(B  I'm -1 on removing this restriction.
(B 
(B I thought we came to a sort of consensus that the link should be optional.
(B Or was that only for atom:entry? Anyway, I think both of them should be
(B optional. That is, I disagree with you, Sam.
(B
(BI agree with Asbjoern. Regards,   Martin.

Re: PaceIRI status: RFC 3987 and STD 66, RFC 3986, published

2005-01-25 Thread Martin Duerst

At 09:17 05/01/25, Tim Bray wrote:

If there were no further discussion: It's hard to see how to avoid 
adopting this now that IRIs are standards-track RFC.  -Tim

Some good news:
The IRI spec is now published as RFC 3987 (Proposed Standard,
http://www.ietf.org/rfc/rfc3987.txt).
The update of the URI spec, known as RFC2396bis, is now published
as STD 66, RFC 3986.
Even less reason for not adopting them. Editors, please update
your references. I'll update PaceIRI in a day or two.
Regards,Martin.

Re: AtomAsRDF_PaceAttributesNS

2005-01-24 Thread Martin Duerst

At 06:38 05/01/25, Sam Ruby wrote:

Henry Story wrote:
 We are all working together on the proposal, in an iterative fashion.
 This is very similar to the way one develops  software projects in Agile
 or Extreme programming methodology.  First one starts with a prototype.
 One gets the major pieces in place, and gets general feedback from the 
clients.
 One checks that it works. Then one iteratively works towards a goal of getting
 something that satisfies the clients needs and budget.
 But you are correct to demand some real code. I have branched off the 
particular
 topic in AtomAsRDF_PaceAttributesNS [1], so that those that only want 
to work on
 detailed text, can get going debating and refining that.
 I would be very thankful if someone with more specification experience 
could get
 it into the correct final format.

It is not code that Tim is asking for, but merely words.  The 
specification is but a set of paragraphs, and somebody needs to say insert 
these words there.

As to the words that you have proposed, I have a technical question: if 
you are equating unnamed attributes with attributes in the atom namespace, 
then everybody who looks today for an href attribute would *also* have to 
look for an atom:href attribute.  Multiply this by all of the attributes 
defined in the specification, and IMHO, you have a solution that is *worse* 
than requiring the atom namespace in the first place.  So,

   -1

That's not the way I have understood this. What I thought this to
be is: In Atom, these attributes always are unprefixed. They are
in the Atom namespace just for the purposes of RDF.
 From my perspective, Atom can be one XSLT translation away from RDF/XML, 
and will likely remain such.  We can capture that translation as a 
non-normative appendix to the spec.  We can also look at GRDDL or other 
techniques for making this explicit in the documents.  But what I don't 
want is to make the syntax dramatially worse for people who want to use 
simple XML processing tools.

Agreed. I just saw the text that we would insert into the spec as
a verbal description of one aspect (adding an Atom-namespace prefix)
of that transformation. But I can see that the current text,
Processors should interpret unprefixed attributes in atom namespaced
elements to be in the atom namespace, can easily be interpreted
the way you have done it. I think this can be fixed by changing it
to something like
For the purpose of transformation to or interpretation as RDF,
unprefixed attributes in atom namespaced elements to be interpreted
as being in the atom namespace.
I guess that Henry didn't add the clarification because he saw
the text in the context of an overall AtomAsRDF section, which
would make the context clear enough.
Regards,Martin.

Re: Posted PaceSimpleLanguageTagging

2005-01-17 Thread Martin Duerst

At 17:47 05/01/17, David Powell wrote:
Reading the XML spec, I'm not clear that we're allowed to restrict the
inheritance of xml:lang?

From the spec:

 The intent declared with xml:lang is considered to apply to all
 attributes and content of the element where it is specified, unless
 overridden with an instance of xml:lang on another element within
 that content.
From this text, it is indeed not clear. There is an erratum that
makes this clearer. Please see
http://www.w3.org/XML/xml-V10-3e-errata#E01.
If we allow it to inherit inappropriately, and we restrict it to
certain elements, then this makes the situation even worse, as we'll
have no way of re-declaring or un-declaring xml:lang on elements that
it shouldn't apply to.
I don't understand this. If the spec says that it doesn't apply to
some element, there is no reason to un-declare it in the content,
and of course even less a reason to re-declare it.
Please also note that the current version of the XML spec allows
xml:lang=, i.e. the empty string as a value to say that you don't
have any information about the language. This is handy for cases
like 1) the element is natural language text, but the actual content
is e.g. only math or some other special notation that doesn't
belong to a natural language, and 2) the actual content is
natural language text, but the software doesn't know which
language it is.
As an aside: We should make sure that the Atom spec is very
clear for each element that inherits (e.g. copyright,...)
how to define the absence of such information. As experiece
with xml:lang and other things has shown, having something
like a 'reset' or 'neutral element' is extremely important
every time there is inheritance, in particular for information
that comes from different sources.
I think it is very unlikely that a user would want to include a title,
summary, and content in a different language.
Well, somewhat unlikely, but not impossible. What about e.g. a service
in Japan that takes some English feed and just changes the titles
to Japanese for quicker browsing by a Japanese audience?
If we only allow xml:lang on Text constructs and atom:content, then
this means that the user would typically have to include it on titles,
summaries, and content separately. Would that be ok?
Why have to do that for feeds that are completely or mostly monolingual?
Would it be any better than a dc:language style atom:language?
Yes. It would still be better.
 As for RDF's support of xml:lang, there is a very RDF-specific,
 unfortunate issue that has to be taken into account when
 converting from atom to RDF/XML: RDF/XML at certain points
 cuts the inheritance of xml:lang.

I didn't realize this - where is it?
Ok, here it is: The RDF Graph model doesn't deal with mixed content
(such as typical XHTML), but the data model has a special datatype
that in RDF/XML is indicated by the rdf:parseType=Literal attribute.
The problem is that whenever you use this, RDF/XML requires you to
cut language inheritance.
From
http://www.w3.org/TR/2004/REC-rdf-concepts-20040210/#section-Literals:

For text that may contain markup, use typed literals with type 
rdf:XMLLiteral. If language annotation is required, it must be explicitly 
included as markup, usually by means of an xml:lang attribute. [XHTML] may 
be included within RDF in this way. Sometimes, in this latter case, an 
additional span or div element is needed to carry an xml:lang or lang attribute.


What this means in practice for Atom is that if it were not for this,
you could just use a DTD to declare a default attribute of
rdf:parseType=Literal on things like title.
However, this won't work, because if you say
entry xml:lang='en'
...
titleMy entry title./title
...
/entry
which will look to a parser like
entry xml:lang='en'
...
title rdf:parseType=LiteralMy entry title./title
...
/entry
The XML Literal won't carry the language information.
Even if you put the language information right on title, as so:
entry
...
title  xml:lang='en' rdf:parseType=LiteralMy entry title./title
...
/entry
it still isn't picked up by RDF/XML. What RDF/XML requires you to do
is something like
 entry xml:lang='en'
...
title rdf:parseType=Literalspan xml:lang='en'My entry 
title./span/title
...
/entry
i.e. in order to give the language information to the title in
RDF/XML, you have to introduce a 'dummy' element.
This means that the simple DTD approach won't work, but such
a transform of course can be done by XSLT. It also means that
Henry and friends have some more work to define how Atom
converts/relates to RDF, because they have to define when/how
to introduce dummy elements.
Regards, Martin.

Re: Posted PaceSimpleLanguageTagging

2005-01-17 Thread Martin Duerst

At 05:15 05/01/18, David Powell wrote:
Monday, January 17, 2005, 6:11:22 PM, you wrote:
 Suppose Joi Ito wants to list his name in
 Japanese but still write in English; or the the reverse.

Let's hope he doesn't want to provide a name in more than one language.
Well, I can definitely imagine situations where somebody like him
may want to do that.
It isn't clear what xml:lang should apply to? Does it apply to email
addresses? Aural browsers might use it as a pronunciation hint. So
does a database implementation need an email_lang column in the
table or not?  I think we need to decide on these issues rather than
leave it to implementors to guess.
I think spelling out in the spec which elements take xml:lang
and which ones don't, and which attributes it applies to and
which not, would help. Anybody wants to start a list?
Regards,Martin.

Re: Content datatypes in XSD?

2005-01-17 Thread Martin Duerst

At 04:54 05/01/18, Danny Ayers wrote:

It's been a long time since I looked at XML schemas, but would be
possible to express the content type attribute at all neatly using a
complex type? I imagine the TEXT | HTML | XHTML part would be
straightforward, but doesn't the | pretty/much;anything option screw
things up?
It's not 'pretty much anything', it's a Mime media type.
This can easily be defined with an XML Schema pattern.
The TEXT | HTML | XHTML can also be made part of
that pattern, or can be defined as an enumeration type
and unioned with the media type pattern.
Regards,Martin.
Cheers,
Danny.

5.12.1  type attribute

   atom:content MAY have a type attribute, When present, the value MAY
   be one of TEXT, HTML, or XHTML.  Failing that, it MUST be a
   MIME media type [RFC2045] in which, to use the terminology of Section
   5 of [RFC2045], the top level is a discrete type.  If the type
   attribute is not provided, software MUST behave as though it were
   present with a value of TEXT.

--

http://dannyayers.com

Re: Posted PaceSimpleLanguageTagging

2005-01-17 Thread Martin Duerst

At 05:16 05/01/18, David Powell wrote:


Monday, January 17, 2005, 7:32:48 PM, you wrote:

 There are some fields in Atom which are language-independent or
 neutral and thus it might be useful to explicitly prevent the use of
 xml:lang tags for these elements or simply state that they have no meaning
 if used.

It is the inheritance which I have the biggest problem with.
Why? It seems silly to mark each and every entry (or even worse,
each and every title and content) in a feed as xml:lang=en if
most or all of them are in English. We have other things that
get inherited from the feed to the entries, so I don't see why
this one would be such a problem.
Regards,Martin.

Re: Questions about -04

2005-01-13 Thread Martin Duerst

At 06:54 05/01/13, Sam Ruby wrote:

Norman Walsh wrote:
 2. Why MUST a feed point to an alternate version. What if the feed is all
I publish?

Deja vu:

http://www.imc.org/atom-syntax/mail-archive/msg08836.html

I'm -1 on removing this restriction.
I don't see any clear justification in the above mail thread
for having the restriction. And I think that in a very fundamental
way, it's a mistake; feeds should be architected as independent
technology, not as an adjunct to Web pages.
For example, there is no restriction that every Web page has an
email address (although such an address was on many Web pages
(that was before spam)). Likewise, there is no restriction
that every email or Web page point to a feed. Or that every
chat channel has an associated feed or Web page. Integration
of these communication media is extremely important, but
forcing a link is not the right way to go.
Regards,Martin.

Re: PaceIRI

2005-01-11 Thread Martin Duerst

At 22:33 05/01/11, Danny Ayers wrote:

On Tue, 11 Jan 2005 16:50:56 +0900, Martin Duerst [EMAIL PROTECTED] wrote:

 http://www.intertwingly.net/wiki/pie/PaceIRI

I like it a lot in principle, my only concern is the dependency on an
IDN library (nicely put together Pace, btw).
Thanks!
As noted this might be a
problem on small devices, but I would suspect that a proportion of
big-device implementors might be tempted to skip this step (if they're
anything like me, largely through ignorance of IRIs/IDNs in general).
Is there any way of reducing the impact here? Any deployment
experience from other application domains that can be drawn on?
An IDN library consists of two parts: Nameprep, which is the part
needing the big tables, and punycode encoding, which is the encoding/
compression algorithm that is quite compact. My understanding is that
indeed some implementers are taking shortcuts with nameprep, although
they are definitely not supposed to do so according to the specs.
(One possibility where this would be possible even while strictly
conforming is for IRIs directly input on a mobile device, where
the manufacturer knows what characters can be input and can guarantee
that these don't cause problems with nameprep.)
Regards,Martin.

PaceIRI

2005-01-10 Thread Martin Duerst

I have completed (as far as I'm concerned) PaceIRI.
The proposal is simple, namely to allow IRIs wherever the spec
currently allows URIs. I have given details of what needs to be
changed in the spec for that to happen. If anything is unclear,
I'm glad to help out the editors.
As far as I remember, the main objection on previous discussions
of IRIs was that the IRI spec was not yet approved. But it is
approved now. If there are other objections, please say so.
Regards, Martin.

content for text/ or +xml SHOULD be local? (was: Re: Hash for Links)

2005-01-10 Thread Martin Duerst

At 05:59 05/01/09, Robert Sayre wrote:
http://atompub.org/2004/10/20/draft-ietf-atompub-format-03.html#rfc.section.5.10.2

If the value of type begins with text/ or ends  with +xml, the 
content SHOULD be local; that is to say, no src attribute should be provided.

If you want to suggest language that describes the bad things that could 
happen when people ignore that recommendation, I would be happy to include it.

When I read this, I was somewhat surprised to find a SHOULD.
A should, in IETF terms, is really quite strong. I'd prefer
a wording such as:
In general, content with value of type beginning with text/ or
ending with +xml will be inline; that is to say, no src attribute
is provided.
(and I think inline is the better term; local implies on the user's
machine rather than inside the document)
In both cases, adding something like Such content is usually
short, and in that case, carrying it inline is more efficient.
covers the rationale I know about. There may be other rationales.
Regards, Martin.

Re: PaceIRI

2005-01-10 Thread Martin Duerst

At 16:04 05/01/11, Martin Duerst wrote:

I have completed (as far as I'm concerned) PaceIRI.
Just to help everybody to look at it faster, here is the URI/IRI:
http://www.intertwingly.net/wiki/pie/PaceIRI
Sorry I forgot.
The proposal is simple, namely to allow IRIs wherever the spec
currently allows URIs. I have given details of what needs to be
changed in the spec for that to happen. If anything is unclear,
I'm glad to help out the editors.

As far as I remember, the main objection on previous discussions
of IRIs was that the IRI spec was not yet approved. But it is
approved now. If there are other objections, please say so.


Regards, Martin.

79 matches

Mail list logo