Re: namespace support in POM.xml ?

Bryon Jacob Thu, 22 May 2008 15:00:54 -0700

In particular, could you compare in detail this relaxng method andthe similar xml schema approach using substitution groups andabstract schema types? (note that the substitution group part ofthis is unnecessary but results in easier to read xml)? Afterstudying your blog post I don't see any major differences betweenthe relaxng approach and the xml schema approach beyond thedifferences between relaxng and xmlschema. On the other hand thisis the first time I've seen relaxng.

I haven't written any XSD in a long time, so bear with me a littlebit... and please correct me if there's a better, more concise way tostate things in XSD than I have here, or if there IS a way to do thethings that I claim XSD can't do!

First off, I basically reimplemented the final example from my blogpost using XSD instead of RelaxNG. First, I have a schema for projectfiles:


<xs:schema xmlns:xs="http://www.w3.org/2001/XMLSchema";
    targetNamespace="http://freedomandbeer.com/project/1.0";
    elementFormDefault="qualified">

<xs:element name="project">
    <xs:complexType>
        <xs:all>
            <xs:element name="name" type="xs:string"/>
            <xs:element name="developers">
                <xs:complexType>
                    <xs:sequence>

<xs:element name="developer" type="xs:string"maxOccurs="unbounded"/>

                    </xs:sequence>
                </xs:complexType>
            </xs:element>
            <xs:element name="extensions">
                <xs:complexType>
                    <xs:sequence>
                        <xs:element name="extension">
                            <xs:complexType>
                                <xs:sequence>

<xs:any minOccurs="0"maxOccurs="unbounded" namespace="##other"/>

                                </xs:sequence>

<xs:attribute name="id"type="xs:string"/>

                            </xs:complexType>
                        </xs:element>
                    </xs:sequence>
                </xs:complexType>
            </xs:element>
        </xs:all>
        <xs:attribute name="id" type="xs:string"/>
    </xs:complexType>
</xs:element>

</xs:schema>

and, I have another schema for my "svn" extension:

<xs:schema xmlns:xs="http://www.w3.org/2001/XMLSchema";
    targetNamespace="http://freedomandbeer.com/project/ext/svn/1.0";
    elementFormDefault="qualified">

<xs:element name="svnRepository">
    <xs:complexType>
        <xs:sequence>

<xs:element name="url" type="xs:string" minOccurs="1"maxOccurs="1"/>

        </xs:sequence>
    </xs:complexType>
</xs:element>

</xs:schema>

which does correctly validate this XML:

<project
    xmlns="http://freedomandbeer.com/project/1.0";
    xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance";

xsi:schemaLocation="http://freedomandbeer.com/project/1.0project.xsd

    http://freedomandbeer.com/project/ext/svn/1.0 svnext.xsd"
    id="myproject">
    <name>My Project</name>
    <developers>
        <developer>Bryon</developer>
        <developer>Jasmine</developer>
    </developers>
    <extensions>
        <extension id="source_control">

<svnRepository xmlns="http://freedomandbeer.com/project/ext/svn/1.0"><url>http://svn.freedomandbeer.com/myproject/trunk</url>

            </svnRepository>
        </extension>
    </extensions>
</project>

so, apart from the fact that I think the RelaxNG is MUCH nicer toread, the basic flavor is implementable fairly easily in XSD.

One place where we've found a need to do something that is just notpossible in XSD is where our "core" product uses multiple namespaces,and we want to allow "mixing-in" of things that are OUTSIDE of thosenamespaces -- in XSD, the rule is "One Schema, One Namespace". Wedefine a new namespace every time we version our schema, and add thenew schema elements into the new namespace.

If you look at the line in the project XSD where we use an <xs:any>element to allow for elements outside the schema's namespace to beincluded, you'll see that we declared the namespace to be ##other - inXSD, that means "any element in a namespace other than the one definedby this schema". Let's call this schema version 1. The problem isthat if we were to add another schema that defined a new "version" ofthis schema, and it's own corresponding "version 2" namespace, now the##other from version 1 can include version 2 elements, and any ##otherreferences in version 2 can include version 1 elements, which is notwhat we intend. There is not (to the best of my knowledge) any way to"group" namespaces in a meaningful way to say that an element must ormust not come from one of those...

Another thing that is very easy to do in RelaxNG, and hard to do inXSD, is deal with order and multiplicity in XML docs -- consider apiece of XML like this:


<car>
   <make>Ford</make>
   <model>Mustang</model>
   <year>2007</year>
   <option>Chrome Wheels</option>
   <option>CD Changer</option>
   <option>Sunroof</option>
</car>

but, maybe you'd like to be able to accept this document as well...

<car>
   <make>Ford</make>
   <model>Mustang</model>
   <option>Chrome Wheels</option>
   <option>CD Changer</option>
   <option>Sunroof</option>
   <year>2007</year>
</car>

in RelaxNG, you can say:

start = CAR
MAKE = element make {text}
MODEL = element model {text}
YEAR = element year {text}
OPTION = element option {text}
CAR = element car { MAKE & MODEL & YEAR & OPTION* }

Which means "a car has exactly one each of make, model, and yearelements, and zero or more option elements - all of which can appearin any order". Generally, when your XML document represents an objectgraph, you don't really care what order child elements appear - justwhether they are there or not. There's no reason why the secondversion of the document should be less valid than the first, andadding this openness makes it easier to interoperate with systems thatspeak in the language of your schema.

Trying to get this exact behavior in XSD is hard - there are severaloptions in XSD that almost do what we want, but not quite...

<xs:all> - this allows for all of the sub-elements to appear in anyorder, but each element may only occur 0 or 1 times - not 0 or morelike our option elements.<xs:sequence> - this is the most commonly used - it allows for eachelement to occur any number of times, but the sequence is set, so welose the flexibility to move things around.


The best I can do to truly re-create the RelaxNG schema above in XSD is:

<?xml version="1.0" encoding="UTF-8"?>
<xs:schema xmlns:xs="http://www.w3.org/2001/XMLSchema";
    targetNamespace="http://freedomandbeer.com/cars";>
    <xs:element name="car">
        <xs:complexType>
            <xs:sequence>

<xs:element name="option" type="xs:string"minOccurs="0" maxOccurs="unbounded"/>

                <xs:choice>
                    <xs:sequence>
                        <xs:element name="make" type="xs:string"/>

<xs:element name="option" type="xs:string"minOccurs="0" maxOccurs="unbounded"/>

                        <xs:choice>
                            <xs:sequence>

<xs:element name="model"type="xs:string"/><xs:element name="option"type="xs:string" minOccurs="0" maxOccurs="unbounded"/><xs:element name="year"type="xs:string"/><xs:element name="option"type="xs:string" minOccurs="0" maxOccurs="unbounded"/>

                            </xs:sequence>
                            <xs:sequence>

<xs:element name="year"type="xs:string"/><xs:element name="option"type="xs:string" minOccurs="0" maxOccurs="unbounded"/><xs:element name="model"type="xs:string"/><xs:element name="option"type="xs:string" minOccurs="0" maxOccurs="unbounded"/>

                            </xs:sequence>
                        </xs:choice>
                    </xs:sequence>
                    <xs:sequence>
                        <xs:element name="model" type="xs:string"/>

<xs:element name="option" type="xs:string"minOccurs="0" maxOccurs="unbounded"/>

                        <xs:choice>
                            <xs:sequence>

<xs:element name="make"type="xs:string"/><xs:element name="option"type="xs:string" minOccurs="0" maxOccurs="unbounded"/><xs:element name="year"type="xs:string"/><xs:element name="option"type="xs:string" minOccurs="0" maxOccurs="unbounded"/>

                            </xs:sequence>
                            <xs:sequence>

<xs:element name="year"type="xs:string"/><xs:element name="option"type="xs:string" minOccurs="0" maxOccurs="unbounded"/><xs:element name="make"type="xs:string"/><xs:element name="option"type="xs:string" minOccurs="0" maxOccurs="unbounded"/>

                            </xs:sequence>
                        </xs:choice>
                    </xs:sequence>
                    <xs:sequence>
                        <xs:element name="year" type="xs:string"/>

<xs:element name="option" type="xs:string"minOccurs="0" maxOccurs="unbounded"/>

                        <xs:choice>
                            <xs:sequence>

<xs:element name="make"type="xs:string"/><xs:element name="option"type="xs:string" minOccurs="0" maxOccurs="unbounded"/><xs:element name="model"type="xs:string"/><xs:element name="option"type="xs:string" minOccurs="0" maxOccurs="unbounded"/>

                            </xs:sequence>
                            <xs:sequence>

<xs:element name="model"type="xs:string"/><xs:element name="option"type="xs:string" minOccurs="0" maxOccurs="unbounded"/><xs:element name="make"type="xs:string"/><xs:element name="option"type="xs:string" minOccurs="0" maxOccurs="unbounded"/>

                            </xs:sequence>
                        </xs:choice>
                    </xs:sequence>
                </xs:choice>
            </xs:sequence>
        </xs:complexType>
    </xs:element>
</xs:schema>

Notice that what we are basically doing is building a DFA that walksvalid documents - because it is a requirement of XSD that what "path"you go down be deterministic. This means that the size of the schemawill actually be exponential in the number of elements in your"arbitrary choice group".

Now, to be fair to XSD, if you simply put a "wrapper" element aroundall of your options, which is a reasonable thing to do in most cases,the schema gets MUCH simpler:


<xs:schema xmlns:xs="http://www.w3.org/2001/XMLSchema";
    targetNamespace="http://freedomandbeer.com/cars";>

    <xs:element name="car">
        <xs:complexType>
            <xs:all>
                <xs:element name="make" type="xs:string"/>
                <xs:element name="model" type="xs:string"/>
                <xs:element name="year" type="xs:string"/>
                <xs:element name="options" minOccurs="0">
                    <xs:complexType>
                        <xs:sequence>

<xs:element name="option" minOccurs="0"maxOccurs="unbounded"/>

                        </xs:sequence>
                    </xs:complexType>
                </xs:element>
            </xs:all>
        </xs:complexType>
    </xs:element>
</xs:schema>

But, I don't really like my choice of schema language telling me thatI can't do something that is perfectly valid XML... Still, it's areasonable tradeoff to make if you like the tool support that XSDoffers (which is undeniably better than what exists for RelaxNG...)Actually, the reason that came to RelaxNG in the first place wasbecause I had to retroactively design a schema for a legacy systemthat had none - and it did something isomorphic to the "car" examplehere -- except that it had much more than the make/model/year/optionmix to deal with -- the XSD would have had thousands of paths andwould have been totally impossible to maintain.

Anyways - sorry if this has devolved into a generic "why RelaxNG isbetter than XSD" argument... I do think that's very true - but that'snot the intention here. I think that RelaxNG provides a much morerobust way than XSD of dealing with documents that need a high degreeof extensibility -- and it gives you the freedom to define your schemain the most natural way for document authors and consumers tounderstand, without your schema language getting in the way.Additionally, RelaxNG schemas are VERY readable, and provide excellentdocumentation that virtually anyone can quickly grok - I don't findthe same to be true with XSD...

Re: namespace support in POM.xml ?

Reply via email to