Hi!

The problem with this is that there are so many different metadix
formats that it will be impossible to come up with one that covers them
all. For example if I remember correctly how the "alt" works is
different in es-pt and in oc-es. I think it was decided that it was
desirable to have them functioning differently, or at least would
require substantial changes in either language pair to get a unified
format -- changes that without some push (and let's face it, cash) are
not going to get made. 

On the other hand, implementing compound words gives us the chance to
strike while the iron is hot! We can make a (fairly innocuous change --
any language pair that does not have compounding will be unaffected)
before getting a plethora of different options and thus avoiding the
metadix problem for another set of issues.

Btw, thinking about metadix I have some probably unpopular ideas,
thatwould preclude any standardisation. I think that maybe we should not
have one format, but rather many _codified_ formats depending on the
language(group). For example how to include a verb would be different in
Tajik and Dutch, because different things are important. Unnecessary
examples:

<e lm="aanzitten"><par n="z/itten__vblex" prefix="aan"
pp="aangezeten"/></e>

Giving:

    <e lm="aanzitten"><i>aanz</i><par n="aanz/itten__vblex_sep"/></e>
    <e lm="aanzitten"><p><l>z</l><r>aanz</r></p><par
n="z/itten#_aan__vblex_sep"/><p><l><b/>aan</l><r></r></p></e>
    <e lm="aanzitten"><p><l>aangezeten</l><r>aanzitten</r></p><par
n="gesproken__vblex_sep"/></e>

Or in Tajik:

<e lm="харидан"><par n="кард/ан__vblex" stem1="харид" stem2="хар"/></e>

Which would give (after transformation) something like:

    <e lm="харидан"><p><l>харид</l><r>харидан<s n="vblex"/></r></p><par
n="кард/ан__vblex"/></e>
    <e lm="харидан"><p><l>хар</l><r>харидан<s n="vblex"/><s
n="prs"/></r></p><par n="к/ард.ан__vblex"/></e>
    <e lm="харидан"><p><l>нахар</l><r>харидан<s n="vblex"/><s
n="neg"/><s n="prs"/></r></p><par n="к/ард.ан__vblex"/></e>
    <e lm="харидан"><p><l>нахарид</l><r>харидан<s n="vblex"/><s
n="neg"/></r></p><par n="кард/ан__vblex"/></e>
    <e lm="харидан"><p><l>мехарид</l><r>харидан<s
n="vblex"/></r></p><par n="ме.кард/ан__vblex"/></e>
    <e lm="харидан"><p><l>мехар</l><r>харидан<s n="vblex"/><s
n="pri"/></r></p><par n="к/ард.ан__vblex"/></e>
    <e lm="харидан"><p><l>намехарид</l><r>харидан<s n="vblex"/><s
n="neg"/></r></p><par n="кард/ан__vblex"/></e>
    <e lm="харидан"><p><l>намехар</l><r>харидан<s n="vblex"/><s
n="neg"/><s n="pri"/></r></p><par n="к/ард.ан__vblex"/></e>

Fran

PS. Wasn't the election to be organised by Unhammer, Pasquale and Nic ?

El dt 21 de 12 de 2010 a les 06:31 +0100, en/na Mikel Forcada va
escriure:
> Hi Apertiumers,
> 
> Before any more patches to the dictionary format are made, a general 
> agreement should be reached. Remember that we have different dialects of 
> metadix and an unification would be desirable before fiddling anymore 
> with dictionary formats....
> 
> Mikel L. Forcada
> 
> P.S. By the way, as the mandate of the current Project Management 
> Committee has long expired and we haven't been able to run a proper 
> election, I understand I could stage a coup d'etat, put on my BDFL cap, 
> and word the above as a command instead of as an opinion. I'm tempted... 
> anyone interested in
> 
> On 12/19/2010 09:57 PM, Francis Tyers wrote:
> > El dg 19 de 12 de 2010 a les 18:28 +0000, en/na Jimmy O'Regan va
> > escriure:
> >> On 19 Dec 2010, at 17:39, Francis Tyers<fty...@prompsit.com>  wrote:
> >>
> >>
> >>> It would be nice to get this done before Christmas, are there any
> >>> comments ?
> >> It would probably be best to use a character other than '+'. In the
> >> event of the final part of the compound being analysed as a multword
> >> with inner inflection, the queue will be attached to the first part of
> >> the compound. As you're talking about a syntax change anyway, is there
> >> any reason to not insert the break directly?
> > I guess we could use '~' and then change pretransfer.cc to output $^ for
> > '~' instead of '$ ^' for '+'...
> >
> > Anything else ?
> >
> > Fran
> >
> >
> > ------------------------------------------------------------------------------
> > Lotusphere 2011
> > Register now for Lotusphere 2011 and learn how
> > to connect the dots, take your collaborative environment
> > to the next level, and enter the era of Social Business.
> > http://p.sf.net/sfu/lotusphere-d2d
> > _______________________________________________
> > Apertium-stuff mailing list
> > Apertium-stuff@lists.sourceforge.net
> > https://lists.sourceforge.net/lists/listinfo/apertium-stuff
> 
> 



------------------------------------------------------------------------------
Lotusphere 2011
Register now for Lotusphere 2011 and learn how
to connect the dots, take your collaborative environment
to the next level, and enter the era of Social Business.
http://p.sf.net/sfu/lotusphere-d2d
_______________________________________________
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to