On 21 Dec 2010, at 10:42, Francis Tyers <fty...@prompsit.com> wrote:

> Hi!
>
> The problem with this is that there are so many different metadix
> formats that it will be impossible to come up with one that covers  
> them
> all. For example if I remember correctly how the "alt" works is
> different in es-pt and in oc-es. I think it was decided that it was
> desirable to have them functioning differently, or at least would
> require substantial changes in either language pair to get a unified
> format -- changes that without some push (and let's face it, cash) are
> not going to get made.

Nah. Majority rules. There's one dominant flavour of 'alt', that one  
wins; the other can still be transformed in exactly the same way as it  
is currently without anything being lost, so where's the problem?

'alt' and 'v' are relatively easy - they're not much different to  
'r' (though the 'r'-ness would have to be taken into account, which  
can get confusing). That much could be done by a GCI student.

The slightly difficult parts are prm and sa, and I'd really rather see  
those be extended to a templating mechanism, but haven't been able to  
find a way of specifying the parameters that isn't worse to use than  
just copying a modifying pardefs manually.



> On the other hand, implementing compound words gives us the chance to
> strike while the iron is hot! We can make a (fairly innocuous change  
> --
> any language pair that does not have compounding will be unaffected)
> before getting a plethora of different options and thus avoiding the
> metadix problem for another set of issues.
>
> Btw, thinking about metadix I have some probably unpopular ideas,
> thatwould preclude any standardisation. I think that maybe we should  
> not
> have one format, but rather many _codified_ formats depending on the
> language(group). For example how to include a verb would be  
> different in
> Tajik and Dutch, because different things are important. Unnecessary
> examples:
>

See, the difference is that metadix is a generic set of extensions,  
while you describe a set of language specific extensions. Not that  
there's anything wrong with that (other than that it creates a  
situation where existing knowledge of dix files is less transferable,  
which is not a major obstacle) - you're just not talking about  
metadix, don't mix up the issues.

Both of your examples seem to be predicated on magic happening -  
pulling secondary paradigm names from the ether. You'd either need to  
have hard coded values in the transformation process, or a set of  
forward declarations that define the mappings. Go for the latter, that  
would involve less name calling on my part.


> <e lm="aanzitten"><par n="z/itten__vblex" prefix="aan"
> pp="aangezeten"/></e>
>
> Giving:
>
>    <e lm="aanzitten"><i>aanz</i><par n="aanz/itten__vblex_sep"/></e>
>    <e lm="aanzitten"><p><l>z</l><r>aanz</r></p><par
> n="z/itten#_aan__vblex_sep"/><p><l><b/>aan</l><r></r></p></e>
>    <e lm="aanzitten"><p><l>aangezeten</l><r>aanzitten</r></p><par
> n="gesproken__vblex_sep"/></e>
>
> Or in Tajik:
>
> <e lm="харидан"><par n="кард/ан__vblex" stem1="харид" stem2="хар"/>< 
> /e>
>
> Which would give (after transformation) something like:
>
>    <e lm="харидан"><p><l>харид</l><r>харидан<s n="vblex"/></r></p><par
> n="кард/ан__vblex"/></e>
>    <e lm="харидан"><p><l>хар</l><r>харидан<s n="vblex"/><s
> n="prs"/></r></p><par n="к/ард.ан__vblex"/></e>
>    <e lm="харидан"><p><l>нахар</l><r>харидан<s n="vblex"/><s
> n="neg"/><s n="prs"/></r></p><par n="к/ард.ан__vblex"/></e>
>    <e lm="харидан"><p><l>нахарид</l><r>харидан<s n="vblex"/><s
> n="neg"/></r></p><par n="кард/ан__vblex"/></e>
>    <e lm="харидан"><p><l>мехарид</l><r>харидан<s
> n="vblex"/></r></p><par n="ме.кард/ан__vblex"/></e>
>    <e lm="харидан"><p><l>мехар</l><r>харидан<s n="vblex"/><s
> n="pri"/></r></p><par n="к/ард.ан__vblex"/></e>
>    <e lm="харидан"><p><l>намехарид</l><r>харидан<s n="vblex"/><s
> n="neg"/></r></p><par n="кард/ан__vblex"/></e>
>    <e lm="харидан"><p><l>намехар</l><r>харидан<s n="vblex"/><s
> n="neg"/><s n="pri"/></r></p><par n="к/ард.ан__vblex"/></e>
>
> Fran
>
> PS. Wasn't the election to be organised by Unhammer, Pasquale and  
> Nic ?
>
> El dt 21 de 12 de 2010 a les 06:31 +0100, en/na Mikel Forcada va
> escriure:
>> Hi Apertiumers,
>>
>> Before any more patches to the dictionary format are made, a general
>> agreement should be reached. Remember that we have different  
>> dialects of
>> metadix and an unification would be desirable before fiddling anymore
>> with dictionary formats....
>>
>> Mikel L. Forcada
>>
>> P.S. By the way, as the mandate of the current Project Management
>> Committee has long expired and we haven't been able to run a proper
>> election, I understand I could stage a coup d'etat, put on my BDFL  
>> cap,
>> and word the above as a command instead of as an opinion. I'm  
>> tempted...
>> anyone interested in
>>
>> On 12/19/2010 09:57 PM, Francis Tyers wrote:
>>> El dg 19 de 12 de 2010 a les 18:28 +0000, en/na Jimmy O'Regan va
>>> escriure:
>>>> On 19 Dec 2010, at 17:39, Francis Tyers<fty...@prompsit.com>   
>>>> wrote:
>>>>
>>>>
>>>>> It would be nice to get this done before Christmas, are there any
>>>>> comments ?
>>>> It would probably be best to use a character other than '+'. In the
>>>> event of the final part of the compound being analysed as a  
>>>> multword
>>>> with inner inflection, the queue will be attached to the first  
>>>> part of
>>>> the compound. As you're talking about a syntax change anyway, is  
>>>> there
>>>> any reason to not insert the break directly?
>>> I guess we could use '~' and then change pretransfer.cc to output  
>>> $^ for
>>> '~' instead of '$ ^' for '+'...
>>>
>>> Anything else ?
>>>
>>> Fran
>>>
>>>
>>> --- 
>>> --- 
>>> --- 
>>> --- 
>>> ------------------------------------------------------------------
>>> Lotusphere 2011
>>> Register now for Lotusphere 2011 and learn how
>>> to connect the dots, take your collaborative environment
>>> to the next level, and enter the era of Social Business.
>>> http://p.sf.net/sfu/lotusphere-d2d
>>> _______________________________________________
>>> Apertium-stuff mailing list
>>> Apertium-stuff@lists.sourceforge.net
>>> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>>
>>
>
>
>
> --- 
> --- 
> --- 
> ---------------------------------------------------------------------
> Lotusphere 2011
> Register now for Lotusphere 2011 and learn how
> to connect the dots, take your collaborative environment
> to the next level, and enter the era of Social Business.
> http://p.sf.net/sfu/lotusphere-d2d
> _______________________________________________
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff

------------------------------------------------------------------------------
Lotusphere 2011
Register now for Lotusphere 2011 and learn how
to connect the dots, take your collaborative environment
to the next level, and enter the era of Social Business.
http://p.sf.net/sfu/lotusphere-d2d
_______________________________________________
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to