Re: [Apertium-stuff] Tagset Standardization

2023-06-07 Thread Francis Tyers via Apertium-stuff
El 2023-06-07 16:19, Daniel Swanson va escriure: Greetings Apertiumers! I've been reminded that derivational morphology exists, which throws a wrench in my desire for full position-independent tags. I've also been reminded that some repos have .udx files which specify a conversion between

Re: [Apertium-stuff] Tagset Standardization

2023-06-07 Thread Daniel Swanson
Greetings Apertiumers! I've been reminded that derivational morphology exists, which throws a wrench in my desire for full position-independent tags. I've also been reminded that some repos have .udx files which specify a conversion between Apertium tags and Universal Dependencies, but as far as

Re: [Apertium-stuff] Tagset Standardization

2023-03-07 Thread Daniel Swanson
Yes, most of our tools assume that tags are position independent, but I've come across a handful of languages that treat some tags as position dependent, and I was more hoping to make it official to make it less likely that we run into issues with that. Also, I have an idea for how to make a

Re: [Apertium-stuff] Tagset Standardization

2023-03-07 Thread Kevin Brubeck Unhammer
Daniel Swanson čálii: > To be clear, I meant splitting into .  > One of my ideals for the tagset is that every tag be > position-independent, so that the only reason I need to care about > order is because of FST topology (and maybe not even then). Aren't the tags themselves already

Re: [Apertium-stuff] Tagset Standardization

2023-03-07 Thread Daniel Swanson
On Tue, Mar 7, 2023 at 6:07 AM Kevin Brubeck Unhammer wrote: > > Daniel Swanson > čálii: > > > Greetings Apertiumers! > > > > This morning I set out to change the Ancient Hebrew analyzer from > > Latin script to Hebrew script (a task I don't wish upon anyone) and in > > the process produced a

Re: [Apertium-stuff] Tagset Standardization

2023-03-07 Thread Kevin Brubeck Unhammer
Daniel Swanson čálii: > Greetings Apertiumers! > > This morning I set out to change the Ancient Hebrew analyzer from > Latin script to Hebrew script (a task I don't wish upon anyone) and in > the process produced a search-and-replace tool that understands the > structure of several of our source

Re: [Apertium-stuff] Tagset Standardization

2023-03-07 Thread Flammie A Pirinen
Am Mon, Mar 06, 2023 at 03:35:45PM -0500 schrieb Daniel Swanson: > This script could, without too much trouble, be expanded to cover the > rest of our source files, at which point I would like to propose that > we move towards greater standardization of our tagset: >

[Apertium-stuff] Tagset Standardization

2023-03-06 Thread Daniel Swanson
Greetings Apertiumers! This morning I set out to change the Ancient Hebrew analyzer from Latin script to Hebrew script (a task I don't wish upon anyone) and in the process produced a search-and-replace tool that understands the structure of several of our source files: