Re: [Apertium-stuff] Transducer contains initial epsilon loop
Dear All, This has now been resolved: https://gitlab.com/zangaphee/CiBantu/-/blob/master/twoc/apertium-nya/apertium-nya.nya.lexc Much appreciated for assisting with the process. Ta! On Sat, May 20, 2023 at 6:59 PM Jonathan Washington wrote: > > This is essentially how spellrelax works in modern language modules. > > To see how it's implemented, you can have a look at the spellrelax file and > Makefile.am for recent language modules, e.g.: > https://github.com/apertium/apertium-yua > > -- > Jonathan > > On Fri, May 19, 2023, 08:55 Zanga Chimombo wrote: >> >> Thanks for this. I am not sure where to put (which file) the XFST >> rule(s) and the syntax. Are there any examples online that you could >> point me to please? >> >> On Sun, May 14, 2023 at 9:00 PM Kevin Brubeck Unhammer >> wrote: >> > >> > >> > > I am looking at this again. Removing the extra tag at the transfer >> > > stage seems to be too late down the pipeline (I need the adjective to >> > > match the noun which is done by CG). Actually, surely removing the >> > > extra tag could be done at the same CG stage? >> > >> > If you use an xfst rule, that happens on the analyser FST, ie. before CG >> > and long before transfer. >> > >> > >> > >> > ___ >> > Apertium-stuff mailing list >> > Apertium-stuff@lists.sourceforge.net >> > https://lists.sourceforge.net/lists/listinfo/apertium-stuff >> >> >> ___ >> Apertium-stuff mailing list >> Apertium-stuff@lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/apertium-stuff > > ___ > Apertium-stuff mailing list > Apertium-stuff@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/apertium-stuff ___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff
Re: [Apertium-stuff] Transducer contains initial epsilon loop
This is essentially how spellrelax works in modern language modules. To see how it's implemented, you can have a look at the spellrelax file and Makefile.am for recent language modules, e.g.: https://github.com/apertium/apertium-yua -- Jonathan On Fri, May 19, 2023, 08:55 Zanga Chimombo wrote: > Thanks for this. I am not sure where to put (which file) the XFST > rule(s) and the syntax. Are there any examples online that you could > point me to please? > > On Sun, May 14, 2023 at 9:00 PM Kevin Brubeck Unhammer > wrote: > > > > > > > I am looking at this again. Removing the extra tag at the transfer > > > stage seems to be too late down the pipeline (I need the adjective to > > > match the noun which is done by CG). Actually, surely removing the > > > extra tag could be done at the same CG stage? > > > > If you use an xfst rule, that happens on the analyser FST, ie. before CG > > and long before transfer. > > > > > > > > ___ > > Apertium-stuff mailing list > > Apertium-stuff@lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/apertium-stuff > > > ___ > Apertium-stuff mailing list > Apertium-stuff@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/apertium-stuff > ___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff
Re: [Apertium-stuff] Transducer contains initial epsilon loop
Thanks for this. I am not sure where to put (which file) the XFST rule(s) and the syntax. Are there any examples online that you could point me to please? On Sun, May 14, 2023 at 9:00 PM Kevin Brubeck Unhammer wrote: > > > > I am looking at this again. Removing the extra tag at the transfer > > stage seems to be too late down the pipeline (I need the adjective to > > match the noun which is done by CG). Actually, surely removing the > > extra tag could be done at the same CG stage? > > If you use an xfst rule, that happens on the analyser FST, ie. before CG > and long before transfer. > > > > ___ > Apertium-stuff mailing list > Apertium-stuff@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/apertium-stuff ___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff
Re: [Apertium-stuff] Transducer contains initial epsilon loop
> I am looking at this again. Removing the extra tag at the transfer > stage seems to be too late down the pipeline (I need the adjective to > match the noun which is done by CG). Actually, surely removing the > extra tag could be done at the same CG stage? If you use an xfst rule, that happens on the analyser FST, ie. before CG and long before transfer. ___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff
Re: [Apertium-stuff] Transducer contains initial epsilon loop
Or should I be using a "tagger definition file" to forbid/ enforce such rules relating to tags...? If so, any pointers to how to include this file in my environment. It doesn't seem to be included in my directory... On Thu, May 4, 2023 at 7:22 AM Zanga Chimombo wrote: > > Alternatively, can CG be restricted to match the last tags in the noun > and adjective? "" in this case...? > > On Thu, May 4, 2023 at 7:11 AM Zanga Chimombo wrote: > > > > I am looking at this again. Removing the extra tag at the transfer > > stage seems to be too late down the pipeline (I need the adjective to > > match the noun which is done by CG). Actually, surely removing the > > extra tag could be done at the same CG stage? > > ^timitengo/mtengo$ ^tatiwisi/wisi$ > > > > All I need in the example above is for the extra tag "" to be > > removed at CG stage. It could be as simple as a rule to "remove first > > class-tag from a noun that has two class-tags", however, I am only > > seeing examples in *.rlx files where the whole "word" is removed, not > > specific tags within the "word"... Any pointers please? > > > > On Tue, Feb 28, 2023 at 5:19 PM Zanga Chimombo wrote: > > > > > > Thanks for the tip. Let me get my head around it. > > > > > > On Tue, Feb 28, 2023 at 5:03 PM Kevin Brubeck Unhammer > > > wrote: > > > > > > > > Hi, > > > > > > > > Cf. http://tinodidriksen.com/pisg/OFTC/logs/%23hfst/2023-02-28.log > > > > perhaps you can make an xfst rule to do the equivalent of > > > > > > > > sed 's/\(.*\)/\1/' > > > > > > > > ? > > > > > > > > > > > > ___ > > > > Apertium-stuff mailing list > > > > Apertium-stuff@lists.sourceforge.net > > > > https://lists.sourceforge.net/lists/listinfo/apertium-stuff ___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff
Re: [Apertium-stuff] Transducer contains initial epsilon loop
Alternatively, can CG be restricted to match the last tags in the noun and adjective? "" in this case...? On Thu, May 4, 2023 at 7:11 AM Zanga Chimombo wrote: > > I am looking at this again. Removing the extra tag at the transfer > stage seems to be too late down the pipeline (I need the adjective to > match the noun which is done by CG). Actually, surely removing the > extra tag could be done at the same CG stage? > ^timitengo/mtengo$ ^tatiwisi/wisi$ > > All I need in the example above is for the extra tag "" to be > removed at CG stage. It could be as simple as a rule to "remove first > class-tag from a noun that has two class-tags", however, I am only > seeing examples in *.rlx files where the whole "word" is removed, not > specific tags within the "word"... Any pointers please? > > On Tue, Feb 28, 2023 at 5:19 PM Zanga Chimombo wrote: > > > > Thanks for the tip. Let me get my head around it. > > > > On Tue, Feb 28, 2023 at 5:03 PM Kevin Brubeck Unhammer > > wrote: > > > > > > Hi, > > > > > > Cf. http://tinodidriksen.com/pisg/OFTC/logs/%23hfst/2023-02-28.log > > > perhaps you can make an xfst rule to do the equivalent of > > > > > > sed 's/\(.*\)/\1/' > > > > > > ? > > > > > > > > > ___ > > > Apertium-stuff mailing list > > > Apertium-stuff@lists.sourceforge.net > > > https://lists.sourceforge.net/lists/listinfo/apertium-stuff ___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff
Re: [Apertium-stuff] Transducer contains initial epsilon loop
I am looking at this again. Removing the extra tag at the transfer stage seems to be too late down the pipeline (I need the adjective to match the noun which is done by CG). Actually, surely removing the extra tag could be done at the same CG stage? ^timitengo/mtengo$ ^tatiwisi/wisi$ All I need in the example above is for the extra tag "" to be removed at CG stage. It could be as simple as a rule to "remove first class-tag from a noun that has two class-tags", however, I am only seeing examples in *.rlx files where the whole "word" is removed, not specific tags within the "word"... Any pointers please? On Tue, Feb 28, 2023 at 5:19 PM Zanga Chimombo wrote: > > Thanks for the tip. Let me get my head around it. > > On Tue, Feb 28, 2023 at 5:03 PM Kevin Brubeck Unhammer > wrote: > > > > Hi, > > > > Cf. http://tinodidriksen.com/pisg/OFTC/logs/%23hfst/2023-02-28.log > > perhaps you can make an xfst rule to do the equivalent of > > > > sed 's/\(.*\)/\1/' > > > > ? > > > > > > ___ > > Apertium-stuff mailing list > > Apertium-stuff@lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/apertium-stuff ___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff
Re: [Apertium-stuff] Transducer contains initial epsilon loop
Thanks for the tip. Let me get my head around it. On Tue, Feb 28, 2023 at 5:03 PM Kevin Brubeck Unhammer wrote: > > Hi, > > Cf. http://tinodidriksen.com/pisg/OFTC/logs/%23hfst/2023-02-28.log > perhaps you can make an xfst rule to do the equivalent of > > sed 's/\(.*\)/\1/' > > ? > > > ___ > Apertium-stuff mailing list > Apertium-stuff@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/apertium-stuff ___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff
Re: [Apertium-stuff] Transducer contains initial epsilon loop
Hi, Cf. http://tinodidriksen.com/pisg/OFTC/logs/%23hfst/2023-02-28.log perhaps you can make an xfst rule to do the equivalent of sed 's/\(.*\)/\1/' ? signature.asc Description: PGP signature ___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff