Re: [Apertium-stuff] New mode for the Apertium Tagger

2013-09-22 Thread Gang Chen
Hi, all, Thanks for the discussions! I've merged the code of branches/apertium-swpost/apertium into trunk/apertium. Now the Apertium PoS tagger supports both models: HMM and LSW. HMM is the default choice, and its usage is the same as before. LSW is the new part, and its usage is also simple:

Re: [Apertium-stuff] New mode for the Apertium Tagger

2013-09-21 Thread Gang Chen
2013/9/21 Francis Tyers fty...@prompsit.com El dv 20 de 09 de 2013 a les 18:41 +0800, en/na Gang Chen va escriure: 2013/9/20 Mikel L. Forcada m...@dlsi.ua.es El 20/09/13 12:18, Francis Tyers escribió: Did he manage to do the training with no TSX file

Re: [Apertium-stuff] New mode for the Apertium Tagger

2013-09-20 Thread Gang Chen
2013/9/20 Mikel L. Forcada m...@dlsi.ua.es El 20/09/13 12:41, Gang Chen escribió: Yes, Mikel, the new tagger could work without rules. However, it is done by modifying the code (for the experiments), and there is no such a option for whether or not to use the rules in the TSX file

Re: [Apertium-stuff] Tagger training sv-da

2013-09-13 Thread Gang Chen
indicate the directory? Yours, Per Tunedal On Fri, Sep 13, 2013, at 2:54, Gang Chen wrote: Hi, 1) Is it possible to make some kind of Wikipedia dump? This tool works fine for extracting the main text from Wikipedia, http://wiki.apertium.org/wiki/User:Gang_Chen/Wikipedia_Extractor

Re: [Apertium-stuff] Ask for help on HMM unsupervised training

2013-06-06 Thread Gang Chen
Hi, Philipe, Fran, I think there are two kinds of words that cause the A new ambiguity class error. (1) the first kind are the words like Mar, that make sense in both cases, whether a dot is appended or not: I ran the command: lt-expand apertium-en-es.es.dix | grep -v __REGEXP__ | grep -v :: |

Re: [Apertium-stuff] Coding Challenge for idea Sliding-window part-of-speech tagger

2013-04-24 Thread Gang Chen
Hi,Mikel, Thank you for your guidance! During the last 2 days, I was mainly focused on reading the paper and writing my application. The good news are that I understand the unsupervised traning alroghtm, which I think is indeed the most mathematically heavy part, and that the first draft of

Re: [Apertium-stuff] Coding Challenge for idea Sliding-window part-of-speech tagger

2013-04-21 Thread Gang Chen
. Best wishes, Gang Chen 2013/4/21 Mikel Forcada m...@dlsi.ua.es Hi Gang, your code seems to work correctly, at least in a few tests I have performed. There is only one thing that I didn't like: the program silently exits unless there is one of the options -r/-f. It should give an error

[Apertium-stuff] Coding Challenge for idea Sliding-window part-of-speech tagger

2013-04-20 Thread Gang Chen
hi, Mlforcada, Fran, I am Gang Chen. I have a great interest in the Apertium GSOC-2013 project Sliding-window part-of-speech tagger. After talking to Fran and other mentors these days and viewing the wiki pages in Apertium, I think I have a better understanding of the platform

Re: [Apertium-stuff] GSOC-2013 Hello Apertiumers

2013-04-18 Thread Gang Chen
Hi, Fran. Thanks for replying! Among the ideas listed on the idea page, I am mostly attracted by: 1.Corpus-based lexicalised feature transfer 2.Sliding-window part-of-speech tagger Cool, have you looked into doing the coding challenges for either of those ? I have looked into the the