Re: [Apertium-stuff] Willingness to participate in the project

2020-03-16 Thread 杨伟哲
Thanks so much! I once visited the repo of lttoolbox and read the source code of lt-proc.cc, lt-comp.cc, lt-expand.cc, etc. But at that time, I was not sure whether it was the code I needed, so I only read it roughly. But I still remember their location in the repository. Now I'll look more closel

Re: [Apertium-stuff] Willingness to participate in the project

2020-03-16 Thread Tino Didriksen
It's somewhere in https://github.com/apertium/lttoolbox - I don't know the exact location. The entrypoint that does tokenization is lt-proc, so start from lt-proc.cc and trace execution to somewhere that does tokenization. That's also a good way to learn the codebase. -- Tino Didriksen On Mon,

Re: [Apertium-stuff] Willingness to participate in the project

2020-03-16 Thread 杨伟哲
Hi Tino and Fammie, Due to my mistake in sending the email before, I am not sure whether you have received the email I sent, so I'm sending the email to you again now. Hope you can receive it. These days, I read the wikipedia description of tokenization and got a general idea of how it works.I al

Re: [Apertium-stuff] Willingness to participate in the project

2020-03-15 Thread gavinwzmails
___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Re: [Apertium-stuff] Willingness to participate in the project

2020-03-15 Thread gavinwzmails
___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Re: [Apertium-stuff] Willingness to participate in the project

2020-03-04 Thread 杨伟哲
> > Yes, my code looks very messy this time. Thank you for pointing out my > shortcomings. > > I will spend time reading the code in the extension readings, trying to > understand the various usages of the syntax in the program, understanding > the project flow, and getting familiar with the code s

Re: [Apertium-stuff] Willingness to participate in the project

2020-03-03 Thread 杨伟哲
> > Yes, my code looks very messy this time. Thank you for pointing out my > shortcomings. > > I will spend time reading the code in the extension readings, trying to > understand the various usages of the syntax in the program, understanding > the project flow, and getting familiar with the code s

Re: [Apertium-stuff] Willingness to participate in the project

2020-03-03 Thread 杨伟哲
OK! Thanks a lot for your reply and recommendation. I configured the apertium core and litoolbox environment and downloaded several dictionaries on my computer the other day. Recently I've been getting familiar with their usage and the meaning of each of the options. I have a certain understandin

Re: [Apertium-stuff] Willingness to participate in the project

2020-03-03 Thread Tino Didriksen
The code for the challenge works. However, it is very far from idiomatic C++ - it's more akin to C with Classes. ICU causes a little of this, but things like malloc(), #define, and having variables first have no home in C++. And how is one supposed to build the code? Also, mixing I/O is generally a

Re: [Apertium-stuff] Willingness to participate in the project

2020-03-03 Thread Flammie A Pirinen
Hi, I am this week on hliday with low internet availability so only few quick points. Firstly I strogly recommend joining #apertium IRC channel, I think even non-mentors will have useful clues. For the tokenisation problem I think the main resource is to understand various unicode technical report

[Apertium-stuff] Willingness to participate in the project

2020-02-26 Thread 杨伟哲
Hi Francis and Flammie, I’m interested in the “Robust tokenisation in lttoolbox”[1] GSoC project. And currently I’m writing the proposal. I have completed the code challenge listed in the project, which has been put on Pastebin[2]. However, I’m not quite clear where this project starting with. An