[Apertium-stuff] Wiki account registration

2020-03-16 Thread Scoop Gracie
I think we need a better way to register wiki accounts. There is nothing
stopping a spambot from sending an email to Apertium-stuff with a username
and getting a wiki account (because we would think it was a person).
Obviously, we would catch this if it happened rapidly, but in the lead up
to GSoC, doing it once a day or so with different (possibly spoofed)
addresses wouldn't look suspicious, and even after we knew about the
attack, how would we know which future messages were bots?
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] GSOC 2020 | Requesting a Wiki Account

2020-03-16 Thread Daniel Swanson
A randomly generated password for Katherinew has been sent to
katheri...@nyu.edu. It can be changed on the change password page upon
logging in.


On Tue, Mar 17, 2020 at 12:50 AM Katherine Wang  wrote:

> Hi Daniel,
> 2. You can probably tell from my last name that I know am familiar with
> Chinese (but I have limited knowledge) so unfortunately, my skills would
> only lie in French-English.
> 1. If you think I could join your team,
> username: katherine ? or katherinew ? I don't really have a preference,
> thank you very much.
>
> Best,
> Katherine Wang
>
> On Mon, Mar 16, 2020 at 11:34 PM Daniel Swanson <
> awesomeevildu...@gmail.com> wrote:
>
>> Hi Katherine,
>>
>> What would you like as your username?
>>
>> Two things to note about your proposed project:
>> 1. We do pretty much everything on Github rather than SourceForge now, so
>> the link you want is https://github.com/apertium/apertium-fra-eng
>> 
>> 2. We generally prefer to focus on low-resource language pairs where we
>> can reasonably hope to do better than, say Google. Are there any other
>> languages you are familiar with that you could pair with one of those?
>>
>> Daniel
>>
>> On Mon, Mar 16, 2020 at 9:49 PM Katherine Wang 
>> wrote:
>>
>>> Name: Katherine Wang
>>> E-mail address: katheri...@nyu.edu
>>>
>>> Interested in the Following Projects. *Would like to connect with the 
>>> mentor leading the unreleased language pair [fra-eng] please.*
>>>
>>> 1. Adopt an unreleased language pair [fra-eng] 
>>> (https://svn.code.sf.net/p/apertium/svn/incubator/apertium-fra-eng/ 
>>> )
>>>
>>> Skills: Basic Machine Learning (Python)
>>>
>>> 2. Learning distributed representations for Apertium modules
>>>
>>> ___
>>> Apertium-stuff mailing list
>>> Apertium-stuff@lists.sourceforge.net
>>> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>>> 
>>>
>> ___
>> Apertium-stuff mailing list
>> Apertium-stuff@lists.sourceforge.net
>>
>> https://urldefense.proofpoint.com/v2/url?u=https-3A__lists.sourceforge.net_lists_listinfo_apertium-2Dstuff=DwICAg=slrrB7dE8n7gBJbeO0g-IQ=eY8Na5crWAXt6GDE0sFMWOsHFGmgPvbNWCUSuOym9Gw=-fj2vZIoCmiVGrKMt2lTl0euKAeWBOdFlUTFu-sCt_4=u2QdCJmAEL-3Gmi3yxoCPNVrDv2EHrXRq4_f8nOmeak=
>>
> ___
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] GSOC 2020 | Requesting a Wiki Account

2020-03-16 Thread Katherine Wang
Hi Daniel,
2. You can probably tell from my last name that I know am familiar with
Chinese (but I have limited knowledge) so unfortunately, my skills would
only lie in French-English.
1. If you think I could join your team,
username: katherine ? or katherinew ? I don't really have a preference,
thank you very much.

Best,
Katherine Wang

On Mon, Mar 16, 2020 at 11:34 PM Daniel Swanson 
wrote:

> Hi Katherine,
>
> What would you like as your username?
>
> Two things to note about your proposed project:
> 1. We do pretty much everything on Github rather than SourceForge now, so
> the link you want is https://github.com/apertium/apertium-fra-eng
> 
> 2. We generally prefer to focus on low-resource language pairs where we
> can reasonably hope to do better than, say Google. Are there any other
> languages you are familiar with that you could pair with one of those?
>
> Daniel
>
> On Mon, Mar 16, 2020 at 9:49 PM Katherine Wang  wrote:
>
>> Name: Katherine Wang
>> E-mail address: katheri...@nyu.edu
>>
>> Interested in the Following Projects. *Would like to connect with the mentor 
>> leading the unreleased language pair [fra-eng] please.*
>>
>> 1. Adopt an unreleased language pair [fra-eng] 
>> (https://svn.code.sf.net/p/apertium/svn/incubator/apertium-fra-eng/ 
>> )
>>
>> Skills: Basic Machine Learning (Python)
>>
>> 2. Learning distributed representations for Apertium modules
>>
>> ___
>> Apertium-stuff mailing list
>> Apertium-stuff@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>> 
>>
> ___
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
>
> https://urldefense.proofpoint.com/v2/url?u=https-3A__lists.sourceforge.net_lists_listinfo_apertium-2Dstuff=DwICAg=slrrB7dE8n7gBJbeO0g-IQ=eY8Na5crWAXt6GDE0sFMWOsHFGmgPvbNWCUSuOym9Gw=-fj2vZIoCmiVGrKMt2lTl0euKAeWBOdFlUTFu-sCt_4=u2QdCJmAEL-3Gmi3yxoCPNVrDv2EHrXRq4_f8nOmeak=
>
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] Gsoc 2020

2020-03-16 Thread Daniel Swanson
A randomly generated password for Shrey1608 has been sent to
modishrey...@gmail.com. It can be changed on the change password page upon
logging in.


On Tue, Mar 17, 2020 at 12:46 AM Shrey Modi  wrote:

> Hello Daniel
> I am applying for gsoc 2020 and i am working on one of the ideas so can i
> get a wiki account?
> I would like the username shrey1608
> ___
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


[Apertium-stuff] Gsoc 2020

2020-03-16 Thread Shrey Modi
Hello Daniel
I am applying for gsoc 2020 and i am working on one of the ideas so can i
get a wiki account?
I would like the username shrey1608
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] GSOC 2020

2020-03-16 Thread Shubham Dikshit
Thank you for your help.

On Tue, Mar 17, 2020 at 9:29 AM Daniel Swanson 
wrote:

> A randomly generated password for Shubham16
> 
>  has
> been sent to iamsds...@gmail.com. It can be changed on the *change
> password * page
> upon logging in.
>
>
> On Mon, Mar 16, 2020 at 11:58 PM Shubham Dikshit 
> wrote:
>
>> Hi,
>> I would like my username to be shubham16 or shubham1011
>>
>> On Tue, Mar 17, 2020 at 9:17 AM Daniel Swanson <
>> awesomeevildu...@gmail.com> wrote:
>>
>>> Hi Shubham,
>>>
>>> What would you like your username to be?
>>>
>>> Daniel
>>>
>>> On Mon, Mar 16, 2020 at 11:44 PM Shubham Dikshit 
>>> wrote:
>>>
 Hi,
 I have applied to Apertium in GSOC 2020 with the title of the project:
 Indian Language Parsing.
 And would like to request for a WIki-Account.
 Thank you
 ___
 Apertium-stuff mailing list
 Apertium-stuff@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/apertium-stuff

>>> ___
>>> Apertium-stuff mailing list
>>> Apertium-stuff@lists.sourceforge.net
>>> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>>>
>> ___
>> Apertium-stuff mailing list
>> Apertium-stuff@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>>
> ___
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] GSOC 2020

2020-03-16 Thread Daniel Swanson
A randomly generated password for Shubham16

has
been sent to iamsds...@gmail.com. It can be changed on the *change password
* page upon logging
in.


On Mon, Mar 16, 2020 at 11:58 PM Shubham Dikshit 
wrote:

> Hi,
> I would like my username to be shubham16 or shubham1011
>
> On Tue, Mar 17, 2020 at 9:17 AM Daniel Swanson 
> wrote:
>
>> Hi Shubham,
>>
>> What would you like your username to be?
>>
>> Daniel
>>
>> On Mon, Mar 16, 2020 at 11:44 PM Shubham Dikshit 
>> wrote:
>>
>>> Hi,
>>> I have applied to Apertium in GSOC 2020 with the title of the project:
>>> Indian Language Parsing.
>>> And would like to request for a WIki-Account.
>>> Thank you
>>> ___
>>> Apertium-stuff mailing list
>>> Apertium-stuff@lists.sourceforge.net
>>> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>>>
>> ___
>> Apertium-stuff mailing list
>> Apertium-stuff@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>>
> ___
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] GSOC 2020

2020-03-16 Thread Shubham Dikshit
Hi,
I would like my username to be shubham16 or shubham1011

On Tue, Mar 17, 2020 at 9:17 AM Daniel Swanson 
wrote:

> Hi Shubham,
>
> What would you like your username to be?
>
> Daniel
>
> On Mon, Mar 16, 2020 at 11:44 PM Shubham Dikshit 
> wrote:
>
>> Hi,
>> I have applied to Apertium in GSOC 2020 with the title of the project:
>> Indian Language Parsing.
>> And would like to request for a WIki-Account.
>> Thank you
>> ___
>> Apertium-stuff mailing list
>> Apertium-stuff@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>>
> ___
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] GSOC 2020

2020-03-16 Thread Daniel Swanson
Hi Shubham,

What would you like your username to be?

Daniel

On Mon, Mar 16, 2020 at 11:44 PM Shubham Dikshit 
wrote:

> Hi,
> I have applied to Apertium in GSOC 2020 with the title of the project:
> Indian Language Parsing.
> And would like to request for a WIki-Account.
> Thank you
> ___
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


[Apertium-stuff] GSOC 2020

2020-03-16 Thread Shubham Dikshit
Hi,
I have applied to Apertium in GSOC 2020 with the title of the project:
Indian Language Parsing.
And would like to request for a WIki-Account.
Thank you
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] GSOC 2020 | Requesting a Wiki Account

2020-03-16 Thread Daniel Swanson
Hi Katherine,

What would you like as your username?

Two things to note about your proposed project:
1. We do pretty much everything on Github rather than SourceForge now, so
the link you want is https://github.com/apertium/apertium-fra-eng
2. We generally prefer to focus on low-resource language pairs where we can
reasonably hope to do better than, say Google. Are there any other
languages you are familiar with that you could pair with one of those?

Daniel

On Mon, Mar 16, 2020 at 9:49 PM Katherine Wang  wrote:

> Name: Katherine Wang
> E-mail address: katheri...@nyu.edu
>
> Interested in the Following Projects. *Would like to connect with the mentor 
> leading the unreleased language pair [fra-eng] please.*
>
> 1. Adopt an unreleased language pair [fra-eng] 
> (https://svn.code.sf.net/p/apertium/svn/incubator/apertium-fra-eng/)
>
> Skills: Basic Machine Learning (Python)
>
> 2. Learning distributed representations for Apertium modules
>
> ___
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


[Apertium-stuff] GSOC 2020 | Requesting a Wiki Account

2020-03-16 Thread Katherine Wang
Name: Katherine Wang
E-mail address: katheri...@nyu.edu

Interested in the Following Projects. *Would like to connect with the
mentor leading the unreleased language pair [fra-eng] please.*

1. Adopt an unreleased language pair [fra-eng]
(https://svn.code.sf.net/p/apertium/svn/incubator/apertium-fra-eng/)

Skills: Basic Machine Learning (Python)

2. Learning distributed representations for Apertium modules
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] Working on the bn-en language pair

2020-03-16 Thread Hèctor Alòs i Font
According to my own experience, I wouldn't recommend to work on two
unreleased pairs in a same project. Most likely, neither of them will reach
the level of quality required for their release. Of course, if there is a
mentor who thinks differently, it is better to follow his/her wise advice;)
The Hindi-Bengali pair, if I remember correctly, has already been the
subject of a GSoC a few years ago. It's a very Apertium-ish pair. It
immediately came to my mind when I read your proposal for a Bengali-English
translator. It would be great to get it released (if possible, in both
directions). (The same would be for Bengali-Assamese Bengali-Odia or any
other Bengali-Indo-Aryan language).
Hèctor

Missatge de Sourabh Raj  del dia dl., 16 de març
2020 a les 20:06:

> By looking at some of the previous proposals and their work plan, this is
> a draft for my work plan. I have decided to work on the en-bn and
> bn-hi(bengali - hindi) pairs. Any feedback would be greatly appreciated.
>
> Thanking you,
> Sourabh
>
>
> On Sun, Mar 15, 2020 at 8:29 PM Sevilay Bayatlı 
> wrote:
>
>> Also you have to work in coding challenge here
>> http://wiki.apertium.org/wiki/Ideas_for_Google_Summer_of_Code/Adopt_a_language_pair#Coding_challenge
>>
>> best,
>>
>> Sevilay
>>
>> On Sun, Mar 15, 2020 at 5:55 PM Saurabh Rai  wrote:
>>
>>> Hello Sourabh,
>>> For planning a Proposal you can have a look at the previous proposals by
>>> the students.
>>>
>>> http://wiki.apertium.org/wiki/Category:Student_proposals_for_the_Google_Summer_of_Code
>>> And have a look at this page as well.
>>> http://wiki.apertium.org/wiki/Top_tips_for_GSOC_applications
>>>
>>> On Sun, Mar 15, 2020, 8:18 PM Sourabh Raj  wrote:
>>>
 Hi,

 I Have been working on the English-Bengali pair. How should my work
 plan be for this pair? I have already started with reading the
 recommended wikis, the documentation and have started working on the
 dictionaries.
 ___
 Apertium-stuff mailing list
 Apertium-stuff@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/apertium-stuff

>>> ___
>>> Apertium-stuff mailing list
>>> Apertium-stuff@lists.sourceforge.net
>>> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>>>
>> ___
>> Apertium-stuff mailing list
>> Apertium-stuff@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>>
> ___
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] Willingness to participate in the project

2020-03-16 Thread 杨伟哲
Thanks so much!

I once visited the repo of lttoolbox and read the source code of
lt-proc.cc,
lt-comp.cc, lt-expand.cc, etc. But at that time, I was not sure whether it
was
the code I needed, so I only read it roughly. But I still remember their
location
in the repository. Now I'll look more closely and try to find out the
specific code
that implements tokenization and where it fits into the ICU. I think this
will help
improve my proposal.

Sincerely,

Weizhe

On Mon, Mar 16, 2020 at 11:44 PM Tino Didriksen 
wrote:

> It's somewhere in https://github.com/apertium/lttoolbox - I don't know
> the exact location.
>
> The entrypoint that does tokenization is lt-proc, so start from lt-proc.cc
> and trace execution to somewhere that does tokenization. That's also a good
> way to learn the codebase.
>
> -- Tino Didriksen
>
>
> On Mon, 16 Mar 2020 at 16:00, 杨伟哲  wrote:
>
>> Hi Tino and Fammie,
>>
>> Due to my mistake in sending the email before, I am not sure whether you
>> have
>> received the email I sent, so I'm sending the email to you again now.
>> Hope you can
>> receive it.
>>
>> These days, I read the wikipedia description of tokenization and got a
>> general idea
>> of how it works.I also learn some icu syntax every day. At the mean time,
>> I'm also
>> searching for information on how to handle tokenized Unicode vocabularies.
>>
>> Recently I have been reading "further reading"[1] of my proposed
>> project[2], which
>> is about HFST. The code is a bit hard to understand. But my task is
>> "Update
>> lttoolbox to be fully Unicode compliant with regards to medication to
>> alphabetical
>> symbols". May I know exactly how tokenization is implemented in lttoolbox
>> and the
>> specific code that I'm going to update?
>>
>> Regards,
>>
>> Weizhe
>>
>> [1] https://github.com/hfst/hfst/blob/master/tools/src/hfst-tokenize.cc
>>
>> [2]
>> http://wiki.apertium.org/wiki/Ideas_for_Google_Summer_of_Code/Robust_tokenisation
>>
>
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] Working on the bn-en language pair

2020-03-16 Thread Sourabh Raj
By looking at some of the previous proposals and their work plan, this is a
draft for my work plan. I have decided to work on the en-bn and
bn-hi(bengali - hindi) pairs. Any feedback would be greatly appreciated.

Thanking you,
Sourabh


On Sun, Mar 15, 2020 at 8:29 PM Sevilay Bayatlı 
wrote:

> Also you have to work in coding challenge here
> http://wiki.apertium.org/wiki/Ideas_for_Google_Summer_of_Code/Adopt_a_language_pair#Coding_challenge
>
> best,
>
> Sevilay
>
> On Sun, Mar 15, 2020 at 5:55 PM Saurabh Rai  wrote:
>
>> Hello Sourabh,
>> For planning a Proposal you can have a look at the previous proposals by
>> the students.
>>
>> http://wiki.apertium.org/wiki/Category:Student_proposals_for_the_Google_Summer_of_Code
>> And have a look at this page as well.
>> http://wiki.apertium.org/wiki/Top_tips_for_GSOC_applications
>>
>> On Sun, Mar 15, 2020, 8:18 PM Sourabh Raj  wrote:
>>
>>> Hi,
>>>
>>> I Have been working on the English-Bengali pair. How should my work plan
>>> be for this pair? I have already started with reading the
>>> recommended wikis, the documentation and have started working on the
>>> dictionaries.
>>> ___
>>> Apertium-stuff mailing list
>>> Apertium-stuff@lists.sourceforge.net
>>> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>>>
>> ___
>> Apertium-stuff mailing list
>> Apertium-stuff@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>>
> ___
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>


Draft proposalGSOC2020 (1).docx
Description: MS-Word 2007 document
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] Willingness to participate in the project

2020-03-16 Thread Tino Didriksen
It's somewhere in https://github.com/apertium/lttoolbox - I don't know the
exact location.

The entrypoint that does tokenization is lt-proc, so start from lt-proc.cc
and trace execution to somewhere that does tokenization. That's also a good
way to learn the codebase.

-- Tino Didriksen


On Mon, 16 Mar 2020 at 16:00, 杨伟哲  wrote:

> Hi Tino and Fammie,
>
> Due to my mistake in sending the email before, I am not sure whether you
> have
> received the email I sent, so I'm sending the email to you again now. Hope
> you can
> receive it.
>
> These days, I read the wikipedia description of tokenization and got a
> general idea
> of how it works.I also learn some icu syntax every day. At the mean time,
> I'm also
> searching for information on how to handle tokenized Unicode vocabularies.
>
> Recently I have been reading "further reading"[1] of my proposed
> project[2], which
> is about HFST. The code is a bit hard to understand. But my task is
> "Update
> lttoolbox to be fully Unicode compliant with regards to medication to
> alphabetical
> symbols". May I know exactly how tokenization is implemented in lttoolbox
> and the
> specific code that I'm going to update?
>
> Regards,
>
> Weizhe
>
> [1] https://github.com/hfst/hfst/blob/master/tools/src/hfst-tokenize.cc
>
> [2]
> http://wiki.apertium.org/wiki/Ideas_for_Google_Summer_of_Code/Robust_tokenisation
>
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] Willingness to participate in the project

2020-03-16 Thread 杨伟哲
Hi Tino and Fammie,

Due to my mistake in sending the email before, I am not sure whether you
have
received the email I sent, so I'm sending the email to you again now. Hope
you can
receive it.

These days, I read the wikipedia description of tokenization and got a
general idea
of how it works.I also learn some icu syntax every day. At the mean time,
I'm also
searching for information on how to handle tokenized Unicode vocabularies.

Recently I have been reading "further reading"[1] of my proposed
project[2], which
is about HFST. The code is a bit hard to understand. But my task is "Update
lttoolbox to be fully Unicode compliant with regards to medication to
alphabetical
symbols". May I know exactly how tokenization is implemented in lttoolbox
and the
specific code that I'm going to update?

Regards,

Weizhe

[1] https://github.com/hfst/hfst/blob/master/tools/src/hfst-tokenize.cc

[2]
http://wiki.apertium.org/wiki/Ideas_for_Google_Summer_of_Code/Robust_tokenisation

On Thu, Mar 5, 2020 at 12:12 PM 杨伟哲  wrote:

> Yes, my code looks very messy this time. Thank you for pointing out my
>> shortcomings.
>>
>> I will spend time reading the code in the extension readings, trying to
>> understand the various usages of the syntax in the program, understanding
>> the project flow, and getting familiar with the code style. After that,
>> I'll modify
>
> my code. Definitely, I will strive to integrate myself into apertium as
>> soon as
>
> possible.
>>
>> Many thanks,
>>
>> Weizhe
>
>
> On Tue, Mar 3, 2020 at 9:33 PM Tino Didriksen 
> wrote:
>
>> The code for the challenge works. However, it is very far from idiomatic
>> C++ - it's more akin to C with Classes. ICU causes a little of this, but
>> things like malloc(), #define, and having variables first have no home in
>> C++. And how is one supposed to build the code? Also, mixing I/O is
>> generally a bad idea. What this says to me is that you've coded a bit of
>> C89 before, but no C99 or C++, and not used a build system.
>>
>> As for what to do next, the wiki pages say what project you're meant to
>> extend, both on the main ideas page and the coding challenge page. You even
>> quoted that part in your mail. So look at that project's code and see if
>> you can understand the flow.
>>
>> -- Tino Didriksen
>>
>>
>> On Thu, 27 Feb 2020 at 06:45, 杨伟哲  wrote:
>>
>>> Hi Francis and Flammie,
>>>
>>> I’m interested in the “Robust tokenisation in lttoolbox”[1] GSoC
>>> project. And
>>> currently I’m writing the proposal.
>>>
>>> I have completed the code challenge listed in the project, which has
>>> been put
>>> on Pastebin[2]. However, I’m not quite clear where this project starting
>>> with.
>>> And I will be much appreciate if you could list somewhere (e.g. GitHub
>>> repo
>>> related to this project) for me to get started with. I will also try to
>>> learn
>>> and solve issues there if possible.
>>>
>>> Bio: I’m Chinese undergraduate in Software Engineering. In my freshman
>>> year, I
>>> joined the high-performance computing center[3] of the university as a
>>> research
>>> assistant. Through research and learning during the period, I have a deep
>>> understanding of software architecture and open source projects.
>>>
>>>
>>> [1]
>>> http://wiki.apertium.org/wiki/Ideas_for_Google_Summer_of_Code/Robust_tokenisation
>>>
>>> [2] https://github.com/GavinWz/Apertium
>>>
>>> [3] http://cs.wfu.edu.cn/2014/0603/c1227a33048/page.htm
>>>
>>>
>>> Regards,
>>>
>>> Weizhe Yang
>>>
>> ___
>> Apertium-stuff mailing list
>> Apertium-stuff@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>>
>
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


[Apertium-stuff] GSOC - Working on UD Annotatrix

2020-03-16 Thread Jonathan Pan
Hi,

My name is Jonathan Pan. I am interested in working on UD Annotatrix for
GSOC. I have contributed a little bit before, back in GCI 2017, but that
was a while ago.

I've started working on the coding challenges and for the proof of concept
for graphing with d3, I put together a rough example (creating deprels):
https://github.com/JPJPJPOPOP/d3-graph.

I've been trying to find an issue regarding the server version, but I
haven't really been able to find one? Is the server version basically the
client, but with more features?

I already have some tasks that I am interested in working on, but what are
some of the higher priority tasks? I will probably ask more on IRC
regarding the specifics of some of the tasks.

Thanks,
Jonathan Pan
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff