Re: [Apertium-stuff] GSoC--Apertium Website Development

2020-03-03 Thread Mohit Kumar Verma
Hi Jonathan,

As I am interested in the project: Apertium Website Improvements,
I want to know about the coding challenges for that project.
On clicking the link, it takes us to the projects page of Apertiu HTML
tools repo
and apertium apy repo. There are a lot of questions as a coding challenge.
Do we need to solve all of them or any one or two?? Also, Do we have to
submit
them before we send the letters or we have to give the link in the letters??

On Mon, Feb 24, 2020 at 8:49 AM Jonathan Washington <
jonathan.n.washing...@gmail.com> wrote:

> Hi Mohit,
>
> As with all GSoC projects, the best way to show your interest in the
> project and your ability to complete it is to start working on it.  In
> this case, this would mean starting by tackling a few open issues in
> apertium-html-tools
> (https://github.com/apertium/apertium-html-tools/issues) and
> apertium-apy (https://github.com/apertium/apertium-apy/issues).
>
> --
> Jonathan
>
> вс, 23 февр. 2020 г. в 18:49, Mohit Kumar Verma :
> >
> > Hi,
> >
> > My name is Mohit Kumar Verma currently studying in NIT Hamirpur,
> > Himachal Pradesh, India. I would like to work with you in further
> > developing the Apertium Website and adding new features. By browsing the
> > website, it seems that in the past someone started the work but left it
> > unfinished. I would like to continue the work and make the website such
> > that it is appealing to the eyes and gets the work done in minimum data
> > consumption.
> >
> > Thank You.
> > ___
> > Apertium-stuff mailing list
> > Apertium-stuff@lists.sourceforge.net
> > https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
>
> ___
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] [Question] GSoC20 || Alternative format for XML files in Apertium

2020-03-03 Thread Amr Mohamed Hosny Anwar
Hi Mohammad,

Each task has a coding challenge that can be found by clicking on the "read 
more..." link at the end of the description 
http://wiki.apertium.org/wiki/Ideas_for_Google_Summer_of_Code/Plain-text_formats_for_Apertium_data

I believe you will need to understand the format that is currently used for the 
language resources files like monolingual and bilingual dictionaries.
You should start by working on the coding challenge and asking for feedback 
then things will get clearer inshallah.

For the IRC channel, try to login whenever you are online.
Good luck with the project and don't forget to say hi on the IRC (AMR-KELEG).

All the best,
Amr


On Feb 29, 2020 12:02 PM, Mohammad Ahmad  wrote:
Hello Apertium team.

My name is Mohammad. I am an Egyptian student enrolled at Alexandria 
University, Faculty of Engineering.

I am interested in joining your GSoC internship program, specifically the 
project "Light alternative format for all XML files in an Apertium language 
pair".
After reading the project description i got the idea that it is just two 
text-parsing scripts (convert XML to another format and vice versa).
However the project difficulty is marked "Hard", did i misunderstand the task?
Can you please elaborate on what am i supposed to do? and if possible how can i 
start preparing myself for the project?

I joined the IRC channel under the nickname (Leviro). However, this is my first 
time using it and i won't be able to check it regularly.
I would prefer if you contact me through E-Mail. 
(mohammadahmad01...@gmail.com)

Thanks in advance. Regards,
Mohammad.

___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] Willingness to participate in the project

2020-03-03 Thread 杨伟哲
>
> Yes, my code looks very messy this time. Thank you for pointing out my
> shortcomings.
>
> I will spend time reading the code in the extension readings, trying to
> understand the various usages of the syntax in the program, understanding
> the project flow, and getting familiar with the code style. After that,
> I'll modify

my code. Definitely, I will strive to integrate myself into apretium as
> soon as

possible.
>
> Many thanks,
>
> Weizhe
>
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] Willingness to participate in the project

2020-03-03 Thread 杨伟哲
OK! Thanks a lot for your reply and recommendation.

I configured the apertium core and litoolbox environment and downloaded
several dictionaries on my computer the other day. Recently I've been
getting familiar with their usage and the meaning of each of the options.

I have a certain understanding of the composition of Unicode code, and now
I am also studying the grammar of ICU and making some progress.

As for IRC, I will always keep an eye on the communication on the channel.

Best regards,

--Weizhe

On Tue, Mar 3, 2020 at 9:10 PM Flammie A Pirinen  wrote:

> Hi,
>
> I am this week on hliday with low internet availability so only few
> quick points. Firstly I strogly recommend joining #apertium IRC channel,
> I think even non-mentors will have useful clues. For the tokenisation
> problem I think the main resource is to understand various unicode
> technical reports that describe tokenisations and a C++ library like
> ICU, and then how apertium currently does tokenisations and how this
> projects code will interact, especially for the last point many other
> people in IRC know it better  than me.
>
> Regards,
>
> On Thu, Feb 27, 2020 at 01:45:09PM +0800, 杨伟哲 wrote:
> > Hi Francis and Flammie,
> >
> > I’m interested in the “Robust tokenisation in lttoolbox”[1] GSoC project.
> > And
> > currently I’m writing the proposal.
> >
> > I have completed the code challenge listed in the project, which has been
> > put
> > on Pastebin[2]. However, I’m not quite clear where this project starting
> > with.
> > And I will be much appreciate if you could list somewhere (e.g. GitHub
> repo
> > related to this project) for me to get started with. I will also try to
> > learn
> > and solve issues there if possible.
> >
> > Bio: I’m Chinese undergraduate in Software Engineering. In my freshman
> > year, I
> > joined the high-performance computing center[3] of the university as a
> > research
> > assistant. Through research and learning during the period, I have a deep
> > understanding of software architecture and open source projects.
> >
> >
> > [1]
> >
> http://wiki.apertium.org/wiki/Ideas_for_Google_Summer_of_Code/Robust_tokenisation
> >
> > [2] https://github.com/GavinWz/Apertium
> >
> > [3] http://cs.wfu.edu.cn/2014/0603/c1227a33048/page.htm
> >
> >
> > Regards,
> >
> > Weizhe Yang
>
>
> > ___
> > Apertium-stuff mailing list
> > Apertium-stuff@lists.sourceforge.net
> > https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
>
> --
> Regards, Flammie 
> (Please note, that I will often include my replies inline instead of
> top or bottom of the mail)
> ___
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] Willingness to participate in the project

2020-03-03 Thread Tino Didriksen
The code for the challenge works. However, it is very far from idiomatic
C++ - it's more akin to C with Classes. ICU causes a little of this, but
things like malloc(), #define, and having variables first have no home in
C++. And how is one supposed to build the code? Also, mixing I/O is
generally a bad idea. What this says to me is that you've coded a bit of
C89 before, but no C99 or C++, and not used a build system.

As for what to do next, the wiki pages say what project you're meant to
extend, both on the main ideas page and the coding challenge page. You even
quoted that part in your mail. So look at that project's code and see if
you can understand the flow.

-- Tino Didriksen


On Thu, 27 Feb 2020 at 06:45, 杨伟哲  wrote:

> Hi Francis and Flammie,
>
> I’m interested in the “Robust tokenisation in lttoolbox”[1] GSoC project.
> And
> currently I’m writing the proposal.
>
> I have completed the code challenge listed in the project, which has been
> put
> on Pastebin[2]. However, I’m not quite clear where this project starting
> with.
> And I will be much appreciate if you could list somewhere (e.g. GitHub repo
> related to this project) for me to get started with. I will also try to
> learn
> and solve issues there if possible.
>
> Bio: I’m Chinese undergraduate in Software Engineering. In my freshman
> year, I
> joined the high-performance computing center[3] of the university as a
> research
> assistant. Through research and learning during the period, I have a deep
> understanding of software architecture and open source projects.
>
>
> [1]
> http://wiki.apertium.org/wiki/Ideas_for_Google_Summer_of_Code/Robust_tokenisation
>
> [2] https://github.com/GavinWz/Apertium
>
> [3] http://cs.wfu.edu.cn/2014/0603/c1227a33048/page.htm
>
>
> Regards,
>
> Weizhe Yang
>
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] Willingness to participate in the project

2020-03-03 Thread Flammie A Pirinen
Hi,

I am this week on hliday with low internet availability so only few
quick points. Firstly I strogly recommend joining #apertium IRC channel,
I think even non-mentors will have useful clues. For the tokenisation
problem I think the main resource is to understand various unicode
technical reports that describe tokenisations and a C++ library like
ICU, and then how apertium currently does tokenisations and how this
projects code will interact, especially for the last point many other
people in IRC know it better  than me.

Regards,

On Thu, Feb 27, 2020 at 01:45:09PM +0800, 杨伟哲 wrote:
> Hi Francis and Flammie,
> 
> I’m interested in the “Robust tokenisation in lttoolbox”[1] GSoC project.
> And
> currently I’m writing the proposal.
> 
> I have completed the code challenge listed in the project, which has been
> put
> on Pastebin[2]. However, I’m not quite clear where this project starting
> with.
> And I will be much appreciate if you could list somewhere (e.g. GitHub repo
> related to this project) for me to get started with. I will also try to
> learn
> and solve issues there if possible.
> 
> Bio: I’m Chinese undergraduate in Software Engineering. In my freshman
> year, I
> joined the high-performance computing center[3] of the university as a
> research
> assistant. Through research and learning during the period, I have a deep
> understanding of software architecture and open source projects.
> 
> 
> [1]
> http://wiki.apertium.org/wiki/Ideas_for_Google_Summer_of_Code/Robust_tokenisation
> 
> [2] https://github.com/GavinWz/Apertium
> 
> [3] http://cs.wfu.edu.cn/2014/0603/c1227a33048/page.htm
> 
> 
> Regards,
> 
> Weizhe Yang


> ___
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff


-- 
Regards, Flammie 
(Please note, that I will often include my replies inline instead of
top or bottom of the mail)


signature.asc
Description: PGP signature
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


[Apertium-stuff] Robust Tokenization idea for GSoC 2020

2020-03-03 Thread Aditya Hari
Hello

I am an Indian sophomore student currently pursuing a degree in Computer
Science.

I would like to work with Apertium for GSoC 2020. In particular, I would
like to take up the "Robust Tokenization"

project. I completed the coding challenge listed on the wiki (link to code)
 and have set up Apertium
on my system and have acquainted myself with the basics of how Apertium
works, including the structure of the dictionaries and lttoolbox.

I have a deep interest in open source and NLP and am familiar with both. I
find the work that Apertium is doing thoroughly interesting and am
desperate to contribute to it. I would greatly appreciate some advice on
how to write the proposal and the next steps to take towards making myself
familiar with what I would be expected to do.

Thanks
K Aditya
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff