Re: [Apertium-stuff] Fwd: Re: Regarding Coding challenge of 1.3 of GSOC idea

2019-03-26 Thread shashank tiwari
Dear francis
i think i phrased my question wrong.. I tweaked code here and there and
managed to make the coding challenge  work. My question was the usage of
unicode compliance regarding lttoolbox and where it has to be changed and
updated in lttoolbox like for ex we use alphabets so the non alphabetic
char get recognized as stop words thus we'd have to change that or
something this is where i am getting  confused.

Thanks
Shashank

On Tue, 26 Mar 2019, 23:17 Francis Tyers,  wrote:

>
>
>  Mensaje Original 
> Asunto: Re: [Apertium-stuff] Regarding Coding challenge of 1.3 of GSOC
> idea
> Fecha: 2019-03-26 17:40
> De: shashank tiwari 
> Destinatario: Francis Tyers 
>
> Dear francis,
>   I have finished the coding challenge successfully. The issue is
> where it asks us for to make lttoolbox fully unicode compliant. I cannot
> understand that. Any help would be appreciated
>
> On Tue, 26 Mar 2019, 23:08 Francis Tyers,  wrote:
>
> > El 2019-03-17 08:09, shashank tiwari escribió:
> >> I have an issue regarding it that while i can read the unicode data
> > if
> >> it is in file but when it is in terminal it can't read the unicode
> >> data here is the code i created
> >>
> >> #include 
> >> #include 
> >> #include 
> >>
> >> int main()
> >> {
> >> std::locale user("");
> >> std::locale unicode("en_US.UTF8");
> >> const auto str = std::string(u8"This! Is a tešt тест ** %
> >> test.");
> >>
> >> auto & decoder = std::use_facet >> std::mbstate_t>>(unicode);
> >> auto & encoder = std::use_facet >> std::mbstate_t>>(user);
> >>
> >> auto inmb = std::mbstate_t();
> >> auto outmb = std::mbstate_t();
> >> auto * next = str.data();
> >> const auto * endptr = str.data() + str.size();
> >> for (auto * ptr = str.data(); ptr < endptr; ptr = next)
> >> {
> >> wchar_t value;
> >> wchar_t * unusedA;
> >> decoder.in [1] [1](inmb, ptr, endptr, next, ,  +
> > 1,
> >> unusedA);
> >>
> >> char buffer[4];
> >> char * endbuffer;
> >> const wchar_t * unusedB;
> >> encoder.out(outmb, ,  + 1, unusedB, [0],
> >> [4], endbuffer);
> >>
> >> std::cout < >> <<" : "
> >> < >> < >> }
> >> return 0;
> >> }
> >>
> >> Any suggestions would be appreciated :)
> >>
> >
> > Check your locale ?
> >
> > F.
>
>
> Links:
> --
> [1] http://decoder.in
>
>
> ___
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


[Apertium-stuff] Regarding Idea 1.3:Update lttoolbox to be fully Unicode compliant with regards to alphabetical symbols.

2019-03-23 Thread shashank tiwari
Hi all,i have few doubts regarding the Idea 1.3 where we have to update
lttoolbox to be made unicode compliant with regarding to alphabetical
symbols.I have completed the coding challenge for that.I wanted to ask the
Mentors that what are the changes needed and where is it needed ?

Thank you for your time and consideration. Regards

Shashank
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] Proposal

2019-03-21 Thread shashank tiwari
Thanks :)

On Fri, Mar 22, 2019 at 2:01 AM Ilnar Salimzianov 
wrote:

> Done for both.
>
> Ilnar
>
> On 3/21/19 10:11 PM, shashank tiwari wrote:
> > I also need username and password for the wiki. username:picklerick
> >
> > On Fri, 22 Mar 2019, 00:07 Ilnar Salimzianov,  > <mailto:il...@selimcan.org>> wrote:
> >
> >
> > On 3/21/19 9:22 PM, Daniyar Nariman via Apertium-stuff wrote:
> > > Hi Ilnar, I did not receive any message with a temporary password.
> Can
> > > you please check it one more time?
> > Hey Daniyar,
> >
> > Now it should be in your inbox/spam folder.
> >
> > The username is slightly different.
> >
> > Sorry about that.
> >
> > I.
> > >
> > > username: nariman9119
> > >
> > > mail: n.dani...@innopolis.ru <mailto:n.dani...@innopolis.ru>
> > >
> > >
> >
> 
> > > *From:* Ilnar Salimzianov  > <mailto:il...@selimcan.org>>
> > > *Sent:* Thursday, March 21, 2019 9:09:07 PM
> > > *To:* apertium-stuff@lists.sourceforge.net
> > <mailto:apertium-stuff@lists.sourceforge.net>
> > > *Subject:* Re: [Apertium-stuff] Proposal
> > >
> > >
> > > On 3/21/19 5:15 PM, Mohit Raj wrote:
> > > > hi all
> > > > could anyone please guide me to create account on Apertium Wiki
> > > >
> > > Hi Mohit!
> > >
> > > Which username would you like to have on the wiki?
> > >
> > > Ilnar (selimcan on IRC)
> > > > On Thu, Mar 21, 2019 at 1:13 AM Mohit Raj  > <mailto:mohiit...@gmail.com>
> > > > <mailto:mohiit...@gmail.com <mailto:mohiit...@gmail.com>>>
> wrote:
> > > >
> > > > Hi Hector
> > > >
> > > > There are many literature and published magzines are
> > available and
> > > > Magahi is introduced at higher secondary level as optional
> > subject
> > > > in related education board of state.
> > > >
> > > > Thanks
> > > >
> > > > On Wed, 20 Mar 2019, 21:03 Hèctor Alòs i Font,
> > mailto:hectora...@gmail.com>
> > > > <mailto:hectora...@gmail.com <mailto:hectora...@gmail.com>>>
> > wrote:
> > > >
> > > > Hi Mohit,
> > > > Magahi seems an excellent choice for Apertium, as an
> > > > under-resourced language, but I wonder only about how
> > much is
> > > > it standardised. Could you clarify?
> > > > Hèctor
> > > >
> > > >
> > > > El dc., 20 març 2019, 17.20, Mohit Raj
> > mailto:mohiit...@gmail.com>
> > > > <mailto:mohiit...@gmail.com
> > <mailto:mohiit...@gmail.com>>> va escriure:
> > > >
> > > > Got it
> > > >
> > > > On Wed, 20 Mar 2019, 16:52 Sevilay Bayatlı,
> > > >  > <mailto:sevilaybaya...@gmail.com> <mailto:sevilaybaya...@gmail.com
> > <mailto:sevilaybaya...@gmail.com>>>
> > > > wrote:
> > > >
> > > > Hi,
> > > >
> > > > here how to
> > > >
> > start
> http://wiki.apertium.org/wiki/Getting_started_with_induction_tools,
> > > > also you have to get Apertium wiki account to
> > write your
> > > > proposal.
> > > >
> > > > best,
> > > >
> > > > Sevilay
> > > >
> > > >
> > > > On Wed, Mar 20, 2019 at 12:52 PM Mohit Raj
> > > >  > <mailto:mohiit...@gmail.com> <mailto:mohiit...@gmail.com
> > <mailto:mohiit...@gmail.com>>> wrote:
> > > >
> > > > Hi all,
> > > > Here is my proposal for GSOC.
> > > >
> > > > I am Mohit 

Re: [Apertium-stuff] Proposal

2019-03-21 Thread shashank tiwari
I also need username and password for the wiki. username:picklerick

On Fri, 22 Mar 2019, 00:07 Ilnar Salimzianov,  wrote:

>
> On 3/21/19 9:22 PM, Daniyar Nariman via Apertium-stuff wrote:
> > Hi Ilnar, I did not receive any message with a temporary password. Can
> > you please check it one more time?
> Hey Daniyar,
>
> Now it should be in your inbox/spam folder.
>
> The username is slightly different.
>
> Sorry about that.
>
> I.
> >
> > username: nariman9119
> >
> > mail: n.dani...@innopolis.ru
> >
> > 
> > *From:* Ilnar Salimzianov 
> > *Sent:* Thursday, March 21, 2019 9:09:07 PM
> > *To:* apertium-stuff@lists.sourceforge.net
> > *Subject:* Re: [Apertium-stuff] Proposal
> >
> >
> > On 3/21/19 5:15 PM, Mohit Raj wrote:
> > > hi all
> > > could anyone please guide me to create account on Apertium Wiki
> > >
> > Hi Mohit!
> >
> > Which username would you like to have on the wiki?
> >
> > Ilnar (selimcan on IRC)
> > > On Thu, Mar 21, 2019 at 1:13 AM Mohit Raj  > > > wrote:
> > >
> > > Hi Hector
> > >
> > > There are many literature and published magzines are available and
> > > Magahi is introduced at higher secondary level as optional subject
> > > in related education board of state.
> > >
> > > Thanks
> > >
> > > On Wed, 20 Mar 2019, 21:03 Hèctor Alòs i Font, <
> hectora...@gmail.com
> > > > wrote:
> > >
> > > Hi Mohit,
> > > Magahi seems an excellent choice for Apertium, as an
> > > under-resourced language, but I wonder only about how  much is
> > > it standardised. Could you clarify?
> > > Hèctor
> > >
> > >
> > > El dc., 20 març 2019, 17.20, Mohit Raj  > > > va escriure:
> > >
> > > Got it
> > >
> > > On Wed, 20 Mar 2019, 16:52 Sevilay Bayatlı,
> > > mailto:sevilaybaya...@gmail.com
> >>
> > > wrote:
> > >
> > > Hi,
> > >
> > > here how to
> > > start
> http://wiki.apertium.org/wiki/Getting_started_with_induction_tools,
> > > also you have to get Apertium wiki account to write
> your
> > > proposal.
> > >
> > > best,
> > >
> > > Sevilay
> > >
> > >
> > > On Wed, Mar 20, 2019 at 12:52 PM Mohit Raj
> > > mailto:mohiit...@gmail.com>>
> wrote:
> > >
> > > Hi all,
> > > Here is my proposal for GSOC.
> > >
> > > I am Mohit Raj, doing my post graduation (4^th sem)
> > > in linguistics from Dr. B.R. Ambedkar University ,
> > > K.M.I, Agra.
> > >
> > >
> > > My area of interest is Machine Translation and
> > > Natural Language Processing. Previously i have
> > > completed courses on XML, Python programming,
> > > Language Technologies and Machine Translation. I
> > > have worked towards the development of parser for
> > > Magahi, in collaboration with my classmate Neerav
> > > Mathur, for course projects. I took participation
> in
> > > following workshop :-
> > >
> > >
> > > 1. 9^th IASNLP-2018: IIIT-Hyderabad Advanced School
> > > on Natural Language Processing
> > >
> > > 2. SOIL-Tech: Towards Digital India at JNU, New
> Delhi
> > >
> > > 3. Hands on workshop on Statistical Machine
> > > Translation with Moses at K.M.I, Agra
> > >
> > >
> > > During the Machine Translation Workshop, Atul Kumar
> > > Ojha introduced us Rule Based Machine Translation
> > > system Apertium and in this period he also informed
> > > us about GSOC.
> > >
> > >
> > > I am interested in working on English-Magahi
> > > language pair for Machine Translation. Magahi
> > > belongs to Indo-Aryan language family and it is
> alos
> > > my native language. I have been suggested that in
> > > Machine Translation, Morphological Analyzer plays
> an
> > > important role in improving the system’s
> performance
> > > for morphologically rich language like Magahi. So I
> > > am interested in developing morph analyzer of
> Magahi.
> > >
> > >
> > > So, please give your feedback, Your feedback is
> > > greatly appreciated.
> > >
> > >
> > > Thanks,
> > >
> > >
> > > Mohit Raj
> > >
> > >
> > > 

Re: [Apertium-stuff] Account on Apertium wiki

2019-03-19 Thread shashank tiwari
I need an account too IRC nick :picklerick

On Fri, Mar 1, 2019 at 9:43 AM Jonathan Washington <
jonathan.n.washing...@gmail.com> wrote:

> Done!
>
> --
> Jonathan
>
> чт, 28 февр. 2019 г. в 23:02, Daniel Swanson :
>
>> I would also like a wiki account for GSoC.
>>
>> Username: popcorndude
>>
>> Daniel
>>
>> On Thu, Feb 28, 2019 at 12:09 AM ashwath s via Apertium-stuff <
>> apertium-stuff@lists.sourceforge.net> wrote:
>>
>>> thanks
>>>
>>> On Thu, 28 Feb, 2019, 09:56 Ilnar Salimzianov, 
>>> wrote:
>>>
 You should've received temporary passwords now, for usernames 'Pks_12'
 and 'ashwaths', respectively.

 Upon login, temporary passwords can be changed at [1].

 Best,

 Ilnar

 [1] http://wiki.apertium.org/wiki/Special:ChangePassword

 On 2/28/19 6:49 AM, ashwath s via Apertium-stuff wrote:
 > username:- ashwath s
 >
 > On Thu, 28 Feb, 2019, 01:49 Ilnar Salimzianov, >>> > > wrote:
 >
 >
 >
 > On 2/27/19 10:19 PM, ashwath s via Apertium-stuff wrote:
 > > hey , could you also give me an account , i've found a few
 grammatical
 > > mistakes in some documents which i could correct
 >
 > Hi,
 >
 > great!
 >
 > Same thing: I need to know the username you'd like to have on the
 wiki
 > so that  I can register you.
 >
 > Best,
 >
 > Ilnar
 >
 >
 > >
 > > On Thu, 28 Feb, 2019, 00:48 Ilnar Salimzianov, <
 il...@selimcan.org
 > 
 > > >> wrote:
 > >
 > > Hi Pranesh,
 > >
 > > what do you want to do on the wiki?
 > >
 > > And which username would you like to have?
 > >
 > > If you're on IRC [1] (a must have if you're applying for
 Google
 > > Summer of Code by Apertium!), contact me, i.e. selimcan.
 > >
 > > Best,
 > >
 > > Ilnar
 > >
 > >
 > > On 2/27/19 8:15 AM, Pranesh Saha wrote:
 > > > Hello,
 > > > Please add my account to the wiki.
 > > > Pranesh Saha
 > > >
 > > >
 > > > ___
 > > > Apertium-stuff mailing list
 > > > Apertium-stuff@lists.sourceforge.net
 > 
 > >  >
 > > >
 https://lists.sourceforge.net/lists/listinfo/apertium-stuff
 > > >
 > >
 > > --
 > > GPG: 0xF3ED6A19
 > >
 > >
 > >
 > > ___
 > > Apertium-stuff mailing list
 > > Apertium-stuff@lists.sourceforge.net
 > 
 > >  >
 > > https://lists.sourceforge.net/lists/listinfo/apertium-stuff
 > >
 > >
 > >
 > > ___
 > > Apertium-stuff mailing list
 > > Apertium-stuff@lists.sourceforge.net
 > 
 > > https://lists.sourceforge.net/lists/listinfo/apertium-stuff
 > >
 >
 > --
 > GPG: 0xF3ED6A19
 >
 >
 > ___
 > Apertium-stuff mailing list
 > Apertium-stuff@lists.sourceforge.net
 > 
 > https://lists.sourceforge.net/lists/listinfo/apertium-stuff
 >
 >
 >
 > ___
 > Apertium-stuff mailing list
 > Apertium-stuff@lists.sourceforge.net
 > https://lists.sourceforge.net/lists/listinfo/apertium-stuff
 >

 --
 GPG: 0xF3ED6A19


 ___
 Apertium-stuff mailing list
 Apertium-stuff@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/apertium-stuff

>>> ___
>>> Apertium-stuff mailing list
>>> Apertium-stuff@lists.sourceforge.net
>>> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>>>
>> ___
>> Apertium-stuff mailing list
>> Apertium-stuff@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>>
> ___
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> 

[Apertium-stuff] Regarding Coding challenge of 1.3 of GSOC idea

2019-03-17 Thread shashank tiwari
I have an issue regarding it that while i can read the unicode data if it
is in file but when it is in terminal it can't read the unicode data here
is the code i created

#include 
#include 
#include 

int main()
{
std::locale user("");
std::locale unicode("en_US.UTF8");
const auto str = std::string(u8"This! Is a tešt тест ** % test.");

auto & decoder = std::use_facet>(unicode);
auto & encoder = std::use_facet>(user);

auto inmb = std::mbstate_t();
auto outmb = std::mbstate_t();
auto * next = str.data();
const auto * endptr = str.data() + str.size();
for (auto * ptr = str.data(); ptr < endptr; ptr = next)
{
wchar_t value;
wchar_t * unusedA;
decoder.in(inmb, ptr, endptr, next, ,  + 1, unusedA);

char buffer[4];
char * endbuffer;
const wchar_t * unusedB;
encoder.out(outmb, ,  + 1, unusedB, [0],
[4], endbuffer);

std::cout <___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


[Apertium-stuff] Regarding getting started with induction tools

2019-03-09 Thread shashank tiwari
Sir,
   I wanted to install the ittoolbox and the link given below  doesn't work
http://wiki.apertium.org/wiki/Getting_started_with_induction_tools Can you
provide me more guidance

Thanks
Shashank Tiwari
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] Current GSOC ideas

2019-01-28 Thread shashank tiwari
Hello, i'll love to work on 1.6 any recommendation on how to get started on
it ??

On Mon, 28 Jan 2019, 22:43 Francis Tyers  Here is my run-down on the current GSOC ideas page:
>
>  1.1 Anaphora resolution for machine translation
>
> Nice project idea, but not sure in 3 months.
>
>  1.2 Bring a released language pair up to state-of-the-art quality
>
> Always needed
>
>  1.3 Robust tokenisation in lttoolbox
>
> Up for grabs, we need this
>
>  1.4 Adopt an unreleased language pair
>
> Always needed
>
>  1.5 Extend lttoolbox to have the power of HFST
>
> I think getting this one is unlikely and requires more than 3 months.
>
>  1.6 Robust recursive transfer
>
> Keep, this would be really great. I got asked to run a workshop on
> Apertium
>   recently and then unasked when they found out that the formalisms
> didn't
> actually create parse trees :)
>
>  1.7 Extend weighted transfer rules
>
> There is ongoing work in this, it would need to be supervised carefully:
>
> https://github.com/sevilaybayatli/apertium-ambiguous
>
> I would say a nice project would be to really use this on a new language
> pair
>
>  1.8 Improvements to the Apertium website
>
> Not sure
>
>  1.9 User-friendly lexical selection training
>
> I think getting this one is unlikely and requires more than 3 months.
> Also has
> been tried several times without luck.
>
>  1.10 Light alternative format for all XML files in an Apertium
> language pair
>
> I'm not sure about this one.
>
>  1.11 Bilingual dictionary enrichment via graph completion
>
> There is code for this, it was a GSOC project last year but wasn't
> merged, I'm
> not sure how well it works.
>
>  1.12 UD and Apertium integration
>
> This is a very useful project. If we can take advantage of UD corpora we
> can
> make supervised taggers for around 70% of our languages.
>
>  1.13 Add weights to lttoolbox
>
> This was done last year. A nice project would be to actually make use of
> it.
>
>  1.14 Improving language pairs mining Mediawiki Content Translation
> postedits
>  1.15 Unsupervised weighting of automata
>
> Open
>
>  1.16 Improvements to UD Annotatrix
>
> This is a really useful tool.
>
>  1.17 apertium-separable language-pair integration
>
> Agree, but I think that it should not just be apertium-separable, but
> perhaps
> something like "upgrade a language pair to use all the latest apertium
> tricks"
>
>  1.18 Create FST-based module for disambiguating
>
> I like this idea, but I'm not sure three months is enough time, without
> someone
> who really knows what they are doing with both the FST library and
> apertium.
>
>  1.19 Python API/library for Apertium
>
> This was mostly done right? I think this is still a really important
> project
>
>  1.20 TIPP functionality for Apertium
>
> Not sure
>
> There is a lot of functionality that is not used widely that could be
> really
> used to improve performance of language pairs.
>
> * apertium-separable
> * weights in lttoolbox
> * weighted transfer
>
> Fran
>
>
> ___
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] Fwd: Re: Gsoc 2019

2019-01-27 Thread shashank tiwari
After matching the ideas with projects completed in gsoc 2018 these are the
ideas that have not been implemented :
Anaphora resolution for machine translation
Robust tokenisation in lttoolbox
Robust recursive transfer
Improvements to the Apertium website
User-friendly lexical selection training
Unsupervised weighting of automata
Create FST-based module for disambiguating
Python API/library for Apertium
TIPP functionality for Apertium
Light alternative format for all XML files in an Apertium language pair
Add weights to lttoolbox
UD and Apertium integration


On Sun, Jan 27, 2019 at 11:59 PM Francis Tyers  wrote:

>
>
>  Mensaje Original 
> Asunto: Re: Gsoc 2019
> Fecha: 2019-01-27 17:30
> De: ashwath s 
> Destinatario: Francis Tyers 
>
> can you tell me what are the projects from last year that are incomplete
> ?
>
> On Sat, 26 Jan, 2019, 21:56 Francis Tyers 
> > El 2019-01-26 11:26, ashwath s escribió:
> >> Hey is apertium going to apply for gsoc 2019 ?
> >
> > It is planning to, yes.
> >
> > Fran
>
>
> ___
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff