[Apertium-stuff] Stepping down from my position in the PMC

2023-05-29 Thread Tanmai Khanna
me along the way - I will forever be grateful. :) With this, I would request you to update this change on the wiki ( https://wiki.apertium.org/wiki/Project_Management_Committee), and remove me from the PMC emailing list. Regards, *तन्मय खन्ना * *Tanmai Khanna

Re: [Apertium-stuff] Applying for GSOC 2023 projects

2023-02-23 Thread Tanmai Khanna
Telugu is a fairly decently resources language now, and has some well performing neural translators, like Hindi. We can still consider working with it in Apertium but first must figure out what our goal is, with it. If there's no particular goal then a low resource language would be preferable. T

Re: [Apertium-stuff] Capitalization Handling

2022-12-23 Thread Tanmai Khanna
Merry Christmas for wordbound blanks indeed! On Fri, Dec 23, 2022, 03:10 Daniel Swanson wrote: > Greetings Apertiumers! > > I have two updates to report: > > First, I have rewritten the postgenerator (again), this time as part > of apertium-separable (and so not breaking the old one, unlike last

Re: [Apertium-stuff] strange behaviour of apertium

2022-07-15 Thread Tanmai Khanna
rk in the past? As I understand it, once you analyse the form, it goes through the pipeline, after transfer it becomes a unit enclosed by ^...$ which then goes through the generator. Is that relevant for your question? Tanmai Khanna On Sat, Jul 16, 2022, 08:21 Amba Kulkarni wrote: > > Dea

Re: [Apertium-stuff] Apertium PMC Election: Bypass election?

2022-04-27 Thread Tanmai Khanna
My point isn't just about appearances though it's also about what's the more democratic thing to do. To at least give a proper platform for the non PMC people to say something. Moreover, I'm really confused about all this talk of "bureaucracy". It's one vote, which was supposed to happen anyway. N

Re: [Apertium-stuff] Apertium PMC Election: Bypass election?

2022-04-27 Thread Tanmai Khanna
Yeah, but you're going to be elected to the PMC, you can't vote for no vote. Same as me, we shouldn't be deciding that we don't need a vote. Violates many principles :p On Wed, Apr 27, 2022, 16:28 Kevin Brubeck Unhammer wrote: > Xavi Ivars čálii: > > > But also, voting for just to confirm (or a

Re: [Apertium-stuff] Apertium PMC Election: Bypass election?

2022-04-27 Thread Tanmai Khanna
ed seems a bit useless. > > Maybe if someone outside the PMC gave their opinion, voting would make > more sense. But so far, it's been only the ones in the PMC (+ Sushain + > Daniel), everyone agreeing. > > > -- > Xavi Ivars > < http://xavi.ivars.me > > > El

Re: [Apertium-stuff] Apertium PMC Election: Bypass election?

2022-04-27 Thread Tanmai Khanna
d we have a possible outcome that > can be achieved by unanimous consent. I say we take it and get on with the > business of forming the Foundation, which the (new) PMC can delegate the > legwork of to anyone. > > -- Tino Didriksen > > > On Tue, 26 Apr 2022 at 17:41, Tanm

Re: [Apertium-stuff] Apertium PMC Election: Bypass election?

2022-04-26 Thread Tanmai Khanna
than N. Washington, Kevin Brubeck > Unhammer, Mikel L. Forcada, Tanmai Khanna, Tino Didriksen, Xavi Ivars > > - President: Francis, Tino > > Given that exactly 7 for PMC would avoid the need for an election, I'm > happy to forego the whole thing and let Francis continue in th

Re: [Apertium-stuff] Apertium PMC Election: Census & Candidates

2022-04-22 Thread Tanmai Khanna
before the weekend: > > - No change to the census > > - 4 candidates for the PMC: Francis Tyers, Tanmai Khanna, Tino Didriksen, > Xavi Ivars > > - 2 candidates for President: Francis, Tino > > - Auditor: Daniel Swanson > > -- Tino Didriksen > > > On Tue, 19 Apr

Re: [Apertium-stuff] Apertium PMC Election: Census & Candidates

2022-04-19 Thread Tanmai Khanna
Maybe in this new PMC we should strictly follow at least a once a month meeting. Even if there's nothing to discuss or no progress made, it helps to meet and keeps things moving. Just a thought On Tue, Apr 19, 2022, 22:59 Francis Tyers via Apertium-stuff < apertium-stuff@lists.sourceforge.net> wro

Re: [Apertium-stuff] Apertium PMC Election: Census & Candidates

2022-04-19 Thread Tanmai Khanna
Hi, I'd like to apply to be part of the PMC again. It's been great to see the management side of things with Apertium and would really love to see through our plan to create a formal organisation. Regards, Tanmai Khanna On Tue, Apr 19, 2022, 16:51 Tino Didriksen wrote: > G'da

Re: [Apertium-stuff] Cleaning Parallel Corpus

2021-04-29 Thread Tanmai Khanna
spanish corpus. You'd lose some sentences to train on but that would be negligible and the remaining corpus would be aligned. Just a thought *तन्मय खन्ना * *Tanmai Khanna* On Thu, Apr 29, 2021 at 6:23 PM VIVEK VICKY wrote: > > On Thu, Apr 29, 2021 at 3:35 PM Kevin Brubeck Unhamm

Re: [Apertium-stuff] Bitrotted releases redux

2021-03-31 Thread Tanmai Khanna
That's great! Do let me know when the main website is updated. Thanks, *तन्मय खन्ना * *Tanmai Khanna* On Wed, Mar 31, 2021 at 5:42 PM Tino Didriksen wrote: > This is less critical than I thought. I just re-tested all existing > packages, and they run. So I can update the mai

Re: [Apertium-stuff] An easy tool to report bad translations and propose alternatives

2020-12-05 Thread Tanmai Khanna
rules. I know a TM exists in Apertium already, but I'm not sure if it's being used on apertium.org, plus user given translations can definitely improve TMs for language pairs that don't have a lot of parallel corpora to learn TMs from. *तन्मय खन्ना * *Tanmai Khanna* On Sat, Dec 5,

Re: [Apertium-stuff] A talk evaluating Apertium

2020-10-16 Thread Tanmai Khanna
refully and all the factors should be considered before comparing multiple systems, without that, it's easy to arrive at false conclusions. My two cents :) *तन्मय खन्ना * *Tanmai Khanna* On Thu, Oct 15, 2020 at 9:27 PM Mikel L. Forcada wrote: > Dear Apertiumers: > > here's

Re: [Apertium-stuff] Need help to decide language pairs for examples of markup handling

2020-09-12 Thread Tanmai Khanna
Thanks a lot for your suggestions guys! :)) *तन्मय खन्ना * *Tanmai Khanna* On Thu, Sep 10, 2020 at 7:57 AM Samuel Sloniker wrote: > > > On Wed, Sep 9, 2020, 02:34 Tanmai Khanna wrote: > >> (Accepting snazzier title suggestions). >> > Maybe

[Apertium-stuff] Need help to decide language pairs for examples of markup handling

2020-09-09 Thread Tanmai Khanna
xamples, that'll be great as well. Any help will be sincerely appreciated :)) Thanks and Regards, *तन्मय खन्ना * *Tanmai Khanna* ___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Re: [Apertium-stuff] Update about superblanks in transfer

2020-09-04 Thread Tanmai Khanna
such as macros, or within .. blocks. Thanks and Regards, *तन्मय खन्ना * *Tanmai Khanna* On Sun, Aug 30, 2020 at 5:59 PM Tanmai Khanna wrote: > Are the changes being implemented going to alter the behavior of the >> punctuation marks that are not analyzed as tokens? >> > > Ye

Re: [Apertium-stuff] We now have markup handling and reordering in Apertium!

2020-09-04 Thread Tanmai Khanna
m transfer rules now, but that is by design. Hèctor, let me know if this solved your issue :) Thanks and Regards, *तन्मय खन्ना * *Tanmai Khanna* On Fri, Sep 4, 2020 at 12:15 PM Hèctor Alòs i Font wrote: > Missatge de Tanmai Khanna del dia dv., 4 de > set. 2020 a les 9:22: > >>

Re: [Apertium-stuff] We now have markup handling and reordering in Apertium!

2020-09-03 Thread Tanmai Khanna
l people agree that after wordbound blanks, we should stop handling blank positions in transfer rules. If this isn't acceptable, we can discuss other possible solutions :) *तन्मय खन्ना * *Tanmai Khanna* ___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Re: [Apertium-stuff] We now have markup handling and reordering in Apertium!

2020-09-03 Thread Tanmai Khanna
u can analyse them as individual LUs and then using apertium-separable you can combine them into one LU. Finally, the space between l and ér shouldn't appear in the rule output and it is because of an issue that's still being fixed. But it'll be fine soon :) *तन्मय खन्ना * *Tanmai

Re: [Apertium-stuff] We now have markup handling and reordering in Apertium!

2020-09-03 Thread Tanmai Khanna
Hèctor can you check the page on beta now? The hyphen and the superscript issues are solved. Of course, there's now a space between l and ér. If that's a big problem we can discuss other solutions. *तन्मय खन्ना * *Tanmai Khanna* On Thu, Sep 3, 2020 at 8:09 PM Tino Didriksen wrote:

Re: [Apertium-stuff] We now have markup handling and reordering in Apertium!

2020-09-03 Thread Tanmai Khanna
h that it applies the individual markups on the final MWE. If this is done then both "idag" and "i dag" will be recognised and the italics will apply on the entire word. If this isn't acceptable or too much of an inconvenience, then I can modify the analyser. *तन्मय खन्

Re: [Apertium-stuff] We now have markup handling and reordering in Apertium!

2020-09-03 Thread Tanmai Khanna
Oh I see the hyphen thing. That should've been fixed after the latest commit. Will check it out. *तन्मय खन्ना * *Tanmai Khanna* On Thu, Sep 3, 2020 at 12:34 PM Tanmai Khanna wrote: > Hey, > As of now, the analyser sees wordbound blanks as normal blanks, and so > when t

Re: [Apertium-stuff] We now have markup handling and reordering in Apertium!

2020-09-03 Thread Tanmai Khanna
t on the ér. So yeah, fundamentally, as of now *it's not possible to have markup on part of an LU.* It's possible though if you keep I & ér as separate LUs. Also Hèctor, is the space after hyphen issue still there? Looks fine to me. *तन्मय खन्ना * *Tanmai Khanna* On Wed, Sep 2, 202

Re: [Apertium-stuff] Update about superblanks in transfer

2020-08-30 Thread Tanmai Khanna
le output. So it might not be possible to strictly control the position of the blanks in the output as was done earlier, but that was pretty much the intention of the change. *तन्मय खन्ना * *Tanmai Khanna* ___ Apertium-stuff mailing list Apert

Re: [Apertium-stuff] Update about superblanks in transfer

2020-08-30 Thread Tanmai Khanna
Also, I agree with Tino, if the punctuation is important for the context then it should probably be analysed as a token. *तन्मय खन्ना * *Tanmai Khanna* On Sun, Aug 30, 2020 at 3:16 PM Tanmai Khanna wrote: > Hèctor what I mean is, if you don't want a space in the output of rules >

Re: [Apertium-stuff] Update about superblanks in transfer

2020-08-30 Thread Tanmai Khanna
ould remove the . This is because if the input blank is an empty string then that isn't counted as a blank. Does that work? *तन्मय खन्ना *

Re: [Apertium-stuff] Update about superblanks in transfer

2020-08-30 Thread Tanmai Khanna
Hey, It solved them Franco-Japanese issue as well :D Can you check the diff once and see if there's any more issues Hèctor? (After updating apertium). *तन्मय खन्ना * *Tanmai Khanna* On Sun, Aug 30, 2020 at 2:35 PM Tanmai Khanna wrote: > Hi Hèctor, > I'm dealing with the iss

Re: [Apertium-stuff] Update about superblanks in transfer

2020-08-30 Thread Tanmai Khanna
, the spaces will go away. I'm still evaluating some other issues. *तन्मय खन्ना * *Tanmai Khanna* On Sun, Aug 30, 2020 at 1:21 PM Hèctor Alòs i Font wrote: > > > Missatge de Tanmai Khanna del dia dg., 30 d’ag. > 2020 a les 9:49: > >> My guess is, the transfer rule f

Re: [Apertium-stuff] Update about superblanks in transfer

2020-08-29 Thread Tanmai Khanna
discuss other options. *तन्मय खन्ना * *Tanmai Khanna* On Sun, Aug 30, 2020 at 12:09 PM Tanmai Khanna wrote: > Hèctor, > No worries I'll look into this. Can you send the input sentences? I want > to see the transfer rules that are applying to the erroneous parts. They > might need

Re: [Apertium-stuff] Update about superblanks in transfer

2020-08-29 Thread Tanmai Khanna
Hèctor, No worries I'll look into this. Can you send the input sentences? I want to see the transfer rules that are applying to the erroneous parts. They might need some changing. तन्मय खन्ना Tanmai Khanna From: Hèctor Alòs i Font Sent: Sunday, August 30,

[Apertium-stuff] Update about superblanks in transfer

2020-08-29 Thread Tanmai Khanna
ker, interchunk, and postchunk. If you have any questions, suggestions, comments, etc., I'll be happy to respond to them. Thanks and Regards, *तन्मय खन्ना * *Tanmai Khanna* ___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net http

Re: [Apertium-stuff] We now have markup handling and reordering in Apertium!

2020-08-27 Thread Tanmai Khanna
ule output. *तन्मय खन्ना * *Tanmai Khanna* On Thu, Aug 27, 2020 at 3:05 PM Kevin Brubeck Unhammer wrote: > Tanmai Khanna > čálii: > > > So what I'll try to do, is after the blanks are collected, lets say X is > > the number of source LUs in the pattern and Y is the

Re: [Apertium-stuff] We now have markup handling and reordering in Apertium!

2020-08-27 Thread Tanmai Khanna
Again, remember these aren't wordbound blanks or block tags, just superblanks, like , or other tags that aren't hard breaks or wordbound. *तन्मय खन्ना * *Tanmai Khanna* On Thu, Aug 27, 2020 at 2:33 PM Tanmai Khanna wrote: > Or, if we want to give the user complete control

Re: [Apertium-stuff] We now have markup handling and reordering in Apertium!

2020-08-27 Thread Tanmai Khanna
paces between their output LUs. *तन्मय खन्ना * *Tanmai Khanna* On Thu, Aug 27, 2020 at 2:31 PM Tanmai Khanna wrote: > So what I'll try to do, is after the blanks are collected, lets say X is > the number of source LUs in the pattern and Y is the number of output LUs. > If X = Y th

Re: [Apertium-stuff] We now have markup handling and reordering in Apertium!

2020-08-27 Thread Tanmai Khanna
the user denotes. If X > Y, then we can print the first Y blanks and then flush the remaining. After this the option will become useless. Does that sound good? *तन्मय खन्ना * *Tanmai Khanna* On Thu, Aug 27, 2020 at 2:28 PM Francis Tyers wrote: > El 2020-08-27 09:54, Kevin Brubeck Unh

Re: [Apertium-stuff] We now have markup handling and reordering in Apertium!

2020-08-27 Thread Tanmai Khanna
. *तन्मय खन्ना * *Tanmai Khanna* On Thu, Aug 27, 2020 at 1:20 PM Kevin Brubeck Unhammer wrote: > Tanmai Khanna > čálii: > > > I always thought that's the default behaviour. That if some blanks aren't > > explicitly printed in the transfer rules then they're f

Re: [Apertium-stuff] We now have markup handling and reordering in Apertium!

2020-08-26 Thread Tanmai Khanna
I always thought that's the default behaviour. That if some blanks aren't explicitly printed in the transfer rules then they're flushed. I'll check it out, but it should be that. *तन्मय खन्ना * *Tanmai Khanna* On Thu, Aug 27, 2020 at 1:27 AM Kevin Brubeck Unhammer wro

Re: [Apertium-stuff] We now have markup handling and reordering in Apertium!

2020-08-26 Thread Tanmai Khanna
hem when a rule is matched, but that is a decision that can be taken in the future. For now, blanks in transfer rules work for any superblanks that still exist in the stream. Hope that answered your question :)) *तन्मय खन्ना * *Tanmai Khanna* On Wed, Aug 26, 2020 at 11:43 PM Kevin Brubeck Unham

[Apertium-stuff] We now have markup handling and reordering in Apertium!

2020-08-26 Thread Tanmai Khanna
nd Regards, *तन्मय खन्ना * *Tanmai Khanna* ___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Re: [Apertium-stuff] GSoC 2020 Code Collections - need info

2020-08-22 Thread Tanmai Khanna
My contributions are documented here <https://wiki.apertium.org/wiki/User:Khannatanmai/Wordbound_blanks#Features> . *तन्मय खन्ना * *Tanmai Khanna* On Sat, Aug 22, 2020 at 5:52 PM Tino Didriksen wrote: > As for previous years, I will run a code collection for GSoC changes. > >

Re: [Apertium-stuff] Testing markup reordering in Apertium

2020-07-22 Thread Tanmai Khanna
Hey Xavi, Postgeneration has been modified to deal with wordbound blanks. It deals with wordbound blanks in one-one, one-many, many-one and many-many rules in postgeneration. (Pull Request <https://github.com/apertium/lttoolbox/pull/102>) Regards, *तन्मय खन्ना * *Tanmai Khanna* On Tue,

Re: [Apertium-stuff] Testing markup reordering in Apertium

2020-07-20 Thread Tanmai Khanna
before the generation step. Since after generation the tokenisation of the input is lost (as the LUs are removed), tf-close adds a delimiter so that the reformatter knows the span of the format. Hope that answers your question. Thanks and Regards, *तन्मय खन्ना * *Tanmai Khanna* On Tue, Jul 21, 2020

[Apertium-stuff] Testing markup reordering in Apertium

2020-07-20 Thread Tanmai Khanna
t will be by the end of the project though. If you have any questions, suggestions, I'd be glad to respond to them on this thread. If you need help testing this on your language pair you can contact us on the IRC. Same if you find any bugs, or

[Apertium-stuff] Starring Apertium repositories

2020-07-18 Thread Tanmai Khanna
y asking people who come across our software to star the repo and share it or something. These are just my personal views and I'm happy to be corrected :) Thanks, *तन्मय खन्ना * *Tanmai Khanna* ___ Apertium-stuff mailing list Apertium-stuff@lists.sourc

Re: [Apertium-stuff] finding difficulty in pairing languages

2020-06-21 Thread Tanmai Khanna
/wiki/IRC Thanks and Regards, *तन्मय खन्ना * *Tanmai Khanna* On Mon, Jun 22, 2020 at 8:48 AM annie rajan via Apertium-stuff < apertium-stuff@lists.sourceforge.net> wrote: > Hello, > I am Annie, from Goa, India. I did some work on morph analyses for Konkani > language and

[Apertium-stuff] Anaphora Resolution 1.0.0

2020-06-21 Thread Tanmai Khanna
languages as well. Any clarifications can be asked here or on the IRC. I'd also appreciate it if you guys let me know about any feature requests you might have from a module like this. :) Thanks and Regards, *तन्मय खन्ना * *Tanmai Khanna* ___

Re: [Apertium-stuff] Apertium's Wider Use & Secondary Tags

2020-06-14 Thread Tanmai Khanna
if one person in the PMC objects to something it shouldn’t happen, and I’m sure a group of experts and I can find something we all agree on as a design decision. Peace ✌🏼 Tanmai Khanna Sent from my iPhone > On 14-Jun-2020, at 18:02, Samuel Sloniker wrote: > >  > If we d

Re: [Apertium-stuff] Apertium's Wider Use & Secondary Tags

2020-06-13 Thread Tanmai Khanna
roposal. None of what I'm saying here is new, but somehow this discussion is stagnating more than most projects I've seen in action. If people are still not convinced about any benefits that can come out of such a modification, then I guess I'll choose the best solution that provid

Re: [Apertium-stuff] How useful is eliminating trimming for language developers?

2020-05-26 Thread Tanmai Khanna
increase, as detailed in a previous mail, just lt-trim takes ~4 seconds and a weighted dictionary takes ~8 seconds. I guess these facts are enough for us to at least provide our language developers with the option to disable trimming, and by propagating the surface form through secondary tags, it&

Re: [Apertium-stuff] How useful is eliminating trimming for language developers?

2020-05-25 Thread Tanmai Khanna
Here's a timing test for weighted dictionaries. On apertium-eng-kaz: 1. lt-trim analyser.bin bidix.bin analyser-found.bin Time: real 0m4.257s user 0m4.120s sys 0m0.131s 2. lt-trim analyser.bin bidix.bin analyser-found.bin lt-print -H analyser.bin > analyser.att lt-print -H analyser-found.b

[Apertium-stuff] How useful is eliminating trimming for language developers?

2020-05-25 Thread Tanmai Khanna
I missed, please add them here so people can make a more informed decision. *Another possible solution that was discussed was to continue trimming multiwords since they present a unique disadvantage, but to eliminate trimming for words without spaces. *Any other possible solution can be discussed

Re: [Apertium-stuff] Secondary Tag Prefixes

2020-05-08 Thread Tanmai Khanna
, for which secondary tags are necessary. I request the community to bombard this project with their skepticism, their doubts, their suggestions, their criticism, and I promise there will be thorough discussion and we will achieve an acceptable compromise on all fronts. What I would hate for this p

Re: [Apertium-stuff] Bylaws Overhaul Proposal

2020-04-30 Thread Tanmai Khanna
I have to ask, what kind of abuse of power do you have in mind? On Fri, May 1, 2020 at 8:15 AM Samuel Sloniker wrote: > That way, we don't have to complicate things now. > > On Thu, Apr 30, 2020 at 7:44 PM Samuel Sloniker > wrote: > >> Another possibility is to allow the Assembly to amend the b

Re: [Apertium-stuff] Bylaws Overhaul Proposal

2020-04-29 Thread Tanmai Khanna
If the main issue is an abuse of power and having adequate checks and balances, doesn't the assembly of committers hold the power to overturn PMC decisions? Which would include removals I guess. Tanmai On Wed, Apr 29, 2020 at 10:26 PM Samuel Sloniker wrote: > Okay. Maybe at least have a group a

Re: [Apertium-stuff] How do I get a list of lemmas for nouns

2020-04-23 Thread Tanmai Khanna
Hi, How about you try this: lt-expand apertium-swe.swe.dix | grep -E "[^<:>]+:[^<:>]+" | sed -E 's/[^<:>]+:([^<:>]+).*/\1/g' | sed -E 's/\p{No}//g' | uniq Just a small addition to Daniel's earlier command, to delete all superscripts before removing duplicates. Hopefully you don't need superscript

Re: [Apertium-stuff] Modifying the apertium stream format to include arbitrary information

2020-04-21 Thread Tanmai Khanna
>> tag??). >> >> I guess this sort of worry is the sort of thing you're keeping track of >> so that it can be worked on? >> >> -- >> Jonathan >> >> On Mon, Apr 20, 2020, 14:52 Tanmai Khanna >> wrote: >> >>> In a nutsh

Re: [Apertium-stuff] Modifying the apertium stream format to include arbitrary information

2020-04-20 Thread Tanmai Khanna
Hey Francis, I agree that it does seem like a solution searching for a problem if we look at it in isolation. But it's important to look at this in the context of eliminating trimming. Chronologically, this project was first about and still is, about eliminating dictionary trimming. Modification to

Re: [Apertium-stuff] Modifying the apertium stream format to include arbitrary information

2020-04-20 Thread Tanmai Khanna
lem, rather than a problem description in search >> >> of a solution. >> > >> > To me the most obvious thing to do with it is to put markup >> > information in secondary tags as a way of solving the superblank >> > reordering problem. >> >

Re: [Apertium-stuff] Modifying the apertium stream format to include arbitrary information

2020-04-20 Thread Tanmai Khanna
this project, I believe these documents provide enough as a proof of concept for this modification. I invite your comments, complaints and feedback regarding this modification. Thanks and Regards, Tanmai Khanna On Mon, Mar 30, 2020 at 12:21 AM Tanmai Khanna wrote: > Just to clarify, in this

Re: [Apertium-stuff] Fwd: Announcing a multilingual COVID-19 health and safety poster-maker

2020-04-20 Thread Tanmai Khanna
Translated it to Hindi. Tanmai On Mon, Apr 20, 2020 at 12:54 PM Mikel L. Forcada wrote: > Dear Apertiumers: > > Translation Commons is seeking help to get a COVID-19 health poster > translated into as many languages as possible. > > If you would like to help, please check the information Fran a

Re: [Apertium-stuff] Lexical Selection

2020-04-03 Thread Tanmai Khanna
To show the translated text with multiple options due to polysemy would require Apertium to preserve all the polysemous forms of a word in the Lexical Unit. While this isn't possible as of now, we're working on a project to try and extend the Apertium stream such that we can include an arbitrary am

Re: [Apertium-stuff] Modifying the apertium stream format to include arbitrary information

2020-03-29 Thread Tanmai Khanna
to be backwards compatible. :) Tanmai On Sun, Mar 29, 2020 at 11:59 PM Tanmai Khanna wrote: > Instead of looking at this as modifying or extending the apertium stream > format, we could look at this as making tags more versatile by creating a > new kind of tags which have a feature:value p

Re: [Apertium-stuff] Modifying the apertium stream format to include arbitrary information

2020-03-29 Thread Tanmai Khanna
n the pipe. Tanmai On Sun, Mar 29, 2020 at 8:53 PM Tanmai Khanna wrote: > Hi Mikel, > >> (0) No change should be made without proper regression testing. I think >> we all agree on that! >> > Definitely, and this is something I'll add in the proposal. > >&

Re: [Apertium-stuff] Modifying the apertium stream format to include arbitrary information

2020-03-29 Thread Tanmai Khanna
Tino listed already. Hope this made the rationale behind trimming and modifying the stream clear. It would be great to hear from Sergio, as these discussions are how we're going to make this project a success for Apertium. Thanks and Regards, Tanmai Khanna El 29/3/20 a les 13:31, Tanm

Re: [Apertium-stuff] Modifying the apertium stream format to include arbitrary information

2020-03-29 Thread Tanmai Khanna
l your comments and suggestions, I will make a robust proposal and it will be formalised before it is implemented. Thanks and Regards, Tanmai Khanna On Sun, Mar 29, 2020 at 3:52 PM Mikel L. Forcada wrote: > Folks: > > The elders in Apertium will not be surprised if I voiced my opposition

Re: [Apertium-stuff] Modifying the apertium stream format to include arbitrary information

2020-03-29 Thread Tanmai Khanna
I apologise, it seems like the link got removed when the message sent. Here it is: http://wiki.apertium.org/wiki/User:Khannatanmai/GSoC2020Proposal_Trimming Thanks Tanmai On Sun, Mar 29, 2020 at 3:11 PM Tanmai Khanna wrote: > Hey guys, > Here's a draft proposal for this project. A

Re: [Apertium-stuff] Modifying the apertium stream format to include arbitrary information

2020-03-29 Thread Tanmai Khanna
Hey guys, Here's a draft proposal http://wiki.apertium.org/wiki/User:Khannatanmai/GSoC2020Proposal_Trimming> for this project. Any comments will be appreciated :) Thanks, Tanmai On Sun, Mar 29, 2020 at 12:52 PM Tanmai Khanna wrote: > Hi Hèctor, > A fundamental motivation for th

Re: [Apertium-stuff] Modifying the apertium stream format to include arbitrary information

2020-03-29 Thread Tanmai Khanna
mation in the stream will be a huge problem, this will allow each program to access a lot more information and open up possibilities that we haven't even thought of yet. At the very least, it will help us to eliminate trimming. Thanks and Regards, Tanmai Khanna On Sun, Mar 29, 2020 at 10:39

Re: [Apertium-stuff] Modifying the apertium stream format to include arbitrary information

2020-03-28 Thread Tanmai Khanna
n Sun, Mar 29, 2020 at 2:29 AM Scoop Gracie wrote: > Or = > > On Sat, Mar 28, 2020, 13:58 Scoop Gracie wrote: > >> That sounds like a great idea to me. Maybe could even become ? >> >> On Sat, Mar 28, 2020, 13:51 Tanmai Khanna >> wrote: >> >>>

[Apertium-stuff] Modifying the apertium stream format to include arbitrary information

2020-03-28 Thread Tanmai Khanna
any pros, cons, suggestions - to the idea, to the syntax, anything. Thanks and Regards, Tanmai Khanna -- *Khanna, Tanmai* ___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Re: [Apertium-stuff] GSOC 2020 idea

2020-03-27 Thread Tanmai Khanna
Hey I have one doubt, The examples given for mistranslation, I didn't quite understand how sentiment analysis would fix those. Also what about languages for which a SentiWordNet doesn't exist? Thanks and Regards, Tanmai On Fri, Mar 27, 2020 at 3:56 PM Rajarshi Roychoudhury < rroychoudhu...@gmail.

Re: [Apertium-stuff] PMC election: Proclamation of the candidates.

2020-03-25 Thread Tanmai Khanna
=0>. > Nobody has found anything to amend. > > The following people have indicated they want to run for PMC members: > > - Sushain K. Cherivirala > - Tino Didriksen > - Mikel L. Forcada > - Scoop Gracie (pseudonym) > - Xavi Ivars > - Tanmai Khanna > - Francis

Re: [Apertium-stuff] Working around monodix trimming

2020-03-22 Thread Tanmai Khanna
the morphological >> dictionary and use an algorithm such as OSTIA (1) to learn morphological >> analyses for word endings. >> >> Mikel >> >> (1) Oncina, J., Garcia, P., Vidal, E., IEEE Trans Patt Recog Mach Intell >> 15:5 (1993)448-458. >> >>

Re: [Apertium-stuff] Working around monodix trimming

2020-03-21 Thread Tanmai Khanna
rget dictionaries. A "guessing" dictionary would try to > assign a morphological analysis to an unknown word by looking at the > morphology of known words in the dictionary... > > This would be easy if one could, e.g. match suffixes to morphology in a > suffixing language. &

[Apertium-stuff] Working around monodix trimming

2020-03-21 Thread Tanmai Khanna
-editability. If you have any significant pros, cons, or suggestions to add for this project, you're requested to reply to this thread so that if I work on this project, I can do it fully informed. Thanks and Regards, Tanmai Khanna -- *Khanna, Tanmai*

Re: [Apertium-stuff] Apertium PMC elections

2020-03-18 Thread Tanmai Khanna
>>>>> va escriure: >>>>> >>>>>> Oh, okay. There's a similar scenario with GCI; I want to be a student >>>>>> again this year. Could we keep a vacancy on the PMC from the start of GCI >>>>>> until it ended? &g

Re: [Apertium-stuff] Apertium elections coming up.

2020-03-13 Thread Tanmai Khanna
r, these people have indicated they want to run for PMC members: >> > - Jonathan Washington >> > - Francis Tyers >> > - Tino Didriksen >> > - Scoop Gracie (pseudonym) >> > - Tanmai Khanna >> > - Xavi Ivars >> > ...with Mikel L. Forcada as a

Re: [Apertium-stuff] Apertium elections coming up.

2020-03-05 Thread Tanmai Khanna
Since it seems like there are enough people to help conduct the election, I’ll just put in my name for the candidacy formally. Thanks Tanmai Sent from my iPhone > On 05-Mar-2020, at 23:28, Scoop Gracie wrote: > >  > We now have three volunteers for the board and four candidates. I suppose >

Re: [Apertium-stuff] Apertium elections coming up.

2020-03-04 Thread Tanmai Khanna
tlı wrote: > >  > I help to run the election. > > Sevilay > >> On Thu, 5 Mar 2020, 06:15 Tanmai Khanna, wrote: >> I volunteer to help conduct the election. >> >> Tanmai >> >> Sent from my iPhone >> >>>> On 05-Mar-2020, at 02

Re: [Apertium-stuff] Apertium elections coming up.

2020-03-04 Thread Tanmai Khanna
I volunteer to help conduct the election. Tanmai Sent from my iPhone > On 05-Mar-2020, at 02:40, Diogo wrote: > >  > I must say the fact that the second list was generated by me, but I'm not > eligible to vote according to those lists is quite ironic > > A quarta, 4/03/2020, 22:01, Tino Did

Re: [Apertium-stuff] GSOC 2020 idea

2020-02-27 Thread Tanmai Khanna
terface, the available languages for >> translation are beyond my knowledge. I am not sure if I am right, but >> Hindi/Bengali is probably not one of the languages to which an English word >> can be translated to. Correct me if I am wrong >> >> >> >>>

Re: [Apertium-stuff] GSOC 2020 idea

2020-02-27 Thread Tanmai Khanna
Hi, I have a few questions about this: 1. How would you analyse the sentiment of the source text? Considering the language pairs that Apertium deals with are low resource languages. 2. As Tino mentions, is there a problem of sentiment loss in Apertium? Any examples of this? 3. Doesn't the sentiment

Re: [Apertium-stuff] Idiom translation module

2020-01-16 Thread Tanmai Khanna
"Is there anything this wouldn't cover?" Idioms may be contiguous but need not be frozen. They can take arguments and can be modified based on TAM and GNP. But that can be handled by separable if I'm not wrong. So yeah, seems like we have this handled. As constructions get larger and more compl

Re: [Apertium-stuff] Null-subject resolution

2019-11-04 Thread Tanmai Khanna
Hey Hèctor, I'm in the process of implementing conditional anaphora resolution where you can define multiple anaphors. We can define verbs as a type of anaphor and give the rules the appropriate scores to guess a reference(subject in this case). This should work decently because Anaphora resolut

[Apertium-stuff] Anaphora Resolution for Apertium

2019-08-25 Thread Tanmai Khanna
h the irc (nick: khannatanmai), or even open issues on the repo. Thanks again, Tanmai Khanna -- *Khanna, Tanmai* ___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Re: [Apertium-stuff] Anaphora Resolution in Nightly

2019-08-12 Thread Tanmai Khanna
Hi, I've sent a PR: https://github.com/apertium/apertium/pull/55 with the changes required to make the ref tag clip-able(?). Really glad that the module works well! :) Thanks, Tanmai Khanna On Tue, Aug 13, 2019 at 1:36 AM Hèctor Alòs i Font wrote: > Thanks, Tino and Tanmai. Great

Re: [Apertium-stuff] Anaphora Resolution for multiple language pairs

2019-08-03 Thread Tanmai Khanna
clitic and > proclitic) in order to elect between the Catalan pronouns "el" and "ho". > > Hèctor > > Missatge de Tanmai Khanna del dia dv., 2 d’ag. > 2019 a les 17:48: > >> Hi all, >> The Anaphora Resolution module is currently being test

[Apertium-stuff] Anaphora Resolution for multiple language pairs

2019-08-02 Thread Tanmai Khanna
can work on adapting the tool for the language pair :) Thanks and Regards, Tanmai Khanna -- *Khanna, Tanmai* apertium-eng-spa.spa-eng.arx Description: Binary data ___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.

Re: [Apertium-stuff] Anaphora Resolution and Long Distance Agreement Resolution

2019-03-31 Thread Tanmai Khanna
this information to the proposal, if I decide to include/not include Zero Anaphora Resolution in the project as I don't want to be too ambitious either. I will also include analysis in the paper as to why the method of Saliency Scores will be the best method given the current situation

Re: [Apertium-stuff] Anaphora Resolution and Long Distance Agreement Resolution

2019-03-29 Thread Tanmai Khanna
the first case) > * The singers that sing sing well. > Both "sing" should be p3pl in Spanish/Catalan, currently they are not > ("Los cantantes que canta canta bien"). > > 3. Let's accept that we will deal only with the 3rd person. It is too > complicated t

[Apertium-stuff] Anaphora Resolution and Long Distance Agreement Resolution

2019-03-29 Thread Tanmai Khanna
m.org/wiki/User:Khannatanmai If anyone has any comments, suggestions, criticism, ideas, I would really appreciate if you let me know as it'll help me make a stronger proposal and a better tool for Apertium during GSoC 2019. Thanks and Regards, Tanmai Khanna IRC: khannatanmai --

[Apertium-stuff] GSoC 2019 Proposal First Draft

2019-03-16 Thread Tanmai Khanna
Hi, Here's the first draft of the proposal for the Anaphora Resolution project. If any of the mentors have feedback on it I would really appreciate it as I can make the required changes and submit the final one. Proposal: http://wiki.apertium.org/wiki/User:Khannatanmai Thanks and Regards, T

Re: [Apertium-stuff] Creation of an account

2019-03-04 Thread Tanmai Khanna
Hi, I also need an account on wiki, with the username: khannatanmai. Thanks! Tanmai Khanna On Mon, Mar 4, 2019 at 9:51 PM Ilnar Salimzianov wrote: > Hi Sandy, > > please tell which username you'd like to have on the wiki. > > Best, > > selimcan > > On 3/4/19 10:

[Apertium-stuff] GSoC Project: Anaphora Resolution

2019-03-04 Thread Tanmai Khanna
project as I would really like to contribute to Apertium this summer. Please advise. Thanks and Regards, Tanmai Khanna IRC Id: khannatanmai -- *Khanna, Tanmai* ___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourc

[Apertium-stuff] Creation of an account on wiki

2019-03-03 Thread Tanmai Khanna
Hi, I'd also like an account on the wiki to start my proposal for GSoC. I'll be working on Anaphora Resolution. Thanks and Regards, Tanmai Khanna IRC Nick: khannatanmai ___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge