Re: [Apertium-stuff] New release fra-cat

2018-10-03 Thread Kartik Mistry
On Tue, Sep 18, 2018 at 11:23 PM Hèctor Alòs i Font
 wrote:
> I have prepared a new version of apertium-fra-cat. It is basically the one 
> that was ready in April, but that could not be released because of problems 
> in apertium-separable. The new one incorporates a few improvements from the 
> new French-Occitan translator and has passed testvoc.
>
> Tino, could you please package it? Thanks in advance!

Hi,

Any update on release. It would be great to have this in Debian and
then Wikipedia's Content Translation tool as current upstream version
has issue with apertium-separable as Hèctor pointed.

I added issue in github also:
https://github.com/apertium/apertium-fra-cat/issues/3

-- 
Kartik Mistry | કાર્તિક મિસ્ત્રી
kartikm.wordpress.com


___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] Wikipedia corpus

2018-05-12 Thread Kartik Mistry
On Sat, May 12, 2018 at 2:51 PM, Hèctor Alòs i Font
 wrote:
> I'd like to create a French Wikipedia corpus, but I wouldn't like to
> download the whole Wikipedia dump. I'm not sure I have enough disk space for
> decompressing it. Is there somewhere maybe a 10% dump?

This can be useful too: https://dumps.wikimedia.org/other/contenttranslation/

-- 
Kartik Mistry/કાર્તિક મિસ્ત્રી | IRC: kart_
{kartikm, 0x1f1f}.wordpress.com

--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] How should I start contributing to Apertium

2017-10-16 Thread Kartik Mistry
On Tue, Oct 17, 2017 at 10:42 AM, paridhi kothari  wrote:
> I agree that both are well resourced languages. Also I was going through the
> documentation and the wiki pages which made me realise I would need to know
> how to write Urdu etc. Spoken Urdu is very similar to Hindi. Hindi borrows a
> ton from it but written Urdu is very difficult. So I can't work on this pair
> either.

Thanks for your interest!

Hindi<->Urdu pair exists and you can contribute there. Or AFAIK,
someone worked on initializing apertium-gu (Gujarati) and you can work
in Hindi<->Gujarati pair based on it. I can help in grammar and
building dictionary (there are free dictionary available to build
monolingual dictionary for Gujarati)

-- 
Kartik Mistry/કાર્તિક મિસ્ત્રી | IRC: kart_
{kartikm, 0x1f1f}.wordpress.com

--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


[Apertium-stuff] Apertium updated in ContentTranslation

2016-10-05 Thread Kartik Mistry
Hello all,

Wikimedia's ContentTranslation tool[1] now contain Machine Translation
support for many new language pairs (+latest Apertium build* and
packages) and it also includes Kevin Brubeck Unhammer's work[2]. It
took some longer than we expected but we're finally done with it.

Many thanks to Tino Didriksen, Kevin Brubeck Unhammer and Francis
Tyers for all help in the process and setting up example of nice
upstream :)

[1] https://www.mediawiki.org/wiki/Content_translation
[2] 
https://blog.wikimedia.org/2016/06/01/scandinavian-wikipedias-content-translation

* Same as in Debian testing.

-- 
Kartik Mistry/કાર્તિક મિસ્ત્રી | IRC: kart_
{kartikm, 0x1f1f}.wordpress.com

--
Check out the vibrant tech community on one of the world's most 
engaging tech sites, SlashDot.org! http://sdm.link/slashdot
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] apertium-swe-nor 0.2.0 released!

2016-06-07 Thread Kartik Mistry
On Wed, Jun 8, 2016 at 2:21 AM, Kevin Brubeck Unhammer
 wrote:
> The pair is already testable from https://apertium.org and it seems
> Kartik and Tino are hard at work packaging stuff so it should be in
> Content Translation for testing Quite Soon™.

Great work!

And, we should see it in Content Translation soon!

-- 
Kartik Mistry/કાર્તિક મિસ્ત્રી | IRC: kart_
{kartikm, 0x1f1f}.wordpress.com

--
What NetFlow Analyzer can do for you? Monitors network bandwidth and traffic
patterns at an interface-level. Reveals which users, apps, and protocols are 
consuming the most bandwidth. Provides multi-vendor support for NetFlow, 
J-Flow, sFlow and other flows. Make informed decisions using capacity 
planning reports. https://ad.doubleclick.net/ddm/clk/305295220;132659582;e
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] Apertium in Google Summer of Code

2016-03-01 Thread Kartik Mistry
On Tue, Mar 1, 2016 at 2:24 PM, Francis Tyers  wrote:
> Apertium is in the Google Summer of Code again this year! :D

Congrats!!

-- 
Kartik Mistry/કાર્તિક મિસ્ત્રી | IRC: kart_
{kartikm, 0x1f1f}.wordpress.com

--
Site24x7 APM Insight: Get Deep Visibility into Application Performance
APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=272487151&iu=/4140
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


[Apertium-stuff] Fwd: WikimediaAnnounce-l Digest, Vol 69, Issue 2

2015-12-04 Thread Kartik Mistry
nced
Wikimedians.[11]

* Wikitherapy <https://meta.wikimedia.org/wiki/Grants:IEG/Wikitherapy>:
An exploration of both the accessibility of Wikimedia editing and the
transformative power of participation, this project will bring Wikimedia
projects including Commons, Wiktionary, and Wikisource to institutional
therapy settings by developing outreach and training materials that
specifically meet the needs of therapy patients.[12]

* Senior Citizens Write in Sanskrit Wikipedia
<https://meta.wikimedia.org/wiki/Grants:IEG/Senior_Citizens_Write_in_Sanskrit_Wikipedia>
:
This project will engage speakers of one of the oldest languages of the
world to contribute to an untapped Wikipedia community by partnering with
Sree Somnath Sanskrit University and promoting community activities to
engage active editors and improve content quality.[13]

Online Community Organizing - 1 project funded

* Growing Kannada language Wikimedia Projects with a digital library
<https://meta.wikimedia.org/wiki/Grants:IEG/Growing_Kannada-language_Wikimedia_projects_with_a_digital_library>
:
By engaging the online community of Pustaka Sanchaya, a crowdsourced
database of Kannada literature and publications, this project will seek to
recruit new editors who can create and expand content on Kannada Wikimedia
projects using the references and citations available through Pustaka
Sanchaya.[14]

Research - 1 project funded

* Editor Behavior Analysis
<https://meta.wikimedia.org/wiki/Grants:IEG/Editor_Behaviour_Analysis>:
A user-friendly mechanism for generating dynamic and interactive data
visualizations is the goal of this project which will enable researchers to
better understand editing behaviour including type of articles edited by
new editors and devices and tools used for editing.[15]

You can read more about this round in the IEG committee’s post on the
Wikimedia Foundation blog.[16]

The next open call is scheduled for March 1-31, 2016.

Congratulations to the successful grantees!

   1.<https://meta.wikimedia.org/wiki/Grants:IEG>
   2. 
<https://meta.wikimedia.org/wiki/Grants:IEG/Batch_uploader_for_small_GLAM_project>
   3. 
<https://meta.wikimedia.org/wiki/Grants:IEG/Pan-Scandinavian_Machine-assisted_Content_Translation>
   4. 
<https://meta.wikimedia.org/wiki/Grants:IEG/StrepHit:_Wikidata_Statements_Validation_via_References>
   5. <https://meta.wikimedia.org/wiki/Grants:IEG/Wikimaps_Warper_2.0>
   6. <https://meta.wikimedia.org/wiki/Grants:IEG/Wiki_needs_pictures>
   7. 
<https://meta.wikimedia.org/wiki/Grants:IEG/Semi-automatically_generate_Categories_for_Vietnamese_Wikipedia>
   8. 
<https://meta.wikimedia.org/wiki/Grants:IEG/Proofreading_semiautomatically_the_Catalan_Wikipedia_with_LanguageTool>
9. <https://meta.wikimedia.org/wiki/Grants:IEG/Editing_Maithili_Wikipedia>
   10. 
<https://meta.wikimedia.org/wiki/Grants:IEG/Increase_Awareness_of_and_participation_in_Indic_language_Wikipedias_in_Colorado>
   11. 
<https://meta.wikimedia.org/wiki/Grants:IEG/Motivational_and_educational_video_to_introduce_Wikimedia>
   12. <https://meta.wikimedia.org/wiki/Grants:IEG/Wikitherapy>
   13. 
<https://meta.wikimedia.org/wiki/Grants:IEG/Senior_Citizens_Write_in_Sanskrit_Wikipedia>
   14. 
<https://meta.wikimedia.org/wiki/Grants:IEG/Growing_Kannada-language_Wikimedia_projects_with_a_digital_library>
   15. <https://meta.wikimedia.org/wiki/Grants:IEG/Editor_Behaviour_Analysis>
   16. <https://blog.wikimedia.org/2015/12/04/ieg-funds-fourteen-projects/>

*Marti JohnsonProgram Officer*
*Individual Grants*
*Wikimedia Foundation <http://wikimediafoundation.org/wiki/Home>*
+1 415-839-6885
Skype: Mjohnson_WMF

Imagine a world in which every single human being can freely share
<http://youtu.be/ci0Pihl2zXY> in the sum of all knowledge.  Help us make it
a reality!
Support Wikimedia <https://donate.wikimedia.org/>


-- 
Kartik Mistry/કાર્તિક મિસ્ત્રી | IRC: kart_
{kartikm, 0x1f1f}.wordpress.com

--
Go from Idea to Many App Stores Faster with Intel(R) XDK
Give your users amazing mobile app experiences with Intel(R) XDK.
Use one codebase in this all-in-one HTML5 development environment.
Design, debug & build mobile apps & 2D/3D high-impact games for multiple OSs.
http://pubads.g.doubleclick.net/gampad/clk?id=254741911&iu=/4140
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


[Apertium-stuff] Stale processes by Apertium-APY

2015-08-05 Thread Kartik Mistry
Hi all,

Apertium-APY left lots of stale processes and thus increasing memory
usage on server. See bug in Wikimedia cluster at,
https://phabricator.wikimedia.org/T107270

I had discussion with Tino on IRC and solution can be,
1. Forcefully kill and restart a child after X requests, to avoid leaks
2. Reap unused children down to one remaining, if they're idle for Y minutes

Should I file separate bug and this can be fix upstream too?

-- 
Kartik Mistry/કાર્તિક મિસ્ત્રી | IRC: kart_
{kartikm, 0x1f1f}.wordpress.com

--
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] apertium removed from Debian stretch

2015-07-14 Thread Kartik Mistry
On Sun, Jun 21, 2015 at 2:29 PM, Kartik Mistry  wrote:
> On Sun, Jun 21, 2015 at 2:27 PM, Paul Wise  wrote:
>>> They're in experimental, I should upload them to unstable once I fix
>>> hfst lintian issues.
>>
>> Any luck with that?
>
> hfst/cg3/lttoolbox - uploaded.
> apertium - by tonight :)

Updates:
* New shiny Apertium in Unstable. Language packages are being updated slowly.
* HFST still in NEW.
* I will try to finish language pairs/packages as soon as possible,
some depends on HFST, so will have to wait until we get it.

Thanks to Tino for all nice work!

-- 
Kartik Mistry/કાર્તિક મિસ્ત્રી | IRC: kart_
{kartikm, 0x1f1f}.wordpress.com

--
Don't Limit Your Business. Reach for the Cloud.
GigeNET's Cloud Solutions provide you with the tools and support that
you need to offload your IT needs and focus on growing your business.
Configured For All Businesses. Start Your Cloud Today.
https://www.gigenetcloud.com/
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] apertium removed from Debian stretch

2015-06-21 Thread Kartik Mistry
On Sun, Jun 21, 2015 at 2:27 PM, Paul Wise  wrote:
>> They're in experimental, I should upload them to unstable once I fix
>> hfst lintian issues.
>
> Any luck with that?

hfst/cg3/lttoolbox - uploaded.
apertium - by tonight :)

-- 
Kartik Mistry/કાર્તિક મિસ્ત્રી | IRC: kart_
{kartikm, 0x1f1f}.wordpress.com

--
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] apertium removed from Debian stretch

2015-05-30 Thread Kartik Mistry
On Sun, May 31, 2015 at 11:31 AM, Mikel L. Forcada  wrote:
> Should we try to have a more recent and working version of Apertium in
> Debian? How far are we from that/?

They're in experimental, I should upload them to unstable once I fix
hfst lintian issues.

-- 
Kartik Mistry/કાર્તિક મિસ્ત્રી | IRC: kart_
{kartikm, 0x1f1f}.wordpress.com

--
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


[Apertium-stuff] Fwd: [Wikimania-l] Wikimania 2015 Call for submissions

2015-02-12 Thread Kartik Mistry
Hi,

Since Apertium is being used in Wikimedia project, Content
Translation, it will be nice to talk about Apertium and Content
Translation (and about future on how we can better work together
etc!).

Wikimania is fun. Let me know.

~ Kartik

-- Forwarded message --
From: Ivan Martínez 
Date: Mon, Jan 19, 2015 at 11:34 PM
Subject: [Wikimania-l] Wikimania 2015 Call for submissions
To: "Wikimania general list (open subscription)"



Dear all:

We would like to invite submissions[0] proposing presentations, panels,
tutorials and workshops for Wikimania 2015 to be held in Mexico City
in July 2015.

Note that the deadline is the February 28; we hope to have final decisions
about the program by early April 2015.

Additionally, we would be delighted to have any additional volunteers for
the program committee[1], which will be finalized shortly.

[0] – https://wikimania2015.wikimedia.org/wiki/Submissions

[1] – https://wikimania2015.wikimedia.org/wiki/Programme_Committee

Warmly,
Deror Avi, James D. Forrester, and Ivan Martinez
Wikimania 2014 Program Committee
wikimania2015 (at) gmail (dot) com

Hemos creado la más grande colección de conocimiento compartido. Ayuda
a proteger a Wikipedia, dona ahora:
https://donate.wikimedia.org

___
Wikimania-l mailing list
wikimani...@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikimania-l



-- 
Kartik Mistry/કાર્તિક મિસ્ત્રી | IRC: kart_
{kartikm, 0x1f1f}.wordpress.com

--
Dive into the World of Parallel Programming. The Go Parallel Website,
sponsored by Intel and developed in partnership with Slashdot Media, is your
hub for all things parallel software development, from weekly thought
leadership blogs to news, videos, case studies, tutorials and more. Take a
look and join the conversation now. http://goparallel.sourceforge.net/
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


[Apertium-stuff] Fwd: [New post] Apertium and Wikimedia: A collaboration that powers the Content Translation tool

2014-11-14 Thread Kartik Mistry
From: 
https://blog.wikimedia.org/2014/11/14/apertium-and-wikimedia-a-collaboration-that-powers-the-content-translation-tool/

-- Forwarded message --
From: Wikimedia blog 
Date: Sat, Nov 15, 2014 at 12:11 AM
Subject: [New post] Apertium and Wikimedia: A collaboration that
powers the Content Translation tool

Apertium and Wikimedia: A collaboration that powers the Content Translation tool

Many readers of this blog know about the Content Translation
initiative. This project, developed by the Language Engineering team
of the Wikimedia Foundation, brings together machine translation and
rich text editing to provide a quick method to create Wikipedia
articles by translating them from another language.

Content Translation uses Apertium as its machine translation back-end.
Apertium is a freely licensed open source project and was our first
choice for this stage of development. The first version of Content
Translation focused on the Spanish-Catalan language pair, and one of
the reasons for this choice was the maturity of Apertium's machine
translation for those languages.

However, with growing needs to support more language pairs in the
newer versions of Content Translation, it became essential that the
machine translation continue to be reliable, and that the back-end be
stable and up-to-date. To ensure this stability, we needed to use the
latest updates released by the Apertium upstream project maintainers,
and we needed to use Apertium as a separate service. Prior to this
set-up, the Apertium service was being provided from within the
Content Translation server (cxserver).

The Content Translation tool is currently hosted on Wikimedia’s beta
servers. To set up the independent Apertium service, it was important
to use the latest released stable packages from Apertium, but they
were not available for the current versions of Ubuntu and Debian. This
became a significant blocker, because use of third party package
repositories is not recommended for Wikimedia’s server environments.

After discussion with Wikimedia’s Operations team and Apertium project
maintainers, it was decided that the Apertium packages would be built
for the Wikimedia repository. In addition to the Apertium base
packages, individual packages for supporting the language pairs and
other service packages were built, tested and included in the
Wikimedia repository. Alexandros Kosiaris (from the Wikimedia
Operations team), reviewed and merged these packages and the patches
for their inclusion in the repository. The Apertium service was then
puppetized for easy configuration and management on the Wikimedia beta
cluster.

Meanwhile, to make Apertium more accessible for Ubuntu and Debian
users, Kartik Mistry (from the Wikimedia Language Engineering team)
also started working closely with the Apertium project maintainers, to
make sure that the Debian packages were up-to-date in the main
repository. Going forward, once the updated packages are included in
Ubuntu’s next Long Term Support (LTS) version, we plan to remove these
packages from the internal Wikimedia repository.

The Content Translation tool has since been updated and now supports
Catalan, Portuguese and Spanish machine translation, using the updated
Apertium service through cxserver. We hope our users will benefit from
the faster and more reliable translation experience.

We would like to thank Tino Didriksen, Francis Tyers and Kevin Brubeck
Unhammer from the Apertium project, and Alexandros Kosiaris and
Antoine Musso from the Wikimedia Operations and Release Engineering
teams respectively, for their continued support and guidance.

Runa Bhattacharjee, and Kartik Mistry, Wikimedia Language Engineering team

-- 
Kartik Mistry/કાર્તિક મિસ્ત્રી | IRC: kart_
{kartikm, 0x1f1f}.wordpress.com

--
Comprehensive Server Monitoring with Site24x7.
Monitor 10 servers for $9/Month.
Get alerted through email, SMS, voice calls or mobile push notifications.
Take corrective actions from your mobile device.
http://pubads.g.doubleclick.net/gampad/clk?id=154624111&iu=/4140/ostg.clktrk
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


[Apertium-stuff] Request for commit rights of apertium SVN

2014-10-09 Thread Kartik Mistry
Hi,

As discussed on IRC, I'm requesting for commit right of apertium SVN repository.

Why
--
I've been working closely with Tino for Debian packaging of latest
Apertium (pushed core and some language pairs to Debian experimental
already). In the process, I've spotted minor issues and that has been
fixed with really helpful Apertium devs. I don't want to bother for
small issues and want to commit myself.

I work with Wikimedia Foundation, where we are building Content
Translation which uses Apertium as MT backend. Expect more reports on
this :)

Scope
-
I'll be mostly fixing issues related to Packaging, APY and sometime
language pairs related issues.

Needed information
------
Name: Kartik Mistry
SF Username: kartik_m

Thanks!

-- 
Kartik Mistry/કાર્તિક મિસ્ત્રી | IRC: kart_
{kartikm, 0x1f1f}.wordpress.com

--
Meet PCI DSS 3.0 Compliance Requirements with EventLog Analyzer
Achieve PCI DSS 3.0 Compliant Status with Out-of-the-box PCI DSS Reports
Are you Audit-Ready for PCI DSS 3.0 Compliance? Download White paper
Comply to PCI DSS 3.0 Requirement 10 and 11.5 with EventLog Analyzer
http://pubads.g.doubleclick.net/gampad/clk?id=154622311&iu=/4140/ostg.clktrk
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff