Re: [Apertium-stuff] Important Information: Apertium PMC Election

2013-03-24 Thread Kevin Brubeck Unhammer
gt; of the PMC. Another volunteer here :) -- Kevin Brubeck Unhammer GPG: 0x766AC60C -- Everyone hates slow websites. So do we. Make your web apps faster with AppDynamics Download AppDynamics Li

[Apertium-stuff] Android ideas (was: Google joins Apertium in providing offline translation on Android.)

2013-04-08 Thread Kevin Brubeck Unhammer
ilt-in? If so, it might be useful to have a button that uses the built-in service where Android makes it available. My main wish for Android would be support for HFST's dictionaries; I read http://lists.puimula.org/pipermail/libvoikko/2012-August/000442.html as HFST being used for spellers on And

Re: [Apertium-stuff] HFST .lexc

2013-04-08 Thread Kevin Brubeck Unhammer
rom now on @D.NUMxrel.pres@ means: only allow this analysis if the 'NUMxrel' feature is 'pres' These features are called "flag diacritics" for no good reason. They can be useful for expressing long-distance relationships. I don't know what @D.lim@ does,

Re: [Apertium-stuff] Apertium PMC election: election board

2013-04-08 Thread Kevin Brubeck Unhammer
rcadaMikel L. Forcada > 15 nordfalkJacob Nordfalk > 16 sanmarfFelipe Sánchez Martínez > 17 selimcanIlnar Salimzyan > 18 sortizSergio Ortiz > 19 spectre360Francis Tyers > 20 tunedalPer Tunedal > 21 unhammerKevin Brubeck Unhammer > 22 xavivars Xavi I

Re: [Apertium-stuff] Problem with lt-proc -c -t output

2013-04-09 Thread Kevin Brubeck Unhammer
regular lttoolbox and run $ lt-proc -c -t /home/karunakar/scl/skt_gen/bin/skt_gen.bin < input ? Do you have a UTF-8 locale installed and set? What does $ export|grep LANG give ? -- Kevin Brubeck Unhammer GPG: 0x766AC60C -

Re: [Apertium-stuff] Apertium PMC election: election board

2013-04-09 Thread Kevin Brubeck Unhammer
el has pointed out on IRC, the decision is the > election board's. It didn't occur to me that we have committers who don't follow apertium-stuff, guess I was wrong :) Jernej agreed to keeping the census open a bit more, I do too – Xavi? Should we say until the 16th? So s

Re: [Apertium-stuff] Apertium PMC election: election board

2013-04-10 Thread Kevin Brubeck Unhammer
Francis Tyers writes: > El dt 09 de 04 de 2013 a les 22:20 +0200, en/na Kevin Brubeck Unhammer > va escriure: >> So sf.net considers it spam to send messages to all committers, but is >> there any other place than IRC we should be spamming^Wmessaging? > > If you prepare

Re: [Apertium-stuff] "Interface for creating tagged corpora" GSOC 13

2013-04-12 Thread Kevin Brubeck Unhammer
o ./configure --enable-utf before compiling, does that help? [...] > For the other task I still have to learn a bit more about how is > structured the TSX files, and how the rules (grammar constraints) are > build http:/

[Apertium-stuff] Last day to propose yourself for PMC Candidate (was: Last day to include yourself in the Census)

2013-04-21 Thread Kevin Brubeck Unhammer
Final call to add yourselves to http://wiki.apertium.org/wiki/PMC_election#Candidates ! -- Kevin Brubeck Unhammer GPG: 0x766AC60C -- Precog is a next-generation analytics platform capable of advanced analytics on semi-str

Re: [Apertium-stuff] GSoC Idea

2013-04-27 Thread Kevin Brubeck Unhammer
da, which is processed by an XSLT script into two different bidixes before compilation. Most entries would be the same, but some would be marked nn-only or nb-only. This kind of thing happens in a lot of apertium pairs, and should be no trouble to set up. -- Kevin Brubeck Unhammer Written wi

Re: [Apertium-stuff] GSoC Idea

2013-04-30 Thread Kevin Brubeck Unhammer
7;t work before adding new stuff (actually, I would first run a big corpus through the CG with --trace and delete any unused rules, to make it easier to deal with). But if you work on no→da first, the da CG would not be useful yet. -- Kevin Brubeck Unhammer GPG: 0x766AC60C -

Re: [Apertium-stuff] ConfirmAccount on Apertium Wiki

2013-06-06 Thread Kevin Brubeck Unhammer
+1, if it's possible to give several people the responsibility of approving so that registrations happen quickly even if people go off the grid (I've been waiting for confirmation for two years on the thinkpadwiki :-/). -- Kevin Brubeck Unhammer GPG: 0x766AC60C ---

Re: [Apertium-stuff] ConfirmAccount on Apertium Wiki

2013-06-06 Thread Kevin Brubeck Unhammer
"Bernard Chardonneau" writes: >> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.3 (gnu/linux) >> Date: Thu, 06 Jun 2013 15:38:27 +0200 >> From: Kevin Brubeck Unhammer >> To: apertium-stuff@lists.sourceforge.net >> Reply-To: apertium-stuff@lists.sourcef

Re: [Apertium-stuff] ConfirmAccount on Apertium Wiki

2013-06-06 Thread Kevin Brubeck Unhammer
> in its tracks on message boards I have run in the past (dramatic difference > between dozens of posts to clean up daily versus one or two weekly). stopforumspam.com seems to be down, but https://www.mediawiki.org/wiki/Manual:Combating_spam#IP_address_blacklists

Re: [Apertium-stuff] CG XML format

2013-07-04 Thread Kevin Brubeck Unhammer
User:David_Nemeskey/CG_XML_brainstorming ? -- Kevin Brubeck Unhammer GPG: 0x766AC60C -- This SF.net email is sponsored by Windows: Build for Windows Store. http://p.sf.net/sfu/windows-dev2dev ___

Re: [Apertium-stuff] CG XML format

2013-07-05 Thread Kevin Brubeck Unhammer
are XML id's; I wish they weren't, since for some reason XML id's are terribly limited in what characters they can contain, e.g. neither @, →, ←, $, nor &entity; are allowed (CG tags need to be able to have @ or → in them). -- Kevin Brubeck Unhammer GPG: 0x766AC60C --

Re: [Apertium-stuff] Help with macros and superblanks

2013-07-16 Thread Kevin Brubeck Unhammer
ion would be to do what is intuitive: to allow as a > parameter to macros. The correct++ resolution would be to allow > anything that can appear in to be used as a parameter to a > macro. Is there a good reason why we can't pass anything to a macro? If people try to pass

Re: [Apertium-stuff] Help with macros and superblanks

2013-07-16 Thread Kevin Brubeck Unhammer
Kevin Brubeck Unhammer writes: > "Jimmy O'Regan" > writes: > >> [Readding list cc] >> >> On 13 July 2013 07:12, Mikel Forcada >> wrote: >>> Sergio, Jimmy, all: >>> >>> Thanks for your help. I am, however, still a b

Re: [Apertium-stuff] /seg__refl

2013-07-17 Thread Kevin Brubeck Unhammer
in nb-nn to turn it into "vere" since "befinne seg" doesn't really work in nn. (The pardef needs more work if you ever want to keep the reflexive pronoun, as the TODO comment tries to say.) -- Kevin Brubeck Unhammer

Re: [Apertium-stuff] new top-level SVN module for monolingual language packs

2013-08-24 Thread Kevin Brubeck Unhammer
Francis Tyers writes: > Hello Apertiumers! > > I'd like to canvas opinions on creating a new top-level SVN module for > monolingual language packs. +1 -- Kevin Brubeck Unhammer GPG: 0x766AC60C pgpGrICcJIYI1.pgp Descripti

Re: [Apertium-stuff] Old fashoned SMT IBM model 1 outperforms Apertium

2013-08-30 Thread Kevin Brubeck Unhammer
it's time to > move on to more meaningful texts. That's why I believe it would be a > help in developing new language pairs. It's hardly natural language though. You'll perhaps get a good translator of block world texts, but how often d

Re: [Apertium-stuff] Old fashoned SMT IBM model 1 outperforms Apertium

2013-08-30 Thread Kevin Brubeck Unhammer
requent outside the block world corpus? How did you come by that information? If you've already got a real frequency list, it's a waste of time to make a new one from text that you know is not natural. The same argument goes for grammatical constructions. -- Kevin Brubeck Unhammer h

Re: [Apertium-stuff] [Fwd: The Sami-Estonian application that just went in.]

2013-09-03 Thread Kevin Brubeck Unhammer
Francis Tyers writes: > Hello all, > > We just got a project funded for Finnish-Estonian and Finnish-North > Sámi! > > Go Apertium! :D Congrats =D -- Kevin Brubeck Unhammer GPG: 0x766AC60C pgpRhYafSM2aa.pgp Descript

Re: [Apertium-stuff] Icelandic <-> Swedish

2013-09-06 Thread Kevin Brubeck Unhammer
in order to make a tarball. This can be uploaded to sourceforge, but perhaps you want to increase the version number first? See http://wiki.apertium.org/wiki/Making_a_release -- Kevin Brubeck Unhammer GPG: 0x766AC60

Re: [Apertium-stuff] Icelandic <-> Swedish

2013-09-08 Thread Kevin Brubeck Unhammer
Tihomir Rangelov writes: > On 6.9.2013, at 08:30, Kevin Brubeck Unhammer wrote: >> apertium.org runs only released tarballs. I made some minor changes to >> the makefile so you should be able to type "./autogen.sh && make dist" >> in order to make a tarba

Re: [Apertium-stuff] null flush in transfer

2013-09-08 Thread Kevin Brubeck Unhammer
eason that it doesn't work as is. > > Fran One confusing thing is that it when null_flush is true, transfer calls transfer_wrapper_null_flush which then calls transfer (now with null_flush false, but internal_null_flush true). -- Kevin Brubeck Unhammer GPG: 0x766AC60C pgpQIn5JHPRZo.pgp Descripti

Re: [Apertium-stuff] Icelandic <-> Swedish

2013-09-08 Thread Kevin Brubeck Unhammer
Tihomir Rangelov writes: > Kærar þakkir! > When does it appear on the web service? Someone with ssh access needs to install it; I don't know who currently has access – anyone? -- Kevin Brubeck Unhammer GPG: 0x766AC60C pgpUgJ2WtT3Ba.pgp Description: PG

Re: [Apertium-stuff] null flush in transfer

2013-09-08 Thread Kevin Brubeck Unhammer
Kevin Brubeck Unhammer writes: > Francis Tyers writes: > >> Hello all, >> >> Can anyone confirm that null flush (-z) option in transfer works with >> the -b (no bidix) option ? [...] Mistook you to mean "lt-proc -b", but apertium-transfer -z -b also w

Re: [Apertium-stuff] null flush in transfer

2013-09-09 Thread Kevin Brubeck Unhammer
; morpho_stream.setEndOfFile(false); } I'm not 100 % sure if "alpha[0][eos] = 1;" is needed. It seems to give the same result without it too, but it is set when starting the tagging and doesn't hurt so I included it. Felipe (or anyone else who understands the HMM): d

Re: [Apertium-stuff] The tsx-files in sv-da

2013-09-09 Thread Kevin Brubeck Unhammer
output | dwdiff --color --diff-input | less $ diff -U1 old-da-sv-output new-da-sv-output | dwdiff --color --diff-input | less [1] Unless you have an army of competent human translators happy to post-edit lots of texts for you every time you make a change just so you can run WER. -- Kev

Re: [Apertium-stuff] The tsx-files in sv-da

2013-09-10 Thread Kevin Brubeck Unhammer
search?searchon=names&keywords=dwdiff -- Kevin Brubeck Unhammer GPG: 0x766AC60C pgpFJs7mrwWJS.pgp Description: PGP signature -- How ServiceNow helps IT people transform IT departments: 1. Consolidate legacy IT systems

Re: [Apertium-stuff] The tsx-files in sv-da

2013-09-10 Thread Kevin Brubeck Unhammer
ur Debian has a different version of it than my Ubuntu (dwdiff 2.0.4). Anyway, you can do $ dwdiff -C1 --color old-sv-da-output new-sv-da-output | less to get the same effect (it's just a bit slower with huge files in my experience). -- Kevin Brubeck Unhammer GPG: 0x766AC60C pgp

Re: [Apertium-stuff] The tsx-files in sv-da

2013-09-11 Thread Kevin Brubeck Unhammer
d, not the "diff -U1 | dwdiff" version. Or, with your blockworld files, you run $ dwdiff -C1 --color blockworld.evaluation.da blockworld.evaluation.sv.translated | less If there is no difference, it should not show any output (this happened when I tried it on the very little Danish text I

Re: [Apertium-stuff] Tagger training sv-da

2013-09-13 Thread Kevin Brubeck Unhammer
t; > Wikipedia very rarely has good translations between languages. Fortunately, training the tagger doesn't require parallel text, just monolingual text. -- Kevin Brubeck Unhammer GPG: 0x766AC60C pgpZ_Es9K5PEL.pgp Description: PGP signature

Re: [Apertium-stuff] Comparing versions of a language pair

2013-09-15 Thread Kevin Brubeck Unhammer
contribute (sometime in the summer 2012?) with the most recent version. Just "svn up -rN" where N is your first commit. Do $ svn log > thelog to download the commit log to find your first commit. -- Kevin Brubeck Unhammer GPG: 0x766AC60C pgp1ZhKs4x

Re: [Apertium-stuff] Comparing versions of a language pair

2013-09-16 Thread Kevin Brubeck Unhammer
ux terminal to read, they should show up as green and red. Maybe your "less" command is deficient and does not show colour? You could try without "|less" if your terminal lets you scroll anyway. -- Kevin Brubeck Un

Re: [Apertium-stuff] One-to-two relations in the pair sv-da

2013-09-19 Thread Kevin Brubeck Unhammer
#x27;ve just replaced .prob files without changing their file names, then no, you don't need new modes files.) -- Kevin Brubeck Unhammer GPG: 0x766AC60C pgpeOzikhBOWh.pgp Description: PGP signature -- LIMITED TI

Re: [Apertium-stuff] How does the tagger choose the source word

2013-09-23 Thread Kevin Brubeck Unhammer
^blå/blå/blå$ > ^kon/ko/kon/ko/ko/kon/kon$ > ^./.$ If you want to test this properly, use "lt-proc -e", since your dictionary has compounding tags on it. The tags / should never appear in output. --

Re: [Apertium-stuff] One-to-two relations in the pair sv-da

2013-09-23 Thread Kevin Brubeck Unhammer
^ta$ lt-proc with bidix gives: ^ta/take/get/grab$ lrx-proc (lexical selection) gives: ^grab$ and that is passed on to transfer rules (which ensure you write "grab the cone" and not just "grab cone", but they are not responsible fo

Re: [Apertium-stuff] How does the tagger choose the source word

2013-09-23 Thread Kevin Brubeck Unhammer
hological disambiguation includes choosing noun vs verb (English ^fly/fly/fly/fly$), choosing different lemmas of nouns (Swedish ^kon/ko/kon$), or even different analyses of the same lemma+PoS (English ^sheep/sheep/sheep$). -- Kevin Brubeck Unhammer GPG: 0x766AC60C pg

Re: [Apertium-stuff] How does the tagger choose the source word

2013-09-23 Thread Kevin Brubeck Unhammer
> Or am I wrong? Does the tagger work more like a language model i.e. take > the actual words into account? You can put lemmas in the TSX if you want (example: http://wiki.apertium.org/wiki/Tagger_training#Writing_a_TSX_file ), but if I understand correctly it won't take forms/lemmas

Re: [Apertium-stuff] Finding patterns

2013-10-06 Thread Kevin Brubeck Unhammer
kistán$}$^punt{^.$}$ $ echo between Kazakhstan and outer space | apertium -d . en-es-chunker ^Pr{^entre$}$ ^nom{^Kazajistán<3><4>$}$ ^cnj{^y$}$ ^nom_adj{^espacio<3><4>$ ^exterior<4>$}$^punt{^.$}$ What's the use case? -- Kevin Brubeck Unhammer GPG: 0x766AC60C

[Apertium-stuff] Can we remove version numbers from pkg-config .pc files?

2013-10-15 Thread Kevin Brubeck Unhammer
1] https://www.flameeyes.eu/autotools-mythbuster/pkgconfig/index.html [2] https://www.flameeyes.eu/autotools-mythbuster/pkgconfig/pkg_check_modules.html -- Kevin Brubeck Unhammer GPG: 0x766AC60C pgp7Spa0VAIFq.pgp Description: PGP signature ---

Re: [Apertium-stuff] Can we remove version numbers from pkg-config .pc files?

2013-10-30 Thread Kevin Brubeck Unhammer
Francis Tyers writes: > El dt 15 de 10 de 2013 a les 11:47 +0200, en/na Kevin Brubeck Unhammer > va escriure: [...] >> If we rename the .pc files, all language pairs need one change to >> configure.ac, but hopefully this will be the last one they need :-) > > I think t

Re: [Apertium-stuff] apertium-sme-nob error

2013-10-31 Thread Kevin Brubeck Unhammer
context b9:b >: i: b9:b _ X3: j:i WARNING! The conflict is unresolvable. There is a <=-rule conflict between "e7 Rising" and "Illative Singular Metaphony2 SUBCASE: Vx=e7 Vy=á". E.g. in context b9:b i: b9:b _ W9: W6: j:i

Re: [Apertium-stuff] Instalando apertium en VPS

2013-11-11 Thread Kevin Brubeck Unhammer
usr/local/bin:${PATH} in your ~/.bashrc (or ~/.profile or ~/.bash_profile or whichever file you use), opening a new terminal, and trying again. This should make /usr/bin preferred over /usr/local/bin. If that doesn't work, can you try $ /usr/local/bin/pe

Re: [Apertium-stuff] Instalando apertium en VPS

2013-11-13 Thread Kevin Brubeck Unhammer
use5005threads=undef useithreads=undef useithreads=undef means /usr/local/bin/perl can't be used. [...] > root@host [~]# /usr/bin/perl -V // sale lo mismo que anterior Exactamente lo mismo? Most importantly, did /usr/bin/perl -V also say "useithreads=undef"? (The perl on my

Re: [Apertium-stuff] Guidelines: t1x or t2x?

2013-11-13 Thread Kevin Brubeck Unhammer
bit more zoomed out, there a verb preposition tag might make us change the preposition chunk. (If I were to do it again, I would probably wait with inserting the preposition chunk until t2x. That is, t1x would add e.g. to the noun chunk and to the verb chunk, while t2x would decide which one to us

Re: [Apertium-stuff] Instalando apertium en VPS

2013-11-14 Thread Kevin Brubeck Unhammer
you would have to check with them. If none of that works, http://stackoverflow.com/a/19063960/69663 suggests editing /usr/share/automake-1.11/Automake/Config.pm to say our $perl_threads = 0; instead of "our $perl_threads = 1;" – it smells like a hack, but it may work (if you have root).

Re: [Apertium-stuff] Windows installation problems

2013-11-21 Thread Kevin Brubeck Unhammer
> should really either update that installer, or remove it. Seems like it needs an update, yes: http://www.google-melange.com/gci/task/view/google/gci2013/6396457749839872 http://superuser.com/a/628401 Does the source for that installer exi

Re: [Apertium-stuff] Windows installation problems

2013-11-21 Thread Kevin Brubeck Unhammer
"Jimmy O'Regan" writes: > On 21 November 2013 15:05, Kevin Brubeck Unhammer > wrote: >> "Jimmy O'Regan" >> writes: >> >>> I'm not 100% about this, but there was a problem with Cygwin recently >>> - IIRC, certain

Re: [Apertium-stuff] Task ideas for Google Code In

2013-12-07 Thread Kevin Brubeck Unhammer
demo app" for other app developers to build on; but I think the plan was to have a separate, more fancy app, so your ideas could certainly go into a new app based on the current one. But then I think we need some mentors who know the android stuff to hang out on #apertium (hint, hint). [

Re: [Apertium-stuff] GSOC Idea: Take a language pair and make it state of the art

2014-02-05 Thread Kevin Brubeck Unhammer
; one. +1 -- Kevin Brubeck Unhammer GPG: 0x766AC60C pgpOwUiqR6_y9.pgp Description: PGP signature -- Managing the Performance of Cloud-Based Applications Take advantage of what the Cloud has to offer - Avoid Common Pitfall

Re: [Apertium-stuff] GSOC idea: make an app for Iphone/Ipad

2014-02-06 Thread Kevin Brubeck Unhammer
one, run it through OCR and Apertium on the server and send back the translation in Kazakh. And the web page would run not only on iOS. -- Kevin Brubeck Unhammer GPG: 0x766AC60C pgpqu4U26XBub.pgp Description: PGP signature --

[Apertium-stuff] lt-trim

2014-02-07 Thread Kevin Brubeck Unhammer
ll get to try it out. If the powers that be accept the code, I can merge this into lttoolbox (without all the #ifdef DEBUG statements =P). [1] http://wiki.apertium.org/wiki/Talk:Automatically_trimming_a_monodix#.23-type_multiwords -- Kevin Brubeck Unhammer GPG: 0x766AC60C pgpT4ew43_rwV.

Re: [Apertium-stuff] lt-trim

2014-02-09 Thread Kevin Brubeck Unhammer
Kevin Brubeck Unhammer writes: > There is one caveat: (group element) is not handled yet. The man > page notes how to work around that. I have an idea for handling the > group element[1], but I'm not sure when I'll get to try it out. With the latest version on https://

Re: [Apertium-stuff] lt-trim

2014-02-10 Thread Kevin Brubeck Unhammer
Kevin Brubeck Unhammer writes: > Regarding multiwords with both and at once, I'm not even sure > what those should look like. But when I tested with ca-en they seem to > be included, except these inf+es entries which keeps getting trimmed > out: > > canvi

Re: [Apertium-stuff] lt-trim

2014-02-14 Thread Kevin Brubeck Unhammer
it with a warning that it's still considered experimental). -- Kevin Brubeck Unhammer GPG: 0x766AC60C pgpxXfJ9tVnRp.pgp Description: PGP signature -- Android apps run on BlackBerry 10 Introducing the new BlackBerry

Re: [Apertium-stuff] Non Standard dictionary format

2014-03-11 Thread Kevin Brubeck Unhammer
typically used for marking language variants. E.g. in an English dix, you might have alt="en_GB" on the entry for "colour", and alt="en_US" on the entry for "color". Then the alt.xsl script called with en_GB as the option would include only those entries that h

Re: [Apertium-stuff] Fwd: Automatic localization of Android applications

2014-03-11 Thread Kevin Brubeck Unhammer
ed fine with the html-noent formatter. The guy wanted to translate Android core resources into Nynorsk; however, he never found out how to contribute his translations to the Android project (and neither did I) so that died down pretty quickly. -- Kevin Brubeck Unhammer GPG: 0x766AC60C pgpCJbGm

Re: [Apertium-stuff] Non Standard dictionary format

2014-03-11 Thread Kevin Brubeck Unhammer
Gaurav Agrawal writes: > Hello Kevil, > > Thanks for answering my queries, I have some further doubts. > > Thanks and Regards, > Gaurav Agrawal > M.Tech CSE > IIIT, Hyderabad > > IRC #ergaurav2 > > On Tue, Mar 11, 2014 at 1:36 PM, Kevin Brubeck Unhammer >

Re: [Apertium-stuff] Fwd: Automatic localization of Android applications

2014-03-11 Thread Kevin Brubeck Unhammer
"Jim O'Regan" writes: > On 11 March 2014 08:12, Kevin Brubeck Unhammer > wrote: >> Mikel Artetxe writes: >> >> [...] >> >>> 3) The real work to do would be writing a deformatter/reformatter for >>> string resources, which use

Re: [Apertium-stuff] [GSOC] Code/Improving support for non-standard text input

2014-03-18 Thread Kevin Brubeck Unhammer
mputer science at Pennsylvania State > University and last semester I had an NLP class. I'm familiar with the > input normalization task. I speak English, Arabic and French very > fluently. > > Is there any specific language that I should target? All of them :-) but perhaps s

Re: [Apertium-stuff] Improving support for non-standard text input

2014-03-18 Thread Kevin Brubeck Unhammer
-> birthday. So we > have to handle them separately. And to do this first we have to > classify them. > > For translation I have just written a bash script > while read line; do echo $line | apertium en-eo; done < Tweets That would run much faster as apertium en-eo &l

Re: [Apertium-stuff] GSOC Idea: Take a language pair and make it state of the art

2014-03-18 Thread Kevin Brubeck Unhammer
od idea, you should certainly do it, but it's not really a deliverable. [1] http://wiki.apertium.org/wiki/Easy_dictionary_maintenance -- Kevin Brubeck Unhammer GPG: 0x766AC60C pgpkc1AliCUOM.pgp Description: PGP signature --

Re: [Apertium-stuff] new release of lttoolbox and Apertium

2014-03-25 Thread Kevin Brubeck Unhammer
Francis Tyers writes: > Hello everyone, > > I think it is time to do a new release of apertium and lttoolbox. +1 -- Kevin Brubeck Unhammer Sent from my emacs pgpjvfHp74OTy.pgp Description: PGP signature ---

Re: [Apertium-stuff] new release of lttoolbox and Apertium

2014-03-25 Thread Kevin Brubeck Unhammer
hunk.dtd tagger.dtd transfer.dtd - acx.rng modes.rng transfer.rng (I would prefer to rm {dix,acx}.* from apertium, and otherwise either have one format for all files, or both formats for all files.) -- Kevin Brubeck Unhammer GPG: 0x766AC60C pgppXZjUoggkr.pgp Description: PGP signature -

Re: [Apertium-stuff] new release of lttoolbox and Apertium

2014-03-25 Thread Kevin Brubeck Unhammer
"Jim O'Regan" writes: > On 25 March 2014 13:14, Kevin Brubeck Unhammer wrote: [...] >> While on the subject of things that could be sorted out before a >> release, why are some XML validation files .rng and some .dtd? And why >> does apertium

Re: [Apertium-stuff] new release of lttoolbox and Apertium

2014-03-27 Thread Kevin Brubeck Unhammer
"Jim O'Regan" writes: > On 25 March 2014 22:15, Kevin Brubeck Unhammer wrote: >> "Jim O'Regan" writes: [...] >> So it should be fine to just change >> >> @echo "$(XMLLINT) --dtdvalid $(apertiumdir)/dix.dtd --noout >

Re: [Apertium-stuff] new release of lttoolbox and Apertium

2014-04-29 Thread Kevin Brubeck Unhammer
Kevin Brubeck Unhammer writes: > "Jim O'Regan" writes: > >> On 25 March 2014 22:15, Kevin Brubeck Unhammer >> wrote: >>> "Jim O'Regan" writes: > > [...] > >>> So it should be fine to just change >>> >&g

[Apertium-stuff] minor releases for eu-es, sme-nob, kaz-tat and ca-it

2014-05-06 Thread Kevin Brubeck Unhammer
ekend: apertium-kaz-tat 0.2.1 apertium-sme-nob-0.5.1 apertium-ca-it 0.1.1 apertium-eu-es-0.3.3 Apart from the mentioned syntax errors, I haven't touched dictionaries, only Makefile.am and configure.ac, so they're as testvoqued as they were the previous release. -- Kevin Brubeck Unhammer

Re: [Apertium-stuff] Information about word reordering

2014-05-08 Thread Kevin Brubeck Unhammer
script to do 0El1 … instead of [0]El[1] and call apertium with the "-f html-noent" format option instead of "-f none" (since the latter will break as soon as you get any "special symbol" as input). -- Kevin Bru

Re: [Apertium-stuff] Information about word reordering

2014-05-08 Thread Kevin Brubeck Unhammer
e' > > with the links reordered. Ah, then numbered superblanks should _not_ work, we try not to reorder superblanks since it could mess up HTML. (Tino Didriksen suggested a solution involving treating some blanks as glued to the words and some as non-reorderable, but no one's tried i

Re: [Apertium-stuff] Need help with testing language corpus

2014-05-18 Thread Kevin Brubeck Unhammer
the corpus from Wikipedia dumps. Hence, we're looking > for help with this > > For more details, check out this link: > http://piratepad.net/LanguagesTested or contact me/sushain. Thanks! Do you just need a plain text corpus in various languages? -- Kevin Brubeck Unhammer GPG

[Apertium-stuff] minor memory leak fixed in reformatters

2014-05-21 Thread Kevin Brubeck Unhammer
I just committed a little bugfix to trunk/apertium (reported by Vee / v21 on IRC, thanks!). On non-ASCII input, the reformatters would eat memory until they dropped. If you run apertium on large corpora and have memory issues you might want to update :-) -- Kevin Brubeck Unhammer GPG

Re: [Apertium-stuff] A design limitation: perfect format handling in transfer may be impossible

2014-05-26 Thread Kevin Brubeck Unhammer
://wiki.apertium.org/wiki/Reordering_superblanks The sketched solution seems to me like it should deal with all of the above issues. Comments please :-) -- Kevin Brubeck Unhammer GPG: 0x766AC60C pgpkKJiGCDTWc.pgp Description: PGP signature -

Re: [Apertium-stuff] Interested in continuing the pair German<->Dutch

2014-06-09 Thread Kevin Brubeck Unhammer
C to discuss it. If upper management agrees, we can probably get you commit access. -- Kevin Brubeck Unhammer GPG: 0x766AC60C pgpyl4WzldZNf.pgp Description: PGP signature -- HPCC Systems Open Source Big Data Platform fr

Re: [Apertium-stuff] new release of lttoolbox and Apertium

2014-06-11 Thread Kevin Brubeck Unhammer
use lt-proc for non-Apertium things. But it would require changing modes files for any pairs that want to take advantage of it … I think maybe a hardcoded ignore-list in lttoolbox would be more helpful to more users. Are there other use-cases than soft-hyphens? Or cases where we want to _not_ ignore the s

Re: [Apertium-stuff] Machine translation aid on LimeSurvey Manual

2014-06-16 Thread Kevin Brubeck Unhammer
from ScaleMT that might affect things is that APY uses three-letter language codes (ISO-639-3); I have no idea how to debug it in the mediawiki thing though. -- Kevin Brubeck Unhammer GPG: 0x766AC60C pgpqZqBrHAqfm.pgp Description: PGP signature -

Re: [Apertium-stuff] What version of apertium is production ready?

2014-07-08 Thread Kevin Brubeck Unhammer
rtium/files/lttoolbox/ and https://sourceforge.net/projects/apertium/files/apertium/ -- Kevin Brubeck Unhammer GPG: 0x766AC60C pgpWfQ4bNJh4K.pgp Description: PGP signature -- Open source business process management suite bu

<    1   2   3   4   5   6