Re: Russian Language Model for Joshua

2016-07-15 Thread Matt Post
no worries I got it packed. will email later tonight. 

matt (from my phone)

> On Jul 15, 2016, at 6:32 PM, Mattmann, Chris A (3980) 
>  wrote:
> 
> Will do.
> 
> Adding Paul Zimdars - do we have an Amazon machine that has > 256GB
> of memory? How much would that cost?
> 
> Cheers,
> Chris
> 
> ++
> Chris Mattmann, Ph.D.
> Chief Architect
> Instrument Software and Science Data Systems Section (398)
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 168-519, Mailstop: 168-527
> Email: chris.a.mattm...@nasa.gov
> WWW:  http://sunset.usc.edu/~mattmann/
> ++
> Director, Information Retrieval and Data Science Group (IRDS)
> Adjunct Associate Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> WWW: http://irds.usc.edu/
> ++
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
>> On 7/15/16, 1:42 PM, "Matt Post"  wrote:
>> 
>> All right, started trying to recompile. If you have a machine with > 256 GB 
>> of memory, it might be more efficient for me to give you the raw ARPA file 
>> and for you to compile it. We'll see how it goes. Ping me in a day if you 
>> don't hear from me.
>> 
>> matt
>> 
>> 
>>> On Jul 15, 2016, at 4:40 PM, Mattmann, Chris A (3980) 
>>>  wrote:
>>> 
>>> Yes please! :)
>>> 
>>> Sent from my iPhone
>>> 
 On Jul 15, 2016, at 1:39 PM, Matt Post  wrote:
 
 I have one built on Common Crawl. It's 25 GB uncompressed. My KenLM 
 compiles of it failed in the past, but I'll try again. I expect it to be 
 about 8 GB when that's done. Do you want it?
 
 matt
 
 
> On Jul 15, 2016, at 3:50 PM, Mattmann, Chris A (3980) 
>  wrote:
> 
> Hey Folks,
> 
> Anyone have a Russian Language Model for Joshua? Lewis was working on
> one, not sure if he has it but just broadening the question.
> 
> Cheers,
> Chris
> 
> ++
> Chris Mattmann, Ph.D.
> Chief Architect
> Instrument Software and Science Data Systems Section (398)
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 168-519, Mailstop: 168-527
> Email: chris.a.mattm...@nasa.gov
> WWW:  http://sunset.usc.edu/~mattmann/
> ++
> Director, Information Retrieval and Data Science Group (IRDS)
> Adjunct Associate Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> WWW: http://irds.usc.edu/
> ++
>> 



Re: Russian Language Model for Joshua

2016-07-15 Thread Tom Barber
Street price is:

r3.8xlarge 32 104 244 2 x 320 SSD $2.66 per Hour



--

Director Meteorite.bi - Saiku Analytics Founder
Tel: +44(0)5603641316

(Thanks to the Saiku community we reached our Kickstart

goal, but you can always help by sponsoring the project
)

On 15 July 2016 at 23:32, Mattmann, Chris A (3980) <
chris.a.mattm...@jpl.nasa.gov> wrote:

> Will do.
>
> Adding Paul Zimdars - do we have an Amazon machine that has > 256GB
> of memory? How much would that cost?
>
> Cheers,
> Chris
>
> ++
> Chris Mattmann, Ph.D.
> Chief Architect
> Instrument Software and Science Data Systems Section (398)
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 168-519, Mailstop: 168-527
> Email: chris.a.mattm...@nasa.gov
> WWW:  http://sunset.usc.edu/~mattmann/
> ++
> Director, Information Retrieval and Data Science Group (IRDS)
> Adjunct Associate Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> WWW: http://irds.usc.edu/
> ++
>
>
>
>
>
>
>
>
>
>
> On 7/15/16, 1:42 PM, "Matt Post"  wrote:
>
> >All right, started trying to recompile. If you have a machine with > 256
> GB of memory, it might be more efficient for me to give you the raw ARPA
> file and for you to compile it. We'll see how it goes. Ping me in a day if
> you don't hear from me.
> >
> >matt
> >
> >
> >> On Jul 15, 2016, at 4:40 PM, Mattmann, Chris A (3980) <
> chris.a.mattm...@jpl.nasa.gov> wrote:
> >>
> >> Yes please! :)
> >>
> >> Sent from my iPhone
> >>
> >>> On Jul 15, 2016, at 1:39 PM, Matt Post  wrote:
> >>>
> >>> I have one built on Common Crawl. It's 25 GB uncompressed. My KenLM
> compiles of it failed in the past, but I'll try again. I expect it to be
> about 8 GB when that's done. Do you want it?
> >>>
> >>> matt
> >>>
> >>>
>  On Jul 15, 2016, at 3:50 PM, Mattmann, Chris A (3980) <
> chris.a.mattm...@jpl.nasa.gov> wrote:
> 
>  Hey Folks,
> 
>  Anyone have a Russian Language Model for Joshua? Lewis was working on
>  one, not sure if he has it but just broadening the question.
> 
>  Cheers,
>  Chris
> 
>  ++
>  Chris Mattmann, Ph.D.
>  Chief Architect
>  Instrument Software and Science Data Systems Section (398)
>  NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>  Office: 168-519, Mailstop: 168-527
>  Email: chris.a.mattm...@nasa.gov
>  WWW:  http://sunset.usc.edu/~mattmann/
>  ++
>  Director, Information Retrieval and Data Science Group (IRDS)
>  Adjunct Associate Professor, Computer Science Department
>  University of Southern California, Los Angeles, CA 90089 USA
>  WWW: http://irds.usc.edu/
>  ++
> >>>
> >
>


Re: Avoiding master failures with CI

2016-07-15 Thread Tom Barber
Don't ask about github pushing its like the antichrist! ;)

--

Director Meteorite.bi - Saiku Analytics Founder
Tel: +44(0)5603641316

(Thanks to the Saiku community we reached our Kickstart

goal, but you can always help by sponsoring the project
)

On 15 July 2016 at 23:31, Mattmann, Chris A (3980) <
chris.a.mattm...@jpl.nasa.gov> wrote:

> Hey Matt,
>
> Apache infra supports Travis CI - just file a ticket and they will
> set it up :)
>
> Cheers,
> Chris
>
> ++
> Chris Mattmann, Ph.D.
> Chief Architect
> Instrument Software and Science Data Systems Section (398)
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 168-519, Mailstop: 168-527
> Email: chris.a.mattm...@nasa.gov
> WWW:  http://sunset.usc.edu/~mattmann/
> ++
> Director, Information Retrieval and Data Science Group (IRDS)
> Adjunct Associate Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> WWW: http://irds.usc.edu/
> ++
>
>
>
>
>
>
>
>
>
>
> On 7/15/16, 2:05 PM, "Matt Post"  wrote:
>
> >Question for Chris and/or Lewis:
> >
> >So, Kellen and I took a look at this today, and it looks like a good
> solution. The problem is that it integrates with projects hosted on Github
> that you have write access to. In order to make use of this, we'd need to
> rearrange the setup we have.
> >
> >Currently, we push to a repo at git.apache.org, and that is then pushed
> down to github.com/apache/incubator-joshua. This lets us use the Github
> repo for receiving things like pull requests and so on, but we do not have
> write access to it, so merges and so on have to be handled manually.
> >
> >To use Travis-ci, we'd need to re-enginneer this. Apache would need to
> give us write access to github.com/apache/incubator-joshua, or we'd need
> to use another official host for Joshua. We'd then use git.apache.org as
> the mirror, instead of the other way around.
> >
> >Is there any way that this could be done? I understand Apache's arguments
> about keeping discussions at home, since github may not last forever.
> However, it seems like we could do this if we use git.apache.org as the
> backup mirror, and continue to use JIRA for discussions and so on. In
> general, Github has a lot of tools that could help with development. It
> would be nice if we could make use of them while still checking off
> Apache's logging requirements.
> >
> >matt
> >
> >
> >
> >> On Jul 11, 2016, at 6:50 PM, kellen sunderland <
> kellen.sunderl...@gmail.com> wrote:
> >>
> >> Sorry should have provided the link to this page:
> https://travis-ci.org/ .
> >> If you scroll down a bit on that page there's a Pull Request flow
> section,
> >> it's the flow I'd be most in favour of.  There's also a decent (but
> rushed)
> >> demo here: https://www.youtube.com/watch?v=Uft5KBimzyk .  We actually
> don't
> >> need to do a lot of the work that he demos, i.e. no node or gulp
> >> configuration.  Our setup is close enough to default a default java
> project
> >> that we just have to tell it to build java 8 and then it runs maven
> >> properly.
> >>
> >> Using a CI server would have some aspects that are similar to the
> branching
> >> document you mention, and some benefits that are a bit orthogonal.
> Most of
> >> these benefits have to do with unit testing, which isn't covered in the
> doc.
> >>
> >> First the orthogonal benefits:  The main benefit we would get from
> using CI
> >> is that we guarantee code in our repo is never broken.  That is to say
> >> tests always pass and it always builds correctly.  CI servers are really
> >> useful to prevent problems where one developer may have everything
> working
> >> properly on his/her machine, but when they later realize it's not
> working
> >> on another devs machine.  A good example of this is the
> class-based-lm-test
> >> we pushed recently.  It works fine for me locally but it would fail for
> >> anyone without kenlm.so.  There are many other examples (javadoc errors,
> >> code style, etc) but what will happen in these cases is we'll see a big
> >> obvious 'The build has problems' message in the PR page on Github.  If
> the
> >> CI server runs of all of our code quality checks and finds that
> everything
> >> is good we'll get a big 'This PR is ready to merge' message.
> >>
> >> Now to the part that overlaps a bit with branching.  There are various
> >> branching strategies that we could adopt for the project.  The master /
> dev
> >> branch one is a possibility.  I'd suggest we try commit code strictly in
> >> PRs rather than pushing to git.  This would be the equivalent of feature
> >> branching from your link.  The 

Re: Russian Language Model for Joshua

2016-07-15 Thread Mattmann, Chris A (3980)
Will do.

Adding Paul Zimdars - do we have an Amazon machine that has > 256GB
of memory? How much would that cost?

Cheers,
Chris

++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattm...@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++
Director, Information Retrieval and Data Science Group (IRDS)
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
WWW: http://irds.usc.edu/
++










On 7/15/16, 1:42 PM, "Matt Post"  wrote:

>All right, started trying to recompile. If you have a machine with > 256 GB of 
>memory, it might be more efficient for me to give you the raw ARPA file and 
>for you to compile it. We'll see how it goes. Ping me in a day if you don't 
>hear from me.
>
>matt
>
>
>> On Jul 15, 2016, at 4:40 PM, Mattmann, Chris A (3980) 
>>  wrote:
>> 
>> Yes please! :)
>> 
>> Sent from my iPhone
>> 
>>> On Jul 15, 2016, at 1:39 PM, Matt Post  wrote:
>>> 
>>> I have one built on Common Crawl. It's 25 GB uncompressed. My KenLM 
>>> compiles of it failed in the past, but I'll try again. I expect it to be 
>>> about 8 GB when that's done. Do you want it?
>>> 
>>> matt
>>> 
>>> 
 On Jul 15, 2016, at 3:50 PM, Mattmann, Chris A (3980) 
  wrote:
 
 Hey Folks,
 
 Anyone have a Russian Language Model for Joshua? Lewis was working on
 one, not sure if he has it but just broadening the question.
 
 Cheers,
 Chris
 
 ++
 Chris Mattmann, Ph.D.
 Chief Architect
 Instrument Software and Science Data Systems Section (398)
 NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
 Office: 168-519, Mailstop: 168-527
 Email: chris.a.mattm...@nasa.gov
 WWW:  http://sunset.usc.edu/~mattmann/
 ++
 Director, Information Retrieval and Data Science Group (IRDS)
 Adjunct Associate Professor, Computer Science Department
 University of Southern California, Los Angeles, CA 90089 USA
 WWW: http://irds.usc.edu/
 ++
>>> 
>


Re: Avoiding master failures with CI

2016-07-15 Thread Matt Post
Question for Chris and/or Lewis:

So, Kellen and I took a look at this today, and it looks like a good solution. 
The problem is that it integrates with projects hosted on Github that you have 
write access to. In order to make use of this, we'd need to rearrange the setup 
we have.

Currently, we push to a repo at git.apache.org, and that is then pushed down to 
github.com/apache/incubator-joshua. This lets us use the Github repo for 
receiving things like pull requests and so on, but we do not have write access 
to it, so merges and so on have to be handled manually.

To use Travis-ci, we'd need to re-enginneer this. Apache would need to give us 
write access to github.com/apache/incubator-joshua, or we'd need to use another 
official host for Joshua. We'd then use git.apache.org as the mirror, instead 
of the other way around.

Is there any way that this could be done? I understand Apache's arguments about 
keeping discussions at home, since github may not last forever. However, it 
seems like we could do this if we use git.apache.org as the backup mirror, and 
continue to use JIRA for discussions and so on. In general, Github has a lot of 
tools that could help with development. It would be nice if we could make use 
of them while still checking off Apache's logging requirements.

matt



> On Jul 11, 2016, at 6:50 PM, kellen sunderland  
> wrote:
> 
> Sorry should have provided the link to this page: https://travis-ci.org/ .
> If you scroll down a bit on that page there's a Pull Request flow section,
> it's the flow I'd be most in favour of.  There's also a decent (but rushed)
> demo here: https://www.youtube.com/watch?v=Uft5KBimzyk .  We actually don't
> need to do a lot of the work that he demos, i.e. no node or gulp
> configuration.  Our setup is close enough to default a default java project
> that we just have to tell it to build java 8 and then it runs maven
> properly.
> 
> Using a CI server would have some aspects that are similar to the branching
> document you mention, and some benefits that are a bit orthogonal.  Most of
> these benefits have to do with unit testing, which isn't covered in the doc.
> 
> First the orthogonal benefits:  The main benefit we would get from using CI
> is that we guarantee code in our repo is never broken.  That is to say
> tests always pass and it always builds correctly.  CI servers are really
> useful to prevent problems where one developer may have everything working
> properly on his/her machine, but when they later realize it's not working
> on another devs machine.  A good example of this is the class-based-lm-test
> we pushed recently.  It works fine for me locally but it would fail for
> anyone without kenlm.so.  There are many other examples (javadoc errors,
> code style, etc) but what will happen in these cases is we'll see a big
> obvious 'The build has problems' message in the PR page on Github.  If the
> CI server runs of all of our code quality checks and finds that everything
> is good we'll get a big 'This PR is ready to merge' message.
> 
> Now to the part that overlaps a bit with branching.  There are various
> branching strategies that we could adopt for the project.  The master / dev
> branch one is a possibility.  I'd suggest we try commit code strictly in
> PRs rather than pushing to git.  This would be the equivalent of feature
> branching from your link.  The reason I'd suggest that approach is that
> from what I've seen it'll be dead simple to get working with Github and
> Travis, and it gives us the same goal of having a stable master branch.
> 
> If you'd like we can walk through setting this up together on a forked
> version of our Github repo.  We could do a quick example of how code would
> be pushed and merged.  I should be available for a google hangout some time
> this week if that works for you?
> 
> -Kellen
> 
> 
> On Mon, Jul 11, 2016 at 10:51 PM, Mattmann, Chris A (3980) <
> chris.a.mattm...@jpl.nasa.gov> wrote:
> 
>> CI = continuous integration :)
>> 
>> ++
>> Chris Mattmann, Ph.D.
>> Chief Architect
>> Instrument Software and Science Data Systems Section (398)
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 168-519, Mailstop: 168-527
>> Email: chris.a.mattm...@nasa.gov
>> WWW:  http://sunset.usc.edu/~mattmann/
>> ++
>> Director, Information Retrieval and Data Science Group (IRDS)
>> Adjunct Associate Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> WWW: http://irds.usc.edu/
>> ++
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> On 7/11/16, 4:50 PM, "Matt Post"  wrote:
>> 
>>> This sounds fine to me. What does CI stand for?
>>> 
>>> Another thing we should do, which might be complementary to this, is just
>> be more 

Re: Russian Language Model for Joshua

2016-07-15 Thread Matt Post
All right, started trying to recompile. If you have a machine with > 256 GB of 
memory, it might be more efficient for me to give you the raw ARPA file and for 
you to compile it. We'll see how it goes. Ping me in a day if you don't hear 
from me.

matt


> On Jul 15, 2016, at 4:40 PM, Mattmann, Chris A (3980) 
>  wrote:
> 
> Yes please! :)
> 
> Sent from my iPhone
> 
>> On Jul 15, 2016, at 1:39 PM, Matt Post  wrote:
>> 
>> I have one built on Common Crawl. It's 25 GB uncompressed. My KenLM compiles 
>> of it failed in the past, but I'll try again. I expect it to be about 8 GB 
>> when that's done. Do you want it?
>> 
>> matt
>> 
>> 
>>> On Jul 15, 2016, at 3:50 PM, Mattmann, Chris A (3980) 
>>>  wrote:
>>> 
>>> Hey Folks,
>>> 
>>> Anyone have a Russian Language Model for Joshua? Lewis was working on
>>> one, not sure if he has it but just broadening the question.
>>> 
>>> Cheers,
>>> Chris
>>> 
>>> ++
>>> Chris Mattmann, Ph.D.
>>> Chief Architect
>>> Instrument Software and Science Data Systems Section (398)
>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>> Office: 168-519, Mailstop: 168-527
>>> Email: chris.a.mattm...@nasa.gov
>>> WWW:  http://sunset.usc.edu/~mattmann/
>>> ++
>>> Director, Information Retrieval and Data Science Group (IRDS)
>>> Adjunct Associate Professor, Computer Science Department
>>> University of Southern California, Los Angeles, CA 90089 USA
>>> WWW: http://irds.usc.edu/
>>> ++
>> 



Re: Russian Language Model for Joshua

2016-07-15 Thread Mattmann, Chris A (3980)
Yes please! :)

Sent from my iPhone

> On Jul 15, 2016, at 1:39 PM, Matt Post  wrote:
> 
> I have one built on Common Crawl. It's 25 GB uncompressed. My KenLM compiles 
> of it failed in the past, but I'll try again. I expect it to be about 8 GB 
> when that's done. Do you want it?
> 
> matt
> 
> 
>> On Jul 15, 2016, at 3:50 PM, Mattmann, Chris A (3980) 
>>  wrote:
>> 
>> Hey Folks,
>> 
>> Anyone have a Russian Language Model for Joshua? Lewis was working on
>> one, not sure if he has it but just broadening the question.
>> 
>> Cheers,
>> Chris
>> 
>> ++
>> Chris Mattmann, Ph.D.
>> Chief Architect
>> Instrument Software and Science Data Systems Section (398)
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 168-519, Mailstop: 168-527
>> Email: chris.a.mattm...@nasa.gov
>> WWW:  http://sunset.usc.edu/~mattmann/
>> ++
>> Director, Information Retrieval and Data Science Group (IRDS)
>> Adjunct Associate Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> WWW: http://irds.usc.edu/
>> ++
> 


[jira] [Closed] (JOSHUA-281) split2files.pl support script no longer exists hence pipeline fails

2016-07-15 Thread Lewis John McGibbney (JIRA)

 [ 
https://issues.apache.org/jira/browse/JOSHUA-281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lewis John McGibbney closed JOSHUA-281.
---
Resolution: Invalid

This is not a bug at all, my input parameters for the pipeline.pl invocation 
were incorrect.

> split2files.pl support script no longer exists hence pipeline fails
> ---
>
> Key: JOSHUA-281
> URL: https://issues.apache.org/jira/browse/JOSHUA-281
> Project: Joshua
>  Issue Type: Bug
>  Components: pipeline
>Affects Versions: 6.0.5
>Reporter: Lewis John McGibbney
>Assignee: Lewis John McGibbney
>Priority: Blocker
> Fix For: 6.1
>
>
> When I attempt to run a pipeline, I get the following
> {code}
> lmcgibbn@LMC-032857 /usr/local/incubator-joshua(master) $ ../bin/pipeline.pl  
> --rundir . --type hiero --corpus 
> /usr/local/jpl/xdata/joshua_experiments/russian_model/commoncrawl.ru-en 
> --tune 
> /usr/local/jpl/xdata/joshua_experiments/russian_model/commoncrawl.ru-en.tune 
> --test 
> /usr/local/jpl/xdata/joshua_experiments/russian_model/commoncrawl.ru-en.test 
> --source en --target ru --rundir experiment_1/1 --readme "Russian model 
> generation experiment 1 run 1" --mbr
> [train-copy-and-filter] rebuilding...
>   
> dep=/usr/local/jpl/xdata/joshua_experiments/russian_model/commoncrawl.ru-en.en
>  [CHANGED]
>   
> dep=/usr/local/jpl/xdata/joshua_experiments/russian_model/commoncrawl.ru-en.ru
>  [CHANGED]
>   dep=/usr/local/incubator-joshua/experiment_1/1/data/train/train.en [NOT 
> FOUND]
>   dep=/usr/local/incubator-joshua/experiment_1/1/data/train/train.ru [NOT 
> FOUND]
>   cmd=/usr/local/incubator-joshua/scripts/training/paste 
> /usr/local/jpl/xdata/joshua_experiments/russian_model/commoncrawl.ru-en.en 
> /usr/local/jpl/xdata/joshua_experiments/russian_model/commoncrawl.ru-en.ru | 
> /usr/local/incubator-joshua/scripts/training/filter-empty-lines.pl | 
> /usr/local/incubator-joshua/scripts/training/split2files.pl 
> /usr/local/incubator-joshua/experiment_1/1/data/train/train.en 
> /usr/local/incubator-joshua/experiment_1/1/data/train/train.ru
>   JOB FAILED (return code 127)
> /bin/bash: /usr/local/incubator-joshua/scripts/training/split2files.pl: No 
> such file or directory
> {code}
> The following commit changed the name of the file
> {code}
> Repository: incubator-joshua
> Updated Branches:
>   refs/heads/master 09fb6a2d3 -> f02bd279e
> combined split2files implementations
> Project: http://git-wip-us.apache.org/repos/asf/incubator-joshua/repo
> Commit: 
> http://git-wip-us.apache.org/repos/asf/incubator-joshua/commit/f02bd279
> Tree: http://git-wip-us.apache.org/repos/asf/incubator-joshua/tree/f02bd279
> Diff: http://git-wip-us.apache.org/repos/asf/incubator-joshua/diff/f02bd279
> Branch: refs/heads/master
> Commit: f02bd279e892408c9eca2a2a241f21f59cb105e9
> Parents: 09fb6a2
> Author: Matt Post 
> Authored: Wed May 18 09:12:07 2016 -0400
> Committer: Matt Post 
> Committed: Wed May 18 09:12:07 2016 -0400
> --
>  scripts/support/split2files  | 44 +++
>  scripts/support/splittabs.pl | 42 -
>  scripts/training/pipeline.pl |  8 ++---
>  scripts/training/split2files.pl  | 38 ---
>  scripts/training/trim_parallel_corpus.pl |  2 +-
>  5 files changed, 49 insertions(+), 85 deletions(-)
> --
> {code}
> I'll submit a PR to do the simple string replace... which is hopefully all 
> that is wrong here.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


Russian Language Model for Joshua

2016-07-15 Thread Mattmann, Chris A (3980)
Hey Folks,

Anyone have a Russian Language Model for Joshua? Lewis was working on
one, not sure if he has it but just broadening the question.

Cheers,
Chris

++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattm...@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++
Director, Information Retrieval and Data Science Group (IRDS)
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
WWW: http://irds.usc.edu/
++







Re: MT marathon registration is open

2016-07-15 Thread Tommaso Teofili
I'll try to attend too.

Regards,
Tommaso

Il giorno mer 13 lug 2016 alle ore 00:40 Mattmann, Chris A (3980) <
chris.a.mattm...@jpl.nasa.gov> ha scritto:

> I will see about registering as well :)
>
> I have BigTranslate up and working if anyone is interested. I am
> currently evaluating it on the XDATA employment corpus with Lingo24
> but next is Joshua (and hoping to use Bing Translate too). If anyone
> has an Amazon unlimited key for translation to send my way would
> love to add it to the mix too :)
>
> http://github.com/chrismattmann/bigtranslate/
>
> Cheers,
> Chris
>
> ++
> Chris Mattmann, Ph.D.
> Chief Architect
> Instrument Software and Science Data Systems Section (398)
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 168-519, Mailstop: 168-527
> Email: chris.a.mattm...@nasa.gov
> WWW:  http://sunset.usc.edu/~mattmann/
> ++
> Director, Information Retrieval and Data Science Group (IRDS)
> Adjunct Associate Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> WWW: http://irds.usc.edu/
> ++
>
>
>
>
>
>
>
>
>
>
> On 7/12/16, 5:12 PM, "kellen sunderland" 
> wrote:
>
> >Thanks for forwarding Matt.  I think a fair number of people from my team
> >will want to attend.  I'll pass around the registration link.
> >
> >-Kellen
> >On Jul 12, 2016 11:01 PM, "Matt Post"  wrote:
> >
> >> Hi everyone,
> >>
> >> We had talked a while ago about Joshua projects for MT Marathon in
> Prague.
> >> Registration (free) is now open. Let me know if you're planning to go
> and
> >> we can make some plans!
> >>
> >> http://ufal.mff.cuni.cz/mtm16/registration
> >>
> >> matt
> >>
> >>
>


[GitHub] incubator-joshua pull request #31: Refactored unit tests to all use TestNG, ...

2016-07-15 Thread ThePasswordIsPassword
Github user ThePasswordIsPassword closed the pull request at:

https://github.com/apache/incubator-joshua/pull/31


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---