[jira] [Assigned] (JOSHUA-288) Port fast_align to java

2016-08-04 Thread Matt Post (JIRA)

 [ 
https://issues.apache.org/jira/browse/JOSHUA-288?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Matt Post reassigned JOSHUA-288:


Assignee: Matt Post

> Port fast_align to java
> ---
>
> Key: JOSHUA-288
> URL: https://issues.apache.org/jira/browse/JOSHUA-288
> Project: Joshua
>  Issue Type: New Feature
>Reporter: Matt Post
>Assignee: Matt Post
>Priority: Minor
>   Original Estimate: 168h
>  Remaining Estimate: 168h
>
> It would be great to have a Java port of fast_align, so that we don't have to 
> worry about compiling it, and could distribute it via Maven.
> https://github.com/clab/fast_align



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (JOSHUA-107) Verbosity levels

2016-08-04 Thread Matt Post (JIRA)

 [ 
https://issues.apache.org/jira/browse/JOSHUA-107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Matt Post resolved JOSHUA-107.
--
Resolution: Fixed

Completed a while ago.

> Verbosity levels
> 
>
> Key: JOSHUA-107
> URL: https://issues.apache.org/jira/browse/JOSHUA-107
> Project: Joshua
>  Issue Type: Bug
>Reporter: Matt Post
>Assignee: Matt Post
> Fix For: 6.2
>
>
> Joshua should support verbosity levels with a command-line switch, so it's 
> easy to shut it up with something like {{-v 0}} or {{-q}}.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Closed] (JOSHUA-22) Parallelize MBR computation

2016-08-04 Thread Matt Post (JIRA)

 [ 
https://issues.apache.org/jira/browse/JOSHUA-22?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Matt Post closed JOSHUA-22.
---
Resolution: Fixed

Closing this issue as something unlikely to be done any time soon, and also as 
something that is more properly done inside the decoder over the hypergraph.

> Parallelize MBR computation
> ---
>
> Key: JOSHUA-22
> URL: https://issues.apache.org/jira/browse/JOSHUA-22
> Project: Joshua
>  Issue Type: Bug
>Reporter: Joshua Decoder
> Fix For: 6.2
>
>
> MBR should be multithreaded.  This would be easy to add following the model 
> used in the InputManager.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Closed] (JOSHUA-95) Vocabulary locking

2016-08-04 Thread Matt Post (JIRA)

 [ 
https://issues.apache.org/jira/browse/JOSHUA-95?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Matt Post closed JOSHUA-95.
---
Resolution: Fixed

This is a very old issue that I believe has been fixed.

> Vocabulary locking
> --
>
> Key: JOSHUA-95
> URL: https://issues.apache.org/jira/browse/JOSHUA-95
> Project: Joshua
>  Issue Type: Bug
>Reporter: Matt Post
>Assignee: Juri Ganitkevitch
> Fix For: 6.2
>
>
> Vocabulary::id() is still synchronized and a potential point of contention. 
> It would be nice to resolve this.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Closed] (JOSHUA-100) Add Shen et al. (2008) dependency LM

2016-08-04 Thread Matt Post (JIRA)

 [ 
https://issues.apache.org/jira/browse/JOSHUA-100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Matt Post closed JOSHUA-100.

Resolution: Fixed

> Add Shen et al. (2008) dependency LM
> 
>
> Key: JOSHUA-100
> URL: https://issues.apache.org/jira/browse/JOSHUA-100
> Project: Joshua
>  Issue Type: New Feature
>Reporter: Matt Post
>Assignee: Matt Post
> Fix For: 6.2
>
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


Re: Language Pack English-Japanese

2016-08-04 Thread Matt Post
Hi Toshiki,

Have you been able to gather any parallel data?

matt


> On Jul 22, 2016, at 3:50 PM, Henry Saputra  wrote:
> 
> HI Toshiki,
> 
> For this kind of discussion, let's have it in the dev@ list.
> 
> You can ask the question to dev@joshua.incubator.apache.org.
> 
> Thanks,
> 
> Henry
> 
> On Thu, Jul 21, 2016 at 9:46 PM, IGA Tosiki  wrote:
> 
>> Hi Matt,
>> 
>> Thanks for your reply!
>> 
>> I'm happy to read your mail, I want to help you Japanese-English language
>> pack.
>> And YES, I mean translation memories by TMS/XLIFF. But I may convert
>> TMS to what you specified format.
>> 
>> And also I knew English to Japanese is very difficult, but also I
>> believe sample of English-Japanese language pack will attract many
>> Japanese people to use Joshua.
>> 
>> Regards,
>> Toshiki
>> 
>> 2016-07-22 12:42 GMT+09:00 Matt Post :
>>> Hi,
>>> 
>>> There is no Japanese--English language pack, but I would be happy to
>> build one if you could help by pointing me to data. What we need is
>> parallel data in the form of sentences that are translations of each other.
>> If you have access to this or pointers to where I could find some, I would
>> be happy to build it. There are likely standard datasets available; people
>> like Graham Neubig (http://www.phontron.com) have been working on this
>> for a while.
>>> 
>>> What are TMS and LTIFF? Are you talking about translation memories?
>>> 
>>> As a side note, translation between English and Japanese is very
>> difficult and tends not to be very good. One approach that helps is
>> translating from trees and forests. Joshua does not have this capability at
>> the moment.
>>> 
>>> Sincerely,
>>> matt
>>> 
>>> 
 On Jul 21, 2016, at 11:28 PM, IGA Tosiki  wrote:
 
 Hi team,
 
 I got interest about Joshua, and language pack. I am Japanese, and I
 want to know around Japanese language pack.
 
 Is there any plan about building Japanese-English language pack?
 I believe TMS or LTIFF will usefull to building such language pack. I
 have many OSS based TMS between English-Japanese. Is there any path
 using TMX or LTIFF for input of Joshua language pack?
 
 Best regards,
 Toshiki Iga
>>> 
>>