[jira] [Assigned] (JOSHUA-288) Port fast_align to java
[ https://issues.apache.org/jira/browse/JOSHUA-288?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Post reassigned JOSHUA-288: Assignee: Matt Post > Port fast_align to java > --- > > Key: JOSHUA-288 > URL: https://issues.apache.org/jira/browse/JOSHUA-288 > Project: Joshua > Issue Type: New Feature >Reporter: Matt Post >Assignee: Matt Post >Priority: Minor > Original Estimate: 168h > Remaining Estimate: 168h > > It would be great to have a Java port of fast_align, so that we don't have to > worry about compiling it, and could distribute it via Maven. > https://github.com/clab/fast_align -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Resolved] (JOSHUA-107) Verbosity levels
[ https://issues.apache.org/jira/browse/JOSHUA-107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Post resolved JOSHUA-107. -- Resolution: Fixed Completed a while ago. > Verbosity levels > > > Key: JOSHUA-107 > URL: https://issues.apache.org/jira/browse/JOSHUA-107 > Project: Joshua > Issue Type: Bug >Reporter: Matt Post >Assignee: Matt Post > Fix For: 6.2 > > > Joshua should support verbosity levels with a command-line switch, so it's > easy to shut it up with something like {{-v 0}} or {{-q}}. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Closed] (JOSHUA-22) Parallelize MBR computation
[ https://issues.apache.org/jira/browse/JOSHUA-22?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Post closed JOSHUA-22. --- Resolution: Fixed Closing this issue as something unlikely to be done any time soon, and also as something that is more properly done inside the decoder over the hypergraph. > Parallelize MBR computation > --- > > Key: JOSHUA-22 > URL: https://issues.apache.org/jira/browse/JOSHUA-22 > Project: Joshua > Issue Type: Bug >Reporter: Joshua Decoder > Fix For: 6.2 > > > MBR should be multithreaded. This would be easy to add following the model > used in the InputManager. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Closed] (JOSHUA-95) Vocabulary locking
[ https://issues.apache.org/jira/browse/JOSHUA-95?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Post closed JOSHUA-95. --- Resolution: Fixed This is a very old issue that I believe has been fixed. > Vocabulary locking > -- > > Key: JOSHUA-95 > URL: https://issues.apache.org/jira/browse/JOSHUA-95 > Project: Joshua > Issue Type: Bug >Reporter: Matt Post >Assignee: Juri Ganitkevitch > Fix For: 6.2 > > > Vocabulary::id() is still synchronized and a potential point of contention. > It would be nice to resolve this. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Closed] (JOSHUA-100) Add Shen et al. (2008) dependency LM
[ https://issues.apache.org/jira/browse/JOSHUA-100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Post closed JOSHUA-100. Resolution: Fixed > Add Shen et al. (2008) dependency LM > > > Key: JOSHUA-100 > URL: https://issues.apache.org/jira/browse/JOSHUA-100 > Project: Joshua > Issue Type: New Feature >Reporter: Matt Post >Assignee: Matt Post > Fix For: 6.2 > > -- This message was sent by Atlassian JIRA (v6.3.4#6332)
Re: Language Pack English-Japanese
Hi Toshiki, Have you been able to gather any parallel data? matt > On Jul 22, 2016, at 3:50 PM, Henry Saputrawrote: > > HI Toshiki, > > For this kind of discussion, let's have it in the dev@ list. > > You can ask the question to dev@joshua.incubator.apache.org. > > Thanks, > > Henry > > On Thu, Jul 21, 2016 at 9:46 PM, IGA Tosiki wrote: > >> Hi Matt, >> >> Thanks for your reply! >> >> I'm happy to read your mail, I want to help you Japanese-English language >> pack. >> And YES, I mean translation memories by TMS/XLIFF. But I may convert >> TMS to what you specified format. >> >> And also I knew English to Japanese is very difficult, but also I >> believe sample of English-Japanese language pack will attract many >> Japanese people to use Joshua. >> >> Regards, >> Toshiki >> >> 2016-07-22 12:42 GMT+09:00 Matt Post : >>> Hi, >>> >>> There is no Japanese--English language pack, but I would be happy to >> build one if you could help by pointing me to data. What we need is >> parallel data in the form of sentences that are translations of each other. >> If you have access to this or pointers to where I could find some, I would >> be happy to build it. There are likely standard datasets available; people >> like Graham Neubig (http://www.phontron.com) have been working on this >> for a while. >>> >>> What are TMS and LTIFF? Are you talking about translation memories? >>> >>> As a side note, translation between English and Japanese is very >> difficult and tends not to be very good. One approach that helps is >> translating from trees and forests. Joshua does not have this capability at >> the moment. >>> >>> Sincerely, >>> matt >>> >>> On Jul 21, 2016, at 11:28 PM, IGA Tosiki wrote: Hi team, I got interest about Joshua, and language pack. I am Japanese, and I want to know around Japanese language pack. Is there any plan about building Japanese-English language pack? I believe TMS or LTIFF will usefull to building such language pack. I have many OSS based TMS between English-Japanese. Is there any path using TMX or LTIFF for input of Joshua language pack? Best regards, Toshiki Iga >>> >>