Hi Toshiki, Have you been able to gather any parallel data?
matt > On Jul 22, 2016, at 3:50 PM, Henry Saputra <henry.sapu...@gmail.com> wrote: > > HI Toshiki, > > For this kind of discussion, let's have it in the dev@ list. > > You can ask the question to dev@joshua.incubator.apache.org. > > Thanks, > > Henry > > On Thu, Jul 21, 2016 at 9:46 PM, IGA Tosiki <igap...@gmail.com> wrote: > >> Hi Matt, >> >> Thanks for your reply! >> >> I'm happy to read your mail, I want to help you Japanese-English language >> pack. >> And YES, I mean translation memories by TMS/XLIFF. But I may convert >> TMS to what you specified format. >> >> And also I knew English to Japanese is very difficult, but also I >> believe sample of English-Japanese language pack will attract many >> Japanese people to use Joshua. >> >> Regards, >> Toshiki >> >> 2016-07-22 12:42 GMT+09:00 Matt Post <p...@cs.jhu.edu>: >>> Hi, >>> >>> There is no Japanese--English language pack, but I would be happy to >> build one if you could help by pointing me to data. What we need is >> parallel data in the form of sentences that are translations of each other. >> If you have access to this or pointers to where I could find some, I would >> be happy to build it. There are likely standard datasets available; people >> like Graham Neubig (http://www.phontron.com) have been working on this >> for a while. >>> >>> What are TMS and LTIFF? Are you talking about translation memories? >>> >>> As a side note, translation between English and Japanese is very >> difficult and tends not to be very good. One approach that helps is >> translating from trees and forests. Joshua does not have this capability at >> the moment. >>> >>> Sincerely, >>> matt >>> >>> >>>> On Jul 21, 2016, at 11:28 PM, IGA Tosiki <igap...@gmail.com> wrote: >>>> >>>> Hi team, >>>> >>>> I got interest about Joshua, and language pack. I am Japanese, and I >>>> want to know around Japanese language pack. >>>> >>>> Is there any plan about building Japanese-English language pack? >>>> I believe TMS or LTIFF will usefull to building such language pack. I >>>> have many OSS based TMS between English-Japanese. Is there any path >>>> using TMX or LTIFF for input of Joshua language pack? >>>> >>>> Best regards, >>>> Toshiki Iga >>> >>