This is an open invitation public session Seminar title: The successes and challenges of making low-data languages available in online automatic translation portals and software.
Date: Tues, August 17, 2010 Location: Microsoft, Redmond, Washington, USA Room: Building 99, Room 1919 Time: 2:00PM-3:30PM Speaker: Jeff Allen Directions to building 99 http://research.microsoft.com/en-us/labs/redmond/visit.aspx Summary: The majority of development work and deployment of machine translation (MT) technologies over the past several decades have been for international languages. Only a few projects for low-data/low-density/low resource/sparse-data/less-prevalent/lesser-commonly taught/minority languages have led to successful prototypes and products. There are a certain number of technical, logistical, social, educational and other factors which influence and impact the potential success of implementing systems for such languages. This talk will cover many of the lessons learned from previous projects, and some of the pitfalls to avoid. It will also demonstrate how the recent efforts for making Haitian Creole available for Haiti Disaster Relief had a certain level of success in record time because of the ability to build upon previous work. Yet, there were also obstacles with have been problematic and remain a concern for this language and for other less-prevalent languages. Lastly, the discussion will mention some ways to enable proactive, forward thinking projects, using some bootstrapping methods, to reduce the risk of situations which can result from working in a primarily reactive mode. _______________________________________________ Mt-list mailing list
