Re: [l10n-dev] Translating .sdf files directly with OmegaT

Jean-Christophe Helary Wed, 11 Jul 2007 00:25:50 -0700


On 11 juil. 07, at 15:29, Arthur Buijs wrote:

The overhead of using po-files in the translation process is minimal
(exept from the initial trying out).

It is not when you have to modify the tagged links to fit the source.In OmegaT that is done automatically without you even noticing it.

Also, all the <emph> tags, if they need to be displaced or edited,require more work in a text based editor than in OmegaT (if done theway I suggested).

Of course, using the PO files in a PO editor or in OmegaT will notmake much difference in terms of editing the matches. The problem_is_ which source file you choose to work with and what relation theyhave to the original format (here: HTML->SDF->PO, almost no relationanymore when you reach the PO stage.)

So I am really talking about not using PO because _that_ requires tohandle the files at text, while using the modified .sdf allows themto be handled as HTML (which does considerably reduce the amount ofediting).

Ofcourse this is only true if a
useable tmx-file is available. My advise would be to find a better way
to generate tmx-files and use po-files for the translation-task.

The TMXs provided by Rafaella were similar to the ones provided bythe translate-toolkit processes (oo2po -> po2tmx) and neithercorresponded to the source po file in terms of number of "\"characters for the escape sequences. They corresponded to theoriginal .sdf file, which is what originally prompted me to use theoriginal .sdf file as source. The rest of the hack I proposed on the7/7 comes from that.

The general problem does not only come from the TMX, but from thefact that .sdf is already an intermediate format (that you thenconvert to yet another intermediate format -> po).

The original conversion requires escapes and _that_ is what requiresthe files to be handled as text when they could just as well behandled as pure and simple HTML which most translation tools support.


The TMX problem is yet another problem.

Here, we have the following structure for the TMXs:

(new source segment)
(old target translation, if present)

A _real_ TMX should be:

(old source segment)
(old target translation)

So the current process is very confusing and does not allow TMXsupporting tools (like OmegaT or even OLT) to fully leverage thecontents of the source. Which is the real function of the TMX file.

Plus, the fact that the TMX do not reflect the structure of theactual source file (PO) makes them yet another problem.

Of course, I am commenting on the process only with the perspectiveof allowing translation contributors to have access to a translationworkflow that supports the use of computer aided translation tools.Right now the process that is suggested by the file formats availablefor OOo's localization does not facilitate this at all.

Another of SUN's project, namely NetBeans, manages to fully leveragelegacy translations thanks to the use of simple source file formats(the UI files are simple Java properties and the Help files aresimply HTML) and the whole source files are matched to the legacytranslations output to TMX for super easy translation (in OmegaT orany other TMX supporting tool, even though OmegaT is the most usedtool there).

As long as OOo sticks to intermediate file formats (.sdf/.po/.xliff)with the current unstable conversion processes, hack will benecessary to reach the same level of efficiency other communitieshave already reached. And _that_ is really too bad.



Cheers,

Jean-Christophe Helary (fr)

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: [l10n-dev] Translating .sdf files directly with OmegaT

Reply via email to