Re: Migrate website translations to PO [was: Re: When and how can we migrate out of CVS and WML ?]

2010-08-11 Thread Helge Kreutzmann
Hello,
throwing my knowledge from the German translation in ...

On Wed, Aug 11, 2010 at 09:28:14AM -0400, David Prévot wrote:
> Le 06/08/2010 17:23, Gerfried Fuchs a écrit :
> > Hi!
> > 
> > * Andrei Popescu  [2010-08-05 09:17:59 CEST]:
> >> On Vi, 30 iul 10, 11:15:17, Andrei Popescu wrote:
> >>>
> >>> Moving to .po probably needs a coordinated effort including at least the 
> >>> coordinators from all the languages that have more than just a few 
> >>> translated pages.
> >>>
> >>> Is there some wiki page about this project? I can start one, but not 
> >>> until tomorrow.
> >>
> >> It took a bit longer, but the page is 
> >> http://wiki.debian.org/DebianWebsitePO

Additional pro: Some parts of the website (namely vote) have several
reocurring paragraphs, where I maintain a dedicated script to save the
trouble of retranslation. Unfortunately, nobody so far was interested in
converting it to po (cf. #364913). This will only help, if po is used
rightly, i.e. with variables (as the perl script I'm maintaining
essentially does by using regular expressions).

On the con side: Working with huge paragraphs with po is a pain,
especially with limited screen space. More below.

> >> Input from people familiar with po4a and (other) translators would be 
> >> highly needed.

:-))

> >  Often parts aren't properly in context, moved around within the po file
> > and get confusing when only working on the po file. If done carefully
> > this might be solved but it is something that shouldn't be ignored for
> > proper decision making.
> 
> I fail to understand the issue here : when working on a blank WML file,
> there is no context at all, the only context is in the original file,
> which needs to be used when translating a WML file, and can also be used
> when working on a PO file.

Blank wml file? I usually copy the english file to the German location
and then work on the english text, replacing it paragraph by paragraph
with the translation. I have all the context I need. Whats unclear for
me is if there will be one, one per directory, or thousands of po
files. The saved work could only be gained if there are as few as
possible po files (e.g. for vote, for News, for DWN, ...), but this
would be a hassle as well, as huge files would need to be moved back and forth
into CVS. Also if a team has several translators, they might "fight"
over a certain file, even if they work on different parts. Of course,
context could be problematic, and 

If there are however lots of small po files, then I somehow fail to
see the advantage (except for the rare case of moving paragraphs).
Review is easily done on updated translations using "cvs diff", both
on the original and the translation. Nothing else required.

> >  This is related to that po is for translating more-or-less text
> > snippets that are meant to be able to stand on their own. Having a text
> > seperated into multiple strings, with always the english part in between
> > does IMHO block some quality possibilities of having the text flow
> > naturally because it doesn't make the final proofreading as easy.
> 
> On the contrary, providing the original text while asking for review
> makes it easier for reviewer to understand what it is about (and
> eventually spot translation mistakes), without needing them to search
> for the ad-hoc part of the original text somewhere on the website.

Yes, this eases review in a certain way. I don't see text flow issues,
just that for large po files the (wanted) reuse of original text might
look like a cloze, so people might miss parts even though they are
there.

Btw. this depends how you do your review. Why not sending two files to
your reviewers? Or teach them CVS, so they can use "CVS diff" as well?
Also I would not let them search, I'd provide the link myself, if
needed be. So it really boils down to review methods and standards in
each team.

> >  Also, translating longer paragraphs gets annoying, especially when the
> > original gets changed. It will mark the string as fuzzy and the
> > translator has to dig around in a longer paragraph about what actually
> > has changed. One solution to this might be the --previous switch which
> > keeps the former string in there for comparison -- but are there
> > translation tools that support that properly and can hilight the changes
> > in a wdiff form? Maybe I missed some development in that area, feel free
> > to enlighten me. As long as such a tool isn't available I consider that
> > as a real issue.
> 
> It's one of the feature of Lokalize, don't know if it is implemented in
> other tools, but yes: Lokalize provide a colored diff inline between the
> old original text and the new one, and make it easy to spot what has
> been changed on the paragraph.

Well, the decision should not be based on a single tool. I use vim,
and I haven't checked it in Squeeze yet, but in Lenny I don't get such
a help. IMHO at least some tools should provide the help. Also I found 
CVS diff very helpful (much
b

Re: Migrate website translations to PO [was: Re: When and how can we migrate out of CVS and WML ?]

2010-08-11 Thread David Prévot
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

Le 06/08/2010 17:23, Gerfried Fuchs a écrit :
>   Hi!
> 
> * Andrei Popescu  [2010-08-05 09:17:59 CEST]:
>> On Vi, 30 iul 10, 11:15:17, Andrei Popescu wrote:
>>>
>>> Moving to .po probably needs a coordinated effort including at least the 
>>> coordinators from all the languages that have more than just a few 
>>> translated pages.
>>>
>>> Is there some wiki page about this project? I can start one, but not 
>>> until tomorrow.
>>
>> It took a bit longer, but the page is 
>> http://wiki.debian.org/DebianWebsitePO
>>
>> Input from people familiar with po4a and (other) translators would be 
>> highly needed.
> 
>  While some translators are familiar with working on po files I see some
> issues that bother me myself when working on po files:

Thanks for your remarks, I'll try to answer some of them. Even if, as a
translator, I'm convinced that PO files is a nice tool to handle
translations, especially updates, I wonder if this question of migrating
the translation of the website to PO files shouldn't be asked to
translators, in order to have their feedback, thus CCing debian-i18n.
Please continue the discussion on debian-www.

>  Often parts aren't properly in context, moved around within the po file
> and get confusing when only working on the po file. If done carefully
> this might be solved but it is something that shouldn't be ignored for
> proper decision making.

I fail to understand the issue here : when working on a blank WML file,
there is no context at all, the only context is in the original file,
which needs to be used when translating a WML file, and can also be used
when working on a PO file.

>  This is related to that po is for translating more-or-less text
> snippets that are meant to be able to stand on their own. Having a text
> seperated into multiple strings, with always the english part in between
> does IMHO block some quality possibilities of having the text flow
> naturally because it doesn't make the final proofreading as easy.

On the contrary, providing the original text while asking for review
makes it easier for reviewer to understand what it is about (and
eventually spot translation mistakes), without needing them to search
for the ad-hoc part of the original text somewhere on the website.

>  Also, translating longer paragraphs gets annoying, especially when the
> original gets changed. It will mark the string as fuzzy and the
> translator has to dig around in a longer paragraph about what actually
> has changed. One solution to this might be the --previous switch which
> keeps the former string in there for comparison -- but are there
> translation tools that support that properly and can hilight the changes
> in a wdiff form? Maybe I missed some development in that area, feel free
> to enlighten me. As long as such a tool isn't available I consider that
> as a real issue.

It's one of the feature of Lokalize, don't know if it is implemented in
other tools, but yes: Lokalize provide a colored diff inline between the
old original text and the new one, and make it easy to spot what has
been changed on the paragraph.

>  The last issue I see is with the the core way how po works: If it finds
> an untranslated or fuzzy string it will put the english original into
> the place. This might be something useful for applications to specificly
> support work-in-progress approaches and not render a translation invalid
> for a string that might only be an error message or such - but then I
> don't consider this as an acceptable approach for the website. It would
> be quite confusing for people to see a mix of english and their own
> language on the same page and switch like from every paragraph to the
> next. I *do* consider it better in that cases to have a potential
> (minorly) outdated page but completely in the native language than a mix
> of english and their language.

It might be possible to trick the usual PO workflow, by keeping the
generated WML file in VCS, and update it if and only if the translator
updates the PO file.

Anyway, even if I understand the "please keep fully (even outdated)
translated pages" argument, I don't think it applies to the whole site.
For example, I think it would be better if developers' related stuff
would be kept up to date, even if not the whole page is translated
(rationale: developers needs to interact in English anyway, even if
translated documentation is helpful, up to date documentation is more
important).

>  Also, in some areas we do encourage adding language specific
> information - I'm not too sure how that should work with po4a. Also, in
> some specific situations it happened that translators have changed the
> formating of a page (like seperating/merging two paragraphs) and it
> might make sense for them to keep that possibility. Different languages
> do have different representation requirements.

Actually po4a can handle addendum, which is a nice way to add some more
information (it is o

Re: Migrate website translations to PO [was: Re: When and how can we migrate out of CVS and WML ?]

2010-08-06 Thread Gerfried Fuchs
Hi!

* Andrei Popescu  [2010-08-05 09:17:59 CEST]:
> On Vi, 30 iul 10, 11:15:17, Andrei Popescu wrote:
> > 
> > Moving to .po probably needs a coordinated effort including at least the 
> > coordinators from all the languages that have more than just a few 
> > translated pages.
> > 
> > Is there some wiki page about this project? I can start one, but not 
> > until tomorrow.
> 
> It took a bit longer, but the page is 
> http://wiki.debian.org/DebianWebsitePO
> 
> Input from people familiar with po4a and (other) translators would be 
> highly needed.

 While some translators are familiar with working on po files I see some
issues that bother me myself when working on po files:

 Often parts aren't properly in context, moved around within the po file
and get confusing when only working on the po file. If done carefully
this might be solved but it is something that shouldn't be ignored for
proper decision making.

 This is related to that po is for translating more-or-less text
snippets that are meant to be able to stand on their own. Having a text
seperated into multiple strings, with always the english part in between
does IMHO block some quality possibilities of having the text flow
naturally because it doesn't make the final proofreading as easy.

 Also, translating longer paragraphs gets annoying, especially when the
original gets changed. It will mark the string as fuzzy and the
translator has to dig around in a longer paragraph about what actually
has changed. One solution to this might be the --previous switch which
keeps the former string in there for comparison -- but are there
translation tools that support that properly and can hilight the changes
in a wdiff form? Maybe I missed some development in that area, feel free
to enlighten me. As long as such a tool isn't available I consider that
as a real issue.

 The last issue I see is with the the core way how po works: If it finds
an untranslated or fuzzy string it will put the english original into
the place. This might be something useful for applications to specificly
support work-in-progress approaches and not render a translation invalid
for a string that might only be an error message or such - but then I
don't consider this as an acceptable approach for the website. It would
be quite confusing for people to see a mix of english and their own
language on the same page and switch like from every paragraph to the
next. I *do* consider it better in that cases to have a potential
(minorly) outdated page but completely in the native language than a mix
of english and their language.

 Also, in some areas we do encourage adding language specific
information - I'm not too sure how that should work with po4a. Also, in
some specific situations it happened that translators have changed the
formating of a page (like seperating/merging two paragraphs) and it
might make sense for them to keep that possibility. Different languages
do have different representation requirements.

 I would like to know how this approach would like to tackle those.

 Thanks!
Rhonda
-- 
"Lediglich 11 Prozent der Arbeitgeber sind der Meinung, dass jeder
Mensch auch ein Privatleben haben sollte."
-- http://www.karriere.at/artikel/884/


-- 
To UNSUBSCRIBE, email to debian-www-requ...@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/20100806212300.ga30...@anguilla.debian.or.at



Re: Migrate website translations to PO [was: Re: When and how can we migrate out of CVS and WML ?]

2010-08-05 Thread David Prévot
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

Le 05/08/2010 03:17, Andrei Popescu a écrit :
> On Vi, 30 iul 10, 11:15:17, Andrei Popescu wrote:
>>
>> Moving to .po probably needs a coordinated effort including at least the
>> coordinators from all the languages that have more than just a few
>> translated pages.

Sure, but is there really a need to *force* up to date translation to be
handled with po4a ? Don't get me wrong, I really agree it would be an
improvement to handle translation with po4a, but if possible, it would
be nice to keep the ability to continue using directly WML in language
teams (at least for the beginning): convert existing translation is
still a manual process (well described in po4a(7)), so it would probably
better to provide an hybrid system: if the PO file exists, use it, if
not, use the "normal" translated WML file as usual.

The advantage of the hybrid system, will be to permit every team to test
the new system (find and fix some issues), and gradually upgrade to po4a
use (and once done, VCS issue won't be a problem to maintain up to date
translations since every information needed will be directly in the PO
files).

>> Is there some wiki page about this project? I can start one, but not
>> until tomorrow.
>
> It took a bit longer, but the page is
> http://wiki.debian.org/DebianWebsitePO
>
> Input from people familiar with po4a and (other) translators would be
> highly needed.

Thanks, I'm currently adding those previous remarks (and some other ones
I've in mind: disk space will double; a recent version of po4a should be
used to handle correctly addendum only if it exists, I think the
backported one will do, etc.)

Cheers.

David

-BEGIN PGP SIGNATURE-
Version: GnuPG v1.4.10 (GNU/Linux)

iEYEARECAAYFAkxayTMACgkQ18/WetbTC/ojuQCeOT4yyemWpudJszCvLK6bmjjS
m3wAoIvHQGkjg4J+w6hDKxWl9pAfhr6P
=/H3M
-END PGP SIGNATURE-


-- 
To UNSUBSCRIBE, email to debian-www-requ...@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/4c5ac934.7050...@tilapin.org



Migrate website translations to PO [was: Re: When and how can we migrate out of CVS and WML ?]

2010-08-05 Thread Andrei Popescu
On Vi, 30 iul 10, 11:15:17, Andrei Popescu wrote:
> 
> Moving to .po probably needs a coordinated effort including at least the 
> coordinators from all the languages that have more than just a few 
> translated pages.
> 
> Is there some wiki page about this project? I can start one, but not 
> until tomorrow.

It took a bit longer, but the page is 
http://wiki.debian.org/DebianWebsitePO

Input from people familiar with po4a and (other) translators would be 
highly needed.

Regards,
Andrei
-- 
Offtopic discussions among Debian users and developers:
http://lists.alioth.debian.org/mailman/listinfo/d-community-offtopic


signature.asc
Description: Digital signature