Hi Everybody,

Lately, I returned to the 'tag elimination' idea because some of our
translators cannot cop up with the tags they see in Virtaal. (We are doing
translation offline). And with a little regex from stackoverflow
<http://stackoverflow.com/a/8784436/1993440> ,
i produced a python script below and got 'clean' text. Here is my entire
'process':

1) po2csv -i x.po -o y.csv => Get CSV
2) Open y.csv in LO and Delete columns A and C
3) run clear-tags.py => get y-clean.csv
4) Open y-clean.csv in LO, run =MOD(ROW(L2),2) on separate column, Filter
the empty row created by po2csv and delete those.
Here<https://docs.google.com/file/d/0B1xCocIHKT5jS3FELXhCdVJ6Mm8/edit>
is
a sample file i did this way.

As you see this is heavy to do for so many po files. But if this is found
to be good and correct, I think we can make it into a bigger script that
can do this repeatedly for a given directory.

"clear-tags.py"
import re
f = open('y.csv')
text = f.read()
f.close()

clean = re.sub('<[^>]+>', ' ', text)

f = open('y-clean.csv', 'w')
f.write(clean)
f.close()


On Mon, Dec 10, 2012 at 12:28 PM, Yaron Shahrabani <sh.ya...@gmail.com>wrote:

> I still need some info from the translation maintainers:
> What type of tags are there in the translation? (I'm not sure about the
> type of tags).
>
> And Tadele: could you please show me some types of the tags you want to
> eliminate?
>
> (Please type the entire text and what should be the output).
>
> Kind regards,
>
> Yaron Shahrabani
>
> <Hebrew translator>
>
>
>
>
> On Mon, Dec 10, 2012 at 10:21 AM, Tadele Assefa <milky...@gmail.com>wrote:
>
>> Yaron,
>>
>> If it can be made to cleare the tags, i think your idea of 'regex' is
>> brilliant. I was thinking of converting the po to csv open them in
>> text editor and remove similar tags by search and replace, and the
>> left overs by hand.... which is more of tiresome job for so many
>> files.
>>
>> Please expand the regex as much as possible and release it.
>>
>> Thanks,
>>
>> On Sun, Dec 9, 2012 at 11:25 AM, Yaron Shahrabani <sh.ya...@gmail.com>
>> wrote:
>> > Hey guys!
>> > These are great news!
>> > If you need we can probably compose some sort of regex that eliminated
>> the
>> > tags, after accomplishing that task we can use Poedit to produce an HTML
>> > output.
>> >
>> > Kind regards,
>> > Yaron Shahrabani.
>> >
>> > Yaron Shahrabani
>> >
>> > <Hebrew translator>
>> >
>> >
>> >
>> >
>> > On Sat, Dec 8, 2012 at 9:27 PM, Tadele Assefa <milky...@gmail.com>
>> wrote:
>> >>
>> >> Dear All,
>> >>
>> >> We are translating the Libreoffice to Sidama (in Ethiopia). Our
>> >> problem is this; some of our team members are language experts who do
>> >> not know tags etc in the po files AND (computer usage infact). As the
>> >> Help po files are so large we proposed to print the po files without
>> >> the tags as pure text and then after the language guys finish the
>> >> translation on paper, we will type and tag the translations. is there
>> >> a tool we can use to accomplish this?
>> >>
>> >> --
>> >> Regards,*
>> >> ___________________________
>> >> Tadele Assefa
>> >> Managing Director*
>> >> *
>> >>
>> >> Cell: +25-911-84-13-84*
>> >> *Think Green – Please do not print this email unless you really need
>> to*
>> >>
>> >> --
>> >> Unsubscribe instructions: E-mail to l10n+h...@global.libreoffice.org
>> >> Problems?
>> >> http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
>> >> Posting guidelines + more:
>> http://wiki.documentfoundation.org/Netiquette
>> >> List archive: http://listarchives.libreoffice.org/global/l10n/
>> >> All messages sent to this list will be publicly archived and cannot be
>> >> deleted
>> >>
>> >
>>
>>
>>
>> --
>> Regards,
>> ___________________________
>> Tadele Assefa
>> Managing Director
>>
>>
>> Cell: +25-911-84-13-84
>> Think Green – Please do not print this email unless you really need to
>>
>
>


-- 
Regards,*
___________________________
Tadele Assefa
Managing Director*
*

Cell: +25-911-84-13-84*
*Think Green – Please do not print this email unless you really need to*

-- 
Unsubscribe instructions: E-mail to l10n+h...@global.libreoffice.org
Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.libreoffice.org/global/l10n/
All messages sent to this list will be publicly archived and cannot be deleted

Reply via email to