Tadele,

I'm wondering if pocommentclean from the Translate Toolkit is the
droid you are looking for.

http://docs.translatehouse.org/projects/translate-toolkit/en/latest/commands/pocommentclean.html

cjl

On Sat, Jan 19, 2013 at 2:57 PM, Tadele Assefa <milky...@gmail.com> wrote:
> Hi Everybody,
>
> Lately, I returned to the 'tag elimination' idea because some of our
> translators cannot cop up with the tags they see in Virtaal. (We are doing
> translation offline). And with a little regex from stackoverflow
> <http://stackoverflow.com/a/8784436/1993440> ,
> i produced a python script below and got 'clean' text. Here is my entire
> 'process':
>
> 1) po2csv -i x.po -o y.csv => Get CSV
> 2) Open y.csv in LO and Delete columns A and C
> 3) run clear-tags.py => get y-clean.csv
> 4) Open y-clean.csv in LO, run =MOD(ROW(L2),2) on separate column, Filter
> the empty row created by po2csv and delete those.
> Here<https://docs.google.com/file/d/0B1xCocIHKT5jS3FELXhCdVJ6Mm8/edit>
> is
> a sample file i did this way.
>
> As you see this is heavy to do for so many po files. But if this is found
> to be good and correct, I think we can make it into a bigger script that
> can do this repeatedly for a given directory.
>
> "clear-tags.py"
> import re
> f = open('y.csv')
> text = f.read()
> f.close()
>
> clean = re.sub('<[^>]+>', ' ', text)
>
> f = open('y-clean.csv', 'w')
> f.write(clean)
> f.close()
>
>
> On Mon, Dec 10, 2012 at 12:28 PM, Yaron Shahrabani <sh.ya...@gmail.com>wrote:
>
>> I still need some info from the translation maintainers:
>> What type of tags are there in the translation? (I'm not sure about the
>> type of tags).
>>
>> And Tadele: could you please show me some types of the tags you want to
>> eliminate?
>>
>> (Please type the entire text and what should be the output).
>>
>> Kind regards,
>>
>> Yaron Shahrabani
>>
>> <Hebrew translator>
>>
>>
>>
>>
>> On Mon, Dec 10, 2012 at 10:21 AM, Tadele Assefa <milky...@gmail.com>wrote:
>>
>>> Yaron,
>>>
>>> If it can be made to cleare the tags, i think your idea of 'regex' is
>>> brilliant. I was thinking of converting the po to csv open them in
>>> text editor and remove similar tags by search and replace, and the
>>> left overs by hand.... which is more of tiresome job for so many
>>> files.
>>>
>>> Please expand the regex as much as possible and release it.
>>>
>>> Thanks,
>>>
>>> On Sun, Dec 9, 2012 at 11:25 AM, Yaron Shahrabani <sh.ya...@gmail.com>
>>> wrote:
>>> > Hey guys!
>>> > These are great news!
>>> > If you need we can probably compose some sort of regex that eliminated
>>> the
>>> > tags, after accomplishing that task we can use Poedit to produce an HTML
>>> > output.
>>> >
>>> > Kind regards,
>>> > Yaron Shahrabani.
>>> >
>>> > Yaron Shahrabani
>>> >
>>> > <Hebrew translator>
>>> >
>>> >
>>> >
>>> >
>>> > On Sat, Dec 8, 2012 at 9:27 PM, Tadele Assefa <milky...@gmail.com>
>>> wrote:
>>> >>
>>> >> Dear All,
>>> >>
>>> >> We are translating the Libreoffice to Sidama (in Ethiopia). Our
>>> >> problem is this; some of our team members are language experts who do
>>> >> not know tags etc in the po files AND (computer usage infact). As the
>>> >> Help po files are so large we proposed to print the po files without
>>> >> the tags as pure text and then after the language guys finish the
>>> >> translation on paper, we will type and tag the translations. is there
>>> >> a tool we can use to accomplish this?
>>> >>
>>> >> --
>>> >> Regards,*
>>> >> ___________________________
>>> >> Tadele Assefa
>>> >> Managing Director*
>>> >> *
>>> >>
>>> >> Cell: +25-911-84-13-84*
>>> >> *Think Green – Please do not print this email unless you really need
>>> to*
>>> >>
>>> >> --
>>> >> Unsubscribe instructions: E-mail to l10n+h...@global.libreoffice.org
>>> >> Problems?
>>> >> http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
>>> >> Posting guidelines + more:
>>> http://wiki.documentfoundation.org/Netiquette
>>> >> List archive: http://listarchives.libreoffice.org/global/l10n/
>>> >> All messages sent to this list will be publicly archived and cannot be
>>> >> deleted
>>> >>
>>> >
>>>
>>>
>>>
>>> --
>>> Regards,
>>> ___________________________
>>> Tadele Assefa
>>> Managing Director
>>>
>>>
>>> Cell: +25-911-84-13-84
>>> Think Green – Please do not print this email unless you really need to
>>>
>>
>>
>
>
> --
> Regards,*
> ___________________________
> Tadele Assefa
> Managing Director*
> *
>
> Cell: +25-911-84-13-84*
> *Think Green – Please do not print this email unless you really need to*
>
> --
> Unsubscribe instructions: E-mail to l10n+h...@global.libreoffice.org
> Problems? 
> http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
> Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
> List archive: http://listarchives.libreoffice.org/global/l10n/
> All messages sent to this list will be publicly archived and cannot be deleted
>

-- 
Unsubscribe instructions: E-mail to l10n+h...@global.libreoffice.org
Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.libreoffice.org/global/l10n/
All messages sent to this list will be publicly archived and cannot be deleted

Reply via email to