Tadele, I'm wondering if pocommentclean from the Translate Toolkit is the droid you are looking for.
http://docs.translatehouse.org/projects/translate-toolkit/en/latest/commands/pocommentclean.html cjl On Sat, Jan 19, 2013 at 2:57 PM, Tadele Assefa <milky...@gmail.com> wrote: > Hi Everybody, > > Lately, I returned to the 'tag elimination' idea because some of our > translators cannot cop up with the tags they see in Virtaal. (We are doing > translation offline). And with a little regex from stackoverflow > <http://stackoverflow.com/a/8784436/1993440> , > i produced a python script below and got 'clean' text. Here is my entire > 'process': > > 1) po2csv -i x.po -o y.csv => Get CSV > 2) Open y.csv in LO and Delete columns A and C > 3) run clear-tags.py => get y-clean.csv > 4) Open y-clean.csv in LO, run =MOD(ROW(L2),2) on separate column, Filter > the empty row created by po2csv and delete those. > Here<https://docs.google.com/file/d/0B1xCocIHKT5jS3FELXhCdVJ6Mm8/edit> > is > a sample file i did this way. > > As you see this is heavy to do for so many po files. But if this is found > to be good and correct, I think we can make it into a bigger script that > can do this repeatedly for a given directory. > > "clear-tags.py" > import re > f = open('y.csv') > text = f.read() > f.close() > > clean = re.sub('<[^>]+>', ' ', text) > > f = open('y-clean.csv', 'w') > f.write(clean) > f.close() > > > On Mon, Dec 10, 2012 at 12:28 PM, Yaron Shahrabani <sh.ya...@gmail.com>wrote: > >> I still need some info from the translation maintainers: >> What type of tags are there in the translation? (I'm not sure about the >> type of tags). >> >> And Tadele: could you please show me some types of the tags you want to >> eliminate? >> >> (Please type the entire text and what should be the output). >> >> Kind regards, >> >> Yaron Shahrabani >> >> <Hebrew translator> >> >> >> >> >> On Mon, Dec 10, 2012 at 10:21 AM, Tadele Assefa <milky...@gmail.com>wrote: >> >>> Yaron, >>> >>> If it can be made to cleare the tags, i think your idea of 'regex' is >>> brilliant. I was thinking of converting the po to csv open them in >>> text editor and remove similar tags by search and replace, and the >>> left overs by hand.... which is more of tiresome job for so many >>> files. >>> >>> Please expand the regex as much as possible and release it. >>> >>> Thanks, >>> >>> On Sun, Dec 9, 2012 at 11:25 AM, Yaron Shahrabani <sh.ya...@gmail.com> >>> wrote: >>> > Hey guys! >>> > These are great news! >>> > If you need we can probably compose some sort of regex that eliminated >>> the >>> > tags, after accomplishing that task we can use Poedit to produce an HTML >>> > output. >>> > >>> > Kind regards, >>> > Yaron Shahrabani. >>> > >>> > Yaron Shahrabani >>> > >>> > <Hebrew translator> >>> > >>> > >>> > >>> > >>> > On Sat, Dec 8, 2012 at 9:27 PM, Tadele Assefa <milky...@gmail.com> >>> wrote: >>> >> >>> >> Dear All, >>> >> >>> >> We are translating the Libreoffice to Sidama (in Ethiopia). Our >>> >> problem is this; some of our team members are language experts who do >>> >> not know tags etc in the po files AND (computer usage infact). As the >>> >> Help po files are so large we proposed to print the po files without >>> >> the tags as pure text and then after the language guys finish the >>> >> translation on paper, we will type and tag the translations. is there >>> >> a tool we can use to accomplish this? >>> >> >>> >> -- >>> >> Regards,* >>> >> ___________________________ >>> >> Tadele Assefa >>> >> Managing Director* >>> >> * >>> >> >>> >> Cell: +25-911-84-13-84* >>> >> *Think Green – Please do not print this email unless you really need >>> to* >>> >> >>> >> -- >>> >> Unsubscribe instructions: E-mail to l10n+h...@global.libreoffice.org >>> >> Problems? >>> >> http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/ >>> >> Posting guidelines + more: >>> http://wiki.documentfoundation.org/Netiquette >>> >> List archive: http://listarchives.libreoffice.org/global/l10n/ >>> >> All messages sent to this list will be publicly archived and cannot be >>> >> deleted >>> >> >>> > >>> >>> >>> >>> -- >>> Regards, >>> ___________________________ >>> Tadele Assefa >>> Managing Director >>> >>> >>> Cell: +25-911-84-13-84 >>> Think Green – Please do not print this email unless you really need to >>> >> >> > > > -- > Regards,* > ___________________________ > Tadele Assefa > Managing Director* > * > > Cell: +25-911-84-13-84* > *Think Green – Please do not print this email unless you really need to* > > -- > Unsubscribe instructions: E-mail to l10n+h...@global.libreoffice.org > Problems? > http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/ > Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette > List archive: http://listarchives.libreoffice.org/global/l10n/ > All messages sent to this list will be publicly archived and cannot be deleted > -- Unsubscribe instructions: E-mail to l10n+h...@global.libreoffice.org Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/ Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette List archive: http://listarchives.libreoffice.org/global/l10n/ All messages sent to this list will be publicly archived and cannot be deleted