Hi Everybody, Lately, I returned to the 'tag elimination' idea because some of our translators cannot cop up with the tags they see in Virtaal. (We are doing translation offline). And with a little regex from stackoverflow <http://stackoverflow.com/a/8784436/1993440> , i produced a python script below and got 'clean' text. Here is my entire 'process':
1) po2csv -i x.po -o y.csv => Get CSV 2) Open y.csv in LO and Delete columns A and C 3) run clear-tags.py => get y-clean.csv 4) Open y-clean.csv in LO, run =MOD(ROW(L2),2) on separate column, Filter the empty row created by po2csv and delete those. Here<https://docs.google.com/file/d/0B1xCocIHKT5jS3FELXhCdVJ6Mm8/edit> is a sample file i did this way. As you see this is heavy to do for so many po files. But if this is found to be good and correct, I think we can make it into a bigger script that can do this repeatedly for a given directory. "clear-tags.py" import re f = open('y.csv') text = f.read() f.close() clean = re.sub('<[^>]+>', ' ', text) f = open('y-clean.csv', 'w') f.write(clean) f.close() On Mon, Dec 10, 2012 at 12:28 PM, Yaron Shahrabani <sh.ya...@gmail.com>wrote: > I still need some info from the translation maintainers: > What type of tags are there in the translation? (I'm not sure about the > type of tags). > > And Tadele: could you please show me some types of the tags you want to > eliminate? > > (Please type the entire text and what should be the output). > > Kind regards, > > Yaron Shahrabani > > <Hebrew translator> > > > > > On Mon, Dec 10, 2012 at 10:21 AM, Tadele Assefa <milky...@gmail.com>wrote: > >> Yaron, >> >> If it can be made to cleare the tags, i think your idea of 'regex' is >> brilliant. I was thinking of converting the po to csv open them in >> text editor and remove similar tags by search and replace, and the >> left overs by hand.... which is more of tiresome job for so many >> files. >> >> Please expand the regex as much as possible and release it. >> >> Thanks, >> >> On Sun, Dec 9, 2012 at 11:25 AM, Yaron Shahrabani <sh.ya...@gmail.com> >> wrote: >> > Hey guys! >> > These are great news! >> > If you need we can probably compose some sort of regex that eliminated >> the >> > tags, after accomplishing that task we can use Poedit to produce an HTML >> > output. >> > >> > Kind regards, >> > Yaron Shahrabani. >> > >> > Yaron Shahrabani >> > >> > <Hebrew translator> >> > >> > >> > >> > >> > On Sat, Dec 8, 2012 at 9:27 PM, Tadele Assefa <milky...@gmail.com> >> wrote: >> >> >> >> Dear All, >> >> >> >> We are translating the Libreoffice to Sidama (in Ethiopia). Our >> >> problem is this; some of our team members are language experts who do >> >> not know tags etc in the po files AND (computer usage infact). As the >> >> Help po files are so large we proposed to print the po files without >> >> the tags as pure text and then after the language guys finish the >> >> translation on paper, we will type and tag the translations. is there >> >> a tool we can use to accomplish this? >> >> >> >> -- >> >> Regards,* >> >> ___________________________ >> >> Tadele Assefa >> >> Managing Director* >> >> * >> >> >> >> Cell: +25-911-84-13-84* >> >> *Think Green – Please do not print this email unless you really need >> to* >> >> >> >> -- >> >> Unsubscribe instructions: E-mail to l10n+h...@global.libreoffice.org >> >> Problems? >> >> http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/ >> >> Posting guidelines + more: >> http://wiki.documentfoundation.org/Netiquette >> >> List archive: http://listarchives.libreoffice.org/global/l10n/ >> >> All messages sent to this list will be publicly archived and cannot be >> >> deleted >> >> >> > >> >> >> >> -- >> Regards, >> ___________________________ >> Tadele Assefa >> Managing Director >> >> >> Cell: +25-911-84-13-84 >> Think Green – Please do not print this email unless you really need to >> > > -- Regards,* ___________________________ Tadele Assefa Managing Director* * Cell: +25-911-84-13-84* *Think Green – Please do not print this email unless you really need to* -- Unsubscribe instructions: E-mail to l10n+h...@global.libreoffice.org Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/ Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette List archive: http://listarchives.libreoffice.org/global/l10n/ All messages sent to this list will be publicly archived and cannot be deleted