On Wed, 2005-02-02 at 14:17 +0300, Anton Danilov wrote: > It's more than 32000 records, so it won't open. Actually, I did so, but > OO Calc is not the tool for l10n word counts. When you count words for > l10n, you must have filters that > > * Count words only in specific language (and support unicode encodings) > * Can omit <xml tags></xml tags> and count only what's inside > * Count only unique words > > I solved this problem with a couple of perl scripts and a combination of > cat, sort, uniq, and wc linux tools, although for some tasks I used OO > Calc as a tool. Anyway, thanks for your follow-up.
For things in PO format you can use pocount from the Translate Toolkit - the same one that does oo2po. I use it quite extensively to check wordcounts. I have never checked it with xml or any other tags. Might need a fix :) > > Anton. > > Vitor Domingos wrote: > > Anton I. Danilov wrote on 01/27/2005 08:09 AM: > > > >> Hi all, > >> > >> I've got two questions about the localization tools. > >> > >> First, what editor is the best suitable for UTF-8 encoded files, like > >> sdf-files from the OpenOffice UI? > > > > > > OpenOffice.org Calc. Just open as an CSV file. > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] -- Dwayne Bailey 083 443 7114 >From a mesh network in White River, Mpumalanga, South Africa --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
