On Wed, 2005-02-02 at 14:17 +0300, Anton Danilov wrote:
> It's more than 32000 records, so it won't open. Actually, I did so, but 
> OO Calc is not the tool for l10n word counts. When you count words for 
> l10n, you must have filters that
> 
> * Count words only in specific language (and support unicode encodings)
> * Can omit <xml tags></xml tags> and count only what's inside
> * Count only unique words
> 
> I solved this problem with a couple of perl scripts and a combination of 
> cat, sort, uniq, and wc linux tools, although for some tasks I used OO 
> Calc as a tool. Anyway, thanks for your follow-up.

For things in PO format you can use pocount from the Translate Toolkit -
the same one that does oo2po.

I use it quite extensively to check wordcounts.  I have never checked it
with xml or any other tags.  Might need a fix :)


> 
> Anton.
> 
> Vitor Domingos wrote:
> > Anton I. Danilov wrote on 01/27/2005 08:09 AM:
> > 
> >> Hi all,
> >>
> >> I've got two questions about the localization tools.
> >>
> >> First, what editor is the best suitable for UTF-8 encoded files, like
> >> sdf-files from the OpenOffice UI?
> > 
> > 
> > OpenOffice.org Calc. Just open as an CSV file.
> > 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
-- 
Dwayne Bailey
083 443 7114

>From a mesh network in White River, Mpumalanga, South Africa


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to