On Fri, 2011-07-08 at 01:17 -0700, David Haslam wrote:
> For projects that begin at USFM (or earlier), it would be great to develop a
> tool that analyses character frequency of the text (for the whole Bible)
> apart from all the USFM tags, etc.

Done for USFM.

sword-tools/modules/misc_cleanup/usfm_charmap.pl

Anything build from XML (this includes files coming out of e.g. a styled
MS word document, once exported properly, e.g to abiword.xml) the
previously mentioned will do the job largely. Shortcomings there would
be verse and chapter numbers are usually part of the pain text.

Peter


_______________________________________________
sword-devel mailing list: sword-devel@crosswire.org
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page

Reply via email to