On Fri, 2011-07-08 at 01:17 -0700, David Haslam wrote: > For projects that begin at USFM (or earlier), it would be great to develop a > tool that analyses character frequency of the text (for the whole Bible) > apart from all the USFM tags, etc.
Done for USFM. sword-tools/modules/misc_cleanup/usfm_charmap.pl Anything build from XML (this includes files coming out of e.g. a styled MS word document, once exported properly, e.g to abiword.xml) the previously mentioned will do the job largely. Shortcomings there would be verse and chapter numbers are usually part of the pain text. Peter _______________________________________________ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page