--- On Sun, 5/12/13, Rafael Skodlar <ra...@linwin.com> wrote:

> > It can be used as one, certainly, but then so can
> Postscript.
> >
> 
> By that logic, we would program web sites code in assembler.
> But then 
> millions use inches and feet even when they tweet ...

You can also do crazy things such as writing an HTML document completely in 
UTF-8 codes, they're &# followed by 4 digits and a ; Leading zeros can be left 
out.

7 characters to encode 1, and for any character in English and several other 
languages, completely pointless when extended ASCII contains all the required 
characters, including left and right single and double quotes.

That brings up a nifty utility I had someone create, it's a text string 
swapper. The input is simple, just a text file with pairs of lines. The program 
has a user interface to enter the swaps but it's easier to write the input file 
in a text editor.

The contents of the first line will be swapped with the contents of the second, 
third with fourth etc.

I used it for 'cleaning' HTML prior to converting ebooks and other documents to 
read on a Palm LifeDrive because Palm OS doesn't support unicode. (Palm OS 5 
was new enough it should have had Unicode!)

It could be useful for doing extremely fast, multiple swaps on any type of text 
file. Sure you can do find and replace all in Word and other text editing 
software, but just one at a time. This utility can do many swaps in a single 
pass.

For example you want to change all instances of G0 to G00 *and* all instances 
of 1002 1005 to 1002-1006 (including the space between) *and* all instances of 
meep with Beep, this can do it.

However it does have a bit of a bug. The number of swap pairs it can handle is 
limited. Try to do too many and it'll bugger up both the file you're working on 
and the swap pairs input file. I tried to make a universal UTF-8 code to ASCII 
input file and it choked on it.

I do have the source code and compiled versions if anyone wants them to tinker 
with and possibly fix the bug. It was written in (IIRC) Visual C 2005 - some 
part of Microsoft's 2005 programming suite. It works perfectly fine as long as 
it's not pushed past its limit. ;)

I didn't experiment to find out the maximum number of swap pairs before it goes 
gaga. Since I didn't write it, I've no idea where to look for the bug or what'd 
be causing it. The author did it for free, got the essential functionality 
working. Good enough!

On the expanded functionality wishlist is the ability to load different swap 
lists for different uses. As it is, it just reads replacementfile.txt in the 
same folder as the executable so to use more than one the user has to keep 
other lists elsewhere or rename the one to use.

It could even be used (if the bug is fixed) for multiple pass encryption and 
decryption by using a series of swap lists to re-arrange all the characters.

If you're interested I can zip up the lot, including replacement lists I made, 
and send it to you.

------------------------------------------------------------------------------
Learn Graph Databases - Download FREE O'Reilly Book
"Graph Databases" is the definitive new guide to graph databases and 
their applications. This 200-page book is written by three acclaimed 
leaders in the field. The early access version is available now. 
Download your free book today! http://p.sf.net/sfu/neotech_d2d_may
_______________________________________________
Emc-users mailing list
Emc-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/emc-users

Reply via email to