Magic Banana does it again ... with "tr -s ' ' < file | sed 's/ $//' | sort
-u"
I plugged in the filename thusly:
> tr -s ' ' < HolkitsorN.txt | sed 's/ $//' | sort -u > HolkitsorN-MBd.txt
With the result that the original 13.1 kB shrank to 8.8 kB, without any
residual duplicate lines.
The LibreOffice Calc spreadsheet shrank from 332 rows to 224 rows; before
sorting and trying
to remove duplicates, just with Libreoffice Calc, there were 2461 rows.
Clearly, this has to
be a multi-step process. Sort -u by itself had no effect on the file size ...
even after I
"laundered" the file by emailing it to myself and copying it from the email
text into Leafpad.
I tried to parse MB's expression ... I'm OK until sed 's/ $//' because in man
sed it looks like
it might be sed 's///' instead. Am I missing something ?
A residual task remains: The sorting process only works for me if I pad the
one- and two-digit
octets with leading zeroes ... which have to be stripped out afterwards. Is
there a "sort" in
any freedom-compatible application which can put numbers in numerical order ?
Thanks & best regards,
George Langford