Re: Reverse cut...

2002-12-02 Thread Andrew D Jewell
If you use the coreutils from http://alexautils.sourceforge.net/ (some docs at http://alexautils.sourceforge.net/text.html the package itself at http://alexautils.sourceforge.net/coreutils-4.5.3-alexa02.tar.gz) the version of cut has many many extensions, including the one you're after. Use :

Re: cut bug?

2002-11-07 Thread Andrew D Jewell
Alternately, you can used the enhanced cut in coreutils-4.5.3-alexa02.tar.gz at http://alexautils.sourceforge.net/ cut, as well as all the other field based tools, supports --dw to set the delimiter to "whitespace" and does what you want. adj At 11:16 AM -0700 11/7/02, Bob Proulx wrote: Bi

Re: fold: add indentation option

2002-11-06 Thread Andrew D Jewell
At 6:35 AM +0800 11/4/02, Dan Jacobson wrote: An enhancement to fold(1) would be to add a selectable indentation string to folded lines. I agree completely, and have added the feature to coreutils-4.5.3-alexa02.tar.gz at http://alexautils.sourceforge.net/ adj

redundancy, and how to reduce it

2002-07-14 Thread Andrew D Jewell
When adding new options to gnu utilities, I am always annoyed that I have to add each option in four places : long-options short-options usage() whatever.texi The solution I've come up with is to replace the glocal "stuct option" definition in the main program source with typedef struct linef

Re: PATCH -- new option for cut(1) -- whitespace delimited fields

2002-06-24 Thread Andrew D Jewell
Also, the version (from 2.0.21) at http://alexautils.sourceforge.net has this feature as --dw, as well as many other convenience delimiters, input and output, for all tools that deal with fields. adj At 1:25 AM -0700 6/22/02, Ian Bruce wrote: >As discussed previously on this list >(http://mai

Re: tab as sort's field-separator

2002-06-16 Thread Andrew D Jewell
What you've got here is a shell problem, not a sort problem. Although I'm sure I've done it in the past, I can't at the moment figure out how to make it happen in tcsh. One option is to grab the improved textutils from http://alexautils.sourceforge.net where all field based tools have '--dt' to

Re: sort -n

2002-06-14 Thread Andrew D Jewell
-n, --numeric-sort compare according to string numerical value -g, --general-numeric-sort compare according to general numerical value -n is fast and for integers only -g is slower, and sorts as by strtod, which is what you want. adj At 2:08 PM + 6/14/02, Apua Paquola wrote:

better --version in textutils

2002-05-29 Thread Andrew D Jewell
Something I find useful : adds the glibc version to the "--version" display. Requires AC_CHECK_HEADERS(gnu/libc-version.h) in configure.ac adj --- old-version-etc.c Wed May 29 11:30:01 2002 +++ version-etc.c Wed May 29 10:38:07 2002 @@ -32,6 +32,12 @@ # define _(Text) Text #endif +

Re: cut 2.0a

2002-05-12 Thread Andrew D Jewell
Bad News : gnu textutils must be 100% compatible with the past Good News : I needn't be. See : http://alexautils.sourceforge.com Which has this change for cut, plus many other improvements to gnu utils. adj At 4:35 PM -0400 5/10/02, Hacksaw wrote: > >The cut program could be changed to do th

Re: Error in sort

2002-04-08 Thread Andrew D Jewell
Instead of sort -k2 -k3n you want to say sort -k2,2 -k3n otherwise, you're saying "from 2 to the end" I don't like the semantics myself, but that's the way it goes. adj At 4:00 PM +0200 4/8/02, Andreas Erb wrote: >Hi, >while sorting a file I encountered a strange behavior: > >sorting this

discussion seeds

2002-02-19 Thread Andrew D Jewell
At Alexa, we have huge amounts of data (100's of terabytes) on a network of cheap UNIX machines (somewhere around 1000 such machines). The standard textutils distribution needs some changes to be maximally useful to us in this environment. I would like to describe some of the changes we've mad

join can fail on large files

2002-02-14 Thread Andrew D Jewell
The way join works is this : For a given match, read all the matching lines from both files into memory, and then print out the join results. The problem : If you're joining really big files (hundreds of gigabytes), the matching lines from one file or the other can exceed your process limit