Re: [Trisquel-users] Removing unwanted carriage returns

amenex Sun, 22 Mar 2020 12:38:08 -0700

I'll restate the problem, unencumbered by distracting arrays of colons andhexadecimals.

All 387,000 rows fall into one of three types, each IP address appearing onlyonce in the first column:


Type A: $1("key" IP address), $2(CIDR block), $3(country code), $4(AS number)

Type B: $1(IP address falling within the $2CIDR block of Type A), $2(Type A's"key" IP address, repeated many times in successive rows)

Type C: $1(hostname), $2(Ip address from which $1hostname can be resolved),$3(CIDR block), $4(country code), $5(AS number)


(Type C is not very populous and can be handled with Leafpad)

The desired script:

awk should locate Type A's $1Key and find all the Type B rows whose$2Key match $1's Key, and thencopy Type A's columns $2, $3 & $4 in place of Type B's column $2 inevery instance of a match with Type A's $1Key

I have found a small number of Type A rows with no data, but those I can lookup with whois and fix easily.

The already looked-up hostnames are the only non-IP data in the $1 columns ofTypes A & B, so awk can safely

concentrate on all the Columns $1.

Also, all the IP addresses of looked-up hostnames will not reappear asnot-looked-up IP addresses.

If awk can do everything described above with the first Type A $1Key beforeproceeding, even if thatinvolves searching the entire 370,000 rows once for each Type A $1Key, thenwe're on the right track.


George Langford

Re: [Trisquel-users] Removing unwanted carriage returns

Reply via email to