Hello all, I would like to identify or eliminate pairs of "words" from different lines.
An example (all words are seperated by a tab: 53_G16I9RF01EUP2C 53_G16I9RF02JZUJU 53_G16I9RF02JZUJU 53_G16I9RF01EUP2C 53_G16I9RF02JZV1E 33_G0JCAX402GV9YC 53_G16I9RF02JZV1E 33_G16I9RF02FOVF0 or: A B B A C D E F Line one and two contains the same words but in inverted order. I would like to eliminate one of these "duplicates". I thought it could work with process duplicate lines with: [a-z0-9_]{17}\t[a-z0-9_]{17} but this didn't work. Then I added two other columns with inverted order: 53_G16I9RF01EUP2C 53_G16I9RF02JZUJU 53_G16I9RF02JZUJU 53_G16I9RF01EUP2C 53_G16I9RF02JZUJU 53_G16I9RF01EUP2C 53_G16I9RF01EUP2C 53_G16I9RF02JZUJU 53_G16I9RF02JZV1E 33_G0JCAX402GV9YC 33_G0JCAX402GV9YC 53_G16I9RF02JZV1E 53_G16I9RF02JZV1E 33_G16I9RF02FOVF0 33_G16I9RF02FOVF0 53_G16I9RF02JZV1E 53_G16I9RF02JZV1E 41_G0JCAX402GBVAW 41_G0JCAX402GBVAW 53_G16I9RF02JZV1E or: A B B A B A A B C D D E F G G F .. In this case line one contains the same pairs as line two (A B at the beginning - line 1; A B at the end of the line 2 Then I tried again process duplicate lines with: [a-z0-9_]{17}\t[a-z0-9_]{17} It didn't work and I don't understand why. I would be glad if someone could help me out with this. Perhaps there is a more simple way to do this Thanks a lot in advance archaeal -- You received this message because you are subscribed to the "BBEdit Talk" discussion group on Google Groups. To post to this group, send email to bbedit@googlegroups.com To unsubscribe from this group, send email to bbedit+unsubscr...@googlegroups.com For more options, visit this group at <http://groups.google.com/group/bbedit?hl=en> If you have a feature request or would like to report a problem, please email "supp...@barebones.com" rather than posting to the group. Follow @bbedit on Twitter: <http://www.twitter.com/bbedit>