Dear Expert programmers,
I aplogise if this mail is out of context here.
I have a list of elements like these:
Contr1 SPR-10 SPR-101 SPR-125 SPR-137 SPR-139 SPR-143
contr2 SPR-1 SPR-15 SPR-126 SPR-128 SPR-141 SPR-148
contr3 SPR-106 SPR-130 SPR-135 SPR-138 SPR-139 SPR-145
contr4 SPR-124 SPR-125 SPR-130 SPR-139 SPR-144 SPR-148
There are several common elements prefixed with SPR-.
Although these elements are sorted in asecending order
row wise, the common elements are difficult to spot.
One has to look for common elements by eyeballing.
It would be wonderful if these elements are aligned
properly by inserting gaps.
In bioinformatics world, this is 100% identical to
Protein or DNA alignment.
Example:
If there are 3 sequences DNA1,2 and 3 with their
sequences:
DNA1: ATAAAATTAA
DNA2: AAAAATAT
DNA3: TAATAATAA
DNA1 ATAAAATTAA
DNA2 A AAAA TA T
DNA3 TA AtAAT AA
These 3 sequences are aligned by introducing gaps.
However, in DNA and protein sequence alignments more
complex algorithms and treatment is done so as to make
a better scoring alignment.
However, unfortunately I cannot apply these
algorithms/programs to my data, because these programs
are made for DNA and protein sequences.
I googled for some word matchers. There are programs
available however, they align them without itroducing
gaps. So ultimately I cannot see the common items
clearly lined up (I guess I may be wrong here, it
might be better also).
My question to the community is, are there any such
programs that would generate a multiple alignments on
user defined data. I am sure that the idea of multiple
algorithms might have been extended to other areas of
science, economics or LINGUISTICS.
Could any one help me if I can align my data. I have
a total of 50 unique words (SPR-1, SPR-2, SPR-3
likewise but no exactly the order and digit). For
some Control elements I have 120 such words in a row
(consider this of a sequence with 120 words).
So if I have to do this in excel I will spend the rest
of my happy life doing that :-)
However, to show I tried to do that and pasted it
below ( derailed completely).
So, dear respected members do you have any suggestions
of any such programs that I can use in this world of
CS.
Thank you.
S
Contr1 SPR-10 SPR-15 SPR-101 SPR-106
SPR-138
SPR-139 SPR-140 SPR-144 SPR-148
contr2 SPR-1 SPR-10 SPR-101 SPR-130
SPR-138
SPR-139 SPR-142 SPR-144 SPR-148
contr3 SPR-15 SPR-16 SPR-17 SPR-106 SPR-130 SPR-135
SPR-139 SPR-144 SPR-181
__________________________________
Start your day with Yahoo! - Make it your home page!
http://www.yahoo.com/r/hs
_______________________________________________
Tutor maillist - [email protected]
http://mail.python.org/mailman/listinfo/tutor