Dear group:

I have 50 thousand lists. My aim is to search a pattern in the alphabetical 
strings (these are protein sequence strings).


MMSASRLAGTLIPAMAFLSCVRPESWEPC VEVVP 
NITYQCMELNFYKIPDNLPFSTKNLDLSFNPLRHLGSYSFFSFPELQVLDLSRCEIQTIED

my aim is to find the list of string that has V*VVP. 

myseq = 'MMSASRLAGTLIPAMAFLSCVRPESWEPC VEVVP 
NITYQCMELNFYKIPDNLPFSTKNLDLSFNPLRHLGSYSFFSFPELQVLDLSRCEIQTIED'

if re.search('V*VVP',myseq):
print myseq 

the problem with this is, I am also finding junk with just VVP or VP etc. 

How can I strictly search for V*VVP only. 

Thanks for help. 

Hs
_______________________________________________
Tutor maillist  -  Tutor@python.org
To unsubscribe or change subscription options:
http://mail.python.org/mailman/listinfo/tutor

Reply via email to