hi everybody

Beema shafreen Tue, 21 Aug 2007 02:37:18 -0700

hi everybody, i have  written to fetch the url, and accesstje nm and np
entries
my code:
import re
import urllib2
import time
Gene_id=raw_input("Please enter the gene_id:")
fh = urllib2.urlopen('
http://www.ncbi.nlm.nih.gov/sites/entrez?db=gene&cmd=search&term='+Gene_id)
for line in fh.readlines():
        pattern = re.compile('(NM_\d+.\d{0,5})</a>.*(NP_\d+.\d{0,5})</a>')
        m = pattern.search(line)
        if m:
                nm_entry = m.group(1)
                np_entry =  m.group(2)
                length = len(np_entry)
                #data = raw_input("There are %s entry, They are:" %(length))
                fh1 = urllib2.urlopen('
http://www.ncbi.nlm.nih.gov/entrez/viewer.fcgi?val='+nm_entry)
                for line1 in fh1.readlines():
                        p1 = re.compile('source\s*(\d{1}.*\d+)')
                        m1 = p1.search(line1)
                        if m1:
                                seq = m1.group(1)
                                seq_len = seq.split('..')
                                print nm_entry, 'Length of NM_seq:',
seq_len[1],np_entry



                fh1.close()
fh.close()
time.sleep(2)

in my result :
Please enter the gene_id:
(after this i want to get the text and data) eg., there are 11 entries  and
the nm and np entry the final print statement. i have include the
highlighted text in code where it is repeaded since inside the looping
please check about the following code and post your comments and where is
include the text to get the result properly
#data = raw_input("There are %s entry, They are:" %(length))



regards
shafreen

-- 
http://mail.python.org/mailman/listinfo/python-list

hi everybody

Reply via email to