HI there!!!
I have  afile like this:
file.txt
programs        sample  gene
program1        sample1 TP53
program1        sample1 TP53
program1        sample2 PRNP
program1        sample2 ATF3
program2        sample1 TP53
program2        sample1 PRNP
program2        sample2 TRIM32
program2        sample2 TLK1
program2        sample2 KIT


with open("prova.csv") as p:
    for i in p:
   ...:         lines = i.rstrip("\n").split("\t")
   ...:         print lines 
   ...:         
['programs ', 'sample', 'gene', 'values']
['program1', 'sample1', 'TP53', '2']
['program1', 'sample1', 'TP53', '3']
['program1', 'sample2', 'PRNP', '4']
['program1', 'sample2', 'ATF3', '3']
['program2', 'sample1', 'TP53', '2']
['program2', 'sample1', 'PRNP', '5']
['program2', 'sample2', 'TRIM32', '4']
['program2', 'sample2', 'TLK1', '4']


I want to create a dictionary with set data with the names of the genes:

example:
dic = {}


dic['program1-sample1] = set(TP53)
dic['program1-sample2] = set(TP53,PRNP,ATF3)

So If I have a dictionary like that I can compare two set  I will compare the 
capacity of the programs in function of the gene show.
Thanks in advance for your help!


_______________________________________________
Tutor maillist  -  Tutor@python.org
To unsubscribe or change subscription options:
https://mail.python.org/mailman/listinfo/tutor

Reply via email to