HI there!!!
I have afile like this:
file.txt
programs sample gene
program1 sample1 TP53
program1 sample1 TP53
program1 sample2 PRNP
program1 sample2 ATF3
program2 sample1 TP53
program2 sample1 PRNP
program2 sample2 TRIM32
program2 sample2 TLK1
program2 sample2 KIT
with open("prova.csv") as p:
for i in p:
...: lines = i.rstrip("\n").split("\t")
...: print lines
...:
['programs ', 'sample', 'gene', 'values']
['program1', 'sample1', 'TP53', '2']
['program1', 'sample1', 'TP53', '3']
['program1', 'sample2', 'PRNP', '4']
['program1', 'sample2', 'ATF3', '3']
['program2', 'sample1', 'TP53', '2']
['program2', 'sample1', 'PRNP', '5']
['program2', 'sample2', 'TRIM32', '4']
['program2', 'sample2', 'TLK1', '4']
I want to create a dictionary with set data with the names of the genes:
example:
dic = {}
dic['program1-sample1] = set(TP53)
dic['program1-sample2] = set(TP53,PRNP,ATF3)
So If I have a dictionary like that I can compare two set I will compare the
capacity of the programs in function of the gene show.
Thanks in advance for your help!
_______________________________________________
Tutor maillist - [email protected]
To unsubscribe or change subscription options:
https://mail.python.org/mailman/listinfo/tutor