Re: unique values of a Dictionary list (removing duplicate elements of a list)
Chad Kellerman wrote: > On Fri, May 21, 2010 at 8:07 AM, Chad Kellerman > wrote: > >> >> >> On Fri, May 21, 2010 at 7:50 AM, Peter Otten <__pete...@web.de> wrote: >> >>> Chad Kellerman wrote: >>> >>> > Python users, >>> > I am parsing an AIX trace file and creating a dictionary >>> containing >>> > keys (PIDS) and values (a list of TIDS). With PIDS being unique >>> > process ids >>> > and TIDS, being a list of thread ids. My function populates the keys >>> > so that they are unique, but my list contains duplicates. >>> > >>> > Can someone point me in the right direction so that my dictionary >>> > value >>> > does not contain duplicate elements? >>> > >>> > >>> > here is what I got. >>> > >>> > >>> > >>> > pidtids = {} >>> > >>> > # --- function to add pid and tid to a dictionary >>> > def addpidtids(pid,tid): >>> > pidtids.setdefault(pid,[]).append(tid) >>> >>> Use a set instead of a list (and maybe a defaultdict): >>> >>> from collections import defaultdict >>> >>> pidtids = defaultdict(set) >>> >>> def addpidtids(pid, tid): >>>pidtids[pid].add(tid) >>> >>> Peter >>> >> >> Thanks. I guess I should have posted this in my original question. >> >> I'm on 2.4.3 looks like defautldict is new in 2.5. >> >> I'll see if I can upgrade. >> >> Thanks again. >> > > > instead of upgrading.. (probably be faster to use techniques in available > 2.4.3) > > Couldn't I check to see if the pid exists (has_key I believe) and then > check if the tid is a value, in the the list for that key, prior to > passing it to the function? > > Or would that be too 'expensive'? No. pidtids = {} def addpidtids(pid, tid): if pid in pidtids: pidtids[pid].add(tid) else: pidtids[pid] = set((tid,)) should be faster than def addpidtids(pid, tid): pidtids.setdefault(pid, set()).add(tid) and both should work in python2.4. Peter -- http://mail.python.org/mailman/listinfo/python-list
Re: unique values of a Dictionary list (removing duplicate elements of a list)
On Fri, May 21, 2010 at 8:07 AM, Chad Kellerman wrote: > > > On Fri, May 21, 2010 at 7:50 AM, Peter Otten <__pete...@web.de> wrote: > >> Chad Kellerman wrote: >> >> > Python users, >> > I am parsing an AIX trace file and creating a dictionary >> containing >> > keys (PIDS) and values (a list of TIDS). With PIDS being unique process >> > ids >> > and TIDS, being a list of thread ids. My function populates the keys so >> > that they are unique, but my list contains duplicates. >> > >> > Can someone point me in the right direction so that my dictionary >> > value >> > does not contain duplicate elements? >> > >> > >> > here is what I got. >> > >> > >> > >> > pidtids = {} >> > >> > # --- function to add pid and tid to a dictionary >> > def addpidtids(pid,tid): >> > pidtids.setdefault(pid,[]).append(tid) >> >> Use a set instead of a list (and maybe a defaultdict): >> >> from collections import defaultdict >> >> pidtids = defaultdict(set) >> >> def addpidtids(pid, tid): >>pidtids[pid].add(tid) >> >> Peter >> > > Thanks. I guess I should have posted this in my original question. > > I'm on 2.4.3 looks like defautldict is new in 2.5. > > I'll see if I can upgrade. > > Thanks again. > instead of upgrading.. (probably be faster to use techniques in available 2.4.3) Couldn't I check to see if the pid exists (has_key I believe) and then check if the tid is a value, in the the list for that key, prior to passing it to the function? Or would that be too 'expensive'? > > > >> >> -- >> >> http://mail.python.org/mailman/listinfo/python-list >> > > > > -- > A grasshopper walks into a bar and the bartender says "Hey, we have a drink > named after you." And the grasshopper says "Really, You have a drink named > Murray?" > -- A grasshopper walks into a bar and the bartender says "Hey, we have a drink named after you." And the grasshopper says "Really, You have a drink named Murray?" -- http://mail.python.org/mailman/listinfo/python-list
Re: unique values of a Dictionary list (removing duplicate elements of a list)
On Fri, May 21, 2010 at 7:50 AM, Peter Otten <__pete...@web.de> wrote: > Chad Kellerman wrote: > > > Python users, > > I am parsing an AIX trace file and creating a dictionary containing > > keys (PIDS) and values (a list of TIDS). With PIDS being unique process > > ids > > and TIDS, being a list of thread ids. My function populates the keys so > > that they are unique, but my list contains duplicates. > > > > Can someone point me in the right direction so that my dictionary > > value > > does not contain duplicate elements? > > > > > > here is what I got. > > > > > > > > pidtids = {} > > > > # --- function to add pid and tid to a dictionary > > def addpidtids(pid,tid): > > pidtids.setdefault(pid,[]).append(tid) > > Use a set instead of a list (and maybe a defaultdict): > > from collections import defaultdict > > pidtids = defaultdict(set) > > def addpidtids(pid, tid): >pidtids[pid].add(tid) > > Peter > Thanks. I guess I should have posted this in my original question. I'm on 2.4.3 looks like defautldict is new in 2.5. I'll see if I can upgrade. Thanks again. > > -- > http://mail.python.org/mailman/listinfo/python-list > -- A grasshopper walks into a bar and the bartender says "Hey, we have a drink named after you." And the grasshopper says "Really, You have a drink named Murray?" -- http://mail.python.org/mailman/listinfo/python-list
Re: unique values of a Dictionary list (removing duplicate elements of a list)
Chad Kellerman wrote: > Python users, > I am parsing an AIX trace file and creating a dictionary containing > keys (PIDS) and values (a list of TIDS). With PIDS being unique process > ids > and TIDS, being a list of thread ids. My function populates the keys so > that they are unique, but my list contains duplicates. > > Can someone point me in the right direction so that my dictionary > value > does not contain duplicate elements? > > > here is what I got. > > > > pidtids = {} > > # --- function to add pid and tid to a dictionary > def addpidtids(pid,tid): > pidtids.setdefault(pid,[]).append(tid) Use a set instead of a list (and maybe a defaultdict): from collections import defaultdict pidtids = defaultdict(set) def addpidtids(pid, tid): pidtids[pid].add(tid) Peter -- http://mail.python.org/mailman/listinfo/python-list
unique values of a Dictionary list (removing duplicate elements of a list)
Python users, I am parsing an AIX trace file and creating a dictionary containing keys (PIDS) and values (a list of TIDS). With PIDS being unique process ids and TIDS, being a list of thread ids. My function populates the keys so that they are unique, but my list contains duplicates. Can someone point me in the right direction so that my dictionary value does not contain duplicate elements? here is what I got. pidtids = {} # --- function to add pid and tid to a dictionary def addpidtids(pid,tid): pidtids.setdefault(pid,[]).append(tid) # --- function to parse a file def grep(pattern, fileObj, include_line_nums=False): r=[] compgrep = re.compile(pattern) for line_num, line in enumerate(fileObj): if compgrep.search(line): info = line.split() p = info[7].lstrip("pid=") t = info[8].lstrip("tid=") addpidtids(p,t) # process trace.int tf = open(tracefile, 'r') grep("cmd=java pid",tf) tf.close() Any help would be greatly appreciated. Thanks, Chad -- A grasshopper walks into a bar and the bartender says "Hey, we have a drink named after you." And the grasshopper says "Really, You have a drink named Murray?" -- http://mail.python.org/mailman/listinfo/python-list