On Sat, 26 Apr 2008 23:44:17 +0530, Rustom Mody wrote: > Over years Ive collected tgz's of my directories. I would like to diff > and uniq them > > Now I guess it would be quite simple to write a script that does a walk > or find through a pair of directory trees, makes a SHA1 of each file and > then sorts out the files whose SHA1s are the same/different. What is > more difficult for me to do is to write a visual/gui tool to help me do > this. > > I would guess that someone in the python world must have already done it > [The alternative is to use some of the tools that come with version > control systems like git. But if I knew more about that option I would > not be stuck with tgzs in the first place ;-)] > > So if there is such software known please let me know. > > PS Also with the spam flood that has hit the python list I dont know if > this mail is being read at all or Ive fallen off the list!
It doesn't have a GUI, but here's a python program I wrote for dividing large collections of files up into identical groups: http://stromberg.dnsalias.org/~strombrg/equivalence-classes.html -- http://mail.python.org/mailman/listinfo/python-list