so it still unfinished :) around 1GB for 1033268 words :) (comes from a top unix command)
Paul > i was also thinking on doing it like that by pip-ing to 'sort | uniq -c | sort -nr' , but i'm pleased if Python can handle it. (well but maybe Python is slower? will check later...) Klaas > i do not know about intern construct, i will have look, but when googling i first found a post from Raymond Hettinger so i'm going to mess my mental space :) http://mail.python.org/pipermail/python-dev/2003-November/040433.html best regards. -- http://mail.python.org/mailman/listinfo/python-list