Re: memory management

Dave Angel Mon, 07 Nov 2011 12:23:38 -0800


On 11/07/2011 02:43 PM, Juan Declet-Barreto wrote:

Hi,


Can anyone provide links or basic info on memory management, variable 
dereferencing, or the like? I have a script that traverses a file structure 
using os.walk and adds directory names to a list. It works for a small number 
of directories, but when I set it loose on a directory with thousands of 
dirs/subdirs, it crashes the DOS session and also the Python shell (when I run 
it from the shell).  This makes it difficult to figure out if the allocated 
memory or heap space for the DOS/shell session have overflown, or why it is 
crashing.

Juan Declet-Barreto [ciId:image001.png@01CC9D4A.CB6B9D70]

I don't have any reference to point you to, but CPython's memorymanagement is really pretty simple. However, it's important to tell usthe build of Python, as there are several, with very different memoryrules. For example Jython, which is Python running in a Java VM, letsthe java garbage collector handle things, and it's entirely different.

Likewise, the OS may be relevant. You're using Windows-kind ofterminology, but that doesn't prove you're on Windows, nor does it saywhat version.

Assuming 32 bit CPython 2.7 on XP, the principles are simple. When anobject is no longer accessible, it gets garbage collected*. So if youbuild a list inside a function, and the only reference is from afunction's local var, then the whole list will be freed when thefunction exits. The mistakes many people make are unnecessarily usingglobals, and using lists when iterables would work just as well.

The tool on XP to tell how much memory is in use is the task manager.As you point out, its hard to catch a short-running app in the act. Soyou want to add a counter to your code (global), and see how high itgets when it crashes. Then put a test in your code for the timer value,and do an "input" somewhat earlier.


At that point, see how much memory the program is actually using.

Now, when an object is freed, a new one of the same size is likely toimmediately re-use the space. But if they're all different sizes, it'ssomewhat statistical. You might get fragmentation, for example. WhenPython's pool is full, it asks the OS for more (perhaps using swapspace), but I don't think it ever gives it back. So your memory use isa kind of ceiling case. That's why it's problematic to build a hugedata structure, and then walk through it, then delete it. The scriptwill probably continue to show the peak memory use, indefinitely.

* (technically, this is ref counted. When the ref reaches zero theobject is freed. Real gc is more lazy scanning)



--
http://mail.python.org/mailman/listinfo/python-list

Re: memory management

Reply via email to