Re: A safer File.readln

Shachar Shemesh via Digitalmars-d Mon, 23 Jan 2017 02:46:47 -0800

On 23/01/17 11:15, Markus Laker wrote:

A 2GiB disk file caused tinycat.d to use over 4GiB of memory.

When extending arrays, a common approach is to double the size everytime you run out of space. This guarantees an amortized O(1) cost of append.

Unfortunately, this also guarantees that we will never have enough spacefreed by previous copies to reuse existing memory:


100 byte array

increase

100 bytes free
200 bytes array

increase

300 free
400 array

etc. The array will always be bigger than the total amount of space wefreed.

If, instead of increasing its size by 100%, we increase it by a smallerpercentage of its previous size, we still maintain the amortized O(1)cost (with a multiplier that might be a little higher, but see the tradeoff). On the other hand, we can now reuse memory. Let's say we increaseby 50% each time:


100 array

increase

100 free
150 array

increase

250 free
225 array

increase

475 free
338 array

813 free
507 array

The next increase will require 761 bytes, but we already have 813 free,so we can allocate the new array over the memory already freed frombefore, reducing the heap size.

Of course, if we integrate the allocating and the move, we could havereused previously used memory starting from allocation 3, but I'massuming that would also be possible when increasing by 100%, so I'massuming we can't do that.

Of course, if, instead of 50% we increase by less (say, 20%), we couldreuse previously used memory even sooner.

I am assuming that this is the problem that manifests itself in this usescenario. I would suggest solving it at the language level, rather thanthe library level.


Shachar

Re: A safer File.readln

Reply via email to